.. |
block
|
6ac658b0d6
some small performance improvements
|
5 maanden geleden |
__init__.py
|
ac1d46a2ec
feat: begin work on the engine
|
1 jaar geleden |
block_manager_v1.py
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
5 maanden geleden |
block_manager_v2.py
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
5 maanden geleden |
embedding_model_block_manager.py
|
237fa59aea
feat: support CPU/GPU swapping in BlockManagerV2
|
5 maanden geleden |
evictor_v1.py
|
5fecc6b025
when was this deprecated?
|
5 maanden geleden |
evictor_v2.py
|
5fecc6b025
when was this deprecated?
|
5 maanden geleden |
interfaces.py
|
237fa59aea
feat: support CPU/GPU swapping in BlockManagerV2
|
5 maanden geleden |
policy.py
|
fca911ee0a
vLLM Upstream Sync (#526)
|
6 maanden geleden |
scheduler.py
|
e76bbe72eb
chore: handle aborted requests for jamba
|
5 maanden geleden |