.. |
block
|
63b735bc2a
chore: optimize v2 block manager to match the performance of v1
|
7 months ago |
__init__.py
|
ac1d46a2ec
feat: begin work on the engine
|
1 year ago |
block_manager_v1.py
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
7 months ago |
block_manager_v2.py
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
7 months ago |
embedding_model_block_manager.py
|
237fa59aea
feat: support CPU/GPU swapping in BlockManagerV2
|
7 months ago |
evictor_v1.py
|
5fecc6b025
when was this deprecated?
|
7 months ago |
evictor_v2.py
|
5fecc6b025
when was this deprecated?
|
7 months ago |
interfaces.py
|
237fa59aea
feat: support CPU/GPU swapping in BlockManagerV2
|
7 months ago |
policy.py
|
fca911ee0a
vLLM Upstream Sync (#526)
|
8 months ago |
scheduler.py
|
99680b2d23
feat: soft prompts (#589)
|
6 months ago |