.. |
block
|
9c9b2dd843
core: improve warmup times for prefix caching in block manager v2 (#920)
|
2 weeks ago |
__init__.py
|
ac1d46a2ec
feat: begin work on the engine
|
1 year ago |
block_manager_v1.py
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
block_manager_v2.py
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
evictor_v1.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
evictor_v2.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 month ago |
interfaces.py
|
f7f3fed265
feat: add async postprocessor (#925)
|
2 weeks ago |
placeholder_block_space_manager.py
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 month ago |
scheduler.py
|
f561a54a43
core: fix async postprocessor in case of preemption (#1000)
|
1 week ago |