.. |
block
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 month ago |
__init__.py
|
ac1d46a2ec
feat: begin work on the engine
|
1 year ago |
block_manager_v1.py
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 month ago |
block_manager_v2.py
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 month ago |
evictor_v1.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
evictor_v2.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 month ago |
interfaces.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 month ago |
placeholder_block_space_manager.py
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 month ago |
scheduler.py
|
82128ec843
fix: over-processing with chunked prefill + prefix caching
|
3 weeks ago |