.. |
block
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 ヶ月 前 |
__init__.py
|
ac1d46a2ec
feat: begin work on the engine
|
1 年間 前 |
block_manager_v1.py
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 ヶ月 前 |
block_manager_v2.py
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 ヶ月 前 |
evictor_v1.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
evictor_v2.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 ヶ月 前 |
interfaces.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 ヶ月 前 |
placeholder_block_space_manager.py
|
abfd4465ca
feat: add support for chunked prefill + prefix caching (#871)
|
1 ヶ月 前 |
scheduler.py
|
bc1a2bdf98
do not use cached chunks for prompt_logprobs
|
3 週間 前 |