.. |
__init__.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
block_table.py
|
79d603954e
fix: chunked prefill with v2 block manager (#679)
|
vor 4 Monaten |
common.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
vor 1 Monat |
cpu_gpu_block_allocator.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
vor 1 Monat |
interfaces.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
vor 1 Monat |
naive_block.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
vor 1 Monat |
prefix_caching_block.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
vor 1 Monat |
utils.py
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
vor 4 Monaten |