.. |
block
|
6f6bf568e5
enable prefix caching with v2 block manager for spec decoding
|
7 月之前 |
__init__.py
|
ac1d46a2ec
feat: begin work on the engine
|
1 年之前 |
block_manager_v1.py
|
6f6bf568e5
enable prefix caching with v2 block manager for spec decoding
|
7 月之前 |
block_manager_v2.py
|
6f6bf568e5
enable prefix caching with v2 block manager for spec decoding
|
7 月之前 |
evictor_v1.py
|
6f6bf568e5
enable prefix caching with v2 block manager for spec decoding
|
7 月之前 |
evictor_v2.py
|
6f6bf568e5
enable prefix caching with v2 block manager for spec decoding
|
7 月之前 |
interfaces.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
10 月之前 |
policy.py
|
fca911ee0a
vLLM Upstream Sync (#526)
|
8 月之前 |
scheduler.py
|
5529304d1f
fix sampling with n>1
|
7 月之前 |