.. |
block
|
df7ae8ce01
fix spec_decode and block imports
|
преди 9 месеца |
__init__.py
|
ac1d46a2ec
feat: begin work on the engine
|
преди 1 година |
block_manager_v1.py
|
f52aa64fe6
use the get_len() method instead of manual len calculation
|
преди 9 месеца |
block_manager_v2.py
|
fa083286e3
Speculative Decoding Part 4: Lookahead scheduling (#402)
|
преди 9 месеца |
evictor.py
|
375f24ccca
fix: optimize context shift performance (#380)
|
преди 9 месеца |
interfaces.py
|
fa083286e3
Speculative Decoding Part 4: Lookahead scheduling (#402)
|
преди 9 месеца |
policy.py
|
6f00203041
refactor scheduler for chunked prefill, remove reorder policy for now
|
преди 9 месеца |
scheduler.py
|
fe17712f29
fully working chunked prefill
|
преди 9 месеца |