.. |
__init__.py
|
04b53d2db5
chore: add initializer files
|
1 anno fa |
cache_engine.py
|
f40b809d3b
allow using v2 block manager with sliding window
|
7 mesi fa |
cpu_model_runner.py
|
8d77c69cbd
feat: support image processor and add llava example
|
7 mesi fa |
cpu_worker.py
|
50b7c13db0
refactor: attention selector (#552)
|
7 mesi fa |
embedding_model_runner.py
|
8d77c69cbd
feat: support image processor and add llava example
|
7 mesi fa |
model_runner.py
|
e321d80e4e
fix: `prompt_logprobs==0` case
|
7 mesi fa |
neuron_model_runner.py
|
35ae01d7ba
refactor: attention metadata term
|
8 mesi fa |
neuron_worker.py
|
fca911ee0a
vLLM Upstream Sync (#526)
|
8 mesi fa |
worker.py
|
eb2c5c77df
feat: enforce the max possible seqlen
|
7 mesi fa |
worker_base.py
|
7194047318
remove vllm-nccl
|
7 mesi fa |