.. |
__init__.py
|
04b53d2db5
chore: add initializer files
|
1 년 전 |
cache_engine.py
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
7 달 전 |
cpu_model_runner.py
|
fdabb55a4d
fix: wrong multi_modal_input format for CPU
|
7 달 전 |
cpu_worker.py
|
50b7c13db0
refactor: attention selector (#552)
|
7 달 전 |
embedding_model_runner.py
|
8d77c69cbd
feat: support image processor and add llava example
|
7 달 전 |
model_runner.py
|
34b41e0a87
chore: add coordinator to reduce code duplication in tp and pp
|
7 달 전 |
neuron_model_runner.py
|
35ae01d7ba
refactor: attention metadata term
|
8 달 전 |
neuron_worker.py
|
fca911ee0a
vLLM Upstream Sync (#526)
|
8 달 전 |
tpu_model_runner.py
|
fe21123a1c
feat: TPU support (#570)
|
7 달 전 |
tpu_worker.py
|
a524667db0
fix: device assertion for sdpa backend; fix env for tpu worker
|
7 달 전 |
worker.py
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
7 달 전 |
worker_base.py
|
7194047318
remove vllm-nccl
|
7 달 전 |
xpu_model_runner.py
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
7 달 전 |
xpu_worker.py
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
7 달 전 |