AlpinDale c975bba905 fix: sharded state loader with lora 7 月之前
..
__init__.py 04b53d2db5 chore: add initializer files 1 年之前
cache_engine.py f40b809d3b allow using v2 block manager with sliding window 7 月之前
cpu_model_runner.py 8d77c69cbd feat: support image processor and add llava example 7 月之前
cpu_worker.py 50b7c13db0 refactor: attention selector (#552) 8 月之前
embedding_model_runner.py 8d77c69cbd feat: support image processor and add llava example 7 月之前
model_runner.py c975bba905 fix: sharded state loader with lora 7 月之前
neuron_model_runner.py 35ae01d7ba refactor: attention metadata term 8 月之前
neuron_worker.py fca911ee0a vLLM Upstream Sync (#526) 8 月之前
worker.py eb2c5c77df feat: enforce the max possible seqlen 7 月之前
worker_base.py 7194047318 remove vllm-nccl 7 月之前