AlpinDale 5b0c11d190 support pipeline parallel pynccl groups 6 ヶ月 前
..
__init__.py 04b53d2db5 chore: add initializer files 1 年間 前
cache_engine.py 50b7c13db0 refactor: attention selector (#552) 6 ヶ月 前
cpu_model_runner.py f6250c5516 move dockerfiles to root; fix cpu build 6 ヶ月 前
cpu_worker.py 50b7c13db0 refactor: attention selector (#552) 6 ヶ月 前
embedding_model_runner.py de62ceb18c refactor: eliminate parallel worker per-step task scheduling overhead 6 ヶ月 前
model_runner.py 5b0c11d190 support pipeline parallel pynccl groups 6 ヶ月 前
neuron_model_runner.py 35ae01d7ba refactor: attention metadata term 6 ヶ月 前
neuron_worker.py fca911ee0a vLLM Upstream Sync (#526) 7 ヶ月 前
worker.py eb2c5c77df feat: enforce the max possible seqlen 6 ヶ月 前
worker_base.py de62ceb18c refactor: eliminate parallel worker per-step task scheduling overhead 6 ヶ月 前