AlpinDale 008e646c7e chore: add support for up to 2048 block size (#715) 4 ヶ月 前
..
__init__.py 04b53d2db5 chore: add initializer files 1 年間 前
cache_engine.py bf88c8567e feat: mamba model support (#674) 4 ヶ月 前
cpu_model_runner.py bf88c8567e feat: mamba model support (#674) 4 ヶ月 前
cpu_worker.py bf88c8567e feat: mamba model support (#674) 4 ヶ月 前
embedding_model_runner.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
enc_dec_model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) 4 ヶ月 前
model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) 4 ヶ月 前
model_runner_base.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前
neuron_model_runner.py 008e646c7e chore: add support for up to 2048 block size (#715) 4 ヶ月 前
neuron_worker.py 008e646c7e chore: add support for up to 2048 block size (#715) 4 ヶ月 前
openvino_model_runner.py bf88c8567e feat: mamba model support (#674) 4 ヶ月 前
openvino_worker.py bf88c8567e feat: mamba model support (#674) 4 ヶ月 前
tpu_model_runner.py 1c519cc6ac chore: set per-rank XLA cache for TPU (#714) 4 ヶ月 前
tpu_worker.py 1c519cc6ac chore: set per-rank XLA cache for TPU (#714) 4 ヶ月 前
utils.py a0e446a17d feat: initial encoder-decoder support with BART model (#633) 4 ヶ月 前
worker.py b03fa02397 refactor: base worker input refactor for multi-step (#683) 4 ヶ月 前
worker_base.py f76f2a5af0 feat: add aphrodite plugin system (#705) 4 ヶ月 前
xpu_model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) 4 ヶ月 前
xpu_worker.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 ヶ月 前