AlpinDale 008e646c7e chore: add support for up to 2048 block size (#715) il y a 4 mois
..
__init__.py 04b53d2db5 chore: add initializer files il y a 1 an
cache_engine.py bf88c8567e feat: mamba model support (#674) il y a 4 mois
cpu_model_runner.py bf88c8567e feat: mamba model support (#674) il y a 4 mois
cpu_worker.py bf88c8567e feat: mamba model support (#674) il y a 4 mois
embedding_model_runner.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
enc_dec_model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) il y a 4 mois
model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) il y a 4 mois
model_runner_base.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois
neuron_model_runner.py 008e646c7e chore: add support for up to 2048 block size (#715) il y a 4 mois
neuron_worker.py 008e646c7e chore: add support for up to 2048 block size (#715) il y a 4 mois
openvino_model_runner.py bf88c8567e feat: mamba model support (#674) il y a 4 mois
openvino_worker.py bf88c8567e feat: mamba model support (#674) il y a 4 mois
tpu_model_runner.py 1c519cc6ac chore: set per-rank XLA cache for TPU (#714) il y a 4 mois
tpu_worker.py 1c519cc6ac chore: set per-rank XLA cache for TPU (#714) il y a 4 mois
utils.py a0e446a17d feat: initial encoder-decoder support with BART model (#633) il y a 4 mois
worker.py b03fa02397 refactor: base worker input refactor for multi-step (#683) il y a 4 mois
worker_base.py f76f2a5af0 feat: add aphrodite plugin system (#705) il y a 4 mois
xpu_model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) il y a 4 mois
xpu_worker.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 4 mois