AlpinDale 008e646c7e chore: add support for up to 2048 block size (#715) há 4 meses atrás
..
__init__.py 04b53d2db5 chore: add initializer files há 1 ano atrás
cache_engine.py bf88c8567e feat: mamba model support (#674) há 4 meses atrás
cpu_model_runner.py bf88c8567e feat: mamba model support (#674) há 4 meses atrás
cpu_worker.py bf88c8567e feat: mamba model support (#674) há 4 meses atrás
embedding_model_runner.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
enc_dec_model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) há 4 meses atrás
model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) há 4 meses atrás
model_runner_base.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
neuron_model_runner.py 008e646c7e chore: add support for up to 2048 block size (#715) há 4 meses atrás
neuron_worker.py 008e646c7e chore: add support for up to 2048 block size (#715) há 4 meses atrás
openvino_model_runner.py bf88c8567e feat: mamba model support (#674) há 4 meses atrás
openvino_worker.py bf88c8567e feat: mamba model support (#674) há 4 meses atrás
tpu_model_runner.py 1c519cc6ac chore: set per-rank XLA cache for TPU (#714) há 4 meses atrás
tpu_worker.py 1c519cc6ac chore: set per-rank XLA cache for TPU (#714) há 4 meses atrás
utils.py a0e446a17d feat: initial encoder-decoder support with BART model (#633) há 4 meses atrás
worker.py b03fa02397 refactor: base worker input refactor for multi-step (#683) há 4 meses atrás
worker_base.py f76f2a5af0 feat: add aphrodite plugin system (#705) há 4 meses atrás
xpu_model_runner.py 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) há 4 meses atrás
xpu_worker.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás