AlpinDale a985143768 core: add cuda graph support for encoder-decoder models (#1051) il y a 2 semaines
..
__init__.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
cache_engine.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
cpu_model_runner.py 65a59bbb6b cpu: raise error if using encoder-decoder models (#1027) il y a 2 semaines
cpu_worker.py f2b6dc3872 cpu: add support for W8A8 quantization via compressed-tensor (#1017) il y a 2 semaines
embedding_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
enc_dec_model_runner.py a985143768 core: add cuda graph support for encoder-decoder models (#1051) il y a 2 semaines
model_runner.py a985143768 core: add cuda graph support for encoder-decoder models (#1051) il y a 2 semaines
model_runner_base.py 304e1e5a8a core: dump model runner inputs during crash (#1023) il y a 2 semaines
multi_step_model_runner.py 1390915778 multi-step: add support for flashinfer attention backend (#1033) il y a 2 semaines
multi_step_tpu_worker.py 4b1b658855 tpu: implement multi-step scheduling (#1046) il y a 2 semaines
multi_step_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
neuron_model_runner.py 145e554a4d neuron: add 8bit quantization for Neuron (#994) il y a 2 semaines
neuron_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
openvino_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
openvino_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
tpu_model_runner.py 4b1b658855 tpu: implement multi-step scheduling (#1046) il y a 2 semaines
tpu_worker.py a50548c0b9 tpu: use XLA rank for persistent cache path (#989) il y a 2 semaines
utils.py a985143768 core: add cuda graph support for encoder-decoder models (#1051) il y a 2 semaines
worker.py a113309876 kernel: add meta functions for ops to prevent graph breaks (#1019) il y a 2 semaines
worker_base.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
xpu_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
xpu_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines