AlpinDale 5c3b94de45 spec decode: move ops.advane_step to flash attention backend (#1005) hai 2 meses
..
__init__.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
cache_engine.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
cpu_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
cpu_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
embedding_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
enc_dec_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
model_runner_base.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
multi_step_model_runner.py 5c3b94de45 spec decode: move ops.advane_step to flash attention backend (#1005) hai 2 meses
multi_step_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
neuron_model_runner.py 145e554a4d neuron: add 8bit quantization for Neuron (#994) hai 2 meses
neuron_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
openvino_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
openvino_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
tpu_model_runner.py 0c56d23ece tpu: fix outputs by correcting the next_token_ids shape (#986) hai 2 meses
tpu_worker.py a50548c0b9 tpu: use XLA rank for persistent cache path (#989) hai 2 meses
utils.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
worker_base.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
xpu_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses
xpu_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) hai 2 meses