AlpinDale 5c3b94de45 spec decode: move ops.advane_step to flash attention backend (#1005) 2 달 전
..
__init__.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
cache_engine.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
cpu_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
cpu_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
embedding_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
enc_dec_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
model_runner_base.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
multi_step_model_runner.py 5c3b94de45 spec decode: move ops.advane_step to flash attention backend (#1005) 2 달 전
multi_step_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
neuron_model_runner.py 145e554a4d neuron: add 8bit quantization for Neuron (#994) 2 달 전
neuron_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
openvino_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
openvino_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
tpu_model_runner.py 0c56d23ece tpu: fix outputs by correcting the next_token_ids shape (#986) 2 달 전
tpu_worker.py a50548c0b9 tpu: use XLA rank for persistent cache path (#989) 2 달 전
utils.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
worker_base.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
xpu_model_runner.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전
xpu_worker.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) 2 달 전