AlpinDale f2b6dc3872 cpu: add support for W8A8 quantization via compressed-tensor (#1017) il y a 2 semaines
..
__init__.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) il y a 9 mois
cpu_executor.py f2b6dc3872 cpu: add support for W8A8 quantization via compressed-tensor (#1017) il y a 2 semaines
distributed_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) il y a 2 semaines
executor_base.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) il y a 2 semaines
gpu_executor.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
msgspec_utils.py 2f61644f6e SPMD optimizations (#824) il y a 1 mois
multiproc_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) il y a 2 semaines
multiproc_worker_utils.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
multiproc_xpu_executor.py 15cb8d5c26 xpu: support pipeline parallel (#932) il y a 2 semaines
neuron_executor.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
openvino_executor.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
ray_gpu_executor.py 4737c22ab3 fix: pass `APHRODITE_ATTENTION_BACKEND` to ray workers (#1009) il y a 2 semaines
ray_tpu_executor.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
ray_utils.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
ray_xpu_executor.py 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) il y a 3 semaines
tpu_executor.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines
xpu_executor.py 3bb0f07461 chore: rename `task_handler` to `worker` (#985) il y a 2 semaines