AlpinDale 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 ヶ月 前
..
__init__.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 ヶ月 前
cpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
distributed_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
executor_base.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
msgspec_utils.py 2f61644f6e SPMD optimizations (#824) 2 ヶ月 前
multiproc_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
multiproc_worker_utils.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 ヶ月 前
multiproc_xpu_executor.py 15cb8d5c26 xpu: support pipeline parallel (#932) 1 ヶ月 前
neuron_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
openvino_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
ray_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
ray_tpu_executor.py 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 ヶ月 前
ray_utils.py 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 ヶ月 前
ray_xpu_executor.py 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) 1 ヶ月 前
tpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前
xpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 ヶ月 前