AlpinDale 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 mesiac pred
..
__init__.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 10 mesiacov pred
cpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
distributed_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
executor_base.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
msgspec_utils.py 2f61644f6e SPMD optimizations (#824) 2 mesiacov pred
multiproc_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
multiproc_worker_utils.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 mesiac pred
multiproc_xpu_executor.py 15cb8d5c26 xpu: support pipeline parallel (#932) 1 mesiac pred
neuron_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
openvino_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
ray_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
ray_tpu_executor.py 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 mesiac pred
ray_utils.py 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 mesiac pred
ray_xpu_executor.py 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) 1 mesiac pred
tpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred
xpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 mesiac pred