AlpinDale 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 maand geleden
..
__init__.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 10 maanden geleden
cpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
distributed_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
executor_base.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
msgspec_utils.py 2f61644f6e SPMD optimizations (#824) 2 maanden geleden
multiproc_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
multiproc_worker_utils.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 maand geleden
multiproc_xpu_executor.py 15cb8d5c26 xpu: support pipeline parallel (#932) 1 maand geleden
neuron_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
openvino_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
ray_gpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
ray_tpu_executor.py 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 maand geleden
ray_utils.py 61103b92d4 tpu: support single and multi-host TPUs on GKE and RayServe (#970) 1 maand geleden
ray_xpu_executor.py 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) 1 maand geleden
tpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden
xpu_executor.py 0dfa6b60ec core: support logprobs with multi-step scheduling (#963) 1 maand geleden