AlpinDale
|
fbec255dc1
chore: enable tpu tensor parallel in async engine
|
hace 4 meses |
AlpinDale
|
8d88814475
chore: reduce XLA compile times
|
hace 4 meses |
AlpinDale
|
e1475fbec7
feat: MoE support with Pallas GMM kernel for TPUs
|
hace 5 meses |
AlpinDale
|
1cb06835a0
fix: TPU multimodal kwargs and outlines installation in TPU docker
|
hace 5 meses |
AlpinDale
|
fe21123a1c
feat: TPU support (#570)
|
hace 5 meses |