1
0
AlpinDale ec5b99d075 fix: use named args 7 сар өмнө
..
attention 75f97bc25d bump flash-attn to remove unnecessary copies in the backend 7 сар өмнө
common 237fa59aea feat: support CPU/GPU swapping in BlockManagerV2 7 сар өмнө
distributed b2fd915c35 improve p2p access check 7 сар өмнө
endpoints d00a7517e6 fix: tokenizer delay with using LLM class 7 сар өмнө
engine ec5b99d075 fix: use named args 7 сар өмнө
executor 05d6e43244 fix: `torch.compile()` with mp executor backend 7 сар өмнө
kv_quant e42a78381a feat: switch from pylint to ruff (#322) 1 жил өмнө
lora 5fecc6b025 when was this deprecated? 7 сар өмнө
modeling 39b36efabf fix: mixtral fp8 ckpt loading 7 сар өмнө
multimodal 75f97bc25d bump flash-attn to remove unnecessary copies in the backend 7 сар өмнө
processing 237fa59aea feat: support CPU/GPU swapping in BlockManagerV2 7 сар өмнө
quantization 39b36efabf fix: mixtral fp8 ckpt loading 7 сар өмнө
spec_decode ec5b99d075 fix: use named args 7 сар өмнө
task_handler e321d80e4e fix: `prompt_logprobs==0` case 7 сар өмнө
transformers_utils 8d77c69cbd feat: support image processor and add llava example 7 сар өмнө
__init__.py be8154a8a0 feat: proper embeddings API with e5-mistral-7b support 7 сар өмнө
py.typed 1c988a48b2 fix logging and add py.typed 1 жил өмнө