AlpinDale 3f92035bf1 fix: add `ignored_seq_groups` in `_schedule_chunked_prefill` há 7 meses atrás
..
attention 75f97bc25d bump flash-attn to remove unnecessary copies in the backend há 7 meses atrás
common 76d6f49bbb fix: modelscope downloads há 7 meses atrás
distributed b2fd915c35 improve p2p access check há 7 meses atrás
endpoints 1d7f5c45b0 feat: add stream_options for chat completions há 7 meses atrás
engine d7ebffe2f0 chore: re-add the graceful engine shutdown há 7 meses atrás
executor 17eb1b7eb9 chore: remove ray health check há 7 meses atrás
kv_quant e42a78381a feat: switch from pylint to ruff (#322) há 1 ano atrás
lora c975bba905 fix: sharded state loader with lora há 7 meses atrás
modeling c975bba905 fix: sharded state loader with lora há 7 meses atrás
multimodal f2e94e2184 chore: minor llava cleanups in preparation for llava-next há 7 meses atrás
processing 3f92035bf1 fix: add `ignored_seq_groups` in `_schedule_chunked_prefill` há 7 meses atrás
quantization 40bc98b363 chore: use cutlass kernels for fp8 if supported há 7 meses atrás
spec_decode ec5b99d075 fix: use named args há 7 meses atrás
task_handler c975bba905 fix: sharded state loader with lora há 7 meses atrás
transformers_utils 76d6f49bbb fix: modelscope downloads há 7 meses atrás
__init__.py be8154a8a0 feat: proper embeddings API with e5-mistral-7b support há 7 meses atrás
py.typed 1c988a48b2 fix logging and add py.typed há 1 ano atrás