AlpinDale 7d0884de9a fix mistral v0.3 weight loading vor 7 Monaten
..
attention f6250c5516 move dockerfiles to root; fix cpu build vor 7 Monaten
common e8b7f53321 allow prompt token IDs in the logits processor api vor 7 Monaten
distributed 5b0c11d190 support pipeline parallel pynccl groups vor 7 Monaten
endpoints e8b7f53321 allow prompt token IDs in the logits processor api vor 7 Monaten
engine 5b0c11d190 support pipeline parallel pynccl groups vor 7 Monaten
executor 5b0c11d190 support pipeline parallel pynccl groups vor 7 Monaten
kv_quant e42a78381a feat: switch from pylint to ruff (#322) vor 1 Jahr
lora 5b0c11d190 support pipeline parallel pynccl groups vor 7 Monaten
modeling 7d0884de9a fix mistral v0.3 weight loading vor 7 Monaten
processing 5b0c11d190 support pipeline parallel pynccl groups vor 7 Monaten
quantization e8b7f53321 allow prompt token IDs in the logits processor api vor 7 Monaten
spec_decode 5b0c11d190 support pipeline parallel pynccl groups vor 7 Monaten
task_handler 5b0c11d190 support pipeline parallel pynccl groups vor 7 Monaten
transformers_utils 60e74e92fd add rope_scaling arg vor 7 Monaten
__init__.py be8154a8a0 feat: proper embeddings API with e5-mistral-7b support vor 7 Monaten
py.typed 1c988a48b2 fix logging and add py.typed vor 1 Jahr