AlpinDale
|
0e6c400b13
feat: re-add GGUF (#600)
|
преди 4 месеца |
AlpinDale
|
9be43994fe
feat: fbgemm quantization support (#601)
|
преди 4 месеца |
AlpinDale
|
5289c14b24
feat: Asymmetric Tensor Parallel (#594)
|
преди 4 месеца |
AlpinDale
|
0f4a9ee77b
quantized lm_head (#582)
|
преди 4 месеца |
AlpinDale
|
ecd4460d55
fix: support 2D inputs for embeddings
|
преди 5 месеца |
AlpinDale
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
преди 5 месеца |
AlpinDale
|
c975bba905
fix: sharded state loader with lora
|
преди 5 месеца |
AlpinDale
|
6fc1ec6e9a
fix redirects and improve low level debugging
|
преди 5 месеца |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
преди 6 месеца |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
преди 8 месеца |
AlpinDale
|
968bde81bf
fix: tensor parallel with GPTQ and AWQ quants (#307)
|
преди 10 месеца |
AlpinDale
|
c41462cfcd
feat: exllamav2 quantization (#305)
|
преди 10 месеца |
AlpinDale
|
705821a7fe
feat: AQLM quantization support (#293)
|
преди 10 месеца |
TearGosling
|
80e8a14949
feat: add pygchat Jinja template (#218)
|
преди 11 месеца |
AlpinDale
|
8635901c76
fix: s-lora vocab embeddings
|
преди 11 месеца |
AlpinDale
|
ea0f57b233
feat: allow further support for non-cuda devices (#247)
|
преди 11 месеца |
AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
преди 11 месеца |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
преди 11 месеца |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
преди 11 месеца |
AlpinDale
|
2755a48d51
merge dev branch into main (#153)
|
преди 1 година |