AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
il y a 1 an |
AlpinDale
|
31c95011a6
feat: FP8 E5M2 KV Cache (#226)
|
il y a 1 an |
AlpinDale
|
641bb0f6e9
feat: add custom allreduce kernels (#224)
|
il y a 1 an |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
il y a 1 an |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
il y a 1 an |
AlpinDale
|
d54791aaa8
feat: reduce sampler overhead by making it less blocking (#198)
|
il y a 1 an |
AlpinDale
|
7d91e9e0f2
feat: CUDA graphs (#172)
|
il y a 1 an |
g4rg
|
2aab3da9bd
chore: fix Python 3.8+ compatibility (#170)
|
il y a 1 an |
AlpinDale
|
ae57df0f44
fix: sliding window for mistral/mixtral (#163)
|
il y a 1 an |
AlpinDale
|
653da510d1
chore: rewrite InputMetadata (#143)
|
il y a 1 an |