AlpinDale
|
842912d022
feat: on-the-fly gguf conversion (#250)
|
1 year ago |
AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
1 year ago |
AlpinDale
|
c0aac15421
feat: S-LoRA support (#222)
|
1 year ago |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
1 year ago |
AlpinDale
|
f013d714c0
chore: merge dev branch into main (#177)
|
1 year ago |
AlpinDale
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
AlpinDale
|
887e03669a
feat: add exllamav2 for GPTQ (#99)
|
1 year ago |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
1 year ago |
AlpinDale
|
efc6f7fbec
chore: reformats (#90)
|
1 year ago |
AlpinDale
|
a6a4220fa6
feat: refactor megatron and quants (#57)
|
1 year ago |
50h100a
|
1da9efbcc7
restore rope scaling (#25)
|
1 year ago |
AlpinDale
|
0495c50a3e
GPTQ+exllama support (#21)
|
1 year ago |
AlpinDale
|
1ab15649ae
fix column parallelism in quantized layers
|
1 year ago |
AlpinDale
|
779148bfc3
fix missing import in llama modeling
|
1 year ago |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 year ago |
AlpinDale
|
280583cd29
forgot import in llama code
|
1 year ago |
AlpinDale
|
d9c1d4f6e5
add awq support
|
1 year ago |
AlpinDale
|
22fe51d5c3
remove revision from llama for now
|
1 year ago |
AlpinDale
|
39beed0b87
Revert "Refactor AWQ support."
|
1 year ago |
AlpinDale
|
d09e27f5d4
Refactor AWQ support.
|
1 year ago |
AlpinDale
|
6b9561ef07
adapt TGI incremental detokenization
|
1 year ago |
AlpinDale
|
45f6d9f923
initial refactor commit
|
1 year ago |
AlpinDale
|
bdf264880f
clean up safetensors support
|
1 year ago |
AlpinDale
|
aba8b0b17a
add rope theta support and bump transformers
|
1 year ago |
AlpinDale
|
8c2353e803
llama support for safetensors
|
1 year ago |
AlpinDale
|
76b2e4a445
Merge dev branch into main (#7)
|
1 year ago |
AlpinDale
|
6de30f43a4
fix: typos and refactors for llama
|
1 year ago |
AlpinDale
|
394006f964
llama: vocab padding support
|
1 year ago |
AlpinDale
|
0715cc1958
fix: typo in llama modeling file
|
1 year ago |
AlpinDale
|
42682fdaf9
feat: adapt modeling_llama.py
|
1 year ago |