AlpinDale
|
9183c13b5b
add gumbel softmax
|
11 months ago |
AlpinDale
|
b3315c9a4c
update the readme (#207)
|
11 months ago |
AlpinDale
|
9f77f35ff5
bump version to 0.4.6 (#204)
|
1 year ago |
AlpinDale
|
fe70c6e8d5
feat: bump cuda and pytorch (#205)
|
1 year ago |
AlpinDale
|
c5802b2bd5
fix: remove windows specific files
|
1 year ago |
AlpinDale
|
193287b2ef
fix: mixtral unused import
|
1 year ago |
AlpinDale
|
53d391e1f2
merge 'dev' into 'main'
|
1 year ago |
AlpinDale
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |
AlpinDale
|
d54791aaa8
feat: reduce sampler overhead by making it less blocking (#198)
|
1 year ago |
AlpinDale
|
871c0ce8e4
fix: triton compile error (#200)
|
1 year ago |
AlpinDale
|
7e72ce0a73
feat: mixtral tensor parallelism (#193)
|
1 year ago |
AlpinDale
|
d7f113c3ff
readme: add acknowledgements
|
1 year ago |
AlpinDale
|
95bdd35ec9
feat: rejection sampler (#197)
|
1 year ago |
AlpinDale
|
f121a5edd8
feat: tokenizer endpoint for OpenAI API (#195)
|
1 year ago |
AlpinDale
|
15a0454172
feat: FP8 KV Cache (#185)
|
1 year ago |
AlpinDale
|
801eda0b7a
feat: support GPTQ 2, 3, and 8bit quants (#181)
|
1 year ago |
AlpinDale
|
b9b295d74e
chore: backlogs 1 (#191)
|
1 year ago |
AlpinDale
|
17cdc5ac23
yapf
|
1 year ago |
KaraKaraWitch
|
9a0b5a197d
fix: set CPU Affinity (#187)
|
1 year ago |
AlpinDale
|
68c2083adb
fix includes in wheel
|
1 year ago |
AlpinDale
|
11af9b796e
bump version to 0.4.5
|
1 year ago |
AlpinDale
|
3736d831f2
fix klite embed
|
1 year ago |
AlpinDale
|
81d7a8d323
bump version to 0.4.4
|
1 year ago |
AlpinDale
|
f013d714c0
chore: merge dev branch into main (#177)
|
1 year ago |
AlpinDale
|
6eb0b926fd
chore: make openai api key optional (#176)
|
1 year ago |
g4rg
|
fe57bb7ad2
feat: add rope scaling to mixtral (#174)
|
1 year ago |
AlpinDale
|
7d91e9e0f2
feat: CUDA graphs (#172)
|
1 year ago |
AlpinDale
|
725be3e0de
feat: mixtral HF with expert parallelism (#167)
|
1 year ago |
AlpinDale
|
6c50f5b067
chore: include stop strings in output (#168)
|
1 year ago |
g4rg
|
2aab3da9bd
chore: fix Python 3.8+ compatibility (#170)
|
1 year ago |