AlpinDale
|
145e554a4d
neuron: add 8bit quantization for Neuron (#994)
|
il y a 2 semaines |
AlpinDale
|
0dfa6b60ec
core: support logprobs with multi-step scheduling (#963)
|
il y a 2 semaines |
AlpinDale
|
ba6d798784
neuron: support for context length and token bucketing (#960)
|
il y a 2 semaines |
AlpinDale
|
0e558e9b2f
fix: loading chameleon model with TP>1 (#695)
|
il y a 4 mois |
AlpinDale
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
il y a 4 mois |