Autor | SHA1 Mensaxe | Data |
---|---|---|
|
145e554a4d neuron: add 8bit quantization for Neuron (#994) | hai 1 mes |
|
0dfa6b60ec core: support logprobs with multi-step scheduling (#963) | hai 1 mes |
|
ba6d798784 neuron: support for context length and token bucketing (#960) | hai 1 mes |
|
0e558e9b2f fix: loading chameleon model with TP>1 (#695) | hai 5 meses |
|
f1d0b77c92 [0.6.0] Release Candidate (#481) | hai 5 meses |