AlpinDale
|
2dfa4e47e6
chore: set seed for dummy weights init
|
5 kuukautta sitten |
AlpinDale
|
9d7beaa5b9
chore: separate kv_scale into k_scale and v_scale
|
5 kuukautta sitten |
AlpinDale
|
b82c39772c
chore: allow quantizing all layers of deepseek-v2
|
5 kuukautta sitten |
AlpinDale
|
517676249c
chore: update the compressed-tensors config
|
6 kuukautta sitten |
AlpinDale
|
690110a051
feat: bitsandbytes quantization
|
6 kuukautta sitten |
AlpinDale
|
7d0884de9a
fix mistral v0.3 weight loading
|
6 kuukautta sitten |
AlpinDale
|
f4ea11b982
feat: initial support for activation quantization
|
6 kuukautta sitten |
AlpinDale
|
94ba676ee0
fix: torch.uniform_ doesn't support FP8, fix for dummy weights
|
6 kuukautta sitten |
AlpinDale
|
3bbfd65549
feat: support hub model ID when offline
|
6 kuukautta sitten |
AlpinDale
|
46159b107a
formatting: pt1
|
7 kuukautta sitten |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
7 kuukautta sitten |