.. |
__init__.py
|
50b7c13db0
refactor: attention selector (#552)
|
7 months ago |
loader.py
|
690110a051
feat: bitsandbytes quantization
|
7 months ago |
neuron.py
|
fca911ee0a
vLLM Upstream Sync (#526)
|
8 months ago |
tensorizer.py
|
0cea453d36
automatically detect tensorized models
|
7 months ago |
utils.py
|
fca911ee0a
vLLM Upstream Sync (#526)
|
8 months ago |
weight_utils.py
|
690110a051
feat: bitsandbytes quantization
|
7 months ago |