AlpinDale
|
b178ae4b4a
chore: generalize linear_method to be quant_method (#540)
|
vor 5 Monaten |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
vor 6 Monaten |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
vor 9 Monaten |
AlpinDale
|
da223153c6
feat&fix: cohere support and missing GPU blocks (#333)
|
vor 9 Monaten |
AlpinDale
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
vor 10 Monaten |
AlpinDale
|
e31c6f0b45
feat: refactor modeling logic and support more models (#274)
|
vor 10 Monaten |
AlpinDale
|
842912d022
feat: on-the-fly gguf conversion (#250)
|
vor 11 Monaten |
AlpinDale
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
vor 11 Monaten |