AlpinDale
|
d9f4c36edd
feat: Medusa speculative decoding support (#590)
|
5 months ago |
AlpinDale
|
7b04361934
fix: support getting `eos_token_id` from the config file
|
5 months ago |
AlpinDale
|
af43576da0
feat: add MLPSpeculator speculative decoding support (#572)
|
5 months ago |
AlpinDale
|
bba89fc6d3
chore: make the automatic rope scaling behave properly with rope_scaling arg, add rope theta
|
5 months ago |
AlpinDale
|
4d1e613804
chore: minor simplifications
|
5 months ago |
AlpinDale
|
76d6f49bbb
fix: modelscope downloads
|
5 months ago |
AlpinDale
|
60e74e92fd
add rope_scaling arg
|
6 months ago |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
6 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |
AlpinDale
|
da223153c6
feat&fix: cohere support and missing GPU blocks (#333)
|
10 months ago |
AlpinDale
|
89c32b40ec
chore: add new imatrix quants (#320)
|
10 months ago |
AlpinDale
|
e31c6f0b45
feat: refactor modeling logic and support more models (#274)
|
11 months ago |
AlpinDale
|
842912d022
feat: on-the-fly gguf conversion (#250)
|
11 months ago |
AlpinDale
|
e59e7f0a99
feat: yi support (#104)
|
1 year ago |
AlpinDale
|
efc6f7fbec
chore: reformats (#90)
|
1 year ago |
AlpinDale
|
9c353a0e02
fix: unnecessary import
|
1 year ago |
AlpinDale
|
28db67fd78
fix: mistral support
|
1 year ago |
AlpinDale
|
cbeeabeb9a
feat: mistral support (#20)
|
1 year ago |
AlpinDale
|
c95d80da39
fix revision issues
|
1 year ago |
AlpinDale
|
6b9561ef07
adapt TGI incremental detokenization
|
1 year ago |
AlpinDale
|
2cdfc45a40
fix: trust_remote_code fixes
|
1 year ago |