AlpinDale
|
5761ef8c35
feat: gemma-2 support
|
5 months ago |
AlpinDale
|
0f4a9ee77b
quantized lm_head (#582)
|
5 months ago |
AlpinDale
|
c7bddcdef1
chore: skip for logits_scale == 1.0
|
6 months ago |
AlpinDale
|
e8b7f53321
allow prompt token IDs in the logits processor api
|
6 months ago |
AlpinDale
|
6fc1ec6e9a
fix redirects and improve low level debugging
|
6 months ago |
AlpinDale
|
aed64884c6
feat: prompt logprobs with chunked prefill (#539)
|
6 months ago |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
7 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
9 months ago |