AlpinDale
|
0e5bb11503
fix: make `merge_async_iterators.is_cancelled()` optional (#656)
|
4 months ago |
AlpinDale
|
3170c0d4c6
fix: GPTQ/AWQ on Colab (#655)
|
4 months ago |
AlpinDale
|
83bcb9119a
fix: multiprocessing timeout (#654)
|
4 months ago |
AlpinDale
|
1e119cbeb6
fix: input processor in internvl2 (#653)
|
4 months ago |
AlpinDale
|
a2344d3617
fix: move zeromq rpc frontend to IPC instead of TCP (#652)
|
4 months ago |
AlpinDale
|
f1e1d0bd3d
feat: introduce `BaseAphroditeParameter` (#646)
|
4 months ago |
AlpinDale
|
47ac074937
fix: RSLoRA support (#647)
|
4 months ago |
50h100a
|
b96ba9930e
Merge pull request #644 from 50h100a/quadfix
|
4 months ago |
AlpinDale
|
59264d32e9
fix: hardcoded float16 in embedding mode check (#645)
|
4 months ago |
50h100a
|
cbdf2d986f
quadratic sampling: separate diff from logits to avoid NaNs.
|
4 months ago |
AlpinDale
|
31f82da8bd
chore: deduplicate nvlink check to cuda platform (#643)
|
4 months ago |
AlpinDale
|
3648170750
fix: gracefully handle missing chat template (#642)
|
4 months ago |
AlpinDale
|
77c4fbd5c9
fix: better async request cancellation (#641)
|
4 months ago |
AlpinDale
|
a03e0e2ea4
ci: exclude cu118 and cu121 from build and add py_limited_api (#639)
|
4 months ago |
AlpinDale
|
db81a67c54
bump to v0.6.0.post1 (#635)
|
4 months ago |
AlpinDale
|
c147670c13
fix: clean up incorrect log in worker (#636)
|
4 months ago |
AlpinDale
|
308501daa5
fix: default api port and attention selector (#634)
|
4 months ago |
AlpinDale
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
4 months ago |
AlpinDale
|
337071f484
chore: optimize evictor v2 performance (#631)
|
4 months ago |
AlpinDale
|
a401f8e05d
feat: per-tensor token epilogue kernels (#630)
|
4 months ago |
AlpinDale
|
09b82f9963
feat: Add support for GPU device selection in SpecDecodeBaseSampler (#629)
|
4 months ago |
AlpinDale
|
f5cca12da8
feat: multi-image input for minicpmv (#628)
|
4 months ago |
Trapper4888
|
ba848b00f3
readme: fix model name typo (#627)
|
4 months ago |
AlpinDale
|
30d7effc7c
feat: add siglip encoder for llava family (#626)
|
4 months ago |
AlpinDale
|
4b02629c6a
add FUNDING.yml
|
4 months ago |
AlpinDale
|
627dd86948
readme: update for v0.6.0
|
4 months ago |
AlpinDale
|
54d6d87f0c
ci: fix the base url
|
4 months ago |
AlpinDale
|
d8d0b9cf26
Create CNAME
|
4 months ago |
AlpinDale
|
32fc9c21f4
ci: deploy under subpath
|
4 months ago |
AlpinDale
|
9f38ec855e
ci: specify pnpm lock path
|
4 months ago |