AlpinDale
|
926ccfd387
exponent is 1.0 by default
|
4 months ago |
AlpinDale
|
48f7216c49
add to procotol
|
4 months ago |
50h100a
|
8f5adc5a5e
make function consistent with comments:
|
4 months ago |
50h100a
|
93f82c8c01
bringing dynatemp back
|
4 months ago |
50h100a
|
b96ba9930e
Merge pull request #644 from 50h100a/quadfix
|
4 months ago |
AlpinDale
|
59264d32e9
fix: hardcoded float16 in embedding mode check (#645)
|
4 months ago |
50h100a
|
cbdf2d986f
quadratic sampling: separate diff from logits to avoid NaNs.
|
4 months ago |
AlpinDale
|
31f82da8bd
chore: deduplicate nvlink check to cuda platform (#643)
|
4 months ago |
AlpinDale
|
3648170750
fix: gracefully handle missing chat template (#642)
|
4 months ago |
AlpinDale
|
77c4fbd5c9
fix: better async request cancellation (#641)
|
4 months ago |
AlpinDale
|
a03e0e2ea4
ci: exclude cu118 and cu121 from build and add py_limited_api (#639)
|
4 months ago |
AlpinDale
|
db81a67c54
bump to v0.6.0.post1 (#635)
|
4 months ago |
AlpinDale
|
c147670c13
fix: clean up incorrect log in worker (#636)
|
4 months ago |
AlpinDale
|
308501daa5
fix: default api port and attention selector (#634)
|
4 months ago |
AlpinDale
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
4 months ago |
AlpinDale
|
337071f484
chore: optimize evictor v2 performance (#631)
|
4 months ago |
AlpinDale
|
a401f8e05d
feat: per-tensor token epilogue kernels (#630)
|
4 months ago |
AlpinDale
|
09b82f9963
feat: Add support for GPU device selection in SpecDecodeBaseSampler (#629)
|
4 months ago |
AlpinDale
|
f5cca12da8
feat: multi-image input for minicpmv (#628)
|
4 months ago |
Trapper4888
|
ba848b00f3
readme: fix model name typo (#627)
|
4 months ago |
AlpinDale
|
30d7effc7c
feat: add siglip encoder for llava family (#626)
|
4 months ago |
AlpinDale
|
4b02629c6a
add FUNDING.yml
|
4 months ago |
AlpinDale
|
627dd86948
readme: update for v0.6.0
|
4 months ago |
AlpinDale
|
54d6d87f0c
ci: fix the base url
|
4 months ago |
AlpinDale
|
d8d0b9cf26
Create CNAME
|
4 months ago |
AlpinDale
|
32fc9c21f4
ci: deploy under subpath
|
4 months ago |
AlpinDale
|
9f38ec855e
ci: specify pnpm lock path
|
4 months ago |
AlpinDale
|
b21169f161
ci: specify pnpm version
|
4 months ago |
AlpinDale
|
b135e1428e
ci: one last try
|
4 months ago |
AlpinDale
|
ac83688fc5
ci: try again
|
4 months ago |