.. |
adapter_commons
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
attention
|
308501daa5
fix: default api port and attention selector (#634)
|
4 months ago |
common
|
926ccfd387
exponent is 1.0 by default
|
4 months ago |
distributed
|
31f82da8bd
chore: deduplicate nvlink check to cuda platform (#643)
|
4 months ago |
endpoints
|
48f7216c49
add to procotol
|
4 months ago |
engine
|
77c4fbd5c9
fix: better async request cancellation (#641)
|
4 months ago |
executor
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
inputs
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
4 months ago |
kv_quant
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
9 months ago |
lora
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
modeling
|
8f5adc5a5e
make function consistent with comments:
|
4 months ago |
multimodal
|
f5cca12da8
feat: multi-image input for minicpmv (#628)
|
4 months ago |
platforms
|
31f82da8bd
chore: deduplicate nvlink check to cuda platform (#643)
|
4 months ago |
processing
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
4 months ago |
prompt_adapter
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
quantization
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
server
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
spec_decode
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
task_handler
|
c147670c13
fix: clean up incorrect log in worker (#636)
|
4 months ago |
transformers_utils
|
3648170750
fix: gracefully handle missing chat template (#642)
|
4 months ago |
triton_utils
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
__init__.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
_core_ext.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
_custom_ops.py
|
a401f8e05d
feat: per-tensor token epilogue kernels (#630)
|
4 months ago |
_ipex_ops.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
py.typed
|
1c988a48b2
fix logging and add py.typed
|
1 year ago |
scalar_type.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
version.py
|
db81a67c54
bump to v0.6.0.post1 (#635)
|
4 months ago |