.. |
adapter_commons
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
attention
|
5d37ec1016
suppress tpu import warning (#696)
|
3 months ago |
common
|
5d37ec1016
suppress tpu import warning (#696)
|
3 months ago |
distributed
|
0e558e9b2f
fix: loading chameleon model with TP>1 (#695)
|
3 months ago |
endpoints
|
3693028340
feat: support for Audio modality (#698)
|
3 months ago |
engine
|
7debd35ca2
fix: shut down ray dag workers cleanly (#692)
|
3 months ago |
executor
|
5d37ec1016
suppress tpu import warning (#696)
|
3 months ago |
inputs
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
4 months ago |
kv_quant
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
9 months ago |
lora
|
1394008421
chore: decouple `should_modify_greedy_probs_inplace (#671)
|
4 months ago |
modeling
|
3693028340
feat: support for Audio modality (#698)
|
3 months ago |
multimodal
|
b9b5e352cb
typo
|
3 months ago |
platforms
|
5d37ec1016
suppress tpu import warning (#696)
|
3 months ago |
processing
|
79d603954e
fix: chunked prefill with v2 block manager (#679)
|
3 months ago |
prompt_adapter
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
quantization
|
3f49a55f82
feat: add INT8 W8A16 quant for TPU (#663)
|
4 months ago |
server
|
ed9a6f97c1
fix: kill api server when pinging dead engine (#660)
|
4 months ago |
spec_decode
|
1394008421
chore: decouple `should_modify_greedy_probs_inplace (#671)
|
4 months ago |
task_handler
|
3693028340
feat: support for Audio modality (#698)
|
3 months ago |
transformers_utils
|
0e08cb1c12
add ultravox config file
|
3 months ago |
triton_utils
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
__init__.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
_core_ext.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
_custom_ops.py
|
5d37ec1016
suppress tpu import warning (#696)
|
3 months ago |
_ipex_ops.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
py.typed
|
1c988a48b2
fix logging and add py.typed
|
1 year ago |
scalar_type.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
version.py
|
db81a67c54
bump to v0.6.0.post1 (#635)
|
4 months ago |