AlpinDale f5bbf07c90 chore: use the `compressed-tensors` library to avoid code reuse (#704) 6 月之前
..
adapter_commons f1d0b77c92 [0.6.0] Release Candidate (#481) 6 月之前
attention 5d37ec1016 suppress tpu import warning (#696) 6 月之前
common 5d37ec1016 suppress tpu import warning (#696) 6 月之前
distributed 0e558e9b2f fix: loading chameleon model with TP>1 (#695) 6 月之前
endpoints 2d044af0e1 chore: spawn engine process from api server process (#703) 6 月之前
engine 7debd35ca2 fix: shut down ray dag workers cleanly (#692) 6 月之前
executor 5d37ec1016 suppress tpu import warning (#696) 6 月之前
inputs 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) 6 月之前
kv_quant e42a78381a feat: switch from pylint to ruff (#322) 1 年之前
lora 1394008421 chore: decouple `should_modify_greedy_probs_inplace (#671) 6 月之前
modeling edec2e9a9e feat: migrate awq and awq_marlin to AphroditeParameter (#702) 6 月之前
multimodal 3693028340 feat: support for Audio modality (#698) 6 月之前
platforms 5d37ec1016 suppress tpu import warning (#696) 6 月之前
processing 79d603954e fix: chunked prefill with v2 block manager (#679) 6 月之前
prompt_adapter f1d0b77c92 [0.6.0] Release Candidate (#481) 6 月之前
quantization f5bbf07c90 chore: use the `compressed-tensors` library to avoid code reuse (#704) 6 月之前
server ed9a6f97c1 fix: kill api server when pinging dead engine (#660) 6 月之前
spec_decode 1394008421 chore: decouple `should_modify_greedy_probs_inplace (#671) 6 月之前
task_handler 3693028340 feat: support for Audio modality (#698) 6 月之前
transformers_utils 3648170750 fix: gracefully handle missing chat template (#642) 6 月之前
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) 6 月之前
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 月之前
_core_ext.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 月之前
_custom_ops.py 5d37ec1016 suppress tpu import warning (#696) 6 月之前
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 月之前
py.typed 1c988a48b2 fix logging and add py.typed 1 年之前
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 月之前
version.py db81a67c54 bump to v0.6.0.post1 (#635) 6 月之前