AlpinDale bf88c8567e feat: mamba model support (#674) 6 ヶ月 前
..
adapter_commons f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
attention bf88c8567e feat: mamba model support (#674) 6 ヶ月 前
common bf88c8567e feat: mamba model support (#674) 6 ヶ月 前
distributed 31f82da8bd chore: deduplicate nvlink check to cuda platform (#643) 6 ヶ月 前
endpoints 2da6a3ec2b feat: option to apply temperature scaling last (#670) 6 ヶ月 前
engine bf88c8567e feat: mamba model support (#674) 6 ヶ月 前
executor f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
inputs 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) 6 ヶ月 前
kv_quant e42a78381a feat: switch from pylint to ruff (#322) 1 年間 前
lora 1394008421 chore: decouple `should_modify_greedy_probs_inplace (#671) 6 ヶ月 前
modeling bf88c8567e feat: mamba model support (#674) 6 ヶ月 前
multimodal 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) 6 ヶ月 前
platforms 31f82da8bd chore: deduplicate nvlink check to cuda platform (#643) 6 ヶ月 前
processing bf88c8567e feat: mamba model support (#674) 6 ヶ月 前
prompt_adapter f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
quantization 3f49a55f82 feat: add INT8 W8A16 quant for TPU (#663) 6 ヶ月 前
server ed9a6f97c1 fix: kill api server when pinging dead engine (#660) 6 ヶ月 前
spec_decode 1394008421 chore: decouple `should_modify_greedy_probs_inplace (#671) 6 ヶ月 前
task_handler bf88c8567e feat: mamba model support (#674) 6 ヶ月 前
transformers_utils 3648170750 fix: gracefully handle missing chat template (#642) 6 ヶ月 前
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
_core_ext.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
_custom_ops.py a401f8e05d feat: per-tensor token epilogue kernels (#630) 6 ヶ月 前
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
py.typed 1c988a48b2 fix logging and add py.typed 1 年間 前
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 ヶ月 前
version.py db81a67c54 bump to v0.6.0.post1 (#635) 6 ヶ月 前