1
0
AlpinDale 709628a74d fix 5 сар өмнө
..
adapter_commons 99680b2d23 feat: soft prompts (#589) 5 сар өмнө
attention fa15bad2ea chore: minor AMD fixes 5 сар өмнө
common ba371fbbbd feat: AWQ marlin kernels (#603) 5 сар өмнө
distributed f91991f584 fix: f-string fixes 5 сар өмнө
endpoints f91991f584 fix: f-string fixes 5 сар өмнө
engine a4cbcfe59f feat: disable logprob serialization to CPU for spec decode 5 сар өмнө
executor 45a004874c chore: allow specifying custom Executor 5 сар өмнө
inputs 4f7d212b70 feat: remove vision language config 5 сар өмнө
kv_quant e42a78381a feat: switch from pylint to ruff (#322) 10 сар өмнө
lora f91991f584 fix: f-string fixes 5 сар өмнө
modeling 709628a74d fix 5 сар өмнө
multimodal f91991f584 fix: f-string fixes 5 сар өмнө
platforms a3e26391e4 chore: add a wrapper for torch.inference_mode decorator 5 сар өмнө
processing 6ac658b0d6 some small performance improvements 5 сар өмнө
prompt_adapter f91991f584 fix: f-string fixes 5 сар өмнө
quantization ba371fbbbd feat: AWQ marlin kernels (#603) 5 сар өмнө
spec_decode a4cbcfe59f feat: disable logprob serialization to CPU for spec decode 5 сар өмнө
task_handler f91991f584 fix: f-string fixes 5 сар өмнө
transformers_utils 45a004874c chore: allow specifying custom Executor 5 сар өмнө
triton_utils c8d398a4ae feat: add custom triton cache manager 5 сар өмнө
__init__.py 0c17c2a8a7 chore: add commit hash, clean up engine logs 5 сар өмнө
_custom_ops.py 815736fc54 feat: add cuda kernels for sampling 5 сар өмнө
_ipex_ops.py 9d7beaa5b9 chore: separate kv_scale into k_scale and v_scale 5 сар өмнө
py.typed 1c988a48b2 fix logging and add py.typed 1 жил өмнө
version.py 9038dea2df fix: short commit hash import error 5 сар өмнө