1
0
drummerv e59dd4a90d fix: openai gguf chat template (#312) 10 сар өмнө
..
common c41462cfcd feat: exllamav2 quantization (#305) 10 сар өмнө
endpoints e59dd4a90d fix: openai gguf chat template (#312) 10 сар өмнө
engine c41462cfcd feat: exllamav2 quantization (#305) 10 сар өмнө
kv_quant 9810daa699 feat: INT8 KV Cache (#298) 10 сар өмнө
lora a1d8ab9f3e fix: lora on quantized models (barred gguf) (#292) 10 сар өмнө
modeling 968bde81bf fix: tensor parallel with GPTQ and AWQ quants (#307) 10 сар өмнө
processing c2d77b1822 chore: logging refactor (#302) 10 сар өмнө
task_handler c2d77b1822 chore: logging refactor (#302) 10 сар өмнө
transformers_utils e59dd4a90d fix: openai gguf chat template (#312) 10 сар өмнө
__init__.py ff898c2c80 bump version to 0.5.0 (#303) 10 сар өмнө
py.typed 1c988a48b2 fix logging and add py.typed 1 жил өмнө