AlpinDale c1c37c755d bump version to 0.6.0 il y a 4 mois
..
adapter_commons 6b1fdd07bd chore: add isort and refactor formatting script and utils il y a 4 mois
attention 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
common 2b85ffb1a5 chore: minor cleanups il y a 4 mois
distributed 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
endpoints 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
engine 28946766fb fix: allow loading GGUF model without .gguf extension il y a 4 mois
executor 87694c8aba feat: add RPC server and client via ZMQ (#615) il y a 4 mois
inputs 1ab2dad198 Refactor prompt processing (#605) il y a 4 mois
kv_quant e42a78381a feat: switch from pylint to ruff (#322) il y a 10 mois
lora 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
modeling 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
multimodal 705e50f4bd fix: broadcasting logic for multi_modal_kwargs il y a 4 mois
platforms a3e26391e4 chore: add a wrapper for torch.inference_mode decorator il y a 4 mois
processing 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
prompt_adapter 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
quantization 208cd5405f fix: cpu offloading with gptq il y a 4 mois
server 040e5af52b refactor: factor out code for running uvicorn again il y a 4 mois
spec_decode 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
task_handler 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
transformers_utils 28946766fb fix: allow loading GGUF model without .gguf extension il y a 4 mois
triton_utils 4d4e767838 ci: take one of fixing lint issues il y a 4 mois
__init__.py 0c17c2a8a7 chore: add commit hash, clean up engine logs il y a 4 mois
_core_ext.py 141672a0d4 kernels: disambiguate quantized types via a new ScalarType il y a 4 mois
_custom_ops.py 0e6c400b13 feat: re-add GGUF (#600) il y a 4 mois
_ipex_ops.py 9d7beaa5b9 chore: separate kv_scale into k_scale and v_scale il y a 4 mois
py.typed 1c988a48b2 fix logging and add py.typed il y a 1 an
scalar_type.py 141672a0d4 kernels: disambiguate quantized types via a new ScalarType il y a 4 mois
version.py c1c37c755d bump version to 0.6.0 il y a 4 mois