AlpinDale a11dee6352 wip 3 miesięcy temu
..
adapter_commons f1d0b77c92 [0.6.0] Release Candidate (#481) 4 miesięcy temu
attention 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) 3 miesięcy temu
common a11dee6352 wip 3 miesięcy temu
distributed ef3a0f4cb1 fix: `custom_ar` check (#737) 3 miesięcy temu
endpoints a11dee6352 wip 3 miesięcy temu
engine 28b6397188 chore: quant config for speculative draft models (#719) 3 miesięcy temu
executor 008e646c7e chore: add support for up to 2048 block size (#715) 3 miesięcy temu
inputs 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) 3 miesięcy temu
kv_quant e42a78381a feat: switch from pylint to ruff (#322) 10 miesięcy temu
lora d9d85eeb6e chore: register lora functions as torch ops (#732) 3 miesięcy temu
modeling a11dee6352 wip 3 miesięcy temu
multimodal 0b8b407b6d feat: support profiling with multiple multi-modal inputs per prompt (#712) 3 miesięcy temu
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) 3 miesięcy temu
plugins f76f2a5af0 feat: add aphrodite plugin system (#705) 3 miesięcy temu
processing 577586309d chore: multi-step args and sequence modifications (#713) 3 miesięcy temu
prompt_adapter f1d0b77c92 [0.6.0] Release Candidate (#481) 4 miesięcy temu
quantization ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) 3 miesięcy temu
server ed9a6f97c1 fix: kill api server when pinging dead engine (#660) 4 miesięcy temu
spec_decode 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) 3 miesięcy temu
task_handler 81c28d2a7f fix: use nvml to get consistent device names (#739) 3 miesięcy temu
transformers_utils 4648f16c84 chore: fix return statement in Detokenizer class (#727) 3 miesięcy temu
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) 4 miesięcy temu
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 miesięcy temu
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) 3 miesięcy temu
_custom_ops.py ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) 3 miesięcy temu
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 miesięcy temu
py.typed 1c988a48b2 fix logging and add py.typed 1 rok temu
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 miesięcy temu
version.py c744443679 ci: bump to 0.6.1.post1 (#728) 3 miesięcy temu