50h100a 273c61d406 guard against nan temperature from dynatemp (or anywhere else). il y a 3 mois
..
adapter_commons f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 6 mois
attention 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) il y a 5 mois
common 0f1af04cf5 frontend: minor logging improvements (#787) il y a 4 mois
distributed 0f1af04cf5 frontend: minor logging improvements (#787) il y a 4 mois
endpoints 2fa112f86b feat: update to serviceinfo v0.2 (#808) il y a 4 mois
engine 0256ed236b feat: windows support (#790) il y a 4 mois
executor 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) il y a 5 mois
inputs 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) il y a 5 mois
kv_quant 8a71788372 Add OLMoE (#772) il y a 5 mois
lora d9d85eeb6e chore: register lora functions as torch ops (#732) il y a 5 mois
modeling 273c61d406 guard against nan temperature from dynatemp (or anywhere else). il y a 3 mois
multimodal 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) il y a 5 mois
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) il y a 5 mois
plugins f76f2a5af0 feat: add aphrodite plugin system (#705) il y a 6 mois
processing 577586309d chore: multi-step args and sequence modifications (#713) il y a 6 mois
prompt_adapter f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 6 mois
quantization f98e7b2f8c feat: add HQQ quantization support (#795) il y a 4 mois
server 0256ed236b feat: windows support (#790) il y a 4 mois
spec_decode 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) il y a 5 mois
task_handler 0f1af04cf5 frontend: minor logging improvements (#787) il y a 4 mois
transformers_utils fb96041ae3 fix: demote skip_special_tokens assertion to logger error (#778) il y a 4 mois
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 6 mois
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 6 mois
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) il y a 5 mois
_custom_ops.py f98e7b2f8c feat: add HQQ quantization support (#795) il y a 4 mois
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 6 mois
py.typed 1c988a48b2 fix logging and add py.typed il y a 1 an
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) il y a 6 mois
version.py f0e00f1b43 ci: bump to 0.6.3.post1 (#801) il y a 4 mois