AlpinDale 1405051912 attention: add `AttentionState` abstraction (#863) 3 mesiacov pred
..
adapter_commons 2f61644f6e SPMD optimizations (#824) 3 mesiacov pred
assets c6c91edab7 ci: update & overhaul test units (#769) 3 mesiacov pred
attention 1405051912 attention: add `AttentionState` abstraction (#863) 3 mesiacov pred
common 3392b81bf9 sampler: allow parsing sampler order using strings (#858) 3 mesiacov pred
distributed 0f1af04cf5 frontend: minor logging improvements (#787) 4 mesiacov pred
endpoints 3392b81bf9 sampler: allow parsing sampler order using strings (#858) 3 mesiacov pred
engine 48a8693aed feat: multi-step scheduling (#831) 3 mesiacov pred
executor 9094a8a2a3 xpu: refactor XPU worker & executor (#861) 3 mesiacov pred
inputs 2f61644f6e SPMD optimizations (#824) 3 mesiacov pred
kv_quant 8a71788372 Add OLMoE (#772) 4 mesiacov pred
lora 2f61644f6e SPMD optimizations (#824) 3 mesiacov pred
modeling 0035dc42ed sampler: optimize DRY performance using z-algorithm (#856) 3 mesiacov pred
multimodal 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) 5 mesiacov pred
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) 5 mesiacov pred
plugins f76f2a5af0 feat: add aphrodite plugin system (#705) 6 mesiacov pred
processing 3d83e64f8e feat: add metrics for prefix cache hit rate (#829) 3 mesiacov pred
prompt_adapter 2f61644f6e SPMD optimizations (#824) 3 mesiacov pred
quantization 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 3 mesiacov pred
server 0256ed236b feat: windows support (#790) 4 mesiacov pred
spec_decode 1405051912 attention: add `AttentionState` abstraction (#863) 3 mesiacov pred
task_handler 1405051912 attention: add `AttentionState` abstraction (#863) 3 mesiacov pred
transformers_utils fb96041ae3 fix: demote skip_special_tokens assertion to logger error (#778) 4 mesiacov pred
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) 6 mesiacov pred
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 mesiacov pred
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) 5 mesiacov pred
_custom_ops.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 3 mesiacov pred
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 mesiacov pred
connections.py c6c91edab7 ci: update & overhaul test units (#769) 3 mesiacov pred
constants.py 2f61644f6e SPMD optimizations (#824) 3 mesiacov pred
py.typed 1c988a48b2 fix logging and add py.typed 1 rok pred
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) 6 mesiacov pred
version.py 8b8d2ce7e2 ci: bump aphrodite version to 0.6.4.post1 (#859) 3 mesiacov pred