AlpinDale 7c7ec12f36 chore: refactor executor classes for easier inheritance (#840) 1 month ago
..
adapter_commons 2f61644f6e SPMD optimizations (#824) 2 months ago
assets c6c91edab7 ci: update & overhaul test units (#769) 2 months ago
attention 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) 4 months ago
common 60f7b828d5 feat: add skew sampling (#834) 2 months ago
distributed 0f1af04cf5 frontend: minor logging improvements (#787) 2 months ago
endpoints 60f7b828d5 feat: add skew sampling (#834) 2 months ago
engine 48a8693aed feat: multi-step scheduling (#831) 2 months ago
executor 7c7ec12f36 chore: refactor executor classes for easier inheritance (#840) 1 month ago
inputs 2f61644f6e SPMD optimizations (#824) 2 months ago
kv_quant 8a71788372 Add OLMoE (#772) 3 months ago
lora 2f61644f6e SPMD optimizations (#824) 2 months ago
modeling 60f7b828d5 feat: add skew sampling (#834) 2 months ago
multimodal 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) 4 months ago
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) 4 months ago
plugins f76f2a5af0 feat: add aphrodite plugin system (#705) 4 months ago
processing 3d83e64f8e feat: add metrics for prefix cache hit rate (#829) 2 months ago
prompt_adapter 2f61644f6e SPMD optimizations (#824) 2 months ago
quantization f98e7b2f8c feat: add HQQ quantization support (#795) 2 months ago
server 0256ed236b feat: windows support (#790) 2 months ago
spec_decode 16b587c104 fix: hidden states handling in batch expansion for spec decoding (#839) 1 month ago
task_handler 48a8693aed feat: multi-step scheduling (#831) 2 months ago
transformers_utils fb96041ae3 fix: demote skip_special_tokens assertion to logger error (#778) 3 months ago
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) 4 months ago
_custom_ops.py bfc8988116 feat: add cuda sampling kernels for top_k and top_p (#828) 2 months ago
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
connections.py c6c91edab7 ci: update & overhaul test units (#769) 2 months ago
constants.py 2f61644f6e SPMD optimizations (#824) 2 months ago
py.typed 1c988a48b2 fix logging and add py.typed 1 year ago
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
version.py f0e00f1b43 ci: bump to 0.6.3.post1 (#801) 2 months ago