AlpinDale 2d7929d3b7 fix: flashinfer crash with uneven attention group_size пре 1 месец
..
adapter_commons 2f61644f6e SPMD optimizations (#824) пре 1 месец
assets c6c91edab7 ci: update & overhaul test units (#769) пре 1 месец
attention 2d7929d3b7 fix: flashinfer crash with uneven attention group_size пре 1 месец
common 9fc6473b18 server: log the process occupying our port (#866) пре 1 месец
distributed 0f1af04cf5 frontend: minor logging improvements (#787) пре 2 месеци
endpoints 3392b81bf9 sampler: allow parsing sampler order using strings (#858) пре 1 месец
engine 5bd4473bb6 async: avoid premature exit in the async generator (#872) пре 1 месец
executor db96c2daa3 executor: pipe `worker_class_fn` arg in executor (#865) пре 1 месец
inputs 2f61644f6e SPMD optimizations (#824) пре 1 месец
kv_quant 8a71788372 Add OLMoE (#772) пре 2 месеци
lora 369600855a xpu: disable punica kernels for XPU (#864) пре 1 месец
modeling ef99a567b6 fix: temp_last warning being repeated for every output token (#869) пре 1 месец
multimodal 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) пре 3 месеци
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) пре 3 месеци
plugins f76f2a5af0 feat: add aphrodite plugin system (#705) пре 3 месеци
processing abfd4465ca feat: add support for chunked prefill + prefix caching (#871) пре 1 месец
prompt_adapter 2f61644f6e SPMD optimizations (#824) пре 1 месец
quantization 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) пре 1 месец
server 9fc6473b18 server: log the process occupying our port (#866) пре 1 месец
spec_decode 1405051912 attention: add `AttentionState` abstraction (#863) пре 1 месец
task_handler abfd4465ca feat: add support for chunked prefill + prefix caching (#871) пре 1 месец
transformers_utils fb96041ae3 fix: demote skip_special_tokens assertion to logger error (#778) пре 2 месеци
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) пре 4 месеци
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) пре 4 месеци
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) пре 3 месеци
_custom_ops.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) пре 1 месец
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) пре 4 месеци
connections.py c6c91edab7 ci: update & overhaul test units (#769) пре 1 месец
constants.py 2f61644f6e SPMD optimizations (#824) пре 1 месец
py.typed 1c988a48b2 fix logging and add py.typed пре 1 година
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) пре 4 месеци
version.py 8b8d2ce7e2 ci: bump aphrodite version to 0.6.4.post1 (#859) пре 1 месец