AlpinDale 0bf916eabd Revert "feat: add support for chunked prefill + prefix caching (#871)" hace 3 semanas
..
adapter_commons 2f61644f6e SPMD optimizations (#824) hace 1 mes
assets 653d1a08d4 feat: add support for audio models (#891) hace 3 semanas
attention 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) hace 3 semanas
common 132aa2abe4 spec decode: add support for EAGLE (#899) hace 3 semanas
distributed 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) hace 3 semanas
endpoints a00ab49e21 api: add client timeouts for the ZeroMQ server (#897) hace 3 semanas
engine 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) hace 3 semanas
executor 65b71f5fcc distributed: fix issue for when nodes have multiple network interfaces (#892) hace 3 semanas
inputs 908ff753a1 fix: phi_3.5_v loading (#896) hace 3 semanas
kv_quant 8a71788372 Add OLMoE (#772) hace 2 meses
lora 369600855a xpu: disable punica kernels for XPU (#864) hace 1 mes
modeling afc9a28aa0 chore: add AphroditeParameter support for FP8 quant (#902) hace 3 semanas
multimodal 653d1a08d4 feat: add support for audio models (#891) hace 3 semanas
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) hace 3 meses
plugins 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) hace 3 semanas
processing 0bf916eabd Revert "feat: add support for chunked prefill + prefix caching (#871)" hace 3 semanas
prompt_adapter 2f61644f6e SPMD optimizations (#824) hace 1 mes
quantization afc9a28aa0 chore: add AphroditeParameter support for FP8 quant (#902) hace 3 semanas
server 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) hace 3 semanas
spec_decode 132aa2abe4 spec decode: add support for EAGLE (#899) hace 3 semanas
task_handler 0bf916eabd Revert "feat: add support for chunked prefill + prefix caching (#871)" hace 3 semanas
transformers_utils 132aa2abe4 spec decode: add support for EAGLE (#899) hace 3 semanas
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) hace 4 meses
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 4 meses
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) hace 3 meses
_custom_ops.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) hace 1 mes
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 4 meses
connections.py c6c91edab7 ci: update & overhaul test units (#769) hace 1 mes
constants.py 2f61644f6e SPMD optimizations (#824) hace 1 mes
py.typed 1c988a48b2 fix logging and add py.typed hace 1 año
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) hace 4 meses
version.py 8b8d2ce7e2 ci: bump aphrodite version to 0.6.4.post1 (#859) hace 1 mes