50h100a d49ead7334 reduce sampler peak memory usage 3 minggu lalu
..
adapter_commons 2f61644f6e SPMD optimizations (#824) 1 bulan lalu
assets 653d1a08d4 feat: add support for audio models (#891) 3 minggu lalu
attention 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 minggu lalu
common 132aa2abe4 spec decode: add support for EAGLE (#899) 3 minggu lalu
distributed 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 minggu lalu
endpoints 6fbab320e7 api: error suppression cleanup + timeout suppression on aborts (#905) 3 minggu lalu
engine 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 minggu lalu
executor d69273bd2b ray: better error when placement group topology is incorrect (#906) 3 minggu lalu
inputs 908ff753a1 fix: phi_3.5_v loading (#896) 3 minggu lalu
kv_quant 8a71788372 Add OLMoE (#772) 2 bulan lalu
lora 369600855a xpu: disable punica kernels for XPU (#864) 1 bulan lalu
modeling d49ead7334 reduce sampler peak memory usage 3 minggu lalu
multimodal 653d1a08d4 feat: add support for audio models (#891) 3 minggu lalu
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) 3 bulan lalu
plugins 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 minggu lalu
processing bc1a2bdf98 do not use cached chunks for prompt_logprobs 3 minggu lalu
prompt_adapter 2f61644f6e SPMD optimizations (#824) 1 bulan lalu
quantization afc9a28aa0 chore: add AphroditeParameter support for FP8 quant (#902) 3 minggu lalu
server 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 minggu lalu
spec_decode ab533e0e60 spec decode: fix logprobs when using speculative decoding (#904) 3 minggu lalu
task_handler 132aa2abe4 spec decode: add support for EAGLE (#899) 3 minggu lalu
transformers_utils 132aa2abe4 spec decode: add support for EAGLE (#899) 3 minggu lalu
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) 4 bulan lalu
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 bulan lalu
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) 3 bulan lalu
_custom_ops.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) 1 bulan lalu
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 bulan lalu
connections.py c6c91edab7 ci: update & overhaul test units (#769) 1 bulan lalu
constants.py 2f61644f6e SPMD optimizations (#824) 1 bulan lalu
py.typed 1c988a48b2 fix logging and add py.typed 1 tahun lalu
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 bulan lalu
version.py 8b8d2ce7e2 ci: bump aphrodite version to 0.6.4.post1 (#859) 1 bulan lalu