AlpinDale b0a8169b54 core: do not compile for profiling vor 2 Wochen
..
adapter_commons 2f61644f6e SPMD optimizations (#824) vor 1 Monat
assets 653d1a08d4 feat: add support for audio models (#891) vor 3 Wochen
attention f7f3fed265 feat: add async postprocessor (#925) vor 2 Wochen
common d46e70ac98 api: add inline model loading (#928) vor 2 Wochen
distributed 2940da2c7b distributed: fix custom allreduce p2p cache file generation (#922) vor 3 Wochen
endpoints a3c03db735 fix: inline model loading conflicts with lora (#930) vor 2 Wochen
engine 8d9f1fd4e6 feat: add single user mode (#927) vor 2 Wochen
executor 0c6d90dade neuron: add support for tensor parallelism (#923) vor 3 Wochen
inputs 908ff753a1 fix: phi_3.5_v loading (#896) vor 3 Wochen
kv_quant 8a71788372 Add OLMoE (#772) vor 2 Monaten
lora 369600855a xpu: disable punica kernels for XPU (#864) vor 1 Monat
modeling 59d1d59028 api: support aphrodite_config.yaml with inline loading (#929) vor 2 Wochen
multimodal 653d1a08d4 feat: add support for audio models (#891) vor 3 Wochen
platforms 81c28d2a7f fix: use nvml to get consistent device names (#739) vor 3 Monaten
plugins 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) vor 3 Wochen
processing f7f3fed265 feat: add async postprocessor (#925) vor 2 Wochen
prompt_adapter 2f61644f6e SPMD optimizations (#824) vor 1 Monat
quantization f7f3fed265 feat: add async postprocessor (#925) vor 2 Wochen
server 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) vor 3 Wochen
spec_decode 3b684a8a54 spec decode: streamline batch expansion tensor manipulation (#918) vor 3 Wochen
task_handler b0a8169b54 core: do not compile for profiling vor 2 Wochen
transformers_utils 132aa2abe4 spec decode: add support for EAGLE (#899) vor 3 Wochen
triton_utils f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
__init__.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
_core_ext.py 9296d4b25d feat: dynamo support for ScalarType (#733) vor 3 Monaten
_custom_ops.py 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) vor 1 Monat
_ipex_ops.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
connections.py c6c91edab7 ci: update & overhaul test units (#769) vor 1 Monat
constants.py 2f61644f6e SPMD optimizations (#824) vor 1 Monat
py.typed 1c988a48b2 fix logging and add py.typed vor 1 Jahr
scalar_type.py f1d0b77c92 [0.6.0] Release Candidate (#481) vor 4 Monaten
version.py 8b8d2ce7e2 ci: bump aphrodite version to 0.6.4.post1 (#859) vor 1 Monat