Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 80be38ca6f chore: expose phi3_v num_crops as an mm_processor_kwargs (#1117) hai 1 semana
  AlpinDale 18e0b0e932 chore: support loading weights by ID within models (#1116) hai 1 semana
  AlpinDale 949f974c59 (1/N) XQA: integrate the XQA CUDA kernels within Aphrodite (#1115) hai 1 semana
  AlpinDale 2cd6ef2d5a misc: skip dumping inputs when unpicklable hai 2 semanas
  AlpinDale be0b0c13ca tests: update scheduler tests (#1113) hai 2 semanas
  AlpinDale 5b03d67abb Core: add output streaming support to multi-step + async (#1112) hai 2 semanas
  AlpinDale b20c4570d2 CI: bump aphrodite-engine to v0.6.6 (#1111) hai 2 semanas
  AlpinDale 7c825e50be fix: correct FP8 support check on Ada+ GPUs by using compressed-tensors (#1110) hai 2 semanas
  AlpinDale 2bb9c9c399 Revert "CI: use self-hosted runner for the build job" hai 2 semanas
  AlpinDale c71d2cf814 CI: use self-hosted runner for the build job hai 2 semanas
  AlpinDale 5c00851691 tests: fix ruff for llava onevision tests hai 2 semanas
  AlpinDale 6d8df254c7 LoRA: skip loading unsupported weight modules (#1109) hai 2 semanas
  AlpinDale f20f5c3491 samplers: improved DRY performance (#1108) hai 2 semanas
  AlpinDale 2dc917fcfd fix: install the headless opencv hai 2 semanas
  AlpinDale eb1ffacf74 Spec Decoding: fix typical acceptance sampler with correct recovered tok IDs (#1106) hai 2 semanas
  AlpinDale 76088aa43a distributed: allow IPv6 in APHRODITE_HOST_IP with ZMQ (#1105) hai 2 semanas
  AlpinDale 69cf654901 LoRA: add assertions for SGMV kernels to avoid incorrect results (#1104) hai 2 semanas
  AlpinDale c90abcc603 VLM: add pipeline parallelism support for Qwen2-VL (#1103) hai 2 semanas
  AlpinDale cc5e185795 VLM: support passing multimodal processor kwargs (#1102) hai 2 semanas
  AlpinDale 1448857bd3 XPU: fix docker build hai 2 semanas
  AlpinDale c36dd3a4b6 build: fix CPU CMake compilation hai 2 semanas
  AlpinDale 98e174b1f4 build: fix cutlass fetch warning hai 2 semanas
  AlpinDale a0f0160b79 spec decode: remove dead code from draft bonus tokens (#1101) hai 2 semanas
  AlpinDale a5bfc2bc3d VLM: add support for LLaVA-Onevision model (#1100) hai 2 semanas
  AlpinDale d44da0332c misc: rename `CudaMemoryProfiler` to `DeviceMemoryProfiler` (#1099) hai 2 semanas
  AlpinDale 7ce3174039 VLM: refactor blip models to support composite weight loading (#1098) hai 2 semanas
  AlpinDale 91d03c04d2 VLM: refactor composite weight loading logic (#1097) hai 2 semanas
  AlpinDale b65449b5ad moe: refactor DBRX experts to support FusedMoE (#1095) hai 2 semanas
  AlpinDale ed63c079f7 Triton: remove atomic add op from awq triton (#1094) hai 2 semanas
  AlpinDale 651678d2df VLM: use `SequenceData.from_token_counts` to create dummy data (#1093) hai 2 semanas