Commit History

Author SHA1 Message Date
  AlpinDale 46159b107a formatting: pt1 9 months ago
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) 9 months ago
  AlpinDale f894f7b176 Revert "reduce dedupe by wrapping in general worker class" 10 months ago
  AlpinDale 9fff6fb507 reduce dedupe by wrapping in general worker class 10 months ago
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) 10 months ago
  AlpinDale e3252edd07 fix: remove event and stream, add typing (#382) 11 months ago
  AlpinDale 33b3786175 fix: cache neuron checks (#379) 11 months ago
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 1 year ago
  AlpinDale e53842bd5d fix: cuda home detection for fp8 kv cache 1 year ago
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) 1 year ago
  AlpinDale c2d77b1822 chore: logging refactor (#302) 1 year ago
  AlpinDale 9810daa699 feat: INT8 KV Cache (#298) 1 year ago
  AlpinDale ea0f57b233 feat: allow further support for non-cuda devices (#247) 1 year ago
  AlpinDale 31c95011a6 feat: FP8 E5M2 KV Cache (#226) 1 year ago
  AlpinDale c0aac15421 feat: S-LoRA support (#222) 1 year ago
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 1 year ago
  AlpinDale f013d714c0 chore: merge dev branch into main (#177) 1 year ago
  AlpinDale 2755a48d51 merge dev branch into main (#153) 1 year ago
  g4rg 28fa4ae213 fix: cpu memory limit detection for containers (#103) 1 year ago
  AlpinDale efc6f7fbec chore: reformats (#90) 1 year ago
  AlpinDale 75c27d3e65 massive overhaul 1 year ago
  AlpinDale b8f4337c5b chore: various fixes 1 year ago
  AlpinDale fefbf029c9 revert previous commit 1 year ago
  AlpinDale 964ac344b2 Deploying to main from @ PygmalionAI/aphrodite-engine@9ae65dd2fe38acf8186d4a8d9ea3e54fc8e523e9 🚀 1 year ago
  AlpinDale b02c4f6060 chore: re-arranging 1 year ago