Commit History

Author SHA1 Message Date
  AlpinDale 709628a74d fix 4 months ago
  AlpinDale b0e420e711 some debug statements 4 months ago
  AlpinDale 31b8636e89 don't do regular top-k/top-p sampling if kernels are enabled 4 months ago
  AlpinDale 815736fc54 feat: add cuda kernels for sampling 4 months ago
  AlpinDale a8d10fcfee chore: add contribution guidelines + Code of Conduct (#507) 6 months ago
  AlpinDale 30f9e1f7c9 remove docker 4 months ago
  AlpinDale 206261a3fb chore: add contribution guidelines + Code of Conduct (#507) 6 months ago
  AlpinDale 9371a33e90 docs: add installation guides 4 months ago
  Ahmed da67a9c8d1 docs: add doc site with example content 4 months ago
  AlpinDale f91991f584 fix: f-string fixes 4 months ago
  AlpinDale acbdc50a71 fix: `vocab_size` field access in llava 4 months ago
  AlpinDale a3e26391e4 chore: add a wrapper for torch.inference_mode decorator 4 months ago
  AlpinDale 79e56506d7 clean up requirements 4 months ago
  AlpinDale a0d031efcc feat: initial text-to-text support for Chameleon model 4 months ago
  AlpinDale 9038dea2df fix: short commit hash import error 4 months ago
  AlpinDale ba371fbbbd feat: AWQ marlin kernels (#603) 4 months ago
  AlpinDale 08373fd1ee fix: asymmetric TP changes breaking the gptq and awq quants (#602) 4 months ago
  AlpinDale ad68d149d8 chore: refactor and decouple phi3v image embedding 4 months ago
  AlpinDale a4cbcfe59f feat: disable logprob serialization to CPU for spec decode 4 months ago
  AlpinDale f6d2f3c4ba move `aphrodite.endpoints.openai.chat_utils` -> `aphrodite.endpoints.chat_utils` 4 months ago
  AlpinDale 055963b252 fix: channel-wise fp8 marlin 4 months ago
  AlpinDale 5d98b7ead1 fix: input_scale for w8a8 is optional 4 months ago
  AlpinDale 408ca43d2e feat: support fbgemm_fp8 quant on ampere 4 months ago
  AlpinDale fa15bad2ea chore: minor AMD fixes 4 months ago
  AlpinDale 9be43994fe feat: fbgemm quantization support (#601) 4 months ago
  AlpinDale cac8163f77 fix: request abort crashing pipeline parallel 4 months ago
  AlpinDale 45a004874c chore: allow specifying custom Executor 4 months ago
  AlpinDale c8f5424d72 add scale_ub inputs to fp8 dynamic per-token quant 4 months ago
  AlpinDale 5f84f0651c docker: install libibverbs by default 4 months ago
  AlpinDale d3c474d219 chore: enable dynamic per-token `fp8` 4 months ago