Commit History

Author SHA1 Message Date
  50h100a bd564148e2 Merge branch 'main' of https://github.com/PygmalionAI/aphrodite-engine into ffs 9 months ago
  50h100a d3dd170a7d merge main 9 months ago
  AlpinDale 78d66f16d1 Chunked Prefill Part 1 (#384) 9 months ago
  AlpinDale 9181fa0396 feat: Triton kernels for sampling (#383) 9 months ago
  50h100a dc09dc2b4d Merge branch 'main' into pr_samplers 9 months ago
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 months ago
  50h100a 7ed57e318d Overhauled SamplingTensors construction. 9 months ago
  50h100a d5dbd29db4 hoist sampler internals into a single function. 9 months ago
  AlpinDale da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 9 months ago
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) 10 months ago
  AlpinDale 9fa99215f8 feat: add cubic sampling (#280) 10 months ago
  AlpinDale 657aec0cbd refactor: OpenAI endpoint (#261) 10 months ago
  AlpinDale 4d04ade9ef feat: fine-grained seeds (#279) 10 months ago
  anon998 35b9033782 fix: crash in quadratic sampling when batch > 1 (#253) 11 months ago
  50h100a f619c96c79 fix: zero token output due to temperature bias (#243) 11 months ago
  50h100a 53a9c60442 fix: logit processor declarations and application (#242) 11 months ago
  50h100a 2e3318c1fa yapf considers this space to be CRITICAL 11 months ago
  50h100a 25acebe33d better variable naming 11 months ago
  50h100a 7b3bb995c1 topk as linear write 11 months ago
  AlpinDale e73a92ad2f fix: remove the mask for quadratic sampling (#236) 11 months ago
  AlpinDale 1c46fa31ad feat: add quadratic sampling (#233) 11 months ago
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) 11 months ago
  AlpinDale c0aac15421 feat: S-LoRA support (#222) 11 months ago
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 11 months ago
  Stefan Gligorijevic 9e7e108dc8 chore: clamp dynatemp_min (#214) 11 months ago
  Stefan Gligorijevic 56446a04bb feat: dynamic temperature (#209) 11 months ago
  AlpinDale d54791aaa8 feat: reduce sampler overhead by making it less blocking (#198) 1 year ago
  AlpinDale 653da510d1 chore: rewrite InputMetadata (#143) 1 year ago
  AlpinDale 6c914ea0e4 fix: `SequenceOutputs` -> `SequenceOutput` (#133) 1 year ago
  Stefan Gligorijevic a5255901c6 feat: min_p sampling (#106) 1 year ago