Historique des commits

Auteur SHA1 Message Date
  AlpinDale d8c4193704 feat: Speculative Decoding using a draft model (#432) il y a 9 mois
  50h100a f67b5be198 chore: port sampler+metadata changes from main to dev (#427) il y a 9 mois
  AlpinDale fa083286e3 Speculative Decoding Part 4: Lookahead scheduling (#402) il y a 10 mois
  AlpinDale 3abc641d68 directly use in forward pass il y a 10 mois
  AlpinDale c3c374396b logprobs fixes il y a 10 mois
  AlpinDale 2efee6bcc6 optimize logprob ranks il y a 10 mois
  AlpinDale 777b6f6d51 add logprob ranks il y a 10 mois
  AlpinDale 0c4ead5e9f min_tokens il y a 10 mois
  AlpinDale d1786645a3 fix formatting il y a 10 mois
  AlpinDale f01c668259 clean up sampler il y a 10 mois
  AlpinDale 78d66f16d1 Chunked Prefill Part 1 (#384) il y a 10 mois
  AlpinDale 9181fa0396 feat: Triton kernels for sampling (#383) il y a 10 mois
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) il y a 10 mois
  50h100a d5dbd29db4 hoist sampler internals into a single function. il y a 10 mois
  AlpinDale da223153c6 feat&fix: cohere support and missing GPU blocks (#333) il y a 11 mois
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) il y a 11 mois
  AlpinDale 9fa99215f8 feat: add cubic sampling (#280) il y a 11 mois
  AlpinDale 657aec0cbd refactor: OpenAI endpoint (#261) il y a 11 mois
  AlpinDale 4d04ade9ef feat: fine-grained seeds (#279) il y a 11 mois
  anon998 35b9033782 fix: crash in quadratic sampling when batch > 1 (#253) il y a 1 an
  50h100a f619c96c79 fix: zero token output due to temperature bias (#243) il y a 1 an
  50h100a 53a9c60442 fix: logit processor declarations and application (#242) il y a 1 an
  AlpinDale e73a92ad2f fix: remove the mask for quadratic sampling (#236) il y a 1 an
  AlpinDale 1c46fa31ad feat: add quadratic sampling (#233) il y a 1 an
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) il y a 1 an
  AlpinDale c0aac15421 feat: S-LoRA support (#222) il y a 1 an
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) il y a 1 an
  Stefan Gligorijevic 9e7e108dc8 chore: clamp dynatemp_min (#214) il y a 1 an
  Stefan Gligorijevic 56446a04bb feat: dynamic temperature (#209) il y a 1 an
  AlpinDale d54791aaa8 feat: reduce sampler overhead by making it less blocking (#198) il y a 1 an