Commit History

Autor SHA1 Mensaxe Data
  AlpinDale e75daadfbd don't copy LogitsProcessors hai 10 meses
  AlpinDale fd2c06e6b4 defensively copy hai 10 meses
  AlpinDale 5638e08d83 Merge branch 'main' into sampler_order_v2 hai 10 meses
  AlpinDale 16615784b3 fix: prefix cache for turing gpus hai 10 meses
  AlpinDale 7dc73a779a fix: properly perform garbage collection for lora (#277) hai 10 meses
  AlpinDale 697c06c4f5 fix: LoRA support for mixtral (#276) hai 10 meses
  AlpinDale 4b80b42362 fix: memory leaks due to nccl cuda graphs (#275) hai 10 meses
  AlpinDale e31c6f0b45 feat: refactor modeling logic and support more models (#274) hai 10 meses
  AlpinDale 7d6ba53602 feat: fused top-k kernels for MoE (#273) hai 10 meses
  AlpinDale a3cab09b69 chore: logging env variable hai 10 meses
  AlpinDale 2c08aa5af4 chore: remove eos token from output (#272) hai 10 meses
  AlpinDale 8e1cd54497 fix: do not include fp8 for rocm (#271) hai 10 meses
  AlpinDale 6a63ab4ec3 fix: remote encode request if using ray (#270) hai 10 meses
  AlpinDale 224b87b484 feat: add fused mixtral moe support (#238) hai 10 meses
  Thomas Xin 43cf0e98a0 fix: worker initialization on WSL (#260) hai 10 meses
  swadical 0527131e93 fix: grammar logits processor (#268) hai 10 meses
  Stefan Gligorijevic a41366fea1 multiple fixes hai 10 meses
  AlpinDale 2370dbcfd8 feat: OPT model support (#266) hai 10 meses
  AlpinDale 6b8faa7ee8 yapf hai 10 meses
  AlpinDale 177646864a Merge branch 'main' into sampler_order_v2 hai 10 meses
  AlpinDale 4360684667 fix: cuda version in wheel hai 10 meses
  TearGosling 80e8a14949 feat: add pygchat Jinja template (#218) hai 11 meses
  Stefan Gligorijevic af72d6bf73 silly oversight hai 11 meses
  Stefan Gligorijevic a744bf95d9 better(and correct) fix for quad hai 11 meses
  Stefan Gligorijevic bc040c78b7 only apply quadratic sampling when requested hai 11 meses
  Stefan Gligorijevic 0cfc7e364f properly set default order hai 11 meses
  Stefan Gligorijevic 57d822b4c8 another bit missed in merge conflict resolution hai 11 meses
  Stefan Gligorijevic d6282a2f65 incorrectly resolved merge hai 11 meses
  Stefan Gligorijevic 916b252524 fix more missing bits hai 11 meses
  Stefan Gligorijevic 53017ce4f3 fix? sampling params for quadratic hai 11 meses