Commit History

Author SHA1 Message Date
  AlpinDale e75daadfbd don't copy LogitsProcessors 10 months ago
  AlpinDale fd2c06e6b4 defensively copy 10 months ago
  AlpinDale 5638e08d83 Merge branch 'main' into sampler_order_v2 10 months ago
  AlpinDale 16615784b3 fix: prefix cache for turing gpus 10 months ago
  AlpinDale 7dc73a779a fix: properly perform garbage collection for lora (#277) 10 months ago
  AlpinDale 697c06c4f5 fix: LoRA support for mixtral (#276) 10 months ago
  AlpinDale 4b80b42362 fix: memory leaks due to nccl cuda graphs (#275) 10 months ago
  AlpinDale e31c6f0b45 feat: refactor modeling logic and support more models (#274) 10 months ago
  AlpinDale 7d6ba53602 feat: fused top-k kernels for MoE (#273) 10 months ago
  AlpinDale a3cab09b69 chore: logging env variable 10 months ago
  AlpinDale 2c08aa5af4 chore: remove eos token from output (#272) 10 months ago
  AlpinDale 8e1cd54497 fix: do not include fp8 for rocm (#271) 10 months ago
  AlpinDale 6a63ab4ec3 fix: remote encode request if using ray (#270) 10 months ago
  AlpinDale 224b87b484 feat: add fused mixtral moe support (#238) 10 months ago
  Thomas Xin 43cf0e98a0 fix: worker initialization on WSL (#260) 10 months ago
  swadical 0527131e93 fix: grammar logits processor (#268) 10 months ago
  Stefan Gligorijevic a41366fea1 multiple fixes 10 months ago
  AlpinDale 2370dbcfd8 feat: OPT model support (#266) 10 months ago
  AlpinDale 6b8faa7ee8 yapf 10 months ago
  AlpinDale 177646864a Merge branch 'main' into sampler_order_v2 10 months ago
  AlpinDale 4360684667 fix: cuda version in wheel 10 months ago
  TearGosling 80e8a14949 feat: add pygchat Jinja template (#218) 11 months ago
  Stefan Gligorijevic af72d6bf73 silly oversight 11 months ago
  Stefan Gligorijevic a744bf95d9 better(and correct) fix for quad 11 months ago
  Stefan Gligorijevic bc040c78b7 only apply quadratic sampling when requested 11 months ago
  Stefan Gligorijevic 0cfc7e364f properly set default order 11 months ago
  Stefan Gligorijevic 57d822b4c8 another bit missed in merge conflict resolution 11 months ago
  Stefan Gligorijevic d6282a2f65 incorrectly resolved merge 11 months ago
  Stefan Gligorijevic 916b252524 fix more missing bits 11 months ago
  Stefan Gligorijevic 53017ce4f3 fix? sampling params for quadratic 11 months ago