Commit History

Author SHA1 Message Date
  AlpinDale 4d04ade9ef feat: fine-grained seeds (#279) 11 months ago
  Stefan Daniel Schwarz 810ca83066 fix+feat: docker compose (#264) 11 months ago
  AlpinDale 16615784b3 fix: prefix cache for turing gpus 11 months ago
  AlpinDale 7dc73a779a fix: properly perform garbage collection for lora (#277) 11 months ago
  AlpinDale 697c06c4f5 fix: LoRA support for mixtral (#276) 11 months ago
  AlpinDale 4b80b42362 fix: memory leaks due to nccl cuda graphs (#275) 11 months ago
  AlpinDale e31c6f0b45 feat: refactor modeling logic and support more models (#274) 11 months ago
  AlpinDale 7d6ba53602 feat: fused top-k kernels for MoE (#273) 11 months ago
  AlpinDale a3cab09b69 chore: logging env variable 11 months ago
  AlpinDale 2c08aa5af4 chore: remove eos token from output (#272) 11 months ago
  AlpinDale 8e1cd54497 fix: do not include fp8 for rocm (#271) 11 months ago
  AlpinDale 6a63ab4ec3 fix: remote encode request if using ray (#270) 11 months ago
  AlpinDale 224b87b484 feat: add fused mixtral moe support (#238) 11 months ago
  Thomas Xin 43cf0e98a0 fix: worker initialization on WSL (#260) 11 months ago
  swadical 0527131e93 fix: grammar logits processor (#268) 11 months ago
  AlpinDale 2370dbcfd8 feat: OPT model support (#266) 11 months ago
  AlpinDale 4360684667 fix: cuda version in wheel 11 months ago
  TearGosling 80e8a14949 feat: add pygchat Jinja template (#218) 11 months ago
  sgsdxzy fe7844f2ef feat: sharding and safetensors support for gguf conversion (#256) 11 months ago
  AlpinDale 8635901c76 fix: s-lora vocab embeddings 11 months ago
  AlpinDale c76b611021 docker: update the Dockerfile and push the latest image (#254) 11 months ago
  anon998 35b9033782 fix: crash in quadratic sampling when batch > 1 (#253) 11 months ago
  AlpinDale 842912d022 feat: on-the-fly gguf conversion (#250) 11 months ago
  AlpinDale faca8745d6 fix: linting issue (#249) 11 months ago
  AlpinDale 3163839c88 bump version to 0.4.9 11 months ago
  AlpinDale f99eb2c874 fix: hadamard tensors not included in wheel 11 months ago
  AlpinDale 8b6790d504 fix: gguf config not recognized 11 months ago
  AlpinDale a1836a40e2 bump version to v0.4.8 11 months ago
  AlpinDale 2bd6c92f73 fix: lora inclusion in wheels 11 months ago
  AlpinDale 8da2be03ce feat: bump version to v0.4.7 (#248) 11 months ago