Commit History

Author SHA1 Message Date
  AlpinDale 867939a6db bring back cuda kernels for lroa 3 months ago
  AlpinDale 0e0bd02b52 ci: bump version to 0.6.2 (#758) 3 months ago
  AlpinDale 5878e887f2 docs: update readme and docs (#757) 3 months ago
  AlpinDale d7309453f6 fix: add pandas to requirements (#756) 3 months ago
  AlpinDale 73177656ed feat: quant_llm support (#755) 3 months ago
  AlpinDale ad181e3fef feat: bring back dynatemp (#754) 3 months ago
  AlpinDale 6329c2d53f chore: re-enable custom token bans (#751) 3 months ago
  Ahmed 55261b09d6 ci: fix docs deployment (#750) 3 months ago
  Ahmed aecd80a47b Merge pull request #749 from PygmalionAI/ci/fix-pnpm-install 3 months ago
  Ahmed 4435a443e1 ci: fix dep install using pnpm 3 months ago
  AlpinDale abd9d5799a feat: add XTC Sampling (#740) 3 months ago
  AlpinDale 4434c4db84 chore: refactor llama3 rope (#748) 3 months ago
  AlpinDale 9d9722b1c1 fix: metrics endpoint with RPC server (#747) 3 months ago
  AlpinDale 81c5f196eb chore: various TPU fixes and optimizations (#746) 3 months ago
  AlpinDale 89a2c6dee1 chore: refactor `MultiModalConfig` initialization and profiling (#745) 3 months ago
  AlpinDale 1068597e8a fix: minor bug fixes & clean-ups (#744) 3 months ago
  Geun, Lim 08711d2ac9 feat: add Exaone model support (#743) 3 months ago
  AlpinDale 81c28d2a7f fix: use nvml to get consistent device names (#739) 3 months ago
  AlpinDale 5559c5886f fix: clear engine ref in RPC server (#738) 3 months ago
  AlpinDale ef3a0f4cb1 fix: `custom_ar` check (#737) 3 months ago
  AlpinDale ccbda97416 fix: types in AQLM and GGUF for dynamo support (#736) 3 months ago
  AlpinDale 9296d4b25d feat: dynamo support for ScalarType (#733) 3 months ago
  AlpinDale d9d85eeb6e chore: register lora functions as torch ops (#732) 3 months ago
  AlpinDale 7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731) 3 months ago
  AlpinDale d34e083c48 feat: add experts_int8 support (#730) 3 months ago
  AlpinDale b0f262eec1 feat: FP8 quantization support for AMD ROCm (#729) 3 months ago
  AlpinDale c744443679 ci: bump to 0.6.1.post1 (#728) 3 months ago
  miku448 9c0e7d95c8 fix: libcudart path for some versions of pytorch (#726) 3 months ago
  AlpinDale 4648f16c84 chore: fix return statement in Detokenizer class (#727) 3 months ago
  AlpinDale a286adaeaa feat: launch API server with uvloop (#725) 3 months ago