Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 22422543ce feat: add no_repeat_ngram sampler hai 1 mes
  Selali 4c4a365f77 feat: Add DRY (Don't Repeat Yourself) sampling (#827) hai 1 mes
  AlpinDale 48a8693aed feat: multi-step scheduling (#831) hai 1 mes
  AlpinDale 2242cb25dc fix: unbound tokenizer error hai 1 mes
  AlpinDale 3d83e64f8e feat: add metrics for prefix cache hit rate (#829) hai 1 mes
  AlpinDale 22425b689d fix: XPU build hai 1 mes
  AlpinDale bfc8988116 feat: add cuda sampling kernels for top_k and top_p (#828) hai 1 mes
  AlpinDale 22427602eb feat: add top-nsigma sampling method hai 1 mes
  AlpinDale 22429e4a10 fix: sampler test with new transformers version hai 1 mes
  AlpinDale 2f61644f6e SPMD optimizations (#824) hai 1 mes
  AlpinDale 32a37e8107 tests: partially fix tensorizer and logprobs tests hai 1 mes
  AlpinDale 7f1c9af5e2 fix: fp8 quant test hai 1 mes
  AlpinDale 173ac23399 fix: experts int8 quant test hai 1 mes
  AlpinDale 68f050129d fix: lora worker manager test import hai 1 mes
  AlpinDale 3661de812d fix: lora layer test hai 1 mes
  AlpinDale 0a369f9171 feat: support chunked prefill with LoRA (#823) hai 1 mes
  AlpinDale e5b1afe625 feat: add chat method for LLM class (#822) hai 1 mes
  AlpinDale 262cbc63b7 fix: vision api test template path hai 1 mes
  AlpinDale b0113a1eaa fix: tokenization api test (#821) hai 1 mes
  AlpinDale c6c91edab7 ci: update & overhaul test units (#769) hai 1 mes
  AlpinDale f088ea81c7 fix: --max-seq-len-to-capture arg (#818) hai 1 mes
  50h100a a5346b2ea5 Merge pull request #814 from PygmalionAI/50h100a-temp-fix hai 1 mes
  50h100a 273c61d406 guard against nan temperature from dynatemp (or anywhere else). hai 1 mes
  50h100a a22e887319 why we don't use the github website editor to make changes hai 1 mes
  50h100a 54a8320638 logit shenanigans to prevent even worse shenanigans hai 1 mes
  50h100a b6a897d2a1 fix temperature, and address those pernicious dynatemp NaNs hai 1 mes
  50h100a a61d00fad7 Merge pull request #813 from PygmalionAI/50h100a-patch-1 hai 1 mes
  50h100a 83040c6389 Mask dynatemp using min/max, rather than exp hai 1 mes
  AlpinDale 2fa112f86b feat: update to serviceinfo v0.2 (#808) hai 2 meses
  AlpinDale 72fbfa1b5b feat: add serviceinfo endpoint (#807) hai 2 meses