Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 2242b356a0 apply dry first hai 1 mes
  AlpinDale 2242c5756a add co-author hai 1 mes
  AlpinDale 224219dd6d fix dry penalties hai 1 mes
  AlpinDale 22424ec91c fix: init the prompt and output token tensors for dry hai 1 mes
  AlpinDale 2242958d91 fix: sequence breaker ids are a list of int hai 1 mes
  AlpinDale 22428f934d formatting hai 1 mes
  AlpinDale 224256d679 take sequence breakers as both a string literal list and a List[int] hai 1 mes
  AlpinDale 22427af50b Merge branch 'main' into dry-sampler hai 1 mes
  AlpinDale 3d83e64f8e feat: add metrics for prefix cache hit rate (#829) hai 1 mes
  AlpinDale 22425b689d fix: XPU build hai 1 mes
  AlpinDale bfc8988116 feat: add cuda sampling kernels for top_k and top_p (#828) hai 1 mes
  Selali Adobor aa7e47cf92 Revert "Refactor based on closer examination of other examples" hai 1 mes
  Selali Adobor d9562b96f8 Refactor based on closer examination of other examples hai 1 mes
  Selali Adobor 07e1175587 Fix naming bug hai 1 mes
  Selali Adobor c63672dce3 Fix refactor bug hai 1 mes
  Selali Adobor 81ae51b1e8 Set useful defaults for DRY API hai 1 mes
  Selali Adobor 5c6fa79193 feat(sampling): Add DRY (Do not Repeat Yourself) sampling hai 1 mes
  AlpinDale 22427602eb feat: add top-nsigma sampling method hai 1 mes
  AlpinDale 22429e4a10 fix: sampler test with new transformers version hai 1 mes
  AlpinDale 2f61644f6e SPMD optimizations (#824) hai 1 mes
  AlpinDale 32a37e8107 tests: partially fix tensorizer and logprobs tests hai 1 mes
  AlpinDale 7f1c9af5e2 fix: fp8 quant test hai 1 mes
  AlpinDale 173ac23399 fix: experts int8 quant test hai 1 mes
  AlpinDale 68f050129d fix: lora worker manager test import hai 1 mes
  AlpinDale 3661de812d fix: lora layer test hai 1 mes
  AlpinDale 0a369f9171 feat: support chunked prefill with LoRA (#823) hai 1 mes
  AlpinDale e5b1afe625 feat: add chat method for LLM class (#822) hai 1 mes
  AlpinDale 262cbc63b7 fix: vision api test template path hai 1 mes
  AlpinDale b0113a1eaa fix: tokenization api test (#821) hai 1 mes
  AlpinDale c6c91edab7 ci: update & overhaul test units (#769) hai 1 mes