Historique des commits

Auteur SHA1 Message Date
  AlpinDale 3c4335ce02 async: avoid premature exit in the async generator il y a 1 mois
  AlpinDale abfd4465ca feat: add support for chunked prefill + prefix caching (#871) il y a 1 mois
  AlpinDale ef99a567b6 fix: temp_last warning being repeated for every output token (#869) il y a 1 mois
  Naomiusearch 4f9fea4c4d fix: ROCm build (#817) il y a 1 mois
  50h100a 9b569279fd Merge pull request #868 from PygmalionAI/dry_zoom il y a 1 mois
  50h100a fc3c1cd5a5 this is getting its own commit because lint failures like that are exactly why people stop using linters il y a 1 mois
  50h100a 60a5d0fb80 rewrite DRY to be a lot faster il y a 1 mois
  AlpinDale e182d00256 feat: AWQ quantization for InternVL (#867) il y a 1 mois
  AlpinDale 9fc6473b18 server: log the process occupying our port (#866) il y a 1 mois
  AlpinDale db96c2daa3 executor: pipe `worker_class_fn` arg in executor (#865) il y a 1 mois
  AlpinDale 369600855a xpu: disable punica kernels for XPU (#864) il y a 1 mois
  AlpinDale 1405051912 attention: add `AttentionState` abstraction (#863) il y a 1 mois
  AlpinDale 82eabb6aa7 build: add jinja2 to requirements file (#862) il y a 1 mois
  AlpinDale 9094a8a2a3 xpu: refactor XPU worker & executor (#861) il y a 1 mois
  AlpinDale 8b8d2ce7e2 ci: bump aphrodite version to 0.6.4.post1 (#859) il y a 1 mois
  AlpinDale 3392b81bf9 sampler: allow parsing sampler order using strings (#858) il y a 1 mois
  AlpinDale 0035dc42ed sampler: optimize DRY performance using z-algorithm (#856) il y a 1 mois
  AlpinDale 2150bb5019 sampler: add range parameter for DRY (#855) il y a 1 mois
  AlpinDale 72c505ad84 sampler: fix dry concurrency issue (#852) il y a 1 mois
  Selali 14ac216498 sampler: add output_tokens to DRY sampler (#849) il y a 1 mois
  Luke Harold Miles d486d7ac01 docs: add linux arm64/aarch64/GH200 installation tips (#851) il y a 1 mois
  AlpinDale d2971a6831 ci: bump version to 0.6.4 (#845) il y a 1 mois
  AlpinDale 538471f76e chore: bump mistral_common to 1.5.0 (#844) il y a 1 mois
  AlpinDale 483c9e6e59 fix: disable awq_marlin override for awq models (#843) il y a 1 mois
  AlpinDale dfa34d1b24 feat: add sampler_priorty (#837) il y a 1 mois
  AlpinDale 93bc863591 feat: Machete Kernels for Hopper GPUs (#842) il y a 1 mois
  AlpinDale 563e8f7ac8 fix: latency and serving benchmarks (#841) il y a 1 mois
  AlpinDale 7c7ec12f36 chore: refactor executor classes for easier inheritance (#840) il y a 1 mois
  AlpinDale 16b587c104 fix: hidden states handling in batch expansion for spec decoding (#839) il y a 1 mois
  AlpinDale 60f7b828d5 feat: add skew sampling (#834) il y a 1 mois