Historique des commits

Auteur SHA1 Message Date
  AlpinDale 60ca1e1e5e feat: add ngram prompt lookup decoding for speculative decoding (#438) il y a 9 mois
  AlpinDale d8c4193704 feat: Speculative Decoding using a draft model (#432) il y a 9 mois
  AlpinDale 8d26cf3876 simplify model_executor logic il y a 9 mois
  AlpinDale 4d33ce60da feat: Triton flash attention backend for ROCm (#407) il y a 9 mois
  AlpinDale 9aaeb5d349 add speculative config and arg for later il y a 9 mois
  AlpinDale 753f6dc51b add v2 block manager il y a 10 mois
  AlpinDale 7b9c08afae vision model support il y a 10 mois
  AlpinDale d1786645a3 fix formatting il y a 10 mois
  AlpinDale 2319b411ce refactor: neuron support il y a 10 mois
  AlpinDale 0f6d56b07f feat: model executor refactor (#367) il y a 10 mois
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) il y a 10 mois