Commit History

Author SHA1 Message Date
  AlpinDale d8c4193704 feat: Speculative Decoding using a draft model (#432) 9 months ago
  AlpinDale 4d33ce60da feat: Triton flash attention backend for ROCm (#407) 9 months ago
  AlpinDale 2319b411ce refactor: neuron support 9 months ago
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 months ago