Commit History

Author SHA1 Message Date
  AlpinDale 24c78e7306 optimization: multi-query attention kernel 1 year ago
  AlpinDale 081545bde6 fix: various CUDA kernel tweaks 1 year ago
  AlpinDale 05d0a7e763 feat: adapt the attention kernels 1 year ago