Commit History

Author SHA1 Message Date
  Tri Dao 601b4dc48d Bump to v2.3.0 1 year ago
  Tri Dao 0a1d03c7ea Bump to v2.2.5 1 year ago
  Tri Dao bff3147175 Re-enable compilation for Hopper 1 year ago
  Tri Dao 229080b9d2 Bump to v2.2.4 1 year ago
  Tri Dao 43617deab9 Remove template for (IsEvenMN=T, IsEvenK=F) to speed up compilation 1 year ago
  Tri Dao 799f56fa90 Don't compile for Pytorch 2.1 on CUDA 12.1 due to nvcc segfaults 1 year ago
  Tri Dao c984208ddb Set block size to 64 x 64 for kvcache to avoid nvcc segfaults 1 year ago
  Tri Dao 8c8b4d36e1 Bump to v2.2.3 1 year ago
  Tri Dao 08c295c043 Bump to v2.2.2 1 year ago
  Tri Dao a1576ad1e8 Bump to v2.2.1 1 year ago
  Tri Dao 6d673cd961 Bump to v2.2.0 1 year ago
  Tri Dao 4976650f74 Set single threaded compilation for CUDA 12.2 so CI doesn't OOM 1 year ago
  Tri Dao 6a89b2f121 Remove constexpr in launch template to fix CI compilation 1 year ago
  Tri Dao 97ba7a62e9 Try switching back to Cutlass 3.2.0 1 year ago
  Tri Dao 1dc1b6c8f2 Bump to v2.1.2 1 year ago
  Tri Dao 757058d4d3 Update Cutlass to v3.2.0 1 year ago
  Tri Dao 9e5e8bc91e Change causal mask to be aligned to bottom-right instead of top-left 1 year ago
  Tri Dao 6711b3bc40 Bump version to 2.0.9 1 year ago
  Tri Dao c5e87b11e9 Bump to v2.0.5 1 year ago
  Tri Dao d30f2e1cd5 Bump to v2.0.4 1 year ago
  Tri Dao a4e5d1eddd Bump to v2.0.3 1 year ago
  Kirthi Shankar Sivamani 32a953f486 Request for v2.0.2 (#388) 1 year ago
  Tri Dao b252072409 Bump to v2.0.1 1 year ago
  chuanli11 30fd8c17d8 remove checkout v2.0.0.post1 from dockerfile 1 year ago
  Tri Dao 4f285b3547 FlashAttention-2 release 1 year ago
  Tri Dao 6d48e14a6c Bump to v1.0.9 1 year ago
  Tri Dao 9610114ce8 Bump to v1.0.8 1 year ago
  Tri Dao 85b51d61ee Bump version to 1.0.7 1 year ago
  Kirthi Shankar Sivamani dd9c3a1fc2 bump to v1.0.6 1 year ago
  Tri Dao eff9fe6b80 Add ninja to pyproject.toml build-system, bump to v1.0.5 1 year ago