david/flash-attention: flash-attention from https://github.com/Dao-AILab/flash-attention @ fa3-kvcache-gqa

espejo de https://github.com/Dao-AILab/flash-attention

Tri Dao 65205d350e [CI] Compile for pytorch 2.4.0		hace 4 meses
..
publish.yml	65205d350e [CI] Compile for pytorch 2.4.0	hace 4 meses