david/flash-attention

Mirror von https://github.com/Dao-AILab/flash-attention

Autor	SHA1 Nachricht	Datum
Chirag Jain	50896ec574 Make nvcc threads configurable via environment variable (#885)	vor 9 Monaten
Tri Dao	dc08ea1c33 Support H100 for other CUDA extensions	vor 1 Jahr
Tri Dao	ca81f32e04 Implement rotary embedding in CUDA	vor 2 Jahren