Tri Dao
|
a157cc8c9b
[FT] Implement MQA/GQA
|
há 1 ano atrás |
Tri Dao
|
62e9814466
[Rotary] Make sure frequency calculation is in fp32
|
há 1 ano atrás |
Tri Dao
|
48bc6eacd6
[Gen] Add rotary base as an argument to FT attention kernel
|
há 1 ano atrás |
Tri Dao
|
a01d1213d7
[Gen] Add kernel from FasterTransformer for benchmarking
|
há 1 ano atrás |