AlpinDale 39beed0b87 Revert "Refactor AWQ support." il y a 1 an
..
attention 45f6d9f923 initial refactor commit il y a 1 an
activation.cpp 32844c1522 add GELU kernels and remove compile bloat il y a 1 an
activation_kernels.cu 32844c1522 add GELU kernels and remove compile bloat il y a 1 an
attention.cpp 24c78e7306 optimization: multi-query attention kernel il y a 1 an
cache.cpp 081545bde6 fix: various CUDA kernel tweaks il y a 1 an
cache_kernels.cu 32844c1522 add GELU kernels and remove compile bloat il y a 1 an
dispatch_utils.h 32844c1522 add GELU kernels and remove compile bloat il y a 1 an
layernorm.cpp 081545bde6 fix: various CUDA kernel tweaks il y a 1 an
layernorm_kernels.cu 32844c1522 add GELU kernels and remove compile bloat il y a 1 an
pos_encoding.cpp 45f6d9f923 initial refactor commit il y a 1 an
pos_encoding_kernels.cu 45f6d9f923 initial refactor commit il y a 1 an
reduction.cuh 081545bde6 fix: various CUDA kernel tweaks il y a 1 an