AlpinDale 801eda0b7a feat: support GPTQ 2, 3, and 8bit quants (#181) 1 年之前
..
attention b9b295d74e chore: backlogs 1 (#191) 1 年之前
quantization 801eda0b7a feat: support GPTQ 2, 3, and 8bit quants (#181) 1 年之前
activation_kernels.cu b9b295d74e chore: backlogs 1 (#191) 1 年之前
cache.h 1aab8a7d6f feat: speedup compilation times by 3x (#130) 1 年之前
cache_kernels.cu b9b295d74e chore: backlogs 1 (#191) 1 年之前
cuda_compat.h 1334a833a4 feat: AMD ROCm support (#95) 1 年之前
cuda_utils.h 1aab8a7d6f feat: speedup compilation times by 3x (#130) 1 年之前
cuda_utils_kernels.cu 1334a833a4 feat: AMD ROCm support (#95) 1 年之前
dispatch_utils.h 32844c1522 add GELU kernels and remove compile bloat 1 年之前
layernorm_kernels.cu b9b295d74e chore: backlogs 1 (#191) 1 年之前
ops.h 801eda0b7a feat: support GPTQ 2, 3, and 8bit quants (#181) 1 年之前
pos_encoding_kernels.cu b9b295d74e chore: backlogs 1 (#191) 1 年之前
pybind.cpp 62b2c4119d feat: re-write GPTQ and refactor exllama kernels (#152) 1 年之前
reduction.cuh 1334a833a4 feat: AMD ROCm support (#95) 1 年之前