AlpinDale fca911ee0a vLLM Upstream Sync (#526) 8 months ago
..
common.h e0c35bb353 feat: bitsandbytes and `--load-in{4,8}bit` support (#294) 1 year ago
cta_iterator.h e0c35bb353 feat: bitsandbytes and `--load-in{4,8}bit` support (#294) 1 year ago
format.cu a98babfb74 fix: bnb on Turing GPUs (#299) 1 year ago
format.h fca911ee0a vLLM Upstream Sync (#526) 8 months ago
gemm_s4_f16.cu a98babfb74 fix: bnb on Turing GPUs (#299) 1 year ago
gemm_s4_f16.h e0c35bb353 feat: bitsandbytes and `--load-in{4,8}bit` support (#294) 1 year ago
gemm_s4_f16_kernel.h e0c35bb353 feat: bitsandbytes and `--load-in{4,8}bit` support (#294) 1 year ago
gemm_template.h e0c35bb353 feat: bitsandbytes and `--load-in{4,8}bit` support (#294) 1 year ago
int4_fp16_gemm_kernels.cu a98babfb74 fix: bnb on Turing GPUs (#299) 1 year ago
metric.h e0c35bb353 feat: bitsandbytes and `--load-in{4,8}bit` support (#294) 1 year ago
warp_iterator.h e0c35bb353 feat: bitsandbytes and `--load-in{4,8}bit` support (#294) 1 year ago