AlpinDale fa84f8102e kernels: split marlin kernels for faster compile, fix MoE, temporarily remove HQQ (#1119) 2 weken geleden
..
marlin_kernels fa84f8102e kernels: split marlin kernels for faster compile, fix MoE, temporarily remove HQQ (#1119) 2 weken geleden
align_block_size_kernel.cu f1d0b77c92 [0.6.0] Release Candidate (#481) 5 maanden geleden
marlin_moe_ops.cu fa84f8102e kernels: split marlin kernels for faster compile, fix MoE, temporarily remove HQQ (#1119) 2 weken geleden
marlin_moe_ops.h fa84f8102e kernels: split marlin kernels for faster compile, fix MoE, temporarily remove HQQ (#1119) 2 weken geleden
moe_ops.h f1d0b77c92 [0.6.0] Release Candidate (#481) 5 maanden geleden
topk_softmax_kernels.cu fa84f8102e kernels: split marlin kernels for faster compile, fix MoE, temporarily remove HQQ (#1119) 2 weken geleden
torch_bindings.cpp fa84f8102e kernels: split marlin kernels for faster compile, fix MoE, temporarily remove HQQ (#1119) 2 weken geleden