AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
..
README f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
attention_dtypes.h f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
attention_kernels.cu f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
cache.h f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
cache_kernels.cu f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
dispatch_utils.h f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
dtype_float32.cuh f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
dtype_int8.cuh f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ
ops.h f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 9 mēneši atpakaļ

README

Backup of attention and cache kernels from INT8 KV Cache. Will be restored soon.