JDKWangGuan 0d810cfb73 Fix KeyError handling for non-existing key in state_dict.pop() (#898) 6 maanden geleden
..
__init__.py ece539abd6 Add __init__.py files to subdirectories for installation 2 jaren geleden
baichuan.py 3f7d5786ba Pass alibi slopes to flash_attn_with_kvcache during generation 1 jaar geleden
bert.py abbc131173 [LayerNorm] Switch from CUDA to Triton implementation 1 jaar geleden
bigcode.py 07005806ff Add BigCode converters (#532) 1 jaar geleden
btlm.py 7ffba9a501 Implement BTLM model 1 jaar geleden
falcon.py f1a73d0740 Run isort and black on python files 1 jaar geleden
gpt.py 0d810cfb73 Fix KeyError handling for non-existing key in state_dict.pop() (#898) 6 maanden geleden
gpt_neox.py 0a146185d6 [Gen] Remove minor dead code 1 jaar geleden
gptj.py f1a73d0740 Run isort and black on python files 1 jaar geleden
llama.py 187c2a0635 Fix E1136 (#563) 1 jaar geleden
opt.py f1a73d0740 Run isort and black on python files 1 jaar geleden
vit.py abbc131173 [LayerNorm] Switch from CUDA to Triton implementation 1 jaar geleden