AlpinDale a788ca33bf hack in custom bias for attention kernels 10 달 전
..
__init__.py a788ca33bf hack in custom bias for attention kernels 10 달 전
baichuan.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
bloom.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
chatglm.py e31c6f0b45 feat: refactor modeling logic and support more models (#274) 11 달 전
cohere.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
decilm.py e31c6f0b45 feat: refactor modeling logic and support more models (#274) 11 달 전
deepseek.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
falcon.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
gemma.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
gpt2.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
gpt_bigcode.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
gpt_j.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
gpt_neox.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
internlm2.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
llama.py 6ebac34dc1 chore: cleaner pre-llamafied Yi implementation (#352) 10 달 전
mixtral.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
mixtral_quant.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
mpt.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
olmo.py e42a78381a feat: switch from pylint to ruff (#322) 10 달 전
opt.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
phi.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
qwen.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
qwen2.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
stablelm.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) 10 달 전
t5.py f009f94ffd update modeling code 10 달 전