AlpinDale 3ed4cc431c enc_dec attention code преди 9 месеца
..
quantization f8652c8e99 fix: optimize aqlm dequantization (#325) преди 10 месеца
triton_kernel e42a78381a feat: switch from pylint to ruff (#322) преди 10 месеца
__init__.py 07aa2a492f upstream: add option to specify tokenizer преди 1 година
activation.py e31c6f0b45 feat: refactor modeling logic and support more models (#274) преди 10 месеца
attention.py 58e89e29d9 add custom bias to attention.py преди 9 месеца
enc_dec_attention.py 3ed4cc431c enc_dec attention code преди 9 месеца
layernorm.py e31c6f0b45 feat: refactor modeling logic and support more models (#274) преди 10 месеца
linear.py e42a78381a feat: switch from pylint to ruff (#322) преди 10 месеца
rejection.py 95bdd35ec9 feat: rejection sampler (#197) преди 1 година
rotary_embedding.py e42a78381a feat: switch from pylint to ruff (#322) преди 10 месеца
sampler.py da223153c6 feat&fix: cohere support and missing GPU blocks (#333) преди 9 месеца
vocab_parallel_embedding.py 968bde81bf fix: tensor parallel with GPTQ and AWQ quants (#307) преди 10 месеца