.. |
fused_moe
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
ops
|
9181fa0396
feat: Triton kernels for sampling (#383)
|
vor 9 Monaten |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
vor 1 Jahr |
activation.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
layernorm.py
|
e31c6f0b45
feat: refactor modeling logic and support more models (#274)
|
vor 11 Monaten |
linear.py
|
7589af22c9
Try to fix gguf.
|
vor 8 Monaten |
logits_processor.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
rejection.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
rotary_embedding.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
sampler.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 8 Monaten |
vocab_parallel_embedding.py
|
7589af22c9
Try to fix gguf.
|
vor 8 Monaten |