.. |
fused_moe
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |
ops
|
9181fa0396
feat: Triton kernels for sampling (#383)
|
hace 9 meses |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
hace 1 año |
activation.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |
layernorm.py
|
e31c6f0b45
feat: refactor modeling logic and support more models (#274)
|
hace 11 meses |
linear.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |
logits_processor.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |
rejection.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |
rotary_embedding.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |
sampler.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |
vocab_parallel_embedding.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hace 8 meses |