.. |
layers
|
bd0ddf1cfe
feat: EETQ quantization (#408)
|
há 8 meses atrás |
models
|
50c2434267
move megatron to a top-level directory
|
há 9 meses atrás |
__init__.py
|
0f1399c135
feat: attention refactor part 2
|
há 9 meses atrás |
hf_downloader.py
|
fcfb72af24
Support arbitrary model in GGUF. (#381)
|
há 8 meses atrás |
loader.py
|
50c2434267
move megatron to a top-level directory
|
há 9 meses atrás |
neuron_loader.py
|
d1786645a3
fix formatting
|
há 9 meses atrás |
outlines_decoding.py
|
63c2508ab4
no key sorting for outlines
|
há 9 meses atrás |
outlines_logits_processors.py
|
0b35176089
feat: add context-free grammars (#376)
|
há 9 meses atrás |
sampling_metadata.py
|
2319b411ce
refactor: neuron support
|
há 9 meses atrás |
utils.py
|
d1786645a3
fix formatting
|
há 9 meses atrás |