.. |
layers
|
bd0ddf1cfe
feat: EETQ quantization (#408)
|
hai 8 meses |
models
|
50c2434267
move megatron to a top-level directory
|
hai 9 meses |
__init__.py
|
0f1399c135
feat: attention refactor part 2
|
hai 9 meses |
hf_downloader.py
|
fcfb72af24
Support arbitrary model in GGUF. (#381)
|
hai 8 meses |
loader.py
|
50c2434267
move megatron to a top-level directory
|
hai 9 meses |
neuron_loader.py
|
d1786645a3
fix formatting
|
hai 9 meses |
outlines_decoding.py
|
63c2508ab4
no key sorting for outlines
|
hai 9 meses |
outlines_logits_processors.py
|
0b35176089
feat: add context-free grammars (#376)
|
hai 9 meses |
sampling_metadata.py
|
2319b411ce
refactor: neuron support
|
hai 9 meses |
utils.py
|
d1786645a3
fix formatting
|
hai 9 meses |