sgsdxzy fcfb72af24 Support arbitrary model in GGUF. (#381) 8 months ago
..
layers bd0ddf1cfe feat: EETQ quantization (#408) 8 months ago
models 50c2434267 move megatron to a top-level directory 8 months ago
__init__.py 0f1399c135 feat: attention refactor part 2 9 months ago
hf_downloader.py fcfb72af24 Support arbitrary model in GGUF. (#381) 8 months ago
loader.py 50c2434267 move megatron to a top-level directory 8 months ago
neuron_loader.py d1786645a3 fix formatting 9 months ago
outlines_decoding.py 63c2508ab4 no key sorting for outlines 8 months ago
outlines_logits_processors.py 0b35176089 feat: add context-free grammars (#376) 9 months ago
sampling_metadata.py 2319b411ce refactor: neuron support 9 months ago
utils.py d1786645a3 fix formatting 9 months ago