AlpinDale 72659e5cad separate prompt and genned tokens for enc-dec 9 months ago
..
__init__.py 07aa2a492f upstream: add option to specify tokenizer 1 year ago
block.py ac82b67f75 feat: naive context shift and various QoL changes (#289) 10 months ago
config.py c41462cfcd feat: exllamav2 quantization (#305) 10 months ago
gguf.py e42a78381a feat: switch from pylint to ruff (#322) 10 months ago
grammar.py 0527131e93 fix: grammar logits processor (#268) 10 months ago
logger.py e42a78381a feat: switch from pylint to ruff (#322) 10 months ago
logits_processor.py 3df36ee07d fix: logit bias logitproc (#278) 10 months ago
outputs.py e42a78381a feat: switch from pylint to ruff (#322) 10 months ago
sampling_params.py e42a78381a feat: switch from pylint to ruff (#322) 10 months ago
sequence.py 72659e5cad separate prompt and genned tokens for enc-dec 9 months ago
test_utils.py 641bb0f6e9 feat: add custom allreduce kernels (#224) 11 months ago
utils.py e53842bd5d fix: cuda home detection for fp8 kv cache 9 months ago