AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 11 months ago
..
__init__.py 07aa2a492f upstream: add option to specify tokenizer 1 year ago
block.py ac82b67f75 feat: naive context shift and various QoL changes (#289) 1 year ago
config.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 11 months ago
gguf.py e42a78381a feat: switch from pylint to ruff (#322) 1 year ago
grammar.py 0527131e93 fix: grammar logits processor (#268) 1 year ago
logger.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 11 months ago
logits_processor.py 3df36ee07d fix: logit bias logitproc (#278) 1 year ago
outputs.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 11 months ago
sampling_params.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 11 months ago
sequence.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 11 months ago
test_utils.py 641bb0f6e9 feat: add custom allreduce kernels (#224) 1 year ago
utils.py f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) 11 months ago