AlpinDale c577c31aaa feat: tree attention 8 mēneši atpakaļ
..
gguf fcfb72af24 Support arbitrary model in GGUF. (#381) 8 mēneši atpakaļ
__init__.py 07aa2a492f upstream: add option to specify tokenizer 1 gadu atpakaļ
block.py ac82b67f75 feat: naive context shift and various QoL changes (#289) 10 mēneši atpakaļ
config.py a3b1602391 fix: rope scaling for cohere and qwen (#436) 8 mēneši atpakaļ
grammar.py 0527131e93 fix: grammar logits processor (#268) 10 mēneši atpakaļ
logger.py 58b0616dd3 feat: support sharded ggufs (#420) 8 mēneši atpakaļ
logits_processor.py f67b5be198 chore: port sampler+metadata changes from main to dev (#427) 8 mēneši atpakaļ
outputs.py c18bf116da fix stop strings not being excluded from outputs 9 mēneši atpakaļ
sampling_params.py c18bf116da fix stop strings not being excluded from outputs 9 mēneši atpakaļ
sequence.py c577c31aaa feat: tree attention 8 mēneši atpakaļ
test_utils.py 50c2434267 move megatron to a top-level directory 9 mēneši atpakaļ
utils.py 1528ce50e5 fix: abort requests when the connection to /v1/completions is interrupted (#431) 8 mēneši atpakaļ