.. |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
vor 1 Jahr |
block.py
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
vor 1 Jahr |
config.py
|
656459fd84
make fp8_e4m3 work on nvidia
|
vor 8 Monaten |
grammar.py
|
0527131e93
fix: grammar logits processor (#268)
|
vor 1 Jahr |
logger.py
|
46159b107a
formatting: pt1
|
vor 9 Monaten |
logits_processor.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 10 Monaten |
outputs.py
|
be8154a8a0
feat: proper embeddings API with e5-mistral-7b support
|
vor 8 Monaten |
pooling_params.py
|
be8154a8a0
feat: proper embeddings API with e5-mistral-7b support
|
vor 8 Monaten |
sampling_params.py
|
e8b7f53321
allow prompt token IDs in the logits processor api
|
vor 8 Monaten |
sequence.py
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
vor 8 Monaten |
test_utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
vor 10 Monaten |
utils.py
|
656459fd84
make fp8_e4m3 work on nvidia
|
vor 8 Monaten |