.. |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
há 1 ano atrás |
block.py
|
b5694be865
chore: use a pool to reuse LogicalTokenBlock.token_ids
|
há 7 meses atrás |
config.py
|
0613d91551
fix: kv head calculation with MPT GQA
|
há 7 meses atrás |
grammar.py
|
0527131e93
fix: grammar logits processor (#268)
|
há 1 ano atrás |
inputs.py
|
8004c9f782
fix: import for multimodaldata
|
há 7 meses atrás |
logger.py
|
46159b107a
formatting: pt1
|
há 8 meses atrás |
logits_processor.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
há 10 meses atrás |
outputs.py
|
90ceab32ff
refactor: consolidate prompt args to LLM engines
|
há 7 meses atrás |
pooling_params.py
|
be8154a8a0
feat: proper embeddings API with e5-mistral-7b support
|
há 7 meses atrás |
sampling_params.py
|
e8b7f53321
allow prompt token IDs in the logits processor api
|
há 7 meses atrás |
sequence.py
|
8d77c69cbd
feat: support image processor and add llava example
|
há 7 meses atrás |
test_utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
há 10 meses atrás |
utils.py
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
há 7 meses atrás |