AlpinDale 0613d91551 fix: kv head calculation with MPT GQA vor 7 Monaten
..
__init__.py 07aa2a492f upstream: add option to specify tokenizer vor 1 Jahr
block.py b5694be865 chore: use a pool to reuse LogicalTokenBlock.token_ids vor 7 Monaten
config.py 0613d91551 fix: kv head calculation with MPT GQA vor 7 Monaten
grammar.py 0527131e93 fix: grammar logits processor (#268) vor 1 Jahr
inputs.py 8004c9f782 fix: import for multimodaldata vor 7 Monaten
logger.py 46159b107a formatting: pt1 vor 8 Monaten
logits_processor.py 9d81716bfd [v0.5.3] Release Candidate (#388) vor 10 Monaten
outputs.py 90ceab32ff refactor: consolidate prompt args to LLM engines vor 7 Monaten
pooling_params.py be8154a8a0 feat: proper embeddings API with e5-mistral-7b support vor 7 Monaten
sampling_params.py e8b7f53321 allow prompt token IDs in the logits processor api vor 7 Monaten
sequence.py 8d77c69cbd feat: support image processor and add llava example vor 7 Monaten
test_utils.py 9d81716bfd [v0.5.3] Release Candidate (#388) vor 10 Monaten
utils.py 6a57861fca feat: initial XPU support via intel_extension_for_pytorch (#571) vor 7 Monaten