.. |
tool_parsers
|
a56bce4c94
fix: remove duplicate assignment in Hermes2ProToolParser
|
il y a 4 semaines |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
il y a 1 an |
api_server.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |
args.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |
logits_processors.py
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
il y a 4 mois |
protocol.py
|
055c8905a3
api: add sampling/engine option to return only deltas or final output (#1035)
|
il y a 4 semaines |
run_batch.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |
samplers.json
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
il y a 10 mois |
serving_chat.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |
serving_completions.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |
serving_embedding.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |
serving_engine.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |
serving_tokenization.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
il y a 3 semaines |