.. |
tool_parsers
|
a56bce4c94
fix: remove duplicate assignment in Hermes2ProToolParser
|
1 month ago |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
1 year ago |
api_server.py
|
d96c363301
api: fix admin key being required for authentication (#1091)
|
1 week ago |
args.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
1 month ago |
logits_processors.py
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
4 months ago |
protocol.py
|
f20f5c3491
samplers: improved DRY performance (#1108)
|
1 week ago |
run_batch.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
1 month ago |
samplers.json
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
11 months ago |
serving_chat.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
1 month ago |
serving_completions.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
1 month ago |
serving_embedding.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
1 month ago |
serving_engine.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
1 month ago |
serving_tokenization.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
1 month ago |