.. |
tool_parsers
|
a56bce4c94
fix: remove duplicate assignment in Hermes2ProToolParser
|
1 week ago |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
1 year ago |
api_server.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |
args.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |
logits_processors.py
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
4 months ago |
protocol.py
|
055c8905a3
api: add sampling/engine option to return only deltas or final output (#1035)
|
1 week ago |
run_batch.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |
samplers.json
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
10 months ago |
serving_chat.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |
serving_completions.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |
serving_embedding.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |
serving_engine.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |
serving_tokenization.py
|
6212072245
api: support LoRA lineage and base model metadata management (#1072)
|
5 days ago |