AlpinDale 1264e0b5d8 api: add mistral function calling format to all models loaded with "mistral" format (#1053) преди 3 седмици
..
rpc 638c08d9dc fix: clean shutdown issues (#1047) преди 4 седмици
tool_parsers a56bce4c94 fix: remove duplicate assignment in Hermes2ProToolParser преди 4 седмици
__init__.py 07aa2a492f upstream: add option to specify tokenizer преди 1 година
api_server.py 638c08d9dc fix: clean shutdown issues (#1047) преди 4 седмици
args.py 313e198557 api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993) преди 1 месец
logits_processors.py 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) преди 4 месеца
protocol.py 055c8905a3 api: add sampling/engine option to return only deltas or final output (#1035) преди 4 седмици
run_batch.py 81fa31bcaf feat: embeddings support for batched OAI endpoint (#676) преди 4 месеца
samplers.json ac82b67f75 feat: naive context shift and various QoL changes (#289) преди 10 месеца
serving_chat.py 1264e0b5d8 api: add mistral function calling format to all models loaded with "mistral" format (#1053) преди 3 седмици
serving_completions.py 055c8905a3 api: add sampling/engine option to return only deltas or final output (#1035) преди 4 седмици
serving_embedding.py 0c162c8dad api: use fp32 for base64 embeddings (#919) преди 1 месец
serving_engine.py c5c09720b0 api: log prompt truncation (#940) преди 1 месец
serving_tokenization.py 055c8905a3 api: add sampling/engine option to return only deltas or final output (#1035) преди 4 седмици