AlpinDale a56bce4c94 fix: remove duplicate assignment in Hermes2ProToolParser 1 bulan lalu
..
rpc 39b2e83ac3 api: optimize zeromq frontend performance (#951) 1 bulan lalu
tool_parsers a56bce4c94 fix: remove duplicate assignment in Hermes2ProToolParser 1 bulan lalu
__init__.py 07aa2a492f upstream: add option to specify tokenizer 1 tahun lalu
api_server.py 313e198557 api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993) 1 bulan lalu
args.py 313e198557 api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993) 1 bulan lalu
logits_processors.py 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) 4 bulan lalu
protocol.py 0191c5efd1 tools: fix tool calls to more strictly follow OpenAI format (#1003) 1 bulan lalu
run_batch.py 81fa31bcaf feat: embeddings support for batched OAI endpoint (#676) 4 bulan lalu
samplers.json ac82b67f75 feat: naive context shift and various QoL changes (#289) 11 bulan lalu
serving_chat.py 7d5feaa037 api: fix logic for deciding if tool parser is used (#1025) 1 bulan lalu
serving_completions.py 61c7182491 feat: enable prompt logprobs in OpenAI API (#720) 4 bulan lalu
serving_embedding.py 0c162c8dad api: use fp32 for base64 embeddings (#919) 1 bulan lalu
serving_engine.py c5c09720b0 api: log prompt truncation (#940) 1 bulan lalu
serving_tokenization.py 313e198557 api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993) 1 bulan lalu