AlpinDale 0247bdcd27 inline model switching 3 weeks ago
..
tool_parsers a56bce4c94 fix: remove duplicate assignment in Hermes2ProToolParser 4 weeks ago
__init__.py 07aa2a492f upstream: add option to specify tokenizer 1 year ago
api_server.py 0247bdcd27 inline model switching 3 weeks ago
args.py 313e198557 api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993) 1 month ago
logits_processors.py 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) 4 months ago
protocol.py 118bbfec5a take more args in model load field 3 weeks ago
run_batch.py 81fa31bcaf feat: embeddings support for batched OAI endpoint (#676) 4 months ago
samplers.json ac82b67f75 feat: naive context shift and various QoL changes (#289) 10 months ago
serving_chat.py b47a39026d feat: introduce MQAphroditeEngine 3 weeks ago
serving_completions.py b47a39026d feat: introduce MQAphroditeEngine 3 weeks ago
serving_embedding.py b47a39026d feat: introduce MQAphroditeEngine 3 weeks ago
serving_engine.py b47a39026d feat: introduce MQAphroditeEngine 3 weeks ago
serving_tokenization.py b47a39026d feat: introduce MQAphroditeEngine 3 weeks ago