.. |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
1 éve |
api_server.py
|
a07fc83bc8
chore: proper util for aphrodite version
|
7 hónapja |
args.py
|
8f9cb7235c
chore: allow multiple served model names
|
10 hónapja |
protocol.py
|
1d7f5c45b0
feat: add stream_options for chat completions
|
7 hónapja |
samplers.json
|
ac82b67f75
feat: naive context shift and various QoL changes (#289)
|
1 éve |
serving_chat.py
|
1d7f5c45b0
feat: add stream_options for chat completions
|
7 hónapja |
serving_completions.py
|
e321d80e4e
fix: `prompt_logprobs==0` case
|
7 hónapja |
serving_embedding.py
|
90ceab32ff
refactor: consolidate prompt args to LLM engines
|
7 hónapja |
serving_engine.py
|
78de98463b
feat: return max_model_len in /v1/models
|
7 hónapja |