.. |
output_processor
|
09dab16f82
core: improve async postproc + multi-step performance (#983)
|
2 kuukautta sitten |
__init__.py
|
04b53d2db5
chore: add initializer files
|
1 vuosi sitten |
aphrodite_engine.py
|
05be6085ec
core: factor out input preprocessing into a separate class (#1039)
|
2 kuukautta sitten |
args_tools.py
|
271879a4a5
fix: disable chunked prefill and prefix caching for multimodal models (#1037)
|
2 kuukautta sitten |
async_aphrodite.py
|
05be6085ec
core: factor out input preprocessing into a separate class (#1039)
|
2 kuukautta sitten |
async_timeout.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 kuukautta sitten |
metrics.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
3 kuukautta sitten |
metrics_types.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
3 kuukautta sitten |
protocol.py
|
0dfa6b60ec
core: support logprobs with multi-step scheduling (#963)
|
2 kuukautta sitten |