.. |
multiprocessing
|
12b0059b47
api: enable MQAphroditeEngine for embedding models (#1065)
|
1 周之前 |
output_processor
|
09dab16f82
core: improve async postproc + multi-step performance (#983)
|
2 周之前 |
__init__.py
|
04b53d2db5
chore: add initializer files
|
1 年之前 |
aphrodite_engine.py
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
1 周之前 |
args_tools.py
|
a985143768
core: add cuda graph support for encoder-decoder models (#1051)
|
1 周之前 |
async_aphrodite.py
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
1 周之前 |
async_timeout.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 月之前 |
metrics.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 月之前 |
metrics_types.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 月之前 |
protocol.py
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
1 周之前 |