.. |
multiprocessing
|
12b0059b47
api: enable MQAphroditeEngine for embedding models (#1065)
|
4 days ago |
output_processor
|
09dab16f82
core: improve async postproc + multi-step performance (#983)
|
2 weeks ago |
__init__.py
|
04b53d2db5
chore: add initializer files
|
1 year ago |
aphrodite_engine.py
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
1 week ago |
args_tools.py
|
a985143768
core: add cuda graph support for encoder-decoder models (#1051)
|
1 week ago |
async_aphrodite.py
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
1 week ago |
async_timeout.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
metrics.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 month ago |
metrics_types.py
|
3d83e64f8e
feat: add metrics for prefix cache hit rate (#829)
|
1 month ago |
protocol.py
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
1 week ago |