.. |
__init__.py
|
04b53d2db5
chore: add initializer files
|
há 1 ano atrás |
aphrodite_engine.py
|
a1f18f17e6
modify the cache engine and model runner/worker to support mamba states
|
há 9 meses atrás |
args_tools.py
|
bd0ddf1cfe
feat: EETQ quantization (#408)
|
há 9 meses atrás |
async_aphrodite.py
|
9aaeb5d349
add speculative config and arg for later
|
há 9 meses atrás |
metrics.py
|
b1caee23a6
cache the p2p access check for memory saving
|
há 9 meses atrás |
ray_tools.py
|
8c9cabf4c8
fix: display error in ray before deadlock (#378)
|
há 9 meses atrás |