AlpinDale c577c31aaa feat: tree attention há 9 meses atrás
..
output_processor c577c31aaa feat: tree attention há 9 meses atrás
__init__.py 04b53d2db5 chore: add initializer files há 1 ano atrás
aphrodite_engine.py a3b1602391 fix: rope scaling for cohere and qwen (#436) há 9 meses atrás
args_tools.py 60ca1e1e5e feat: add ngram prompt lookup decoding for speculative decoding (#438) há 9 meses atrás
async_aphrodite.py f216601f18 fix: logging in the API server há 9 meses atrás
metrics.py d8c4193704 feat: Speculative Decoding using a draft model (#432) há 9 meses atrás
ray_tools.py 8c9cabf4c8 fix: display error in ray before deadlock (#378) há 9 meses atrás