AlpinDale c577c31aaa feat: tree attention vor 9 Monaten
..
output_processor c577c31aaa feat: tree attention vor 9 Monaten
__init__.py 04b53d2db5 chore: add initializer files vor 1 Jahr
aphrodite_engine.py a3b1602391 fix: rope scaling for cohere and qwen (#436) vor 9 Monaten
args_tools.py 60ca1e1e5e feat: add ngram prompt lookup decoding for speculative decoding (#438) vor 9 Monaten
async_aphrodite.py f216601f18 fix: logging in the API server vor 9 Monaten
metrics.py d8c4193704 feat: Speculative Decoding using a draft model (#432) vor 9 Monaten
ray_tools.py 8c9cabf4c8 fix: display error in ray before deadlock (#378) vor 10 Monaten