AlpinDale
|
e0886ee929
feat: add `ProposerWorkerBase` abstract class
|
7 月之前 |
AlpinDale
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
7 月之前 |
AlpinDale
|
16f345c29a
fix circular reference with weakref
|
7 月之前 |
AlpinDale
|
ef733aee43
implement ExecuteModelData to reduce executor complexity
|
8 月之前 |
AlpinDale
|
723c6acb84
re-add ngram speculative decoding
|
8 月之前 |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
10 月之前 |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
11 月之前 |