Author | SHA1 Message | Date |
---|---|---|
AlpinDale | d8a51d05a7 fix: seeded gens with pipeline parallel | 4 months ago |
AlpinDale | 2c653a2268 fix: make speculative decoding work with per-request seed | 4 months ago |
AlpinDale | 7253e9052d feat: integrate typical acceptance sampling for spec decoding | 4 months ago |
AlpinDale | 313e6e1ec7 feat: add typical acceptance sampling | 5 months ago |