Author | SHA1 Message | Date |
---|---|---|
AlpinDale | d8c4193704 feat: Speculative Decoding using a draft model (#432) | 9 months ago |
AlpinDale | 4d33ce60da feat: Triton flash attention backend for ROCm (#407) | 9 months ago |
AlpinDale | 2319b411ce refactor: neuron support | 9 months ago |
AlpinDale | f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) | 9 months ago |