david/aphrodite-engine

Tác giả	SHA1 Thông báo	Ngày
AlpinDale	1390915778 multi-step: add support for flashinfer attention backend (#1033)	4 tuần trước cách đây
AlpinDale	6f59024522 torch.compile: hide slicing under custom op for inductor (#1029)	4 tuần trước cách đây
AlpinDale	de341ffb00 fix: ensure multistep lookahead allocation is compatible with cugraph max capture (#1008)	4 tuần trước cách đây
AlpinDale	5c3b94de45 spec decode: move ops.advane_step to flash attention backend (#1005)	4 tuần trước cách đây
AlpinDale	3bb0f07461 chore: rename `task_handler` to `worker` (#985)	1 tháng trước cách đây
AlpinDale	1405051912 attention: add `AttentionState` abstraction (#863)	1 tháng trước cách đây
AlpinDale	7a313483f1 chore: move update_flash_attn_metadata to attn backend (#731)	4 tháng trước cách đây
AlpinDale	60b702a827 chore: register custom torch ops for flash-attn and flashinfer (#724)	4 tháng trước cách đây
AlpinDale	24456206a9 fix: logit softcapping in flash-attn (#688)	4 tháng trước cách đây
AlpinDale	7df7b8ca53 optimization: reduce end-to-end overhead from python obj allocation (#666)	4 tháng trước cách đây
AlpinDale	f1d0b77c92 [0.6.0] Release Candidate (#481)	4 tháng trước cách đây
AlpinDale	9d81716bfd [v0.5.3] Release Candidate (#388)	8 tháng trước cách đây