.. |
backends
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
ops
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
7 months ago |
__init__.py
|
a94de94c44
refactor: combine the prefill and decode into a single API (#553)
|
7 months ago |
layer.py
|
ac79d115b3
add guards for prefix caching, fp8, chunked, etc
|
7 months ago |
selector.py
|
696f2cd59c
add phi3_small support with blocksparse attention
|
7 months ago |