AlpinDale
|
5289c14b24
feat: Asymmetric Tensor Parallel (#594)
|
4 månader sedan |
AlpinDale
|
5be90c3859
Mamba infrastrucuture support (#586)
|
4 månader sedan |
AlpinDale
|
ae04f57ec1
feat: Pipeline Parallel support (#581)
|
4 månader sedan |
AlpinDale
|
6a57861fca
feat: initial XPU support via intel_extension_for_pytorch (#571)
|
5 månader sedan |
AlpinDale
|
fe21123a1c
feat: TPU support (#570)
|
5 månader sedan |
AlpinDale
|
f40b809d3b
allow using v2 block manager with sliding window
|
5 månader sedan |
AlpinDale
|
50b7c13db0
refactor: attention selector (#552)
|
5 månader sedan |
AlpinDale
|
8b56dc4347
dict -> torch.Tensor for blocks_to_swap
|
5 månader sedan |
AlpinDale
|
21ce19b3ea
blocks_to_copy dict -> torch.Tensor
|
5 månader sedan |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 månader sedan |
AlpinDale
|
e3252edd07
fix: remove event and stream, add typing (#382)
|
9 månader sedan |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
9 månader sedan |
AlpinDale
|
c2d77b1822
chore: logging refactor (#302)
|
10 månader sedan |
AlpinDale
|
ea0f57b233
feat: allow further support for non-cuda devices (#247)
|
11 månader sedan |
AlpinDale
|
31c95011a6
feat: FP8 E5M2 KV Cache (#226)
|
11 månader sedan |
AlpinDale
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 månader sedan |
AlpinDale
|
15a0454172
feat: FP8 KV Cache (#185)
|
1 år sedan |
AlpinDale
|
1aab8a7d6f
feat: speedup compilation times by 3x (#130)
|
1 år sedan |
AlpinDale
|
74604eb252
fix: pylint complaints (#91)
|
1 år sedan |
AlpinDale
|
75c27d3e65
massive overhaul
|
1 år sedan |
AlpinDale
|
525edab7cc
fix: logger in cache engine
|
1 år sedan |
AlpinDale
|
b8f4337c5b
chore: various fixes
|
1 år sedan |
AlpinDale
|
a409431c40
feat: draft for cuda kernels
|
1 år sedan |