AlpinDale
|
9a7d5514c4
feat: introduce MQAphroditeEngine (#1056)
|
1 week ago |
AlpinDale
|
ddaefd8d38
chore: remove engine_use_ray (#1024)
|
1 week ago |
AlpinDale
|
a113309876
kernel: add meta functions for ops to prevent graph breaks (#1019)
|
1 week ago |
AlpinDale
|
0e5cf7f840
tpu: avoid dynamo guard eval overhead (#949)
|
2 weeks ago |
AlpinDale
|
fcfcfc65e1
quants: add triton kernels for AWQ (#946)
|
2 weeks ago |
AlpinDale
|
f1ea7711bd
core: do not compile ScalarType for torch < 2.4.0 (#938)
|
2 weeks ago |
AlpinDale
|
d46e70ac98
api: add inline model loading (#928)
|
2 weeks ago |
AlpinDale
|
8d9f1fd4e6
feat: add single user mode (#927)
|
2 weeks ago |
AlpinDale
|
a00ab49e21
api: add client timeouts for the ZeroMQ server (#897)
|
3 weeks ago |
AlpinDale
|
65b71f5fcc
distributed: fix issue for when nodes have multiple network interfaces (#892)
|
3 weeks ago |
AlpinDale
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
3 weeks ago |