.. |
ops
|
bf4a4d8516
fix: do not register punica with torch if using older torch (#948)
|
il y a 1 mois |
__init__.py
|
2bd6c92f73
fix: lora inclusion in wheels
|
il y a 11 mois |
fully_sharded_layers.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
il y a 4 mois |
layers.py
|
1394008421
chore: decouple `should_modify_greedy_probs_inplace (#671)
|
il y a 4 mois |
lora.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
il y a 4 mois |
models.py
|
9f3e7c86e2
feat: add fused Marlin MoE kernel (#934)
|
il y a 1 mois |
punica.py
|
bf4a4d8516
fix: do not register punica with torch if using older torch (#948)
|
il y a 1 mois |
request.py
|
2f61644f6e
SPMD optimizations (#824)
|
il y a 2 mois |
utils.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
il y a 4 mois |
worker_manager.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
il y a 4 mois |