.. |
attention.py
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |
backend_request_func.py
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
1 year ago |
latency.py
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |
launch_tgi.sh
|
4d04ade9ef
feat: fine-grained seeds (#279)
|
1 year ago |
moe_config.py
|
6d2f00d728
benchmark script for fp8 MoE
|
7 months ago |
serving.py
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
1 year ago |
throughput.py
|
f22b700ee4
feat: marlin kernels for GPTQ (#547)
|
7 months ago |