.. |
attention.py
|
e1f3fd1e02
fix: test units (#201)
|
1 年間 前 |
backend_request_func.py
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
1 年間 前 |
hashing.py
|
c6a501f682
add multiprocessing executor; make ray optional
|
7 ヶ月 前 |
latency.py
|
e1f3fd1e02
fix: test units (#201)
|
1 年間 前 |
launch_tgi.sh
|
4d04ade9ef
feat: fine-grained seeds (#279)
|
1 年間 前 |
moe_config.py
|
6d2f00d728
benchmark script for fp8 MoE
|
7 ヶ月 前 |
serving.py
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
1 年間 前 |
throughput.py
|
033797fd55
refactor throughput benchmark script
|
7 ヶ月 前 |