AlpinDale 033797fd55 refactor throughput benchmark script 7 ヶ月 前
..
attention.py e1f3fd1e02 fix: test units (#201) 1 年間 前
backend_request_func.py e42a78381a feat: switch from pylint to ruff (#322) 1 年間 前
hashing.py c6a501f682 add multiprocessing executor; make ray optional 7 ヶ月 前
latency.py e1f3fd1e02 fix: test units (#201) 1 年間 前
launch_tgi.sh 4d04ade9ef feat: fine-grained seeds (#279) 1 年間 前
moe_config.py 6d2f00d728 benchmark script for fp8 MoE 7 ヶ月 前
serving.py e42a78381a feat: switch from pylint to ruff (#322) 1 年間 前
throughput.py 033797fd55 refactor throughput benchmark script 7 ヶ月 前