.. |
cutlass_benchmarks
|
765adcfba1
chore: add w8a8 benchmark scripts
|
il y a 7 mois |
attention.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
il y a 7 mois |
backend_request_func.py
|
89ee54dcff
update dockerfile and enhance serving benchmark
|
il y a 7 mois |
benchmark_moe.py
|
5b5e6dc359
chore: add batch size 1536 and 3072 to moe benchmark
|
il y a 7 mois |
hashing.py
|
c6a501f682
add multiprocessing executor; make ray optional
|
il y a 7 mois |
latency.py
|
e1f3fd1e02
fix: test units (#201)
|
il y a 1 an |
launch_tgi.sh
|
4d04ade9ef
feat: fine-grained seeds (#279)
|
il y a 1 an |
serving.py
|
89ee54dcff
update dockerfile and enhance serving benchmark
|
il y a 7 mois |
sonnet.txt
|
89ee54dcff
update dockerfile and enhance serving benchmark
|
il y a 7 mois |
throughput.py
|
0886c361f4
feat: OpenVINO CPU backend (#576)
|
il y a 7 mois |