.. |
attention.py
|
156f577f79
feat: switch from `PYBIND11_MODULE` to `TORCH_LIBRARY` (#569)
|
пре 7 месеци |
backend_request_func.py
|
89ee54dcff
update dockerfile and enhance serving benchmark
|
пре 7 месеци |
benchmark_moe.py
|
5b5e6dc359
chore: add batch size 1536 and 3072 to moe benchmark
|
пре 7 месеци |
hashing.py
|
c6a501f682
add multiprocessing executor; make ray optional
|
пре 7 месеци |
latency.py
|
e1f3fd1e02
fix: test units (#201)
|
пре 1 година |
launch_tgi.sh
|
4d04ade9ef
feat: fine-grained seeds (#279)
|
пре 1 година |
serving.py
|
89ee54dcff
update dockerfile and enhance serving benchmark
|
пре 7 месеци |
sonnet.txt
|
89ee54dcff
update dockerfile and enhance serving benchmark
|
пре 7 месеци |
throughput.py
|
033797fd55
refactor throughput benchmark script
|
пре 7 месеци |