.. |
async_engine
|
9082ac7b7a
add async engine test units
|
8 months ago |
basic_correctness
|
3a206f9e11
add chunked prefill correctness test
|
8 months ago |
benchmarks
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
9 months ago |
distributed
|
9e1cea354c
add distributed system tests
|
8 months ago |
endpoints
|
e28c8496b2
endpoint tests
|
8 months ago |
engine
|
ce10891496
add more engine-related tests:
|
8 months ago |
fp8_kv
|
2172a9c374
add fp8_e4m3fn scales for llama2 7b and 70b
|
8 months ago |
kernels
|
e177788401
add moe tests
|
8 months ago |
models
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |
processing
|
54ed7adef3
add core processor tests
|
8 months ago |
prompts
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |
samplers
|
4d04ade9ef
feat: fine-grained seeds (#279)
|
10 months ago |
worker
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |
__init__.py
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
conftest.py
|
54ed7adef3
add core processor tests
|
8 months ago |
test_regression.py
|
e1f3fd1e02
fix: test units (#201)
|
1 year ago |