AlpinDale abbb730607 feat: support draft model on different tensor parallel size 7 months ago
..
benchmarks abbb730607 feat: support draft model on different tensor parallel size 7 months ago
endpoints 46159b107a formatting: pt1 8 months ago
engine e42a78381a feat: switch from pylint to ruff (#322) 1 year ago
kernels e1f3fd1e02 fix: test units (#201) 1 year ago
models f970f3f3fb add base class for VLMs 7 months ago
prompts e1f3fd1e02 fix: test units (#201) 1 year ago
samplers 313e6e1ec7 feat: add typical acceptance sampling 7 months ago
worker e1f3fd1e02 fix: test units (#201) 1 year ago
__init__.py 2755a48d51 merge dev branch into main (#153) 1 year ago
conftest.py 72229a94da feat: better marlin kernels (#285) 1 year ago
test_regression.py e1f3fd1e02 fix: test units (#201) 1 year ago