.. |
common
|
0f6d56b07f
feat: model executor refactor (#367)
|
11 months ago |
endpoints
|
b361096463
fix: tokenizer when using ray (#366)
|
11 months ago |
engine
|
0f6d56b07f
feat: model executor refactor (#367)
|
11 months ago |
executor
|
0f6d56b07f
feat: model executor refactor (#367)
|
11 months ago |
kv_quant
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
1 year ago |
lora
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
11 months ago |
modeling
|
72cd8494aa
feat: mistral neuron support (#368)
|
11 months ago |
processing
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
11 months ago |
spec_decode
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
11 months ago |
task_handler
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
11 months ago |
transformers_utils
|
b361096463
fix: tokenizer when using ray (#366)
|
11 months ago |
__init__.py
|
0f6d56b07f
feat: model executor refactor (#367)
|
11 months ago |
py.typed
|
1c988a48b2
fix logging and add py.typed
|
1 year ago |