.. |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
пре 1 година |
block.py
|
7df7b8ca53
optimization: reduce end-to-end overhead from python obj allocation (#666)
|
пре 4 месеци |
config.py
|
867939a6db
bring back cuda kernels for lroa
|
пре 3 месеци |
connections.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
пре 4 месеци |
grammar.py
|
0527131e93
fix: grammar logits processor (#268)
|
пре 10 месеци |
logger.py
|
867939a6db
bring back cuda kernels for lroa
|
пре 3 месеци |
logits_processor.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
пре 8 месеци |
outputs.py
|
0e558e9b2f
fix: loading chameleon model with TP>1 (#695)
|
пре 4 месеци |
pooling_params.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
пре 4 месеци |
sampling_params.py
|
ad181e3fef
feat: bring back dynatemp (#754)
|
пре 3 месеци |
sequence.py
|
577586309d
chore: multi-step args and sequence modifications (#713)
|
пре 3 месеци |
test_utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
пре 8 месеци |
utils.py
|
0b8b407b6d
feat: support profiling with multiple multi-modal inputs per prompt (#712)
|
пре 3 месеци |