.. |
__init__.py
|
07aa2a492f
upstream: add option to specify tokenizer
|
1 ano atrás |
block.py
|
7df7b8ca53
optimization: reduce end-to-end overhead from python obj allocation (#666)
|
4 meses atrás |
config.py
|
5d37ec1016
suppress tpu import warning (#696)
|
4 meses atrás |
connections.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 meses atrás |
grammar.py
|
0527131e93
fix: grammar logits processor (#268)
|
10 meses atrás |
logger.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 meses atrás |
logits_processor.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 meses atrás |
outputs.py
|
0e558e9b2f
fix: loading chameleon model with TP>1 (#695)
|
4 meses atrás |
pooling_params.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 meses atrás |
sampling_params.py
|
2da6a3ec2b
feat: option to apply temperature scaling last (#670)
|
4 meses atrás |
sequence.py
|
ef40c05cd3
fix: minor adjustments to scheduler and block manager (#667)
|
4 meses atrás |
test_utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 meses atrás |
utils.py
|
5d37ec1016
suppress tpu import warning (#696)
|
4 meses atrás |