.. |
guided_decoding
|
f61acdd3ec
api: add json_schema to OpenAI server (#915)
|
3 weeks ago |
layers
|
5cb2e998d8
quants: update compressed tensors lifecycle to remove `prefix` from `create_weights` (#924)
|
3 weeks ago |
model_loader
|
59d1d59028
api: support aphrodite_config.yaml with inline loading (#929)
|
2 weeks ago |
models
|
fce970a846
feat: multi-image input support for Phi3V (#917)
|
3 weeks ago |
__init__.py
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
3 weeks ago |
_custom_op.py
|
5d37ec1016
suppress tpu import warning (#696)
|
3 months ago |
parameter.py
|
afc9a28aa0
chore: add AphroditeParameter support for FP8 quant (#902)
|
3 weeks ago |
pooling_metadata.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
sampling_metadata.py
|
f0cc35befe
sampler: pad dry sequence breakers tensor (#875)
|
1 month ago |
utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |