.. |
guided_decoding
|
313e198557
api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993)
|
2 ヶ月 前 |
layers
|
6951928522
xpu: bump IPEX to 2.3, support GQA (#1042)
|
2 ヶ月 前 |
model_loader
|
f2b6dc3872
cpu: add support for W8A8 quantization via compressed-tensor (#1017)
|
2 ヶ月 前 |
models
|
e3f5bae2cc
fix: skip loading extra bias for Qwen2-VL GPTQ (#1040)
|
2 ヶ月 前 |
__init__.py
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
2 ヶ月 前 |
_custom_op.py
|
5d37ec1016
suppress tpu import warning (#696)
|
6 ヶ月 前 |
parameter.py
|
83af2524f3
quants: add GPTQ and FBGEMM to AphroditeParameters (#987)
|
2 ヶ月 前 |
pooling_metadata.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
6 ヶ月 前 |
sampling_metadata.py
|
f0cc35befe
sampler: pad dry sequence breakers tensor (#875)
|
3 ヶ月 前 |
utils.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
10 ヶ月 前 |