.. |
arctic_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
cached_prefix_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
embedding_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
encoder_decoder_inference.py
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
4 ヶ月 前 |
gguf_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
mlpspeculator_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
neuron_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
offline_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
ray_distributed_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |
slora_inference.py
|
8a71788372
Add OLMoE (#772)
|
2 ヶ月 前 |
soft_prompt_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 ヶ月 前 |