.. |
arctic_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
cached_prefix_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
embedding_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
encoder_decoder_inference.py
|
a0e446a17d
feat: initial encoder-decoder support with BART model (#633)
|
4 months ago |
gguf_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
mlpspeculator_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
neuron_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
offline_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
ray_distributed_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
slora_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
soft_prompt_inference.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |