Fizz~ 8a71788372 Add OLMoE (#772) 2 months ago
..
arctic_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
cached_prefix_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
embedding_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
encoder_decoder_inference.py 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) 4 months ago
gguf_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
mlpspeculator_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
neuron_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
offline_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
ray_distributed_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
slora_inference.py 8a71788372 Add OLMoE (#772) 2 months ago
soft_prompt_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago