AlpinDale 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) 1 kuukausi sitten
..
arctic_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
cached_prefix_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
embedding_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
encoder_decoder_inference.py 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) 4 kuukautta sitten
gguf_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
lora_aphrodite_engine.py 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) 1 kuukausi sitten
lora_async_aphrodite.py 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) 1 kuukausi sitten
mlpspeculator_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
neuron_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
offline_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
ray_distributed_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten
soft_prompt_inference.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 kuukautta sitten