AlpinDale 4ea4f5943e refactor openai endpoints and add function calls 11 months ago
..
alpaca_template.jinja 2755a48d51 merge dev branch into main (#153) 1 year ago
aphrodite_engine_example.py f32d57ed04 add inference examples 1 year ago
api_client.py f32d57ed04 add inference examples 1 year ago
chatml_template.jinja 2755a48d51 merge dev branch into main (#153) 1 year ago
function_call.py 4ea4f5943e refactor openai endpoints and add function calls 11 months ago
gguf_to_torch.py c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) 11 months ago
gradio_server.py 551c4280cf chore: change default port to 2242 1 year ago
offline_inference.py f32d57ed04 add inference examples 1 year ago
prefix_cache_example.py 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 11 months ago
slora_inference.py c0aac15421 feat: S-LoRA support (#222) 11 months ago