.. |
alpaca_template.jinja
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
aphrodite_engine_example.py
|
f32d57ed04
add inference examples
|
1 year ago |
api_client.py
|
f32d57ed04
add inference examples
|
1 year ago |
chatml_template.jinja
|
2755a48d51
merge dev branch into main (#153)
|
1 year ago |
function_call.py
|
4ea4f5943e
refactor openai endpoints and add function calls
|
11 months ago |
gguf_to_torch.py
|
c3a221eb02
feat: GGUF, QuIP#, and Marlin support (#228)
|
11 months ago |
gradio_server.py
|
551c4280cf
chore: change default port to 2242
|
1 year ago |
offline_inference.py
|
f32d57ed04
add inference examples
|
1 year ago |
prefix_cache_example.py
|
8fa608aeb7
feat: replace Ray with NCCL for control plane comms (#221)
|
11 months ago |
slora_inference.py
|
c0aac15421
feat: S-LoRA support (#222)
|
11 months ago |