sgsdxzy 58b0616dd3 feat: support sharded ggufs (#420) 8 months ago
..
marlin e42a78381a feat: switch from pylint to ruff (#322) 9 months ago
monitoring 070c1cef8c fix: explicit RFC3986 for prometheus_client asgi (#344) 9 months ago
alpaca_template.jinja 2755a48d51 merge dev branch into main (#153) 1 year ago
aphrodite_engine_example.py f32d57ed04 add inference examples 1 year ago
api_client.py f32d57ed04 add inference examples 1 year ago
chatml_template.jinja 2755a48d51 merge dev branch into main (#153) 1 year ago
gguf_to_torch.py 58b0616dd3 feat: support sharded ggufs (#420) 8 months ago
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) 9 months ago
offline_inference.py e42a78381a feat: switch from pylint to ruff (#322) 9 months ago
prefix_cache_example.py 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) 11 months ago
pygchat_template.jinja 80e8a14949 feat: add pygchat Jinja template (#218) 11 months ago
slora_inference.py e42a78381a feat: switch from pylint to ruff (#322) 9 months ago