AlpinDale e13a66925c feat: add fuyu vision model and persimmon language model support 6 months ago
..
chat_templates e1c4cf1d50 chore: organize chat templates 7 months ago
fp8 d63690a0df chore: add fp8 examples 7 months ago
marlin e42a78381a feat: switch from pylint to ruff (#322) 1 year ago
monitoring b1555eb208 add new grafana metrics 7 months ago
offline_inference a3b56353fa fix: another one missed 6 months ago
vision e13a66925c feat: add fuyu vision model and persimmon language model support 6 months ago
aphrodite_engine_example.py f32d57ed04 add inference examples 1 year ago
api_client.py f32d57ed04 add inference examples 1 year ago
gguf_to_torch.py 9d81716bfd [v0.5.3] Release Candidate (#388) 10 months ago
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) 1 year ago
save_sharded_state.py 7bcff4ac03 implement sharded state dict 7 months ago
tensorize_aphrodite_model.py d0cca80b8b feat: support sharded tensorizer models 7 months ago