AlpinDale b26a014b12 fix: prometheus.yaml path in monitoring example (#969) 2 weeks ago
..
audio 653d1a08d4 feat: add support for audio models (#891) 3 weeks ago
chat_templates f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
fp8 f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
marlin 8a71788372 Add OLMoE (#772) 2 months ago
monitoring b26a014b12 fix: prometheus.yaml path in monitoring example (#969) 2 weeks ago
offline_inference ba6d798784 neuron: support for context length and token bucketing (#960) 2 weeks ago
openai_api 0c162c8dad api: use fp32 for base64 embeddings (#919) 3 weeks ago
vision 03bd85c950 chore: multi-image support for llava-next (#935) 2 weeks ago
aphrodite_engine_example.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
api_client.py f32d57ed04 add inference examples 1 year ago
gguf_to_torch.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) 9 months ago
run_cluster.sh f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
save_sharded_state.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 months ago
tensorize_aphrodite_model.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 3 weeks ago