.. |
audio
|
8eb4a3cfd3
vlm: support multiple audios per prompt for Ultravox (#990)
|
1 week ago |
chat_templates
|
485d1de42e
fix: hermes tool call chat template (#999)
|
1 week ago |
fp8
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
marlin
|
8a71788372
Add OLMoE (#772)
|
2 months ago |
monitoring
|
b26a014b12
fix: prometheus.yaml path in monitoring example (#969)
|
1 week ago |
offline_inference
|
145e554a4d
neuron: add 8bit quantization for Neuron (#994)
|
1 week ago |
openai_api
|
313e198557
api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993)
|
1 week ago |
vision
|
acc0c727c8
vlm: add support for molmo vision model (#1069)
|
3 days ago |
aphrodite_engine_example.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
api_client.py
|
f32d57ed04
add inference examples
|
1 year ago |
gguf_to_torch.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
7 months ago |
gradio_server.py
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
9 months ago |
run_cluster.sh
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
save_sharded_state.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
tensorize_aphrodite_model.py
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
3 weeks ago |