AlpinDale 411ac4f405 vlm: add support for Qwen2-VL model (#1015) 4 недель назад
..
audio 8eb4a3cfd3 vlm: support multiple audios per prompt for Ultravox (#990) 1 месяц назад
chat_templates 485d1de42e fix: hermes tool call chat template (#999) 1 месяц назад
fp8 f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
marlin 8a71788372 Add OLMoE (#772) 3 месяцев назад
monitoring b26a014b12 fix: prometheus.yaml path in monitoring example (#969) 1 месяц назад
offline_inference 145e554a4d neuron: add 8bit quantization for Neuron (#994) 1 месяц назад
openai_api 313e198557 api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993) 1 месяц назад
vision 411ac4f405 vlm: add support for Qwen2-VL model (#1015) 4 недель назад
aphrodite_engine_example.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
api_client.py f32d57ed04 add inference examples 1 год назад
gguf_to_torch.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 месяцев назад
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) 10 месяцев назад
run_cluster.sh f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
save_sharded_state.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
tensorize_aphrodite_model.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 месяц назад