AlpinDale 0c162c8dad api: use fp32 for base64 embeddings (#919) 1 месяц назад
..
audio 653d1a08d4 feat: add support for audio models (#891) 1 месяц назад
chat_templates f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
fp8 f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
marlin 8a71788372 Add OLMoE (#772) 3 месяцев назад
monitoring 12e40ae6fd chore: update grafana template (#721) 4 месяцев назад
offline_inference 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) 1 месяц назад
openai_api 0c162c8dad api: use fp32 for base64 embeddings (#919) 1 месяц назад
vision 908ff753a1 fix: phi_3.5_v loading (#896) 1 месяц назад
aphrodite_engine_example.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
api_client.py f32d57ed04 add inference examples 1 год назад
gguf_to_torch.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 месяцев назад
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) 10 месяцев назад
run_cluster.sh f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
save_sharded_state.py f1d0b77c92 [0.6.0] Release Candidate (#481) 4 месяцев назад
tensorize_aphrodite_model.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) 1 месяц назад