AlpinDale 0c162c8dad api: use fp32 for base64 embeddings (#919) há 1 mês atrás
..
audio 653d1a08d4 feat: add support for audio models (#891) há 1 mês atrás
chat_templates f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
fp8 f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
marlin 8a71788372 Add OLMoE (#772) há 3 meses atrás
monitoring 12e40ae6fd chore: update grafana template (#721) há 4 meses atrás
offline_inference 673621a3d2 xpu: refactor the model runner for tensor parallelism (#910) há 1 mês atrás
openai_api 0c162c8dad api: use fp32 for base64 embeddings (#919) há 1 mês atrás
vision 908ff753a1 fix: phi_3.5_v loading (#896) há 1 mês atrás
aphrodite_engine_example.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
api_client.py f32d57ed04 add inference examples há 1 ano atrás
gguf_to_torch.py 9d81716bfd [v0.5.3] Release Candidate (#388) há 8 meses atrás
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) há 10 meses atrás
run_cluster.sh f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
save_sharded_state.py f1d0b77c92 [0.6.0] Release Candidate (#481) há 4 meses atrás
tensorize_aphrodite_model.py 22a4cd4595 core: fix spec decode metrics and envs circular import (#889) há 1 mês atrás