.. |
audio
|
653d1a08d4
feat: add support for audio models (#891)
|
hai 3 semanas |
chat_templates
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
hai 4 meses |
fp8
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
hai 4 meses |
marlin
|
8a71788372
Add OLMoE (#772)
|
hai 2 meses |
monitoring
|
12e40ae6fd
chore: update grafana template (#721)
|
hai 3 meses |
offline_inference
|
673621a3d2
xpu: refactor the model runner for tensor parallelism (#910)
|
hai 3 semanas |
openai_api
|
653d1a08d4
feat: add support for audio models (#891)
|
hai 3 semanas |
vision
|
908ff753a1
fix: phi_3.5_v loading (#896)
|
hai 3 semanas |
aphrodite_engine_example.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
hai 4 meses |
api_client.py
|
f32d57ed04
add inference examples
|
hai 1 ano |
gguf_to_torch.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
hai 8 meses |
gradio_server.py
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
hai 10 meses |
hadamard_example.py
|
d91e4e98e1
attempt adding kernels back
|
hai 3 semanas |
run_cluster.sh
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
hai 4 meses |
save_sharded_state.py
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
hai 4 meses |
tensorize_aphrodite_model.py
|
22a4cd4595
core: fix spec decode metrics and envs circular import (#889)
|
hai 3 semanas |