.. |
chat_templates
|
e1c4cf1d50
chore: organize chat templates
|
4 months ago |
fp8
|
d63690a0df
chore: add fp8 examples
|
4 months ago |
marlin
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
9 months ago |
monitoring
|
b1555eb208
add new grafana metrics
|
5 months ago |
offline_inference
|
0e6c400b13
feat: re-add GGUF (#600)
|
4 months ago |
openai_api
|
6b1fdd07bd
chore: add isort and refactor formatting script and utils
|
4 months ago |
vision
|
c3ee71a437
feat: port SiglipVisionModel from transformers
|
4 months ago |
aphrodite_engine_example.py
|
4d4e767838
ci: take one of fixing lint issues
|
4 months ago |
api_client.py
|
f32d57ed04
add inference examples
|
1 year ago |
gguf_to_torch.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |
gradio_server.py
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
9 months ago |
run_cluster.sh
|
b21f947a63
feat: script for multi-node cluster setup
|
4 months ago |
save_sharded_state.py
|
7bcff4ac03
implement sharded state dict
|
5 months ago |
tensorize_aphrodite_model.py
|
d0cca80b8b
feat: support sharded tensorizer models
|
5 months ago |