sgsdxzy 214151b04c fix: max_num_batched_tokens for chunked_prefill (#412) vor 8 Monaten
..
marlin e42a78381a feat: switch from pylint to ruff (#322) vor 10 Monaten
monitoring 070c1cef8c fix: explicit RFC3986 for prometheus_client asgi (#344) vor 9 Monaten
alpaca_template.jinja 2755a48d51 merge dev branch into main (#153) vor 1 Jahr
aphrodite_engine_example.py f32d57ed04 add inference examples vor 1 Jahr
api_client.py f32d57ed04 add inference examples vor 1 Jahr
chatml_template.jinja 2755a48d51 merge dev branch into main (#153) vor 1 Jahr
gguf_to_torch.py 214151b04c fix: max_num_batched_tokens for chunked_prefill (#412) vor 8 Monaten
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) vor 10 Monaten
offline_inference.py e42a78381a feat: switch from pylint to ruff (#322) vor 10 Monaten
prefix_cache_example.py 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) vor 11 Monaten
pygchat_template.jinja 80e8a14949 feat: add pygchat Jinja template (#218) vor 11 Monaten
slora_inference.py e42a78381a feat: switch from pylint to ruff (#322) vor 10 Monaten