sgsdxzy 214151b04c fix: max_num_batched_tokens for chunked_prefill (#412) há 8 meses atrás
..
marlin e42a78381a feat: switch from pylint to ruff (#322) há 10 meses atrás
monitoring 070c1cef8c fix: explicit RFC3986 for prometheus_client asgi (#344) há 9 meses atrás
alpaca_template.jinja 2755a48d51 merge dev branch into main (#153) há 1 ano atrás
aphrodite_engine_example.py f32d57ed04 add inference examples há 1 ano atrás
api_client.py f32d57ed04 add inference examples há 1 ano atrás
chatml_template.jinja 2755a48d51 merge dev branch into main (#153) há 1 ano atrás
gguf_to_torch.py 214151b04c fix: max_num_batched_tokens for chunked_prefill (#412) há 8 meses atrás
gradio_server.py e42a78381a feat: switch from pylint to ruff (#322) há 10 meses atrás
offline_inference.py e42a78381a feat: switch from pylint to ruff (#322) há 10 meses atrás
prefix_cache_example.py 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) há 11 meses atrás
pygchat_template.jinja 80e8a14949 feat: add pygchat Jinja template (#218) há 11 meses atrás
slora_inference.py e42a78381a feat: switch from pylint to ruff (#322) há 10 meses atrás