Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 737f1c3351 part 1 hai 6 meses
  AlpinDale ed759f065d chore: tokenizer_revision -> revision hai 6 meses
  AlpinDale 2e0b115ce1 move func tracing to utils hai 6 meses
  AlpinDale 41338053e7 feat: add shutdown method to engine hai 6 meses
  AlpinDale 199e776722 chore: move ray utils to executor dir hai 6 meses
  AlpinDale e7b1368156 feat: Phi3 support hai 6 meses
  AlpinDale 1225c4dfd6 fix: illegal mem access crash for marlin hai 6 meses
  AlpinDale d1a3c7bc2c chore: simplify try-finally logic in pynccl hai 6 meses
  AlpinDale 440384d776 chore: use nvidia-ml-py instead of pynvml hai 6 meses
  AlpinDale 46159b107a formatting: pt1 hai 6 meses
  AlpinDale 4c746d8baa chore: init nccl using the gloo backend hai 6 meses
  AlpinDale bf2dd2bee9 feat: allow multiple sampling params in LLM class hai 6 meses
  Orion a2a24e9b0d feat: list support in message.content (#503) hai 6 meses
  Bruno Renié 9c45fe9a2a openai: fix metrics endpoint (#512) hai 6 meses
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) hai 6 meses
  AlpinDale 8be299e78b fix: lora load check hai 7 meses
  AlpinDale 096d9eb6c5 enhance nvlink detection hai 7 meses
  AlpinDale fb7825df8f squash logprobs hai 7 meses
  AlpinDale 66b7bc4415 sliding window in prefix kernel hai 7 meses
  AlpinDale 42998e423c better quant verification hai 7 meses
  AlpinDale 483c95a2f8 fix ops in gptq and awq hai 7 meses
  AlpinDale 8f9cb7235c chore: allow multiple served model names hai 7 meses
  AlpinDale fc80f57967 fix: correct file name for qwen2 moe hai 7 meses
  AlpinDale f894f7b176 Revert "reduce dedupe by wrapping in general worker class" hai 7 meses
  AlpinDale 082b0b03bc Revert "actually run the workers" hai 7 meses
  AlpinDale 36cf32649d actually run the workers hai 7 meses
  AlpinDale 9fff6fb507 reduce dedupe by wrapping in general worker class hai 7 meses
  AlpinDale b92bddafe9 time.monotonic() -> time.time() hai 7 meses
  AlpinDale 0178b4d976 docker: add AWS Neuron Docker image hai 7 meses
  AlpinDale 949f0445de readme: update installation command hai 8 meses