Commit History

Autor SHA1 Mensaxe Data
  AlpinDale c2be1b9f29 formatting hai 6 meses
  AlpinDale cbfc7f96d6 potential fix hai 6 meses
  AlpinDale 7d3194e7f4 revert #244 hai 6 meses
  AlpinDale 3ab36e6b2d feat: extended RoPE for Llama 3.1 (#543) hai 6 meses
  AlpinDale c9d6f9f164 fix formatting hai 6 meses
  AlpinDale 0e75803a50 why was this ignored by git? hai 6 meses
  AlpinDale 0d3562a7f9 MQA in triton FA hai 6 meses
  AlpinDale 8de8034f8b include fp8 compilation in rocm hai 6 meses
  AlpinDale 0f7ef9ef7c fix: import in selector hai 6 meses
  AlpinDale 36660b55c2 chore: mixtral fp8 w/ static scales (#542) hai 6 meses
  AlpinDale c21af7acad feat: `DistributedGPUExecutor` abstract class (#541) hai 6 meses
  AlpinDale b178ae4b4a chore: generalize linear_method to be quant_method (#540) hai 6 meses
  AlpinDale a6a627d745 fix aqlm compilation hai 6 meses
  AlpinDale aed64884c6 feat: prompt logprobs with chunked prefill (#539) hai 6 meses
  Naomiusearch 9bcbf61296 There's no aphrodite.py in outlines repo (#531) hai 6 meses
  cloud11665 cc9a801eed [bugfix] change c++ std to 20 (#529) hai 6 meses
  AlpinDale ed759f065d chore: tokenizer_revision -> revision hai 6 meses
  AlpinDale 2e0b115ce1 move func tracing to utils hai 6 meses
  AlpinDale 41338053e7 feat: add shutdown method to engine hai 6 meses
  AlpinDale 199e776722 chore: move ray utils to executor dir hai 6 meses
  AlpinDale e7b1368156 feat: Phi3 support hai 6 meses
  AlpinDale 1225c4dfd6 fix: illegal mem access crash for marlin hai 6 meses
  AlpinDale d1a3c7bc2c chore: simplify try-finally logic in pynccl hai 6 meses
  AlpinDale 440384d776 chore: use nvidia-ml-py instead of pynvml hai 6 meses
  AlpinDale 46159b107a formatting: pt1 hai 6 meses
  AlpinDale 4c746d8baa chore: init nccl using the gloo backend hai 6 meses
  AlpinDale bf2dd2bee9 feat: allow multiple sampling params in LLM class hai 6 meses
  Orion a2a24e9b0d feat: list support in message.content (#503) hai 6 meses
  Bruno Renié 9c45fe9a2a openai: fix metrics endpoint (#512) hai 6 meses
  AlpinDale fca911ee0a vLLM Upstream Sync (#526) hai 6 meses