Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 50c2434267 move megatron to a top-level directory hai 9 meses
  AlpinDale 4d33ce60da feat: Triton flash attention backend for ROCm (#407) hai 9 meses
  AlpinDale 41beab5dc1 add exllamav2 tensor paralell, fused MoE for GPTQ/AWQ hai 9 meses
  AlpinDale 0f1399c135 feat: attention refactor part 2 hai 9 meses
  AlpinDale d1786645a3 fix formatting hai 9 meses
  AlpinDale 688d56993a add logit scale for command-r hai 9 meses
  AlpinDale f1ea36a445 add some imports hai 9 meses
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) hai 9 meses
  AlpinDale da223153c6 feat&fix: cohere support and missing GPU blocks (#333) hai 10 meses
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) hai 10 meses
  AlpinDale e31c6f0b45 feat: refactor modeling logic and support more models (#274) hai 11 meses
  AlpinDale 842912d022 feat: on-the-fly gguf conversion (#250) hai 11 meses
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) hai 1 ano
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) hai 1 ano
  AlpinDale b9b295d74e chore: backlogs 1 (#191) hai 1 ano
  AlpinDale 7d91e9e0f2 feat: CUDA graphs (#172) hai 1 ano
  AlpinDale 653da510d1 chore: rewrite InputMetadata (#143) hai 1 ano
  AlpinDale 8b2bbbd98b chore: attention rewrite + models (#135) hai 1 ano
  AlpinDale 0d51eac374 feat: awq for all models (#124) hai 1 ano
  AlpinDale e7b6a2d5a0 chore: tensor parallel refactors part 2 (#116) hai 1 ano
  AlpinDale 74604eb252 fix: pylint complaints (#91) hai 1 ano
  AlpinDale efc6f7fbec chore: reformats (#90) hai 1 ano
  AlpinDale a6a4220fa6 feat: refactor megatron and quants (#57) hai 1 ano
  AlpinDale 0495c50a3e GPTQ+exllama support (#21) hai 1 ano
  AlpinDale 75c27d3e65 massive overhaul hai 1 ano
  AlpinDale 6b9561ef07 adapt TGI incremental detokenization hai 1 ano
  AlpinDale 45f6d9f923 initial refactor commit hai 1 ano
  AlpinDale 06e71fc492 feat: add GPT-NeoX support for testing purposes hai 1 ano