Commit History

Autor SHA1 Mensaxe Data
  50h100a 9022c6d869 remove progress_bar imports hai 2 meses
  50h100a 9576096b9d iterate over weights normally hai 2 meses
  AlpinDale 0e558e9b2f fix: loading chameleon model with TP>1 (#695) hai 4 meses
  AlpinDale 3f712cd287 feat: add progress bar for loading individual weight modules (#640) hai 4 meses
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) hai 4 meses
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) hai 8 meses
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) hai 9 meses
  AlpinDale da223153c6 feat&fix: cohere support and missing GPU blocks (#333) hai 9 meses
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) hai 10 meses
  AlpinDale e31c6f0b45 feat: refactor modeling logic and support more models (#274) hai 10 meses
  AlpinDale 842912d022 feat: on-the-fly gguf conversion (#250) hai 11 meses
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) hai 11 meses
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) hai 11 meses
  AlpinDale b9b295d74e chore: backlogs 1 (#191) hai 1 ano
  AlpinDale 7d91e9e0f2 feat: CUDA graphs (#172) hai 1 ano
  AlpinDale 653da510d1 chore: rewrite InputMetadata (#143) hai 1 ano
  AlpinDale 8b2bbbd98b chore: attention rewrite + models (#135) hai 1 ano
  AlpinDale 0d51eac374 feat: awq for all models (#124) hai 1 ano
  AlpinDale e7b6a2d5a0 chore: tensor parallel refactors part 2 (#116) hai 1 ano
  AlpinDale 74604eb252 fix: pylint complaints (#91) hai 1 ano
  AlpinDale efc6f7fbec chore: reformats (#90) hai 1 ano
  AlpinDale a6a4220fa6 feat: refactor megatron and quants (#57) hai 1 ano
  AlpinDale 0495c50a3e GPTQ+exllama support (#21) hai 1 ano
  AlpinDale 75c27d3e65 massive overhaul hai 1 ano
  AlpinDale 6b9561ef07 adapt TGI incremental detokenization hai 1 ano
  AlpinDale 45f6d9f923 initial refactor commit hai 1 ano
  AlpinDale 06e71fc492 feat: add GPT-NeoX support for testing purposes hai 1 ano