Histórico de Commits

Autor SHA1 Mensagem Data
  AlpinDale c577c31aaa feat: tree attention há 9 meses atrás
  50h100a f67b5be198 chore: port sampler+metadata changes from main to dev (#427) há 9 meses atrás
  AlpinDale 8c67b37131 fix docstrings há 9 meses atrás
  AlpinDale fe17712f29 fully working chunked prefill há 9 meses atrás
  AlpinDale 50c2434267 move megatron to a top-level directory há 9 meses atrás
  AlpinDale 071269e406 feat: FP8 E4M3 KV Cache (#405) há 9 meses atrás
  AlpinDale f845a661dd Chunked Prefill Part 2: data update há 9 meses atrás
  AlpinDale 5f851e45e5 ruff há 9 meses atrás
  AlpinDale 41f5af0426 add python nccl wrapper, remove cupy há 9 meses atrás
  AlpinDale 7b9c08afae vision model support há 9 meses atrás
  AlpinDale 0f1399c135 feat: attention refactor part 2 há 9 meses atrás
  AlpinDale 2319b411ce refactor: neuron support há 9 meses atrás
  AlpinDale 15308ffb5b compute logits in model_runner há 9 meses atrás
  AlpinDale 78d66f16d1 Chunked Prefill Part 1 (#384) há 9 meses atrás
  AlpinDale 9181fa0396 feat: Triton kernels for sampling (#383) há 9 meses atrás
  AlpinDale 4b99ac15b7 fix: do not deepcopy metadata há 9 meses atrás
  AlpinDale 17b034613d chore: make metadata a dataclass (#377) há 9 meses atrás
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) há 10 meses atrás
  50h100a b9e0ae87c5 fix fine-grained seeding. há 10 meses atrás
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) há 10 meses atrás
  sgsdxzy 50c0875c32 chore: log total memory usage (#316) há 10 meses atrás
  AlpinDale c2d77b1822 chore: logging refactor (#302) há 10 meses atrás
  AlpinDale 9810daa699 feat: INT8 KV Cache (#298) há 10 meses atrás
  AlpinDale ac82b67f75 feat: naive context shift and various QoL changes (#289) há 10 meses atrás
  AlpinDale 4d04ade9ef feat: fine-grained seeds (#279) há 11 meses atrás
  AlpinDale 697c06c4f5 fix: LoRA support for mixtral (#276) há 11 meses atrás
  AlpinDale 4b80b42362 fix: memory leaks due to nccl cuda graphs (#275) há 11 meses atrás
  AlpinDale ea0f57b233 feat: allow further support for non-cuda devices (#247) há 11 meses atrás
  AlpinDale 1a94ccf3cf fix: prefix cache fail with lora (#239) há 11 meses atrás
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) há 1 ano atrás