Commit History

Autor SHA1 Mensaxe Data
  AlpinDale 62111fab17 feat: allow serving encoder-decoder models in the API server (#664) hai 4 meses
  AlpinDale a0e446a17d feat: initial encoder-decoder support with BART model (#633) hai 4 meses
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) hai 4 meses
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) hai 8 meses
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) hai 9 meses
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) hai 10 meses
  AlpinDale ac82b67f75 feat: naive context shift and various QoL changes (#289) hai 10 meses
  AlpinDale c3a221eb02 feat: GGUF, QuIP#, and Marlin support (#228) hai 1 ano
  AlpinDale 641bb0f6e9 feat: add custom allreduce kernels (#224) hai 1 ano
  AlpinDale c0aac15421 feat: S-LoRA support (#222) hai 1 ano
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) hai 1 ano
  AlpinDale f013d714c0 chore: merge dev branch into main (#177) hai 1 ano
  AlpinDale 2755a48d51 merge dev branch into main (#153) hai 1 ano
  AlpinDale 8834ecf9de chore: clean up refactor endpoints (#98) hai 1 ano
  AlpinDale c70abc7522 fix the LLM class for quantization hai 1 ano
  AlpinDale 6b9561ef07 adapt TGI incremental detokenization hai 1 ano
  AlpinDale 388d7545dd fix: circular import hai 1 ano
  AlpinDale c761d38c69 fix: sort outputs and avoid unwanted list copy hai 1 ano
  AlpinDale 56077f0f29 upstream: trust remote code hai 1 ano
  AlpinDale 724852dc31 chore: refactoring cont. hai 1 ano
  AlpinDale 5169163403 chore: add tokenizer mode for slow/fast tokenizers hai 1 ano
  AlpinDale 07aa2a492f upstream: add option to specify tokenizer hai 1 ano
  AlpinDale 20a8235114 upstream: add hai 1 ano
  AlpinDale e52de7de70 feat: add API endpoint with FastAPI hai 1 ano