Commit History

Autor SHA1 Mensaxe Data
  AlpinDale f561a54a43 core: fix async postprocessor in case of preemption (#1000) hai 1 mes
  AlpinDale df5bf69938 core: slightly improve chunked prefill performance (#981) hai 1 mes
  AlpinDale 5be6225f38 core: support multi-step scheduling w/ async post-processor (#955) hai 1 mes
  AlpinDale 0e2bfccda0 core: add virtual engine for async outproc (#939) hai 1 mes
  AlpinDale f7f3fed265 feat: add async postprocessor (#925) hai 1 mes
  AlpinDale abfd4465ca feat: add support for chunked prefill + prefix caching (#871) hai 1 mes
  AlpinDale 3d83e64f8e feat: add metrics for prefix cache hit rate (#829) hai 2 meses
  AlpinDale 2f61644f6e SPMD optimizations (#824) hai 2 meses
  AlpinDale 0a369f9171 feat: support chunked prefill with LoRA (#823) hai 2 meses
  AlpinDale 577586309d chore: multi-step args and sequence modifications (#713) hai 4 meses
  AlpinDale bf88c8567e feat: mamba model support (#674) hai 4 meses
  AlpinDale 7df7b8ca53 optimization: reduce end-to-end overhead from python obj allocation (#666) hai 4 meses
  AlpinDale a0e446a17d feat: initial encoder-decoder support with BART model (#633) hai 5 meses
  AlpinDale f1d0b77c92 [0.6.0] Release Candidate (#481) hai 5 meses
  AlpinDale 9d81716bfd [v0.5.3] Release Candidate (#388) hai 8 meses
  AlpinDale 78d66f16d1 Chunked Prefill Part 1 (#384) hai 10 meses
  AlpinDale f8dfac6372 chore: attention refactor and upstream sync apr01 (#365) hai 10 meses
  AlpinDale e42a78381a feat: switch from pylint to ruff (#322) hai 10 meses
  AlpinDale c2d77b1822 chore: logging refactor (#302) hai 10 meses
  AlpinDale ac82b67f75 feat: naive context shift and various QoL changes (#289) hai 11 meses
  AlpinDale 4d04ade9ef feat: fine-grained seeds (#279) hai 11 meses
  AlpinDale c0aac15421 feat: S-LoRA support (#222) hai 1 ano
  AlpinDale 8fa608aeb7 feat: replace Ray with NCCL for control plane comms (#221) hai 1 ano
  AlpinDale b9b295d74e chore: backlogs 1 (#191) hai 1 ano
  g4rg 2aab3da9bd chore: fix Python 3.8+ compatibility (#170) hai 1 ano
  AlpinDale 9ec4e08ade fix: cpu sync delay fix (#127) hai 1 ano
  AlpinDale 13901af940 fix: scheduler hang with long prompts (#126) hai 1 ano
  50h100a fa0ae5a2c9 feat: new mirostatv2 implementation (#96) hai 1 ano
  AlpinDale efc6f7fbec chore: reformats (#90) hai 1 ano
  AlpinDale 3d72f05c7b feat: flattened 1D tensor -> 2D tensor (#85) hai 1 ano