AlpinDale ae04f57ec1 feat: Pipeline Parallel support (#581) hai 7 meses
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) hai 10 meses
batch_expansion.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) hai 7 meses
draft_model_runner.py ae04f57ec1 feat: Pipeline Parallel support (#581) hai 7 meses
interfaces.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) hai 7 meses
metrics.py 7253e9052d feat: integrate typical acceptance sampling for spec decoding hai 7 meses
mlp_speculator_worker.py 405bb74612 Control plane comms refactor (#573) hai 7 meses
multi_step_worker.py cdff8e89f9 feat: introduce `DraftModelRunner` hai 7 meses
ngram_worker.py e0886ee929 feat: add `ProposerWorkerBase` abstract class hai 7 meses
proposer_worker_base.py abbb730607 feat: support draft model on different tensor parallel size hai 7 meses
smaller_tp_proposer_worker.py b6ff0623a6 chore: clean up branding hai 7 meses
spec_decode_worker.py dd378ea063 feat: MLPSpeculator with tensor parallel hai 7 meses
top1_proposer.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) hai 7 meses
util.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) hai 7 meses