AlpinDale ae04f57ec1 feat: Pipeline Parallel support (#581) há 7 meses atrás
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) há 10 meses atrás
batch_expansion.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) há 7 meses atrás
draft_model_runner.py ae04f57ec1 feat: Pipeline Parallel support (#581) há 7 meses atrás
interfaces.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) há 7 meses atrás
metrics.py 7253e9052d feat: integrate typical acceptance sampling for spec decoding há 7 meses atrás
mlp_speculator_worker.py 405bb74612 Control plane comms refactor (#573) há 7 meses atrás
multi_step_worker.py cdff8e89f9 feat: introduce `DraftModelRunner` há 7 meses atrás
ngram_worker.py e0886ee929 feat: add `ProposerWorkerBase` abstract class há 7 meses atrás
proposer_worker_base.py abbb730607 feat: support draft model on different tensor parallel size há 7 meses atrás
smaller_tp_proposer_worker.py b6ff0623a6 chore: clean up branding há 7 meses atrás
spec_decode_worker.py dd378ea063 feat: MLPSpeculator with tensor parallel há 7 meses atrás
top1_proposer.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) há 7 meses atrás
util.py af43576da0 feat: add MLPSpeculator speculative decoding support (#572) há 7 meses atrás