AlpinDale 4d4e767838 ci: take one of fixing lint issues 4 months ago
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 months ago
batch_expansion.py d8a51d05a7 fix: seeded gens with pipeline parallel 4 months ago
draft_model_runner.py 4d4e767838 ci: take one of fixing lint issues 4 months ago
interfaces.py 3a53ff1e01 fix: raise an error for no draft token case when draft_tp>1 5 months ago
medusa_worker.py d8a51d05a7 fix: seeded gens with pipeline parallel 4 months ago
metrics.py 4d4e767838 ci: take one of fixing lint issues 4 months ago
mlp_speculator_worker.py d8a51d05a7 fix: seeded gens with pipeline parallel 4 months ago
multi_step_worker.py 84a9cd25c9 fix: some naming issues 4 months ago
ngram_worker.py 6b1fdd07bd chore: add isort and refactor formatting script and utils 4 months ago
proposer_worker_base.py d638dc592d fix: some minor typing issues in spec decode 5 months ago
smaller_tp_proposer_worker.py 16dff9babc chore: enable bonus token in spec decoding for KV cache based models 5 months ago
spec_decode_worker.py 4d4e767838 ci: take one of fixing lint issues 4 months ago
target_model_runner.py a4cbcfe59f feat: disable logprob serialization to CPU for spec decode 5 months ago
top1_proposer.py 3a53ff1e01 fix: raise an error for no draft token case when draft_tp>1 5 months ago
util.py edffcecc67 chore: add proper logging for spec decoding verification 4 months ago