AlpinDale 344ddaac5a properly disable speculative decoding преди 5 месеца
..
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) преди 8 месеца
batch_expansion.py a94de94c44 refactor: combine the prefill and decode into a single API (#553) преди 5 месеца
interfaces.py ef733aee43 implement ExecuteModelData to reduce executor complexity преди 5 месеца
metrics.py 9d81716bfd [v0.5.3] Release Candidate (#388) преди 8 месеца
multi_step_worker.py a94de94c44 refactor: combine the prefill and decode into a single API (#553) преди 5 месеца
ngram_worker.py de62ceb18c refactor: eliminate parallel worker per-step task scheduling overhead преди 5 месеца
spec_decode_worker.py 344ddaac5a properly disable speculative decoding преди 5 месеца
top1_proposer.py e42d0b3455 possibly improve ngram efficiency преди 5 месеца
util.py 5b0c11d190 support pipeline parallel pynccl groups преди 5 месеца