AlpinDale c20073824a cleanup 7 mesi fa
..
attention 66b7bc4415 sliding window in prefix kernel 7 mesi fa
common 42998e423c better quant verification 7 mesi fa
distributed 096d9eb6c5 enhance nvlink detection 7 mesi fa
endpoints fb7825df8f squash logprobs 7 mesi fa
engine 42998e423c better quant verification 7 mesi fa
executor f894f7b176 Revert "reduce dedupe by wrapping in general worker class" 8 mesi fa
kv_quant e42a78381a feat: switch from pylint to ruff (#322) 10 mesi fa
lora 8be299e78b fix: lora load check 7 mesi fa
modeling 85a865cc00 feat: fp8 quant 7 mesi fa
processing 9d81716bfd [v0.5.3] Release Candidate (#388) 8 mesi fa
quantization c20073824a cleanup 7 mesi fa
spec_decode 9d81716bfd [v0.5.3] Release Candidate (#388) 8 mesi fa
task_handler f894f7b176 Revert "reduce dedupe by wrapping in general worker class" 8 mesi fa
transformers_utils 9d81716bfd [v0.5.3] Release Candidate (#388) 8 mesi fa
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 mesi fa
py.typed 1c988a48b2 fix logging and add py.typed 1 anno fa