1
0
AlpinDale c20073824a cleanup 7 сар өмнө
..
attention 66b7bc4415 sliding window in prefix kernel 7 сар өмнө
common 42998e423c better quant verification 7 сар өмнө
distributed 096d9eb6c5 enhance nvlink detection 7 сар өмнө
endpoints fb7825df8f squash logprobs 7 сар өмнө
engine 42998e423c better quant verification 7 сар өмнө
executor f894f7b176 Revert "reduce dedupe by wrapping in general worker class" 7 сар өмнө
kv_quant e42a78381a feat: switch from pylint to ruff (#322) 10 сар өмнө
lora 8be299e78b fix: lora load check 7 сар өмнө
modeling 85a865cc00 feat: fp8 quant 7 сар өмнө
processing 9d81716bfd [v0.5.3] Release Candidate (#388) 8 сар өмнө
quantization c20073824a cleanup 7 сар өмнө
spec_decode 9d81716bfd [v0.5.3] Release Candidate (#388) 8 сар өмнө
task_handler f894f7b176 Revert "reduce dedupe by wrapping in general worker class" 7 сар өмнө
transformers_utils 9d81716bfd [v0.5.3] Release Candidate (#388) 8 сар өмнө
__init__.py 9d81716bfd [v0.5.3] Release Candidate (#388) 8 сар өмнө
py.typed 1c988a48b2 fix logging and add py.typed 1 жил өмнө