.. |
attention
|
66b7bc4415
sliding window in prefix kernel
|
7 mēneši atpakaļ |
common
|
42998e423c
better quant verification
|
7 mēneši atpakaļ |
distributed
|
096d9eb6c5
enhance nvlink detection
|
7 mēneši atpakaļ |
endpoints
|
fb7825df8f
squash logprobs
|
7 mēneši atpakaļ |
engine
|
42998e423c
better quant verification
|
7 mēneši atpakaļ |
executor
|
f894f7b176
Revert "reduce dedupe by wrapping in general worker class"
|
7 mēneši atpakaļ |
kv_quant
|
e42a78381a
feat: switch from pylint to ruff (#322)
|
10 mēneši atpakaļ |
lora
|
8be299e78b
fix: lora load check
|
7 mēneši atpakaļ |
modeling
|
85a865cc00
feat: fp8 quant
|
7 mēneši atpakaļ |
processing
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 mēneši atpakaļ |
quantization
|
c20073824a
cleanup
|
7 mēneši atpakaļ |
spec_decode
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 mēneši atpakaļ |
task_handler
|
f894f7b176
Revert "reduce dedupe by wrapping in general worker class"
|
7 mēneši atpakaļ |
transformers_utils
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 mēneši atpakaļ |
__init__.py
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 mēneši atpakaļ |
py.typed
|
1c988a48b2
fix logging and add py.typed
|
1 gadu atpakaļ |