AlpinDale
|
e75daadfbd
don't copy LogitsProcessors
|
10 months ago |
AlpinDale
|
fd2c06e6b4
defensively copy
|
10 months ago |
AlpinDale
|
5638e08d83
Merge branch 'main' into sampler_order_v2
|
10 months ago |
AlpinDale
|
16615784b3
fix: prefix cache for turing gpus
|
10 months ago |
AlpinDale
|
7dc73a779a
fix: properly perform garbage collection for lora (#277)
|
10 months ago |
AlpinDale
|
697c06c4f5
fix: LoRA support for mixtral (#276)
|
10 months ago |
AlpinDale
|
4b80b42362
fix: memory leaks due to nccl cuda graphs (#275)
|
10 months ago |
AlpinDale
|
e31c6f0b45
feat: refactor modeling logic and support more models (#274)
|
10 months ago |
AlpinDale
|
7d6ba53602
feat: fused top-k kernels for MoE (#273)
|
10 months ago |
AlpinDale
|
a3cab09b69
chore: logging env variable
|
10 months ago |
AlpinDale
|
2c08aa5af4
chore: remove eos token from output (#272)
|
10 months ago |
AlpinDale
|
8e1cd54497
fix: do not include fp8 for rocm (#271)
|
10 months ago |
AlpinDale
|
6a63ab4ec3
fix: remote encode request if using ray (#270)
|
10 months ago |
AlpinDale
|
224b87b484
feat: add fused mixtral moe support (#238)
|
10 months ago |
Thomas Xin
|
43cf0e98a0
fix: worker initialization on WSL (#260)
|
10 months ago |
swadical
|
0527131e93
fix: grammar logits processor (#268)
|
10 months ago |
Stefan Gligorijevic
|
a41366fea1
multiple fixes
|
10 months ago |
AlpinDale
|
2370dbcfd8
feat: OPT model support (#266)
|
10 months ago |
AlpinDale
|
6b8faa7ee8
yapf
|
10 months ago |
AlpinDale
|
177646864a
Merge branch 'main' into sampler_order_v2
|
10 months ago |
AlpinDale
|
4360684667
fix: cuda version in wheel
|
10 months ago |
TearGosling
|
80e8a14949
feat: add pygchat Jinja template (#218)
|
11 months ago |
Stefan Gligorijevic
|
af72d6bf73
silly oversight
|
11 months ago |
Stefan Gligorijevic
|
a744bf95d9
better(and correct) fix for quad
|
11 months ago |
Stefan Gligorijevic
|
bc040c78b7
only apply quadratic sampling when requested
|
11 months ago |
Stefan Gligorijevic
|
0cfc7e364f
properly set default order
|
11 months ago |
Stefan Gligorijevic
|
57d822b4c8
another bit missed in merge conflict resolution
|
11 months ago |
Stefan Gligorijevic
|
d6282a2f65
incorrectly resolved merge
|
11 months ago |
Stefan Gligorijevic
|
916b252524
fix more missing bits
|
11 months ago |
Stefan Gligorijevic
|
53017ce4f3
fix? sampling params for quadratic
|
11 months ago |