AlpinDale
|
9181fa0396
feat: Triton kernels for sampling (#383)
|
9 months ago |
AlpinDale
|
e3252edd07
fix: remove event and stream, add typing (#382)
|
9 months ago |
AlpinDale
|
375f24ccca
fix: optimize context shift performance (#380)
|
9 months ago |
AlpinDale
|
33b3786175
fix: cache neuron checks (#379)
|
9 months ago |
AlpinDale
|
8c9cabf4c8
fix: display error in ray before deadlock (#378)
|
9 months ago |
50h100a
|
dc09dc2b4d
Merge branch 'main' into pr_samplers
|
9 months ago |
AlpinDale
|
f587953f46
fix: yapf
|
9 months ago |
AlpinDale
|
4b99ac15b7
fix: do not deepcopy metadata
|
9 months ago |
AlpinDale
|
17b034613d
chore: make metadata a dataclass (#377)
|
9 months ago |
AlpinDale
|
9534fcfb7b
fix: build error
|
9 months ago |
AlpinDale
|
0b35176089
feat: add context-free grammars (#376)
|
9 months ago |
AlpinDale
|
feb5840f2a
feat: async tokenization (#374)
|
9 months ago |
IggoOnCode
|
2aec297c55
feat: add embeddings endpoint to openai rest-api server. (#363)
|
9 months ago |
AlpinDale
|
29c241c115
fix: explicitly disallow installation on non-linux platforms (#373)
|
9 months ago |
AlpinDale
|
439a826712
fix: broadcast group
|
9 months ago |
AlpinDale
|
935027bdcc
feat: dynamic shared memory allocation for moe align block size (#372)
|
9 months ago |
AlpinDale
|
97a2b26c97
fix: assertion error when use_sliding_window is present
|
9 months ago |
AlpinDale
|
e702f587cf
feat: add batched RoPE kernels (#371)
|
9 months ago |
AlpinDale
|
3d6695cfbb
feat: add approximate gelu activation kernels (#370)
|
9 months ago |
AlpinDale
|
5fa15b4435
fix: double free with sliding window (#369)
|
9 months ago |
AlpinDale
|
72cd8494aa
feat: mistral neuron support (#368)
|
9 months ago |
AlpinDale
|
0f6d56b07f
feat: model executor refactor (#367)
|
9 months ago |
AlpinDale
|
b361096463
fix: tokenizer when using ray (#366)
|
9 months ago |
AlpinDale
|
f8dfac6372
chore: attention refactor and upstream sync apr01 (#365)
|
9 months ago |
50h100a
|
35b4aa7da5
Fix logitproc for logit_bias in OAI endpoints.
|
9 months ago |
50h100a
|
7ed57e318d
Overhauled SamplingTensors construction.
|
9 months ago |
50h100a
|
a39920bc99
Merge pull request #355 from 50h100a/pr_seedfix
|
9 months ago |
50h100a
|
051c60736e
Merge pull request #356 from 50h100a/pr_samplerinternals
|
9 months ago |
50h100a
|
d5dbd29db4
hoist sampler internals into a single function.
|
9 months ago |
50h100a
|
b9e0ae87c5
fix fine-grained seeding.
|
9 months ago |