AlpinDale
|
c20073824a
cleanup
|
7 months ago |
AlpinDale
|
85a865cc00
feat: fp8 quant
|
7 months ago |
AlpinDale
|
8be299e78b
fix: lora load check
|
7 months ago |
AlpinDale
|
096d9eb6c5
enhance nvlink detection
|
7 months ago |
AlpinDale
|
fb7825df8f
squash logprobs
|
7 months ago |
AlpinDale
|
66b7bc4415
sliding window in prefix kernel
|
7 months ago |
AlpinDale
|
42998e423c
better quant verification
|
7 months ago |
AlpinDale
|
483c95a2f8
fix ops in gptq and awq
|
7 months ago |
AlpinDale
|
8f9cb7235c
chore: allow multiple served model names
|
7 months ago |
AlpinDale
|
fc80f57967
fix: correct file name for qwen2 moe
|
7 months ago |
AlpinDale
|
f894f7b176
Revert "reduce dedupe by wrapping in general worker class"
|
7 months ago |
AlpinDale
|
082b0b03bc
Revert "actually run the workers"
|
7 months ago |
AlpinDale
|
36cf32649d
actually run the workers
|
7 months ago |
AlpinDale
|
9fff6fb507
reduce dedupe by wrapping in general worker class
|
7 months ago |
AlpinDale
|
b92bddafe9
time.monotonic() -> time.time()
|
7 months ago |
AlpinDale
|
0178b4d976
docker: add AWS Neuron Docker image
|
7 months ago |
AlpinDale
|
949f0445de
readme: update installation command
|
8 months ago |
Naomiusearch
|
893158a4ed
fix: quants installation on ROCm (#469)
|
8 months ago |
AlpinDale
|
5ee79a1692
readme: update for 0.5.3
|
8 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |
Krovius
|
205c8e4106
fix: kobold api /tokencount (#424)
|
8 months ago |
50h100a
|
f663d3fccc
Merge pull request #397 from 50h100a/pr_samplerasserts
|
9 months ago |
50h100a
|
85ae23ac3c
Missed .items() and assert
|
9 months ago |
50h100a
|
43c9858854
Merge pull request #244 from PygmalionAI/faster_topk
|
9 months ago |
50h100a
|
1da7fd64bc
Merge pull request #396 from 50h100a/pr_notneuron
|
9 months ago |
50h100a
|
0634b8a3a6
fix memory pinning conditional
|
9 months ago |
50h100a
|
a0b21a3e42
Merge pull request #358 from 50h100a/pr_samplers
|
9 months ago |
50h100a
|
bd564148e2
Merge branch 'main' of https://github.com/PygmalionAI/aphrodite-engine into ffs
|
9 months ago |
50h100a
|
d3dd170a7d
merge main
|
9 months ago |
AlpinDale
|
78d66f16d1
Chunked Prefill Part 1 (#384)
|
9 months ago |