AlpinDale
|
c2be1b9f29
formatting
|
5 months ago |
AlpinDale
|
cbfc7f96d6
potential fix
|
5 months ago |
AlpinDale
|
7d3194e7f4
revert #244
|
5 months ago |
AlpinDale
|
3ab36e6b2d
feat: extended RoPE for Llama 3.1 (#543)
|
5 months ago |
AlpinDale
|
c9d6f9f164
fix formatting
|
5 months ago |
AlpinDale
|
0e75803a50
why was this ignored by git?
|
5 months ago |
AlpinDale
|
0d3562a7f9
MQA in triton FA
|
5 months ago |
AlpinDale
|
8de8034f8b
include fp8 compilation in rocm
|
5 months ago |
AlpinDale
|
0f7ef9ef7c
fix: import in selector
|
5 months ago |
AlpinDale
|
36660b55c2
chore: mixtral fp8 w/ static scales (#542)
|
5 months ago |
AlpinDale
|
c21af7acad
feat: `DistributedGPUExecutor` abstract class (#541)
|
5 months ago |
AlpinDale
|
b178ae4b4a
chore: generalize linear_method to be quant_method (#540)
|
5 months ago |
AlpinDale
|
a6a627d745
fix aqlm compilation
|
5 months ago |
AlpinDale
|
aed64884c6
feat: prompt logprobs with chunked prefill (#539)
|
5 months ago |
Naomiusearch
|
9bcbf61296
There's no aphrodite.py in outlines repo (#531)
|
6 months ago |
cloud11665
|
cc9a801eed
[bugfix] change c++ std to 20 (#529)
|
6 months ago |
AlpinDale
|
ed759f065d
chore: tokenizer_revision -> revision
|
6 months ago |
AlpinDale
|
2e0b115ce1
move func tracing to utils
|
6 months ago |
AlpinDale
|
41338053e7
feat: add shutdown method to engine
|
6 months ago |
AlpinDale
|
199e776722
chore: move ray utils to executor dir
|
6 months ago |
AlpinDale
|
e7b1368156
feat: Phi3 support
|
6 months ago |
AlpinDale
|
1225c4dfd6
fix: illegal mem access crash for marlin
|
6 months ago |
AlpinDale
|
d1a3c7bc2c
chore: simplify try-finally logic in pynccl
|
6 months ago |
AlpinDale
|
440384d776
chore: use nvidia-ml-py instead of pynvml
|
6 months ago |
AlpinDale
|
46159b107a
formatting: pt1
|
6 months ago |
AlpinDale
|
4c746d8baa
chore: init nccl using the gloo backend
|
6 months ago |
AlpinDale
|
bf2dd2bee9
feat: allow multiple sampling params in LLM class
|
6 months ago |
Orion
|
a2a24e9b0d
feat: list support in message.content (#503)
|
6 months ago |
Bruno Renié
|
9c45fe9a2a
openai: fix metrics endpoint (#512)
|
6 months ago |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
6 months ago |