AlpinDale
|
b02e633691
configure endpoint
|
5 months ago |
AlpinDale
|
ed759f065d
chore: tokenizer_revision -> revision
|
6 months ago |
AlpinDale
|
2e0b115ce1
move func tracing to utils
|
6 months ago |
AlpinDale
|
41338053e7
feat: add shutdown method to engine
|
6 months ago |
AlpinDale
|
199e776722
chore: move ray utils to executor dir
|
6 months ago |
AlpinDale
|
e7b1368156
feat: Phi3 support
|
6 months ago |
AlpinDale
|
1225c4dfd6
fix: illegal mem access crash for marlin
|
6 months ago |
AlpinDale
|
d1a3c7bc2c
chore: simplify try-finally logic in pynccl
|
6 months ago |
AlpinDale
|
440384d776
chore: use nvidia-ml-py instead of pynvml
|
6 months ago |
AlpinDale
|
46159b107a
formatting: pt1
|
6 months ago |
AlpinDale
|
4c746d8baa
chore: init nccl using the gloo backend
|
6 months ago |
AlpinDale
|
bf2dd2bee9
feat: allow multiple sampling params in LLM class
|
6 months ago |
Orion
|
a2a24e9b0d
feat: list support in message.content (#503)
|
6 months ago |
Bruno Renié
|
9c45fe9a2a
openai: fix metrics endpoint (#512)
|
6 months ago |
AlpinDale
|
fca911ee0a
vLLM Upstream Sync (#526)
|
6 months ago |
AlpinDale
|
8be299e78b
fix: lora load check
|
7 months ago |
AlpinDale
|
096d9eb6c5
enhance nvlink detection
|
7 months ago |
AlpinDale
|
fb7825df8f
squash logprobs
|
7 months ago |
AlpinDale
|
66b7bc4415
sliding window in prefix kernel
|
7 months ago |
AlpinDale
|
42998e423c
better quant verification
|
7 months ago |
AlpinDale
|
483c95a2f8
fix ops in gptq and awq
|
7 months ago |
AlpinDale
|
8f9cb7235c
chore: allow multiple served model names
|
7 months ago |
AlpinDale
|
fc80f57967
fix: correct file name for qwen2 moe
|
7 months ago |
AlpinDale
|
f894f7b176
Revert "reduce dedupe by wrapping in general worker class"
|
7 months ago |
AlpinDale
|
082b0b03bc
Revert "actually run the workers"
|
7 months ago |
AlpinDale
|
36cf32649d
actually run the workers
|
7 months ago |
AlpinDale
|
9fff6fb507
reduce dedupe by wrapping in general worker class
|
7 months ago |
AlpinDale
|
b92bddafe9
time.monotonic() -> time.time()
|
7 months ago |
AlpinDale
|
0178b4d976
docker: add AWS Neuron Docker image
|
7 months ago |
AlpinDale
|
949f0445de
readme: update installation command
|
8 months ago |