AlpinDale
|
411ac4f405
vlm: add support for Qwen2-VL model (#1015)
|
1 week ago |
AlpinDale
|
be59e30139
vlm: add support for video modality + llava next video (#1014)
|
1 week ago |
AlpinDale
|
313e198557
api: implement OpenAI-compatible tools API for Hermes/Mistral models (#993)
|
1 week ago |
AlpinDale
|
9ff3239ce2
fix: gguf vocab embddings in TP (#958)
|
2 weeks ago |
khanonnie
|
e1eb7fbedc
fix: SentencePieceTokenizer error when using mistral tokenizer mode (#943)
|
2 weeks ago |
AlpinDale
|
53d0ba7c7c
api: add endpoint for loading and unloading the model (#926)
|
2 weeks ago |
AlpinDale
|
538471f76e
chore: bump mistral_common to 1.5.0 (#844)
|
1 month ago |
AlpinDale
|
2f61644f6e
SPMD optimizations (#824)
|
1 month ago |
Fizz~
|
8a71788372
Add OLMoE (#772)
|
2 months ago |
AlpinDale
|
d7309453f6
fix: add pandas to requirements (#756)
|
3 months ago |
AlpinDale
|
d289c3855b
fix: install protobuf for cpu (#716)
|
3 months ago |
AlpinDale
|
d5033e12fd
feat: implement mistral tokenizer mode (#711)
|
3 months ago |
AlpinDale
|
04da8c33bd
Revert "chore: use the `compressed-tensors` library to avoid code reuse (#704)" (#706)
|
3 months ago |
AlpinDale
|
f76f2a5af0
feat: add aphrodite plugin system (#705)
|
3 months ago |
AlpinDale
|
f5bbf07c90
chore: use the `compressed-tensors` library to avoid code reuse (#704)
|
3 months ago |
AlpinDale
|
3693028340
feat: support for Audio modality (#698)
|
3 months ago |
AlpinDale
|
ec32f999bc
build: bump cmake to 3.26 (#691)
|
3 months ago |
AlpinDale
|
8cfbe62a7c
chore: bump lmfe to v0.10.6 and include triton for tpu and xpu dockerfiles (#682)
|
3 months ago |
AlpinDale
|
62111fab17
feat: allow serving encoder-decoder models in the API server (#664)
|
4 months ago |
AlpinDale
|
f1d0b77c92
[0.6.0] Release Candidate (#481)
|
4 months ago |
AlpinDale
|
9d81716bfd
[v0.5.3] Release Candidate (#388)
|
8 months ago |