PygmalionAI's large-scale inference engine
pygmalion.chat
It is designed to serve as the inference endpoint for the PygmalionAI website, and to allow serving the Pygmalion models to a large number of users with blazing fast speeds (thanks to vLLM's Paged Attention).
AlpinDale e7ef567c19 stuff | il y a 1 an | |
---|---|---|
geppetto | il y a 1 an |