AlpinDale b1caee23a6 cache the p2p access check for memory saving vor 9 Monaten
..
__init__.py 07aa2a492f upstream: add option to specify tokenizer vor 1 Jahr
api_server.py 2aec297c55 feat: add embeddings endpoint to openai rest-api server. (#363) vor 9 Monaten
embeddings.py 2aec297c55 feat: add embeddings endpoint to openai rest-api server. (#363) vor 9 Monaten
protocol.py b4fcaf7aa3 add sampling param for left-truncating prompt tokens vor 9 Monaten
samplers.json ac82b67f75 feat: naive context shift and various QoL changes (#289) vor 10 Monaten
serving_chat.py 5ab7a159d7 fix formatting for previous commit vor 9 Monaten
serving_completions.py 531969a0b2 move merge_async_iterators to common utils vor 9 Monaten
serving_engine.py b1caee23a6 cache the p2p access check for memory saving vor 9 Monaten