.. |
kobold
|
6c50f5b067
chore: include stop strings in output (#168)
|
il y a 1 an |
ooba
|
63c28919a0
Revert "fix: correct auto ntk scaling_factor for 4k ctx case" (#149)
|
il y a 1 an |
openai
|
b9b295d74e
chore: backlogs 1 (#191)
|
il y a 1 an |
__init__.py
|
e52de7de70
feat: add API endpoint with FastAPI
|
il y a 1 an |
llm.py
|
7d91e9e0f2
feat: CUDA graphs (#172)
|
il y a 1 an |