Коммит түүх

Эзэн SHA1 Мессеж Огноо
  Nicholas Carlini 16f8d13900 Add a few more recent tests 7 сар өмнө
  Nicholas Carlini be07d215b2 Improve evaluations for several tests 8 сар өмнө
  Nicholas Carlini 2e14f5e5d0 Fix python version errors 8 сар өмнө
  Nicholas Carlini 53d1251d88 A dozen new tests from the last month 8 сар өмнө
  Nicholas Carlini 38d421bf8c Code golf questions! 9 сар өмнө
  Nicholas Carlini 5b7c71d4c4 Add try/catch to stop failures 10 сар өмнө
  Grant Williams b1cfa13350 update RustRun docstring 10 сар өмнө
  Nicholas Carlini baec772990 Fix typo in JSON object conversion command; #1 from Evanc123/patch-1 11 сар өмнө
  Viswa c09ecb6e9d update python env varaible 11 сар өмнө
  Evan Cater da9d53df54 Update evaluator.py 11 сар өмнө
  Nicholas Carlini 872a90b3ae Minor changes 11 сар өмнө
  Nicholas Carlini 23c2965686 Add podman option 11 сар өмнө
  Nicholas Carlini e50ca985fc A bunch of changes for release 11 сар өмнө
  Nicholas Carlini e676a59407 Produce logfile of runs 1 жил өмнө
  Nicholas Carlini 1d173a1627 Prepare description addition 1 жил өмнө
  srxzr 521ed3896c adding preample and also new tests 1 жил өмнө
  Nicholas Carlini 5656488a16 Add ability for llm to work with interactive processes 1 жил өмнө
  Nicholas Carlini e425c714aa More tests, fixes to models 1 жил өмнө
  Nicholas Carlini c1a909f67b Five new tests 1 жил өмнө
  Nicholas Carlini 0737c24c6f Split llms across files, a few new tests 1 жил өмнө
  Nicholas Carlini 70e5ca5889 A bunch of tests 1 жил өмнө
  Nicholas Carlini af7c7c67f7 Add a bunch of evaluators, rewrite the eval interface 1 жил өмнө
  Nicholas Carlini f15c042cf2 Initial commit; framework skeleton 1 жил өмнө