コミット履歴

作者 SHA1 メッセージ 日付
  Nicholas Carlini 16f8d13900 Add a few more recent tests 5 ヶ月 前
  Nicholas Carlini be07d215b2 Improve evaluations for several tests 7 ヶ月 前
  Nicholas Carlini 2e14f5e5d0 Fix python version errors 7 ヶ月 前
  Nicholas Carlini 53d1251d88 A dozen new tests from the last month 7 ヶ月 前
  Nicholas Carlini 38d421bf8c Code golf questions! 8 ヶ月 前
  Nicholas Carlini 5b7c71d4c4 Add try/catch to stop failures 9 ヶ月 前
  Grant Williams b1cfa13350 update RustRun docstring 9 ヶ月 前
  Nicholas Carlini baec772990 Fix typo in JSON object conversion command; #1 from Evanc123/patch-1 9 ヶ月 前
  Viswa c09ecb6e9d update python env varaible 9 ヶ月 前
  Evan Cater da9d53df54 Update evaluator.py 9 ヶ月 前
  Nicholas Carlini 872a90b3ae Minor changes 10 ヶ月 前
  Nicholas Carlini 23c2965686 Add podman option 10 ヶ月 前
  Nicholas Carlini e50ca985fc A bunch of changes for release 10 ヶ月 前
  Nicholas Carlini e676a59407 Produce logfile of runs 11 ヶ月 前
  Nicholas Carlini 1d173a1627 Prepare description addition 11 ヶ月 前
  srxzr 521ed3896c adding preample and also new tests 11 ヶ月 前
  Nicholas Carlini 5656488a16 Add ability for llm to work with interactive processes 11 ヶ月 前
  Nicholas Carlini e425c714aa More tests, fixes to models 11 ヶ月 前
  Nicholas Carlini c1a909f67b Five new tests 11 ヶ月 前
  Nicholas Carlini 0737c24c6f Split llms across files, a few new tests 11 ヶ月 前
  Nicholas Carlini 70e5ca5889 A bunch of tests 11 ヶ月 前
  Nicholas Carlini af7c7c67f7 Add a bunch of evaluators, rewrite the eval interface 11 ヶ月 前
  Nicholas Carlini f15c042cf2 Initial commit; framework skeleton 1 年間 前