提交歷史

作者 SHA1 備註 提交日期
  Nicholas Carlini 16f8d13900 Add a few more recent tests 5 月之前
  Nicholas Carlini be07d215b2 Improve evaluations for several tests 7 月之前
  Nicholas Carlini 2e14f5e5d0 Fix python version errors 7 月之前
  Nicholas Carlini 53d1251d88 A dozen new tests from the last month 7 月之前
  Nicholas Carlini 38d421bf8c Code golf questions! 8 月之前
  Nicholas Carlini 5b7c71d4c4 Add try/catch to stop failures 9 月之前
  Grant Williams b1cfa13350 update RustRun docstring 9 月之前
  Nicholas Carlini baec772990 Fix typo in JSON object conversion command; #1 from Evanc123/patch-1 9 月之前
  Viswa c09ecb6e9d update python env varaible 9 月之前
  Evan Cater da9d53df54 Update evaluator.py 9 月之前
  Nicholas Carlini 872a90b3ae Minor changes 10 月之前
  Nicholas Carlini 23c2965686 Add podman option 10 月之前
  Nicholas Carlini e50ca985fc A bunch of changes for release 10 月之前
  Nicholas Carlini e676a59407 Produce logfile of runs 11 月之前
  Nicholas Carlini 1d173a1627 Prepare description addition 11 月之前
  srxzr 521ed3896c adding preample and also new tests 11 月之前
  Nicholas Carlini 5656488a16 Add ability for llm to work with interactive processes 11 月之前
  Nicholas Carlini e425c714aa More tests, fixes to models 11 月之前
  Nicholas Carlini c1a909f67b Five new tests 11 月之前
  Nicholas Carlini 0737c24c6f Split llms across files, a few new tests 11 月之前
  Nicholas Carlini 70e5ca5889 A bunch of tests 11 月之前
  Nicholas Carlini af7c7c67f7 Add a bunch of evaluators, rewrite the eval interface 11 月之前
  Nicholas Carlini f15c042cf2 Initial commit; framework skeleton 1 年之前