Commit History

Author SHA1 Message Date
  Nicholas Carlini 16f8d13900 Add a few more recent tests 5 months ago
  Nicholas Carlini be07d215b2 Improve evaluations for several tests 7 months ago
  Nicholas Carlini 2e14f5e5d0 Fix python version errors 7 months ago
  Nicholas Carlini 53d1251d88 A dozen new tests from the last month 7 months ago
  Nicholas Carlini 38d421bf8c Code golf questions! 8 months ago
  Nicholas Carlini 5b7c71d4c4 Add try/catch to stop failures 9 months ago
  Grant Williams b1cfa13350 update RustRun docstring 9 months ago
  Nicholas Carlini baec772990 Fix typo in JSON object conversion command; #1 from Evanc123/patch-1 9 months ago
  Viswa c09ecb6e9d update python env varaible 9 months ago
  Evan Cater da9d53df54 Update evaluator.py 9 months ago
  Nicholas Carlini 872a90b3ae Minor changes 10 months ago
  Nicholas Carlini 23c2965686 Add podman option 10 months ago
  Nicholas Carlini e50ca985fc A bunch of changes for release 10 months ago
  Nicholas Carlini e676a59407 Produce logfile of runs 11 months ago
  Nicholas Carlini 1d173a1627 Prepare description addition 11 months ago
  srxzr 521ed3896c adding preample and also new tests 11 months ago
  Nicholas Carlini 5656488a16 Add ability for llm to work with interactive processes 11 months ago
  Nicholas Carlini e425c714aa More tests, fixes to models 11 months ago
  Nicholas Carlini c1a909f67b Five new tests 11 months ago
  Nicholas Carlini 0737c24c6f Split llms across files, a few new tests 11 months ago
  Nicholas Carlini 70e5ca5889 A bunch of tests 11 months ago
  Nicholas Carlini af7c7c67f7 Add a bunch of evaluators, rewrite the eval interface 11 months ago
  Nicholas Carlini f15c042cf2 Initial commit; framework skeleton 1 year ago