Историја ревизија

Аутор SHA1 Порука Датум
  david 4d88c30242 conflicts resolved пре 2 месеци
  Nicholas Carlini a35d306f34 Add O1 option пре 3 месеци
  Nicholas Carlini 6bb358f269 Add GPT-4o mini result пре 4 месеци
  Nicholas Carlini b09d7fede5 Merge pull request #17 from simveit/main пре 5 месеци
  Nicholas Carlini d0ecd8c33b Update with Sonnet 3.5 and Gemini 1.5 Pro results пре 5 месеци
  Simon Veitner eae650718a Added groq to example config пре 5 месеци
  Simon Veitner 7b972e43ee Enable support for Groq models пре 5 месеци
  Your Name dfea2287a9 I don't have blind faith in LLMs пре 5 месеци
  Nicholas Carlini 16f8d13900 Add a few more recent tests пре 5 месеци
  Nicholas Carlini 3ad3cfde3c Update README пре 7 месеци
  Nicholas Carlini be07d215b2 Improve evaluations for several tests пре 7 месеци
  Nicholas Carlini 2e14f5e5d0 Fix python version errors пре 7 месеци
  Nicholas Carlini 53d1251d88 A dozen new tests from the last month пре 7 месеци
  Nicholas Carlini 0d2b4d9e9d Edit README to say how to generate result figures пре 8 месеци
  Nicholas Carlini 656a597d01 Add support for incremental builds of results пре 8 месеци
  Nicholas Carlini e98bcc1e22 Fix golfing question again пре 8 месеци
  Nicholas Carlini 5c9a6521e0 Fix 20 questions пре 8 месеци
  Nicholas Carlini b0d674b92c Fix golfing question пре 8 месеци
  Nicholas Carlini 4e890ca464 Fix webgl draw test пре 8 месеци
  Nicholas Carlini 38d421bf8c Code golf questions! пре 8 месеци
  david ebeb8876c5 Merge branch 'main' of https://github.com/carlini/yet-another-applied-llm-benchmark пре 8 месеци
  Nicholas Carlini 49207c3ed7 Add a few new test cases пре 8 месеци
  Nicholas Carlini 55175af4a1 Update anthropic llm to latest API пре 8 месеци
  Nicholas Carlini 0e7238803b Six new tests пре 8 месеци
  Nicholas Carlini 4b56a1e278 Merge pull request #15 from RyanSaxe/fix/incorrect_hparams пре 8 месеци
  RyanSaxe a6c1e2e36a name written twice пре 8 месеци
  RyanSaxe 75e11cbf67 I noticed that the hparams in the config file that were being accessed had copy-pasted code that had the wrong llm name. Updated to the right name according to class object пре 8 месеци
  david d4a50ae671 custom model support by OpenAI compatible API, and fix program_pipes_cpp, program_pipes_python test пре 8 месеци
  david 30f36116f6 demo config пре 8 месеци
  david 551ba34582 remove config.json пре 8 месеци