david
|
4d88c30242
conflicts resolved
|
hace 2 meses |
Nicholas Carlini
|
a35d306f34
Add O1 option
|
hace 3 meses |
Nicholas Carlini
|
6bb358f269
Add GPT-4o mini result
|
hace 4 meses |
Nicholas Carlini
|
b09d7fede5
Merge pull request #17 from simveit/main
|
hace 5 meses |
Nicholas Carlini
|
d0ecd8c33b
Update with Sonnet 3.5 and Gemini 1.5 Pro results
|
hace 5 meses |
Simon Veitner
|
eae650718a
Added groq to example config
|
hace 5 meses |
Simon Veitner
|
7b972e43ee
Enable support for Groq models
|
hace 5 meses |
Your Name
|
dfea2287a9
I don't have blind faith in LLMs
|
hace 5 meses |
Nicholas Carlini
|
16f8d13900
Add a few more recent tests
|
hace 5 meses |
Nicholas Carlini
|
3ad3cfde3c
Update README
|
hace 7 meses |
Nicholas Carlini
|
be07d215b2
Improve evaluations for several tests
|
hace 7 meses |
Nicholas Carlini
|
2e14f5e5d0
Fix python version errors
|
hace 7 meses |
Nicholas Carlini
|
53d1251d88
A dozen new tests from the last month
|
hace 7 meses |
Nicholas Carlini
|
0d2b4d9e9d
Edit README to say how to generate result figures
|
hace 8 meses |
Nicholas Carlini
|
656a597d01
Add support for incremental builds of results
|
hace 8 meses |
Nicholas Carlini
|
e98bcc1e22
Fix golfing question again
|
hace 8 meses |
Nicholas Carlini
|
5c9a6521e0
Fix 20 questions
|
hace 8 meses |
Nicholas Carlini
|
b0d674b92c
Fix golfing question
|
hace 8 meses |
Nicholas Carlini
|
4e890ca464
Fix webgl draw test
|
hace 8 meses |
Nicholas Carlini
|
38d421bf8c
Code golf questions!
|
hace 8 meses |
david
|
ebeb8876c5
Merge branch 'main' of https://github.com/carlini/yet-another-applied-llm-benchmark
|
hace 8 meses |
Nicholas Carlini
|
49207c3ed7
Add a few new test cases
|
hace 8 meses |
Nicholas Carlini
|
55175af4a1
Update anthropic llm to latest API
|
hace 8 meses |
Nicholas Carlini
|
0e7238803b
Six new tests
|
hace 8 meses |
Nicholas Carlini
|
4b56a1e278
Merge pull request #15 from RyanSaxe/fix/incorrect_hparams
|
hace 8 meses |
RyanSaxe
|
a6c1e2e36a
name written twice
|
hace 8 meses |
RyanSaxe
|
75e11cbf67
I noticed that the hparams in the config file that were being accessed had copy-pasted code that had the wrong llm name. Updated to the right name according to class object
|
hace 8 meses |
david
|
d4a50ae671
custom model support by OpenAI compatible API, and fix program_pipes_cpp, program_pipes_python test
|
hace 8 meses |
david
|
30f36116f6
demo config
|
hace 8 meses |
david
|
551ba34582
remove config.json
|
hace 8 meses |