A new test of AI capabilities consists of puzzles that humans are able to solve without too much trouble, but which all leading AI models struggle with. To improve and pass the test, AI companies will need to balance problem-solving abilities with cost.
Requires account
I read it without an account.
I didn’t read it and don’t have an account.
hmm. I probably went over a limit or something then…