Claude Opus 4.7 has scored 72.8 on the Thematic Generalization Benchmark, a significant drop from the 80.6 recorded by Opus 4 ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible resultsSome results have been hidden because they may be inaccessible to you
Show inaccessible results