tmuxvim@tmuxvim·Original post
well, that's insane. GPT-5.5 just took over the leaderboard on ErrataBench, a large-text proofreading benchmark.
well, that's insane. GPT-5.5 just took over the leaderboard on ErrataBench, a large-text proofreading benchmark.