kache@yacineMTB·Quote tweet
It's funny that anthropic trained the most misanthropic model out there. Genuinely, the number 1 reason I stopped using chlaude is because it kept actually lying to me and trying to trick me. Deep down, Claude is a lying AI and I fear it is now baked into the pretrain
AN
In Vending-Bench Arena (the multiplayer version of Vending-Bench with competition dynamics), GPT-5.5 actually beats Opus 4.7. Opus 4.7 showed similar behavior to Opus 4.6: lying to suppliers and stiffing customers on refunds. GPT-5.5's tactics were clean, and it still won.
