ScarfBench: Why AI Agents Still Can't Migrate Enterprise Java Apps
IBM Research just dropped a benchmark that reveals a harsh truth: frontier coding agents achieve less than 10% success migrating real Java apps. The problem isn't code—it's everything else.