Lessons from testing three AI agents on the same complex task | Dark Hacker News