Open-Source Research Agents Are Making Proprietary Benchmarks Obsolete
PokeeResearch-7B shows significant performance gains on challenging benchmarks like GAIA and HLE, suggesting that open models are closing the gap with closed systems in complex, multi-step research tasks.