🏆 Gemini3-Pro: 49.7% · Visual Reasoning BabyVision A benchmark for visual reasoning that challanges frontier MLLMs yet 3-year-olds can solve.