Apple has released Pico-Banana-400K, a 400,000-image research dataset designed to train AI image editing models, built using Google’s Gemini-2.5 technology. The dataset addresses a critical gap in open AI research by providing high-quality, shareable training data that researchers say has been limited by synthetic generations from proprietary models and inconsistent quality control.
Why this matters: Existing image editing datasets often suffer from domain shifts, unbalanced edit distributions, and quality issues that hinder the development of robust AI models, leaving researchers without adequate training resources.
How Apple built the dataset: The research team systematically created the dataset using a multi-step validation process to ensure quality and diversity.
What’s included: The dataset encompasses multiple types of AI training scenarios to help models learn both successful and unsuccessful editing outcomes.
The bigger picture: Apple’s researchers acknowledge that current image editing models, including Nano-Banana, still struggle with fine-grained spatial editing, layout extrapolation, and typography challenges.
Where to find it: The complete study is available on arXiv, with the full dataset accessible through GitHub for qualifying researchers.