We are proud to offer the Sama-Coco dataset, a relabelling of the Coco-2017 dataset by our own in-house Sama associates (here’s more information about our people!). We invite the Machine Learning (ML) community to use it for anything you would like to do – all free of charge and ungated.
This is part of our ongoing effort to redefine data quality for the modern age, and to contribute to the wider research and development efforts of the ML community. Here are the ungated links to the two datasets (both covered by the Creative Commons license) so that you can get started right away.


In the span of a single generation, the way we consume "entertainment content and popular media" has shifted from a scheduled, shared experience to an on-demand, personalized universe. What was once a passive diversion is now a powerful cultural engine—one that dictates fashion, influences political discourse, and even rewires our neural pathways.
Description: Create a personalized mood board where users can curate and organize their favorite photos, creating a visually appealing and easily accessible collection. xxx.photos.funia.com
Artificial Intelligence is no longer just a backend tool; it is a primary driver of content production and personalization. Beyond the Screen: How Entertainment Content and Popular