Download Community Forensics

(06/07/2025): The small version of the dataset is now available on Hugging Face.
(04/09/2025): The dataset is now available on Hugging Face. Existing links are taken down due to frequent issues during downloading.
(01/16/2025): Model weights released.
(01/13/2025): Filtered training dataset is released.
(11/20/2024): Full training dataset is released. We are also working on releasing the classifier model weights.
(11/06/2024): We currently only release the Train-small subset of our dataset, which is about 10% of the full dataset. The full dataset will be released in the coming weeks. The Public version of our evaluation set is also available for download. The instructions to reconstruct the Comprehensive evaluation set will be released in the future.

The dataset is provided for research purposes only. Each image in this dataset has been generated by the models with their respective licenses. Please refer to the metadata for license information.


Full Dataset: Full dataset including detailed metadata in parquet format. The dataset is available on Hugging Face.
Train-small: A small subset of the training dataset, which is about 11% of the full dataset. This subset contains real data with redistributable licenses, and is intended for easier prototyping and testing.
Model Weights: Classifier model weights. Please check the GitHub repository for the model code.

Download the dataset by clicking the link below:

Download Full Dataset (1.1TB) Download Small Dataset (278GB) Download Model Weights