Galaxy Zoo Evo: A Massive Labeled Dataset for Galaxy Image Analysis

Published:Dec 29, 2025 18:51
1 min read
ArXiv

Analysis

This paper introduces a significant contribution to the field of astronomy and computer vision by providing a large, human-annotated dataset of galaxy images. The dataset, Galaxy Zoo Evo, offers detailed labels for a vast number of images, enabling the development and evaluation of foundation models. The dataset's focus on fine-grained questions and answers, along with specialized subsets for specific astronomical tasks, makes it a valuable resource for researchers. The potential for domain adaptation and learning under uncertainty further enhances its importance. The paper's impact lies in its potential to accelerate the development of AI models for astronomical research, particularly in the context of future space telescopes.

Reference

GZ Evo includes 104M crowdsourced labels for 823k images from four telescopes.