astromorph: self-supervised machine learning pipeline for astronomical morphology analysis
Artikel i vetenskaplig tidskrift, 2026

Context. Modern telescopes generate increasingly large and diverse datasets, often consisting of complex and morphologically rich structures. To efficiently explore such data requires automated methods that can extract and organize physically meaningful information, ideally without the need for extensive manual interaction. Aims. Our aim is to provide a user-friendly implementation of a self-supervised machine learning framework to explore morphological properties of large datasets, based on the Bootstrap Your Own Latents (BYOL) method. By enabling the generation of meaningful image embeddings without manually labelled data, the framework will enable key tasks such as clustering, anomaly detection, and similarity-based exploration. Methods. We present astromorph, a Python package that implements the BYOL method in a way tailored for astronomical imaging. In contrast to existing BYOL implementations, astromorph accommodates data of varying dimensions and resolutions, including both single-channel FITS images and multi-channel spectral cubes. The package is built with usability in mind, and offers streamlined pipeline scripts for ease of use as well as deeper customization options via PyTorch-based classes. Results. To demonstrate the utility of astromorph, we apply it in two contrasting science cases representing different astronomical domains: images of protoplanetary disks observed with the Atacama Large Millimeter/submillimeter Array (ALMA), and infrared dark clouds observed with Spitzer and Herschel. In both cases we demonstrate how astromorph produces scientifically meaningful embeddings that capture morphological differences and similarities across large samples. Conclusions. astromorph enables users to apply a robust, label-free approach for uncovering morphological patterns in astronomical datasets. The successful application to two markedly different datasets suggests that the pipeline is broadly applicable across a wide range of imaging-rich astronomical contexts, providing a user-friendly tool for advancing discovery in observational astronomy.

methods: numerical

methods: data analysis

ISM: clouds

protoplanetary disks

Författare

Per Bjerkeli

Fysik, kemi och bioteknik samt matematik och tekniskt basår

Jouni Kainulainen

Chalmers, Rymd-, geo- och miljövetenskap, Astronomi och plasmafysik

Maria Carmen Toribio Perez

Chalmers, Rymd-, geo- och miljövetenskap, Onsala rymdobservatorium

Leon Boschman

Chalmers, Fysik, E-commons

Otoniel Maya Lucas

Chalmers, Fysik, E-commons

Astronomy and Astrophysics

0004-6361 (ISSN) 1432-0746 (eISSN)

Vol. 708 A156

Ämneskategorier (SSIF 2025)

Programvaruteknik

Datorgrafik och datorseende

Astronomi, astrofysik och kosmologi

Infrastruktur

Chalmers e-Commons (inkl. C3SE, 2020-)

DOI

10.1051/0004-6361/202557638

Mer information

Senast uppdaterat

2026-04-17