Morphological Classification of Galaxies in S-PLUS using an Ensemble of Convolutional Networks

Published in Notas Técnicas do Centro Brasileiro de Pesquisas Físicas - NT-CBPF, 2021

Abstract: The universe is composed of galaxies that have diverse shapes. Once the structure of a galaxy is determined, it is possible to obtain important information on its formation and evolution. Morphological classification is the cataloging of galaxies according to their visual appearance and it is linked to their physical properties. A morphological classification made through visual analysis is sensible to differences in evaluation among diferent human volunteers. For this reason, systematic, objective and easily reproducible classification of galaxies has been gaining importance since the astronomer Edwin Hubble created his famous classification method. In this work, we combine accurate visual classifications of the Galaxy Zoo project with Deep Learning methods. The goal is to find an efficient technique at human performance level classification, but in a systematic and automatic way, for classification of elliptical and spiral galaxies. For this, a neural network model was created through an Ensemble of four other convolutional models, allowing a greater accuracy in the classification than what would be obtained with any one individual. Details of the individual models and improvements made to these are also described. The present work is entirely based on the analysis of images (not parameter tables) from DR1 (www.datalab.noao.edu) of the Southern Photometric Local Universe Survey (S-PLUS). In terms of classification, we achieved, with the Ensemble, an accuracy of $\approx 99 \%$ in the test sample (using pre-trained networks).

Authors: N. M. Cardoso, G. B. O. Schwarz, L. O. Dias, C. R. Bom, L. Sodre Jr, C. Mendes de Oliveira.

Paper (PDF): arXiv:2107.02287

Supplementary Materials & Data Output

Column	Description
ra	right ascension in decimal degrees (J2000)
dec	declination in decimal degrees (J2000)
r_auto	S-PLUS DR1 r-band magnitude (SDSS-like r)
mn170_pE	probability of the object being elliptical given by the classification of the trained model with objects with magnitude $r_{auto} < 17$ (mn stands for “MorphoNet”)
mn170_pS	probability of the object being spiral given by the classification of the trained model with objects with magnitude $r_{auto} < 17$
mn175_pE	probability of the object being elliptical given by the classification of the trained model with objects with magnitude $r_{auto} < 17.5$
mn175_pS	probability of the object being spiral given by the classification of the trained model with objects with magnitude $r_{auto} < 17.5$

Downloads:

Catalog: CSV
Images: Numpy Arrays

Trained network weights

Weights of trained models as described in the paper. The network was trained with TensorFlow v2 and saved in SavedModel format. Each folder contains the model of a member of the Ensemble. A file in Python3 called MorphoNet Class is available that automates the loading of the weights of each member. It is recommended to use this class instead of making direct predictions as it standardizes member predictions with respect to training data as described in the paper.

Downloads:

MN170 Weights (~1GB): Main • Mirror
MN175 Weights (~1GB): Main • Mirror
Prediction Class (for both models): MorphoNet Class

Code Samples

import os
import random
import numpy as np
import matplotlib.pyplot as plt
from mpl_toolkits.axes_grid1 import ImageGrid
import morphonet

# kind can be 'mn170' or 'mn175'
mn = morphonet.MorphoNet(kind='mn170', weights_folder='morphonet170')

# this method load the members and meta-model in memory
mn.load()

# load images to memory, using np.load in this case of .npy images
imgs = [np.load(f'colored_splus_stamps_npy/{f}') for f in os.listdir('colored_splus_stamps_npy')]

# call predict method with a list of numpy arrays of shape (128, 128, 3)
preds = mn.predict(imgs)

# select a random sample to plot
ids = random.sample(range(0, len(imgs)), 100)

# plot the reults in a grid
fig = plt.figure(figsize=(14, 16))
grid = ImageGrid(fig, 111, nrows_ncols=(7, 7), axes_pad=0.3)
for ax, _id in zip(grid, ids):
  im = imgs[_id]
  ax.set_title(f'S={preds[_id][0]:.2f} E={preds[_id][1]:.2f}')
  ax.imshow(im)
plt.show()

Share on

Twitter Facebook LinkedIn

N. M. Cardoso

Supplementary Materials & Data Output

Classifications of blind sample

Trained network weights

Code Samples

Share on