Light-curve classification with recurrent neural networks for GOTO : dealing with imbalanced data

Burhanudin, U F and Maund, Justyn and Killestein, Thomas and Ackley, K and Dyer, Martin J and Lyman, Joe and Ulaczyk, K and Cutter, R. and Mong, Y-L and Steeghs, D and Galloway, D K and Dhillon, Vik and O’Brien, P and Ramsay, G and Noysena, K and Kotak, R and Breton, Rene P and Nuttall, L and Pallé, E and Pollacco, D and Thrane, E and Awiphan, S and Chote, P and Chrimes, A and Daw, E and Duffy, Christopher and Eyles-Ferris, RAJ and Gompertz, Benjamin and Heikkilä, T and Irawati, P and Kennedy, Mark R and Levan, A and Littlefair, S and Makrygianni, L and Mata-Sánchez, D and Mattila, S and McCormac, J and Mkrtichian, D and Mullaney, J and Sawangwit, U and Stanway, Elizabeth and Starling, Rhaana and Strøm, P and Tooke, S and Wiersema, K (2021) Light-curve classification with recurrent neural networks for GOTO : dealing with imbalanced data. Monthly Notices of the Royal Astronomical Society, 505 (3). pp. 4345-4361. ISSN 0035-8711

Full text not available from this repository.

Abstract

The advent of wide-field sky surveys has led to the growth of transient and variable source discoveries. The data deluge produced by these surveys has necessitated the use of machine learning (ML) and deep learning (DL) algorithms to sift through the vast incoming data stream. A problem that arises in real-world applications of learning algorithms for classification is imbalanced data, where a class of objects within the data is underrepresented, leading to a bias for overrepresented classes in the ML and DL classifiers. We present a recurrent neural network (RNN) classifier that takes in photometric time-series data and additional contextual information (such as distance to nearby galaxies and on-sky position) to produce real-time classification of objects observed by the Gravitational-wave Optical Transient Observer, and use an algorithm-level approach for handling imbalance with a focal loss function. The classifier is able to achieve an Area Under the Curve (AUC) score of 0.972 when using all available photometric observations to classify variable stars, supernovae, and active galactic nuclei. The RNN architecture allows us to classify incomplete light curves, and measure how performance improves as more observations are included. We also investigate the role that contextual information plays in producing reliable object classification.

Item Type:
Journal Article
Journal or Publication Title:
Monthly Notices of the Royal Astronomical Society
Uncontrolled Keywords:
/dk/atira/pure/subjectarea/asjc/3100/3103
Subjects:
?? astronomy and astrophysicsspace and planetary science ??
ID Code:
227765
Deposited By:
Deposited On:
25 Feb 2025 09:25
Refereed?:
Yes
Published?:
Published
Last Modified:
25 Feb 2025 09:25