Predicting rice phenotypes with meta and multi-target learning
Journal article, 2020

The features in some machine learning datasets can naturally be divided into groups. This is the case with genomic data, where features can be grouped by chromosome. In many applications it is common for these groupings to be ignored, as interactions may exist between features belonging to different groups. However, including a group that does not influence a response introduces noise when fitting a model, leading to suboptimal predictive accuracy. Here we present two general frameworks for the generation and combination of meta-features when feature groupings are present. Furthermore, we make comparisons to multi-target learning, given that one is typically interested in predicting multiple phenotypes. We evaluated the frameworks and multi-target learning approaches on a genomic rice dataset where the regression task is to predict plant phenotype. Our results demonstrate that there are use cases for both the meta and multi-target approaches, given that overall, they significantly outperform the base case.

Multi-target learning

Bioinformatics

Rice

Machine learning

Meta-learning

Author

Oghenejokpeme I. Orhobor

University of Cambridge

Nickolai N. Alexandrov

International Rice Research Institute

Ross King

Chalmers, Biology and Biological Engineering, Systems and Synthetic Biology

Alan Turing Institute

University of Cambridge

Machine Learning

0885-6125 (ISSN) 1573-0565 (eISSN)

Vol. 109 11 2195-2212

Subject Categories

Language Technology (Computational Linguistics)

Bioinformatics (Computational Biology)

Computer Vision and Robotics (Autonomous Systems)

DOI

10.1007/s10994-020-05881-9

More information

Latest update

12/3/2020