Home / Research Library / A Review of Feature Selection Methods for Machine...
🤖 Artificial Intelligence OpenAlex

A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction

📅 June 27, 2022 👤 Nicholas Pudjihartono, Tayaza Fadason, Andreas W. Kempa-Liehr et al. 📖 Frontiers in Bioinformatics 📊 802 citations

🤖 Plain-English Summary

Machine learning has shown utility in detecting patterns within large, unstructured, and complex datasets. Therefore, the generalizability of machine learning models benefits from feature selection, which aims to extract only the most "informative" features and remove noisy "non-informative," irrelevant and redundant features.

🔑 Key Findings

  • One of the promising applications of machine learning is in precision medicine, where disease risk is predicted using patient genetic data.
  • However, creating an accurate prediction model based on genotype data remains challenging due to the so-called "curse of dimensionality" (i.e., extensively larger number of features compared to the number of samples).
  • Therefore, the generalizability of machine learning models benefits from feature selection, which aims to extract only the most "informative" features and remove noisy "non-informative," irrelevant and redundant features.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Jun 27, 2022
Journal Frontiers in Bioinformatics
Authors Nicholas Pudjihartono, Tayaza Fadason, Andreas W. Kempa-Liehr, Justin M. O’Sullivan
DOI 10.3389/fbinf.2022.927312
Citations 802
Source OpenAlex

More 🤖 Artificial Intelligence Research