Home / Research Library / Ultrafast one‐pass FASTQ data preprocessing, quali...
🤖 Artificial Intelligence OpenAlex

Ultrafast one‐pass FASTQ data preprocessing, quality control, and deduplication using fastp

📅 May 1, 2023 👤 Shifu Chen 📖 iMeta 📊 1,932 citations

🤖 Plain-English Summary

A large amount of sequencing data is generated and processed every day with the continuous evolution of sequencing technology and the expansion of sequencing applications. For instance, the duplication evaluation module has been improved, and a new deduplication module has been added.

🔑 Key Findings

  • One consequence of such sequencing data explosion is the increasing cost and complexity of data processing.
  • The preprocessing of FASTQ data, which means removing adapter contamination, filtering low-quality reads, and correcting wrongly represented bases, is an indispensable but resource intensive part of sequencing data analysis.
  • Therefore, although a lot of software applications have been developed to solve this problem, bioinformatics scientists and engineers are still pursuing faster, simpler, and more energy-efficient software.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published May 01, 2023
Journal iMeta
Authors Shifu Chen
DOI 10.1002/imt2.107
Citations 1,932
Source OpenAlex

More 🤖 Artificial Intelligence Research