PATHOEXTRACT: A BIOINFORMATIC PIPELINE FOR QUALITY CONTROL AND HOST DNA REMOVAL IN PLASMODIUM FALCIPARUM NGS DATA

- Laboratory of Mechanics and Computer Science (LAMI), Felix Houphouet-Boigny University, Abidjan, Cote dIvoire.
- Parasitology and Mycology Unit, Pasteur Institute (IPCI), Abidjan, Cote dIvoire.
- Genomics and Metagenomics Platform, Pasteur Institute (IPCI), Abidjan, Cote dIvoire.
- Laboratory of Environmental Science and Technology (LSTE), Jean LorougnonGuede University (UJLoG), Daloa, Cote dIvoire.
- Abstract
- Keywords
- Cite This Article as
- Corresponding Author
Malaria, caused by Plasmodium falciparum, is a significant global health burden, particularly in sub-Saharan Africa. Deep sequencing (NGS) of parasite genomes has revolutionized our understanding of its biology and the emergence of drug resistance. However, the presence of host human DNA and other microbial contaminants within patient samples can hinder accurate and efficient parasite genome analysis. To address this challenge, we have developed PathoExtract, a robust bioinformatics pipeline that integrates commonly used tools into a streamlined workflow. PathoExtract leverages Snakemake, a workflow management system, to provide a flexible and reproducible framework for data processing. The pipeline incorporates rigorous quality control steps to identify and remove low-quality reads and contaminants. Host DNA and microbial sequences are effectively filtered out using a combination of alignment-based and alignment-free methods, ensuring that only Plasmodium falciparum reads are retained for downstream analysis.The pipeline offers an intuitive graphical user interface, making it accessible to researchers with varying levels of bioinformatics expertise. This user-friendly interface simplifies the process of running the pipeline, even for those unfamiliar with command-line tools. The code and documentation for PathoExtract are freely available at: https://github.com/stanlasso/DREPAL-PATHOEXTRACT.
[Stanislas Egomli Assohoun, Aristide Berenger Ako, Patrice Nguessan Akoguhi, Paul Christian Abouchou Ako, Medard Brou Kouassi, Jerome Adou Kablan and Ronan Jambou (2024); PATHOEXTRACT: A BIOINFORMATIC PIPELINE FOR QUALITY CONTROL AND HOST DNA REMOVAL IN PLASMODIUM FALCIPARUM NGS DATA Int. J. of Adv. Res. (Sep). 1150-1161] (ISSN 2320-5407). www.journalijar.com
Laboratory of Mechanics and Computer Science (LAMI), Félix Houphouët-Boigny University, Abidjan, Côte d'Ivoire.
Cote d