Texas A&M University · College Station

Nagendra
Pratap Singh

Ph.D. Student in Plant Genetics & Bioinformatics — developing adaptive ML pipelines for genomic prediction, constructing pangenomes, and characterizing organellar genomes in crop species.

180 Citations
6 h-index
10+ Publications
8+ In Review / Prep
📍 Current Position
Graduate Research Assistant
Dept. of Horticultural Sciences, TAMU
🧬 Active Project
GenoBridge v1.0
Adaptive ML pipeline for genomic prediction
🌿 Organism Focus
Grapefruit · Wheat · Chickpea
Citrus × paradisi · Triticum · Cicer arietinum
🏛️ Affiliations
TAMU · TAMU-K · USDA-ARS · CIMMYT
College Station, Kingsville, International
01

About Me

I am a Ph.D. student in Plant Genetics & Bioinformatics at Texas A&M University, College Station, working at the intersection of computational genomics, machine learning, and plant biology. My research spans chloroplast and mitochondrial genomics, transcriptomics, pangenomics, and the development of scalable ML pipelines for crop improvement.

My doctoral work centers on two parallel tracks: Wheat Pangenome and evolution study — Assembling and analyzing pangenomes of Elite North American winter, spring, and durum wheat varieties using whole-genome resequencing. Uncovering selective sweeps, genomic diversity, and functional gene expansion across North American elite wheat germplasm — and GenoBridge, an ensemble ML framework for sample-size-adaptive genomic prediction integrated with GWAS-guided candidate gene discovery.

Before joining TAMU, I spent four years as a CSIR-NET Research Fellow at NIPGR, New Delhi, where I contributed to genomic and epigenomic studies in chickpea, ricebean, and rice, and administered the institute's HPC infrastructure. I also held a Research Assistant position at Montana State University, where I assembled wheat pangenomes and investigated cross-kingdom sRNA regulation.

I am a CIMMYT Academy affiliate (2024–2026) and hold competitive scholarships from TAMU including the Horticultural Sciences Graduate Scholarship and the Steidinger Citrus Scholarship.

🎓
Education Ph.D. Plant Genetics, TAMU (2025–Present)
M.S. Plant & Soil Sci., TAMU-K (2024–2025)
📍
Location College Station, TX, USA
📧
Email nagendra.singh@agnet.tamu.edu
🏆
CSIR-NET All-India Rank 62 (June 2018)
🌐
Languages English · Hindi
02

Research Areas

🍊
Citrus Organellar Genomics

Characterizing complete chloroplast and mitochondrial genomes of red-fleshed grapefruit cultivars (Rio Red, Ruby Red, Texas Red TR-1) via PacBio HiFi WGS. Investigating IR boundary polymorphisms and plastid-to-mitochondria gene transfer relevant to HLB disease.

Chloroplast Genomics PacBio HiFi Citrus × paradisi
🤖
ML-Based Genomic Prediction

Developing GenoBridge — a sample-size-adaptive ensemble ML framework integrating Ridge regression, Random Forest, XGBoost, MLP, and stacking ensembles for genotype-to-phenotype prediction. Introduces CV-R² diagnostics and ML-gated GWAS for improved statistical power allocation.

GenoBridge Ensemble ML GWAS XGBoost
🌾
Wheat Pangenomics

Assembling and analyzing pangenomes of Montana winter, spring, and durum wheat varieties using whole-genome resequencing. Uncovering selective sweeps, genomic diversity, and functional gene expansion across North American elite wheat germplasm.

Pangenomics WGS Triticum
🔗
Cross-Kingdom sRNA Regulation

Investigating microRNA-mediated cross-kingdom gene regulation between wheat and wheat stem sawfly (Cephus cinctus). Identifying host-derived sRNAs that may silence insect target genes as a novel mechanism of crop resistance.

miRNA RNA-seq Insect Resistance
🫘
Legume Transcriptomics & Epigenomics

Contributing to genome sequencing, DNA methylation landscape characterization, and transcriptome-wide association mapping in chickpea (Cicer arietinum) and ricebean (Vigna umbellata) at NIPGR, New Delhi.

Epigenomics TWAS Chickpea
🥜
Peanut Disease Resistance

RNA-seq transcriptomics of peanut seed-coat resistance to Aspergillus flavus. Identified 76 significant phenylpropanoid DEGs; highlighted UGT73C6 (+11.7 log₂FC) and a lignin-building cluster (BBE-like 13, Laccase-15, Dirigent-22) as primary resistance determinants.

RNA-seq Aspergillus Phenylpropanoid
🍄
Fungal Genomics & Pathogenicity

Evolution of pathogenicity-associated genes in Rhizoctonia solani AG1-IA by genome duplication and transposon-mediated gene function alterations.

AG1-IA strains Rhizoctonia solani Transposable element Fungal pathogen Whole-genome duplication Sequential Segmental Duplications
03

Publications

2020 Plant Biotechnology Journal Published
Root-specific expression of chickpea cytokinin oxidase/dehydrogenase 6 leads to enhanced root growth, drought tolerance, and yield without compromising nodulation
Khandal H, Gupta SK, Dwivedi V, …, Singh NP, …, Chattopadhyay D.
🔢 90 citations
2022 Plants Published
Mutation in the endo-β-1,4-glucanase (KORRIGAN) is responsible for the thick leaf phenotype in sorghum
Mendu L, Jalathge G, Dhillon KK, Singh NP, …, Mendu V.
🔢 5 citations
2023 Crop Science Published
A 'solid' solution for wheat stem sawfly (Hymenoptera: Cephidae) resistance: Genetics, breeding, and development of solid stem wheat
Bathini A, Mendu L, Singh NP, Cook J, Weaver D, Sherman J, Hager M, Mondal S, Mendu V.
🔢 8 citations
2023 BMC Biology Published
Evolution of pathogenicity-associated genes in Rhizoctonia solani AG1-IA by genome duplication and transposon-mediated gene function alterations
Francis A, Ghosh S, Tyagi K, Prakasam V, Rani M, Singh NP, …, Jha G.
🔢 22 citations
2023 Plant Biotechnology Journal Published
The ricebean genome provides insight into Vigna genome evolution and facilitates genetic enhancement
Francis A, Singh NP, Singh M, Sharma P, …, Chattopadhyay D.
🔢 12 citations
2023 Planta Published
Differential expression of genes during recovery of Nicotiana tabacum from tomato leaf curl Gujarat virus infection
Namgial T, Singh AK, Singh NP, Francis A, Chattopadhyay D, Voloudakis A, Chakraborty S.
🔢 6 citations
2024 DNA Research Published
Uncovering DNA methylation landscapes to decipher evolutionary footprints of phenotypic diversity in chickpea
Daware A, Mohanty JK, Narnoliya L, …, Singh NP, …, Parida SK.
🔢 4 citations
2024 Journal of Experimental Botany Published
The chickpea WIP2 gene underlying a major QTL contributes to lateral root development
Dwivedi V, Pal L, Singh S, Singh NP, Parida SK, Chattopadhyay D.
🔢 9 citations
2024 BMC Plant Biology Published
Transcriptome-wide association mapping provides insights into the genetic basis and candidate genes governing flowering, maturity, and seed weight in rice bean (Vigna umbellata)
Sahu TK, Verma SK, Gayacharan, Singh NP, …, Singh AK.
🔢 5 citations
2025 Plant Pangenomes and Pangenomics, Elsevier Book Chapter
Technological development for analyzing and constructing plant pangenomes
Singh NP, Mendu L, Thudi M, Mendu V.
2022 bioRxiv Preprint
The extracellular RNA pool within Zea mays apoplast: composition and differential expression during Ustilago maydis infection
Mukherjee D#, Singh NP#, Roy A, Mondal R, Acharya U, Chattopadhyay D, Ghosh A. (#co-first authors)
2025 BMC Plant Biology Under Review
Characterization of complete chloroplast genomes of three red-fleshed grapefruit (Citrus × paradisi Macfad.) cultivars
Singh NP, Mendu V, Setamou M, Kunta MB.
2025 Plant Methods Under Review
GenoBridge: A sample-size-adaptive ensemble machine learning pipeline for genotype-to-phenotype prediction using integrated GWAS-ML hybrid validation
Singh NP, Mendu L, Mendu V.
2025 In Review Under Review
Pangenome and resequencing analysis of elite North American durum wheat uncovers genomic diversity, selective sweeps, and functional gene expansion
Singh NP, Mendu L, Thudi M, Mendu V.
In Prep In Preparation
Haplotype-resolved chromosome-level assembly of 'Rio Red' grapefruit reveals candidate loci for lycopene accumulation and key citrus gene families
Singh NP.
In Prep In Preparation
Plastid-to-mitochondria gene transfer and repeat-mediated mitochondrial genome reconfiguration in grapefruit: Organellar genomic resources for HLB disease research
Singh NP, Mendu V, Setamou M, Kunta MB.
04

Software & Tools

🧬
GenoBridge
v1.0

Sample-size-adaptive ensemble ML pipeline for genotype-to-phenotype prediction using SNP data from VCF files. Integrates Ridge regression, Random Forest, XGBoost, MLP, and stacking. Supports 30–100,000+ samples across any plant species. Distributed as standalone binaries — no Python required.

Platform Linux · macOS · CentOS
Input VCF + CSV
License MIT
🔍
GenoBridge_GWAS_ML
v1.0

Hybrid GWAS + ML pipeline for candidate gene discovery with population structure correction. Takes ML prediction results from GenoBridge as input, applies ML-gated GWAS to focus statistical power on genomically predictable traits, and annotates candidate genes from GFF3.

Input VCF + GFF3
Correction PC-based
License MIT
🌐
NIPGR Web Portals

Developed and deployed three institutional bioinformatics portals during tenure at NIPGR: RiceBean Portal (ricebeanportal.com), Chickpea Genome Browser (http://223.31.159.7/chickpea/), and Cicer Genome Resource (http://223.31.159.7/cicer/). Built with PHP, HTML/CSS, JavaScript, and MySQL.

Stack PHP · MySQL · JS
Org NIPGR, India
05

Research Experience

Aug 2025 – Present
Graduate Research Assistant
Dept. of Horticultural Sciences, Texas A&M University, College Station · Advisor: Dr. Amit Dhingra/Dr. Venugopal Mendu
  • Developing GenoBridge ensemble ML pipeline for sample-size-adaptive genomic prediction with GWAS integration.
  • Characterizing complete chloroplast genomes of three grapefruit cultivars via PacBio HiFi WGS.
  • Constructing haplotype-resolved 'Rio Red' grapefruit assembly for lycopene locus discovery.
Aug 2024 – Aug 2025
Graduate Research Assistant
Texas A&M University–Kingsville · Advisor: Dr. Venugopal Mendu
  • Identified 76 phenylpropanoid DEGs in peanut–Aspergillus flavus resistance.
  • Integrative Transcriptome and Small RNA Sequencing​ Reveal miRNA-Mediated Regulatory Mechanisms in the Wheat Stem Sawfly.
  • Drafted NSF PGRP GenoBridge-2 proposal with four specific aims and multi-institutional collaboration.
2024 – 2026
Visiting Graduate Student / Research Affiliate
CIMMYT Academy, Texcoco, Mexico
  • Independent training and collaborative research in international plant sciences and genomics.
Aug 2022 – Aug 2024
Research Assistant
Dept. of Plant Sciences & Plant Pathology, Montana State University, Bozeman
  • Assembled pangenomes of Montana winter, spring, and durum wheat; characterized selective sweeps and gene expansion.
  • Presented cross-kingdom miRNA regulation research at PAG San Diego (2024) and Montana Wheat & Barley Committee (2024).
  • Co-authored Crop Science paper on solid-stem wheat resistance (Bathini et al., 2023).
2018 – 2022
Junior / Senior Research Fellow (CSIR-NET)
National Institute of Plant Genome Research (NIPGR), New Delhi, India
  • Contributed to 6 peer-reviewed publications (130+ combined citations) across chickpea, ricebean, rice, and maize genomics.
  • Administered 7-node HPC cluster (~6.5 TB RAM) and ~397 TB NAS/SAN storage for institute-wide sequencing.
  • Developed and deployed three public bioinformatics web portals.
06

Technical Skills

🧬 Genome Assembly
FlyeCANUFalcon VerkkoMaSuRCAMEGAHIT metaSPAdesRavenShasta
📝 Annotation
AUGUSTUSBRAKER2MAKER-P PASAEvidenceModelerRepeatMasker InterProScaneggNOG-mapper
📊 Transcriptomics
STARHISAT2DESeq2 edgeRSalmonKallisto SeuratScanpy
🔬 Variant Calling
GATKDeepVariantFreeBayes MantaSnifflesSnpEff VEPANNOVAR
🤖 ML / GWAS
XGBoostRandom ForestRidge MLPPLINKTASSEL GAPITscikit-learn
🌲 Phylogenetics
IQ-TREERAxMLMrBayes MEGAMAFFTMUSCLE OrthoFinderMCScanX
💻 Languages
PythonRBash PerlRustPHP JavaScriptMySQL
⚙️ Workflows & Cloud
NextflowSnakemakeAWS Batch DNAnexusGalaxyHPC/SLURM
07

Get in Touch

Open to collaborations in plant genomics, ML pipelines & crop improvement.

I am always interested in discussing research collaborations, especially around genomic prediction frameworks, organellar genomics, pangenomics, and ML applications in plant breeding. Feel free to reach out via email or connect on GitHub and Google Scholar.