PhD in Bioinformatics, 2014
University of Camerino Italy
MSc. in Bioinformatics, 2008
Nottingham Trent University UK
BSc in Biotechnology, 2006
University of Pune India
Plants have evolved complex bouquets of specialized natural products that are utilized in medicine, agriculture, and industry. Untargeted natural product discovery has benefitted from growing plant omics data resources. Yet, plant genome complexity limits the identification and curation of biosynthetic pathways via single omics. Pairing multi-omics types within experiments provides multiple layers of evidence for biosynthetic pathway mining. The extraction of paired biological information facilitates connecting genes to transcripts and metabolites, especially when captured across time points, conditions and chemotypes. Experimental design requires specific adaptations to enable effective paired-omics analysis. Ultimately, metadata standards are required to support the integration of paired and unpaired public datasets and to accelerate collaborative efforts for natural product discovery in the plant research community.
With the emergence of large amounts of omics data, computational approaches for the identification of plant natural product biosynthetic pathways and their genetic regulation have become increasingly important. While genomes provide clues regarding functional associations between genes based on gene clustering, metabolome mining provides a foundational technology to chart natural product structural diversity in plants, and transcriptomics has been successfully used to identify new members of their biosynthetic pathways based on coexpression. Thus far, most approaches utilizing transcriptomics and metabolomics have been targeted towards specific pathways and use one type of omics data at a time. Recent technological advances now provide new opportunities for integration of multiple omics types and untargeted pathway discovery. Here, we review advances in plant biosynthetic pathway discovery using genomics, transcriptomics, and metabolomics, as well as recent efforts towards omics integration. We highlight how transcriptomics and metabolomics provide complementary information to link genes to metabolites, by associating temporal and spatial gene expression levels with metabolite abundance levels across samples, and by matching mass-spectral features to enzyme families. Furthermore, we suggest that elucidation of gene regulatory networks using time-series data may prove useful for efforts to unwire the complexities of biosynthetic pathway components based on regulatory interactions and events.