TY - JOUR
T1 - The PhytoClust tool for metabolic gene clusters discovery in plant genomes
AU - Topfer, Nadine
AU - Fuchs, Lisa-Maria
AU - Aharoni, Asaph
N1 - Deans of Life Sciences Postdoctoral fellowship; Alternative Energy Research Initiative (AERI) of the Weizmann Institute; Alternative Energy Research Initiative (AERI), Weizmann Institute of Science
PY - 2017/7/7
Y1 - 2017/7/7
N2 - The existence of Metabolic Gene Clusters (MGCs) in plant genomes has recently raised increased interest. Thus far, MGCs were commonly identified for pathways of specialized metabolism, mostly those associated with terpene type products. For efficient identification of novel MGCs, computational approaches are essential. Here, we present Phyto- Clust; a tool for the detection of candidate MGCs in plant genomes. The algorithm employs a collection of enzyme families related to plant specialized metabolism, translated into hidden Markov models, to mine given genome sequences for physically colocalized metabolic enzymes. Our tool accurately identifies previously characterized plant MGCs. An exhaustive search of 31 plant genomes detected 1232 and 5531 putative gene cluster types and candidates, respectively. Clustering analysis of putative MGCs types by species reflected plant taxonomy. Furthermore, enrichment analysis revealed taxa- And species-specific enrichment of certain enzyme families in MGCs. When operating through our webinterface, PhytoClust users can mine a genome either based on a list of known cluster types or by defining new cluster rules. Moreover, for selected plant species, the output can be complemented by coexpression analysis. Altogether, we envisage Phyto-Clust to enhance novel MGCs discovery which will in turn impact the exploration of plant metabolism.
AB - The existence of Metabolic Gene Clusters (MGCs) in plant genomes has recently raised increased interest. Thus far, MGCs were commonly identified for pathways of specialized metabolism, mostly those associated with terpene type products. For efficient identification of novel MGCs, computational approaches are essential. Here, we present Phyto- Clust; a tool for the detection of candidate MGCs in plant genomes. The algorithm employs a collection of enzyme families related to plant specialized metabolism, translated into hidden Markov models, to mine given genome sequences for physically colocalized metabolic enzymes. Our tool accurately identifies previously characterized plant MGCs. An exhaustive search of 31 plant genomes detected 1232 and 5531 putative gene cluster types and candidates, respectively. Clustering analysis of putative MGCs types by species reflected plant taxonomy. Furthermore, enrichment analysis revealed taxa- And species-specific enrichment of certain enzyme families in MGCs. When operating through our webinterface, PhytoClust users can mine a genome either based on a list of known cluster types or by defining new cluster rules. Moreover, for selected plant species, the output can be complemented by coexpression analysis. Altogether, we envisage Phyto-Clust to enhance novel MGCs discovery which will in turn impact the exploration of plant metabolism.
UR - http://www.scopus.com/inward/record.url?scp=85026436241&partnerID=8YFLogxK
U2 - 10.1093/nar/gkx404
DO - 10.1093/nar/gkx404
M3 - مقالة
SN - 0305-1048
VL - 45
SP - 7049
EP - 7063
JO - Nucleic acids research
JF - Nucleic acids research
IS - 12
ER -