A biclique approach to reference anchored gene blocks and its applications to pathogenicity Islands

Arnon Benshahar, Vered Chalifa-Caspi, Danny Hermelin, Michal Ziv-Ukelson

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We formalize a new problem variant in gene-block discovery, denoted Reference-Anchored Gene Blocks (RAGB). Given a query sequence Q of length n, representing the gene-array of a DNA element, a window size bound d on the length of a substring of interest in Q, and a set of target gene sequences T = {T1…Tc}. Our objective is to identify gene-blocks in T that are centered in a subset q of co-localized genes from Q, and contain genomes from T in which the corresponding orthologs of the genes from q are also co-localized. We cast RAGB as a variant of a (colored) biclique problem in bipartite graphs, and analyze its parameterized complexity, as well as the parameterized complexity of other related problems. We give an O(nm+2dnm/ lgm) time algorithm for the uncolored variant of our biclique problem, where m is the number of areas of interest that are parsed from the target sequences, and n and d are as defined above. Our algorithm can be adapted to compute all maximal bicliques in the graph within the same time complexity, and to handle edge-weights with a slight O(lg d) increase to its time complexity. For the colored version of the problem, our algorithm has a time complexity of O(2dnm). We implement the algorithm and exemplify its application to LEE, a well-known pathogenicity island from the e. coli genome harboring virulence genes. Our code and supplementary materials, including omitted proofs and figures, are available at https://www. cs.bgu.ac.il/∼negevcb/RAGB/.

Original languageAmerican English
Title of host publicationAlgorithms in Bioinformatics - 16th International Workshop, WABI 2016, Proceedings
EditorsMartin Frith, Christian Nørgaard Storm Pedersen
PublisherSpringer Verlag
Pages14-26
Number of pages13
ISBN (Print)9783319436807
DOIs
StatePublished - 1 Jan 2016
Event16th International Workshop on Algorithms in Bioinformatics, WABI 2016 - Aarhus, Denmark
Duration: 22 Aug 201624 Aug 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume9838 LNCS

Conference

Conference16th International Workshop on Algorithms in Bioinformatics, WABI 2016
Country/TerritoryDenmark
CityAarhus
Period22/08/1624/08/16

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A biclique approach to reference anchored gene blocks and its applications to pathogenicity Islands'. Together they form a unique fingerprint.

Cite this