Omics Data Management and Annotation

Arye Harel, Irina Dalah, Shmuel Pietrokovski, Marilyn Safran, Doron Lancet

פרסום מחקרי: פרק בספר / בדוח / בכנספרק


Technological lilies breakthroughs, including next generation sequencing, bring avalanches of data which need to undergo effective data management to ensure integrity, security, and maximal knowledge-gleaning. Data management system requirements include flexible input formats, diverse data entry mechanisms and views, user friendliness, attention to standards, hardware and software platform definition, as well as robustness. Relevant solutions elaborated by the scientific community include Laboratory Information Management Systems (LIMS) and standardization protocols facilitating data sharing and managing. In project planning, special consideration has to be made when choosing relevant lilies annotation sources, since many of them overlap and require sophisticated integration heuristics. The data modeling step defines and categorizes the data into objects (e.g., genes, articles, disorders) and creates an application flow. A data storage/warehouse mechanism must be selected, such as file-based systems and relational databases, the latter typically used for larger projects. Omics project life cycle considerations must include the definition and deployment of new versions, incorporating either full or partial updates. Finally, quality assurance (QA) procedures must validate data and feature integrity, as well as system performance expectations. We illustrate these data management principles with examples from the life cycle of the GeneCards lilies project (, a comprehensive, widely used compendium of annotative information about human genes. For example, the GeneCards infrastructure has recently been changed from text files to a relational database, enabling better organization and views of the growing data. Omics data handling benefits from the wealth of Web-based information, the vast amount of public domain software, increasingly affordable hardware, and effective use of data management and annotation principles as outlined in this chapter
שפה מקוריתאנגלית
כותר פרסום המארחBioinformatics For Omics Data
כותר משנה של פרסום המארחMethods And Protocols
מוציא לאורHumana Press
מספר עמודים26
מסת"ב (אלקטרוני)978-1-61779-027-0
מסת"ב (מודפס)978-1-61779-026-3
מזהי עצם דיגיטלי (DOIs)
סטטוס פרסוםפורסם - 29 ינו׳ 2011

סדרות פרסומים

שםMethods in Molecular Biology

טביעת אצבע

להלן מוצגים תחומי המחקר של הפרסום 'Omics Data Management and Annotation'. יחד הם יוצרים טביעת אצבע ייחודית.

פורמט ציטוט ביבליוגרפי