Omics Data Management and Annotation

Arye Harel, Irina Dalah, Shmuel Pietrokovski, Marilyn Safran, Doron Lancet

نتاج البحث: فصل من :كتاب / تقرير / مؤتمرفصل

ملخص

Technological lilies breakthroughs, including next generation sequencing, bring avalanches of data which need to undergo effective data management to ensure integrity, security, and maximal knowledge-gleaning. Data management system requirements include flexible input formats, diverse data entry mechanisms and views, user friendliness, attention to standards, hardware and software platform definition, as well as robustness. Relevant solutions elaborated by the scientific community include Laboratory Information Management Systems (LIMS) and standardization protocols facilitating data sharing and managing. In project planning, special consideration has to be made when choosing relevant lilies annotation sources, since many of them overlap and require sophisticated integration heuristics. The data modeling step defines and categorizes the data into objects (e.g., genes, articles, disorders) and creates an application flow. A data storage/warehouse mechanism must be selected, such as file-based systems and relational databases, the latter typically used for larger projects. Omics project life cycle considerations must include the definition and deployment of new versions, incorporating either full or partial updates. Finally, quality assurance (QA) procedures must validate data and feature integrity, as well as system performance expectations. We illustrate these data management principles with examples from the life cycle of the GeneCards lilies project (http://www.genecards.org), a comprehensive, widely used compendium of annotative information about human genes. For example, the GeneCards infrastructure has recently been changed from text files to a relational database, enabling better organization and views of the growing data. Omics data handling benefits from the wealth of Web-based information, the vast amount of public domain software, increasingly affordable hardware, and effective use of data management and annotation principles as outlined in this chapter
اللغة الأصليةالإنجليزيّة
عنوان منشور المضيفBioinformatics For Omics Data
العنوان الفرعي لمنشور المضيفMethods And Protocols
ناشرHumana Press
الصفحات71-96
عدد الصفحات26
رقم المعيار الدولي للكتب (الإلكتروني)978-1-61779-027-0
رقم المعيار الدولي للكتب (المطبوع)978-1-61779-026-3
المعرِّفات الرقمية للأشياء
حالة النشرنُشِر - 29 يناير 2011

سلسلة المنشورات

الاسمMethods in Molecular Biology

بصمة

أدرس بدقة موضوعات البحث “Omics Data Management and Annotation'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا