Leicester Research Archive

Leicester Research Archive >
College of Medicine, Biological Sciences and Psychology >
Genetics, Department of >
Theses, Dept. of Genetics >

Please use this identifier to cite or link to this item: http://hdl.handle.net/2381/8951

Title: Database federation, resource interoperability and digital identity, for management and exploitation of contemporary biological data
Authors: Thorisson, Gudmundur A.
Supervisors: Brookes, Anthony J.
Award date: 1-Jan-2011
Presented at: University of Leicester
Abstract: Modern research into the genetic basis of human health and disease is increasingly dominated by high-throughput experimentation and routine generation of large volumes of complex genotype to phenotype (G2P) information. Efforts to effectively manage, integrate, analyse and interpret this wealth of data face substantial challenges. This thesis discusses informatics approaches to addressing some of these challenges, primarily in the context of disease genetics. The genome-wide association study (GWAS) is widely used in the field, but translation of findings into scientific knowledge is hampered by heterogeneous and incomplete reporting, restrictions on sharing of primary data, publication bias and other factors. The central focus of the work was design and implementation of a core informatics infrastructure for centralised gathering and presentation of GWAS results. The resulting open-access HGVbaseG2P genetic association database and web-based tools for search, retrieval and graphical genome viewing increase overall usefulness of published GWAS findings. HGVbaseG2P conceptual modelling activities were also merged into a collaborative standardisation effort with international partners. A key outcome of this joint work is a minimal model for phenotype data which, together with ontologies and other standards, lays the foundation for a federated network of semantically and syntactically interoperable, distributed G2P databases. Attempts to gather complete aggregate representations of primary GWAS data into HGVbaseG2P were largely unsuccessful, chiefly due to concerns over re-identification of study participants. This led to a separate line of inquiry which explored - via in-depth field analysis, workshop organisation and other community outreach activities – potential applications of federated identity technologies for unambiguously identifying researchers online. Results suggest two broad use cases for user-centric researcher identities - i) practical, streamlined data access management and ii) tracking digital contributions for the purpose of attribution - which are critical to facilitating and incentivising sharing of GWAS (and other) research data.
Links: http://hdl.handle.net/2381/8951
Type: Thesis
Level: Doctoral
Qualification: PhD
Rights: This work is licensed to the public under the Creative Commons Attribution Non-Commercial 3.0 Unported License. http://creativecommons.org/licenses/by-nc/3.0/
Appears in Collections:Theses, Dept. of Genetics
Leicester Theses

Files in This Item:

File Description SizeFormat
Thorisson PhD Thesis 2010 FINAL.pdfThesis6.65 MBAdobe PDFView/Open
Thorisson PhD Thesis 2010 supplementary materials.zipSupplementary material4.18 MBZip ArchiveView/Open
View Statistics

Items in LRA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

MAINTAINER