Development and Optimization of Bioinformatic Workflows for Forensic Applications

Show simple item record

dc.contributor.advisor Harbison, S en
dc.contributor.author Liu, Yao-Yuan en
dc.date.accessioned 2016-02-23T01:59:54Z en
dc.date.issued 2016 en
dc.identifier.citation 2016 en
dc.identifier.uri http://hdl.handle.net/2292/28280 en
dc.description Full text is available to authenticated members of The University of Auckland only. en
dc.description.abstract Forensic scientists have been using short tandem repeats (STRs) for more than twenty years to analyse DNA. This has been based on the use of electrophoresis, most recently capillary electrophoresis, to find out the lengths of the repeat sequences detected for the forensic STR markers tested. Forensic scientists are now carrying out research into the use of massively parallel sequencing (MPS) technology for forensic testing. This research has and is generating data about the sequence of each STR marker and also additional SNP markers for physical characteristic determination, identity and ancestry and whole mitochondrial genome analysis. All of these advances offer forensic scientists the ability to provide more information about forensic samples in the future. Before this can happen, research needs to be carried out to develop suitable bioinformatics pipelines and to analyse and interpret the sequencing data in order to fully understand and interpret the data. In this study, a bioinformatics pipeline was assembled for the analysis of STR data generated on massively parallel sequencers. The pipeline is composed of tools that were readily available and new tools developed for this study. Verification of the pipeline involved the comparison of the STR profile generated by the pipeline with the results from capillary electrophoresis. The pipeline was used to explore if STR data generated on the Illumina MiSeq, and Ion Torrent Personal Genome Machine (Ion PGM) sequencers produce different results. The results showed that DNA profiles generated from data sequenced on the Ion PGM contained much more incorrectly called loci than that from the MiSeq due to high coverage sequencing errors. These errors result in the production of indel artefact sequences, which confound allele calling. Furthermore, specific locations of the STR were observed to be more prone to sequencing errors. en
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof Masters Thesis - University of Auckland en
dc.relation.isreferencedby 99264869812502091 en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. en
dc.rights Restricted Item. Available to authenticated members of The University of Auckland. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/nz/ en
dc.title Development and Optimization of Bioinformatic Workflows for Forensic Applications en
dc.type Thesis en
thesis.degree.discipline Forensic Science en
thesis.degree.grantor The University of Auckland en
thesis.degree.level Masters en
dc.rights.holder Copyright: The Author en
pubs.author-url http://hdl.handle.net/2292/28280 en
pubs.elements-id 523303 en
pubs.record-created-at-source-date 2016-02-23 en
dc.identifier.wikidata Q112925855


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics