On the transformation of accent

Show simple item record

dc.contributor.advisor Riddle, Patricia J. en
dc.contributor.advisor Watson, Catherine I. en
dc.contributor.author Teutenberg, Jonathan en
dc.date.accessioned 2020-07-08T05:00:59Z en
dc.date.available 2020-07-08T05:00:59Z en
dc.date.issued 2010 en
dc.identifier.uri http://hdl.handle.net/2292/52237 en
dc.description Full text is available to authenticated members of The University of Auckland only. en
dc.description.abstract RODUCING new synthetic voices for speech synthesis systems is expensive in time, and requires expert knowledge. This has motivated research into voice conversion-automatically adapting a source speaker into a target speaker based on a small corpus of examples. In this thesis we consider a variant of voice conversion that focuses on the the task of localising speechenabled software: accent transformation. An accent transformation describes a single mapping that can be applied to speech from any voice in a source accent to produce similar speech in a target accent. We consider an approach to accent transformation suitable if it successfully changes the perceived accent of speech, maintains speech quality and naturalness, and requires minimal time and resources to develop. This thesis investigates potential approaches to the modification of pronunciation and intonation for accent transformation, and assesses their suitability. To avoid gross speech errors found to result from modifying formants by linear predictive pole rotation and by frequency warping, we describe an approach to pronunciation modification based on independent exponential functions. Modifications to the formants are determined from linguistic analyses of accents of English, and we detail empirically determined rule-based solutions to handling spectral slope, mapping phone labels to formant space, normalising vocal tract size, and modelling phone targets and transitions across time. We determine appropriate intonation modifications with an instance-based approach using limited data. We propose a representation of fundamental frequency contours with many suitable characteristics for instance-based corpora, based on the discrete cosine transform (DCT). We compare matching based on phrase features against matching DCT coefficients. We show results of a perceptual study indicating that speech intensity is an important feature in accent transformation. Methods for including intensity in intonation transformation are outlined. The overall approach, and the individual pronunciation and intonation trans formations, are evaluated in a number of perceptual tests. These show that the transformations are able to make significant changes in perceived accent from British RP toward a variety of target accents. en
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof PhD Thesis - University of Auckland en
dc.relation.isreferencedby UoA99225030014002091 en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. en
dc.rights Restricted Item. Full text is available to authenticated members of The University of Auckland only. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.title On the transformation of accent en
dc.type Thesis en
thesis.degree.discipline Computer Science en
thesis.degree.grantor The University of Auckland en
thesis.degree.level Doctoral en
thesis.degree.name PhD en
dc.rights.holder Copyright: The author en
dc.identifier.wikidata Q112884696


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics