Comparison of Vocal Tract Shape Modelling Methods: MRI vs AR

Show simple item record

dc.contributor.advisor Watson, C en
dc.contributor.author Fan, Hsiao-Tien en
dc.date.accessioned 2013-03-08T02:53:58Z en
dc.date.issued 2013 en
dc.identifier.uri http://hdl.handle.net/2292/20186 en
dc.description Full text is available to authenticated members of The University of Auckland only. en
dc.description.abstract Investigation was carried out on the vocal tract structural data obtained with the magnetic resonance imaging and the acoustic reflectometery techniques during the vocalization of vowels. This was carried out as a determination of the merits of the data acquisition techniques in question. This investigation included data for both techniques from 5 speakers. Vocal tract structural data was captured by the magnetic resonance method as a series of sagittal images which is converted into cross-sectional area functions through manipulation of the images in the open source modelling tool of CMGUI [http://www.cmiss.org/cmgui]. The data collected by the acoustic reflectometer is in the form of cross-sectional areas along the vocal tract. Voice recordings for each target vowel were also collected. The cross-sectional area functions obtained by the two methods are processed, and with the application of the lossless tube model and the linear predictive coding method, the spectrum and resonances of a give vocal tract structure was obtained. The length and resonance values of each vowel was recorded and analysed. Vocal tract shapes obtained from the two methods were compared and discussed, and it was found that both methods were successful in capturing the expected vocal tract geometry of the target vowels, though a few compromises to the shapes from the acoustic reflectometery method was observed due to the compromising nature of the measurement mouthpiece. The resonances deduced from the vocal tract shapes were compared to the formants extracted from recorded speech. It was found that the magnetic resonance method yielded a more accurate estimate of the formant values (9 out of 11 monophthong with reasonable estimations) while the acoustic reflectometery method was much less accurate (3 out of 9 monophthongs with reasonable estimation) It was concluded that while the acoustic reflectometery method was able to capture the general shape of the vocal tract, only a number of vowels were able to be modelled accurately enough for the calculated resonances to be comparable to the formants of real speech. However, future research may allow for methods with counter acts the compromising effects of the mouth piece, and yield more promising results. en
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof Masters Thesis - University of Auckland en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. en
dc.rights Restricted Item. Available to authenticated members of The University of Auckland. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/nz/ en
dc.title Comparison of Vocal Tract Shape Modelling Methods: MRI vs AR en
dc.type Thesis en
thesis.degree.grantor The University of Auckland en
thesis.degree.level Masters en
dc.rights.holder Copyright: The Author en
pubs.elements-id 374238 en
pubs.record-created-at-source-date 2013-03-08 en


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics