Fast string parsing and its application in information and similarity measurement

Yang, Jia, 1975-

Fast string parsing and its application in information and similarity measurement

Yang, Jia, 1975-

Identifier: http://hdl.handle.net/2292/52334

Issue Date: 2005

Degree Name: PhD

Degree Grantor: The University of Auckland

Rights: Copyright: The author

Rights (URI): https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm

Abstract:

T-decomposition was first introduced by Mark Titchener in 1993. It is a string parsing algorithm that has been investigated in the fields of coding and information measures. This thesis shows that the information-measuring capability of T-decomposition compares well with that of the well-accepted Lempel-Ziv parsing algorithms. This thesis also presents a T-decomposition algorithm with O(nlogn) time complexity as one of its core results. This now permits T-decomposition-based information measurements with the same time complexity as the fastest of the Lempel-Ziv parsing algorithms with comparable accuracy. The improved algorithm is applied to similarity measurements on both synthetic data and real-world data (character recognition) with promising results.

Description:

Full text is available to authenticated members of The University of Auckland only.

Show full item record

Files in this item

Name: yang-2005-whole.pdf

Size: 7.891Mb

Format: PDF

This item appears in the following Collection(s)

Doctoral Theses - Authenticated Access [1680]

Fast string parsing and its application in information and similarity measurement

Fast string parsing and its application in information and similarity measurement

Abstract:

Description:

Files in this item

This item appears in the following Collection(s)

Search ResearchSpace

Browse

All of ResearchSpace

This Collection

Statistics

Fast string parsing and its application in information and similarity measurement

Fast string parsing and its application in information and similarity measurement

Abstract:

Description:

Files in this item

This item appears in the following Collection(s)

Share

Search ResearchSpace

Browse

All of ResearchSpace

This Collection

Statistics