Multi-strategic learning, reasoning and searching in the game of Go

Show simple item record

dc.contributor.advisor Guesgen, Hans Werner en
dc.contributor.advisor Baltes, Jacky en
dc.contributor.author Lee, Byung Doo en
dc.date.accessioned 2007-08-04T11:13:38Z en
dc.date.available 2007-08-04T11:13:38Z en
dc.date.issued 2004 en
dc.identifier THESIS 05-020 en
dc.identifier.citation Thesis (PhD--Computer Science)--University of Auckland, 2004 en
dc.identifier.uri http://hdl.handle.net/2292/1241 en
dc.description Full text is available to authenticated members of The University of Auckland only. en
dc.description.abstract Go is a fascinating research domain for exploring new approaches in Artificial Intelligence (AI). In spite of its very simple rules, computer Go is much more difficult to implement than computer chess, because Go is an extremely complex strategic board game and has a vast search spare. The traditional AI approach when faced with a board game problem is to search the space of all possible moves to find the best move sequence that leads to an advantageous position. Heuristic searching methods are often used to tackle the problem. As Go is a complete-knowledge, deterministic and strategic game with extreme complexity, these approaches are not feasible in computer Go. Computer Go must be tackled with Go knowledge-intensive approaches rather than just the heuristic searching methods that are applied to computer chess. A lot of Go knowledge has been accumulated over the past several centuries by human master players. Most of it is implicit in the form of expert games. The patterns recognised by human players are more than just stones and empty spaces. That is, players intuitively perceive the life-and-death status of surrounded groups, and select an effective next move in a certain Go board position with a priori, knowledge of Go concepts such as influence, the safety of groups, connectedness between groups etc. This visual nature in Go is easy for human perception but hard to model with computers. In this thesis, we classify Go knowledge into implicit Go knowledge and explicit Go knowledge. Because implicit Go knowledge is embedded in prototypical professional Go games as pattern knowledge, we use pattern recognition capability to infer implicit knowledge from professional Go games. Meanwhile, explicit Go knowledge is represented as Go propositions (including Go proverbs, Go maxims and Go terms), and thus we apply Go term knowledge to a rule-based fuzzy reasoning model to solve problems in the opening game. This thesis discusses and implements a learning model, a reasoning model and a heuristic searching model. The learning model is a neural network for Temporal Difference (TD) learning with pattern recognition capability. The reasoning model includes a neuro-fuzzy controller for fuzzy reasoning using Go term knowledge based on pattern knowledge. Both models are applied to the full-sized (19x19) opening games of Go instead of a simplified version (e.g. 9x9), which is often used to study AI methods in Go. Additionally, the heuristic searching model is applied to solving life-and-death problems in a local domain. Firstly, we analyse the feasibility of applying a TD(λ) learning model in the opening game of Go. We implement TD(λ) learning with a neural network to predict the next stone moves, to evaluate the different opening styles and to find the most favourable opening game for black, and then evaluate the performance of TD(λ). The empirical result for predicting the next stone moves is promising, but there is no guarantee that TD(λ) will always pick the same opening game, which is one of the most favourable opening games for black, independent of different λ values. The competition between two TD(λ)s shows that it is difficult to clearly say that which TD(λ) is better, with a few games played between two TD(λ)s. Secondly, we discuss the implementation of a novel fuzzy reasoning model, which includes three components: (l) a neural network with supervised learning to generate the candidate moves, (2) a heuristic evaluator of the candidate moves, and (3) an adapted neuro-fuzzy controller to decide the best next move. We also let the fuzzy reasoning model play against TD(λ) learning to test the performance. The experimental result reveals that even the simple fuzzy reasoning model can compete against TD(λ) learning and it shows great potential to be applied to the real game of Go. Finally, we construct a heuristic searching model that enables the reduction of the branching factor in a game tree for solving life-and-death problems. The set of first moves in a game tree composes a set of candidate moves generated by pattern clustering and a set of possible moves generated by eye shape analysis. Then α-β searching is conducted to find the best sequence of moves for solving life-and-death problems. The empirical result shows that when there is a complete boundary between black and white, the sequence of moves generated is almost always correct, except that it does not deal well with ko fighting. Meanwhile, α-β searching does not generate a correct sequence of moves for solving problems that have an incomplete boundary between black and white. Keywords: α-β Searching, Complexity. Computer Go, Distance Transform, Eve Shape Analysis, Explicit Go Knowledge, Go Knowledge, Implicit Go Knowledge, Influence, Learning, Mamdani Fuzzy Model, Neural Network, Neuro-Fuzzy Reasoning. Pattern Clustering, Pattern Recognition, Reasoning, Reinforcement Learning, Searching, Sugeno Fuzzy Model, Supervised Learning, Temporal Difference (TD) Learning, Tsukamoto Fuzzy Model, Unsupervised Learning. en
dc.language.iso en en
dc.publisher ResearchSpace@Auckland en
dc.relation.ispartof PhD Thesis - University of Auckland en
dc.relation.isreferencedby UoA99145137814002091 en
dc.rights Restricted Item. Available to authenticated members of The University of Auckland. en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.title Multi-strategic learning, reasoning and searching in the game of Go en
dc.type Thesis en
thesis.degree.discipline Computer Science en
thesis.degree.grantor The University of Auckland en
thesis.degree.level Doctoral en
thesis.degree.name PhD en
dc.rights.holder Copyright: The author en
pubs.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. en
dc.identifier.wikidata Q112859946


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics