The application of TD(?) Learning to the Opening Games of 19x19 Go

ResearchSpace/Manakin Repository

Show simple item record

dc.contributor.author Lee, Byung-Doo en
dc.contributor.author Guesgen, Hans en
dc.date.accessioned 2008-08-21T01:57:29Z en
dc.date.available 2008-08-21T01:57:29Z en
dc.date.issued 2003 en
dc.identifier.citation Communication and Information Technology Research Technical Report 136, (2003) en
dc.identifier.issn 1178-3609 en
dc.identifier.uri http://hdl.handle.net/2292/2830 en
dc.description You are granted permission for the non-commercial reproduction, distribution, display, and performance of this technical report in any format, BUT this permission is only for a period of 45 (forty-five) days from the most recent time that you verified that this technical report is still available from the original CITR web site; http://citr.auckland.ac.nz/techreports/ under terms that include this permission. All other rights are reserved by the author(s). en
dc.description.abstract This paper describes the results of applying Temporal Difference (TD) learning with a network to the opening game problems in Go. The main difference from other research is that this experiment applied TD learning to the fullsized (19x19) game of Go instead of a simple version (e.g., 9x9 game). We discuss and compare TD(?) learning for predicting an opening game's winning and for finding the best game among the prototypical professional opening games. We also tested the performance of TD(?)s by playing against each other and against the commercial Go programs. The empirical result for picking the best game is promising, but there is no guarantee that TD(?) will always pick the identical opening game independent of different values. The competition between two TD(?)s shows that TD(?) with a higher ? has better performance. en
dc.publisher CITR, The University of Auckland, New Zealand en
dc.relation.ispartofseries Communication and Information Technology Research (CITR) Technical Report Series en
dc.rights Copyright CITR, The University of Auckland. You are granted permission for the non-commercial reproduction, distribution, display, and performance of this technical report in any format, BUT this permission is only for a period of 45 (forty-five) days from the most recent time that you verified that this technical report is still available from the original CITR web site under terms that include this permission. All other rights are reserved by the author(s). en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.source.uri http://citr.auckland.ac.nz/techreports/2003/CITR-TR-136.pdf en
dc.title The application of TD(?) Learning to the Opening Games of 19x19 Go en
dc.type Technical Report en
dc.subject.marsden Fields of Research::280000 Information, Computing and Communication Sciences en


Full text options

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Advanced Search

Browse