Performance optimisation strategies for automatically generated FPGA accelerators for biomedical models

Show simple item record

dc.contributor.author Yu, Ting en
dc.contributor.author Oppermann, J en
dc.contributor.author Bradley, Christopher en
dc.contributor.author Sinnen, Oliver en
dc.date.accessioned 2016-09-05T07:40:38Z en
dc.date.issued 2016-04-10 en
dc.identifier.citation Concurrency and Computation: Practice and Experience, 2016, 28(5), pp. 1480 - 1506 en
dc.identifier.issn 1532-0626 en
dc.identifier.uri http://hdl.handle.net/2292/30224 en
dc.description.abstract Biomedical modelling that is mathematically described by ordinary differential equations (ODEs) is often one of the most computationally intensive parts of simulations. With high inherent parallelism, hardware acceleration based on field programmable gate array has great potential to increase the computational performance of the ODE model integration while being very power efficient. ODE-based Domain-specific Synthesis Tool is a tool we proposed previously to automatically generate the complete hardware/software co-design framework for computing biomedical models based on CellML. Although it provides remarkable performance improvement and high energy efficiency compared with CPUs and GPUs, there is still a great potential for optimisation. In this paper, we investigate a set of optimisation strategies including compiler optimisation, resource fitting and balancing, and multiple pipelines. They all have in common that they can be performed automatically and hence can be integrated in our domain-specific high level synthesis tool. We evaluate the optimised hardware accelerator modules generated by ODE-based Domain-specific Synthesis Tool on real hardware based on their resource usage, processing speed and power consumption. The results are compared with single threaded and multi-core CPUs with/without Streaming SIMD Extension (SSE) optimisation and a graphics card. The results show that the proposed optimisation strategies provide significant performance improvement and result in even more energy-efficient hardware accelerator modules. Furthermore, the resources of the target field programmable gate array device can be more efficiently utilised in order to fit larger biomedical models than before. en
dc.description.uri http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)1532-0634 en
dc.publisher Wiley en
dc.relation.ispartofseries Concurrency and Computation: Practice and Experience en
dc.rights Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. Details obtained from http://www.sherpa.ac.uk/romeo/issn/1532-0626/ en
dc.rights.uri https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm en
dc.title Performance optimisation strategies for automatically generated FPGA accelerators for biomedical models en
dc.type Journal Article en
dc.identifier.doi 10.1002/cpe.3699 en
pubs.issue 5 en
pubs.begin-page 1480 en
pubs.volume 28 en
dc.description.version AM - Accepted Manuscript en
dc.rights.holder Copyright: Wiley en
pubs.author-url http://onlinelibrary.wiley.com/doi/10.1002/cpe.3699/full en
pubs.end-page 1506 en
pubs.publication-status Published en
dc.rights.accessrights http://purl.org/eprint/accessRights/OpenAccess en
pubs.subtype Article en
pubs.elements-id 515849 en
pubs.org-id Bioengineering Institute en
pubs.org-id ABI Associates en
pubs.org-id Engineering en
pubs.org-id Department of Electrical, Computer and Software Engineering en
dc.identifier.eissn 1532-0634 en
pubs.record-created-at-source-date 2016-01-05 en


Files in this item

Find Full text

This item appears in the following Collection(s)

Show simple item record

Share

Search ResearchSpace


Browse

Statistics