Michigan Tech Publications, Part 1

A neural network model for cache and memory prediction of neural networks

Sai Sha, Peking University
Yingwei Luo, Peking University
Zhenlin Wang, Michigan Technological UniversityFollow
Xiaolin Wang, Peking University

Document Type

Conference Proceeding

Publication Date

3-21-2019

Department

Department of Computer Science

Abstract

Neural networks have been widely applied to various research and production fields. However, most recent research is focused on the establishment and selection of a specific neural network model. Less attention is paid to their system overhead despite of their massive computing and storage resource demand. This research focuses on a relatively new research direction that models the system-level memory and cache demand of neural networks. We utilize a neural network to learn and predict hit ratio curve and memory footprint of neural networks with their hyper-parameters as input. The prediction result is used to drive cache partitioning and memory partitioning to optimize co-execution of multiple neural networks. To demonstrate effectiveness of our approach, we model four common networks, BP neural network, convolutional neural network, recurrent neural network, and autoencoder. We investigate the influence of hyper-parameters of each model on the last level cache and memory demand. We resort to the BP algorithm as the learning tool to predict last level cache hit ratio curve and memory usage. Our experimental results show that cache and memory allocation schemes guided by our prediction optimize for a wide range of performance targets.

Publisher's Statement

Publication Title

2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Ubiquitous Computing & Communications, Big Data & Cloud Computing, Social Computing & Networking, Sustainable Computing & Communications (ISPA/IUCC/BDCloud/SocialCom/SustainCom)

Recommended Citation

Sha, S., Luo, Y., Wang, Z., & Wang, X. (2019). A neural network model for cache and memory prediction of neural networks. 2018 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Ubiquitous Computing & Communications, Big Data & Cloud Computing, Social Computing & Networking, Sustainable Computing & Communications (ISPA/IUCC/BDCloud/SocialCom/SustainCom), 972-978. http://doi.org/10.1109/BDCloud.2018.00142
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p/905

Link to Full Text

COinS

Michigan Tech Publications, Part 1

A neural network model for cache and memory prediction of neural networks

Document Type

Publication Date

Department

Abstract

Publisher's Statement

Publication Title

Recommended Citation

LINKS

Browse

Search

Author Corner

Links

Michigan Tech Publications, Part 1

A neural network model for cache and memory prediction of neural networks

Authors

Document Type

Publication Date

Department

Abstract

Publisher's Statement

Publication Title

Recommended Citation

Share

LINKS

Browse

Search

Author Corner

Links