Please note: In order to keep Hive up to date and provide users with the best features, we are no longer able to fully support Internet Explorer. The site is still available to you, however some sections of the site may appear broken. We would encourage you to move to a more modern browser like Firefox, Edge or Chrome in order to experience the site fully.

Data Orchestration in Deep Learning Accelerators, PDF eBook

Data Orchestration in Deep Learning Accelerators PDF

Part of the Synthesis Lectures on Computer Architecture series

PDF

Please note: eBooks can only be purchased with a UK issued credit card and all our eBooks (ePub and PDF) are DRM protected.

Description

This Synthesis Lecture focuses on techniques for efficient data orchestration within DNN accelerators.

The End of Moore's Law, coupled with the increasing growth in deep learning and other AI applications has led to the emergence of custom Deep Neural Network (DNN) accelerators for energy-efficient inference on edge devices.

Modern DNNs have millions of hyper parameters and involve billions of computations; this necessitates extensive data movement from memory to on-chip processing engines.

It is well known that the cost of data movement today surpasses the cost of the actual computation; therefore, DNN accelerators require careful orchestration of data across on-chip compute, network, and memory elements to minimize the number of accesses to external DRAM.

The book covers DNN dataflows, data reuse, buffer hierarchies, networks-on-chip, and automated design-space exploration.

It concludes with data orchestration challenges with compressed and sparse DNNs and future trends.

The target audience is students, engineers, and researchers interested in designing high-performance and low-energy accelerators for DNN inference.

Information

Other Formats

Information