XVDPU: A High Performance CNN Accelerator on Versal Platform Powered by AI Engine

Xijie Jia; Yu Zhang; Guangdong Liu; Xinlin Yang; Tianyu Zhang; Jia Zheng; Dongdong Xu; Zhuohuan Liu; Mengke Liu; Xiaoyang Yan; Hong Wang; Rongzhang Zheng; Li Wang; Dong Li; Satyaprakash Pareek; Jian Weng; Lu Tian; Dongliang Xie; Hong Luo; Yi Shan

doi:10.1145/3617836

What is it about?

Nowadays, convolution neural networks (CNNs) are widely used in computer vision applications. However, the trends of higher accuracy and higher resolution generate larger networks. The requirements of computation or I/O are the key bottlenecks. In this paper, we propose XVDPU: the AI-Engine (AIE)-based CNN accelerator on Versal chips to meet heavy computation requirements. To resolve IO bottleneck, we adopt several techniques to improve data-reuse and reduce I/O requirements. An Arithmetic Logic Unit (ALU) is further proposed which can better balance resource utilization, new feature support, and efficiency of the whole system. We have successfully deployed more than 100 CNN models with our accelerator.

Photo by Mohamed Nohassi on Unsplash

Why is it important?

Our experimental results show that the 96-AIE-core implementation can achieve 1653 frames per second (FPS) for ResNet50 on VCK190, which is 9.8 × faster than the design on ZCU102 running at 168.5 FPS. The 256-AIE-core implementation can further achieve 4050 FPS. We propose a tilling strategy to achieve feature-map-stationary (FMS) for high-definition CNN (HD-CNN) with the accelerator, achieving 3.8 × FPS improvement on Residual Channel Attention Network (RCAN) and 3.1 × on Super-Efficient Super-Resolution (SESR). This accelerator can also solve the 3D convolution task in disparity estimation, achieving end-to-end (E2E) performance of 10.1FPS with all the optimizations.

This page is a summary of: XVDPU: A High Performance CNN Accelerator on Versal Platform Powered by AI Engine, ACM Transactions on Reconfigurable Technology and Systems, September 2023, ACM (Association for Computing Machinery),
DOI: 10.1145/3617836.
You can read the full text:

Read

Contributors

The following have contributed to this page

Mengke Liu

A CNN Accelerator on Versal Platform Powered by AI Engine

What is it about?

Why is it important?

Contributors

You might also like

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

A CNN Accelerator on Versal Platform Powered by AI Engine

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

You might also like

Quasi-Monte Carlo technique in global sensitivity analysis of wind resource assessment with a study on UAE

Novel design of a coreless axial-flux permanent-magnet generator with three-layer winding coil for small wind turbines

Enhancing Performance of Photovoltaic Panel by Cold Plate Design with Guided Channels

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management