ADMM for Efficient Deep Learning with Global Convergence

Junxiang Wang; Fuxun Yu; Xiang Chen; Liang Zhao

doi:10.1145/3292500.3330936

What is it about?

Currently, most of the training strategies for deep neural networks (DNNs) are based on gradient descent. Although they are popular, they are far from being perfect and people are complaining about the drawbacks such as gradient-vanishing, poor conditioning, biological implausibility, and low concurrency. To address the fundamental drawbacks of the gradient-based methods, a new framework is proposed for training deep neural networks based on alternating direction methods of multipliers (ADMM) with convergence guarantee no matter how this method is initialized.

Photo by Mika Baumeister on Unsplash

Why is it important?

Gradient-free optimization for training DNN contains a number of young yet promising topics. For example, we are focusing on alternating optimization-based optimizers for training deep neural networks, which first transfer a DNN training problem into an equivalent, decomposable one. Then we decompose the problem into subprogblems corresponding to each layer which is solved separately and easily with an analytical solution.

This page is a summary of: ADMM for Efficient Deep Learning with Global Convergence, July 2019, ACM (Association for Computing Machinery),
DOI: 10.1145/3292500.3330936.
You can read the full text:

Read

Contributors

The following have contributed to this page

Liang Zhao
George Mason University

Training Deep neural networks without gradient descent

What is it about?

Why is it important?

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Training Deep neural networks without gradient descent

What is it about?

Featured Image

Why is it important?

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management