Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos

Hadi Alzayer; Zhihao Xia; Xuaner (Cecilia) Zhang; Eli Shechtman; Jia-Bin Huang; Michael Gharbi

doi:10.1145/3750722

What is it about?

We train a diffusion model that takes a roughly edited image, and makes it a photorealistic masterpiece while being faithful to the original image!

Why is it important?

Prior work relies on "text" to perform an edit, which is difficult to control precisely, and struggles to be faithful to the original image. With our work, we allow the user to edit their photos intuitively (just do a quick edit without worrying about precision), and the model does the heavy lifting and make your edit neat and realistic.

Perspectives

The key insight is that we used video data to train an "image editing" model. Since in videos, objects move naturally, and it can be used to teach the model on how to re-arrange or edit a photo realistically.
Hadi Alzayer
University of Maryland at College Park

This page is a summary of: Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos, ACM Transactions on Graphics, July 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3750722.
You can read the full text:

Read

Contributors

The following have contributed to this page

Hadi Alzayer
University of Maryland at College Park

Faithful spatial editing with diffusion models

What is it about?

Why is it important?

Perspectives

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Faithful spatial editing with diffusion models

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management