What is it about?

We train a diffusion model that takes a roughly edited image, and makes it a photorealistic masterpiece while being faithful to the original image!

Featured Image

Why is it important?

Prior work relies on "text" to perform an edit, which is difficult to control precisely, and struggles to be faithful to the original image. With our work, we allow the user to edit their photos intuitively (just do a quick edit without worrying about precision), and the model does the heavy lifting and make your edit neat and realistic.

Perspectives

The key insight is that we used video data to train an "image editing" model. Since in videos, objects move naturally, and it can be used to teach the model on how to re-arrange or edit a photo realistically.

Hadi Alzayer
University of Maryland at College Park

Read the Original

This page is a summary of: Magic Fixup: Streamlining Photo Editing by Watching Dynamic Videos, ACM Transactions on Graphics, July 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3750722.
You can read the full text:

Read

Contributors

The following have contributed to this page