What is it about?

Coloring black-and-white videos is challenging because there are often many ways a scene can be colored. One approach for image colorization is to use text captions; however, this is too complicated for videos. Our work, called RAGCol, uses the latest advances in machine learning to address this challenge. RAGCol combines video colorization with external knowledge to ground the colorization in real-world knowledge. We test the methodology on a range of videos, where it outperforms the previous best method.

Featured Image

Why is it important?

Colorization allows users to feel more connected to the past, but only if done correctly. Current colorizers, which mostly rely on neural networks, are prone to mistakes and inaccurate colorizations. This work limits this issue by leveraging external knowledge. This work is relevant in the colorization application but also has broader potential in other domains to make artificial intelligence more accurate, trustworthy and robust.

Perspectives

As someone deeply interested in history, this work excites me because it offers a new and improved method for restoring archival material. Enhancing the quality and accuracy of historical video colorization will enable better dissemination and, therefore, connection of people with culture and history. This is particularly relevant for material from a time that may not receive as much attention as it deserves.

Rory Ward
National University of Ireland - Galway

Read the Original

This page is a summary of: RAGCol: RAG-Based Automatic Video Colorization Through Text Caption Generation and Knowledge Enrichment, March 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3672608.3707748.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page