RAGCol: RAG-Based Automatic Video Colorization Through Text Caption Generation and Knowledge Enrichment

Rory Ward; Dhairya Dalal; Paul Buitelaar; John Breslin

doi:10.1145/3672608.3707748

What is it about?

Coloring black-and-white videos is challenging because there are often many ways a scene can be colored. One approach for image colorization is to use text captions; however, this is too complicated for videos. Our work, called RAGCol, uses the latest advances in machine learning to address this challenge. RAGCol combines video colorization with external knowledge to ground the colorization in real-world knowledge. We test the methodology on a range of videos, where it outperforms the previous best method.

Photo by Megan Lee on Unsplash

Why is it important?

Colorization allows users to feel more connected to the past, but only if done correctly. Current colorizers, which mostly rely on neural networks, are prone to mistakes and inaccurate colorizations. This work limits this issue by leveraging external knowledge. This work is relevant in the colorization application but also has broader potential in other domains to make artificial intelligence more accurate, trustworthy and robust.

Perspectives

As someone deeply interested in history, this work excites me because it offers a new and improved method for restoring archival material. Enhancing the quality and accuracy of historical video colorization will enable better dissemination and, therefore, connection of people with culture and history. This is particularly relevant for material from a time that may not receive as much attention as it deserves.
Rory Ward
National University of Ireland - Galway

This page is a summary of: RAGCol: RAG-Based Automatic Video Colorization Through Text Caption Generation and Knowledge Enrichment, March 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3672608.3707748.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page

RAGCol: Adding color to black-and-white images using text and knowledge

What is it about?

Why is it important?

Perspectives

Resources

FRCol: Face Recognition Based Speaker Video Colorization

ControlCol: Controllability in Automatic Speaker Video Colorization

LatentColorization: Latent Diffusion-Based Speaker Video Colorization

Knowledge-Guided Colorization: Overview, Prospects and Challenges

Towards Temporal Stability in Automatic Video Colourisation

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

RAGCol: Adding color to black-and-white images using text and knowledge

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Resources

FRCol: Face Recognition Based Speaker Video Colorization

ControlCol: Controllability in Automatic Speaker Video Colorization

LatentColorization: Latent Diffusion-Based Speaker Video Colorization

Knowledge-Guided Colorization: Overview, Prospects and Challenges

Towards Temporal Stability in Automatic Video Colourisation

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management