Tamil Offensive Language Detection: Supervised versus Unsupervised Learning Approaches

Vimala Balakrishnan; Vithyatheri Govindan; Kumanan N Govaichelvan

doi:10.1145/3575860

What is it about?

Offensive language typically in low-resource language is not common. This study uses Tamil language to detect the offensive pattern with machine learning approaches, typically comparing supervised and unsupervised approaches.

Why is it important?

The findings show that unsupervised approach shows a tremendous performance compared to supervised to detect the offensive pattern in low-resourced language. Nonetheless, unsupervised clustering has shown better accuracy in terms of accuracy compared to human annotated dataset.

Perspectives

I hope that this article provides an insight to researchers on improvising clustering methods with balanced and imbalanced dataset.
Vithya Govindan
University of Malaya

This page is a summary of: Tamil Offensive Language Detection: Supervised versus Unsupervised Learning Approaches, ACM Transactions on Asian and Low-Resource Language Information Processing, December 2022, ACM (Association for Computing Machinery),
DOI: 10.1145/3575860.
You can read the full text:

Read

Contributors

The following have contributed to this page

Vithya Govindan
University of Malaya

Offensive language detection

What is it about?

Why is it important?

Perspectives

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Offensive language detection

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management