The Role of Language in the Automatic Coding of Political Texts

Didier Ruedin
  • Swiss Political Science Review, November 2013, Wiley
  • DOI: 10.1111/spsr.12050

We can use automatic approaches to coding party manifestos across languages

What is it about?

Researchers increasingly use automatic approaches to coding party manifestos and other political texts. When these political texts are written in different languages, we might worry whether the results are comparable. Here I use political texts available in two languages to examine the impact of the source language on the results obtained. I use both the popular Wordscores method and a dictionary of keywords. When I use so-called stop words, the results are almost identical in both languages.

Why is it important?

Automatic coding is a blessing to researchers, greatly reducing the work necessary to code political texts like party manifestos. With this, new research designs become possible, many of which are comparative in nature. The validity of these automatic approaches is central to their success. When working with different languages across countries, it is not evident whether differences in numbers are really differences in positions, or whether we merely pick up linguistic differences. This analysis reassures us that we pick up substantive differences.

Perspectives

Didier Ruedin (Author)
University of Neuch√Ętel

In most cases, we can only assert that we pick up substantive differences when political texts are written in different languages. Here we have a particular situation, political manifestos that are available in two languages -- professionally translated with the same contents. It is reassuring to see that stop-words work as advertised, and linguistic differences can be removed.

Read Publication

http://dx.doi.org/10.1111/spsr.12050

The following have contributed to this page: Didier Ruedin