Automatically Extracting Typical Syntactic Differences from Corpora

W. Wiersma; J. Nerbonne; T. Lauttamus

doi:10.1093/llc/fqq017

What is it about?

We compare "styles" of different sorts of speech with repect to their syntax. We do this by comparing the sequences of the parts of speech (POS) used. The sorts of language compared are the conversational English speech used in Australia by first-generation immigrants (after thirty years in the country) vs. that of their children who immigrated at 17 yr, and younger (at an average age of < 10).

Why is it important?

This was one of the first papers to show how natural language processing (NLP), here in the form of a part-of-speech (POS) tagger, could play a role in detecting syntactic differences. Applying NLP to conversation was a risky since POS taggers were developed on edited, carefully written texts, and indeed performance fell massively, but not so much that the project was endangered.

Perspectives

Althought the focus was on contact linguistics -- the speech of Finnish immigrants to Australia -- the study showed that NLP could be harnessed for the study of style differences, as are studied in stylometry (within digital humanities).
Professor John Nerbonne
Rijksuniversiteit Groningen

This page is a summary of: Automatically Extracting Typical Syntactic Differences from Corpora, Literary and Linguistic Computing, October 2010, Oxford University Press (OUP),
DOI: 10.1093/llc/fqq017.
You can read the full text:

Read

Contributors

The following have contributed to this page

Professor John Nerbonne
Rijksuniversiteit Groningen

Comparing speech with respect to syntax using computational methods.

What is it about?

Why is it important?

Perspectives

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Comparing speech with respect to syntax using computational methods.

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management