Will.Whim

A weblog by Will Fitzgerald

Language Identification: A Computational Linguistics Primer

Slides and results from a talk I gave at Kalamazoo College on language identification.

My co-worker at Powerset, Chris Biemann, has a nice paper on Unsupervised Language Identification
.

Advertisements

One response to “Language Identification: A Computational Linguistics Primer

  1. Daniel Lemire April 27, 2009 at 3:40 pm

    Great. Thanks for sharing.

    I did some vaguely related work hashing n-grams… you may appreciate it:

    Recursive n-gram hashing is pairwise independent, at best
    http://arxiv.org/abs/0705.4676

    (You provided the initial motivation of this paper a long, long, long time ago!)

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: