corpus linguistics

Constructing a Field Dictionary from Scratch

Imagine being the first outsider to document a language with no written form. How would you create its first-ever dictionary?…

3 weeks ago

Quantifying a Dialect’s ‘Distance’

How different are two dialects, really? Linguists can now answer that question with surprising precision. Discover the methods used to…

3 weeks ago

How Algorithms Read Your Resume

Applicant Tracking Systems (ATS) don't "read" your resume; they parse it using strict linguistic rules. To get past this digital…

3 weeks ago

The Anti-Turing Test: What is Author Verification?

Author verification is the "Anti-Turing Test"—a field of linguistic forensics that determines if a specific person wrote a given text.…

4 weeks ago

The Great Manx Comeback

In 1974, UNESCO declared the Manx language extinct with the death of its last native speaker, Ned Maddrell. Yet, this…

4 weeks ago

Digitizing the Dead Sea Scrolls: An OCR Puzzle

Teaching a computer to read is simple, but what if the text is a 2,000-year-old, fragmented manuscript written in an…

4 weeks ago

The Rebirth of Cornish

Once officially declared extinct after the death of its last native speaker in the 18th century, the Cornish language (Kernewek)…

4 months ago

AI’s Linguistic Blind Spots

AI language models can write poetry and translate languages, but their impressive abilities mask significant linguistic blind spots. Inheriting biases…

4 months ago

The Linguist Who Caught the Unabomber

For 17 years, the Unabomber was a ghost, but his most powerful weapon—a 35,000-word manifesto—became his undoing. This is the…

4 months ago

The Politics of Dictionaries

Ever wonder who decides when a word like 'rizz' is official? This post delves into the surprisingly political world of…

4 months ago

This website uses cookies.