LING 073 / CPSC 013 — Spring 2022
Computational Linguistics

Professor:Jonathan North Washington
Office:Pearson 105
Office phone:x6134
Office hours:T 13:30-15:00
& by appointment
also available by messaging on Google Chat/Hangouts
Email/messaging: jwashin1@swarth...more.edu
 
Meeting time:TTh 9:55-11:10
Lab hours: TBD
Meeting modality:Mixed (in person as possible)
Physical classroom:Clothier 16
Online classroom:Gather (see Moodle for meeting URL)
Course website: http://jnw.domains.swarthmore.edu/ling073
Course wiki: http://wikis.swarthmore.edu/ling073
IRC channel: irc.oftc.net#swatling
Course Piazza site: LING 073
Course Moodle site: LING073-01-CPSC013-01-S22

Course Syllabus

Schedule (subject to change)

weekdatetopicto read (by class) / due (on Friday)
1 18 Jan

Preparation week

20 Jan

Preparation week

Environment setup

2 25 Jan

Introductions, syllabus

What (and why) is CL (and NLP)?

Linguistic communities

Models of development, FOSS

Resource identification

Corpus assembly

Long (2007) - Chilean Mapuches in language row with Microsoft

language selection

27 Jan

LAB

lab 1 - documentation of resources + Initial corpus assembly

3 1 Feb

Input methods

Lebedev (2004) - Where once was a comma

3 Feb

LAB

lab 2 - keyboard layout

4 8 Feb

Morphological typology

Grammar documentation

Janhunen & Gruzdeva (2016) - Bringing the orthography of an indigenous language to the digital age: The case of Nivkh in the Russian Far East

10 Feb

LAB

lab 3 - Grammar documentation

5 15 Feb

FSTs and morphology

Analyser evaluation

Bird (2009) - Natural Language Processing and Linguistic Fieldwork

17 Feb

LAB

lab 4 - Basic morphological analyser

6 22 Feb

FSTs and phonology

Generator evaluation

Kornai (2013) - Digital Language Death

24 Feb

LAB

lab 5 - Basic morphological generator

7 1 Mar

Morphological disambiguation

Manual disambiguation

Disambiguator evaluation

Moshagen & Trosterud (2019) - Rich Morphology, No Corpus – And We Still Made It. The Sámi Experience

3 Mar

LAB

lab 6 - Basic CG disambiguator

8 Mar

Spring break!

10 Mar

Spring break!

8 15 Mar

midterm project demos

17 Mar

TBD

9 22 Mar

Machine translation

Lexical transfer

Khanna et al. (2021) - Recent advances in Apertium, a free / open-source rule-based machine translation platform for low-resource languages (§1, §2, §5)

24 Mar

LAB

lab 7 - Lexical transfer

10 29 Mar

Lexical selection

Pedersen (2008) - Empiricism Is Not A Matter of Faith

31 Mar

LAB

lab 8 - Lexical selection

11 5 Apr

Contrastive grammars

Mahelona (2020) - Te reo Māori Speech Recognition: A Story of Community, Trust, and Sovereignty

7 Apr

LAB

lab 9 - Contrastive grammar

12 12 Apr

Structural transfer

Romero (2016) - Bill Gates speaks Kʼicheeʼ! The corporatization of linguistic revitalization in Guatemala

14 Apr

LAB

lab 10 - Structural transfer

13 19 Apr

RBMT evaluation

Bird (2020) - Decolonising Speech and Language Technology

21 Apr

LAB

lab 11 - Polished basic RBMT system

14 26 Apr

TBD

28 Apr

TBD

15 TBD

Final project presentation