601.467/667 Introduction to Human Language Technology

Fall 2024

Coordinator: Philipp Koehn (phi@jhu.edu)
TA: Elina Baral (ebaral1@jhu.edu)
CAs: TBA
Class: Tuesday and Thursday 9:00-10:15am, Hodson 210
Office hours: Coordinator: on request
Office hours: TA: Elina Baral, Thursday 1:00-2:00, Malone 216
CA: Muhan Gao, Monday 1:00-2:00, Malone 216
Shabeg Singh Gill, Tuesday 5:00-6:00, Malone 216
Hirtika Mirghani, Wednesday 11:30-1:00, Malone 216
Huiqi Zou, Friday 1:00-2:00, Malone 216
Gradescope (entry code: 8KX4DX) ☆ Piazza (access code qlnu01yhhel) ☆ Old lecture recordings

Assignments

Note: For Fall 2024, please *do not* proceed before assignments being announced. They are subject to change.
You can confirm whether the homework is released by checking the date listed on each homework.

Late submissions: For each student, we allow a total of 10 days of late submission for all homeworks.
It is counted on a daily basis, for example, if you submit a homework even a few minutes late, you will lose 1 day of your quota.
After you use up all 10 days of late submission, each late day would cost 5% points penalty.
Late submission for teamwork would use 1-day for each teammate.
For each homework, you *are not allowed to submit* after 14 days.

N-gram language modeling, CYK parsing: Due on September 11 (Wednesday)
RNNLMs, word2vec: Due on September 25 (Wednesday)
Seq2seq for pronunciation prediction: Due on October 30 (Wednesday)
Speech recognition with CTC: Due on November 27 (Wednesday)

Exam

There will be two mid-terms and final exam. You are allowed to bring 1 sheet of paper with notes to the exam.

The final exam time takes place Tuesday, December 17, 6-9pm.

First Midterm (Text): 2023 (solutions), 2022, 2021 (solutions), 2020, 2019 example questions.
Second Midterm (Speech): 2023 (solutions), 2022, 2021, 2020 (solutions), 2019.
Final (Applications): 2023 (solutions), 2022, 2021, 2020 (solutions).

Lectures

Date	Topic	Instructor
Tu Aug 27	Introduction	Koehn
Text
Th Aug 29	Words and Language Models	Yarowsky
Tu Sep 3	Morphology	Yarowsky
Th Sep 5	Syntax	Post
Tu Sep 10	Deep learning I	Murray
Th Sep 12	Deep learning II (Python notebook)	Murray
Tu Sep 17	Semantics	Lippincott
Th Sep 19	Distributional Semantics and Large Language Models	Koehn
Tu Sep 24	Machine Translation	Duh
Th Sep 26	Information Extraction	Koehn
Tu Oct 1	Information Retrieval	Yang
Th Oct 3	First Midterm Exam	-
Speech
Tu Oct 8	Speech basics	Moro-Velazquez
Th Oct 10	Classic speech recognition ¹ (additional slides, video 0:12-1:25)	Khudanpur
Tu Oct 15	Speaker recognition	Villalba
Tu Oct 22	End-to-end neural speech recognition	Wiesner
Th Oct 24	Auditory system	Elhilali
Tu Oct 29	Enhancement and Diarization	Maciejewski
Th Oct 31	Hands on: Kaldi (K2, ESPnet)	Wiesner and Maciejewski
Tu Nov 5	Hands on: Kaldi (Transducer-based ASR, CTC ASR from pretrained models)
Th Nov 7	Second Midterm Exam
Applications
Tu Nov 12	Question Answering	Duh
Th Nov 14	NLP for Digital Humanities	Lippincott
Tu Nov 19	(no class)
Th Nov 21	Ethical Problems	Moro-Velazquez
Tu Dec 3	Human-Centered Evaluation of Language Technologies	Ziang Xiao
Th Dec 5	Computational Social Science	Field

¹These slides present an incomplete picture of what will be discussed in class. Attentive listening is recommended for gaining maximal benefit.