Link Search Menu Expand Document

CS662

Advanced Natural Language Processing

Staff

Instructor

Jonathan May

Office Hours: Mondays 12:00-1:00 pm TBD and Wednesdays 9:00–9:50 am TBD, or by appointment (at ISI on other days)

Teaching Assistant

Alexander Spangher

spangher@usc.edu

Office Hours: 12-2pm Monday, RTH 314

Lectures

  • Monday and Wednesday 10:00–11:50 am, DMC 261
  • See schedule for select days where class is canceled

Textbook

Grading

PercentageAssessment Component
10%In class participation
10%Posted questions before each in-class selected paper presentation and possible quizzes
10%In-class selected paper presentation
30%Three Homeworks (10% each)
40%Project, done in small groups, comprising:
 - Proposal (5%)
 - First version of report (5%)
 - In-class presentation (10%)
 - Final report (20%).
  • Written homeworks and project components except for final project report must be submitted on the date listed in the schedule, by 23:59:59 AoE.
  • Final project report is due Monday, December 15, 2025, 10:00 AM PST
  • A deduction of 1/5 of the total possible score will be assessed for each late day. After four late days (i.e. on the fifth), you get a 0 on the assignment (and you should come talk to us because your grade will likely suffer!)
  • You have four extension days, to be applied as you wish, throughout the entire class, for homeworks and project proposal / first report (NOT final report). No deduction will be assessed if an extension day is used. As an example, if an assignment is due November 10, you have two extension days remaining, you submit the assignment on November 12, and your score is 90/100. In this case you lose the extension days but your grade is not reduced; it remains 90/100. If you have one extension day, you lose it, and your grade is 70/100. If you have no extension days, your grade is 50/100.

Contact us

On Slack, or in class/office hours. Please do not email (unless notified otherwise).

Topics

(subject to change per instructor/class whim) (will not necessarily be presented in this order):
Fundamentals
Linguistic Stack (graphemes/phones - words - syntax - semantics - pragmatics - discourse
Corpora, Corpus statistics, Data cleaning, munging, and annotation
Evaluation
Linear and Nonlinear Models
Dense Representations and neural architectures (feed-forward, RNN, Transformer)
Language Models
Pre-training, Fine-tuning, Prompting, Reward Alignment
Ethics
Effective written and oral communication
Applications
Multilingualism and Translation
Syntax
Information Retrieval/Question Answering
Dialogue
Information Extraction
Multimodality
Speech Recognition and Generation
Agent Interaction
Discourse

Week 1

Week 2

Week 3

Week 4

Week 5

Week 6

Week 7

Week 8

Oct 13
Machine Translation (MT) slides1 slides2
JM13
Oct 15
Multilingual
Oct 17
HW 2 due

Week 9

Week 10

Oct 27

Information Extraction

HW3 out (due 11/21)
JM17.3, 20
Oct 29
Agents (Guest Lecture by Tenghao Huang)
WebArena, ToolLLM, Narrative Discourse, ReAct

Week 11

Nov 3
Multimodal NLP (Guest Lecture by Xuezhe Ma)
Nov 5
Spoken Language Processing (SLP) (Guest Lecture by Sudarsana Reddy Kadiri)
JM 16
Nov 7
Project Report Version 1 due

Week 12

Nov 10
TBD
Nov 12
TBD
-
Nov 14
Late Drop (W, No refund)

Week 13

Nov 17
TBD Syntax
Nov 19
Discourse Slides
Nov 21
HW 3 due

Week 14

Nov 24
TBD
Nov 26
THANKSGIVING BREAK; NO CLASS

Week 15

Dec 1
Project Presentations
(10:00) TBD

Questions by: TBD

(10:22) TBD

Questions by: TBD

(10:44) TBD

Questions by: TBD

(11:06) TBD

Questions by: TBD

(11:28) TBD

Questions by: TBD

Dec 4
Project presentations
(10:00) TBD

Questions by: TBD

(10:22) TBD

Questions by: TBD

(10:44) TBD

Questions by: TBD

(11:06) TBD

Questions by: TBD

(11:28) TBD

Questions by: TBD