Link Search Menu Expand Document

CS662

Advanced Natural Language Processing

Staff

Instructor

Jonathan May

Office Hours: Mondays 12:00-1:00 pm TBD and Wednesdays 9:00–9:50 am TBD, or by appointment (at ISI on other days)

Teaching Assistants

Justin Cho

jcho@isi.edu

Office Hours: 2-4pm Mondays, SAL PhD lounge

Shushan Arakelyan

shushana@usc.edu

Office Hours: 3-5pm Fri, RTH 314

Lectures

Monday and Wednesday 10:00–11:50 am, DMC 261

Textbook

Grading

PercentageAssessment Component
10%In class participation
10%Posted questions before each in-class selected paper presentation and possible quizzes
10%In-class selected paper presentation
30%Three Homeworks (10% each)
40%Project, done in small groups, comprising:
 - Proposal (5%)
 - First version of report (5%)
 - In-class presentation (10%)
 - Final report (20%).
  • Written homeworks and project components except for final project report must be submitted on the date listed in the schedule, by 23:59:59 AoE.
  • Final project report is due Monday, December 11, 2023, 10:00 AM PST
  • A deduction of 1/5 of the total possible score will be assessed for each late day. After four late days (i.e. on the fifth), you get a 0 on the assignment (and you should come talk to us because your grade will likely suffer!)
  • You have four extension days, to be applied as you wish, throughout the entire class, for homeworks and project proposal / first report (NOT final report). No deduction will be assessed if an extension day is used. As an example, if an assignment is due November 10, you have two extension days remaining, you submit the assignment on November 12, and your score is 90/100. In this case you lose the extension days but your grade is not reduced; it remains 90/100. If you have one extension day, you lose it, and your grade is 70/100. If you have no extension days, your grade is 50/100.

Contact us

On Piazza, Slack, or in class/office hours. Please do not email (unless notified otherwise).

Topics

(subject to change per instructor/class whim) (will not necessarily be presented in this order):

Linguistic Stack (graphemes/phones - words - syntax - semantics - pragmatics - discourse)

Tools:
Corpora, Corpus statistics, Data cleaning and munging
Annotation and crowdwork
Evaluation
Models/approaches: rule-based, automata/grammars, perceptron, logistic regression, neural network models
Effective written and oral communication
Components/Tasks/Subtasks:
Language Models
Syntax: POS tags, constituency tree, dependency tree, parsing
Semantics: lexical, formal, inference tasks
Information Extraction: Named Entities, Relations, Events
Generation: Machine Translation, Summarization, Dialogue, Creative Generation

Week 1

Week 2

Week 3

Week 4

Week 5

Week 6

Week 7

Oct 2
POS tags, HMMs, treebanks
E 7.1–7.4, 8.1, JM 8.1–8.5, 17.3
Katie - Affective Knowledge Enhanced Multiple-Graph Fusion Networks for Aspect-based Sentiment Analysis
Questions by: Yavuz
Sean - Reproducibility in Computational Linguistics: Is Source Code Enough?
Questions by: Tina
Oct 4
constituencies, cky
E 10.1–10.4, JM 17 (the rest)
Ian - Finding Skill Neurons in Pre-trained Transformer-based Language Models
Questions by: Deuksin
Darshan - On the Transformation of Latent Space in Fine-Tuned NLP Models
Questions by: Shauryasikt
Oct 6
Drop deadline (no refund, without W)

Week 8

Oct 9
dependencies
E 11, JM 18
Sina - Perturbation Augmentation for Fairer NLP
Questions by: Kian
Tina - Balancing out Bias: Achieving Fairness Through Balanced Training
Questions by: Ajay
Oct 11
semantics: logical/compositional, frames and roles, amr, distributional
E 12.1, 12.2, 13.1, 13.3, 14.1-3, 14.6-8, JM 19, 23, 24
Nuan - Don’t Prompt, Search! Mining-based Zero-Shot Learning with Language Models
Questions by: Daniel P
Yavuz - The Geometry of Multilingual Language Model Representations
Questions by: Weike
Oct 13
HW 2 due

Week 9

Week 10

Week 11

Week 12

Week 13

Week 14

Nov 20
prompts, multi task large language models (guest lecture by Qinyuan Ye)
InstructGPT paper (pp1–20), Large Language Models are Human-Level Prompt Engineers
Zhejian - Maieutic Prompting: Logically Consistent Reasoning with Recursive Explanations
Questions by: Patrick
Yiming - Reasoning Like Program Executors
Questions by: Daniel Y.
Nov 22
THANKSGIVING BREAK; NO CLASS

Week 15

Nov 27
Project presentations
Nov 29
Project presentations