Jonathan May and Nanyun (Violet) Peng
Website | https://www.isi.edu/~jonmay/cs544_fa18_web/ |
Lectures | SAL 101, Wednesdays and Fridays 8:00-9:50 am |
Instructor office hours | Jonathan May or Nanyun (Violet) Peng, RTH 512, Wednesdays and Fridays 10-11am or 11am-12pm with appointment |
Textbook | None required; Speech and Language Processing, 2nd Edition1 is optional but badly out of date. The new version (called ``jm'' in reading selection notes below) is incomplete but preferred2. All required readings from jm and elsewhere will be listed in the syllabus. |
TA Office hours | Xusen Yin, 10:30am-12:30pm Mon., SAL Lab |
Ramesh Manuvinakurike, 11:30am-1:30pm Wed., SAL Lab | |
Sarik Ghazarian, 3pm-5pm Wed., SAL Lab | |
Quizzes (3% of grade) | approximately 25 x 0.12% of grade each. Multiple choice online tests taken mid-class to break up the monotony. Open note, open internet, open discussion. |
Homeworks (48.6% of grade) | 6 x 8.1% of grade each. A mix of programming and written assignments, electronic submission due 11:59 PM local Pacific time per table below. |
Project (8.1% of grade) | Can be done in groups of up to 4. Preliminary part due mid-class, due on last class day. (this may be replaced with a seventh homework) |
Late days | Four (4) cumulative with no more than two (2) per assignment. 50% penalty first day; thereafter, no credit afterward. |
Midterm (15% or 25% of grade, whichever is more advantageous) | Friday, October 5, 8:00-9:45 am. Covers lectures, readings, and homeworks from 8/22-10/3 |
Final exam (15% or 25% of grade, whichever is more advantageous) | Wednesday, December 5, 8:00-10:00 am. Covers lectures, readings, and homeworks from 8/22-11/30 (i.e. the entire class) |
Contact us | On Piazza or in class/office hours. Please do not email (unless notified otherwise). |
A note about the midterm/final grading: whichever you do better on will count as 25% of your grade and the other one will count as 15% of your grade. Together the tests will count as 40% of your grade.
instructor | date | material | reading | HW out | HW due |
JM | 8/22 | intro, applications | jm v2 c1 3, hirschberg & manning 4 | ||
JM | 8/24 | corpora, text processing, words, regular expressions (lec 2 starting code 5) | NLTK ch.2 6, jmv3 c27, nathan schneider's notes 8, Unix for poets9, sculpting text10, | HW 1 (due 9/7) | |
JM | 8/29 | Morphology, Finite-State Automata, FSA relationship to regex, finite state transducers | jm v2 c3 11 | ||
JM | 8/31 | probability | Goldwater probability tutorial 12, | HW 2 (due 9/14) | |
JM | 9/5 | Classifiers, features, naive Bayes, perceptron, logistic regression | jm v3 c613, Eisenstein notes pp. 21-4814 | ||
JM | 9/7 | pos tagging, hmm, search | jm v3 c915, blunsom notes16, collins notes (optional, more detailed)17 | HW1 | |
NP | 9/12 | parsing and syntax 1: treebanks, evaluation, cky, grammar induction | penn treebank 18, jm v3 c11 19 jm v3 c1220, jm v3 c1321 | HW3=project proposal (due 9/26) | |
NP | 9/14 | parsing and syntax 2: pcfgs, restructuring, lexicalization, smoothing, beaming | chiang notes22 | HW2 | |
NP | 9/19 | parsing and syntax 3: dependencies, mst and shift reduce algorithms | jm v3 c1423 | HW 4 (due 10/10) | |
NP | 9/21 | ngram LMs | jm v3 c424 | ||
NP | 9/26 | smoothing, interpolation | (optional) chen and goodman25 | HW 3(pp) | |
NP | 9/28 | feed forward lm, rnn lm | jm v3 c826 | ||
JM | 10/3 | REVIEW | |||
10/5 | MIDTERM | ||||
NP | 10/10 | classical lexical semantics | jm v3 c1727 | HW 4 | |
NP | 10/12 | distributional lexical semantics | jm v3 c1528, jm v3 c1629 | HW 5 (due 10/26) | |
NP | 10/17 | RNNs, LM and semantics | jmv3 c730 c931 | ||
NP | 10/19 | Information Extraction | jmv3 c2132 | HW 6 (due 11/9) | |
NP | 10/24 | Information Extraction: CRF | Collins's Note33 Sutton and McCallum'a Note34 | project rewrite (moved to 10/31) | |
NP | 10/26 | Neural CRF | Lample et. al.35 | HW 5 | |
JM | 10/31 | MT evaluation and alignment | arcturan and centauri36, Bleu37, koehn slides38 | project rewrite | |
JM | 11/2 | Statistical MT | Brown et al. 39 (mathy bits of models 3+ can be skimmed), Knight workbook40, mert41 | ||
JM | 11/7 | Neural MT | neubig tutorial42 or Koehn Chapter (section 13.5) 43 | ||
JM | 11/9 | Summarization | jmv2 c23.3-23.7 (see blackboard), abi see blog post44 | HW 6 | |
JM | 11/14 | Task-Oriented Dialogue | jmv3 c2445, jmv3 c2546 | ||
JM | 11/16 | Chit-Chat Dialogue | ritter et al.47 | ||
11/21 | NO CLASS | ||||
11/23 | NO CLASS | ||||
JM | 11/28 | Leftovers: annotation, common sense, Entailment, generation | overview of kappa: 48, bowman snli paper: 49, csr: davis and marcus50 | ||
JM/NP | 11/30 | Review | project | ||
12/5 | FINAL |