Logistics

Term: 2025 Fall

Time: Tuesday/Thursday, 4:00–5:50 p.m.

Location: SAL 101

Complete videos for this class will be available on Brightspace for enrolled students.

Contact: Students should ask all course-related questions in the Ed forum, where you will also find announcements. You will find the course Ed on the course Brightspace page. For external enquiries, emergencies, or personal matters that you don't wish to put in a private Ed post, email csci544-25f@googlegroups.com with “CSCI544” in your subject line. Please send all emails to this mailing list — do not email the instructors directly. We will try to respond within 48–72 hours.

Jieyu Zhao

Instructor: Jieyu Zhao

Office Hour: 30 mins after the class

Teaching Assistants

  • Photo of Rebecca Dorn Rebecca Dorn — Office Hour: Thursdays 9:30–10:30 AM, Zoom Link
  • Photo of Thomas Reeves Thomas Reeves — Office Hour: Wednesdays 8:30–9:30 AM, Zoom Link
  • Photo of Sahana Ramnath Sahana Ramnath — Office Hour: Mondays 11:00 AM–12:00 noon, Zoom Link
  • Photo of Pengda Xiang Pengda Xiang — Office Hour: Tuesday 10:00–11:00 AM, Zoom Link
  • Photo of Muzi Tao Muzi Tao — Office Hour: Tuesday 3:00–4:00 PM, Zoom Link
  • Photo of Ziyi Liu Ziyi Liu — Office Hour: Friday 2:00–3:00 PM, Zoom Link
  • Photo of Xu Wang Xu Wang — Office Hour: Wednesday 11:00 AM–12:00 PM, Zoom Link

Announcements

[Oct. 25] Each team will prepare slides (via Google slides) and add the link here by 11:59 PM the day before the presentation. Failure to share slides on time will cause a loss of grade.

[Aug. 26] Check announcements on Brightspace.

Summary

This course covers both fundamental and cutting-edge topics in Natural Language Processing (NLP) with a focus on Language Models. Natural language processing (NLP) has been revolutionized by the advancement of large-scale language models, achieving state-of-the-art performance across a wide variety of tasks. This course will cover the fundamentals of language modeling and related topics in natural language processing, deep learning, and machine learning.

Students will gain familiarity with the capabilities of large language models as well as get hands-on experience with building and evaluating small-scale language models. The class will also explore the real-world consequences of deploying language models, such as the ethics and harms associated with them.

Syllabus

Calendar and prespecified syllabus are subject to change. More details, e.g., reading materials and additional resources, will be added as the semester continues. All work (except the project final report) is due on the specified date by 11:59 pm PT.

Open Spreadsheet, Syllabus in new window

WeekDateClass TopicsReadingsWork Due
1Aug. 26Introduction to LMs and Course Overview
Aug. 28n-gram ModelsJ&M, Chap 3
2Sep. 2n-gram Language Models (Smoothing) + Logistic RegressionJ&M, Chap 3
Sep. 4Logistic Regression (contd.)J&M, Chap 5
3Sep. 9Word EmbeddingsJ&M, Chap 6Group Formation Deadline
Sep. 11Word Embeddings (contd.)J&M, Chap 6 Additional: word2vec Explained
4Sep. 16Feedforward Neural NetsJ&M, Chap 7
Sep. 18BackpropagationJ&M, Chap 7HW1 Release
5Sep. 23Recurrent Neural NetworksJ&M, Chap 8Quiz 1
Sep. 25Seq2Seq and AttentionJ&M, Chap 8Project Proposal Due
6Sep. 30Transformers - Building BlocksJ&M, Chap 9
Oct. 2Transformers (contd.)J&M, Chap 9Mid-Semester Evaluation
7Oct. 7Guest Lecture - PyTorch for Transformers (TA)
Oct. 9Fall Recess
8Oct. 14Midterm Exam HW1 Due
Oct. 16Pre-training and Finetuning TransformersJ&M, Chaps 10, 11HW2 Release
9Oct. 21Tokenization and Generating from LMsJ&M, Chaps 2.5, 13
Oct. 23Language GenerationJ&M, Chaps 13
10Oct. 28Large Language Models - EvaluationJ&M, Chaps 10, 12Project Status Report Due
Oct. 30Large Language Models - In-context Learning, Scaling LawJ&M, Chaps 12
11Nov. 4LLMs - Post-Training (By TA Xu Wang)J&M, Chaps 12
Nov. 6Guest Lecture on LLM Agents (Lecturer: Qingyun Wu)
12Nov. 11Guest Lecture on LLMs Applications: LLM + Medical (Lecturer: Ruishan Liu)Quiz 2 Veterans Day Holiday
Nov. 13Guest Lecture on LLMs (Lecturer: Swabha Swayamdipta)
13Nov. 18Guest Lecture on LLMs Applications: LLM + Medical (Lecturer: Ruishan Liu)Quiz 2 HW2 Due
Nov. 20Project Presentations I
14Nov. 25Project Presentations II
Nov. 27Thanksgiving
15Dec. 2Project Presentations III
Dec. 4Project Presentations IV
16Dec. 9-
Dec. 11-
17Dec. 16Project Final Report due by 6:30pm

Assignments & Grading

Grading inquiries and questions about the grading of the homework and the quizzes can be asked (to the TAs) within two weeks from the grading date (the date the grades are released). Grades will be available within 2–2.5 weeks after submission.

All written assignments related to the final project should use the standard *ACL paper submission template.

Project Deliverables

Project proposal (5%).

Student teams should submit a ~1-page proposal (using the *CL paper submission template) for their project. The proposal should:

We highly encourage students to work towards a problem involving predictive models, hence it’s worth thinking about the five key ingredients of supervised learning: data, model, loss function, optimization algorithm and inference / evaluation.

Project progress report (10%).

Student teams should submit a ~3-page progress report (using the *CL paper submission template) for their project. This report should:

While the initial results might be inconclusive, you are expected to have made non-trivial progress by this point. The project proposal may be extended for this report. Please take into consideration the earlier feedback you received, and address those inline (you may highlight these in a different text color if you wish to draw the grader’s attention).

Project final presentation (15%).

Each team will prepare a 5 minute presentation, followed by 1-2 minutes of Q/A.

You can choose a representative for your team to do the presentation. But you need to use one slide to clearly describe in the team, who is responsible for which part.

Points will be deducted if the time limit (only for the 5-min presentation) is violated, so please practice timing your talk. We will be very strict about this.

Each project presentation should describe

All members of the team are expected to identify the central points of the research, and present that research to the class, as well as answer questions from the instructor, TAs and fellow students.

If you are in the audience, you could participate in asking questions - bonus points will be awarded to folks who ask insightful questions (and clearly announce their name before asking a question).

Each team will prepare slides (via Google slides) and add the link here by 11:59 PM the day before the presentation. Failure to share slides on time will cause a loss of grade.

Project final report (20%).

Student teams should submit a ~6-8 page final report (again using the *CL paper submission template) detailing all aspects of their project. The report should be structured like a conference paper (similar to the papers that students read and presented in class), including

Late Days

Students are allowed a maximum of 4 late days total for all assignments (but NOT the quiz sheets). You may use up to 2 late days per assignment. Using one late day for a project assignment involves each of the teammates using a late day each. Partial late days are not permitted. For every extra late day beyond the allowed late days, the student / team will lose 20% of the grade for the assignment.

Note: Please familiarize yourself with the academic policies and read the note about student well-being.

Reference Texts

The following texts are useful, but none are required. All of them can be read free online.