Course Overview


Instructor: Kilian Weinberger

Contact: kqw4@cornell.edu

Faculty Office Hours: Gates 410, Mondays 9-10am

Course Staff:

Credits: 4.0 credits (3.0 for lecture, 1.0 for independent research project)

Prerequisites: CS 4780 or CS 5780 or equivalents or permission of instructor

Time and Location: Tuesdays and Thursdays, 14:55-16:10, for a total of 27 lectures

Course overview: This course covers advanced topics in machine learning, focusing on recent developments in large language models, multimodal models, and their applications. Students will engage with cutting-edge research through presentations, discussions, and a final project.

Course Structure


Weekly Schedule
Class Session Structure (Thursdays)
  1. Quiz on the paper and previous class (10 minutes)
  2. 30-minute presentation by the team
  3. 30-minute group discussion with role presentations
  4. Optional post-class quiz (5 minutes)
Social Reading Groups

Every main presenter will lead a Perusall group for pre-class paper annotation. Groups consist of one Presenter, one Archaeologist, one Academic Researcher, one Industry Expert, one Ethicist, and six other students without specific roles (11 people total).

Course Topics


This course covers the following main topics in advanced machine learning:

  1. LLM Foundations: Tokenization, Positional embeddings, Attention, Causal Language Modeling, In-Context Learning
  2. LLM Extension: Quantization, Adapters, Mixture of experts, Context window, Parallelism, Distillation, Attention free architectures
  3. To be announced

Schedule


The course follows this tentative schedule. Please note that topics and dates may be subject to change.

Date Style Topic
Tuesday, August 27, 2024 Lecture Introduction
Thursday, August 29, 2024 Lecture Introduction
Tuesday, September 3, 2024 Overview LLM Foundations
Thursday, September 5, 2024 Paper LLM Foundations
Tuesday, September 10, 2024 Overview LLM Extension
Thursday, September 12, 2024 Paper LLM Extension

Roles


Students will rotate through various roles throughout the semester:

Each student assigned to a role should prepare 2-3 slides and present for approximately 5-6 minutes, except for the Hacker, who provides a Jupyter Notebook instead of slides.

Weekly Assignment: By 10 PM the day before each Thursday class, all students must submit a review on CMT here.

Role Sign-ups

Students are required to sign up for their roles using this Google Sheets link. Please ensure you sign up for your preferred roles as soon as possible.

Presenter Responsibilities

In addition to creating and delivering the main presentation, students assigned as presenters will be responsible for creating quizzes for the rest of the class. These quizzes will be used to assess the class's understanding of the presented material.

Acknowledgment

The role-playing aspect of this course has evolved from an initial design by Alec and Eitan Grinspun at Columbia University.

Grading


Note: The role-playing aspect of the course is a significant part of the in-class participation and presentations grade. Students are expected to fully engage with their assigned roles, prepare thoroughly for their presentations, and contribute meaningfully to class discussions.

Final Project


The final project aims to engage students in research related to machine learning fields covered in class.

Project Guidelines:

Students are encouraged to demonstrate limitations of related work and suggest improvements by applying methods to public datasets beyond those used in the original papers.

More details about project proposals and suggested topics will be provided later in the course.

Course Policies


Attendance and Participation

Regular attendance and active participation in class discussions and group activities are essential for success in this course.

Academic Integrity

All students are expected to adhere to the Cornell University Code of Academic Integrity. Any work submitted by a student in this course for academic credit will be the student's own work.

Late Work Policy

Given the collaborative nature of many assignments, late work will generally not be accepted without prior approval from the instructor.

Accommodations for Students with Disabilities

If you have a disability and are registered with the Student Disability Services office, please make an appointment with the instructor as soon as possible to discuss any accommodations needed for the course. All such appointments will be kept confidential.

Diversity and Inclusion

We are committed to creating an inclusive learning environment that values diversity and fosters respect for all individuals. We encourage open and respectful dialogue and expect all participants to contribute to a positive and supportive learning atmosphere.