- About
- Events
- Calendar
- Graduation Information
- Cornell Learning Machines Seminar
- Student Colloquium
- BOOM
- Spring 2025 Colloquium
- Conway-Walker Lecture Series
- Salton 2024 Lecture Series
- Seminars / Lectures
- Big Red Hacks
- Cornell University / Cornell Tech - High School Programming Workshop and Contest 2025
- Game Design Initiative
- CSMore: The Rising Sophomore Summer Program in Computer Science
- Explore CS Research
- ACSU Research Night
- Cornell Junior Theorists' Workshop 2024
- People
- Courses
- Research
- Undergraduate
- M Eng
- MS
- PhD
- Admissions
- Current Students
- Computer Science Graduate Office Hours
- Advising Guide for Research Students
- Business Card Policy
- Cornell Tech
- Curricular Practical Training
- A & B Exam Scheduling Guidelines
- Fellowship Opportunities
- Field of Computer Science Ph.D. Student Handbook
- Graduate TA Handbook
- Field A Exam Summary Form
- Graduate School Forms
- Instructor / TA Application
- Ph.D. Requirements
- Ph.D. Student Financial Support
- Special Committee Selection
- Travel Funding Opportunities
- Travel Reimbursement Guide
- The Outside Minor Requirement
- Robotics Ph. D. prgram
- Diversity and Inclusion
- Graduation Information
- CS Graduate Minor
- Outreach Opportunities
- Parental Accommodation Policy
- Special Masters
- Student Spotlights
- Contact PhD Office
Online Non-Parametric Regression for Sales Forecast amid a Pandemic (via Zoom)
Abstract: Motivated by our collaboration with Anheuser-Busch InBev (AB InBev), a consumer packaged goods (CPG) company, we consider the problem of forecasting sales under the coronavirus disease 2019 (COVID-19) pandemic. Our approach combines non-parametric regression, game theory, and pandemic modeling to develop a data-driven competitive online non parametric regression method. Specifically, the method takes the future COVID-19 cases estimates, which can be simulated via the SIR (i.e., Susceptible-Infectious-Removed) epidemic model, as an input, and outputs the level of calibration for the baseline sales forecast generated by AB InBev's machine learning algorithm. In generating the calibration level, we focus on an online learning setting, where our algorithm sequentially predicts the label (i.e., the level of calibration) of a random covariate (i.e., the current number of active cases) given past observations and the generative process (i.e., the SIR epidemic model) of future covariates. To provide robust performance guarantee, we derive our algorithm by minimizing regret, which is the difference between the squared L2-norm associated with labels generated by the algorithm and labels generated by an adversary and the squared L2-norm associated with labels generated by the best isotonic (non-decreasing) function in hindsight and the adversarial labels. We develop a computationally-efficient algorithm that attains the minimax-optimal regret over all possible choices of the labels (possibly non-i.i.d. and even adversarial). We demonstrate the performances of our algorithm on both synthetic and AB InBev’s datasets (from March 2020 to March 2021) of three different markets (each corresponds to a country). The AB InBev’s numerical experiments show that our method is capable of reducing the forecasting error in terms of WMAPE (i.e., weighted mean absolute percentage error) and MSE (i.e., mean squared error) by more than 37% for the company.
The paper is available at https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3670264
Bio: Ruihao Zhu is currently an Assistant Professor at the Cornell University SC Johnson College of Business. Previously, he received his Interdisciplinary Ph.D. in Statistics from the Massachusetts Institute of Technology and his B.Eng. degree in Electrical Engineering and Computer Science from both the Shanghai Jiao Tong University and the University of Michigan.