- About
- Events
- Calendar
- Graduation Information
- Cornell Learning Machines Seminar
- Student Colloquium
- BOOM
- Fall 2024 Colloquium
- Conway-Walker Lecture Series
- Salton 2024 Lecture Series
- Seminars / Lectures
- Big Red Hacks
- Cornell University - High School Programming Contests 2024
- Game Design Initiative
- CSMore: The Rising Sophomore Summer Program in Computer Science
- Explore CS Research
- ACSU Research Night
- Cornell Junior Theorists' Workshop 2024
- People
- Courses
- Research
- Undergraduate
- M Eng
- MS
- PhD
- Admissions
- Current Students
- Computer Science Graduate Office Hours
- Advising Guide for Research Students
- Business Card Policy
- Cornell Tech
- Curricular Practical Training
- A & B Exam Scheduling Guidelines
- Fellowship Opportunities
- Field of Computer Science Ph.D. Student Handbook
- Graduate TA Handbook
- Field A Exam Summary Form
- Graduate School Forms
- Instructor / TA Application
- Ph.D. Requirements
- Ph.D. Student Financial Support
- Special Committee Selection
- Travel Funding Opportunities
- Travel Reimbursement Guide
- The Outside Minor Requirement
- Diversity and Inclusion
- Graduation Information
- CS Graduate Minor
- Outreach Opportunities
- Parental Accommodation Policy
- Special Masters
- Student Spotlights
- Contact PhD Office
Relaxations and Regret Bounds for Online Problems
Abstract: Sequential optimization problems can be represented and solved via Markov Decision Process (MDP) formulations. In addition to the so-called ``curse of dimensionality'', the optimal strategies in finite horizon settings have complicated representations. To overcome these challenges, researchers often look for approximations that are computationally tractable and produce solutions with provable optimality-gap bounds.
A natural relaxation stems from considering an offline controller, or ``prophet'', who can use future information to make decisions. We offer a relaxation framework, coupled with optimality-gap guarantees, that is based on (1) generalizing the Bellman Equations to *Bellman Inequalities * and (2) using these inequalities to obtain a relaxation and an online algorithm.
We apply this approach to resource allocation problems and show constant regret.