Article

Playing games with approximation algorithms

Authors:

Sham M. Kakade,

Adam Tauman Kalai,

Katrina LigettAuthors Info & Claims

STOC '07: Proceedings of the thirty-ninth annual ACM symposium on Theory of computing

Pages 546 - 555

https://doi.org/10.1145/1250790.1250870

Published: 11 June 2007 Publication History

Get Access

Abstract

In an online linear optimization problem, on each period t, an online algorithm chooses s_t ∈ S from a fixed (possibly infinite) set S of feasible decisions. Nature (who may be adversarial) chooses a weight vector w_t ∈ R, and the algorithm incurs cost c(s_t,w_t), where c is a fixed cost function that is linear in the weight vector. In the full-information setting, the vector w_t is then revealed to the algorithm, and in the bandit setting, only the cost experienced, c(s_t,w_t), is revealed. The goal of the online algorithm is to perform nearly as well as the best fixed s ∈ S in hindsight. Many repeated decision-making problems with weights fit naturally into this framework, such as online shortest-path, online TSP, online clustering, and online weighted set cover.

Previously, it was shown how to convert any efficient exact offline optimization algorithm for such a problem into an efficient online bandit algorithm in both the full-information and the bandit settings, with average cost nearly as good as that of the best fixed s ∈ S in hindsight. However, in the case where the offline algorithm is an approximation algorithm with ratio α > 1, the previous approach only worked for special types of approximation algorithms. We show how to convert any offline approximation algorithm for a linear optimization problem into a corresponding online approximation algorithm, with a polynomial blowup in runtime. If the offline algorithm has an α-approximation guarantee, then the expected cost of the online algorithm on any sequence is not much larger than α times that of the best s ∈ S, where the best is chosen with the benefit of hindsight. Our main innovation is combining Zinkevich's algorithm for convex optimization with a geometric transformation that can be applied to any approximation algorithm. Standard techniques generalize the above result to the bandit setting, except that a "Barycentric Spanner" for the problem is also (provably) necessary as input.Our algorithm can also be viewed as a method for playing largerepeated games, where one can only compute approximate best-responses, rather than best-responses.

References

[1]

B. Awerbuch and R. Kleinberg. Adaptive routing with end-to-end feedback: Distributed learning and geometric approaches. In Proceedings of the 36th ACM Symposium on Theory of Computing (STOC), 2004.

Digital Library

Google Scholar

[2]

M.-F. Balcan and A. Blum. Approximation algorithms and online mechanisms for item pricing. In Proceedings of the 7th ACM Conference on Electronic Commerce (EC), 2006.

Digital Library

Google Scholar

[3]

R. Carr and S. Vempala. Randomized metarounding. Random Struct. Algorithms, 20(3):343--352, 2002.

Digital Library

Google Scholar

[4]

D. Chakrabarty, A. Mehta, and V. Vazirani. Design is as easy as optimization. In 33rd International Colloquium on Automata, Languages and Programming (ICALP), 2006.

Digital Library

Google Scholar

[5]

V. Dani and T.P. Hayes. Robbing the bandit: Less regret in online geometric optimization against an adaptive adversary. In Proceedings of the 17th ACM-SIAM Symposium on Discrete Algorithms (SODA), 2006.

Digital Library

Google Scholar

[6]

M.X. Goemans and D.P. Williamson. Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming. J. ACM, 42(6):1115--1145, 1995.

Digital Library

Google Scholar

[7]

J. Hannan. Approximation to Bayes risk in repeated play. In M. Dresher, A. Tucker, and P. Wolfe, editors, Contributions to the Theory of Games, volume III, pages 97--139. Princeton University Press, 1957.

Google Scholar

[8]

A. Kalai and S. Vempala. Efficient algorithms for online decision problems. J. Comput. Syst. Sci., 71(3):291--307, 2005.

Digital Library

Google Scholar

[9]

H. McMahan and A. Blum. Online geometric optimization in the bandit setting against an adaptive adversary. In Proceedings of the 17th Annual Conference on Learning Theory (COLT), 2004.

Crossref

Google Scholar

[10]

H. Robbins. Some aspects of the sequential design of experiments. In Bulletin of the American Mathematical Society, volume 55, 1952.

Google Scholar

[11]

M. Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th International Conference on Machine Learning (ICML), 2003.

Google Scholar

Cited By

View all

Golrezaei NNiazadeh RPatel KSusan FLarson K(2024)Online combinatorial optimization with group fairness constraintsProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/44(394-402)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/44
Golrezaei NNiazadeh RPatel KSusan F(2024)Online Combinatorial Optimization with Group Fairness ConstraintsSSRN Electronic Journal10.2139/ssrn.4824251Online publication date: 2024
https://doi.org/10.2139/ssrn.4824251
Bhatt AHaghtalab NShetty AOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Smoothed analysis of sequential probability assignmentProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669617(79808-79831)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3669617
Show More Cited By

Index Terms

Playing games with approximation algorithms
1. Theory of computation
  1. Design and analysis of algorithms

Recommendations

Playing Games with Approximation Algorithms

In an online linear optimization problem, on each period $t$, an online algorithm chooses $s_t\in\mathcal{S}$ from a fixed (possibly infinite) set $\mathcal{S}$ of feasible decisions. Nature (who may be adversarial) chooses a weight vector $w_t\in\...
Efficient Online Linear Optimization with Approximation Algorithms
We revisit the problem of online linear optimization in the case where the set of feasible actions is accessible through an approximated linear optimization oracle with a factor α multiplicative approximation guarantee. This setting in particular is ...
Tight Approximation Algorithms for Geometric Bin Packing with Skewed Items
Abstract
In Two-dimensional Bin Packing (2BP), we are given n rectangles as input and our goal is to find an axis-aligned nonoverlapping packing of these rectangles into the minimum number of unit square bins. 2BP admits no APTAS and the current best ...

Comments

Information & Contributors

Information

Published In

STOC '07: Proceedings of the thirty-ninth annual ACM symposium on Theory of computing

June 2007

734 pages

ISBN:9781595936318

DOI:10.1145/1250790

General Chair:
David Johnson
AT&T Labs - Research
,
Program Chair:
Uriel Feige
Microsoft Research and Weizmann Institute

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 June 2007

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

STOC07

Sponsor:

STOC07: Symposium on Theory of Computing

June 11 - 13, 2007

California, San Diego, USA

Acceptance Rates

Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Upcoming Conference

STOC '25

Sponsor:
sigact

57th Annual ACM Symposium on Theory of Computing (STOC 2025)

June 23 - 27, 2025

Prague , Czech Republic

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

23
Total Citations
View Citations
529
Total Downloads

Downloads (Last 12 months)31
Downloads (Last 6 weeks)4

Reflects downloads up to 14 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Golrezaei NNiazadeh RPatel KSusan FLarson K(2024)Online combinatorial optimization with group fairness constraintsProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence10.24963/ijcai.2024/44(394-402)Online publication date: 3-Aug-2024
https://dl.acm.org/doi/10.24963/ijcai.2024/44
Golrezaei NNiazadeh RPatel KSusan F(2024)Online Combinatorial Optimization with Group Fairness ConstraintsSSRN Electronic Journal10.2139/ssrn.4824251Online publication date: 2024
https://doi.org/10.2139/ssrn.4824251
Bhatt AHaghtalab NShetty AOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Smoothed analysis of sequential probability assignmentProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3669617(79808-79831)Online publication date: 10-Dec-2023
https://dl.acm.org/doi/10.5555/3666122.3669617
Zuo PWang YTang SEvans RShpitser I(2023)Regularized online DR-submodular optimizationProceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence10.5555/3625834.3626077(2608-2617)Online publication date: 31-Jul-2023
https://dl.acm.org/doi/10.5555/3625834.3626077
Swamy GWu DChoudhury SBagnell JWu ZKrause ABrunskill ECho KEngelhardt BSabato SScarlett J(2023)Inverse reinforcement learning without reinforcement learningProceedings of the 40th International Conference on Machine Learning10.5555/3618408.3619793(33299-33318)Online publication date: 23-Jul-2023
https://dl.acm.org/doi/10.5555/3618408.3619793
Agarwal ANiazadeh RPatil P(2023)Misalignment, Learning, and Ranking: Harnessing Users Limited AttentionSSRN Electronic Journal10.2139/ssrn.4365381Online publication date: 2023
https://doi.org/10.2139/ssrn.4365381
Haghtalab NHan YShetty AYang KKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Oracle-efficient online learning for smoothed adversariesProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3600564(4072-4084)Online publication date: 28-Nov-2022
https://dl.acm.org/doi/10.5555/3600270.3600564
Ren J(2022)Regret Minimization of Extensive Games and Its Application on Game StrategiesHighlights in Science, Engineering and Technology10.54097/hset.v12i.145512(204-212)Online publication date: 26-Aug-2022
https://doi.org/10.54097/hset.v12i.1455
Tran-Thanh LXia YQin TJennings N(2015)Efficient algorithms with performance guarantees for the stochastic multiple-choice Knapsack problemProceedings of the 24th International Conference on Artificial Intelligence10.5555/2832249.2832305(403-409)Online publication date: 25-Jul-2015
https://dl.acm.org/doi/10.5555/2832249.2832305
Garber DHazan E(2013)Playing Non-linear Games with Linear OraclesProceedings of the 2013 IEEE 54th Annual Symposium on Foundations of Computer Science10.1109/FOCS.2013.52(420-428)Online publication date: 26-Oct-2013
https://dl.acm.org/doi/10.1109/FOCS.2013.52
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations