research-article

Efficient projections onto the l₁-ball for learning in high dimensions

Authors:
John Duchi

Google, Mountain View, CA

Google, Mountain View, CA
View Profile

,
Shai Shalev-Shwartz

Toyota Technological Institute, Chicago, IL

Toyota Technological Institute, Chicago, IL
View Profile

,
Yoram Singer

Google, Mountain View, CA

Google, Mountain View, CA
View Profile

,
Tushar Chandra

Google, Mountain View, CA

Google, Mountain View, CA
View Profile

ICML '08: Proceedings of the 25th international conference on Machine learningJuly 2008Pages 272–279https://doi.org/10.1145/1390156.1390191

Published:05 July 2008Publication History

ICML '08: Proceedings of the 25th international conference on Machine learning

Pages 272–279

ABSTRACT

We describe efficient algorithms for projecting a vector onto the l₁-ball. We present two methods for projection. The first performs exact projection in O(n) expected time, where n is the dimension of the space. The second works on vectors k of whose elements are perturbed outside the l₁-ball, projecting in O(k log(n)) time. This setting is especially useful for online learning in sparse feature spaces such as text categorization applications. We demonstrate the merits and effectiveness of our algorithms in numerous batch and online learning tasks. We show that variants of stochastic gradient projection methods augmented with our efficient projection procedures outperform interior point methods, which are considered state-of-the-art optimization techniques. We also show that in online settings gradient updates with l₁ projections outperform the exponentiated gradient algorithm while obtaining models with high degrees of sparsity.

References

Beck, A., & Teboulle, M. (2003). Mirror descent and nonlinear projected subgradient methods for convex optimization. Operations Research Letters, 31, 167--175. Google ScholarDigital Library
Bertsekas, D. (1999). Nonlinear programming. Athena Scientific.Google Scholar
Candes, E. J. (2006). Compressive sampling. Proc. of the Int. Congress of Math., Madrid, Spain.Google Scholar
Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2001). Introduction to algorithms. MIT Press. Google ScholarDigital Library
Crammer, K., & Singer, Y. (2002). On the learnability and design of output codes for multiclass problems. Machine Learning, 47. Google ScholarDigital Library
Donoho, D. (2006a). Compressed sensing. Technical Report, Stanford University.Google Scholar
Donoho, D. (2006b). For most large underdetermined systems of linear equations, the minimal l ₁-norm solution is also the sparsest solution. Comm. Pure Appl. Math. 59.Google Scholar
Friedman, J., Hastie, T., & Tibshirani, R. (2007). Pathwise co-ordinate optimization. Annals of Applied Statistics, 1, 302--332.Google ScholarCross Ref
Gafni, E., & Bertsekas, D. P. (1984). Two-metric projection methods for constrained optimization. SIAM Journal on Control and Optimization, 22, 936--964.Google ScholarCross Ref
Hazan, E. (2006). Approximate convex optimization by online game playing. Unpublished manuscript.Google Scholar
Kim, S.-J., Koh, K., Lustig, M., Boyd, S., & Gorinevsky, D. (2007). An interior-point method for large-scale l ₁-regularized least squares. IEEE Journal on Selected Topics in Signal Processing, 4, 606--617.Google ScholarCross Ref
Kivinen, J., & Warmuth, M. (1997). Exponentiated gradient versus gradient descent for linear predictors. Information and Computation, 132, 1--64. Google ScholarDigital Library
Koh, K., Kim, S.-J., & Boyd, S. (2007). An interior-point method for large-scale l ₁-regularized logistic regression. Journal of Machine Learning Research, 8, 1519--1555. Google ScholarDigital Library
Lewis, D., Yang, Y., Rose, T., & Li, F. (2004). Rcv1: A new benchmark collection for text categorization research. Journal of Machine Learning Research, 5, 361--397. Google ScholarDigital Library
Ng, A. (2004). Feature selection, l ₁ vs. l ₂ regularization, and rotational invariance. Proceedings of the Twenty-First International Conference on Machine Learning. Google ScholarDigital Library
Shalev-Shwartz, S., & Singer, Y. (2006). Efficient learning of label ranking by soft projections onto polyhedra. Journal of Machine Learning Research, 7 (July), 1567--1599. Google ScholarDigital Library
Shalev-Shwartz, S., Singer, Y., & Srebro, N. (2007). Pegasos: Primal estimated sub-gradient solver for SVM. Proceedings of the 24th International Conference on Machine Learning. Google ScholarDigital Library
Tarjan, R. E. (1983). Data structures and network algorithms. Society for Industrial and Applied Mathematics. Google ScholarDigital Library
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Royal. Statist. Soc B., 58, 267--288.Google ScholarCross Ref
Zinkevich, M. (2003). Online convex programming and generalized infinitesimal gradient ascent. Proceedings of the Twentieth International Conference on Machine Learning.Google Scholar

Index Terms

Efficient projections onto the l₁-ball for learning in high dimensions

Recommendations

Enhanced Locality Preserving Projections
CSSE '08: Proceedings of the 2008 International Conference on Computer Science and Software Engineering - Volume 01

In pattern recognition, feature extraction techniques are widely employed to reduce the dimensionality of data. In this paper, a new manifold learning algorithm, called Enhanced Locality Preserving Projections, to identify the underlying manifold ...
Read More
Locality Preserving Projections
NIPS'03: Proceedings of the 16th International Conference on Neural Information Processing Systems

Many problems in information processing involve some form of dimensionality reduction. In this paper, we introduce Locality Preserving Projections (LPP). These are linear projective maps that arise by solving a variational problem that optimally ...
Read More
Locality preserving projections
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICML '08: Proceedings of the 25th international conference on Machine learning
July 2008
1310 pages
ISBN:9781605582054
DOI:10.1145/1390156
General Chair:
William Cohen
Carnegie Mellon University
,
Program Chairs:
Andrew McCallum
University of Massachusetts Amherst
,
Sam Roweis
University of Toronto and Google
Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 5 July 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate140of548submissions,26%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 555
  Total Citations
  View Citations
- 2,518
  Total Downloads
- Downloads (Last 12 months)222
- Downloads (Last 6 weeks)53
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Efficient projections onto the l₁-ball for learning in high dimensions

ICML '08: Proceedings of the 25th international conference on Machine learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Enhanced Locality Preserving Projections

Locality Preserving Projections

Locality preserving projections

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Efficient projections onto the l1-ball for learning in high dimensions

ICML '08: Proceedings of the 25th international conference on Machine learning

ABSTRACT

References

Cited By

Index Terms

Recommendations

Enhanced Locality Preserving Projections

Locality Preserving Projections

Locality preserving projections

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media

Efficient projections onto the l₁-ball for learning in high dimensions