skip to main content
10.1145/2897518.2897647acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
research-article
Public Access

Geometric median in nearly linear time

Published: 19 June 2016 Publication History

Abstract

In this paper we provide faster algorithms for solving the geometric median problem: given n points in d compute a point that minimizes the sum of Euclidean distances to the points. This is one of the oldest non-trivial problems in computational geometry yet despite a long history of research the previous fastest running times for computing a (1+є)-approximate geometric median were O(d· n4/3є−8/3) by Chin et. al, Õ(dexpє−4logє−1) by Badoiu et. al, O(nd+poly(d−1)) by Feldman and Langberg, and the polynomial running time of O((nd)O(1)log1/є) by Parrilo and Sturmfels and Xue and Ye.
In this paper we show how to compute such an approximate geometric median in time O(ndlog3n/є) and O(dє−2). While our O(dє−2) is a fairly straightforward application of stochastic subgradient descent, our O(ndlog3n/є) time algorithm is a novel long step interior point method. We start with a simple O((nd)O(1)log1/є) time interior point method and show how to improve it, ultimately building an algorithm that is quite non-standard from the perspective of interior point literature. Our result is one of few cases of outperforming standard interior point theory. Furthermore, it is the only case we know of where interior point methods yield a nearly linear time algorithm for a canonical optimization problem that traditionally requires superlinear time.

References

[1]
M. Badoiu, S. Har-Peled, and P. Indyk. Approximate clustering via core-sets. In Proceedings on 34th Annual ACM Symposium on Theory of Computing, May 19-21, 2002, Montréal, Québec, Canada, pages 250–257, 2002.
[2]
C. Bajaj. The algebraic degree of geometric optimization problems. Discrete & Computational Geometry, 3(2):177–191, 1988.
[3]
E. Balas and C.-S. Yu. A note on the weiszfeld-kuhn algorithm for the general fermat problem. Managme Sci Res Report, (484):1–6, 1982.
[4]
P. Bose, A. Maheshwari, and P. Morin. Fast approximations for sums of distances, clustering and the Fermat-Weber problem. Computational Geometry, 24(3):135 – 146, 2003.
[5]
S. Bubeck. Theory of convex optimization for machine learning. arXiv preprint arXiv:1405.4980, 2014.
[6]
R. Chandrasekaran and A. Tamir. Open questions concerning weiszfeld’s algorithm for the fermat-weber location problem. Mathematical Programming, 44(1-3):293–295, 1989.
[7]
H. H. Chin, A. Madry, G. L. Miller, and R. Peng. Runtime guarantees for regression problems. In ITCS, pages 269–282, 2013.
[8]
L. Cooper and I. Katz. The weber problem revisited. Computers and Mathematics with Applications, 7(3):225 – 234, 1981.
[9]
Z. Drezner, K. Klamroth, A. Sch ˜ A˝ ubel, and G. Wesolowsky. Facility location, chapter The Weber problem, pages 1–36. Springer, 2002.
[10]
D. Feldman and M. Langberg. A unified framework for approximating and clustering data. In Proceedings of the forty-third annual ACM symposium on Theory of computing, pages 569–578. ACM, 2011.
[11]
C. C. Gonzaga. Path-following methods for linear programming. SIAM review, 34(2):167–224, 1992.
[12]
S. Har-Peled and A. Kushal. Smaller coresets for k-median and k-means clustering. In Proceedings of the twenty-first annual symposium on Computational geometry, pages 126–134. ACM, 2005.
[13]
P. Indyk and S. U. C. S. Dept. High-dimensional computational geometry. Stanford University, 2000.
[14]
J. Krarup and S. Vajda. On torricelli’s geometrical solution to a problem of fermat. IMA Journal of Management Mathematics, 8(3):215–224, 1997.
[15]
R. A. Kronmal and A. V. Peterson. The alias and alias-rejection-mixture methods for generating random variables from probability distributions. In Proceedings of the 11th Conference on Winter Simulation - Volume 1, WSC ’79, pages 269–280, Piscataway, NJ, USA, 1979. IEEE Press.
[16]
H. Kuhn. A note on fermat’s problem. Mathematical Programming, 4(1):98–107, 1973.
[17]
Y. T. Lee and A. Sidford. Path-finding methods for linear programming : Solving linear programs in ˜ o(sqrt(rank)) iterations and faster algorithms for maximum flow. In 55th Annual IEEE Symposium on Foundations of Computer Science, FOCS 2014, 18-21 October, 2014, Philadelphia, PA, USA, pages 424–433, 2014.
[18]
H. P. Lopuhaa and P. J. Rousseeuw. Breakdown points of affine equivariant estimators of multivariate location and covariance matrices. Ann. Statist., 19(1):229–248, 03 1991.
[19]
H. P. Lopuhaa and P. J. Rousseeuw. Breakdown points of affine equivariant estimators of multivariate location and covariance matrices. The Annals of Statistics, pages 229–248, 1991.
[20]
A. Madry. Navigating central path with electrical flows: from flows to matchings, and back. In Proceedings of the 54th Annual Symposium on Foundations of Computer Science, 2013.
[21]
Y. Nesterov. Introductory Lectures on Convex Optimization: A Basic Course, volume I. 2003.
[22]
Y. Nesterov and A. S. Nemirovskii. Interior-point polynomial algorithms in convex programming, volume 13. Society for Industrial and Applied Mathematics, 1994.
[23]
L. M. Ostresh. On the convergence of a class of iterative methods for solving the weber location problem. Operations Research, 26(4):597–609, 1978.
[24]
P. A. Parrilo and B. Sturmfels. Minimizing polynomial functions. In DIMACS Workshop on Algorithmic and Quantitative Aspects of Real Algebraic Geometry in Mathematics and Computer Science, March 12-16, 2001, DIMACS Center, Rutgers University, Piscataway, NJ, USA, pages 83–100, 2001.
[25]
F. Plastria and M. Elosmani. On the convergence of the weiszfeld algorithm forăcontinuous single facility location allocation problems. TOP, 16(2):388–406, 2008.
[26]
J. Renegar. A polynomial-time algorithm, based on newton’s method, for linear programming. Mathematical Programming, 40(1-3):59–93, 1988.
[27]
Y. Vardi and C.-H. Zhang. The multivariate l1-median and associated data depth. Proceedings of the National Academy of Sciences, 97(4):1423–1426, 2000.
[28]
V. Viviani. De maximis et minimis geometrica divinatio liber 2. De Maximis et Minimis Geometrica Divinatio, 1659.
[29]
A. Weber. The Theory of the Location of Industries. Chicago University Press, 1909. Aber den I der Industrien.
[30]
E. Weiszfeld. Sur le point pour lequel la somme des distances de n points donnes est minimum. Tohoku Mathematical Journal, pages 355–386, 1937.
[31]
G. Xue and Y. Ye. An efficient algorithm for minimizing a sum of euclidean norms with applications. SIAM Journal on Optimization, 7:1017–1036, 1997.
[32]
Y. Ye. Interior point algorithms: theory and analysis, volume 44. John Wiley & Sons, 2011.

Cited By

View all
  • (2025)Low Complexity Byzantine-Resilient Federated LearningIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.348272720(2051-2066)Online publication date: 2025
  • (2025)Reviewing extensions and solution methods of the planar Weber single facility location problemComputers & Operations Research10.1016/j.cor.2024.106825173(106825)Online publication date: Jan-2025
  • (2025)GuardedLearn: Safeguarding Federated Learning with Robust Defenses and Privacy Preserving MechanismsIntelligent Systems, Blockchain, and Communication Technologies10.1007/978-3-031-82377-0_36(439-453)Online publication date: 5-Mar-2025
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
STOC '16: Proceedings of the forty-eighth annual ACM symposium on Theory of Computing
June 2016
1141 pages
ISBN:9781450341325
DOI:10.1145/2897518
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 June 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. geometric median
  2. interior point methods
  3. stochastic gradient descent

Qualifiers

  • Research-article

Funding Sources

Conference

STOC '16
Sponsor:
STOC '16: Symposium on Theory of Computing
June 19 - 21, 2016
MA, Cambridge, USA

Acceptance Rates

Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Upcoming Conference

STOC '25
57th Annual ACM Symposium on Theory of Computing (STOC 2025)
June 23 - 27, 2025
Prague , Czech Republic

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)349
  • Downloads (Last 6 weeks)41
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Low Complexity Byzantine-Resilient Federated LearningIEEE Transactions on Information Forensics and Security10.1109/TIFS.2024.348272720(2051-2066)Online publication date: 2025
  • (2025)Reviewing extensions and solution methods of the planar Weber single facility location problemComputers & Operations Research10.1016/j.cor.2024.106825173(106825)Online publication date: Jan-2025
  • (2025)GuardedLearn: Safeguarding Federated Learning with Robust Defenses and Privacy Preserving MechanismsIntelligent Systems, Blockchain, and Communication Technologies10.1007/978-3-031-82377-0_36(439-453)Online publication date: 5-Mar-2025
  • (2024)Byzantine resilient and fast federated few-shot learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693928(45696-45706)Online publication date: 21-Jul-2024
  • (2024)Optimal coresets for low-dimensional geometric medianProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692081(262-270)Online publication date: 21-Jul-2024
  • (2024)Settling Time vs. Accuracy Tradeoffs for Clustering Big DataProceedings of the ACM on Management of Data10.1145/36549762:3(1-25)Online publication date: 30-May-2024
  • (2024)Byzantine Machine Learning: A PrimerACM Computing Surveys10.1145/361653756:7(1-39)Online publication date: 9-Apr-2024
  • (2024)Non‐Euclidean Sliced Optimal Transport SamplingComputer Graphics Forum10.1111/cgf.1502043:2Online publication date: 30-Apr-2024
  • (2024)Resilient Federated Learning Using Trimmed-Clipping Aggregation2024 IEEE 6th International Conference on Trust, Privacy and Security in Intelligent Systems, and Applications (TPS-ISA)10.1109/TPS-ISA62245.2024.00031(192-201)Online publication date: 28-Oct-2024
  • (2024)Equalized Aggregation for Heterogeneous Federated Mobile Edge LearningIEEE Transactions on Mobile Computing10.1109/TMC.2023.3276900(1-18)Online publication date: 2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media