skip to main content
10.1145/1723112.1723129acmconferencesArticle/Chapter ViewAbstractPublication PagesfpgaConference Proceedingsconference-collections
research-article

FPMR: MapReduce framework on FPGA

Published: 21 February 2010 Publication History

Abstract

Machine learning and data mining are gaining increasing attentions of the computing society. FPGA provides a highly parallel, low power, and flexible hardware platform for this domain, while the difficulty of programming FPGA greatly limits its prevalence. MapReduce is a parallel programming framework that could easily utilize inherent parallelism in algorithms. In this paper, we describe FPMR, a MapReduce framework on FPGA, which provides programming abstraction, hardware architecture, and basic building blocks to developers.
An on-chip processor scheduler is implemented to maximize the utilization of computation resources and achieve better load balancing. An efficient data access scheme is carefully designed to maximize data reuse and throughput. Meanwhile, the FPMR framework hides the task control, synchronization, and communication away from designers so that more attention can be paid to the application itself. A case study of RankBoost acceleration based on FPMR demonstrates that FPMR efficiently helps with the development productivity; and the speedup is 31.8x versus CPU-based implementation. This performance is comparable to a fully manually designed version, which achieves 33.5x speedup. Two other applications: SVM, PageRank are also discussed to show the generalization of the framework.

References

[1]
Martin C. Herbordt, Tom VanCourt, Yongfeng Gu, Bharat Sukhwani, Al Conti, Josh Model, Doug DiSabello, Achieving High Performance with FPGA-Based Computing, Computer, Volume 40, Issue 3 (March 2007), Pages 50--57
[2]
FPGAs and Moore's Law, http://www.ciol.com/Semicon/Design-Trends/News-Reports/FPGAs-and-Moores-Law/111108112450/0/
[3]
AutoPilot, AutoESL, www.autoesl.com
[4]
CatapultC, Mentor Graphics, www.mentor.com
[5]
Impulse C, IA Technologies, www.impulseaccelerated.com
[6]
A.DeHon et al., "Design Patterns for Reconfigurable Computing," Proc. 12th Ann. IEEE Symp. Field-Programmable Custom Computing Machines, IEEE CS Press, 2004, pp. 13--23.
[7]
J. Dean and S. Ghemawat. MapReduce: Simplified data processing on large clusters. In Sixth Symposium on Operating System Design and Implementation (OSDI), San Francisco, CA, 2004.
[8]
Apache Hadoop, http://hadoop.apache.org/
[9]
C. Chu, S. Kim, Y. Lin, Y. Yu, G. Bradski, A. Y. Ng, and K. Olukotun. Map-reduce for machine learning on multicore. In NIPS'07: Proceedings of Twenty-First Annual Conference on Neural Information Processing Systems. Neural Information Processing Systems Foundation, 2007.
[10]
Adam Pisoni. Skynet, Apr. 2008. http://skynet.rubyforge.org.
[11]
Colby Ranger, Ramanan Raghuraman, Arun Penmetsa, Gary Bradski, and Christos Kozyrakis. Evaluating MapReduce for Multi-core and Multiprocessor Systems. Proceedings of the 13th Intl. Symposium on High-Performance Computer Architecture (HPCA), Phoenix, AZ, February 2007.
[12]
Richard M. Yoo, Anthony Romano, and Christos Kozyrakis. Phoenix Rebirth: Scalable MapReduce on a Large-Scale Shared-Memory System. Proceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC), Austin, TX, October 2009.
[13]
Bingsheng He, Wenbin Fang, Qiong Luo, Naga K. Govindaraju, and Tuyong Wang. Mars: A MapReduce Framework on Graphics Processors. PACT 2008.
[14]
M. Mustafa Rafique, Benjamin Rose, Ali R. Butt, D.S.Nikolopoulos. CellMR: A Framework for Supporting MapReduce on Asymmetric Cell-Based Clusters. IPDPS'09
[15]
J. H. Yeung, C. Tsang, K. Tsoi, B. S. Kwan, C. C. Cheung, A. P. Chan, and P. H. Leong. Map-reduce as a programming model for custom computing machines. IEEE Symposium on Field-Programmable Custom Computing Machines, 2008.
[16]
Yoav Freund, Raj Iyer, Robert E. Schapire and Yoram Singer, An efficient boosting algorithm for combining preferences, The Journal of Machine Learning Research, Volume 4, (December 2003), Pages: 933 -- 969
[17]
NY Xu, XF Cai, R Gao, L Zhang, FH Hsu, FPGA Acceleration of RankBoost in Web Search Engines, ACM Transactions on Reconfigurable Technology and Systems (TRETS), Volume 1, Issue 4 (January 2009), Article No. 19
[18]
C. Chu, S. Kim, Y. Lin, Y. Yu, G. Bradski, A. Y. Ng, and K. Olukotun. Map-reduce for machine learning on multicore. In NIPS '07: Proceedings of Twenty-First Annual Conference on Neural Information Processing Systems. Neural Information Processing Systems Foundation, 2007.
[19]
Boser, B., Guyon, I., Vapnik, V. A training algorithm for optimal margin classifiers. Proc. of the Fifth Annual Workshop on Computational Learning Theory, Pittsburgh, ACM, pp, 144--152
[20]
J. Platt. Sequential minimal optimization: A fast algorithm for training support vector machines. Technical Report MSR-TR-98-14, Microsoft Research
[21]
Bryan Catanzaro, Narayanan Sundaram and Kurt Keutzer. Fast support vector machine training and classification on graphics processors, Proceedings of the 25th International Conference on Machine Learning, Helsinki, Finland, 2008.
[22]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, 1998.
[23]
Ningyi Xu, Even Microsoft uses AutoESL's C synthesis to speed up its SW, http://deepchip.com/items/0482-06.html

Cited By

View all
  • (2024)Multi-Access Distributed ComputingIEEE Transactions on Information Theory10.1109/TIT.2024.337312870:5(3385-3398)Online publication date: May-2024
  • (2024)F-TADOC: FPGA-Based Text Analytics Directly on Compression with HLS2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00287(3739-3752)Online publication date: 13-May-2024
  • (2024)High-Level Programming of FPGA-Accelerated Systems with Parallel PatternsInternational Journal of Parallel Programming10.1007/s10766-024-00770-352:4(253-273)Online publication date: 27-May-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
FPGA '10: Proceedings of the 18th annual ACM/SIGDA international symposium on Field programmable gate arrays
February 2010
308 pages
ISBN:9781605589114
DOI:10.1145/1723112
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 February 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. FPGA framework
  2. MapReduce
  3. RankBoost

Qualifiers

  • Research-article

Conference

FPGA '10
Sponsor:

Acceptance Rates

Overall Acceptance Rate 125 of 627 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)27
  • Downloads (Last 6 weeks)3
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Multi-Access Distributed ComputingIEEE Transactions on Information Theory10.1109/TIT.2024.337312870:5(3385-3398)Online publication date: May-2024
  • (2024)F-TADOC: FPGA-Based Text Analytics Directly on Compression with HLS2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00287(3739-3752)Online publication date: 13-May-2024
  • (2024)High-Level Programming of FPGA-Accelerated Systems with Parallel PatternsInternational Journal of Parallel Programming10.1007/s10766-024-00770-352:4(253-273)Online publication date: 27-May-2024
  • (2023)Optimistic Data Parallelism for FPGA-Accelerated SketchingProceedings of the VLDB Endowment10.14778/3579075.357908516:5(1113-1125)Online publication date: 1-Jan-2023
  • (2022)The Eye in Extended Reality: A Survey on Gaze Interaction and Eye Tracking in Head-worn Extended RealityACM Computing Surveys10.1145/349120755:3(1-39)Online publication date: 25-Mar-2022
  • (2022)A Comprehensive Exploration of Languages for Parallel ComputingACM Computing Surveys10.1145/348500855:2(1-39)Online publication date: 18-Jan-2022
  • (2022)A Survey on Machine Learning Accelerators and Evolutionary Hardware PlatformsIEEE Design & Test10.1109/MDAT.2022.316112639:3(91-116)Online publication date: Jun-2022
  • (2022)SAIR: significance-aware approach to improve QoR of big data processing in case of budget constraintThe Journal of Supercomputing10.1007/s11227-019-02797-775:9(5760-5781)Online publication date: 10-Mar-2022
  • (2021)Academic Activities Recommendation System for Sustainable Education in the Age of COVID-19Informatics10.3390/informatics80200298:2(29)Online publication date: 20-Apr-2021
  • (2021)Analysis of the State of Learning in University Students with the Use of a Hadoop FrameworkFuture Internet10.3390/fi1306014013:6(140)Online publication date: 24-May-2021
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media