research-article

S2FA: an accelerator automation framework for heterogeneous computing in datacenters

Authors:
Cody Hao Yu

University of California and Falcon Computing Solutions, Inc.

University of California and Falcon Computing Solutions, Inc.
View Profile

,
Peng Wei

University of California

University of California
View Profile

,
Max Grossman

Rice University

Rice University
View Profile

,
Peng Zhang

Falcon Computing Solutions, Inc.

Falcon Computing Solutions, Inc.
View Profile

,
Vivek Sarker

Georgia Institute of Technology

Georgia Institute of Technology
View Profile

,
Jason Cong

University of California

University of California
View Profile

DAC '18: Proceedings of the 55th Annual Design Automation ConferenceJune 2018Article No.: 153Pages 1–6https://doi.org/10.1145/3195970.3196109

Published:24 June 2018Publication History

DAC '18: Proceedings of the 55th Annual Design Automation Conference

Pages 1–6

ABSTRACT

Big data analytics using the JVM-based MapReduce framework has become a popular approach to address the explosive growth of data sizes. Adopting FPGAs in datacenters as accelerators to improve performance and energy efficiency also attracts increasing attention. However, the integration of FPGAs into such JVM-based frameworks raises the challenge of poor programmability. Programmers must not only rewrite Java/Scala programs to C/C++ or OpenCL, but, to achieve high performance, they must also take into consideration the intricacies of FPGAs. To address this challenge, we present S2FA (Spark-to-FPGA-Accelerator), an automation framework that generates FPGA accelerator designs from Apache Spark programs written in Scala. S2FA bridges the semantic gap between object-oriented languages and HLS C while achieving high performance using learning-based design space exploration. Evaluation results show that our generated FPGA designs achieve up to 49.9× performance improvement for several machine learning applications compared to their corresponding implementations on the JVM.

References

Amazon EC2 F1 Instance. https://aws.amazon.com/ec2/instance-types/f1/.Google Scholar
Apache Hadoop. http://hadoop.apache.org/.Google Scholar
Aparapi in amd developer website. https://github.com/aparapi/aparapi.Google Scholar
Falcon Computing Solutions, Inc. http://falcon-computing.com/.Google Scholar
Rose Compiler Infrastructure. http://rosecompiler.org/.Google Scholar
Xilinx SDx. www.xilinx.com/products/design-tools/software-zone/sdaccel.html.Google Scholar
J. Ansel et al. 2014. OpenTuner: An Extensible Framework for Program Autotuning. In PACT. Google ScholarDigital Library
Y.-T. Chen et al. 2016. When Spark Meets FPGAs: A Case Study for Next-Generation DNA Sequencing Acceleration. In HotCloud. Google ScholarDigital Library
J. Cong et al. 2016. Source-to-Source Optimization for HLS. In FPGAs for Software Programmers. Springer International Publishing. Google ScholarDigital Library
J. Cong et al. 2016. Software Infrastructure for Enabling FPGA-Based Accelerations in Data Centers: Invited Paper. In ISLPED. Google ScholarDigital Library
J. Cong et al. 2011. High-Level Synthesis for FPGAs: From Prototyping to Deployment. TCAD. Google ScholarDigital Library
J. Dean et al. 2008. MapReduce: Simplified Data Processing on Large Clusters. OSDI. Google ScholarDigital Library
Á. Fialho et al. 2010. Analyzing bandit-based adaptive operator selection mechanisms. Ann Math Artif Intell. Google ScholarDigital Library
M. Huang et al. 2016. Programming and Runtime Support to Blaze FPGA Accelerator Deployment at Datacenter Scale. In SoCC. Google ScholarDigital Library
D. Koeplinger et al. 2016. Automatic Generation of Efficient Accelerators for Reconfigurable Hardware. In ISCA. Google ScholarDigital Library
H.-Y. Liu et al. 2013. On learning-based methods for design-space exploration with high-level synthesis. In DAC. Google ScholarDigital Library
R. Prabhakar et al. 2016. Generating Configurable Hardware from Parallel Patterns. ASPLOS. Google ScholarDigital Library
A. Putnam et al. 2014. A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services. In ISCA. Google ScholarDigital Library
R. Rodríguez et al. 2012. Image segmentation via an iterative algorithm of the mean shift filtering for different values of the stopping threshold. IJIR (2012).Google Scholar
B. C. Schafer et al. 2012. Machine learning predictive modelling high-level synthesis design space exploration. IET CDT (2012).Google Scholar
O. Segal et al. 2015. SparkCL: A Unified Programming Framework for Accelerators on Heterogeneous Clusters. CoRR.Google Scholar
C. E. Shannon. 2001. A mathematical theory of communication. ACM MC2R. Google ScholarDigital Library
T. F. Smith et al. 1981. Identification of common molecular subsequences. JMB.Google Scholar
Z. Wang et al. 2016. A performance analysis framework for optimizing OpenCL applications on FPGAs. In HPCA.Google Scholar
Z. Wang et al. 2016. Melia: A MapReduce Framework on OpenCL-based FPGAs. TPDS. Google ScholarDigital Library
C. Xu et al. 2017. A Parallel Bandit-Based Approach for Autotuning FPGA Compilation. In FPGA. Google ScholarDigital Library
S. Xydis et al. 2015. SPIRIT: Spectral-Aware pareto iterative refinement optimization for supervised high-level synthesis. TCAD (2015).Google Scholar
M. Zaharia et al. 2010. Spark: Cluster Computing with Working Sets. In HotCloud. Google ScholarDigital Library
G. Zhong et al. 2014. Design space exploration of multiple loops on FPGAs using high level synthesis. In ICCD.Google Scholar
W. Zuo et al. 2013. Improving Polyhedral Code Generation for High-level Synthesis. In CODES+ISSS. Google ScholarDigital Library

Recommendations

S2FA: An Accelerator Automation Framework for Heterogeneous Computing in Datacenters
2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)
Big data analytics using the JVM-based MapReduce framework has become a popular approach to address the explosive growth of data sizes. Adopting FPGAs in datacenters as accelerators to improve performance and energy efficiency also attracts increasing ...
Read More
Synthesizable Standard Cell FPGA Fabrics Targetable by the Verilog-to-Routing CAD Flow
Special Section on Field Programmable Logic and Applications 2015 and Regular Papers

In this article, we consider implementing field-programmable gate arrays (FPGAs) using a standard cell design methodology and present a framework for the automated generation of synthesizable FPGA fabrics. The open-source Verilog-to-Routing (VTR) FPGA ...
Read More
Embedded Design Using Programmable Gate Arrays
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DAC '18: Proceedings of the 55th Annual Design Automation Conference
June 2018
1089 pages
ISBN:9781450357005
DOI:10.1145/3195970

Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 June 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,770of5,499submissions,32%
Upcoming Conference
DAC '24

Sponsor:

sigda

61st ACM/IEEE Design Automation Conference

June 23 - 27, 2024

San Francisco , CA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 16
  Total Citations
  View Citations
- 459
  Total Downloads
- Downloads (Last 12 months)36
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

S2FA: an accelerator automation framework for heterogeneous computing in datacenters

DAC '18: Proceedings of the 55th Annual Design Automation Conference

ABSTRACT

References

Cited By

Recommendations

S2FA: An Accelerator Automation Framework for Heterogeneous Computing in Datacenters

Synthesizable Standard Cell FPGA Fabrics Targetable by the Verilog-to-Routing CAD Flow

Embedded Design Using Programmable Gate Arrays

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

S2FA: an accelerator automation framework for heterogeneous computing in datacenters

DAC '18: Proceedings of the 55th Annual Design Automation Conference

ABSTRACT

References

Cited By

Recommendations

S2FA: An Accelerator Automation Framework for Heterogeneous Computing in Datacenters

Synthesizable Standard Cell FPGA Fabrics Targetable by the Verilog-to-Routing CAD Flow

Embedded Design Using Programmable Gate Arrays

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media