research-article

Learning to predict indoor illumination from a single image

Authors:

Marc-André Gardner,

Kalyan Sunkavalli,

Emiliano Gambaretto,

Christian Gagné,

Jean-François LalondeAuthors Info & Claims

ACM Transactions on Graphics (TOG), Volume 36, Issue 6

Article No.: 176, Pages 1 - 14

https://doi.org/10.1145/3130800.3130891

Published: 20 November 2017 Publication History

Abstract

We propose an automatic method to infer high dynamic range illumination from a single, limited field-of-view, low dynamic range photograph of an indoor scene. In contrast to previous work that relies on specialized image capture, user input, and/or simple scene models, we train an end-to-end deep neural network that directly regresses a limited field-of-view photo to HDR illumination, without strong assumptions on scene geometry, material properties, or lighting. We show that this can be accomplished in a three step process: 1) we train a robust lighting classifier to automatically annotate the location of light sources in a large dataset of LDR environment maps, 2) we use these annotations to train a deep neural network that predicts the location of lights in a scene from a single limited field-of-view photo, and 3) we fine-tune this network using a small dataset of HDR environment maps to predict light intensities. This allows us to automatically recover high-quality HDR illumination estimates that significantly outperform previous state-of-the-art methods. Consequently, using our illumination estimates for applications like 3D object insertion, produces photo-realistic results that we validate via a perceptual user study.

References

[1]

Aayush Bansal, Bryan Russell, and Abhinav Gupta. 2016. Marr Revisited: 2D-3D Model Alignment via Surface Normal Prediction. IEEE Conference on Computer Vision and Pattern Recognition (2016).

[2]

Francesco Banterle, Marco Callieri, Matteo Dellepiane, Massimiliano Corsini, Fabio Pellacini, and Roberto Scopigno. 2013. EnvyDepth: An interface for recovering local natural illumination from environment maps. Computer Graphics Forum 32, 7 (2013), 411--420.

[3]

Jonathan Barron and Jitendra Malik. 2013a. Intrinsic Scene Properties from a Single RGB-D Image. IEEE Conference on Computer Vision and Pattern Recognition (2013).

Digital Library

[4]

Jonathan Barron and Jitendra Malik. 2013b. Shape, illumination, and reflectance from shading. IEEE Transactions on Pattern Analysis and Machine Intelligence 37, 8 (2013), 1670--1687.

[5]

Sean Bell, Paul Upchurch, Noah Snavely, and Kavita Bala. 2015. Material Recognition in the Wild with the Materials in Context Database. IEEE Conference on Computer Vision and Pattern Recognition (2015).

[6]

Dmitri Bitouk, Neeraj Kumar, Samreen Dhillon, Peter Belhumeur, and Shree K. Nayar. 2008. Face Swapping: Automatically Replacing Faces in Photographs. ACM Transactions on Graphics 27, 3, Article 39 (Aug. 2008), 8 pages.

Digital Library

[7]

Djork-Arné Clevert, Thomas Unterthiner, and Sepp Hochreiter. 2016. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs). In International Conference on Learning Representations.

[8]

Navneet Dalal and Bill Triggs. 2005. Histograms of Oriented Gradients for Human Detection. In IEEE Conference on Computer Vision and Pattern Recognition. 886--893.

Digital Library

[9]

Paul Debevec.1997. Recovering High Dynamic Range Radiance Maps from Photographs. In ACM SIGGRAPH 1997. 1--10.

Digital Library

[10]

Paul Debevec. 1998. Rendering Synthetic Objects into Real Scenes : Bridging Traditional and Image-based Graphics with Global Illumination and High Dynamic Range Photography. In Proceedings of ACM SIGGRAPH.

Digital Library

[11]

Paul Debevec, Paul Graham, Jay Busch, and Mark Bolas. 2012. A Single-shot Light Probe. In ACM SIGGRAPH 2012 Talks. ACM, New York, NY, USA, 10:1--10:1.

Digital Library

[12]

David Eigen and Rob Fergus. 2015. Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture. International Conference on Computer Vision (2015).

Digital Library

[13]

Pedro F. Felzenszwalb, Ross B. Girshick, David McAllester, and Deva Ramanan. 2010. Object Detection with Discriminative Trained Part Based Models. IEEE Transactions on Pattern Analysis and Machine Intelligence 32, 9 (2010), 1627--1645.

Digital Library

[14]

Stamatios Georgoulis, Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Luc J. Van Gool, and Tinne Tuytelaars. 2016. DeLight-Net: Decomposing Reflectance Maps into Specular Materials and Natural Illumination. CoRR abs/1603.08240 (2016).

[15]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep Residual Learning for Image Recognition. In IEEE Conference on Computer Vision and Pattern Recognition.

[16]

Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, and Jean-François Lalonde. 2017. Deep Outdoor Illumination Estimation. In IEEE Conference on Computer Vision and Pattern Recognition.

[17]

Lukáš Hošek and Alexander Wilkie. 2012. An analytic model for full spectral sky-dome radiance. ACM Transactions on Graphics 31, 4 (2012), 1--9. http://www.scopus.com/inward/record.url?eid=2-s2.0-84872254894

Digital Library

[18]

Kevin Karsch, Varsha Hedau, David Forsyth, and Derek Hoiem. 2011. Rendering synthetic objects into legacy photographs. ACM Transactions on Graphics 30, 6 (2011), 1.

Digital Library

[19]

Kevin Karsch, Kalyan Sunkavalli, Sunil Hadap, Nathan Carr, Hailin Jin, Rafael Fonte, Michael Sittig, and David Forsyth. 2014. Automatic Scene Inference for 3D Object Compositing. ACM Transactions on Graphics 3 (2014), 32:1--32:15.

Digital Library

[20]

Erum Arif Khan, Erik Reinhard, Roland W. Fleming, and Heinrich H. Bülthoff. 2006. Image-based material editing. ACM Transactions on Graphics 25, 3 (2006), 654.

Digital Library

[21]

Diederik Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv.1412.6980 (2014).

[22]

Philipp Krähenbühl and Vladlen Koltun. 2012. Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials. In Neural Information Processing Systems.

[23]

Jean-François Lalonde, Derek Hoiem, Alexei A. Efros, Carsten Rother, John Winn, and Antonio Criminisi. 2007. Photo Clip Art. ACM Transactions on Graphics 26, 3, Article 3 (July 2007).

Digital Library

[24]

Jean-François Lalonde, Srinivasa G. Narasimhan, and Alexei A. Efros. 2010. What do the sun and the sky tell us about the camera? International Journal on Computer Vision 88, 1 (May 2010), 24--51.

Digital Library

[25]

Stephen Lombardi and Ko Nishino. 2016. Reflectance and Illumination Recovery in the Wild. IEEE Transactions on Pattern Analysis and Machine Intelligence 38, 1 (2016), 129--141.

Digital Library

[26]

Jorge Lopez-Moreno, Sunil Hadap, Erik Reinhard, and Diego Gutierrez. 2010. Compositing images through light source detection. Computers & Graphics 34, 6 (2010), 698--707.

Digital Library

[27]

Ravi Ramamoorthi and Pat Hanrahan. 2001. A Signal-processing Framework for Inverse Rendering. In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH '01). 117--128.

Digital Library

[28]

Erik Reinhard, Wolfgang Heidrich, Paul Debevec, Sumanta Pattanaik, Greg Ward, and Karol Myszkowski. 2010. High Dynamic Range Imaging (2 ed.). Morgan Kaufman.

[29]

Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Efstratios Gavves, and Tinne Tuytelaars. 2016. Deep Reflectance Maps. In IEEE Conference on Computer Vision and Pattern Recognition.

[30]

Levi Valgaerts, Chenglei Wu, Andrés Bruhn, Hans-Peter Seidel, and Christian Theobalt. 2012. Lightweight Binocular Facial Performance Capture Under Uncontrolled Lighting. ACM Transactions on Graphics 31, 6, Article 187 (Nov. 2012), 11 pages.

Digital Library

[31]

Chenglei Wu, B. Wilburn, Y. Matsushita, and C. Theobalt. 2011. High-quality Shape from Multi-view Stereo and Shading Under General Illumination. In IEEE Conference on Computer Vision and Pattern Recognition.

Digital Library

[32]

Jianxiong Xiao, Krista A. Ehinger, Aude Oliva, and Antonio Torralba. 2012. Recognizing scene viewpoint using panoramic place representation. In IEEE Conference on Computer Vision and Pattern Recognition.

Digital Library

[33]

Edward Zhang, Michael F. Cohen, and Brian Curless. 2016. Emptying, Refurnishing, and Relighting Indoor Spaces. ACM Transactions on Graphics 35, 6 (2016).

Digital Library

[34]

Tinghui Zhou, Philipp Krähenbühl, and Alexei A. Efros. 2015. Learning Data-driven Reflectance Priors for Intrinsic Image Decomposition. International Conference on Computer Vision (2015).

Digital Library

Cited By

Zhao YFanello SGuo T(2024)Multi-Camera Lighting Estimation for Mobile Augmented RealityGetMobile: Mobile Computing and Communications10.1145/3701701.370171028:3(25-29)Online publication date: 22-Oct-2024
https://dl.acm.org/doi/10.1145/3701701.3701710
Buehler MLi GWood EHelminger LChen XShah TWang DGarbin SOrts-Escolano SHilliges OLagun DRiviere JGotardo PBeeler TMeka ASarkar K(2024)Cafca: High-quality Novel View Synthesis of Expressive Faces from Casual Few-shot CapturesSIGGRAPH Asia 2024 Conference Papers10.1145/3680528.3687580(1-12)Online publication date: 3-Dec-2024
https://dl.acm.org/doi/10.1145/3680528.3687580
Rao PFox GMeka AB R MZhan FWeyrich TBickel BPfister HMatusik WElgharib MTheobalt C(2024)Lite2Relight: 3D-aware Single Image Portrait RelightingACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657470(1-12)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657470
Show More Cited By

Index Terms

Learning to predict indoor illumination from a single image
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Scene understanding
      2. Image and video acquisition
        Computational photography
  2. Computer graphics
    1. Image manipulation

Recommendations

Multiresolution image-space rendering for interactive global illumination
Hardware-accelerated global illumination by image space photon mapping
HPG '09: Proceedings of the Conference on High Performance Graphics 2009

We describe an extension to photon mapping that recasts the most expensive steps of the algorithm -- the initial and final photon bounces -- as image-space operations amenable to GPU acceleration. This enables global illumination for real-time ...
Physically-Based Editing of Indoor Scene Lighting from a Single Image
Computer Vision – ECCV 2022
Abstract
We present a method to edit complex indoor lighting from a single image with its predicted depth and light source segmentation masks. This is an extremely challenging problem that requires modeling complex light transport, and disentangling HDR ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Graphics

ACM Transactions on Graphics Volume 36, Issue 6

December 2017

973 pages

ISSN:0730-0301

EISSN:1557-7368

DOI:10.1145/3130800

Editor:
Kavita Bala

Issue’s Table of Contents

Copyright © 2017 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 November 2017

Published in TOG Volume 36, Issue 6

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

Natural Sciences and Engineering Research Council of Canada

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

185
Total Citations
View Citations
1,952
Total Downloads

Downloads (Last 12 months)82
Downloads (Last 6 weeks)8

Reflects downloads up to 01 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Zhao YFanello SGuo T(2024)Multi-Camera Lighting Estimation for Mobile Augmented RealityGetMobile: Mobile Computing and Communications10.1145/3701701.370171028:3(25-29)Online publication date: 22-Oct-2024
https://dl.acm.org/doi/10.1145/3701701.3701710
Buehler MLi GWood EHelminger LChen XShah TWang DGarbin SOrts-Escolano SHilliges OLagun DRiviere JGotardo PBeeler TMeka ASarkar K(2024)Cafca: High-quality Novel View Synthesis of Expressive Faces from Casual Few-shot CapturesSIGGRAPH Asia 2024 Conference Papers10.1145/3680528.3687580(1-12)Online publication date: 3-Dec-2024
https://dl.acm.org/doi/10.1145/3680528.3687580
Rao PFox GMeka AB R MZhan FWeyrich TBickel BPfister HMatusik WElgharib MTheobalt C(2024)Lite2Relight: 3D-aware Single Image Portrait RelightingACM SIGGRAPH 2024 Conference Papers10.1145/3641519.3657470(1-12)Online publication date: 13-Jul-2024
https://dl.acm.org/doi/10.1145/3641519.3657470
Vélez Bedoya JGonzález Bedia MOssa LArango‐López JDíaz‐Arancibia J(2024)Exploring deep learning techniques for illuminance estimationExpert Systems10.1111/exsy.1355942:1Online publication date: 6-Feb-2024
https://doi.org/10.1111/exsy.13559
Xu RZhang LZhang F(2024)Intrinsic Omnidirectional Image Decomposition With Illumination Pre-ExtractionIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.336634330:7(4416-4428)Online publication date: Jul-2024
https://doi.org/10.1109/TVCG.2024.3366343
Wang MZhou JFeng WJiang YSerrano A(2024)Studying the Effect of Material and Geometry on Perceptual Outdoor IlluminationIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2023.334756030:9(6468-6480)Online publication date: Sep-2024
https://doi.org/10.1109/TVCG.2023.3347560
Fei FCheng YZhu YZheng QLi SPan GShi B(2024)SPLiT: Single Portrait Lighting Estimation via a Tetrad of Face IntrinsicsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2023.332845346:2(1079-1092)Online publication date: 1-Feb-2024
https://dl.acm.org/doi/10.1109/TPAMI.2023.3328453
Zhao JXue BZhang M(2024)SALENet: Structure-Aware Lighting Estimations From a Single Image for Indoor EnvironmentsIEEE Transactions on Image Processing10.1109/TIP.2024.351238133(6806-6820)Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1109/TIP.2024.3512381
Cheng LHu JYan HGladkova MHuang TLiu YCremers DLi H(2024)Physically-Based Photometric Bundle Adjustment in Non-Lambertian Environments2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)10.1109/IROS58592.2024.10802236(10461-10468)Online publication date: 14-Oct-2024
https://doi.org/10.1109/IROS58592.2024.10802236
Xue JHuang HZhou ZXu SChen A(2024)LRGAN: Learnable Weighted Recurrent Generative Adversarial Network for End-to-End Shadow Generation2024 International Joint Conference on Neural Networks (IJCNN)10.1109/IJCNN60899.2024.10650634(1-8)Online publication date: 30-Jun-2024
https://doi.org/10.1109/IJCNN60899.2024.10650634
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Issue’s Table of Contents