skip to main content
10.1145/2671015.2671018acmconferencesArticle/Chapter ViewAbstractPublication PagesvrstConference Proceedingsconference-collections
research-article

Accelerating vision-based 3D indoor localization by distributing image processing over space and time

Published: 11 November 2014 Publication History

Abstract

In a vision-based 3D indoor localization system, conducting localization of user's device at a high frame rate is important to support real-time augment reality applications. However, vision-based 3D localization typically involves 2D keypoint detection and 2D-3D matching processes, which are in general too computationally intensive to be carried out at a high frame rate (e.g., 30 fps) on commodity hardware such as laptops or smartphones. In order to reduce per-frame computation time for 3D localization, we present a new method that distributes required computation over space and time, by splitting a video frame region into multiple sub-blocks, and processing only a sub-block in a rotating sequence at each video frame. The proposed method is general enough that it can be applied to any keypoint detection and 2D-3D matching schemes. We apply the method in a prototype 3D indoor localization system, and evaluate its performance in a 120m long indoor hallway environment using 5,200 video frames of 640x480 (VGA) resolution and a commodity laptop. When SIFT-based keypoint detection is used, our method reduces average and maximum computation time per frame by a factor of 10 and 7 respectively, with a marginal increase of positioning error (e.g., 0.17 m). This improvement enables the frame processing rate to increase from 3.2 fps to 23.3 fps.

References

[1]
Arth, C., Wagner, D., Klopschitz, M., Irschara, A., and Schmalstieg, D. 2009. Wide Area Localization on Mobile Phones. In Proc. IEEE International Symposium on Mixed and Augmented Reality.
[2]
ATAP Project Tango. https://www.google.com/atap/projecttango/. Accessed: 2014-09-02.
[3]
Beauchemin, S. S., and Barron, J. L. 1995. The Computation of Optical Flow. ACM Computing Surveys 27, 3.
[4]
Bouguet, J.-Y. 2000. Pyramidal Implementation of the Lucas Kanade Feature Tracker. Intel Corporation, Microprocessor Research Labs.
[5]
Cai, H., and Zheng, J. Y. 2009. Key Views for Visualizing Large Spaces. Journal of Visual Communication and Image Representation 20, 6.
[6]
Carozza, L., Bosché, F., and Abdel-Wahab, M. 2013. Image-based Localization for an Indoor VR/AR Construction Training System. In Proc. International Conference on Construction Applications of Virtual Reality.
[7]
Castle, R. O., Klein, G., and Murray, D. W. 2011. Wide-area augmented reality using camera tracking and mapping in multiple regions. Computer Vision and Image Understanding 115, 854--867.
[8]
Davison, A. J., Reid, I. D., Molton, N. D., and Stasse, O. 2007. MonoSLAM: Real-Time Single Camera SLAM. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 6.
[9]
Dellaert, F., Seitz, S. M., Thorpe, C. E., and Thrun, S. 2000. Structure from Motion without Correspondence. In Proc. IEEE Conference on Computer Vision and Pattern Recognition.
[10]
Dissanayake, M. W. M. G., Newman, P. M., Clark, S., Durrant-Whyte, H. F., and Csorba, M. 2001. A Solution to the Simultaneous Localisation and Map Building (SLAM) Problem. IEEE Transactions on Robotics and Automation 17, 3.
[11]
Dong, Z., Zhang, G., Jia, J., and Bao, H. 2009. Keyframe-Based Real-Time Camera Tracking. In Proc. ICCV.
[12]
Elias, R., and Elnahas, A. 2009. Fast Localization in Indoor Environments. In Proc. CISDA.
[13]
Faragher, R. M., Sarno, C., and Newman, M. 2012. Opportunistic Radio SLAM for Indoor Navigation using Smartphone Sensors. In Proc. IEEE/ION PLANS.
[14]
Ferris, B., Fox, D., and Lawrence, N. D. 2007. WiFi-SLAM Using Gaussian Process Latent VariableModels. In Proc. IJCAI.
[15]
Fischler, M. A., and Bolles, R. C. 1981. Random Sample Consensus: A Paradigm for Model Fitting with Apphcatlons to Image Analysis and Automated Cartography. Graphics and Image Processing 24, 6.
[16]
Henry, P., Krainin, M., Herbst, E., Ren, X., and Fox, D. 2012. RGB-D Mapping: Using Kinect-style Depth Cameras for Dense 3D Modeling of Indoor Environments. International Journal of Robotics Research 31, 5.
[17]
Irschara, A., Zach, C., Frahm, J.-M., and Bischof, H. 2009. From Structure-from-Motion Point Clouds to Fast Location Recognition. In Proc. CVPR.
[18]
Juan, L., and Gwun, O. 2009. A Comparison of SIFT, PCA-SIFT and SURF. International Journal of Image Processing (IJIP) 3, 4.
[19]
Kummerle, R., Grisetti, G., Strasdat, H., Konolige, K., and Burgard, W. 2011. g2o: A General Framework for Graph Optimization. In Proc. IEEE International Conference on Robotics and Automation (ICRA).
[20]
Lai, S.-H. 2000. Robust Image Matching under Partial Occlusion and Spatially Varying Illumination Change. Computer Vision and Image Understanding 78, 1.
[21]
Li, X., Wang, J., Knight, N., and Ding, W. 2011. Vision-based Positioning with a Single Camera and 3D Maps: Accuracy and Reliability Analysis. Journal of Global Positioning Systems 10, 1.
[22]
Lim, H., Sinha, S. N., Cohen, M. F., and Uyttendaele, M. 2012. Real-time Image-based 6-DOF Localization in Large-Scale Environments. In Proc. CVPR.
[23]
Lowe, D. G. 2004. Distinctive Image Features from Scale-Invariant Keypoints. International Journal of Computer Vision 60, 2.
[24]
Lu, G., and Kambhamettu, C. 2014. Image-based Indoor Localization System based on 3D SfM Model. In Proc. SPIE, Intelligent Robots and Computer Vision.
[25]
Maesen, S., Goorts, P., and Bekaert, P. 2013. Scalable Optical Tracking for Navigating Large Virtual Environments using Spatially Encoded Markers. In Proc. ACM VRST.
[26]
OpenCV. http://opencv.org/documentation.html. Accessed: 2014-09-02.
[27]
Sattler, T., Leibe, B., and Kobbelt, L. 2011. Fast Image-Based Localization using Direct 2D-to-3D Matching. In Proc. ICCV.
[28]
Snavely, N., Seitz, S. M., and Szeliski, R. 2008. Modeling the World from Internet Photo Collections. In Proc. IJCV.
[29]
SolvePnP. http://docs.opencv.org/modules/calib3d/doc/camera_calibration_and_3d_reconstruction.html. Accessed: 2014-09-02.
[30]
SuperDroid Robots. http://www.superdroidrobots.com/. Accessed: 2014-09-02.
[31]
Werner, M., Kessel, M., and Marouane, C. 2011. Indoor Positioning Using Smartphone Camera. In Proc. IPIN.
[32]
Wolf, J., Burgard, W., and Burkhardt, H. 2005. Robust Vision-based Localization by Combining an Image Retrieval System with Monte Carlo Localization. IEEE Transactions on Robotics 21, 2.
[33]
Xtion PRO LIVE. http://www.asus.com/Multimedia/Xtion_PRO_LIVE/. Accessed: 2014-09-02.
[34]
Yun, D., Chang, H., Mirowski, P., and Lakshman, T. 2013. A Case of Cloud-based 3D Indoor Localization System with Sub-meter Accuracy. Tech. Rep. ITD-13-53900M, Bell Labs Technical Memorandum.
[35]
Yun, D., Mirowski, P., Lee, T., and Kee, C. 2013. Sub-meter Accuracy 3D Indoor Positioning Algorithm byMatching Feature Points of 2D Smartphone Photo. In Proc. the ION 2013 Pacific PNT Meeting.

Cited By

View all
  • (2020)A Comprehensive Survey of Indoor Localization Methods Based on Computer VisionSensors10.3390/s2009264120:9(2641)Online publication date: 6-May-2020
  • (2017)BreakSenseProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3026021(3595-3607)Online publication date: 2-May-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
VRST '14: Proceedings of the 20th ACM Symposium on Virtual Reality Software and Technology
November 2014
238 pages
ISBN:9781450332538
DOI:10.1145/2671015
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 November 2014

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. 3D localization
  2. indoor localization
  3. indoor navigation

Qualifiers

  • Research-article

Conference

VRST '14
Sponsor:

Acceptance Rates

Overall Acceptance Rate 66 of 254 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2020)A Comprehensive Survey of Indoor Localization Methods Based on Computer VisionSensors10.3390/s2009264120:9(2641)Online publication date: 6-May-2020
  • (2017)BreakSenseProceedings of the 2017 CHI Conference on Human Factors in Computing Systems10.1145/3025453.3026021(3595-3607)Online publication date: 2-May-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media