Abstract
In this work, we discover the daily location-driven routines that are contained in a massive real-life human dataset collected by mobile phones. Our goal is the discovery and analysis of human routines that characterize both individual and group behaviors in terms of location patterns. We develop an unsupervised methodology based on two differing probabilistic topic models and apply them to the daily life of 97 mobile phone users over a 16-month period to achieve these goals. Topic models are probabilistic generative models for documents that identify the latent structure that underlies a set of words. Routines dominating the entire group's activities, identified with a methodology based on the Latent Dirichlet Allocation topic model, include “going to work late”, “going home early”, “working nonstop” and “having no reception (phone off)” at different times over varying time-intervals. We also detect routines which are characteristic of users, with a methodology based on the Author-Topic model. With the routines discovered, and the two methods of characterizing days and users, we can then perform various tasks. We use the routines discovered to determine behavioral patterns of users and groups of users. For example, we can find individuals that display specific daily routines, such as “going to work early” or “turning off the mobile (or having no reception) in the evenings”. We are also able to characterize daily patterns by determining the topic structure of days in addition to determining whether certain routines occur dominantly on weekends or weekdays. Furthermore, the routines discovered can be used to rank users or find subgroups of users who display certain routines. We can also characterize users based on their entropy. We compare our method to one based on clustering using K-means. Finally, we analyze an individual's routines over time to determine regions with high variations, which may correspond to specific events.
- Akoush, S. and Sameh, A. 2007. Mobile user movement prediction using bayesian learning for neural networks. In Proceedings of the ACM International Wireless Communications and Mobile Computing Conference (IWCMC). 191--196. Google ScholarDigital Library
- Blei, D. M., Griffiths, T. L., Jordan, M. I., and Tenenbaum, J. B. 2004. Hierarchical topic models and the nested Chinese restaurant process. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS).Google Scholar
- Blei, D. M., Ng, A. Y., and Jordan, M. I. 2003. Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993--1022. Google ScholarCross Ref
- Choudhury, T. and Pentland, A. 2003. Sensing and modeling human networks using the sociometer. In Proceedings of the IEEE International Symposium on Wearable Computers (ISWC). 216. Google ScholarDigital Library
- Choudhury, T., Philipose, M., Wyatt, D., and Lester, J. 2006. Towards activity databases: Using sensors and statistical models to summarize people's lives. IEEE Data Engin. Bull. 49--58.Google Scholar
- Duda, R., Hart, P., and Stork, D. 2000. Pattern Classification 2nd Ed. Wiley-Interscience. Google ScholarDigital Library
- Eagle, N. and Pentland, A. 2009. Eigenbehaviors: Identifying structure in routine. Behav. Ecol. Sociobiol. 63, 7, 1057--1066.Google ScholarCross Ref
- Eagle, N., Pentland, A., and Lazer, D. 2009. Inferring social network structure using mobile phone data. Proc. Nat. Acad. Sci. 106, 36, 15274--15278.Google ScholarCross Ref
- Farrahi, K. and Gatica-Perez, D. 2008a. Discovering human routines from cell phone data with topic models. In Proceedings of the IEEE International Symposium on Wearable Computers (ISWC). Google ScholarDigital Library
- Farrahi, K., and Gatica-Perez, D. 2008b. What did you do today? Discovering daily routines from large-scale mobile data. In Proceedings of the ACM International Conference on Multimedia (ACM MM). Google ScholarDigital Library
- Gonzalez, M. C., Hidalgo, C. A., and Barabasi, A.-L. 2008. Understanding individual human mobility patterns. Nature 453, 7196, 779--782.Google Scholar
- Griffiths, T. L. and Steyvers, M. 2004. Finding scientific topics. Proc. Nat. Acad. Sci. USA 101 Suppl 1, 5228--5235.Google ScholarCross Ref
- Heinrich, G. 2008. Parameter estimation for text analysis,. Tech. rep., University of Leipzig.Google Scholar
- Hightower, J., Consolvo, S., Lamarca, A., Smith, I., and Hughes, J. 2005. Learning and recognizing the places we go. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 159--176. Google ScholarDigital Library
- Huynh, T., Fritz, M., and Schiele, B. 2008. Discovery of activity patterns using topic models. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 10--19. Google ScholarDigital Library
- Intille, S. S., Larson, K., Tapia, E. M., Beaudin, J. S., Kaushik, P., Nawyn, J., and Rockinson, R. 2006. Using a live-in laboratory for ubiquitous computing research. In Proceedings of the Workshop on Pervasive Computing. 349--365. Google ScholarDigital Library
- Kiukkonen, N., Blom, J., Dousse, O., Gatica-Perez, D., and Laurila, J. 2010. Towards rich mobile phone datasets: Lausanne data collection campaign. In Proceedings of the ACM International Conference on Pervasive Services (ICPS).Google Scholar
- Krumm, J. and Horvitz, E. 2006. Predestination: Inferring destinations from partial trajectories. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). Google ScholarDigital Library
- Larson, K. and Intille, S. Placelab website. http://architecture.mit.edu/house_n/data/placelab/placelab.htm.Google Scholar
- Letchner, J., Fox, D., and LaMarca, A. 2005. Large-scale localization from wireless signal strength. In Proceedings of the Natural Conference on Artificial Intelligence (AAAI). 15--20. Google ScholarDigital Library
- Liao, L., Fox, D., and Kautz, H. 2006. Location-based activity recognition. In Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS). 787--794.Google Scholar
- Ling, R. and Donner, J. 2009. Mobile Communication: Digital Media and Society Series. Polity Press.Google Scholar
- Logan, B., Healey, J., Philipose, M., Tapia, E. M., and Intille, S. S. 2007. A long-term evaluation of sensing modalities for activity recognition. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). Lecture Notes in Computer Science, vol. 4717, Springer, 483--500. Google ScholarDigital Library
- MIT Technology Review. 10 emerging technologies 2008. http://www.technologyreview.com/specialreports/specialreport.aspx?id=25.Google Scholar
- Monay, F. and Gatica-Perez, D. 2007. Modeling semantic aspects for cross-media image retrieval. IEEE Trans. Patt. Anal. Mach. Intell. Google ScholarDigital Library
- Niebles, J., Wang, H., and Fei-Fei, L. 2006. Unsupervised learning of human action categories using spatial-temporal words. Int. J. Comput. Vision. Google ScholarDigital Library
- Otsason, V., Varshavsky, A., Lamarca, A., and de Lara, E. 2005. Accurate gsm indoor localization. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). Springer, Berlin, 141--158. Google ScholarDigital Library
- Patterson, D., Liao, L., Fox, D., and Kautz, H. 2003. Inferring high-level behavior from low-level sensors. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 73--89.Google Scholar
- Pritchard, J., Stephens, M., and Donnelly, P. 2000. Inference of population structure using multilocus genotype data. Genetics. 155, 945--959.Google ScholarCross Ref
- Quelhas, P., Monay, F., Odobez, J.-M., Gatica-Perez, D., and Tuytelaars, T. 2007. A thousand words in a scene. IEEE Trans. Patt. Anal. Mach. Intell., 1575--89. Google ScholarDigital Library
- Reddy, S., Burke, J., Estrin, D., Hansen, M., and Strivastava, M. 2008. Using mobile phones to determine transportation mode. In Proceedings of the International Symposium on Wearable Computers (ISWC).Google Scholar
- Rosen-Zvi, M., Griffiths, T., Steyvers, M., and Smyth, P. 2004. The author-topic model for authors and documents. In Proceedings of the Conference on Uncertainty in Artificial Intelligence (UAI). 487--494. Google ScholarDigital Library
- Sohn, T., Varshavsky, A., Lamarca, A., Chen, M., Choudhury, T., Smith, I., Consolvo, S., Hightower, J., Griswold, W., and de Lara, E. 2006. Mobility detection using everyday gsm traces. In Proceedings of the Workshop on Ubiquitous Computing (UbiComp). 212--224. Google ScholarDigital Library
- Tapia, E., Intille, S., Haskell, W., Larson, K., Wright, J., King, A., and Friedman, R. 2007. Real-time recognition of physical activities and their intensities using wireless accelerometers and a heart rate monitor. In Proceedings of the International Symposium on Wearable Computers (ISWC). 37--40. Google ScholarDigital Library
- Tapia, E. M., Intille, S. S., and Larson, K. 2004. Activity recognition in the home setting using simple and ubiquitous sensors. In Proceedings of PERVASIVE. 158--175.Google Scholar
- Wang, Y., Bai, H., Stanton, M., Chen, W.-Y., and Chang, E. Y. 2009. PLDA: Parallel latent dirichlet allocation for large-scale applications. In Proceedings of AAIM. 301--314. Google ScholarDigital Library
- Wren, C., Ivanov, Y., Kaur, I., Leigh, D., Westhues, J., Wren, C. R., Ivanov, Y. A., Kaur, I., Leigh, D., and Westhues, J. 2007. Socialmotion: Measuring the hidden social life of a building. In Proceedings of the International Symposium on Location- and Context-Awareness (LoCA'07). 85--102. Google ScholarDigital Library
Index Terms
- Discovering routines from large-scale human locations using probabilistic topic models
Recommendations
What did you do today?: discovering daily routines from large-scale mobile data
MM '08: Proceedings of the 16th ACM international conference on MultimediaWe present a framework built from two Hierarchical Bayesian topic models to discover human location-driven routines from mobile phones. The framework uses location-driven bag representations of people's daily activities obtained from celltower ...
Discovering human routines from cell phone data with topic models
ISWC '08: Proceedings of the 2008 12th IEEE International Symposium on Wearable ComputersWe present a framework to automatically discover people's routines from information extracted by cell phones. The framework is built from a probabilistic topic model learned on novel bag type representations of activity-related cues (location, proximity ...
Latent topic model-based group activity discovery
Special Issue on ICVGIP 2010Surveillance videos of public places often consist of group activities composed from multiple co-occurring individual activities. However, latent topic models, such as Latent Dirichlet Allocation (LDA), which have been successfully used to discover ...
Comments