research-article

Periodicity in User Engagement with a Search Engine and Its Application to Online Controlled Experiments

Authors:
Alexey Drutsa

Yandex, Moscow, Russia

Yandex, Moscow, Russia
View Profile

,
Gleb Gusev

Yandex, Moscow, Russia

Yandex, Moscow, Russia
View Profile

,
Pavel Serdyukov

Yandex, Moscow, Russia

Yandex, Moscow, Russia
View Profile

Authors Info & Claims

ACM Transactions on the Web Volume 11 Issue 2Article No.: 9pp 1–35https://doi.org/10.1145/2856822

Published:14 April 2017Publication History

ACM Transactions on the Web

Abstract

Nowadays, billions of people use the Web in connection with their daily needs. A significant part of these needs are constituted by search tasks that are usually addressed by search engines. Thus, daily search needs result in regular user engagement with a search engine. User engagement with web services was studied in various aspects, but there appears to be little work devoted to its regularity and periodicity. In this article, we study periodicity of user engagement with a popular search engine through applying spectrum analysis to temporal sequences of different engagement metrics. First, we found periodicity patterns of user engagement and revealed classes of users whose periodicity patterns do not change over a long period of time. In addition, we give an exhaustive analysis of the stability and quality of identified clusters. Second, we used the spectrum series as key metrics to evaluate search quality. We found that the novel periodicity metrics outperform the state-of-the-art quality metrics both in terms of significance level (p-value) and sensitivity to a large set of larges-scale A/B experiments conducted on real search engine users.

References

Eytan Adar, Jaime Teevan, and Susan T. Dumais. 2008. Large scale analysis of web revisitation patterns. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1197--1206. Google ScholarDigital Library
Eytan Adar, Jaime Teevan, and Susan T. Dumais. 2009a. Resonance on the web: Web dynamics and revisitation patterns. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1381--1390. Google ScholarDigital Library
Eytan Adar, Jaime Teevan, Susan T. Dumais, and Jonathan L. Elsas. 2009b. The web changes everything: Understanding the dynamics of web content. In Proceedings of the 2nd ACM International Conference on Web Search and Data Mining. ACM, 282--291. Google ScholarDigital Library
Olga Arkhipova, Lidia Grauer, Igor Kuralenok, and Pavel Serdyukov. 2015. Search engine evaluation based on search engine switching prediction. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 723--726. Google ScholarDigital Library
Sunandan Chakraborty, Filip Radlinski, Milad Shokouhi, and Paul Baecke. 2014. On correlation of absence time and search effectiveness. In Proceedings of the 37th International ACM SIGIR Conference on Research 8 Development in Information Retrieval. ACM, 1163--1166. Google ScholarDigital Library
William T. Cochran, James W. Cooley, David L. Favin, Howard D. Helms, Reg A. Kaenel, William W. Lang, George C. Maling Jr., David E. Nelson, Charles M. Rader, and Peter D. Welch. 1967. What is the fast fourier transform? Proc. IEEE 55, 10 (1967), 1664--1674. Google ScholarCross Ref
James W. Cooley and John W. Tukey. 1965. An algorithm for the machine calculation of complex Fourier series. Math. Comput. 19, 90 (1965), 297--301. Google Scholar
Thomas Crook, Brian Frasca, Ron Kohavi, and Roger Longbotham. 2009. Seven pitfalls to avoid when running controlled experiments on the web. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1105--1114. Google ScholarDigital Library
David L. Davies and Donald W. Bouldin. 1979. A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 2 (1979), 224--227. Google ScholarDigital Library
Alex Deng. 2015. Objective bayesian two sample hypothesis testing for online controlled experiments. In Proceedings of the 24th International Conference on World Wide Web Companion. International World Wide Web Conferences Steering Committee, 923--928. Google ScholarDigital Library
Alex Deng and Victor Hu. 2015. Diluted treatment effect estimation for trigger analysis in online controlled experiments. In Proceedings of the 8th ACM International Conference on Web Search and Data Mining. ACM, 349--358. Google ScholarDigital Library
Alex Deng, Tianxi Li, and Yu Guo. 2014. Statistical inference in two-stage online controlled experiments with treatment selection and validation. In Proceedings of the 23rd International Conference on World Wide Web. ACM, 609--618. Google ScholarDigital Library
Alex Deng, Ya Xu, Ron Kohavi, and Toby Walker. 2013. Improving the sensitivity of online controlled experiments by utilizing pre-experiment data. In Proceedings of the 6th ACM International Conference on Web Search and Data Mining. ACM, 123--132. Google ScholarDigital Library
Abdigani Diriye, Ryen White, Georg Buscher, and Susan Dumais. 2012. Leaving so soon?: Understanding and predicting web search abandonment rationales. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management. ACM, 1025--1034. Google ScholarDigital Library
Alexey Drutsa. 2015. Sign-aware periodicity metrics of user engagement for online search quality evaluation. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 779--782. Google ScholarDigital Library
Alexey Drutsa, Gleb Gusev, and Pavel Serdyukov. 2015a. Engagement periodicity in search engine usage: Analysis and its application to search quality evaluation. In Proceedings of the 8th ACM International Conference on Web Search and Data Mining. ACM, 27--36. Google ScholarDigital Library
Alexey Drutsa, Gleb Gusev, and Pavel Serdyukov. 2015b. Future user engagement prediction and its application to improve the sensitivity of online experiments. In Proceedings of the 24th International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 256--266. Google ScholarDigital Library
Alexey Drutsa, Anna Ufliand, and Gleb Gusev. 2015c. Practical aspects of sensitivity in online experimentation with user engagement metrics. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management. ACM, 763--772. Google ScholarDigital Library
Georges Dupret and Mounia Lalmas. 2013. Absence time and user engagement: Evaluating ranking functions. In Proceedings of the Sixth ACM International Conference on Web Search and Data Mining. ACM, 173--182. Google ScholarDigital Library
Henry A. Feild, James Allan, and Rosie Jones. 2010. Predicting searcher frustration. In Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 34--41. Google ScholarDigital Library
David A. Freedman, David Collier, Jasjeet S. Sekhon, and Philip B. Stark. 2010. Statistical Models and Causal Inference: A Dialogue with the Social Sciences. Cambridge University Press.Google Scholar
Qi Guo, Ryen W. White, Yunqiao Zhang, Blake Anderson, and Susan T. Dumais. 2011. Why searchers switch: Understanding and predicting engine switching rationales. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 335--344. Google ScholarDigital Library
Ahmed Hassan, Xiaolin Shi, Nick Craswell, and Bill Ramsey. 2013. Beyond clicks: Query reformulation as a predictor of search satisfaction. In Proceedings of the 22nd ACM International Conference on Conference on Information 8 Knowledge Management. ACM, 2019--2028. Google ScholarDigital Library
Ahmed Hassan, Yang Song, and Li-wei He. 2011. A task level metric for measuring web search satisfaction and its application on improving relevance estimation. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management. ACM, 125--134. Google ScholarDigital Library
Ahmed Hassan and Ryen W. White. 2013. Personalized models of search satisfaction. In Proceedings of the 22nd ACM International Conference on Conference on Information 8 Knowledge Management. ACM, 2009--2018. Google ScholarDigital Library
Henning Hohnhold, Deirdre O’Brien, and Diane Tang. 2015. Focusing on the long-term: It’s good for users and business. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1849--1858. Google ScholarDigital Library
Victor Hu, Maria Stone, Jan Pedersen, and Ryen W White. 2011. Effects of search success on search engine re-use. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management. ACM, 1841--1846. Google ScholarDigital Library
Bernard J. Jansen, Amanda Spink, and Vinish Kathuria. 2007. How to define searching sessions on web search engines. In Advances in Web Mining and Web Usage Analysis. Springer, 92--109. Google ScholarCross Ref
Rosie Jones and Kristina Lisa Klinkner. 2008. Beyond the session timeout: Automatic hierarchical segmentation of search topics in query logs. In Proceedings of the 17th ACM Conference on Information and Knowledge Management. ACM, 699--708. Google ScholarDigital Library
Eugene Kharitonov, Craig Macdonald, Pavel Serdyukov, and Iadh Ounis. 2015a. Optimised scheduling of online experiments. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 453--462. Google ScholarDigital Library
Eugene Kharitonov, Aleksandr Vorobev, Craig Macdonald, Pavel Serdyukov, and Iadh Ounis. 2015b. Sequential testing for early stopping of online experiments. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 473--482. Google ScholarDigital Library
Ronny Kohavi, Thomas Crook, Roger Longbotham, Brian Frasca, Randy Henne, Juan Lavista Ferres, and Tamir Melamed. 2009. Online experimentation at microsoft. (2009), 11--23.Google Scholar
Ron Kohavi, Alex Deng, Brian Frasca, Roger Longbotham, Toby Walker, and Ya Xu. 2012. Trustworthy online controlled experiments: Five puzzling outcomes explained. In Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 786--794. Google ScholarDigital Library
Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, and Nils Pohlmann. 2013. Online controlled experiments at large scale. In Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1168--1176. Google ScholarDigital Library
Ron Kohavi, Alex Deng, Roger Longbotham, and Ya Xu. 2014. Seven rules of thumb for web site experimenters. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1857--1866. Google ScholarDigital Library
Ron Kohavi, Randal M. Henne, and Dan Sommerfield. 2007. Practical guide to controlled experiments on the web: Listen to your customers not to the hippo. In Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 959--967. Google ScholarDigital Library
Ron Kohavi, Roger Longbotham, Dan Sommerfield, and Randal M. Henne. 2009. Controlled experiments on the web: Survey and practical guide. Data Min. Knowl. Discov. 18, 1 (2009), 140--181. Google ScholarDigital Library
Ron Kohavi, David Messner, Seth Eliot, Juan Lavista Ferres, Randy Henne, Vignesh Kannappan, and Justin Wang. 2010. Tracking Users Clicks and Submits: Tradeoffs between User Experience and Data Loss. (2010).Google Scholar
Tomáš Kramár and Mária Bieliková. 2014. Context of seasonality in web search. In Advances in Information Retrieval. Springer, 644--649. Google ScholarDigital Library
Janette Lehmann, Mounia Lalmas, Georges Dupret, and Ricardo Baeza-Yates. 2013. Online multitasking and user engagement. In Proceedings of the 22nd ACM International Conference on Conference on Information 8 Knowledge Management. ACM, 519--528. Google ScholarDigital Library
Janette Lehmann, Mounia Lalmas, Elad Yom-Tov, and Georges Dupret. 2012. Models of user engagement. In User Modeling, Adaptation, and Personalization. Springer, 164--175. Google ScholarDigital Library
Claudio Lucchese, Franco Maria Nardini, Salvatore Orlando, and Gabriele Tolomei. 2016. Learning to rank user queries to detect search tasks. In Proceedings of the 2016 ACM on International Conference on the Theory of Information Retrieval. ACM, 157--166. Google ScholarDigital Library
Claudio Lucchese, Salvatore Orlando, Raffaele Perego, Fabrizio Silvestri, and Gabriele Tolomei. 2011. Identifying task-based sessions in search engine query logs. In Proceedings of the 4th ACM International Conference on Web Search and Data Mining. ACM, 277--286. Google ScholarDigital Library
Claudio Lucchese, Salvatore Orlando, Raffaele Perego, Fabrizio Silvestri, and Gabriele Tolomei. 2013. Discovering tasks from search engine query logs. ACM Trans. Inf. Syst. 31, 3 (2013), 14. Google ScholarDigital Library
Stephen L. Morgan and Christopher Winship. 2014. Counterfactuals and Causal Inference. Cambridge University Press. Google ScholarCross Ref
Kirill Nikolaev, Alexey Drutsa, Ekaterina Gladkikh, Alexander Ulianov, Gleb Gusev, and Pavel Serdyukov. 2015. Extreme states distribution decomposition method for search engine online evaluation. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 845--854. Google ScholarDigital Library
Eric T. Peterson. 2004. Web Analytics Demystified: A Marketer’s Guide to Understanding How Your Web Site Affects Your Business. Ingram.Google Scholar
Alexey Poyarkov, Alexey Drutsa, Andrey Khalyavin, Gleb Gusev, and Pavel Serdyukov. 2016. Boosted decision tree regression adjustment for variance reduction in online controlled experiments. In Proceedings of the 22th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 235--244. Google ScholarDigital Library
Kira Radinsky, Krysta Svore, Susan Dumais, Jaime Teevan, Alex Bocharov, and Eric Horvitz. 2012. Modeling and predicting behavioral dynamics on the web. In Proceedings of the 21st International Conference on World Wide Web. ACM, 599--608. Google ScholarDigital Library
Kerry Rodden, Hilary Hutchinson, and Xin Fu. 2010. Measuring the user experience on a large scale: User-centered metrics for web applications. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 2395--2398. Google ScholarDigital Library
Peter J. Rousseeuw. 1987. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20 (1987), 53--65. Google ScholarDigital Library
Tetsuya Sakai. 2006. Evaluating evaluation metrics based on the bootstrap. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 525--532. Google ScholarDigital Library
Denis Savenkov, Dmitry Lagun, and Qiaoling Liu. 2013. Search engine switching detection based on user personal preferences and behavior patterns. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 33--42. Google ScholarDigital Library
Milad Shokouhi. 2011. Detecting seasonal queries by time-series analysis. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1171--1172. Google ScholarDigital Library
Yang Song, Xiaolin Shi, and Xin Fu. 2013. Evaluating and predicting user engagement change with degraded search relevance. In Proceedings of the 22nd International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1213--1224. Google ScholarDigital Library
Diane Tang, Ashish Agarwal, Deirdre O’Brien, and Mike Meyer. 2010. Overlapping experiment infrastructure: More, better, faster experimentation. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 17--26. Google ScholarDigital Library
Jaime Teevan, Susan T. Dumais, Daniel J. Liebling, and Richard L. Hughes. 2009. Changing how people view changes on the web. In Proceedings of the 22nd Annual ACM Symposium on User Interface Software and Technology. ACM, 237--246. Google ScholarDigital Library
Michail Vlachos, Christopher Meek, Zografoula Vagena, and Dimitrios Gunopulos. 2004. Identifying similarities, periodicities and bursts for online search queries. In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data. ACM, 131--142. Google ScholarDigital Library
Hongning Wang, Yang Song, Ming-Wei Chang, Xiaodong He, Ryen W. White, and Wei Chu. 2013. Learning to extract cross-session search tasks. In Proceedings of the 22nd International Conference on World Wide Web. ACM, 1353--1364. Google ScholarDigital Library
William Wu-Shyong Wei. 1994. Time Series Analysis. Addison-Wesley Redwood City, CA.Google Scholar
Robert West, Ryen W. White, and Eric Horvitz. 2013. From cookies to cooks: Insights on dietary patterns via analysis of web usage logs. In Proceedings of the 22nd International Conference on World Wide Web. International World Wide Web Conferences Steering Committee, 1399--1410. Google ScholarDigital Library
Ryen W. White and Susan T. Dumais. 2009. Characterizing and predicting search engine switching behavior. In Proceedings of the 18th ACM Conference on Information and Knowledge Management. ACM, 87--96. Google ScholarDigital Library
Ryen W. White, Ashish Kapoor, and Susan T. Dumais. 2010. Modeling long-term search engine usage. In UMAP. 28--39. Google ScholarDigital Library
Ying Zhang, Bernard J. Jansen, and Amanda Spink. 2009. Time series analysis of a Web search engine transaction log. Inf. Process. Manag. 45, 2 (2009), 230--245. Google ScholarDigital Library

Index Terms

Periodicity in User Engagement with a Search Engine and Its Application to Online Controlled Experiments

Recommendations

Practical Aspects of Sensitivity in Online Experimentation with User Engagement Metrics
CIKM '15: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management

Online controlled experiments, e.g., A/B testing, is the state-of-the-art approach used by modern Internet companies to improve their services based on data-driven decisions. The most challenging problem is to define an appropriate online metric of user ...
Read More
Using the Delay in a Treatment Effect to Improve Sensitivity and Preserve Directionality of Engagement Metrics in A/B Experiments
WWW '17: Proceedings of the 26th International Conference on World Wide Web

State-of-the-art user engagement metrics (such as session-per-user) are widely used by modern Internet companies to evaluate ongoing updates of their web services via A/B testing. These metrics are predictive of companies' long-term goals, but suffer ...
Read More
Engagement Periodicity in Search Engine Usage: Analysis and its Application to Search Quality Evaluation
WSDM '15: Proceedings of the Eighth ACM International Conference on Web Search and Data Mining

Nowadays, billions of people use the Web in connection with their daily needs. A significant part of the needs are constituted by search tasks that are usually addressed by search engines. Thus, daily search needs result in regular user engagement with ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on the Web Volume 11, Issue 2
May 2017
199 pages
ISSN:1559-1131
EISSN:1559-114X
DOI:10.1145/3079924
Editors:
Brian D. Davison
Lehigh University, USA
,
Marianne Winslett
University of Illinois at Urbana-Champaign
Issue’s Table of Contents
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 14 April 2017
- Accepted: 1 December 2016
- Revised: 1 September 2016
- Received: 1 December 2015
Published in tweb Volume 11, Issue 2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
A/B test
DFT
OAC
OEC
User engagement
amplitude
discrete Fourier transform
frequency domain
key metric
online controlled experiment
overall acceptance criterion
overall evaluation criterion
periodicity
quality metrics
search engine
spectrum analysis
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 17
  Total Citations
  View Citations
- 329
  Total Downloads
- Downloads (Last 12 months)26
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Periodicity in User Engagement with a Search Engine and Its Application to Online Controlled Experiments

ACM Transactions on the Web

Abstract

References

Cited By

Index Terms

Recommendations

Practical Aspects of Sensitivity in Online Experimentation with User Engagement Metrics

Using the Delay in a Treatment Effect to Improve Sensitivity and Preserve Directionality of Engagement Metrics in A/B Experiments

Engagement Periodicity in Search Engine Usage: Analysis and its Application to Search Quality Evaluation