Look Who's Tracking

An analysis of the 500 websites most-visited by Finnish web users

Avainsanat: evästeet [http://www.yso.fi/onto/yso/p8439], seuranta [http://www.yso.fi/onto/yso/p17497], etäseuranta [http://www.yso.fi/onto/yso/p34193], WWW-sivut [http://www.yso.fi/onto/yso/p4050], yksityisyys [http://www.yso.fi/onto/yso/p10909]

Abstrakti

Though research into online tracking prevalence as a topic is not new, we still know little about who is tracking and profiling Finnish web users. This study examines tracking on the 500 websites most frequently visited by Finnish users. We also compare trackers on Finnish websites versus non-Finnish websites. We found trackers on 410 of the 500 websites, and a total of 466 unique trackers from 408 different organizations. Similar to most previous studies, Google had the greatest tracker coverage, mostly through Google Analytics and Doubleclick, reaching 75 % of the websites analyzed. The second-most prevalent tracking organization was Facebook, present on 46 % of the websites. After Google and Facebook came a number of organizations with fairly similar tracking coverage, followed by a long tail of others. There were notable differences when comparing Finnish websites to non-Finnish sites, displaying some level of geographical preference in publishers’ choices of advertising platforms and analytical tools.

Lähdeviitteet

Bailey J., Laakso, M., & Nyman, L. (2019). Web tracking data for 500 websites popular among Finnish web users [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3543444

Bujlow, T., Carela-Español, V., Sole-Pareta, J., & Barlet-Ros, P. (2017). A survey on web tracking: Mechanisms, implications, and defenses. Proceedings of the IEEE, 105(8), 1476-1510. https://doi.org/10.1109/JPROC.2016.2637878

Cavna, M. (2013). ‘NOBODY KNOWS YOU’RE A DOG’: As iconic internet cartoon turns 20, creator Peter Steiner knows the joke rings as relevant as ever. The Washington Post. July 31, 2013. Retrieved from https://web.archive.org/web/20190906110052/https://www.washingtonpost.com/blogs/comic-riffs/post/nobody-knows-youre-a-dog-as-iconic-internet-cartoon-turns-20-creator-peter-steiner-knows-the-joke-rings-as-relevant-as-ever/2013/07/31/73372600-f98d-11e2-8e84-c56731a202fb_blog.html Accessed September 5th 2019.

Center for Democracy & Technology. (2011). What does “do not track” mean? (Proposal). Washington: Center for Democracy & Technology. Retrieved from https://web.archive.org/web/20190906105058/https://www.cdt.org/files/pdfs/CDT-DNT-Report.pdf Accessed September 5th 2019.

Eckersley P. (2010). How Unique Is Your Web Browser?. In: Atallah M.J., Hopper N.J. (eds) Privacy

Enhancing Technologies. PETS 2010. Lecture Notes in Computer Science, vol 6205. Berlin, Heidelberg: Springer. https://doi.org/10.1007/978-3-642-14527-8_1

Englehardt, S., Eubank, C., Zimmerman, P., Reisman, D., & Narayanan, A. (2015). OpenWPM: An automated platform for web privacy measurement. Technical report, Princeton University, March 2015.

Englehardt, S., & Narayanan, A. (2016). Online Tracking: A 1-million-site Measurement and Analysis (pp. 1388–1401). Presented at the Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security. New York, NY, USA: ACM. http://doi.org/10.1145/2976749.2978313

Eriksson-Backa, K. (2013). Informaatiotutkimus tänään - Informationsvetenskapen idag. Informaatiotutkimus, 31(4). https://journal.fi/inf/article/view/7528

Falahrastegar M., Haddadi H., Uhlig S., Mortier R. (2014). The Rise of Panopticons: Examining Region-Specific Third-Party Web Tracking. In: Dainotti A., Mahanti A., Uhlig S. (eds) Traffic Monitoring and Analysis. TMA 2014. Lecture Notes in Computer Science, vol 8406. Berlin, Heidelberg: Springer. https://doi.org/10.1007/978-3-642-54999-1_9

Falahrastegar, M., Haddadi, H., Uhlig, S., & Mortier, R. (2014b). Anatomy of the third-party web tracking ecosystem. ArXiv Preprint arXiv:1409.1066. https://arxiv.org/abs/1409.1066v1

Falahrastegar M., Haddadi H., Uhlig S., Mortier R. (2016). Tracking Personal Identifiers Across the Web. In: Karagiannis T., Dimitropoulos X. (eds) Passive and Active Measurement. PAM 2016. Lecture Notes in Computer Science, vol 9631. Springer. https://doi.org/10.1007/978-3-319-30505-9_3

Fruchter, N., Miao, H., Stevenson, S., & Balebako, R. (2015). Variations in tracking in relation to geographic location. Paper presented at the Proceedings of the 9th Workshop on Web 2.0 Security and Privacy (W2SP) 2015. https://arxiv.org/abs/1506.04103v1

Ghostery. (2019). What are the new tracker categories? Retrieved from https://web.archive.org/web/20190203203947/https://ghostery.zendesk.com/hc/en-us/articles/115000740394-What-are-the-new-tracker-categories- Accessed February 3rd 2019

GitHub (2017). Canonical repository for the Disconnect services file. https://github.com/disconnectme/disconnect-tracking-protection/blob/master/services.json Accessed August 20th 2017.

Google (2019). How Google uses information from sites or apps that use our services. https://web.archive.org/web/20190905102725/https://policies.google.com/technologies/partner-sites?hl=en-US Accessed September 5th 2019.

Karaj, A., Macbeth, S., Berson, R., & Pujol, J. M. (2018). WhoTracks .me: Monitoring the online tracking landscape at scale. ArXiv Preprint arXiv:1804.08959. https://arxiv.org/abs/1804.08959.

Krishnamurthy, B., & Wills, C. (2009). Privacy diffusion on the web: A longitudinal perspective. Proceedings of the 18th International Conference on World Wide Web, 541-550. Retrieved from https://web.archive.org/web/20190906104518/http://www2009.eprints.org/55/1/p541.pdf Accessed September 5th 2019.

Kristol, D. M. & Montulli, L. (1997). HTTP state management mechanism. Technical Report. RFC 2109 (Feb.), IETF. https://web.archive.org/web/20190906104432/https://www.ietf.org/rfc/rfc2109.txt Accessed September 6th 2019.

Kristol, D. M. (2001). HTTP Cookies: Standards, privacy, and politics. ACM Transactions on Internet Technology, 1(2), 151–198. https://doi.org/10.1145/502152.502153.

Lerner, A., Simpson, A. K., Kohno, T., & Roesner, F. (2016). Internet Jones and the Raiders of the Lost Trackers: An Archaeological Atudy of Web Tracking from 1996 to 2016. 25th USENIX Security Symposium (USENIX Security 16). Retrieved from https://web.archive.org/web/20190906104336/https://www.usenix.org/system/files/conference/usenixsecurity16/sec16_paper_lerner.pdf Accessed September 6th 2019.

Li, T., Hang, H., Faloutsos, M., & Efstathopoulos, P. (2015). Trackadvisor: Taking back browsing privacy from third-party trackers. International Conference on Passive and Active Network Measurement, 277-289. Retrieved from https://web.archive.org/web/20190906104258/https://www.symantec.com/content/dam/symantec/docs/research-papers/trackadvisor-taking-back-browsing-privacy-from-third-party-trackers-en.pdf Accessed September 6th 2019.

Libert, T. (2015). Exposing the hidden web: An analysis of third-party HTTP requests on one million websites. International Journal of Communication, 9, 3544–3561.

Macbeth, S. (2017). Tracking the Trackers: Analysing the Global Tracking Landscape with GhostRank. Retrieved from https://web.archive.org/web/20190906103605/https://www.ghostery.com/wp-content/themes/ghostery/images/campaigns/tracker-study/Ghostery_Study_-_Tracking_the_Trackers.pdf Accessed September 6th 2019.

Metwalley, H., Traverso, S., Mellia, M., Miskovic, S., & Baldi, M. (2015). The online tracking horde: A view from passive measurements. Paper presented at the International Workshop on Traffic Monitoring and Analysis. https://doi.org/10.1007/978-3-319-17172-2_8.

Montulli, L. (1998). U.S. Patent No. 5,826,242. Washington, DC: U.S. Patent and Trademark Office. Retrieved from https://web.archive.org/web/20191128073514/https://patents.google.com/patent/US5826242A/en Accessed November 28th 2019.

Purra, J., & Carlsson, N. (2016). Third-party tracking on the web: A Swedish perspective. Paper presented at the Local Computer Networks (LCN), 2016 IEEE 41st Conference on, 28-34. Retrieved from https://www.diva-portal.org/smash/get/diva2:1071640/FULLTEXT01.pdf Accessed May 5th, 2019.

Roesner, F., Kohno, T., & Wetherall, D. (2012). Detecting and defending against third-party tracking on the web. Paper presented at the Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation. Retrieved from https://web.archive.org/web/20190906101953/https://www.usenix.org/system/files/conference/nsdi12/nsdi12-final17.pdf Accessed September 6th 2019.

Ruohonen, J., & Leppänen, V. (2017). Whose hands are in the Finnish cookie jar? Paper presented at the 2017 European Intelligence and Security Informatics Conference (EISIC), 127-130. https://doi.org/10.1109/EISIC.2017.25.

Räisänen, O. (2015). Trackers leaking bank account data. Retrieved from https://web.archive.org/web/20190906102954/http://www.windytan.com/2015/04/trackers-and-bank-accounts.html Accessed September 5th 2019.

Sanchez-Rola I., Santos I. (2018). Knockin’ on Trackers’ Door: Large-Scale Automatic Analysis of Web Tracking. In: Giuffrida C., Bardin S., Blanc G. (eds) Detection of Intrusions and Malware, and Vulnerability Assessment. DIMVA 2018. Lecture Notes in Computer Science, vol 10885. Springer. https://doi.org/10.1007/978-3-319-93411-2_13.

Schneier, B. (2015). Data and goliath: The hidden battles to collect your data and control your world. New York: WW Norton & Company.

Shah, R. C., & Kesan, J. P. (2009). Recipes for cookies: How institutions shape communication technologies. New Media & Society, 11(3), 315-336. https://doi.org/10.1177/1461444808101614

Sirkkunen, E., & Haara, P. (2017). Yksityisyys ja notkea valvonta: Yksityisyys ja anonymiteetti verkkoviestinnässä-projektin loppuraportti. Tampere: Tampereen Yliopisto. http://urn.fi/URN:ISBN:978-952-03-0331-0.

Solove, D. J. (2004). The digital person: Technology and privacy in the information age. New York: New York University Press.

S-Pankki. (2015). Google analytics -palvelun käyttö S-pankin verkkopalveluissa. Retrieved from https://web.archive.org/web/20190906101353/https://www.s-pankki.fi/fi/tiedotteet/2015/google-analytics--palvelun-kaytto-s-pankin-verkkopalveluissa/ Accessed September 6th 2019.

Steiner, P. (1993). On the internet, nobody knows you’re a dog. The New Yorker, 69(20), 61.

Tsai, J. Y., Egelman, S., Cranor, L., & Acquisti, A. (2011). The effect of online privacy information on purchasing behavior: An experimental study. Information Systems Research, 22(2), 254–268. https://doi.org/10.1287/isre.1090.0260.

Zuboff, S. (2019). The Age of Surveillance Capitalism – The fight for a human future at the new frontier of power. London: Profile Books.

Julkaistu
2019-12-20
Viittaaminen
Bailey, J., Laakso, M., & Nyman, L. (2019). Look Who’s Tracking: An analysis of the 500 websites most-visited by Finnish web users. Informaatiotutkimus, 38(3-4). https://doi.org/10.23978/inf.87841
Osasto
Artikkelit