Methods – WI Methods Lab

New Publication: Extracting smartphone use from Android event log data

May 30, 2025May 30, 2025 Diana Ignatovich

Back in October 2024, the Methods lab shared a preprint of a study by Methods Lab member and data scientist, Roland Toth, and former research fellow, Douglas Parry, exploring how to isolate meaningful measures of smartphone use from Android event log data. We’re now pleased to announce that this work has been peer-reviewed and published in the journal Computational Communication Research.

The article titled “Extracting Meaningful Measures of Smartphone Usage from Android Event Log Data: A Methodological Primer” outlines a practical and reproducible step-by-step guide for deriving objective indicators of human usage from raw mobile data, offering valuable insights for research in social science and related disciplines. It details the extraction of key usage metrics through written explanations, visual aids, and pseudo-code. The paper is a vital resource for researchers seeking to understand patterns of mobile phone engagement and its implications in today’s rapidly evolving digital environment.

Workshop Recap: Introduction to Programming and Data Analysis with R

April 3, 2025April 4, 2025 Diana Ignatovich

A third edition on the Introduction to Programming and Data Analysis with R workshop took place on March 12th and 13th, 2025. Roland Toth with the Methods Lab at the Weizenbaum Institute engaged almost 20 participants with essential methods of data analysis via comprehensive coverage of fundamental R programming concepts and techniques.

Roland asks participants about their former experience with programming

On the first day, Roland guided participants through the basics of R syntax and its integration with Markdown/Quarto in an interactive environment. This included the very basics of programming like functions, objects, and indexing, but also data-related practices like data wrangling, sanity checks, and simple statistical analyses. Among others, participants also gained insight on managing warnings and errors that might stunt the process of coding throughout projects.

On day two, after an introduction to data visualization techniques, participants put their learning into practice: They explored provided survey data and developed a research question, so they could prepare and statistically analyze the data accordingly in R. The result was a reproducible HTML report on the reasoning behind the research question, all data wrangling steps, an exploration of the data set, the analysis, and the results including an interpretation. Attendees also supported each other’s progress whenever possible, while Roland offered personalized guidance.

The workshop alternated between lecture-like and interactive formats

The workshop concluded with a thorough review of useful functions and packages in R. Throughout the event, participants were encouraged to ask questions freely and frequently, and they took the opportunity. The Methods Lab would like to give a great thanks to all guests for their attendance and lively participation!

Workshop: Open Research – Principles, Practices, and Implementation (September 3, 2024)

July 26, 2024July 26, 2024 Methods Lab

We’re excited to announce our upcoming workshop Open Research – Principles, Practices, and Implementation, which will take place on Tuesday, September 3. This workshop will be conducted both at the Weizenbaum Institute and online, and is open to Weizenbaum Institute members as well as external participants (and the QPD).

Led by Tobias Dienlin, Assistant Professor of Interactive Communication at the University of Vienna, this workshop will equip participants with skills in open research by covering principles of transparency, reproducibility, the replication crisis, and practical sessions on sharing research materials, data, and analyses. It will also include preregistrations, registered reports, preprints, postprints, TOP Guidelines, and initiatives like DORA, CORA, and RESQUE. Participants will engage in drafting preregistration plans and discussing the incentives and challenges of open research, aiming to integrate these practices into their work for a more transparent and robust research community.

For further details, visit our program page. We are looking forward to your participation!

Short Project: Ethics of Data Work

July 22, 2024July 22, 2024 Anna Hohwü-Christensen

AI systems rely heavily on workers who face precarious conditions. Data work, clickwork, and crowdwork—essential for validating algorithms and creating datasets to train and refine AI systems—are frequently outsourced by commercial entities and academic institutions. Despite the vast and growing workforce of 435 million data workers enabling machine learning, their working conditions remain largely unaddressed, resulting in exploitative practices. Academic clients, in particular, lack clear guidance on how to outsource data work ethically and responsibly.

To address this issue, Christian Strippel from the Methods Lab is part of the short project “Ethics of Data Work” together with Milagros Miceli and Tianling Yang from the research group “Data, Algorithmic Systems and Ethics“, Bianca Herlo and Corinna Canali from the research “Design, Diversity and New Commons“, and Alexandra Keiner from the research group “Norm Setting and Decision Processes“. Together they aim to create equitable working systems grounded in the real knowledge and experience of data workers. The project will gather valuable insights about the challenges and needs data workers face, with the objective of developing ethical guidelines for researchers to ensure responsible and ethical treatment in the future.

Tutorial: Analyzing Digital Trace Data using Process Mining (September 17, 2024)

May 21, 2024July 15, 2024 Methods Lab

We’re excited to announce our upcoming workshop Analyzing Digital Trace Data using Process Mining, scheduled for Tuesday, September 17th at the Weizenbaum Institute. In this QPD Tutorial, led by Jan Mendling (HU), we will discuss the essentials of analyzing digital trace data using process mining.

For further details about the workshop, please visit our program page.

Workshop Recap: Research Ethics – Principles and Practice in Digitalization Research

May 19, 2024July 15, 2024 Methods Lab

On April 18 2024, the Methods Lab organized the workshop Research Ethics – Principles and Practice in Digitalization Research to meet the increasing relevance and complexity of ethics in digitalization research.

In the first part of the workshop, Christine Normann (WZB) introduced participants to good research practice and research ethics in alignment with the guidelines of the German Research Foundation (DFG). Besides the need to balance the freedom of research and data protection, she informed about important institutions, noted the difficulties of formulating ethics statements for funding applications before study designs are finalized, and provided some practical tips regarding guidance when planning research.

Next, Julian Vuorimäki (WI) guided participants through the handling of research ethics at the Weizenbaum Institute. He focussed on the code of conduct, ombudspersons, guideline for handling research data, and the newly founded review board. The latter is in charge of providing ethics reviews for individual projects and studies, which can be applied for through a questionnaire on the institute website.

*Julian Vuorimäki* presents the principles of good research practice at WI

In the second part of the workshop, three researchers presented practical ethical implications and learnings from research projects. Methods Lab lead Christian Strippel reported on a study where user comments were annotated to allow for the automatic detection of hate speech. He focused on possible misuse for censorship, the confrontation of coders with questionable content, and the challenges of publishing the results and data regarding copyright and framing. Tianling Yang (WI) presented ethical considerations and challenges in qualitative research. The focus lied on consent acquisition, anonymity and confidentiality, power relations, reciprocity (i.e., incentives and support), and the protection of the researchers themselves due to the physical and emotional impact of qualitative field work. Finally, Maximilian Heimstädt (Helmut Schmidt University Hamburg) talked about ambiguous consent in ethnographic research. He gave insights into a study in cooperation with the state criminal police office to predict crime for regional police agencies. Not all individuals in this research could be informed about the research endeavor, especially when the researchers accompanied the police during their shifts, which raised the question of how to find a balance between overt and covert research.

The Methods Labs thanks all presenters and participants for this insightful workshop!

Workshop Recap: Introduction to Programming and Data Analysis with R

April 22, 2024July 24, 2024 Anna Hohwü-Christensen

On April 10th and 11th, The Methods Lab organized the second edition of the workshop Introduction to Programming and Data Analysis with R. Led by Roland Toth from the Methods Lab, the workshop was designed to equip participants with fundamental R programming skills essential for data wrangling and analysis.

Roland Toth introduces participants to data wrangling with R

Across two days, attendees engaged in a comprehensive exploration of R fundamentals, covering topics such as RStudio, Markdown, data wrangling, and practical data analysis. Day one focused on laying the groundwork, covering the main concepts in programming including functions, classes, objects, and vectors. Participants were also familiarized with Markdown and Quarto, enabling them to include analysis results while producing text, and the key steps and techniques of data wrangling.

Participants work on their own research questions during the practical exercise

The first half of the second day was dedicated to showcasing and exploring basic data analysis and various visualization methods. Afterwards, participants had the opportunity to put into practice the knowledge they had gained from the previous day by working with a dataset to formulate and address their own research questions. Roland was on hand to offer assistance and guidance to the participants, addressing any challenges or concerns that arose along the journey.

Christian Strippel presents first results

The workshop fostered a collaborative learning environment, with lively discussions and ample questions from all. We thank all participants for their active involvement!

Workshop Recap: Introduction to Online Surveys

March 4, 2024April 29, 2024 Anna Hohwü-Christensen

The use of online surveys in contemporary social science research has grown rapidly due to their many benefits such as cost-effectiveness and ability to yield insights into attitudes, experiences, and perceptions. Unlike more established methods such as pen-and-paper surveys, they enable complex setups like experimental designs and seamless integration of digital media content. But despite their user-friendliness, even seasoned researchers still face numerous challenges in creating online surveys. To showcase the versatility and common pitfalls of online surveying, Martin Emmer, Christian Strippel, and Roland Toth of the Methods Lab arranged the workshop Introduction to Online Surveys on February 22, 2024.

Martin gave a presentation on the design and logic of online surveys.

In the first segment, Martin Emmer provided a theoretical overview of the design and logic of online surveys. He started by outlining the common challenges and benefits associated with interviewing, with a particular emphasis on social-psychological dynamics. Compared to online surveys, face-to-face interviews offer a more personal, engaging, and interactive experience, enabling interviewers to adjust questions and seek clarification of answers in real time. However, they can be time-consuming and expensive and may introduce biases such as the interviewer effect. On the other hand, the process of conducting online surveys presents its own set of challenges, such as limited control over the interview environment, a low drop-out threshold, and particularities connected with self-administration such as the need for detailed text-based instructions for respondents. Nevertheless, self-administered and computer-administered surveys boast numerous advantages, including cost-effectiveness, rapid data collection, the easy application of visuals and other stimuli, and accessibility to large and geographically dispersed populations. When designing an online survey, Martin stressed the importance of clear question wording, ethical considerations, and robust procedures to ensure voluntary participation and data protection.

Christian shared his insights on survey creation using online access panel providers.

In the second part of the workshop, Christian Strippel delved into the realm of online access panel providers, including the perks and pitfalls associated with utilizing them in survey creation. Panel providers serve as curated pools of potential survey participants managed by institutions, such as Bilendi/Respondi, YouGov, Cint, Civey, and the GESIS Panel. Panel providers oversee the recruitment and management processes, ensuring participants are matched with surveys relevant to their demographics and interests, while also handling survey distribution and data collection. While the use of online panels offers advantages such as accessing a broad participant pool, cost-efficiency, and streamlined sampling of specific sub-groups, they also have their limitations. Online panels are, for example, not entirely representative of the general population as they exclude non-internet users. Moreover, challenges arise from professional respondents such as so-called speeders who rush through surveys, and straight-liners who consistently choose the same response in matrix questions. Strategies to combat these issues include attention checks throughout the questionnaire, systematic exclusion of speeders and straight-liners, and quota-based screening. To conclude, Christian outlined what constitutes a good online panel provider, and shared valuable insights into how to plan a survey using one effectively.

Participants learned how to create their own survey using LimeSurvey during Roland’s live demo.

The third and final segment of the workshop featured a live demonstration by Roland Toth on how to set up an online survey using the open-source software LimeSurvey, which is hosted on the institute’s own servers. During this live demonstration, he created the very evaluation questionnaire administered to the workshop participants at the end of the workshop. Roland began by providing an overview of the general setup and relevant settings for survey creation. Subsequently, he demonstrated various methods of crafting questions with different scales, display conditions, and the incorporation of visual elements such as images. Throughout the demo, Roland addressed issues raised earlier in the first part of the workshop concerning language and phrasing, emphasizing rules for question-wording and why it is important to ask for one piece of information only per question. The live demonstration was wrapped up with a segment on viewing and exporting collected data. After letting the participants complete the evaluation form, the workshop concluded with a Q&A session.

The workshop was held in-person and online.

Workshop: Introduction to Programming and Data Analysis with R (April 10-11, 2024)

February 27, 2024February 28, 2024 Methods Lab

Level: Beginner/Intermediate
Category: Data Analysis

After being well received last year, we’re happy to announce the return of our workshop Programming and Data Analysis with R for its second edition. This two-day intensive workshop led by Roland Toth (WI) will take place on Wednesday, April 10, and Thursday, April 11, at the Weizenbaum Institute.

During the first day, attendees will receive comprehensive training in programming fundamentals, essential data wrangling techniques, and Markdown integration. The second day will center around data analysis, providing participants with the chance to engage directly with a dataset and address a research topic independently. A blend of concepts, coding techniques, and smaller practical tasks will be interspersed throughout both days to reinforce hands-on learning.

For more information, check out the program page!

Workshop: Introduction to Online Surveys

January 22, 2024April 29, 2024 Methods Lab

We are excited to announce the Methods Lab’s first workshop of the year, “Introduction to Online Surveys“, which will take place on Thursday, February 22. This workshop will be conducted both at the Weizenbaum Institute and online, and is open to Weizenbaum Institute members as well as external participants. Led by members of the Methods Lab, Martin Emmer, Christian Strippel, and Roland Toth, the workshop will focus on the use of online surveys in the context of social science research, providing participants with a theoretical foundation as well as a hands-on guide. We will cover aspects such as the logic and design of online surveys, how to work with access panel providers, and demonstrate how to effectively set up an online survey using the versatile survey tool LimeSurvey. Crucial topics such as ethics and data protection will also be discussed.

For detailed information about the workshop, please visit our program page. We look forward to your participation!