Blog – Page 4 – WI Methods Lab

New Publication: Ethics of Data Work

July 9, 2025July 9, 2025 Diana Ignatovich

Machine learning is becoming increasingly central to academic research, yet it often depends on data workers in exploitative conditions whose contributions are largely overlooked in ethical guidelines and unacknowledged within the academic community.

Last year, the Methods Lab outlined the aims of a project to target this issue in a short blog post. We’re now excited to announce the resulting published discussion paper: “Ethics of Data Work: Principles for Academic Data Work Requesters.”

This paper builds on the insights of an interdisciplinary group of scholars, practitioners, and data workers, alongside expert workshops held at the Weizenbaum Institute in 2024. It organizes practical principles for engaging more ethically with platform-based data workers, including how to define data work to then address important gaps in current ethical guidelines. The paper therefore offers concrete recommendations and regulations based on the most pressing challenges faced by these contributors. As the rapid development of AI continues to rely on the insight and labor of real people, it’s crucial to reflect on how research is conducted to ensure those workers receive proper acknowledgment for their role. This discussion paper calls for commitment to fair treatment, transparency, and meaningful support to make ethical data work a consistent part of the machine learning research process.

If you would like to learn more about the experiences and working conditions of these data workers, check out our blog post featuring creative projects from the Data Workers’ Inquiry!

New Publication: Extracting smartphone use from Android event log data

May 30, 2025May 30, 2025 Diana Ignatovich

Back in October 2024, the Methods lab shared a preprint of a study by Methods Lab member and data scientist, Roland Toth, and former research fellow, Douglas Parry, exploring how to isolate meaningful measures of smartphone use from Android event log data. We’re now pleased to announce that this work has been peer-reviewed and published in the journal Computational Communication Research.

The article titled “Extracting Meaningful Measures of Smartphone Usage from Android Event Log Data: A Methodological Primer” outlines a practical and reproducible step-by-step guide for deriving objective indicators of human usage from raw mobile data, offering valuable insights for research in social science and related disciplines. It details the extraction of key usage metrics through written explanations, visual aids, and pseudo-code. The paper is a vital resource for researchers seeking to understand patterns of mobile phone engagement and its implications in today’s rapidly evolving digital environment.

Workshop: Introduction to MAXQDA

May 13, 2025May 13, 2025 Methods Lab

Join us for the workshop Introduction to MAXQDA, designed for all researchers, students, and professionals interested in qualitative data analysis. On May 28th, 2025, at the Weizenbaum Institute, certified MAXQDA trainer Dr. phil. Aikokul Maksutova will lead a basic yet comprehensive workshop introducing the software’s core features, aligning with the key stages of digital qualitative research.

This event will offer guidance on MAXQDA’s essential tools for documenting, coding, and analyzing qualitative data. Participants will become familiar with navigating the Code System and a range of additional features, such as functions for exporting data, linking memos, and generating visualizations. Each segment will include hands-on activities using various datasets, enabling participants to confidently apply the skills they’ve learned on their own.

To conclude, special guest and representative of MAXQDA, Ms. Tamara Pataki, will inform participants of the software’s latest innovations and host an open Q&A session.

To learn more, please visit our program page. We hope to see you there!

DeZIM Summer School 2025

May 5, 2025May 5, 2025 Diana Ignatovich

For those interested in strengthening their skills in social research methods, we’re pleased to announce that registration is now open for the DeZIM Summer School 2025 (Deutsches Zentrum für Integrations und Migrationsforschung).

Running from August 12 to 14, the program is free and open to all, offering workshops in both qualitative and quantitative methods. Courses are designed for participants ranging from beginners to advanced, and all are welcome to join multiple sessions. However, space is limited, so we encourage early registration by completing this survey before the deadline on June 30, 2025.

As part of the ongoing collaboration between DeZIM and the Weizenbaum Institute, both institutions share access to each other’s workshops. Through this partnership, we aim to create more opportunities for researchers to develop and strengthen their methodological expertise.

To learn more about additional upcoming workshops, check out our Methods Ticker!

Workshop Recap: Introduction to Programming and Data Analysis with R

April 3, 2025April 15, 2026 Diana Ignatovich

The third edition of the Introduction to Programming and Data Analysis with R workshop took place on March 12th and 13th, 2025. Roland Toth with the Methods Lab at the Weizenbaum Institute engaged almost 20 participants with essential methods of data analysis via comprehensive coverage of fundamental R programming concepts and techniques.

Roland asks participants about their former experience with programming

On the first day, Roland guided participants through the basics of R syntax and its integration with Markdown/Quarto in an interactive environment. This included the very basics of programming like functions, objects, and indexing, but also data-related practices like data wrangling, sanity checks, and simple statistical analyses. Among others, participants also gained insight on managing warnings and errors that might stunt the process of coding throughout projects.

On day two, after an introduction to data visualization techniques, participants put their learning into practice: They explored provided survey data and developed a research question, so they could prepare and statistically analyze the data accordingly in R. The result was a reproducible HTML report on the reasoning behind the research question, all data wrangling steps, an exploration of the data set, the analysis, and the results including an interpretation. Attendees also supported each other’s progress whenever possible, while Roland offered personalized guidance.

The workshop alternated between lecture-like and interactive formats

The workshop concluded with a thorough review of useful functions and packages in R. Throughout the event, participants were encouraged to ask questions freely and frequently, and they took the opportunity. The Methods Lab would like to give a great thanks to all guests for their attendance and lively participation!

Career Tutorial: LLMs for all Expertise Levels (March 7, 2025)

February 19, 2025February 19, 2025 Methods Lab

In a joint effort, the Career Development and the Methods Lab are excited to announce the hybrid “Career Tutorial on LLMs for all Expertise Levels”. In this tutorial, beginning with fundamental concepts of LLMs and in-context learning, we’ll address the “Needle in the Haystack Problem” and compare ultra-long context models with RAG approaches. Through practical demonstrations, participants will gain hands-on experience with RAG’s core functionalities and understand its objectives. The session delves into scaling solutions using vector databases and advanced implementations, including chunking strategies, hybrid RAG, and graph-based RAG architectures. We conclude with an overview of emerging trends, examining agentic RAG and the integration of reasoning models in deep research applications. This comprehensive exploration equips attendees with both theoretical knowledge and practical insights into the latest developments in AI language models.

For more information, visit our program page. We are looking forward to your participation!

Workshop Recap: Introduction to Git

February 17, 2025March 11, 2025 Diana Ignatovich

On February 6th, 2025, LK Seiling facilitated a workshop for an Introduction to Git, with support from Sascha Kostadinoski and Quentin Bukold. This was co-organized by the Methods Lab and took place at the Weizenbaum institute. The hybrid event provided a thorough overview on the foundation of Git and its relative platforms for about 30 participants.

Firstly, Git was introduced for its general relevance. Seiling explored the qualities of its version control system and the advantages of efficiently managing changes to files. Its widespread use and accessibility were also highlighted by the software’s free and open source application. At its core, Git enables collaborative work by allowing concurrent adjustments to files by multiple participants and offers a system to track the changes made without requiring alterations to the original file.

Next, participants were invited to open the Terminal and guided through some basic commands. To this end, commands for traversing directories, creating, moving, organizing, and deleting files were explained and demonstrated in detail.

LK Seiling explains how to stage and commit changes

In the second hour of the workshop, Seiling encouraged participants to implement these basics by imagining the context of a classic Python project, one that might require collaborative engagement. Here, Python scripts were saved, renamed and staged accordingly to git messages and configurations. The principle git practices were emphasized to remind the audience of when and how to commit changes to the previously specified local repository. Furthermore, Seiling prepared guests to make requests when merging work, added description templates for joint projects and generally taught the features of use for group collaboration.

This was followed by instructions on the key functionality of Git, such as the Git repository, Git commands, branches, and conflict resolution. For instance, the branches gave insight into how to leverage simultaneous work done separately from the overall code base. This is especially beneficial for feature development while also helping to streamline the process of reviewing changes before merging. Throughout this instruction, commands were given to switch branches and merge scripts in the terminal, which was operationalized with a quickly constructed example. Seiling also provided necessary information on managing repositories, including visuals of the basic workflow and linkage between local and remote repositories, either for individual or collaborative effort.

For those curious when to use which Git platform, Bukold jumped in to detail the major differences between Github, Gitlab and Git.

Terminal commands are used to perform actions with Git

Later, Seiling explored some advantageous elements of the GitLab platform, accessible free of charge to Weizenbaum researchers, by describing the repository graph, issue tracking and project management tools. To elaborate, the repository graph structures insight into how a participant makes a contribution or change by arranging branches to show merges or commits, particularly relevant for collaborative code projects. In case of software malfunctioning, the issue tracking feature allows one to see who is working on what branch for an update on the progress of the problem. Finally, Gitlab’s management tool was outlined for instances of assigning work, applying tags to notify when projects are finished and to open or close potential issues.

To close, Kostadinoski briefly summarized the basic elements of Git, along with its implications in data work, such as for software development and research. He simplified key terms and embraced questions in a Q&A. Seiling joined in, encouraging participants to “learn by doing” and stay connected with each other via Weizenbaum associated Github accounts for future internal coordination.

Throughout this workshop, participants were presented with various tasks and benefited from frequent recaps that highlighted key points, ensuring a solid understanding of the material. Attendees both online and in person freely asked questions and received support from instructors. Therefore the Methods Lab would like to give a huge thank you to LK Seiling, Sascha Kostadinoski and Quentin Bukold for their clear instruction on the foundations of Git and for facilitating such an engaging environment for all participants.

Workshop: Social Science and Language Models (April 3–4, 2025)

February 11, 2025February 26, 2025 Methods Lab

The Weizenbaum research groups “Digital Economy, Internet Ecosystem, and Internet Policy” (Jan Batzner) and “Data, Algorithmic Systems and Ethics” (Dr. Fatma Elsafoury), supported by Fraunhofer FOKUS and TU Berlin, with contributors Zeerak Talat and Flor Miriam Plaza del Arco, are excited to introduce the workshop “Social Science and Language Models – Methods and theory to responsible research on and with Language Technologies” taking place on April 3–4, 2025 at the Weizenbaum Institute. This hybrid event encourages interdisciplinary collaboration to promote ethically responsible research in the application of natural language technology. As methodology utilizing language models is increasingly applied to a variety of contexts from social science, health-care settings to computer software development, research suggests the growing need to monitor potential biased outcomes of its use. However, the absence of collaborative understanding between researchers of social science and those in Natural Language Processing (NLP), perpetuates discrimination as biases in the conception and measurement of socio-technical systems often go unrecognized.

Therefore we hope to engage a diverse group of researchers involved in the methodology of social or economic fields of discipline to address this prejudice in language technologies. Submissions of abstracts are encouraged to involve aspects of bias in the mitigation and measurement of NLP, as well as its implications in the social sciences.

This event is open for the Qualification program in digitalization research (Module 2; specialization).

For more information, visit our program page. We are looking forward to your participation!

Recap: Second Networking Event for Digitalization Research in Berlin

February 6, 2025February 11, 2025 Methods Lab

After the first networking event in 2024, the Methods Lab at the Weizenbaum Institute and the Interdisciplinary Center for Digitality and Digital Methods a t Humboldt University Berlin (IZ D2MCM) organized a second networking event on January 24, 2025. As with the initial event, members of various institutions, institutionalized teams, and centers actively engaged in digital research within the humanities, social sciences, and cultural studies in the Berlin area participated. While there were some familiar faces, there were some newcomers.

In the first part of the event, participants discuss their experiences with networking strategies in a speed-dating format. Each conversation was documented by a member of the organizing team. Participants were rotated every few minutes to create different pairings. Each conversation was documented by a member of the organizing team. Participants highlighted the importance of networking within their own institutions, attending regularly organized events to formalize informal connections, pooling resources, and implementing cross-institutional research projects.

Melanie Althage (IZ D2MCM) guides participants through the new calendar system

In the second part of the event, colleagues from IZ D2MCM presented participants with a calendar system they developed. Its purpose is to consolidate events occurring at the network institutions into a single platform, making them accessible to all members. The system was then discussed in two groups. In one group, participants exchanged ideas on the design and admission criteria for events, considering aspects such as content, format, and location. In the other group, participants focused on facilitating the technical implementation, which operates through Git and enables network members to submit event metadata in a structured format.

The Methods Lab would like to thank the IZ D2MCM and all participants for their contributions to this successful event. Stay tuned for the next one!

Workshop: Introduction to Programming and Data Analysis with R (March 12-13, 2025)

January 28, 2025January 30, 2025 Methods Lab

After another successful run, the Methods Lab is excited to bring back the third annual Programming and Data Analysis with R workshop, led by Roland Toth (WI). This two-day event held at the Weizenbaum institute falls on Wednesday, March 12th and Thursday, March 13th.

On the first day, one can expect a comprehensive introduction to the fundamentals of programming, essential data wrangling techniques and Markdown integration. Following this, the second day emphasizes data analysis and incorporates hands-on application of datasets, enabling attendees to independently explore a relevant research topic. Throughout both days, participants will be presented with conceptual knowledge, coding techniques and basic subtasks for a practical and immersive learning experience.

For more information, check out the program page!