What do people do online?
Using data donation to understand digital behavior.
a workshop at the SPP Junior Researcher Meeting
October 22, 2025
The researcher’s perspective.
🤔 What are methodological decisions researchers have to take in data donation studies?
|
|
|
|
|
Survey | Request & Download Data | Extract Data | Inspect Data | Consent |
📢 Task: Try it yourself.
Take a look at your downloaded data. What do you see; anything caught your eye?
Feel free to work in groups of 2-3 people for 5 minutes.
Different degrees in standardization for DDP content (Hase et al. 2024)…
subscriptions.csv (before processing)
def extract_youtube_content_from_zip_folder(zip_file_path, possible_filenames):
"""Extract content from YouTube data export zip file using filenames"""
try:
with zipfile.ZipFile(zip_file_path, "r") as zip_ref:
# Get the list of file names in the zip file
filenames = zip_ref.namelist()
# Look for matching files
for possible_filename in possible_filenames:
for filename in filenames:
if possible_filename in filename:
try:
# Process based on file extension
if filename.endswith(".json"):
with zip_ref.open(filename) as json_file:
json_content = json.loads(json_file.read())
return json_content
elif file_name.endswith(".csv"):
with zip_ref.open(file_name) as csv_file:
csv_content = pd.read_csv(csv_file)
def extract_subscriptions(subscriptions_csv):
"""Extract YouTube channel subscriptions"""
# Define column name
if "Kanaltitel" in subscriptions_csv.columns:
channel_column = "Kanaltitel"
else:
channel_column = "Channel Title"
# Define description
channel_name = "Subscribed Channel"
# Create DataFrame with just the channel names
subscriptions_df = pd.DataFrame({channel_name: subscriptions_csv[channel_column]})
return subscriptions_df
subscriptions.csv (after processing)
🙃 Thank you for participating; happy to talk with you about data donation (and anything else) throughout the next days.
For example …
🤔 What do you think: Which participant characteristics may correlate with non-response or non-compliance?
🤔 Questions?
Data Donation Workshop - Frieder & Sebastian