Skip to content

Code for cleaning, QC, and preprocessing physiological signals (EDA, BVP, TMP) from a large public dataset (OSF). Includes baseline/video handling, ML classification, and analysis scripts. Part of Girgis et al. (in submission), supporting reproducible research with these data.

License

Notifications You must be signed in to change notification settings

BIAPT/LargeScaleDatasetAffectiveScience

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LargeScaleDatasetAffectiveScience

QC & ML Pipeline for Physiological Signal Dataset

This repository contains code for quality control (QC), preprocessing, and machine learning analysis of a large physiological signal dataset (EDA, BVP, TMP). It includes:

QC & preprocessing scripts: Clean signals, handle baseline/video conditions, normalize, and output processed CSVs.

ML pipeline: Trains Random Forest, XGBoost, SVM, and a VotingClassifier ensemble on pre-extracted features. Performs feature selection (RFE), SMOTE balancing, baseline normalization, train/validation/test splits, and evaluation with classification reports, confusion matrices, and bootstrapped F1 confidence intervals.

A Filepaths.csv is included for easy configuration of input/output directories. Participant_videos.csv also provided to help with cleaning code.

This work is part of Girgis et al. (in submission) and supports reproducible research via OSF.

Please visit https://osf.io/pn9rq/?view_only=25f185f049434b18a7356d85e7f81d5c to access the dataset files.

About

Code for cleaning, QC, and preprocessing physiological signals (EDA, BVP, TMP) from a large public dataset (OSF). Includes baseline/video handling, ML classification, and analysis scripts. Part of Girgis et al. (in submission), supporting reproducible research with these data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages