DigitalBiomarkerDiscoveryPipeline/Exploratory-Data-Analysis

 
 

Repository files navigation

Objectives: Exploratory Data Analysis is a standard process in the early stages of digital biomarker development. EDA allows us to explore relationships between variables in the data, examine trends, analyze missingness of data, and begin the process of understanding the link between the data and the physiological state we are studying.

Input: .csv file with entire dataset.

Output: Figures for EDA (after filtering all the NULL data)

Functions: This repository currently contains the following functions.

FunctionREADME
makehistPlot histograms of all variables in data
makeboxPlot boxplot of all variables in data
makeleafPlot leafplot of all variables in data
makebubblePlot bubble chart of all variables in data
makerunPlot run chart of all variables in data
makemultivariatePlot multivariate chart of all variables in data
makescatterPlot scatterplot of all variables in data

Publications:


  • MissingDataAnalysis/ - a collection of analyses for exploring missingness of data
  • ExploratoryDataAnalysis.ipynb - a general, all purpose EDA notebook for analyzing longitudinal wearable data with outcomes

Sources: we use STEP Data (link: https://physionet.org/content/bigideaslab-step-hr-smartwatch/1.0/) for the EDA analysis

About

Tools for exploratory data analysis of wearables and mHealth data.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages

  • Jupyter Notebook 93.8%
  • R 6.2%