GitHub - onedeeper/EEGLearn: A python package for quickly preprocessing the TD-Braindataset.

Watch the video

https://www.youtube.com/watch?v=EH4AaAEidVw

Installation

Clone the repository

git clone https://github.com/yourusername/eeg-graph-learning.git \

cd eeg-graph-learning

Create and activate the conda environment

conda env create -f environment.yml
conda activate eeg-graph-learning

Install the package in development mode
pip install -e .

Project Structure

The repository is organized as follows:

clean.sh: Shell script to automate the preprocessing pipeline execution
environment.yml: Conda environment configuration
eeglearn/: Main package containing all the code
- preprocess/: EEG preprocessing modules
- features/: Feature extraction modules
- utils/: Utility functions
data/: Directory for storing datasets (not included in the repository)
tests/: Unit tests
notebooks/: Jupyter notebooks for exploration and visualization
experiments/: Experiment configurations and results
scripts/: Utility scripts

Data Setup

This project uses the TD-Brain dataset, which requires a Data Usage Agreement (DUA).

Apply for access to the TD-Brain dataset : https://www.brainclinics.com/resources/tdbrain-dataset
Once approved, download the dataset. You may use the sample dataset (~7Gb) or the full dataset (~100 gb) with this code.
Add the files into the data directory following the structure below:

Place the participant metadata file in the following location:
```
data/TDBRAIN_participants_V2.xlsx
```
This file contains important metadata about participants and is used for various analyses.
You can run the preprocessing pipeline in two ways:

a. Using the clean.sh script (recommended):

For macOS/Linux:
```
# Make the script executable (if not already)
chmod +x clean.sh

# Run the preprocessing script
./clean.sh
```
For Windows:

Option 1 - Using Git Bash or WSL (Windows Subsystem for Linux):
```
# Make the script executable (if not already)
chmod +x clean.sh

# Run the preprocessing script
./clean.sh
```
Option 2 - Using Command Prompt or PowerShell:
```
# Install Git Bash or WSL first, then follow the instructions above
# Alternatively, you can run the Python script directly:
conda activate eeg-graph-learning
python -m eeglearn.preprocess.preprocess_pipeline
```
The script will automatically:
- Detect your operating system
- Check if the conda environment exists and create it if needed
- Activate the conda environment
- Run the preprocessing pipeline
The script is designed to minimize verbose output, showing only essential progress information. If you need to debug issues, you can edit the script and uncomment the set -x line to enable command echoing.

b. Manually:
- Navigate to eeglearn/preprocess
- Run python preprocess_pipeline.py
Parameters like sampling frequency, epoch length, and line noise removal can be configured in the preprocess_pipeline.py file.

Testing

Test Configuration

The test suite can use either synthetic data (generated automatically) or your own EEG data files for testing. By default, it will create synthetic test data, but you can configure it to use your own EEG data files.

Using the Test Environment Setup Script

We provide a script to easily set up all required test environment variables:

# Run the setup script (must be sourced to persist variables)
source setup_test_env.sh

This script sets the following environment variables:

EEG_TEST_FILE_PATH: Path to the raw EEG test file (CSV)
EEG_CLEANED_TEST_FILE: Path to a preprocessed EEG test file (NPY)
EEG_TEST_CLEANED_FOLDER_PATH: Path to the directory containing cleaned test data
EEG_TEST_DERIVATIVES_DIR: Path to the derivatives directory containing raw test data

You should customize the paths in this script to match your local environment:

Open setup_test_env.sh in your editor
Update the paths for each variable to point to your test data
Save the file and run it with source setup_test_env.sh

Manual Environment Variable Setup

If you prefer to set the environment variables manually:

# Bash/Zsh
export EEG_TEST_FILE_PATH="/path/to/your/eeg/test/file.csv"
export EEG_CLEANED_TEST_FILE="/path/to/your/eeg/test/file.npy"
export EEG_TEST_CLEANED_FOLDER_PATH="/path/to/cleaned/"
export EEG_TEST_DERIVATIVES_DIR="/path/to/derivatives/"

Test Data Structure

Place test files in the following locations:

Cleaned .npy file:

tests/test_data/data/cleaned/{subject_id}/ses-1/eeg/your_file.npy

Raw CSV file:

tests/test_data/TDBRAIN-dataset/derivatives/{subject_id}/ses-1/eeg/your_file.csv

The test file should be a CSV file with the following characteristics:

Channels as columns
Time points as rows
33 channels (26 EEG + 7 other)
Sampling frequency of 500 Hz

If the environment variables are not set or the files don't exist, the test suite will automatically generate synthetic test data.

Running Tests

To run the test suite:

# Run all tests
pytest

# Run specific test file
pytest tests/test_preprocessing.py

# Run tests with verbose output
pytest -v

Troubleshooting

Cross-Platform Issues

Windows-specific Issues

Line Ending Problems: If you encounter errors related to line endings when running the shell script on Windows, you may need to convert the line endings from CRLF to LF:
```
# Using Git
git config --global core.autocrlf false
# Then re-clone the repository or run:
dos2unix clean.sh
```
Conda Activation Issues: If conda activation fails in the script:
- Ensure Anaconda/Miniconda is properly installed and added to your PATH
- Try running the commands manually:
```
conda activate eeg-graph-learning
python -m eeglearn.preprocess.preprocess_pipeline
```
WSL Recommendations: For the best experience on Windows, we recommend using Windows Subsystem for Linux (WSL2) with Ubuntu, which provides a more Linux-like environment.

macOS-specific Issues

Permission Issues: If you encounter permission issues:
```
chmod +x clean.sh
```
Conda Command Not Found: If conda is not recognized:
- Ensure you've initialized conda in your shell:
```
conda init zsh  # or bash, depending on your shell
```
- Restart your terminal after initialization

Linux-specific Issues

Missing Dependencies: If you encounter missing system dependencies:

# For Ubuntu/Debian
sudo apt-get update
sudo apt-get install -y build-essential

# For CentOS/RHEL
sudo yum groupinstall "Development Tools"

For any other issues, please open an issue on the GitHub repository.

Enabling Verbose Output for Debugging

If you encounter issues with the preprocessing pipeline and need more detailed output for debugging:

Edit the clean.sh script and add the following line after set -e:
```
# Enable command echoing for debugging
set -x
```

Remove the redirection to /dev/null from the conda commands if you need to see conda's output:

# Change this:
eval "$(conda shell.bash hook)" > /dev/null 2>&1

# To this:
eval "$(conda shell.bash hook)"

Run the script again to see detailed output of each command execution.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
eeglearn		eeglearn
notebooks		notebooks
scripts		scripts
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
clean.sh		clean.sh
data_dir_structure.png		data_dir_structure.png
environment.mac.yml		environment.mac.yml
environment.yml		environment.yml
pyproject.toml		pyproject.toml
setup.py		setup.py
setup_conda_env.sh		setup_conda_env.sh
setup_conda_env_mamba.sh		setup_conda_env_mamba.sh
setup_test_env.sh		setup_test_env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Watch the video

Installation

Clone the repository

Create and activate the conda environment

Project Structure

Data Setup

Testing

Test Configuration

Using the Test Environment Setup Script

Manual Environment Variable Setup

Test Data Structure

Running Tests

Troubleshooting

Cross-Platform Issues

Windows-specific Issues

macOS-specific Issues

Linux-specific Issues

Enabling Verbose Output for Debugging

About

Uh oh!

Releases

Packages

Languages

onedeeper/EEGLearn

Folders and files

Latest commit

History

Repository files navigation

Watch the video

Installation

Clone the repository

Create and activate the conda environment

Project Structure

Data Setup

Testing

Test Configuration

Using the Test Environment Setup Script

Manual Environment Variable Setup

Test Data Structure

Running Tests

Troubleshooting

Cross-Platform Issues

Windows-specific Issues

macOS-specific Issues

Linux-specific Issues

Enabling Verbose Output for Debugging

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages