Category: [A1]; Team name: MaAnCla; Dataset: Dynamical Activity Complex (DAC) #243
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Checklist
Description
This pull request adds support to the Dynamical Activity Complex (DAC) datasets from the paper [1]. These datasets contain structural brain directed graphs and neuronal activations in response to different stimuli.
Specifically, the PR implements a dataset and a dataloader class, considers 3 different datasets (named
4-325-1,4-325-3,4-325-5, corresponding, respectively, to the datasetsM=1,M=3,M=5in [1]) and loads the predefined 5 splits from [1] (which can be selected via the configuration parametersplit_num).Dataset description
The DAC digraphs are derived from the Blue Brain Project's reconstruction of a microcircuit in the somatosensory cortex of a rat [2]. For a specific graph, each node corresponds to a neuron, directed edges correspond to neuronal connections and node features (of shape$T$ ) contain the activation state of a neuron (0 or 1) over $T$ time bins. Activations are obtained by injecting one of eight distinct synaptic input patterns delivered via thalamocortical afferents, and the dataset task consists in classifying which of the eight stimula the observed activations are associated to. To generate different graphs, subgraphs of different size and the associated activations are subsampled, resulting in the 3 datasets of this PR.
Issue
Neuronal activations are naturally modeled by networks and generate higher-order (directed) patterns informative of brain dynamics, making these datasets of high interest for the TDL community.
References
[1] Lecha, Manuel, et al. "Directed Semi-Simplicial Learning with Applications to Brain Activity Decoding." arXiv preprint arXiv:2505.17939 (2025).
[2] Markram, Henry, et al. "Reconstruction and simulation of neocortical microcircuitry." Cell 163.2 (2015): 456-492.