Category: A2; Team name: IgPa; Dataset: ConjugatedMoleculeDataset #245
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Checklist
Description
This PR adds three conjugated molecule datasets (OCELOTv1, OPV, PCQM4MV2) to TopoBench for hypergraph-based molecular property prediction tasks.
This implementation adapts the work from "Molecular Hypergraph Neural Networks" to the TopoBench framework. It ports the dataset preprocessing and hypergraph construction logic from the original mhnn repository while ensuring full compatibility with TopoBench's standardized data loaders and model interfaces.
Datasets Added
Hypergraph structure
Conjugated bonds are natural higher-order structures that occur in molecules. The idea of representing molecules with conjugated bonds as hypergraphs first appeared, to the knowledge of authors, in Molecular Hypergraph Neural Networks.
In organic materials, conjugated systems determine key properties such as conductivity or reactivity. Higher-order representations capture electron delocalization that pairwise graphs cannot. Therefore this PR enables benchmarking such hypergraphs within the framework of TopoBench. This supports research in drug discovery and materials science.
References