Category: A1; Team name: ProofTruth; Dataset: Metamath #249
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Checklist
Description
This pull request adds the Metamath proof-graph dataset for the TAG-DS TopoBench Challenge 2025 (Category A1). The dataset is distributed as a single
data.ptfile hosted on Hugging Face and contains 20,000 graphs derived from the first 10,000 theorems in the ~45k-theorem Metamath database.This dataset provides a large-scale, real-world benchmark for graph neural networks on formal reasoning, with deep directed acyclic graph (DAG) structure, long-range dependencies, and a highly non-uniform label distribution reflective of real mathematical theorem usage.
Dataset Structure
Each of the 10k theorems appears twice:
1. Theorem Graph (statement only). A small DAG containing:
2. Proof Graph (full derivation). A full proof DAG containing:
Train/Val/Test Split Design
Task: Node-Level Classification
The model must predict, for each node, the justification label (the theorem used at that proof step).
UNKclass)References:
[1] Metamath Official Site - https://us.metamath.org/index.html
[2] Huggingface Dataset - https://huggingface.co/datasets/jableable/metamath-proof-graphs