Category: A1; Team name: SPAICOM_CattDiN; Dataset: LargeScaleMultipurposeBenchmarkDatasetsWDN #237
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Checklist
Description
This pull requests integrates eleven datasets for Water Distribution Network (WDN) analysis as described in [1]: they are generated synthetically via numerical simulation given well-known configurations.
The eleven datasets are
Anytown,Balerman,C-Town,D-Town,EXN,KY1,KY6,KY8,KY13,L-Town,Modena.Each of these datasets comprise many different
.csvfiles; however, we restricted the interest to the following files containing time-series generated as described in the reference paper:pressure.csvdemand.csvflowrate.csvvelocity.csvhead.csvhead_loss.csvfriction_factor.csvattrs.jsonEach of these files includes a certain number of scenarios, each of which has a certain temporal resolution in terms of subsquent snapshots: the number of scenarios is stored under the key
gen_batch_sizein theattrs.jsonfiles, while the number of time-stamps is stored under the keyduration. The metadata fileattrs.jsonalso contains the graph in terms of adjacency list under the keyadj_list.Water Distribution Networks (WDNs) can be naturally represented as graphs, which has led to extensive use of graph deep learning for a variety of challenging tasks [2].
These applications span both transductive settings-such as estimating the full network state from partial observations at a single time snapshot-and inductive settings, where models leverage spatiotemporal structure to perform tasks like demand forecasting.
Clearly, the datasets poses many possible regression problems that can be cast at node-level, edge-level, and at a combined level.
However, the physical parameters of WDNs are governed by a variety of structural laws that impose higher-order topological constraints on the system [3].
Thus, we believe that topological deep learning could offer a powerful and principled framework for addressing the engineering problems posed by WDN monitoring and analysis.
Issue
We know that spatio-temporal and cross-domain learning is beyond the scope of the current implemented architectures of TopoBench; this in our opinion does not make such a contribution any less relevant, but rather makes it crucial in view of the implementation of new topological models for time series and for physical-informed topological learning.
Moreover, it serves as the first bridge between the topological deep learning and water distribution network (WDN) communities, establishing TopoBench as a practical tool for this type of analysis.
References
[1] Tello A., et al., "Large-Scale Multipurpose Benchmark Datasets For Assessing Data-Driven Deep Learning Approaches For Water Distribution Networks" (2023)
[2] Vittori, G., et al. “Graph neural networks to model and optimize the operation of Water Distribution Networks: A review.” (2025)
[3] Cattai, T, et al. “Physics-Informed Topological Signal Processing for Water Distribution Network Monitoring” (2025)
Co-authored by @TizianaCattai and @LeoDiNino97