Skip to content

Commit 902ec25

Browse files
committed
add DOI to README and CITATION file
1 parent f273a0b commit 902ec25

File tree

2 files changed

+11
-3
lines changed

2 files changed

+11
-3
lines changed

CITATION.cff

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,12 @@ authors:
1919
family-names: Bock
2020
orcid: 'https://orcid.org/0000-0001-6091-3088'
2121
affiliation: CeMM Research Center for Molecular Medicine
22+
identifiers:
23+
- type: doi
24+
value: 10.5281/zenodo.8405360.
25+
description: >-
26+
This DOI represents all versions, and will always
27+
resolve to the latest one.
2228
repository-code: 'https://github.com/epigen/unsupervised_analysis'
2329
url: 'https://epigen.github.io/unsupervised_analysis/'
2430
abstract: >-

README.md

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,11 @@
1+
[![DOI](https://zenodo.org/badge/475465311.svg)](https://zenodo.org/badge/latestdoi/475465311)
2+
13
# Unsupervised Analysis Workflow
24
A general purpose [Snakemake](https://snakemake.readthedocs.io/en/stable/) workflow to perform unsupervised analyses (dimensionality reduction and cluster analysis) and visualizations of high-dimensional data.
35

46
This workflow adheres to the module specifications of [MR.PARETO](https://github.com/epigen/mr.pareto), an effort to augment research by modularizing (biomedical) data science. For more details and modules check out the project's repository.
57

6-
**If you use this workflow in a publication, please don't forget to give credit to the authors by citing it using this DOI [coming soon]().**
8+
**If you use this workflow in a publication, please don't forget to give credit to the authors by citing it using this DOI [10.5281/zenodo.8405360](https://doi.org/10.5281/zenodo.8405360).**
79

810
![Workflow Rulegraph](./workflow/dags/rulegraph.svg)
911

@@ -90,7 +92,7 @@ We validated/analyzed the clustering results by comparing them with all categori
9092
**Cluster Validation - Internal Indices & MCDM using TOPSIS**
9193
We performed internal cluster validation using six complementary indices: Silhouette, Calinski-Harabasz, C-index, Dunn index, Davis-Bouldin Score from the clusterCrit package (ver) [ref], and a weighted Bayesian Information Criterion (BIC) approach as described in [Reichl 2018 - Chapter 4.2.2 - Internal Indices](https://repositum.tuwien.at/handle/20.500.12708/3488). Due to computational cost, PCA results representing 90% of variance explained were used as input, and only a random sample proportion of [sample_proportion] was used. These internal cluster indices are linear, using Euclidean distance metrics. To rank all clustering results and [metadata_of_interest] from best to worst, we applied the Multiple-criteria decision-making (MCDM) method TOPSIS from the the Python package pymcdm (ver) [ref] to the internal cluster indices, as described in [Reichl 2018 - Chapter 4.3.1 - The Favorite Approach](https://repositum.tuwien.at/handle/20.500.12708/3488).
9294

93-
**The analysis and visualizations described here were performed using a publicly available Snakemake [ver] (ref) workflow [DOI]().**
95+
**The analysis and visualizations described here were performed using a publicly available Snakemake [ver] (ref) workflow [10.5281/zenodo.8405360](https://doi.org/10.5281/zenodo.8405360).**
9496

9597

9698
# Features
@@ -225,7 +227,7 @@ Leiden algorithm for clustering
225227
# Links
226228
- [GitHub Repository](https://github.com/epigen/unsupervised_analysis/)
227229
- [GitHub Page](https://epigen.github.io/unsupervised_analysis/)
228-
- [Zenodo Repository]()
230+
- [Zenodo Repository](https://doi.org/10.5281/zenodo.8405360)
229231
- [Snakemake Workflow Catalog Entry](https://snakemake.github.io/snakemake-workflow-catalog?usage=epigen/unsupervised_analysis)
230232

231233
# Resources

0 commit comments

Comments
 (0)