Also available here for better readability / zooming functionalities.

Figure X. Comprehensive overview of the experimental workflow for simulating and evaluating species distribution models using virtual species.
Also available here for better readability / zooming functionalities.

Feature scaling (Min-Max-Normalisierung):
- x is an original value
- x' is the normalized value
- min(x) lower bound of target range (for AUC 0.5)
- max(x) upper bound of target range (for AUC 1)
| Metric | Baseline | Min | Max | Higher Better? |
|---|---|---|---|---|
| AUC | 0.5 | 0 | 1 | Yes |
| COR | - | -1 | 1 | Yes |
| Spec | - | 0 | 1 | Yes |
| Sens | - | 0 | 1 | Yes |
| Kappa | - | -1 | 1 | Yes |
| PCC | - | 0 | 1 | Yes |
| TSS | 0 | -1 | 1 | Yes |
| PRG | 0.5 | 0 | 1 | Yes |
| MAE | - | 0 | 1 | No |
| BIAS | - | -1 | 1 | No |
We plot the results of the evaluation metrics against the Pearson correlation between the suitability raster and the prediction map of virtual species.
If the model (points) are plotted on the diagonal, then the metric is performing well.
Figure: AUCROC, Pearson's correlation, AUCPRG, and Specificity. In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis.The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets. The left column shows results from presence–absence (PA) data, the middle column from presence–background (PBG) data, and the right column from presence-artificial-absence (PAA) data. Rows correspond to different evaluation metrics.
Figure: Sensitivity, true skill statistic (TSS), Cohen’s kappa, and percent correctly classified (PCC). In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis. The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets. The left column shows results from presence–absence (PA) data, the middle column from presence–background (PBG) data, and the right column from presence-artificial-absence (PAA) data. Rows correspond to different evaluation metrics.
Figure: Symmetric extremal dependence index (SEDI), Smoothed boyce index mean, and omission rate. In each plot, one evaluation metric with rescaled values from 0 to 1 is shown on the x-axis, and Pearson's correlation between the true probability of occurrence and the artificial distribution maps (used as the reference for actual model performance) is shown on the y-axis. SEDI and omission rate are shown on an inversed scale. The dotted pink line depicts the bisector (slope = 1, intercept = 0). Each blue point represents one evaluation metric calculated on one of the 8,335 experimental test datasets.

