Skip to content

DiGIR crawls which always fail not marked as failed crawls #18

@MattBlissett

Description

@MattBlissett

Example: https://www.gbif.org/dataset/844f5238-f762-11e1-a439-00145eb45e9a

Always has Pages Crawled = 0, the endpoint returns 404, and all three page retrieve attempts end with "Got transport exception, will give up this request".

We have some undetected orphans because of this.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions