Skip to content

Pipeline currently mis-maps IEAs around GOlr, likely related to owltools or its environment #251

@kltm

Description

@kltm

There are 128276 more annotations on release than were on the candidate. Looking at AmiGO, it seems like "evidence used in automatic assertion" (ECO:0000501) has entirely disappeared. Strangely, a very small numberof IEA annotations did manage to find their way in:

https://amigo-staging.geneontology.io/amigo/search/annotation?q=IEA*&fq=-evidence_subset_closure_label:%22genetic%20interaction%20evidence%20used%20in%20manual%20assertion%22&fq=-evidence_subset_closure_label:%22biological%20aspect%20of%20ancestor%20evidence%20used%20in%20manual%20assertion%22&sfq=document_category:%22annotation%22

That's weird.
I'm putting a pin in that for now.

That leaves us with two paths for data to be dropped, assuming that we are at issue and not upstream: ontobio and owltools.

Checking around for a data set let's arbitrarily select the MGI GAF as an example, specifically the IEAs:

snapshot | release
src: 73326 | 73829
valid: 73318 | 73821

So, that is a small change in the src/valid numbers. In AmiGO, filtering for just species "mouse" annotations, there are actually /more/ in the snapshot...so hm.

My guess for the moment is that there is something going on in either owltools or the docker mechanism around it (possibly cached old files) that is causing a problem here. I'll do a bit more digging to try and get at what's going on.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    Status

    Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions