Skip to content

Conversation

@DSuveges
Copy link
Contributor

Context

This update includes studyStartDate as source of evidenceDate. This change enables dating ChEMBL evidence.

Number of evidence with available studyStartDate:

+------------+------+
|datasourceId| count|
+------------+------+
|      chembl|485114|
+------------+------+

Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR adds studyStartDate as a source for evidenceDate in the ChEMBL evidence processing pipeline. The change enables dating of ChEMBL evidence records that previously may have lacked date information.

  • Added studyStartDate as a new column option in the coalesce function for determining evidence dates
  • Updated the priority comment to reflect the new column ordering for evidence date selection
  • This change affects approximately 485,114 ChEMBL evidence records that have available studyStartDate values

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

Copy link
Contributor

@ireneisdoomed ireneisdoomed left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! Would it make more sense prioritising releaseDate over the literature reference? This is available por PPP data and for ClinGen. I think it would precede a potentially availale publication date

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Update evidence step to capture ChEMBL study start date

2 participants