-
Notifications
You must be signed in to change notification settings - Fork 1
Data collector #149
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Data collector #149
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
need to add tests to make sure the original functionality matches the tracing code + add missing files
| import csv | ||
| from pathlib import Path | ||
|
|
||
| INPUT_PATH = Path("data/checker_dataset_pistachio.jsonl") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
jsonl file missing in PR
| from pathlib import Path | ||
|
|
||
| INPUT_PATH = Path("data/checker_dataset_pistachio.jsonl") | ||
| OUTPUT_PATH = Path("data/checker_dataset_clean.csv") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
CSV file missing in PR
| @@ -0,0 +1,274 @@ | |||
| """Offline runner to collect rule-based checker data over a SMILES dataset. | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please add tests to this to make sure the functionality matches the original code
Description
Fix #(issue)
Type of change
Please check the option that is related to your PR.
Checklist