-
Notifications
You must be signed in to change notification settings - Fork 17
LLM-based example #196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
tectonia
wants to merge
22
commits into
main
Choose a base branch
from
martyna/contribute-llm-example
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
LLM-based example #196
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Trigger pr checks with minor change
Trigger another pr check
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This pull request introduces a new LLM-based example - the invoice processing pipeline, including new configurations, dependencies, workflows, and documentation. Key changes focus on enabling invoice processing experiments, integrating Azure OpenAI services, and enhancing pipeline functionality.
Invoice Processing Pipeline:
invoice_processing_ci_pipeline.ymlto define a continuous integration workflow for invoice processing, supporting pull requests and workflow dispatch triggers.config/config.yamlto include configurations for invoice processing pipelines (invoice_processing_prandinvoice_processing_dev) with specific compute cluster and dataset settings. [1] [2]experiment_config.yamlto define parameters for data preparation, prediction, and scoring in invoice processing experiments.mlops/invoice_processing/components/predict.ymlto define the predict component for the pipeline, integrating Azure OpenAI service inputs and outputs.Dependency Updates:
.github/requirements/build_validation_requirements.txtand.github/requirements/execute_job_requirements.txtto include newer versions ofmlflow,azure-ai-ml, and additional libraries likeazureml-fsspec,Levenshtein, andpython-retry. [1] [2]Workflow Improvements:
build_validation_workflow.ymlto includePYTHONPATHfor improved test execution.execute_shell_code/action.ymlfor better security and reliability.Documentation Updates:
docs/how-to/ConfigureExperiments.mdto provide detailed instructions on configuring experiments, including.envfile setup and pipeline configurations.docs/how-to/PromptsAndExtractionStrategies.mdto document prompt creation and extraction strategies for invoice processing.Data and Config Updates:
config/data_config.jsonto include new datasets for invoice processing (invoice_processing_testandinvoice_processing_test_gt).mlops/common/config_utils.pyto support loadingexperiment_config.yamlalongside the main configuration file.These changes collectively enable robust support for the invoice processing pipeline, streamline workflows, and enhance documentation for easier onboarding and experimentation.