Code for application and training algorithms described in:
Yijun Zhao, Fernando Martinez, Haoran Xue, Gary M. Weiss (2024) "Admissions in the Age of AI: Detecting AI-Generated Application Materials in Higher Education"
This repository is organized as follows:
The src folder contains scripts used for the generation of prompts and the training and analysis of AI models.
LORPromptsMaker.py: Generates prompts for letters of recommendation.SOIPromptsMaker.py: Generates prompts for statements of intent.TrainingAndAnalysis.py: Handles the training and analysis of models.
The app directory encompasses all the necessary components to run the application.
app.py: Main application entry point.custom_models.py: Contains custom transformer-based models.requirements.txt: Lists all dependencies required to run the application.Dockerfile: Dockerfile for building the application container..streamlit: Contains Streamlit configuration files (if applicable).
The models subdirectory within app contains baseline models for machine learning operations.
baseline_model_lr.joblib: Baseline logistic regression model.baseline_model_lr2.joblib: Second logistic regression baseline model.baseline_model_nb.joblib: Baseline Naive Bayes model.baseline_model_nb2.joblib: Second Naive Bayes baseline model.