🚀 Building a SageMaker Pipeline to Train & Deploy a RoBERTa Fake News Detection Model

A fully automated AWS SageMaker Pipeline that ingests a raw “fake news” dataset, cleans & balances it, trains a RoBERTa classifier, evaluates its performance, and—if it meets your quality gates—packages & registers the model for deployment after human approval.

🏗️ Architecture

Data Registration & understanding
Pipeline Definition
Processing (clean, balance, transform, split)
Training (train on train+validation)
Evaluation (test the trained model's performance on the test dataset)
Conditional Model Registration
Human approval and SageMaker endpoint deployment

⚙️ Prerequisites

Python 3.8 or above
AWS account with permissions for SageMaker, S3, IAM, CloudWatch
AWS CLI v2 configured
boto3, sagemaker, transformers

pip install boto3 sagemaker protobuf transformers pandas

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
README.md		README.md
architecture.png		architecture.png
fake_news_classification_pipline.ipynb		fake_news_classification_pipline.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 Building a SageMaker Pipeline to Train & Deploy a RoBERTa Fake News Detection Model

🏗️ Architecture

⚙️ Prerequisites

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ikram98ai/fakenews_detection_bert_aws_pipeline

Folders and files

Latest commit

History

Repository files navigation

🚀 Building a SageMaker Pipeline to Train & Deploy a RoBERTa Fake News Detection Model

🏗️ Architecture

⚙️ Prerequisites

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages