Product Sentiment Analysis

Homework # 4 for the course Advanced Data Mining and Language Technologies at La Sapienza University of Rome

Brief description

The assignment consists in the analysis of customer ratings and comments for a set of products and constructing a language model that can classify a customer's comments as negative or positive.

Of the following dataset we quantize the 4 possible ratings into a binary feature (positive or negative comment) that we use as a label for implementing supervised models and consider title and review_text as the only informative features for classification.

The model we prefer is a Neural Network model based on a BERT pre-trained model for the embedding part fine-tuned with a simple Feedfoward Neural Network.

Model selection

In the first part of the homework we try different combinations of encoding techniques and machine learning models to compare them. Go to the notebook for further information abouot our choices.

TF-IDF + Complement Naive Bayes	Word2Vec + RandomForest	BERT + XGBoost

XGBoost in combination with BERT embeddings seems to slightly outperform the other methods.

In the second part we report our final model improving the best from the previous study by proceeding with a transfer learning technique. In fact we use BERT embeddings in combination with a simple FNN.

FNN + BERT embeddings final results

The last proposed model achieves excellent performance compared to the previous ones demonstrating the relevance of deep learning models in language processing (despite the fact that our study is based on less advanced models with respect to RNNs or transformers).

Evaluation metrics

Performance

Used technologies

Team

Enrico Grimaldi

Mario Edoardo Pandolfo

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
ADMLT2023_HW4_notebook.ipynb		ADMLT2023_HW4_notebook.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Product Sentiment Analysis

Brief description

Model selection

FNN + BERT embeddings final results

Used technologies

Team

About

Uh oh!

Releases

Packages

Languages

License

Engrima18/Product-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Product Sentiment Analysis

Brief description

Model selection

FNN + BERT embeddings final results

Used technologies

Team

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages