Aspect-based sentiment analysis

The problem is treated as a sentence pair classification task. The model takes as input a sentence pair (a sentence and a category name) and outputs a label (polarity). The full process consists of loading and pre-processing the dataset, training the model, and predicting labels.

Pre-processing

Filter dataset to include only sentence-category-label columns, encode labels ('neutral': 0, 'negative': 1, 'positive': 2). Apply AutoTokenizer to sentence-category pairs, save ids and attention masks. Finally, convert the pre-processed dataset to a tensor.

Training

Fine-tune a BERT model using different hyper parameters and save the best one (based on accuracy score).

^{_{Note: the code is adapted by Olesia Khrapunova from a project from NLP course (MS in Data Science and Business Analytics, CentraleSupelec). The original code was written by Suiyan Liu, Alain Mikael Alafriz, Milind Bhatnagar and Olesia Khrapunova.}}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aspect-based sentiment analysis

Pre-processing

Training

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Aspect-based sentiment analysis

Pre-processing

Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages