This project explores the process of extracting keywords from:
- Scientific abstracts
- IMDB reviews
- Rotten Tomatoes reviews
The datasets used can be found in:
- IMDB: https://www.kaggle.com/datasets/lakshmi25npathi/imdb-dataset-of-50k-movie-reviews
- Rotten Tomatoes: https://www.kaggle.com/datasets/andrezaza/clapper-massive-rotten-tomatoes-movies-and-reviews
- Scientific abstracts: Inspec dataset, ArXiv data: https://www.kaggle.com/datasets/sarthakharne/dataset-with-embeddings?select=cs_papers_wo_embeddings.csv