Below you will find pages that utilize the taxonomy term “NLP”
Projects
Project 5: Predicting Vote & Ideology with Demographics and Browsing Data (Thesis)
Extracted features from the Media Exposure and Opinion Formation (MEOF) survey. Processed large quantities (millions of rows) of user trace data from YouGov Utilized Python and R for proper data cleaning Segmented and documented code for modularized implementation Optimized Machine Learning models, neural networks, and utilized natural language processing for topic modeling The model performance for ideology is significantly better. GitHub Repository
Paper
Projects
Project 4: Natural Language Processing 25 Years of EU Climate Policy
This project expands on the work by Sewerin, S., Kaack, L.H., Küttel, J. et al. Towards understanding policy design through text-as-data approaches: The policy design annotations (POLIANNA) dataset. Sci Data 10, 896 (2023). https://doi.org/10.1038/s41597-023-02801-z.
The POLIANNA is a dataset of policy texts from the European Union (EU) that are annotated based on theoretical concepts of policy design, which can be used to develop supervised machine learning approaches for scaling policy analysis.