Below are a list of projects that I have either worked on individually, or I have contributed to in collaboration.
Projects
Project 7: Visualizing Global Imports from Major Economic Powers (China, EU, USA) since 2000
Use IMF API to automatically collect relevant data soures Process data as needed Create metrics to define majority import (in this case which ever of the three countries the share of the imports are from) Combine geojson file with data so that there is a proper geospatial file with the neceesary information Creating visuals of the dataset Start to finish process in one Jupyter Notebook GitHub Repository
Projects
Project 6: Bike Infrastructure Availability & Quality in Berlin vs. Amsterdamm
Define metrics: Availability = Bike Paths (km)/ Roads (km) Quality = ( Seperated Bikepaths / All Bike Paths )*100 Utilize API to collect data from OpenStreetMap Acquire appropriate shapefiles from the city authorities Clean and process data Create data visualizations and report findings GitHub Repository
Projects
Project 5: Predicting Vote & Ideology with Demographics and Browsing Data (Thesis)
Extracted features from the Media Exposure and Opinion Formation (MEOF) survey. Processed large quantities (millions of rows) of user trace data from YouGov Utilized Python and R for proper data cleaning Segmented and documented code for modularized implementation Optimized Machine Learning models, neural networks, and utilized natural language processing for topic modeling The model performance for ideology is significantly better. GitHub Repository
Paper