Projects
Note: my master's program is still in progress, so expect this page to update.
A showcase of work, school, and hobbyist projects that I've worked on. More projects to come soon!
OpenTabulate
Developed during 2018-2020. (project link)
I worked as a data scientist for Statistics Canada for almost two years during my undergrad.
Over that time, I contributed to the development of the Linkable Open Data Environment (LODE), an open source project for data processing, distribution and visualization.
I wrote opentabulate
, a Python package and a command line interface for tabulating data.
The main purpose of OpenTabulate is to programmatically convert regularly updated datasets (with a consistent schema) into CSV format with a user-defined schema.
OpenTabulate is used internally in my section to put together datasets, which are released publicly with an open data license.