Projects


Note: my master's program is still in progress, so expect this page to update.

A showcase of work, school, and hobbyist projects that I've worked on. More projects to come soon!

OpenTabulate

Developed during 2018-2020. (project link)

I worked as a data scientist for Statistics Canada for almost two years during my undergrad. Over that time, I contributed to the development of the Linkable Open Data Environment (LODE), an open source project for data processing, distribution and visualization. I wrote opentabulate, a Python package and a command line interface for tabulating data. The main purpose of OpenTabulate is to programmatically convert regularly updated datasets (with a consistent schema) into CSV format with a user-defined schema. OpenTabulate is used internally in my section to put together datasets, which are released publicly with an open data license.