I always like to understand concepts well before I use them (which is good because it's the right thing to do, but bad because it slows me down a lot!), so it was with great excitement that I came across Matt Brems' article A One-Stop Shop for Principal Component Analysis recently. If you read this... Continue Reading →
Tutorial: BigQuery arrays and structs
The first time I encountered the BigQuery export schema this year my heart sank: arrays and structs were not something covered in my SQL intro course! But having spent a few months extracting data like this I've come to appreciate the logic. These are all the 'notes to self' I wish I'd had at the... Continue Reading →
Data Science for Good Challenge – Kaggle
I picked Center for Policing Equity challenge on Kaggle for three reasons: I love maps and I love the idea that data scientists can significantly improve our world, in addition to improving the bottom lines of big corporates. And this is exactly the type of messy data one would get in the real world so... Continue Reading →