Author: shotlefttodatascience

  • A goldmine of information…

    I’ve just discovered the awesome Brandon Rohrer and his blog while trying to find an intelligible article on Bayesian inference. What a goldmine – this guy is a born educator! Thank you for sharing your knowledge – it is well-appreciated!

  • SQL CheatSheet

    I’ve just worked through Imtiaz Ahmad’s Master SQL for Data Science on Udemy and it was a thoroughly enjoyable, morale-boosting experience! He build on each concept so you never feel left behind or perplexed at how he arrived at a solution, and as promised there are a gazillion exercises so by the time you’re done you feel like…

  • Poisson vs Exponential distributions

    These distributions are related yet different – here’s a comparison that hopefully clears up any confusions! Poisson Exponential Number of events that occur in an interval of time Time taken between 2 events occurring For example… the number of Metrorail trains that arrive at the platform in an hour For example… the time between one…

  • A Scrum fan is born… cheatsheet

    I’ve just finished listening to The Art of Doing Twice the Work in Half the Time on Audible and I feel like a real fan already – I can’t wait to test-drive it in a team situation! As a stalwart of Corporate IT, I’ve only ever worked according to the “waterfall” methodology and I’m unpleasantly…

  • Law of total probability – worked examples

    According to Wikipedia the law of total probability “expresses the total probability of an outcome which can be realized via several distinct events”. We can also think of this as the marginal probability: irrespsective of what road we took to get to this outcome, what is the total likelihood of the outcome occurring? Example 1…

  • Expected value refresher

    The expected value of an event is its most likely outcome. Assign each potential result a probability. The expected value is sum of all the potential results x their respective probabilities: ∑ (potential_result1 x probability1,… potential_resultn x probabilityn) Consider the simplest example possible, the coin flip. You’ll be paid R10 if you pick tails, but…

  • Git cheatsheet

    Even working in splendid isolation there have been times when I managed to lose a prior version of my code that actually worked better, and I wished I could get back to it without having to re-think everything from scratch! Now that I’ll be collaborating with others, version management is going to be a big…

  • Bash: command not found

    This morning I’ve been playing “spot the difference” – a simple enough task: use a bash shell script to get 2 strings from a user (first name and second name) and then evaluate whether the 2 strings are the same or different. I had a template for the script and everything from the edX course,…

  • Why Linux / command line?

    Today I’m ploughing my way through the edX Introduction to Linux and found myself wondering “whyyyy??”. It’s all quite entertaining and nifty, but how will I be using this later? Naturally I googled (using bing!) and found this wonderful online resource www.datascienceatthecommandline.com. After reading the section entitled “A Real-world Use Case” involving data for New York…

  • Getting results vs Understanding

    Alexander Pope is famously quoted as saying: A little learning is a dangerous thing; drink deep, or taste not the Pierian spring: there shallow draughts intoxicate the brain, and drinking largely sobers us again. I’ve been thinking about these words the past few days as I worked on my latest challenge: a text classifier using…