The Set Theory of Data Warehouse Design


I love volleyball as much as I love data. However, as this is a blog about data, I’ll leave the theories I have related to setting the ball at the door and move forward on the topic of data. Believe it or not, you learned the basics of how a data warehouse works way back…

Thinking Inside the Box


When I was 19, I attended a leadership school where one of the sessions focused on trying to sell an otherwise undesirable product. In the workshop, we were given the opportunity to try and come up with reasons to purchase a bucket of excrement. Many of the people concentrated on the practical uses for poop….

Who Does the Best Imitation of You?


I was fortunate enough to grow up in an era when the likes of Rich Little and Dana Carvey could be seen on a regular basis doing impressions of famous people on late night TV. It was always amazing to me how spot on these gentlemen could imitate the facial expressions, mannerisms, and speech patterns…

“Our Work Isn’t an Exact Science”


“Our work isn’t an exact science.” I heard this stated by someone in the data science field last week, and I can think of few things in our work that make me angrier than this – so prepare to hear a rant, as I often find myself on the wrong side of Brandolini’s Law. Simply…

Be the Frank Lloyd Wright of Your Data Warehouse


I studied architecture in middle school (yes, I am that big of a nerd) and fell in love with the patterns of Frank Lloyd Wright designs. Widely regarded as one of the best architects in U.S. history, Wright’s designs are legendary for their attention to the smallest of details. He drew the plans, selected the…

The Truth, the Whole Truth, and Nothing but the Truth…


How you finish that statement tells a great deal about a person. If you finish with “so help me God”, then you’re likely either a Christian, or spend way too much time watching courtroom dramas. I’m kidding – we all know there’s no such thing as too much Matlock or Perry Mason. Alternatively, if you’re…

Regression to the Mean Machine


Old habits die hard. That, in a nutshell, is the concept behind regression to the mean. To understand this concept, let’s first define what mean, means (and forgive me for sounding like Bill Clinton during the Lewinski affair). Mean is the highfalutin way statisticians say “average”. With regression to the mean, the philosophy is that…

Roll Up Your Sleeves, Not Your Counts


Rolling up your shirtsleeves has long been associated with the idea of working hard. I have zero issues with working hard – but you should work smart as well. I was reminded of this recently in a discussion with a colleague regarding a table that stores distinct record counts by day. The idea behind the…