Introductions

Hello and welcome!

Here I’m going to show case data analyses from topics ranging from economics, to public health, to information security. Primarily my analyses are going to be focused around small datasets around <100 rows(ish) of data. These articles are called Data Bytes

What is ‘Data Bytes’?

In a nutshell they are small easily digestible articles centered around an analysis of a tiny dataset.

I want to show how much information can be acquired from a small amount of data. Also I want beginners to be able to follow along and take the plunge into data analysis themselves.

Large datasets have an enormous amount of information. And if other people are anything like me: it can be intimidating to the point of never starting. These articles will help bolster newbies confidence to tackle those larger projects.

Just because the data is tiny, doesn’t mean that the techniques are really going to be any different.

Definition of Tiny Data Set

I am defining tiny data set as a data set with 100 or less rows and 10 or less columns.

Why tiny data sets?

I’m choosing to work with tiny data sets for a couple of reasons.

  1. They are easier for beginners to grasp and understand at a high level.
  2. While small and not exactly representative, you can still glean a lot of information from these datasets.
  3. I can do the analysis by hand which reinforces my knowledge of statistics and double checks my programming.

Will you ever work with larger data sets?

Most definitely! They’ll be under the ‘long form’ tag. But my focus is primarily going to be on tiny data.

Conclusion

Thank you for taking the time to check this out! I hope you stick around.