Semi-Random Data Musings
  • About
Categories
All (16)
arima (1)
classification (2)
data visualization (11)
duckdb (1)
fastai (1)
forecasting (1)
gganimate (1)
ggplot2 (1)
machine learning (4)
model deployment (1)
model evaluation (1)
nlp (2)
plot (3)
poisson (1)
Python (3)
python (3)
R (12)
random forest (1)
recommender systems (1)
sentiment (1)
simulation (1)
text analysis (3)
xgboost (1)

Getting Lost in Library Data

Exploring Seattle Library Data
R
data visualization
machine learning
xgboost
duckdb
One of my favorite public goods has always been the library. Going all the way back to elementary school and the glorious pan-pizza summer reading challenges, I always found…
Nov 21, 2025
Adam

Let’s Get Lucky

Looking at investment income with simulations
simulation
data visualization
There is common adage these days that goes something like this: the wealthy, and billionaires in particular, are not genius investors, they just have more rolls of the dice…
Nov 11, 2025
Adam

Visualizing how often I see my siblings–with Waffle(plot)s

R
data visualization
plot
ggplot2
Several years ago I started using a service called AskMeEvery to track various little aspects of my life– how much I read that day, how much water I drank, etc. There are…
Feb 4, 2022
Adam

Tidy Tuesday: How much does it rain in Australia–with gifs!

R
data visualization
plot
gganimate
A goal of mine in the new year is to take part in tidytuesday as often as I can. It’s a great, supportive community and I think it’s important to deliberately practice your…
Jan 9, 2020
Adam

Analysis of Beer Advocate Dataset: Walkthrough of a data science take-home interview test

R
data visualization
machine learning
recommender systems
Interviewing for data science jobs is hard. Since the job definition and responsibilities vary significantly between companies and roles, you never quite know what areas of…
Mar 11, 2019
Adam
 

How to deploy a Keras model? Who knows? Let’s find out!

Python
nlp
text analysis
python
model deployment
Over the past couple weeks, I’ve been playing around with the Yelp academic dataset. The first thing I did was explore the dataset to see what interesting tidbits jumped…
May 2, 2018
Adam
 

Predicting Yelp Ratings from Review Text

Python
classification
machine learning
python
text analysis
Hello again! If you’ve been following along you’ll know that I’m in the middle of a series of posts digging through Yelp review data. Last time I went through some…
Apr 23, 2018
Adam

The Tables Turn: Reviewing Yelp!

R
nlp
data visualization
sentiment
For those who know me, it’s no secret that I love food. Whether it’s a bowl of ramen, delicious BBQ or a (half-dozen)warm cookie, if there is food to be tried, I want to try…
Apr 11, 2018
Adam
 

A Python Post?

Python
fastai
python
machine learning
random forest
This isn’t a typical post. Though I prefer working in R, I also do quite a bit of work in Python these days. I really like the way Python is setup for machine learning and…
Mar 27, 2018
Adam

Text Analysis of Presidential News Articles

R
data visualization
plot
text analysis
Do you live under a rock? If not, you’ve noticed that Donald Trump became president last year. No matter your feelings on the subject, most people would probably agree that…
Mar 14, 2018
Adam

To Classify or Not to Classify

R
classification
model evaluation
If you’ve been following this blog, you’ll recognize that I’ve been pretty obsessed with crime data recently. Louisville’s open crime data has been a great open source…
Feb 21, 2018
Adam

Violent Crime Revisited

R
data visualization
poisson
Occasionally I like to revisit my old blog posts. Since I try to treat them as a means to enhance my data science toolkit, old posts will often look quite naive when viewed…
Jan 4, 2018
Adam

A Forecasters Folly

R
forecasting
arima
I’ve been itching to do a forecasting post for a while. Time series data is not something I work with often, so I have wanted to practice some basic techniques, but I’ve…
Oct 13, 2017
Adam

Louisville’s Heroin Epidemic

R
data visualization
If you have been paying any attention to the news recently, you will have heard that the United States is experiencing a resurgence of heroin use. Unfortunately, Louisville…
Aug 18, 2017
Adam

Is Theft in Louisville Rising?

R
data visualization
Last time, we dove into Louisville’s open crime data set to explore the city’s violent crime. Today, we examine the vast world of theft/larceny crimes in Louisville. In this…
Aug 2, 2017
Adam

Exploring Violent Crime in Louisville

R
data visualization
Over the past year Louisville’s local news seems like it has been increasingly dominated by crime news. Whether it was riots, teens running rampant or a record number of…
Jul 31, 2017
Adam
No matching items