I am a modern-day translator.
Successfully translating millions of rows of data into actionable information requires the technical precision of an engineer and the creativity of an artist. I transform complex data into information that is useful.
intuitive - elegant - useful
Visualizations optimized for viewing on a desktop or tablet
The Street Trees of Brooklyn & Manhattan
Svalbard's Rising Temperature

Finding Great Value in Iris' NYC Restaurant Reviews
Iris, a New York City native, has spent years surveying the local fare, giving restaurants in the 5 Boroughs review and price ratings, in addition to dishing out some brutally honest reviews. She kindly gave me access to her review data and I decided to explore which restaurants provided the best value. Her restaurant review ★-★★★★★ and cost $-$$$$$ ratings are on a 5-point scale which made the comparison relatively straight forward and is displayed below.
One interesting point brought up by Iris’ brother is that the review and cost comparison is not statistically accurate because the average rating is 3.7, while the average cost is only 2.2 meaning that most restaurants show positive value (rating > cost). I went ahead and normalized both the review and cost ratings for a more accurate comparison. This version can be found here, as it’s a little less straightforward to interpret with the scale showing a normal distribution.
A disclaimer from the master critic: These reviews are the opinion of one woman. Yes, my opinion is superior, but it is just that - an opinion. If you have any comments or suggestions, tough. HAPPY EATING! –IRIS.
I've always been fascinated with sharks. With the recent population explosion of white sharks near my family's home on Cape Cod, I thought it's time to investigate what the data tells us about these majestic predators. Using data collected by the Shark Research Institute, I examined 5661 reported attacks from 1800-2018. Note that the data is inclusive of unsuccessful attacks too (e.g., shark bites an oar while someone is rowing.)
Let's start with the location of reported incidents. Attacks are heavily concentrated in three countries, the United States, Australia, and South Africa, which account for 67% of worldwide attacks. The rest of the countries combined come in at 31%, while attacks at sea accounted for the remaining 2%.
Next let's examine attacks by species with a focus on fatal and non-fatal attacks. Four species are considered extremely dangerous (white, tiger, bull, and oceanic whitetip) to humans, and the data confirms 3/4. Oceanic whitetips register few attacks, although scientists suggest that this number is underreported because they prowl the open ocean and are responsible for many deaths during sea disasters where there are few survivors.
However, shark species are just one half of the equation. How do attacks breakdown against human activities? The chart below shows the total number of attacks by size and their fatality percent by color. Swimming (37%) and sea disasters (60%) have the highest fatality rate across all species, while board sports (surfing, kayaking, stand-up paddle boarding, surf skiing) account for the most attacks (1327), but have a low fatality rate (6%). Interestingly tiger sharks have a higher propensity for board sport attacks (82) than attacks involving swimmers (55), whereas the reverse is true for bull sharks.
Now let’s take a look at how attacks have trended over time. Non-fatal attacks generally increased since 1800 with a large spike in the 1960s and an even larger jump since 2000, while fatalities largely remained constant.
There are many factors driving the increase in recorded attacks, the two most important being the increase in number of people in the water and the increase in reporting/information availability. However, these would suggest a linear increase, which we don’t see in the data.
The jump in attacks during the 50-60’s is partially a factor of increased provoked attacks primarily involving fishermen (e.g., clearing sharks from their nets or falling overboard) and spearfishermen (e.g., spearing or interacting with sharks). So, what gives with the dramatic drop in the 70’s? A couple of factors are at play. First, the international shark attack file was not as active during that time which lead to less records. Second, the infamous JAWS film was released in 1975 which dramatically changed the public’s opinion (and behavior) on the risks of shark attacks. Finally, fish stocks were devastated which saw a corresponding drop in fishing. Attacks grew with a vengeance over the past thirty years driven by an increase in board sports, led by the emergence of surfing.
Examining Violence During the Afghan War
My interest in the Afghan war peaked when my National Guard unit received orders to deploy there to provide air ambulance medevac services throughout 2018-2019. The war had again slipped out of the public consciousness after NATO’s combat mission officially ended in 2014. Questions arose like: Who is winning the war? How intense is the fighting? What is my experience going to be like vis-à-vis other American soldiers who deployed to Afghanistan over the last 17 years?
While I found it difficult to get answers to these questions, I stumbled upon the Global Terrorism Database which provides a treasure trove of free information on terrorism related incidents. Using this data, I mapped terrorism related violence by week since 2001. The data shows that although our involement has waned, the Afghan war is in full swing. It’s also clear that we missed a window of opportunity for peace and reconstruction early in the war while violence was still relatively low.
Although the recent ceasefire and peace negotiations bring hope for the people of Afghanistan, the reality remains grim. The Special Investigator General for Afghanistan Reconstruction reported in May, 2018 that the Afghan government controls or influences 229 of Afghanistan’s 407 districts (56.3 percent), the Taliban controls or influences 59 districts (14.5 percent), and the remaining 119 districts (29.2 percent) are contested...
Regression to the Mean // ESPN's Way Too Early Rankings
The end of the NFL’s 2016 coincided with me reading Daniel Kahneman’s Thinking Fast and Slow which examines human biases that cloud our reasoning and distort our perception of reality. One of the concepts covered in the book is regression to the mean, a statistical phenomenon that states if a variable is extreme on its first measurement, it will tend to be closer to the average on its second measurement.
Luckily for us, the experts at ESPN provide an annual display of this human bias in the form of their Way Too Early NFL Rankings. Shortly after the NFL seasons ends, they predict team rankings for the next season. In the viz below, I compared their predictions with actual final standings. The size of the lines equal the absolute change in ranking from ESPN's Way to Early 2016 Season Ranking to actual standings at the end of the 2016-2017 NFL season. The colors represent whether the change in ranking was positive or negative.
In general, good teams got worse and bad teams got better (regression to the mean). However, the best teams saw the largest drop, finishing on average 8 places lower, while many mediocre teams saw a significant jump. Of course, somethings are sure bets like the cellar-dwellers Browns bringing up the rear and perennial favorites, the Patriots, remaining on top.
The Current State of Our American Nations
American Nations, by historian Colin Woodard, argues that separate nation-states exist within our Republic, where dominant cultures explain our voting behaviors and attitudes toward everything from social issues to the role of government. Here we explore how these nations voted during the 2016 Presidential Election...
My takeaway is that the largest divide amongst Americans is not their prescribed nation, but urban vs. rural. The divide is particularly evident if you fill the map by population instead of area. Nearly all democratic votes came from major population centers, while Trump carried most rural areas.
On the graph below you can explore how the American Nations compare on other demographic indicators. Differences in the American Nations are more evident when examining these indicators than when comparing voting patterns. These results suggest that there may be more than meets the eye when trying to understand the American people and our culture.
What's Poppin' / Gunshots in the District
As part of their open governance initiative, the District of Columbia publishes its data for public use. After my friend Pablo was robbed at gunpoint walking from the Metro to my apartment, I became interested in gun violence in the District.
When viewing gunshots recorded by day, it is easy to see that something is off. My hunch is that MPD is recording "gunshots" using sensors that pick-up noises breaking a set sound level. The spikes in "gunshots" on NYE and July 4th are likely the sensors picking up the noise emitted by fireworks.
What’s fascinating is removing data from New Year’s Eve and July 4th and seeing how it impacts the most dangerous time of day (which you can do using the filter above). Without the data removed, midnight is by far the most dangerous time, likely buoyed by people lighting off fireworks. However, if you remove those holidays the most dangerous time shifts to 10:00-11:00pm, which is exactly when my friend was robbed at gunpoint.
An Age of Champions? // MotoGP
MotoGP is the premiere motorcycle racing league in the world. Recently they opened up their archives of data to the public. As a fan of all-time great, Valentino Rossi, I was interested in exploring the history of the sport in the context of the competition today. Right now seems to be a golden age of MotoGP, with three great world champions, Valentino Rossi, Jorge Lorenzo, and Marc Marquez all dueling it out on the track. The data supports this theory. Explore both tabs below to decide for yourself.