Five data quality lessons from Amazon

2 min read

About a year ago on this site, I penned a post titled “Analytics lessons from Amazon.” In it, I described the analytics lessons that employees and even entire companies can learn from the retail giant.

But there’s so much more that Jeff Bezos et. al can teach us. Today, I’ll focus on the data quality lessons we can glean from the largest Internet retailer.

By way of background, Amazon understands that all data is important whether it is:

As someone who has followed the company since its inception, trust me: this core belief cascades throughout the organization.

Get the AI & data signal, daily.

335k+ subscribers read this every morning. One email, both newsletters. Unsubscribe anytime.

Amazon stocks a ridiculous number of items. One of my favorite stats: its clothing selection is now bigger than 250 Walmart supercenters combined. With that much inventory, from time to time there’s going to be a data issue or two. (I wrote a post about one such issue back in July of 2014.)

Still, Amazon is constantly its improving the data around offerings, even if that means acquiring companies such as IMDb in 1998.

With so many products and different lines of business, a top-down approach to data quality would never fly at Amazon. To this end, the company provides its customers, partners and vendors with tools to manage a great deal of their own data.;

 

Yves Mulkers

Yves Mulkers is the founder of 7wData and a widely followed voice in the data and AI community. He curates the 7wData and AI Beat newsletters, reaching hundreds of thousands of data and AI professionals, and writes on data strategy, analytics, AI, and the evolving data ecosystem.