Is Your Data Good Enough for Your Machine Learning/AI Plans?

Is Your Data Good Enough for Your Machine Learning/AI Plans?

Developments in AI are a high priority for businesses and governments globally. Yet, a fundamental aspect of AI remains neglected: poor data quality.

AI algorithms rely on reliable data to generate optimal results – if the data is biased, incomplete, insufficient, and inaccurate, it leads to devastating consequences.

AI systems that identify patient diseasesare an excellent example of how poor data quality can lead to adverse outcomes. When ingested with insufficient data, these systems produce false diagnoses and inaccurate predictions resulting in misdiagnoses and delayed treatments. For example, a study conducted at the University of Cambridge of over 400 tools used for diagnosing Covid-19 found reports generated by AI entirely unusable, caused by flawed datasets.

In other words, your AI initiatives will have devastating real-world consequences if your data isn’t good enough.

There is quite a debate on what ‘good enough’ data means. Some say good enough data doesn’t exist. Others say the need for good data causes analysis paralysis – while HBR outrightly states your machine learning tools are useless if your information is terrible.

At WinPure, we define good enough data as “complete, accurate, valid data that can be confidently used for business processes with acceptable risks, the level of which is subjected to individual objectives and circumstances of a business.’

Most companies struggle with data quality and governance more than they admit. Add to the tension; they are overwhelmed and under immense pressure to deploy AI initiatives to stay competitive. Sadly, this means problems like dirty data are not even part of boardroom discussions until it causes a project to fail.

Data quality issues arise at the start of the process when the algorithm feeds on training data to learn patterns. For example, if an AI algorithm is provided with unfiltered social media data, it picks up abuses, racist comments, and misogynist remarks, as seen with Microsoft’s AI bot. Recently, AI’s inability to detect dark-skinned persons was also believed as due to partial data.

How is this related to data quality?

The absence of data governance, the lack of data quality awareness, and isolated data views (where such a gender disparity may have been noticed) lead to poor outcomes.

When businesses realize they’ve got a data quality problem, they panic about hiring. Consultants, engineers, and analysts are blindly hired to diagnose, clean up data and resolve issues ASAP. Unfortunately, months pass before any progress is made, and despite spending millions on the workforce, the problems don’t seem to disappear. A knee-jerk approach to a data quality problem is hardly helpful.

Here are three crucial steps to take if you want your AI/ML project to move in the right direction.

Share it:
Share it:

[Social9_Share class=”s9-widget-wrapper”]

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

You Might Be Interested In

Industry 40 demands new solutions for manufacturing

2 Dec, 2017

Life is changing fast for manufacturing companies. Not only is the business of designing, making, shipping, distributing and selling finished …

Read more

IoT boom and GDPR raise the stakes of a cyber security breach

11 Feb, 2017

The burgeoning number of connected devices as part of the Internet of Things will see the global spend on cyber …

Read more

How artificial intelligence can slowly change the healthcare landscape

20 May, 2019

The real test for AI systems will depend on the solution’s ability to integrate with the hospital or doctors’ workflow. …

Read more

Recent Jobs

IT Engineer

Washington D.C., DC, USA

1 May, 2024

Read More

Data Engineer

Washington D.C., DC, USA

1 May, 2024

Read More

Applications Developer

Washington D.C., DC, USA

1 May, 2024

Read More

D365 Business Analyst

South Bend, IN, USA

22 Apr, 2024

Read More

Do You Want to Share Your Story?

Bring your insights on Data, Visualization, Innovation or Business Agility to our community. Let them learn from your experience.

Get the 3 STEPS

To Drive Analytics Adoption
And manage change

3-steps-to-drive-analytics-adoption

Get Access to Event Discounts

Switch your 7wData account from Subscriber to Event Discount Member by clicking the button below and get access to event discounts. Learn & Grow together with us in a more profitable way!

Get Access to Event Discounts

Create a 7wData account and get access to event discounts. Learn & Grow together with us in a more profitable way!

Don't miss Out!

Stay in touch and receive in depth articles, guides, news & commentary of all things data.