Moving from data swamp to smart data

Moving from data swamp to smart data

Over the last decade, the increased use of unstructured and alternative data, and the ascendance of the cloud, has posed an overt challenge to the relational database model based on structured inputs and batch transmission. Transition from the classic SQL on Hadoop database toward the unstructured, real time data hub or lake has come with costs, however.

These costs are best understood within the context of a four stage data lifecycle comprising capture, transformation, extraction and delivery. Challenges related to capture and transformation revolve around inconsistent data quality and manual intervention requirements. Extraction and delivery processes, on the other hand, tend to founder on the expectations of the end user or business unit.

End user dissatisfaction with existing business intelligence processes and the need for more timely delivery of pricing, operational and analytical insight has spurred a generalized reliance on workarounds, which may include the unauthorized input and removal of data. The knock-on effects of these workarounds have in turn fostered desire for an end-to-end, all embracing data solution.

The process of eliminating data marts (and the superstructure repositories that contain them) presupposes massive resource deployment as well as a philosophical about-face. Resource deployment extends beyond data clean up and transition from legacy architecture to the investment required to build and support the data lake structure. The philosophical about-face relates to the fundamental purpose of data within an organization.

In this new world data management is not a top-down process designed for a single use case. Rather, improved data management should enable the creation of a so-called golden source, in which universally accepted and accessible data is summoned by business partners according to their needs.

Look Before You Leap

The data lake that is able to store and process data in its native or raw format represents the latest iteration of this vision, but not the end state.

Share it:
Share it:

[Social9_Share class=”s9-widget-wrapper”]

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

You Might Be Interested In

3 Tips to Recognize Hidden Costs of the Cloud

30 Aug, 2020

Cloud usage and demand has spiked, but businesses must be wary of hidden cloud “gotchas” when it comes to modernizing …

Read more

Best practices for building an information governance program that will last

30 Jul, 2019

This Q&A with Aaron Bryant, PMP, is based on a series of interviews I’m conducting with thought leaders who are …

Read more

Top Benefits of Data Governance for Businesses

11 Jun, 2022

All companies face the need to measure and analyze their business performance. The bigger the organization, the bigger the problem, …

Read more

Recent Jobs

Senior Cloud Engineer (AWS, Snowflake)

Remote (United States (Nationwide))

9 May, 2024

Read More

IT Engineer

Washington D.C., DC, USA

1 May, 2024

Read More

Data Engineer

Washington D.C., DC, USA

1 May, 2024

Read More

Applications Developer

Washington D.C., DC, USA

1 May, 2024

Read More

Do You Want to Share Your Story?

Bring your insights on Data, Visualization, Innovation or Business Agility to our community. Let them learn from your experience.

Get the 3 STEPS

To Drive Analytics Adoption
And manage change

3-steps-to-drive-analytics-adoption

Get Access to Event Discounts

Switch your 7wData account from Subscriber to Event Discount Member by clicking the button below and get access to event discounts. Learn & Grow together with us in a more profitable way!

Get Access to Event Discounts

Create a 7wData account and get access to event discounts. Learn & Grow together with us in a more profitable way!

Don't miss Out!

Stay in touch and receive in depth articles, guides, news & commentary of all things data.