Essential Guide

Using big data and Hadoop 2: New version enables new applications

A comprehensive collection of articles, videos and more, hand-picked by our editors

unstructured data

Unstructured data is a generic label for describing any data that is not in a database or other type of data structure. 

Unstructured data is a generic label for describing data that is not contained in a database or some other type of data structure .  Unstructured data can be textual or non-textual.  Textual unstructured data is generated in media like email messages, PowerPoint presentations, Word documents, collaboration software and instant messages.  Non-textual unstructured data is generated in media like JPEG images, MP3 audio files and Flash video files.

If left unmanaged, the sheer volume of unstructured data that’s generated each year within an enterprise can be costly in terms of storage. Unmanaged data can also pose a liability if information cannot be located in the event of a compliance audit or lawsuit.  The information contained in unstructured data is not always easy to locate.  It requires that data in both electronic and hard copy documents and other media be scanned so a search application can parse out concepts based on words used in specific contexts. This is called semantic search.  It is also referred to as enterprise search.  

In customer-facing businesses, the information contained in unstructured data can be analyzed to improve customer relationship management and relationship marketing. As social media applications like Twitter and Facebook go mainstream, the growth of unstructured data is expected to far outpace the growth of structured data.  According to the "IDC Enterprise Disk Storage Consumption Model" report released in Fall 2009, while transactional data is projected to grow at a compound annual growth rate (CAGR) of 21.8%, it's far outpaced by a 61.7% CAGR prediction for unstructured data. 

See also:data mining, raw data, social CRM

This was first published in April 2010

Continue Reading About unstructured data


'unstructured data' is part of the:

View All Definitions



Find more PRO+ content and other member only offers, here.

Essential Guide

Managing Hadoop projects: What you need to know to succeed



Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to:


File Extensions and File Formats

Powered by: