What is text mining? - Definition from Whatis.com

Text mining is the analysis of data contained in natural language text. Text mining works by transposing words and phrases in unstructured data into numerical values which can then be linked with structured data in a database and analyzed with traditional data mining techniques.

Text mining is especially useful for tasks such as:

• Routing email to the appropriate department.

• Teasing out information about product satisfaction from text located in disparate data stores.

• Analyzing open-ended survey questions.

It’s generally accepted that unstructured data, most of it located in text files, accounts for at least 80% of an organization’s data.  Text mining can be challenging because natural language text is often inconsistent.  It contains ambiguities caused by semantics, slang and syntax. 

See also: full text database, business analytics, predictive analytics

Learn more: 

Text analytics market small but growing amid recession, according to IDC

Calculating text analytics ROI: Start small and focus on customer data

Text mining tools and prediction markets extract insight from unstructured data

 

This was last updated in May 2010
Editorial Director: Margaret Rouse

Email Alerts

Register now to receive SearchBusinessAnalytics.com-related news, tips and more, delivered to your inbox.
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy

Dig Deeper

Do you have something to add to this definition? Let us know.

Send your comments to techterms@whatis.com

Join the conversationComment

Share
Comments

    Results

    Contribute to the conversation

    All fields are required. Comments will appear at the bottom of the article.