Text mining is the analysis of data contained in natural language text. Text mining works by transposing words and phrases in unstructured data into numerical values which can then be linked with structured data in a database and analyzed with traditional data mining techniques.
Text mining is especially useful for tasks such as:
• Routing email to the appropriate department.
• Teasing out information about product satisfaction from text located in disparate data stores.
• Analyzing open-ended survey questions.
It’s generally accepted that unstructured data, most of it located in text files, accounts for at least 80% of an organization’s data. Text mining can be challenging because natural language text is often inconsistent. It contains ambiguities caused by semantics, slang and syntax.
See also: full text database, business analytics, predictive analytics
Learn more:
Text analytics market small but growing amid recession, according to IDC
Calculating text analytics ROI: Start small and focus on customer data
Text mining tools and prediction markets extract insight from unstructured data
Business Intelligence Strategies for the CIO
Join the conversationComment
Share
Comments
Results
Contribute to the conversation