Noisy text is an electronically-stored communication that cannot be categorized properly by a text mining software program. In an electronic document, noisy text is characterized by a discrepancy between the letters and symbols in the HTML code and the author's intended meaning.
By submitting your email address, you agree to receive emails regarding relevant topic offers from TechTarget and its partners. You can withdraw your consent at any time. Contact TechTarget at 275 Grove Street, Newton, MA.
Noisy text does not comply with rules the program uses to identify and categorize words, phrases and clauses in a particular language. Idiomatic expressions, abbreviations, acronyms and business-specific lingo can all cause noisy text. It is particularly prevalent in the unstructured text found in blog posts, chat conversations, discussion threads and SMS text messages. Other potential causes include poor spelling and punctuation, typographical errors and poor translations from optical (OCR) and speech recognition programs.