What are the differences between data mining, data warehousing and data querying?
A data warehouse is a repository of data designed to facilitate information retrieval and analysis. The data contained within a data warehouse is often consolidated from multiple systems making analysis across those systems quicker and easier.
Data mining and data querying represent two methods of retrieval and analysis.
Data querying is the process of asking questions of data in search of a specific answer. Unlike many forms of search (i.e. Google), queries are normally structured and require specific parameters or code, known as SQL (Structured Query Language). A query could be written to answer questions like, "How many items were sold in Region 2 last month?"
Data mining is the process of sorting through large amounts of data to identify patterns and relationships using statistical algorithms. These relationships may help us to understand which factors affected the outcome of something, or they may be used to predict future outcomes. Data mining might be used to answer questions like, "What factors affected sales in Region 2 last month?" Knowing which factors drive sales in the past could help to predict or make estimates about sales in the future.
More about data warehousing
More about data mining
More about data querying
- Data query questions and answers on IT Knowledge Exchange
Dig Deeper on Business intelligence data mining
Related Q&A from William McKnight
Are business intelligence certifications worth it? Get certification advice from our business intelligence expert, William McKnight. Continue Reading
There are loads of business intelligence tools out there on the market today. Our business intelligence expert provides some valuable resources for ... Continue Reading
Get examples of how data mining is used in vertical industries, such as retail, manufacturing, healthcare, financial and telecommunications. Continue Reading