Data Mining: What, Why and Where?

 
I.T.Times, vol. 10, no. 5, page 42, by Mr. J. Anton Ravindran, Executive Director, Genovate.com – May, 2000
 
"Data mining is an iterative process of knowledge discovery from Stored information. It is the process of discovering and extracting information from large databases and identifying meaningful new correlations, patterns and trends."
 
What is data mining? In today's "dot com" world, no industry is an island; neither is any company.What is profitable today may not be tomorrow as a new technology may change the cost structure overnight. The role of data mining is now greater than ever as the Internet frontier is swiftly eliminating borders between markets and countries.
 
The competition a business faces today is now more than twice likely to be foreign than local because many more companies can now successfully compete globally. According to a study by Gartner Group, by the middle of this year, at least half of the Fortune 1000 companies world wide will be using data mining technology. Meta Group says that, "the majority of global 2000 organisations will find data mining to be critical to their business by year 2000".
 
Data mining techniques are rapidly becoming a necessity for business survival. Heralded as the competitive weapon for businesses to succeed in the digital economy, the term "data mining" has become one of the most loosely used terms in IT vocabulary during the past 24 months. But ask a variety of vendors and professionals for their vision of what data mining solution is and how it should be built, and the ambiguity of the term will quickly become apparent.
 
The exponential growth in the amount of data available in organisations has actually made it more difficult for most to get the information they require on a timely basis. A study done by Input in 1999 found that senior executives in Asia South reported that they were not getting the business information they needed in the areas of sales and marketing, production, planning, customer services, financial accounting, and material/inventory management. The same study also found that only a quarter of the organisations surveyed had installed any business intelligence solution. However, studies indicate that this situation is about to change, as two-thirds of the organisations that have not yet installed business intelligence solutions are planning to do so in the next 12 months.
 
Data mining is an iterative process of knowledge discovery from stored information. It is the process of discovering and extracting information from large databases and identifying meaningful new correlations, patterns, and trends. Data mining tools analyse data and discover important relationship within the data that may not be intuitive or obvious to naked eye, thereby providing users a new and more focused base of knowledge that often impacts business competitiveness.
 
Some of the leading vendors and their business intelligence (BI) and data mining solutions at the forefront of this wave include Crystal reports and Holos from Seagate Software, Intelligent Miner from IBM, and PowerPlay and Impromptu from Cognos.
 
Although the market place for data mining currently features a wide range of products, the underlying technology has a rich tradition that goes back at least two decades.The pioneers for statistical analysis, an early name for data mining, were products such as SPSS and SAS which originally contained classical routines such as regression, cross tabulation, correlation and Chi-Square. During the heydays of Artificial Intelligence in the late 1980s, new technologies such as fuzzy logic, heuristic reasoning and neural networks were incorporated into the classical statistical analysis.
 
In the '90s, with the introduction of data warehousing coupled with the expertise to implement the best practice approaches from classical statistical analysis, neural networks, decision trees, market basket analysis and other powerful techniques have resulted in data mining gaining wider acceptance.
 
Most of the end users are confused about the roles of traditional query tools based on Structured Query Language and data mining. Query tools and data mining tools are complementary and it should be noted that a data-mining tool is not an alternative for query tools but provides additional features and functionality to uncover "hidden knowledge" in the repository. Organisations are turning to data mining as a tool to achieve new levels of business intelligence to counter the heightened competition and to increase profits.
 
The major objectives are to provide an understanding of data relationship and to predict the future trends based on historical data. These two objectives are complementary to and interdependent on each other.
 
 
Why do We Need Data Mining?
 
From a competitive analysis, firstly, it helps to avoid surprises of competition and enables businesses to plan and execute strategies to forestall upcoming moves by competition. Secondly, data mining will enable organisations to identify oppurtunities that will result in sustainable advantage. For example, by reducing cost of production or by significantly increasing value to the customers. Thirdly the cost of wrong decision or indecision due to faulty or insufficient data is graver than ever before.
 
A number questions arise. Should we enter this market segment? Should we acquire this company? Should we launch this product/service now? Lastly, the shelf-life of products are increasingly becoming shorter and the cost of obsolescence is high.
 
From a service standpoint, data mining increases customer retention by developing target customer loyalty programs and by passing on the cost saving to customers arising from reduced risk and fraud and improved efficiency and effectiveness.
 
For the company's management, the benefits of data mining tools include speeding up the decision cycle as a result of the shortened data analysis process and by requiring minimal or no technical expertise, as much of the procedure is automated and user friendly, if not "idiot-proof". Furthermore, the ever-increasing frequency of change is driving the need for constant monitoring of the market developments.
 
Like any technology, data mining tools can become a concern if individual privacy is violated and used for unethical objectives. Another problem may arise if the captured data is inaccurate, inconsistent or even unauthorised. Users may enter wrong data intentionally or unintentionally and that may cause an adverse effect.
 
 
Where is Data Mining Going?
 
Most early adopters of data mining have tended to be in the traditional information-intensive sectors such as financial services and direct marketing. A wide range of tools are available today and have been successfully deployed in a wide range of industries ranging from finance, retail, healthcare, telecommunications, manufacturing to defence and logistics. Types of data mining applications include profiling customer habits, profiling customer demographics, time series analysis to study seasonal buying behaviour, retention management of patients, customers and employees, risk forecast and profitability analysis.
 
In addition, data mining technology has played a role in spawning new disiplines such as Customer Relationship Management while playing a pivotal role in Supply Chain Management. Customer profiling is one of the most widely pursued application disciples today. By incorporating data mining techniques for target marketing purposes, businesses are able to improve response rates, better identify the target market segments for promotional activity, and market basket analysis will identify the relationships amongst items purchased by customers. Data mining techniques are being widely used with increasing success in fraud detaction, credit scoring and risk management by banks and credit card companies. In the manufacturing industry, data mining is being used for process control and quality control by studying the physical conditions such as temperature or mechanical specifications rather than financial and demographics attributes. In the field of medicine, host of data mining applications is used to track the effectiveness of experimental drugs.
 
What is the future of data mining and business intelligence? Based on the momentum and the acceptance of this technology, as well as the increasing reliance of business for rapid access to information (hundreds of gigabytes) on market developments, one can envisage the natural progression of business intelligence tools of today. These tools may end up as standard tools amalgamated with the office automation products of tomorrow. With the ever decreasing cost of storage and RAMs, and continued doubling of processing power every 12-18 months, together with the ease of use, the integeration of data mining tools into the DBMS (database management system) will be no different from the integration of grammar and spell checker with the word processor.
 
« Back to Pressroom
   
 
 
 
  © Copyright 2005 Genovate Solutions Pte. Ltd.
All Rights Reserved.