Professional Documents
Culture Documents
Huifangzeng 2010
Huifangzeng 2010
Abstract- Based on the process model CRISP-DM, this The objective of this paper is to remedy the dilemma
paper has solved E-marketing problems like network customer discussed above and to propose the usage of a six- phase
prediction and market segmentation. Each phase of this model knowledge discovery and data mining process
was described firstly, and then its application in E-marketing model-CRISP-DM in previously unexplored area of
was presented, which provides a complete description of all the E-marketing. We present the framework of the process
steps, from understanding the problem domain to using the model firstly, and then describe each step in detail. After all,
discovered knowledge. Finally a KDDM procedure of network we aim for proposing a simple operating and easy
marketing is discussed thoroughly, and a simple operating and understanding process model which should be reliable and
easy understanding process model is provided to marketing repeatable by E-marketing people with little data mining
people who have difficulties to make marketing decisions. background.
Key words - E-marketing/network marketing, e-customer,
data mining process, CRISP-DM II. THE CRISP-DM MODEL AND E-MARKETING
E-marketing is challenging.
CRISP-DM is currently the most popular and broadly
adopted model which has been already acknowledged and
relatively widely used in both research and industrial Fig. 1 The CRISP-DM process model
communities [3]. For instance, CRISP-DM model is used in 1) Business Understanding: involve the project
automotive direct marketing by Gersten et al.[4], processing objectives and requirements, define data mining problems,
small and medium sized enterprises' data [5], for customer and design a preliminary plan to achieve the objectives.
churn prediction[6], and so on.. Nevertheless, little research 2) Data Understanding: start with the initial data
has used it to develop E-marketing problems. In fact, it collection and further gather the description of data, the
almost has become a prescribed standard, thus this paper exploration of data, and the verification of data quality.
argues that specializing CRISP-DM for discovering
knowledge in website is significant.
∗
This work is partially supported by National Natural Science Foundation of China grant #70771044 and #70872020, and Jinan University Fund for
Humanities and Social Science grant #006JSYJ013.
3961
1) Determine the E-marketing objectives: Using data females males
mining techniques to analyze network customer data we can 100%
segment the market at all levels, thus provide reliable basis
for company to locate its target market. 80% 57.0% 56.5% 53.0%
2) E-marketing program development: Usage of data
mining technology analyzes customer consumption data; 60%
digs out the customer's consumption patterns in order to
help vendors develop effective marketing programs, so as to 40%
effectively win customers and create profits.
3) Carry out one to one marketing: Enterprises can use 20% 43.0% 43.5% 47.0%
3962
rules, cluster analysis, classification analysis to complete In the modeling phase, we did not use any specific data
our network marketing purposes. to verify the validity of the model, which only provides a
1) Association Rules: The purpose of association rules process for reference. The results may not be as expected,
is to search the relevance of different items in the same you need to establish a large number of models to compare
event. Such as the relevance of different commodities in a and analyze. If the results are still not satisfied, we must
purchase activity, while the most typical example is the consider looping back to the second or third step, whether
market basket analysis. We can carry out one to one the prepared data is suitable for the tool of choice.
marketing sales and develop E-marketing plans with the Generally, the cost of data collection is huge. If we can use
help of correlation analysis. the KDDM techniques and mining results to improve data
2) Cluster Analysis: Cluster analysis is used to put collection, not only we can save the cost including savings,
together the data items and customers that have similar preprocessing, etc, but also directly affect the ability of
characteristics. In the network marketing, in order to carry knowledge discovery.
out targeted marketing and provide more appropriate and
satisfactory service, using clustering technology to divide D. Evaluation and Deployment
the market into a number of segments on the basis of The keys steps in the evaluation phase are the
available customer data is necessary. evaluation of results, the process review, and the
3) Classification Analysis: Classification was based on determination of next steps. As shown in Figure 6, the
the value of variables for calculation, and then classified Evaluation Chart node offers an easy way to evaluate and
according to the results. It usually involves two kinds of compare predictive models to choose the best model for
statistical methods: logistic regression and discriminated your application. There are five types of evaluation charts:
analysis, however, neural networks and decision trees have Gains Charts, Lift Charts, Response Charts, Profit Charts,
been adopted gradually. By analyzing the existing historical and ROI Charts, each of which emphasizes a different
data, classification can summarize a prediction model that evaluation.
could predict which customers might react to the Here we give an example of gains charts showing
advertising, product catalogs, etc. Then you can determine Figure 6. Gains are defined as the proportion of total hits
the network marketing objectives and provide personalized that occurs in each quantile. Cumulative gains charts always
information services, addressing the characteristics of this start at 0% and end at 100% as you go from left to right. For
type of client. a good model, the gains chart will rise steeply toward 100%
In the research described in this paper, the modeling and then level off. A model that provides no information
was supported by the data mining tool Clementine from will follow the diagonal from lower left to upper right.
SPSS, mainly because its breadth of techniques, its process
support, and its scripting facilities. Using a visual
programming interface, Clementine offers rich facilities for
exploring and manipulating data. It also contains several
modeling techniques and offers standard graphics for
visualization. The single operations are represented by
% Gain
Fig. 5 The process of modeling According to the standard data mining process,
CRISP-DM, one can directly collect data that are essential
3963
and useful for the mining results. It also offers practical to plan e-marketing activities,” Business Horizons, vol. 49, pp. 51-60,
help to those KD researchers both from industry and 2006.
academia. In this paper, a KD procedure and method of [2] I.S.Y. Kwan, J. Fong and H.K. Wong, "An e-customer behavior model
E-marketing is discussed thoroughly, and a simple operating with online analytical mining for internet marketing planning,"
and easy understanding process model is presented to Decision Support Systems, vol. 41, pp. 189–204, 2005.
marketing people with little data mining background. [3] L.A. Kurgan and P. Musilek, "A survey of Knowledge Discovery and
In addition, the traditional market research methods are Data Mining process models," The Knowledge Engineering Review, vol.
highly subjective; it is difficult to support the objective 21, pp. 1-24, 2006.
marketing decisions. While the KDDM process model can [4] W. Gersten, R. Wirth and D. Arndt, "Predictive modeling in automotive
be effective in helping market analyst find the distribution direct marketing: tools, experiences and open issues," In Proceeding of
and propensity of customers, thus to predict customer needs, the 6th ACM SIGKDD International Conference on Knowledge
determine the marketing strategy and ultimately to develop Discovery and Data Mining, pp. 398–406, 2000.
effective marketing plans. [5] Z. Bošnjak, O. Grljević and S. Bošnjak, "CRISP-DM as a Framework
However, dynamic characteristics of the marketing for Discovering Knowledge in Small and Medium Sized Enterprises'
make KDDM difficult to deal with unstructured data, and a Data," 5th International Symposium on Applied Computational
complete dynamic model can not be established to support Intelligence and Informatics, pp. 509-514, 2009.
marketing decisions. In addition, theoretical and applied [6] Z. Mo, S. Zhao, L. Li and A. Liu, "A Predictive Model of Churn in
data mining research on network marketing still at an early Telecommunications Based on Data Mining," 2007 IEEE International
exploratory stage, the dynamic characteristics add various Conference on Control and Automation, pp. 809-813, 2007.
difficulties to the study. We can say that KDDM process [7] C. Shearer, "The CRISP-DM model: the new blueprint for data
model for the academic study of E-marketing have a long mining," Journal of Data Warehousing, vol. 5, pp. 13–19, 2000.
way to go. [8] Data sources: http://research.cnnic.cn/
[9] T.W. Liao and E. Triantaphyllou, "Recent Advances in Data Mining of
REFERENCES Enterprise Data: Algorithms and Applications," Series on Computers
[1] S. Krishnamurthy, “Introducing e-markplan: A practical methodology and Operations Research, vol. 6, 2008.
3964