|Table of Contents|

[1] Zhao Wei, He Jianmin, Wang Chunlin, Chen Jinbo, et al. Application of a cost-sensitive methodfor churn prediction in telecommunication industry [J]. Journal of Southeast University (English Edition), 2007, 23 (1): 135-138. [doi:10.3969/j.issn.1003-7985.2007.01.027]
Copy

Application of a cost-sensitive methodfor churn prediction in telecommunication industry()
一种代价敏感学习方法在电信业流失预测中的应用
Share:

Journal of Southeast University (English Edition)[ISSN:1003-7985/CN:32-1325/N]

Volumn:
23
Issue:
2007 1
Page:
135-138
Research Field:
Economy and Management
Publishing date:
2007-03-30

Info

Title:
Application of a cost-sensitive methodfor churn prediction in telecommunication industry
一种代价敏感学习方法在电信业流失预测中的应用
Author(s):
Zhao Wei He Jianmin Wang Chunlin Chen Jinbo
School of Economics and Management, Southeast University, Nanjing 210096, China
赵巍 何建敏 王纯麟 陈金波
东南大学经济管理学院, 南京 210096
Keywords:
cost-sensitive learning C4.5 telecommunication industry customer churn
代价敏感学习 C4.5 电信业 客户流失
PACS:
F626;TP391
DOI:
10.3969/j.issn.1003-7985.2007.01.027
Abstract:
To deal with the data mining problem of asymmetry misclassification cost, an innovative churn prediction method is proposed based on existing churn prediction research.This method adjusts the misclassification cost based on the C4.5 decision tree as a baseline classifier, which can obtain the prediction model with a minimum error rate based on the assumption that all misclassifications have the same cost, to realize cost-sensitive learning.Results from customer data of a certain Chinese telecommunication company and the fact that the churners and the non-churners have different misclassification costs demonstrate that by altering the sampling ratio of churners and non-churners, this cost-sensitive learning method can considerably reduce the total misclassification cost produced by traditional classification methods.This method can also play an important role in promoting core competence of Chinese telecommunication industry.
根据已有的流失预测方法, 提出新的流失预测方法解决数据挖掘中的非对称错分代价问题.该方法以传统C4.5决策树算法为基准分类器, 融合代价调整方法实现代价敏感学习.相比之下, C4.5决策树算法仅是基于样本错分代价相同假定, 建立了一种错分率最低而非总错分代价最低的预测模型.基于某电信企业的客户数据, 及流失客户和非流失客户代价非对称的实际, 实证研究结果表明, CS-C4.5通过调整流失类和非流失类样本的比例, 大大降低了传统分类算法的样本错分总代价.该方法对于提高电信企业的核心竞争力具有重要的现实意义.

References:

[1] Hung S Y, Yen C D, Wang H Y.Applying data mining to telecom churn management[J].Expert Systems with Applications, 2006, 31(3):515-524.
[2] Ma H M, Yin H B, Guo X.The mining model of defecting customers[J]. Journal of Huazhong University of Science and Technology: Nature Science Edition, 2003, 31(9):28-30.(in Chinese)
[3] Mozer M C.Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry [J].IEEE Trans on Neural Networks, 2000, 11(3):690-696.
[4] Yao M, Shen B, Li M F.A kind of unusual customers recognition system based on multi-criteria neural network and CART in telecom system[J].Systems Engineering — Theory & Practice, 2004, 24(5):78-83.(in Chinese)
[5] Chan P, Stolfo S.Towards scalable learning with non-uniform class and cost distributions:a case study in credit card fraud detection[C]//Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining.New York, USA, 1998:164-168.
[6] Liu Y, Yang Y, Carbonell J.Boosting to correct inductive bias in text classification[C]//Proceedings of the Eleventh International Conference on Information and Knowledge Management.Pittsburgh, USA, 2002:348-355.
[7] Foster D P, Stine R A.Variable selection in data mining:building a predictive model for bankruptcy[J].Journal of the American Statistical Association, 2004, 99(466):303-313.
[8] Zheng E H, Li P, Song Z H.SVM-based cost sensitive mining[J].Information and Control, 2006, 35(3):294-298.(in Chinese)
[9] Quinlan J R.C4.5:Programs for machine learning[M].San Mateo, CA:Morgan Kaufmann Publishers, 1993.
[10] Duda R O, Hart P E, Stork D G.Pattern classification[M].2nd ed.John Wiley and Sons, Inc., 2000.
[11] Viaene S, Dedene G.Cost-sensitive learning and decision making revisited[J].European Journal of Operational Research, 2005, 166(11): 212-220.
[12] Ciraco M, Rogalewski M, Weiss G.Improving classifier utility by altering the misclassification cost ratio[C]//ACM SIGKDD Workshop on Utility-Based Data Mining.Chicago, USA, 2005:46-52.

Memo

Memo:
Biographies: Zhao Wei(1980—), male, graduate;He Jianmin(corresponding author), male, professor, nj.jian@public1.ptt.js.cn.
Last Update: 2007-03-20