Back to Search
Start Over
Behavior Analysis Based SMS Spammer Detection in Mobile Communication Networks
- Source :
- DSC
- Publication Year :
- 2016
- Publisher :
- IEEE, 2016.
-
Abstract
- In a communication network, automatic short message service (SMS) spammer detection is a big challenge for a telecommunication operator nowadays, especially with the development of the rich communication services (RCS). Three main problems exist in the areas of research and real practice. They are (1) the whole-volume content based SMS spam detection techniques cannot be easily used on the side of network due to the issue of user privacy, (2) traditional ways to filter the spam according to the combination of key words and sending frequency can be easily bypassed by adding the interference words, (3) Most of them result in a great deal of manual review after the automatic filtering due to a low precision rate. To make up the aforementioned gaps, we study the user behavior characteristics. A two-dimensional visualized result indicates that any combination of two user behavior attributes cannot distinguish the abnormal users from the whole set by splitting the 2-dimensional space. Thus, the integration of multiple user behavior attributes is exploited to train the classifier in a labeled set by machine learning algorithms, respectively, including decision tree, random forest, supported vector machine (SVM), logistic regression, and self-organized feature mapping (SOM). The performance comparison indicates that random forest is a good choice to balance the tradeoff of the precision rate and the recall rate, and in an acceptable time. The experimental result shows the proposed method without the knowledge of SMS content has a significant improvement in terms of precision rate and recall rate compared with the traditional method using the combination of key words and sending frequency used in most of existing networks.
- Subjects :
- Short Message Service
business.industry
Computer science
Decision tree
02 engineering and technology
computer.software_genre
Machine learning
Telecommunications network
Electronic mail
Random forest
Spamming
Support vector machine
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Mobile telephony
Data mining
Artificial intelligence
business
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2016 IEEE First International Conference on Data Science in Cyberspace (DSC)
- Accession number :
- edsair.doi...........01bd2f82ba143d66d505ef66439821e2