Back to Search
Start Over
Classifier evaluation and attribute selection against active adversaries
- Source :
- Data Mining and Knowledge Discovery. 22:291-335
- Publication Year :
- 2010
- Publisher :
- Springer Science and Business Media LLC, 2010.
-
Abstract
- Many data mining applications, such as spam filtering and intrusion detection, are faced with active adversaries. In all these applications, the future data sets and the training data set are no longer from the same population, due to the transformations employed by the adversaries. Hence a main assumption for the existing classification techniques no longer holds and initially successful classifiers degrade easily. This becomes a game between the adversary and the data miner: The adversary modifies its strategy to avoid being detected by the current classifier; the data miner then updates its classifier based on the new threats. In this paper, we investigate the possibility of an equilibrium in this seemingly never ending game, where neither party has an incentive to change. Modifying the classifier causes too many false positives with too little increase in true positives; changes by the adversary decrease the utility of the false negative items that are not detected. We develop a game theoretic framework where equilibrium behavior of adversarial classification applications can be analyzed, and provide solutions for finding an equilibrium point. A classifier's equilibrium performance indicates its eventual success or failure. The data miner could then select attributes based on their equilibrium performance, and construct an effective classifier. A case study on online lending data demonstrates how to apply the proposed game theoretic framework to a real application.
- Subjects :
- TheoryofComputation_MISCELLANEOUS
Computer Networks and Communications
Computer science
Population
Feature selection
02 engineering and technology
Intrusion detection system
computer.software_genre
Machine learning
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
False positive paradox
education
Equilibrium point
education.field_of_study
business.industry
Adversary
Computer Science Applications
020201 artificial intelligence & image processing
Artificial intelligence
Data mining
business
Classifier (UML)
computer
Game theory
Information Systems
Subjects
Details
- ISSN :
- 1573756X and 13845810
- Volume :
- 22
- Database :
- OpenAIRE
- Journal :
- Data Mining and Knowledge Discovery
- Accession number :
- edsair.doi.dedup.....4d697f71c9e95ed93feb774b261a5d05
- Full Text :
- https://doi.org/10.1007/s10618-010-0197-3