Back to Search Start Over

Unsupervised Labelling of Stolen Handwritten Digit Embeddings with Density Matching

Authors :
Thomas Thebaud
Gaël Le Lan
Anthony Larcher
Orange Labs R&D [Rennes]
France Télécom
Laboratoire d'Informatique de l'Université du Mans (LIUM)
Le Mans Université (UM)
Source :
Lecture Notes in Computer Science ISBN: 9783030616373, ACNS Workshops, International Workshop on Security in Machine Learning and its Applications (SiMLA), International Workshop on Security in Machine Learning and its Applications (SiMLA), Oct 2020, Rome, Italy
Publication Year :
2020
Publisher :
Springer International Publishing, 2020.

Abstract

International audience; Biometrics authentication is now widely deployed, and from that omnipresence comes the necessity to protect private data. Recent studies proved touchscreen handwritten digits to be a reliable biomet-rics. We set a threat model based on that biometrics: in the event of theft of unlabelled embeddings of handwritten digits, we propose a labelling method inspired by recent unsupervised translation algorithms. Provided a set of unlabelled embeddings known to have been produced by a Long Short Term Memory Recurrent Neural Network (LSTM RNN), we demonstrate that inferring their labels is possible. The proposed approach involves label-wise clustering of the embeddings and label identification of each group by matching their distribution to the label-relative classes of a comparison hand-crafted labeled set of embeddings. Cluster labelling is done through a two steps process including a genetic algorithm that finds the N-best matching hypotheses before a fine-tuning of those N-candidates. The proposed method was able to infer the correct labels on 100 randomised runs on different dataset splits.

Details

ISBN :
978-3-030-61637-3
ISBNs :
9783030616373
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science ISBN: 9783030616373, ACNS Workshops, International Workshop on Security in Machine Learning and its Applications (SiMLA), International Workshop on Security in Machine Learning and its Applications (SiMLA), Oct 2020, Rome, Italy
Accession number :
edsair.doi.dedup.....0482020047d499ef10d8c4a2f6d4affb
Full Text :
https://doi.org/10.1007/978-3-030-61638-0_30