A^4NT: Author Attribute Anonymity by Adversarial Training of Neural Machine 
Translation

Shetty, Rakshith; Schiele, Bernt; Fritz, Mario

DetailsSummary

A^4NT: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation

Shetty, R., Schiele, B., & Fritz, M. (2017). A^4NT: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation. Retrieved from http://arxiv.org/abs/1711.01921.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-002E-271D-B Version Permalink: https://hdl.handle.net/21.11116/0000-0000-4357-3

Genre: Paper

Latex : $A^{4}NT$: Author Attribute Anonymity by Adversarial Training of Neural Machine Translation

Other : A4NT

Files

show Files

hide Files

:

arXiv:1711.01921.pdf (Preprint), 745KB

File Permalink:
-

Name:
arXiv:1711.01921.pdf

Description:
File downloaded from arXiv at 2017-11-09 08:02

OA-Status:

Visibility:
Private

MIME-Type / Checksum:
application/pdf

Technical Metadata:

Copyright Date:
-

Copyright Info:
-

License:
http://arxiv.org/help/license

Locators

show

Creators

show

hide

Creators:
Shetty, Rakshith¹, Author
Schiele, Bernt¹, Author
Fritz, Mario¹, Author

Affiliations:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547

Content

show

hide

Free keywords: Computer Science, Cryptography and Security, cs.CR,Computer Science, Computation and Language, cs.CL,Computer Science, Computers and Society, cs.CY,cs.SI,Statistics, Machine Learning, stat.ML

Abstract: Text-based analysis methods allow to reveal privacy relevant author attributes such as gender, age and identify of the text's author. Such methods can compromise the privacy of an anonymous author even when the author tries to remove privacy sensitive content. In this paper, we propose an automatic method, called Adversarial Author Attribute Anonymity Neural Translation ($A^4NT$), to combat such text-based adversaries. We combine sequence-to-sequence language models used in machine translation and generative adversarial networks to obfuscate author attributes. Unlike machine translation techniques which need paired data, our method can be trained on unpaired corpora of text containing different authors. Importantly, we propose and evaluate techniques to impose constraints on our $A^4NT$ to preserve the semantics of the input text. $A^4NT$ learns to make minimal changes to the input text to successfully fool author attribute classifiers, while aiming to maintain the meaning of the input. We show through experiments on two different datasets and three settings that our proposed method is effective in fooling the author attribute classifiers and thereby improving the anonymity of authors.

Details

show

hide

Language(s): eng - English

Dates: Created: 2017-11-06Modified: 2017-11-07Published Online: 2017

Publication Status: Published online

Pages: 15 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 1711.01921
URI: http://arxiv.org/abs/1711.01921
BibTex Citekey: shettyANT17arxiv

Degree: -

Event

show

Legal Case

show

Project information

show

Source

show