TitleOn the declassification of confidential documents
Publication TypeConference Proceedings
Year of Conference2011
AuthorsAbril D, Navarro-Arribas G, Torra V
Editor, Long J
Conference LocationChangsha, China
Date Published28/07/2011
Keywordsanonymity, Data Privacy, declassification, Information Retrieval, named-entity recognition, pattern classification, privacy preserving information retrieval, semantic

Abstract. We introduce the anonymization of unstructured documents to settle the base of automatic declassification of confidential documents. Departing from known ideas and methods of data privacy, we introduce the main issues of unstructured document anonymization and propose the use of named entity recognition techniques from natural language processing and information extraction to identify the entities of the document that need to be protected.