OPP115 Corpus of Privacy Policy Annotations

Description:

CMU 2021-129: OPP-115 Data Set of Privacy Policy Annotations

Abstract

These annotations distinguish between 10 different types of data collection and use practices (first party collection/use, third party sharing/collection, user choice/control, user access/edit/deletion, data retention, data security, policy change, do not track, international & specific audiences, other). For each data practice, the annotation scheme further specifies a set of relevant attributes along with different possible values for these attributes.

Benefit

This data set can be used to train algorithms and create models

 

Publications

S. Wilson, F. Schaub, A.A. Dara, F. Liu, S.K. Cherivirala, P.G. Leon, M.S. Andersen, S. Zimmeck, K.M. Sathyendra, N.C. Russell, T.B. Norton, E. Hovy, J. Reidenberg, and N. Sadeh. The creation and analysis of a website privacy policy corpus. ACL '16: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany, August 2016.










 

Patent Information:
Category(s):
Technology
For Information, Contact:
Fadwa Brady
Manager, Business Development & Licensing
CMU
fbrady@andrew.cmu.edu
Inventors:
Norman Sadeh-Koniecpol
Pedro Najera
Mads Andersen
Aswarth Dara
Sebastian Zimmeck
Florian Schaub
Shomir Wilson
Joel Reidenberg (deceased 4/21/2020)
N. Russell
Keywords: