CMU 2021-129: OPP-115 Data Set of Privacy Policy Annotations
Abstract
These annotations distinguish between 10 different types of data collection and use practices (first party collection/use, third party sharing/collection, user choice/control, user access/edit/deletion, data retention, data security, policy change, do not track, international & specific audiences, other). For each data practice, the annotation scheme further specifies a set of relevant attributes along with different possible values for these attributes.
Benefit
This data set can be used to train algorithms and create models
Publications
S. Wilson, F. Schaub, A.A. Dara, F. Liu, S.K. Cherivirala, P.G. Leon, M.S. Andersen, S. Zimmeck, K.M. Sathyendra, N.C. Russell, T.B. Norton, E. Hovy, J. Reidenberg, and N. Sadeh. The creation and analysis of a website privacy policy corpus. ACL '16: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, Berlin, Germany, August 2016.