Differential Privacy Tools Empowering Transparency in Data Research

Differential Privacy Tools Empowering Transparency in Data Research
Slide Note
Embed
Share

Workshop review on exploring sensitive data with differential privacy tools for transparent publication. OpenDP project aims to make sensitive datasets findable and explorable while ensuring privacy. Infrastructure enhancements for privacy-assured computation and secure storage are highlighted to support journal data repositories in managing sensitive datasets effectively.

  • Privacy
  • SensitiveData
  • DifferentialPrivacy
  • Transparency
  • ResearchData

Uploaded on Feb 27, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Reviewing Sensitive Data with Differential Privacy JEDI Workshop, November 20, 2020. Organized by Data-PASSS Alliance. As Open as Possible, As Closed as Necessary: Empowering Transparency in Publications Based on Sensitive Research Data Merc Crosas, Ph.D., Harvard University scholar.harvard.edu/mercecrosas @mercecrosas

  2. Differential Privacy tools to explore sensitive data A differentially private algorithm introduces a minimum amount of noise to released statistics to mathematically guarantee the privacy of any individual in the dataset OpenDP (https://opendp.org) is a community effort to build a trustworthy and open- source suite of differential privacy tools to explore sensitive data We are currently working on the first release of OpenDP and Dataverse integration What will this mean: Opens up sensitive data to the research community Sensitive datasets findable in Dataverse will be explorable through differentially private statistics, without ever accessing the original dataset Statistics included: mean, histogram, quantile, median, variance, OLS regression, logistic regression, probit regression, difference of means, unbiased privacy

  3. Openly findable data, secure computation and storage, differentially private releases Data owner/depositor Public Repository Sensitive datasets discoverable via repository (only metadata open) Approved Secure Compute Environment to run differential privacy statistics Notary Service Digitally signed attestations confirming DUA compliance Blue Green Yellow Trusted Remote Storage Agents (TRSA) or data enclaves Data Reviewer Differential Privacy release Orange Red Impact: Infrastructure for Privacy-Assured Computation https://cyberimpact.us/

  4. Journals dataverses will be able to support discovery, citation, and review of sensitive data 92 journal dataverses in Harvard Dataverse Replication datasets currently contain non-sensitive data With OpenDP-Dataverse integration, sensitive datasets will be archived in a secure storage Data reviewers and users will have access to differential privacy statistics of the sensitive datasets https://dataverse.harvard.edu

Related


More Related Content