Efficient Data Management and Export Tools for FaceBase Bootcamp

facebase bootcamp n.w
1 / 9
Embed
Share

Explore the comprehensive tools and processes for managing, exporting, and sharing data in the FaceBase Bootcamp, including installing DERIVA Client Tools, exporting data efficiently, utilizing BDBag for bulk downloads, and following a streamlined data submission process.

  • Data Management
  • Data Export
  • FaceBase Bootcamp
  • BDBag
  • DERIVA Tools

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. FaceBase Bootcamp Nov. 18, 2020

  2. Additional FaceBase topics: 1. Installing DERIVA Client Tools 2. Exporting Data 3. Requesting Human-subject protected data 4. Primer on data submission and uploading data

  3. Installing DERIVA Client Tools Suite of reliable tools for batch uploading and downloading data GUI and command line tool for Mac, Windows and Linux Detailed documentation: https://github.com/informatics-isi-edu/facebase-curation/wiki/Deriva-Clients Mac and Windows: Download bundle from https://github.com/informatics-isi-edu/deriva-client-bundle/releases Linux: Use pip install: pip3 install --user deriva-client

  4. Exporting Data from FaceBase Individual File Download Click on a (highlighted) file and the browser will prompt you to save the file to your computer Bulk Export: CSV Downloads a CSV file of the metadata of the search results Available both at the Results and Detail page Most useful to download results of a search in tabular form BAG Download a BDBag (Big Data Bag) file with metadata and information about the data files (assets) The BDBag file is used by our DERIVA Bag Tool to export the all files to your machine Available at the Details page (Datasets, Experiment) Most useful to download all data from a dataset, especially for large number of files and data volume Tool is very robust: performs validation, console shows progress and status, can be restarted from where it left off

  5. BDBag Export What is a BDBag? BDBag (Big Data Bag) is a standard for reliable sharing of data collections by transferring of a "bag" of digital content A BDBag consists of a hierarchical directory containing all data and metadata Provides verification that you have all the files you were trying to export AND that they were not corrupted in the process. In FaceBase we use them to export in bulk all files from a Dataset (or Experiment, etc). Materialized BDBag Downloaded BDBag file

  6. Streamlined Process for Data Submission Start Notification Released? Human? N Y Y N Submit Form Review Form Setup Project Submit (Meta)Data Quality Control Released ? ? ? N Accepted? Y IRB Certification Process Timeline (approximate) T0: Form submitted T+2 weeks: Review decision T+3 weeks: Project setup T+5 weeks: Submit data* T+6 weeks: QC review * Based on user averages IRB Certification Process Individual level data classified as human subjects Requires USC certification of your IRB decision Tracks are not considered restricted data Timeline Varies User Activity Hub Activity

  7. Data Submission Resources Starting point https://www.facebase.org/submit/submitting-data/ FaceBase data curation Wiki https://github.com/informatics-isi-edu/facebase-curation/wiki Detailed step-by-step documentation Feel like need a tutorial? Contact the Hub (help@facebase.org) to setup a 1-hour walk-through conference call

  8. Requesting Human-Subject Protected Data Main Flow: 1. Documentation depends on Data Use Limitations of requested dataset 2. Once the request is approved by the FaceBase Data Access Committee (DAC), we package and encrypt the data 3. Direct the requested to download the dataset(s) Fill in Data Access Request (DAR) and submit required documentation Current process is being revised and a new streamlined procedure will be in production within a few weeks.

  9. END

More Related Content