Publicly Available Data Sources Overview
Explore a comprehensive overview of publicly available data sources including Agriculture, Business, Climate, Education, Healthcare, and more. Access diverse datasets in various formats like CSV, XML, PDF, and more from reputable sources such as US Data.gov, Kaggle, UCI Machine Learning Repository, and others. Discover valuable resources for research, analysis, and innovation in a wide range of fields.
Download Presentation

Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.
E N D
Presentation Transcript
Publicly Available Data Publicly Available Data Sources (Free) Sources (Free) Spring 2019 MIS 464 Sagar Samtani and Hsinchun Chen with updates from Hongyi Zhu
Publicly Available Data Sources Name of Data Source # Entries Description Data Formats URL Agriculture, Business, climate, consumer, ecosystem, education, energy, finance, health, local government manufacturing, public safety, science and research http://www.data.gov/ US Data.gov > 300,000 HTML, XML, XLSX, CSV, PDF, shapefile, txt, zip http://data.europa.eu /euodp/data/dataset EU OpenData > 15,000 Product, insurance, forum comments, twitter data, images https://www.kaggle.c om/datasets Kaggle 14,072 CSV, XLSX, SQL UC Irvine Machine Learning Repository Research datasets used in past machine learning publications HTML, XML, XLSX, CSV, PDF, txt, zip https://archive.ics.uci. edu/ml 468 Public transportation, satellite images, web pages, genome, ecosystem, etc. Data API (CSV, JSON) https://registry.opend ata.aws/ Amazon Opendata on AWS 90 Biology, engineering, healthcare, physics, math, science and research https://msropendata. com/ Microsoft Research Open Data 53 CSV, TXT, TSV, PDF 2
Publicly Available Data Sources Name of Data Source # Entries Description Data Formats URL Agriculture, Biology, Climate, Data Challenges, Economics, Education, Finance, Government, Healthcare, Machine Learning, NLP, Search Engines, Sports, Transportation Awesome Public Datasets (Github Repo) XLSX, JSON, XML, Zip, CSV, PDF https://github.com/aweso medata/awesome-public- datasets > 600 Data from: Various sciences (Astronomy, biological, environmental, information, etc.), engineering, commerce, management, tourism XLSX, Zip, XML, CSV, PDF Figshare > 50 https://figshare.com/ Data sets designed specifically for data mining tasks JSON, CSV, SQL, XLSX http://www.kdnuggets.co m/datasets/index.html KD Nuggets > 50 VisualData 247 Computer Vision datasets JPG, PNG, https://www.visualdata.io/ ML Vis 48 Repository of scientific datasets for visualization CSV http://www.mlvis.com/ Google Dataset Search Search engine for publicly available datasets https://toolbox.google.com/datasetsearch Enigma Search engine for publicly available datasets https://public.enigma.com/ 3
US Data.gov Metadata and Additional Info Dataset Search Introduction Data Download Browse by Category 4
Kaggle Other users projects using this dataset Metadata and Description Dataset Search Browse with Filters Data Demo and Explore Panel 5
UCI Repository Search and Browse Metadata and Description 6
Amazon OpenData Dataset Search User Project Examples with This Dataset Browsing 7