
- Hadoop Illuminated > Publicly Available Big Data Sets – http://hadoopilluminated.com/hadoop_illuminated/Public_Bigdata_Sets.html
- Freebase – collection of community created databases – http://www.freebase.com/
- Amazon Public Datasets, collection of 50+ large datasets – http://aws.amazon.com/datasets?_encoding=UTF8&jiveRedirect=1
- Canada Data Center – http://data.gc.ca/eng
- UK Open Government – 17.000 data series – http://data.gov.uk/data/search
- World Bank datasets – http://econ.worldbank.org/WBSITE/EXTERNAL/EXTDEC/EXTRESEARCH/0,,contentMDK:20388241~menuPK:665266~pagePK:64165401~piPK:64165026~theSitePK:469382,00.html
- Timetric – https://timetric.com/data-platform/
- Freebase – A community-curated database of well-known people, places, and things – http://www.freebase.com/
- US Open Data of 85.000 + datasets – http://www.data.gov/
- San Francisco Data (businesses, crime, case data sets) – https://data.sfgov.org/
- Seattle Data (businesses, crime, case data sets) – https://data.seattle.gov/
- Chicago Data (businesses, crime, case data sets) – https://data.cityofchicago.org/
- Texas Data (businesses, crime, case data sets) – https://data.austintexas.gov/
- US Government Web Services and XML Data Sources – http://usgovxml.com/
- Find numerical datasets in a DB of 8.000.000 sets – http://www.quandl.com/
- US National Archives – Online Databases – http://www.archives.gov/research/alic/tools/online-databases.html
- Windows Azure MarketPlace for data sources – https://datamarket.azure.com/