Datamob

Description:

Datamob aims to show, in a very simple way, how public data sources are being used.

Their listings emphasize the connection between data posted by governments and public institutions and the interfaces people are building to explore that data.

Created almost 2 years ago by Infochimps

Updated almost 2 years ago

AMEE API

Query the AMEE (“Avoiding Mass Extinctions Engine”) database of CO2 data and measure the carbon footprint of anything.
Offsite

Amazon ISBN Similarity Graph

Output of a crawl of Amazon.com’s item similarity API from January 18, 2008 for ISBNs (International Standard Book Numbers). ASCII text and XML. By Aaron Swartz.
Offsite

GOP.gov API

The GOP.gov API provides biographical, contact, and voting information about Republican members of Congress; contact information and member listings for House of Representatives committees; and information on bills and votes in XML. Nearly RESTful; requires an API key which is tied to a GOP Portfolio account.
Offsite

BART API

Developer-friendly real-time estimated time of arrival (ETA) feeds, transit schedules, advisory feeds and trip planning information for Bay Area Rapid Transit (BART), serving the San Francisco Bay Area in California. RESTful API returning results in XML. Licensed under a simple License Agreement with no usage limitations.
Offsite

Baseball Databank

Free historical baseball data, downloadable in comma-delimited text and SQL.
Offsite

BBC Backstage: Feeds & APIs

BBC TV and radio data, programme catalogue information, search API and more. Available for non-commercial use with attribution.
Offsite

Big Huge Thesaurus API

Query a database of 145,000 English language words for synonyms. Returns data in JSON, XML, serialized PHP array or plain text formats. Based on data from the Princeton University WordNet database and the Carnegie Mellon Pronouncing Dictionary. By John Watson.
Offsite

BookMooch API

Query or download the database for BookMooch, a book exchange community. ASCII text and XML formats. Creative Commons Attribution-Noncommercial-Share Alike 3.0 License.
Offsite

Brooklyn Museum API

RESTful API for programmatically searching the Brooklyn Museum’s digitized collection of more than 10,000 individual works. Free for non-commercial use with a limit of 3,000 API calls a day. Review the Brooklyn Museum API Terms of Use.
Offsite

CalorieKing API

The CalorieKing API provides nutritional information from the CalorieKing food database via SOAP or REST interfaces. Free up to 20,000 queries a month.
Offsite

Capitol Words API

The Capitol Words API provides several methods of accessing detailed information from the Capitol Words database of word frequency from the U.S. Congressional Record. Returns results in JSON and XML.
Offsite

Census 2000 Planning Database (for Census 2010)

The Tract Level Planning Database With Census 2000 Data is a database that assembles a range of housing, demographic and socioeconomic variables that are correlated with mail nonresponse. Using data from U.S. Census 2000, a database containing these variables has been developed for all census tracts in the country. The variables included in the Tract Level Planning ...
Offsite

U.S. Census Data

Variety of tools for accessing U.S. Census data, including direct file access.
Offsite

U.S. Census Bureau TIGER/Line Data

Data files for the U.S. Census Bureau’s experiment in web-based mapping: the Topologically Integrated Geographic Encoding and Referencing (TIGER) system. Shapefile format.
Offsite

Center for Economic and Policy Research Data

ceprDATA.org provides consistent, user-friendly versions of the Survey of Income and Program Participation (SIPP), Current Population Survey (CPS), and other datasets used at CEPR.
Offsite

Chronicling America API

Open programmatic access to information about historic American newspapers (1690-present) and select digitized newspaper pages (1880-1922). Returns results in Atom or Linked Data via RDF. Searching newspaper pages is also possible via OpenSearch. No API key or registration needed. Sponsored jointly by the National Endowment for the Humanities and the Library of Congress ...
Offsite

CIA World Factbook

Public-domain information about the countries of the world from the U.S. government, downloadable as a compressed Zip of HTML files.
Offsite

CiteSeer.IST

Scientific literature digital library and search engine with fully downloadable records.
Offsite

CiteULike Dataset

Data on usage of the CiteULike service for organizing academic research papers.
Offsite

New York City Baby Name Data

Two CSV files available for download: all New York City baby names dating back to 1920, and New York City baby names broken down by ethnicity, dating back to 1990. Data supplied by the New York City Department of Health and Mental Hygiene, compiled by Jennifer 8. Lee for the New York Times City Room Blog.
Offsite

Civic Footprint API

Look up the political geography of any address in Cook County, Illinois.
Offsite

The New York Times Congress API

Get biographical information on Congresspeople dating back to 1947 and voting records dating back to 1989 in JSON and XML. Based on information from THOMAS, senate.gov, and house.gov. Read the announcement on Open for more information. See also the New York Times Congress API Ruby Wrapper with Congresh Shell.
Offsite

New York Times Congress API Ruby Wrapper with Cong

An easy Ruby wrapper for the New York Times Congress API. Also provides a command shell called Congresh for interacting with the API directly. Available for download under an MIT License.submitted by: Patrick Ewing
Offsite

CorpWatch API

The CorpWatch API uses automated parsers to extract the subsidiary relationship information from Exhibit 21 of companies’ 10-K filings with the Securities and Exchange Commission, providing a free, well-structured interface for programs to query and process the data. Although the SEC provides a search interface for locating company filings (EDGAR / IDEA), the ...
Offsite

CrunchBase API

Information on early-stage technology companies, including acquisitions and funding rounds. Available in JSON. Licensing is still being finalized but will be Creative Commons Attribution or similar.
Offsite

Data.gov: Airline On-Time Performance

Data derived from the U.S. Bureau of Transportation Statistics containing on-time arrival data for non-stop domestic flights by major air carriers. Also contains departure and arrival delays, origin and destination airports, flight numbers, scheduled and actual departure and arrival times, cancelled or diverted flights, taxi-out and taxi-in times, air time and non-stop ...
Offsite

Doing Business: Full Data

Data on business regulations and their enforcement across 178 countries and selected cities. A project of the World Bank. Tables downloadable in Microsoft Excel format. See also: World Bank API and World Bank Data & Statistics.
Offsite

DBpedia Dataset

A large multi-domain ontology derived from Wikipedia. GNU Free Documentation License. N3 and CSV formats.
Offsite

Washington, D.C. Citywide Data Warehouse

The Washington, D.C. Citywide Data Warehouse, also known as the DC Data Catalog, is a comprehensive collection of government activity data from the District of Columbia.
Offsite

Dolores Labs' Color Name Dataset

10,000 color/label pairs, based on data collected through Amazon’s Mechanical Turk crowdsourcing marketplace. By Brendan O’Connor.
Offsite

DIMES Project Data

Data from the DIMES Project, a distributed scientific research project aiming to study and map the structure and topology of the internet. CSV format.
Offsite

Discogs API

RESTful API for the Discogs community-built database of music information. Artist, label and release data is made available through a public domain license.
Offsite

New York City Sign Permit Actions

Applications for sign permits in New York City. Published by the New York City Department of Buildings in Microsoft Excel format.
Offsite

Elev.at

Web API that converts legacy/proprietary files, such as XLS, into XML in real-time so that it can be consumed by your web, mobile and desktop apps. Created out of the need to consume data from NYC Data Mine, Data.gov and DataSF, which often contain datasets that are updated regularly but in proprietary formats. Check out some of the government datasets that are known to ...
Offsite

Enron Email Mailbox PST Dataset

This refined version of the CALO Enron Email Dataset is available as 148 PST files, complete with original folder structure, to preserve user information associated with the emails. Licensed as Creative Commons Attribution 3.0.submitted by: John Wang
Offsite

All Collections