Data sets tagged with "metadata"

The Whitburn Project: 120 Years of Music Chart History

For the last ten years, obsessive record collectors in Usenet have been working on the Whitburn Project — a huge undertaking to preserve and share high-quality recordings of every popular song since the 1890s. To assist their efforts, they’ve created a spreadsheet of 37,000 songs and 112 columns of raw data, including each song’s duration, beats-per-minute, ...
Offsite

MusicBrainz

MusicBrainz is a user-maintained open music community that collects, and makes available to the public, music metadata, including information about artists, release groups, releases, tracks, labels and the many relationships between them. The database also contains a full history of all the changes that the MusicBrainz community has made to the music metadata. The music ...
Offsite

Document Metadata Based on a Sample of Web Documents from the Open Directory

DMOZ100k06 is a large research data set about document metadata based on a random sample of 100,000 web documents from the Open Directory combined with data retrieved from the social bookmarking service delicious.com, the content rating system ICRA, and the search engine Google. The data set is freely available for other research. Michael G. Noll
Offsite

Amsterdam Museum Data Set (RDF)

The Amsterdam Museum dataset describes more than 70,000 cultural heritage objects related to the city of Amsterdam described by the museum. The metadata was retrieved from an XML Web API of the museum’s Adlib collection database and converted to RDF compliant with the Europeana Data Model (EDM). This makes the Amsterdam Museum data the first of its kind to be ...
Offsite

MIDAS - Heritage project

From the website: > What is MIDAS? > MIDAS sets out an agreed list of the items or ‘units’ of information that should be included in an inventory or other systematic record of the historic environment. These units of information are grouped together under broad headings or ‘information schemes’. These cover areas such as Monument Character, Events, People and ...
Offsite

Open Media Database

About “omdb (open media database) is a free database for film media. There is no set editorial staff, but rather a large number of movie addicts and lovers who volunteer their time to provide material and develop the site. Anybody can add or change existing information on omdb once they have done the quick and simple task of signing up for their user login name. ...
Offsite

Biblios.net - the world's largest database of freely-licensed library records

About > The beta test environment for LibLime’s new cataloging service, ‡biblios.net, is now available! > ‡biblios.net is a subscription-based, hosted version of the open-source ‡biblios metadata editor that we released earlier this year. In addition to the editor, ‡biblios.net includes some extended community features such as integrated real-time chat, ...
Offsite

Discogs Release

This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Discogs Release. Wikipedia describes Discogs, short for discographies, as a website and database of information about audio recordings, including commercial releases, promotional releases, and bootleg or off-label releases. The Discogs servers, currently hosted under the domain name ...
Free

Audioscrobbler Data

Description “Much of the data available to view on Last.fm is available in several formats through the Audioscrobbler Web Services API.” Format Data variously available in Plain, XML, XSPF, iCal and RSS. License “All web services here are for non-commercial use only under the Creative Commons Attribution-NonCommercial-ShareAlike License. If you want to use ...
Offsite

Lyricsfly Lyrics REST API

Application Programming Interface is available to anyone who wishes to use our database for their own music project, website or program. If you currently use the web to search out lyrics or use code tricks to access other lyrics websites to display relevant lyrics text for your content you can now have a reliable source without the hassle. example code for php: ...
Offsite

Airborne Antarctic Ozone Experiment (AAOE-87)

This data is from the Airborne Antarctic Ozone Experiment (AAOE) which was based in Punta Arenas, Chile during August and September 1987. The data was primarily collected onboard the NASA ER-2 and DC-8 aircraft, along with ozonesonde data collected at four Antarctic stations: Halley Bay, McMurdo, Palmer Station, and the South Pole. The experiment tested the chemical and ...
Offsite

International Music Database Project (IMDBP)

About > IMDBP strives to categorize every single piece of music ever written in a format that is: 1. Flexible, extensible; 2. Thorough, uncompromising detail; 3. Efficient and intuitive to use for the average user, including the elimination of duplicate information entry and other potential inconsistencies
Offsite

Wikipedia XML Data

This data set contains a complete copy of all Wikimedia wikis, in the form of wikitext source and metadata embedded in XML as provided by the Wikimedia Foundation. The data set will be updated every month and the 3 previous months will always be available for use. We will list previous snapshots in the text of this description.
Offsite

DBPedia

,DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.6 million things, including at least 213,000 persons, 328,000 places, 57,000 music albums, 36,000 films, 20,000 companies. The knowledge base consists of 274 million pieces of ...
Offsite

Wikipedia Extraction (WEX)

The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia. The wiki markup for each article is transformed into machine-readable XML, and common relational features such as templates, infoboxes, categories, article sections, and redirects are extracted intabular form. Freebase WEX is provided as a set of database tables in TSV format ...
Offsite

Infochimps Site Metadata Dump

A full dump of all the metadata in the Infochimps repository. Includes complete information on collections, datasets, sources, licenses, tags, and fields.
Free

OpenCalais API

The OpenCalais Web Service automatically creates rich semantic metadata for the content you submit – in well under a second. Using natural language processing (NLP), machine learning and other methods, Calais analyzes your document and finds the entities within it. But, Calais goes well beyond classic entity identification and returns the facts and events hidden within ...
Offsite

OpenDover API

OpenDover is the leading webservice that lets you tag your documents based on sentiments and emotions found in your documents. The OpenDover API can handle different ways of sentiment tagging, depending on what your needs are, or what the content is that you provide via the API. The OpenDover knowledge base consists of thousands of opinion words, domain-related words and ...
Offsite

All Tags