Data sets tagged with "frequency"
Adult Attendance at Sports Events by Frequency: 2006
The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to their rows and columns. A few files had extraneous characters in the title. These were corrected to be consistent. A few files ...
Free
Adult Participation in Selected Leisure Activities, by Frequency: 2006
The Statistical Abstract files are distributed by the US Census Department as Microsoft Excel files. These files have data mixed with notes and references, multiple tables per sheet, and, worst of all, the table headers are not easily matched to their rows and columns. A few files had extraneous characters in the title. These were corrected to be consistent. A few files ...
Free
Given Name Frequency Project
Quite a bit of data is available for download but only individually (not in a single file). According to web page have have: > * GINAP – code to standardize given names and correct common problems in name samples. Such standardization is an important step in analysis of given names. > * Popular given names, US 1801 to 1999 – a collection of sets of standardized ...
Offsite
Word List - 1000 Most Frequent Words from an Internet Corpus
This file consists of the 1,000 most frequently used English words as used on the Internet computer network in 1992.
Free
Word List - 1,000 Most Frequently Used English Words by Frequency (with Definitions, Excel format)
This file consists of the 1,000 most frequently used English words from a wide variety of common texts listed in decreasing order of frequency
$4.00
Frequency of Sex versus Satisfaction Levels (DUREX 2006 Survey)
2006 Survey Sexual frequency and satisfaction
Via Sean Banks, you can view a spreadsheet version here
Offsite
Word Frequencies in Written & Spoken English from British National Corpus (100M-word)
by Geoffrey Leech, Paul Rayson, Andrew Wilson Overview Download word lists Books of English word frequencies have in the past suffered from severe limitations of sample size and breadth. They have also tended to be restricted to word forms alone. Most importantly, almost all have dealt only with written language. This book overcomes these limitations. It is derived from ...
Offsite
Word List - 1,000+ Most Frequent words in King James Bible
1,185 King James Version frequent substrings (KJVfreq.txt) The most frequently occurring 1,185 substrings in the King James Version Bible ranked and counted by order of frequency.
Free
Letter frequency - Substring frequency in an Amy Tan Novel
467 current fiction substrings (fiction.txt) The most frequently occurring 467 character sequences (n-grams) occurring in a best-selling novel by Amy Tan in 1990.
Free
1990 Census Name Files
Three separate datasets obtained from the 1990 cense. One set includes last names, one has first male names, and one has first female names. They contain the following data: the name, frequency in percent, cumulative frequency in percent, and rank.
Offsite
German Female Forenames
List of German Female Forenames
10.000 names ordered by commonness
See Also: List of 30,000 German Male Fornames ordered by commonness
$100.00
Google Books Ngrams
Description Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links will directly download a fragment of the given corpus. For ...
Offsite
Google Labs - Books Ngram Viewer
Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the links below will directly download a fragment of the given corpus. For instance, ...
Offsite
