Files

LADECv1-2019.csv
LADECv1_2019_variables_labels.csv
LADECv1-2019_Stata_dataformat.dta
LADEC_analyses_2019_Stata_do-file.do
Usage
  • 227 views
  • 616 downloads

LaDEC: Large database of English compounds

  • Author(s) / Creator(s)
  • The Large Database of English Compounds (LADEC) consists of over 8000 English words that can be parsed into two constituents that are free morphemes. This file contains compounds formed from 3-10 letter long bases. All items are listed in Wordnet as Nouns. The database contains a number of linguistic and psycholinguistic variables including semantic transparency, family size, bigram frequency, sentiment (valence), and word frequency. The database is available in both Stata and CSV formats. Note: Some compounds can be divided in multiple ways (e.g., bakeshop -> bake/shop and bakes/hop). The correct parse is indicated in the variable "correctParse". When using this database please cite the accompanying journal article (full details available at the journal website). Gagné, CL., Spalding, TL., & Schmidtke, D. (2019). LADEC: Large database of English compounds. Behaviour Research Methods. This paper is Open Access and available at: https://link.springer.com/article/10.3758/s13428-019-01282-6

  • Date created
    2019-06-24
  • Subjects / Keywords
  • Type of Item
    Dataset
  • DOI
    https://doi.org/10.7939/r3-dyqx-9b36
  • License
    Attribution-NonCommercial 4.0 International
  • Language
  • Link to related item
    https://link.springer.com/article/10.3758/s13428-019-01282-6