Benedikt Szmrecsanyi
 
 
home
how to pronounce 'Szmrecsanyi'
research & data collection
publications
presentations
teaching & supervision
workshops
collaborators
Curriculum Vitae
press

Research & Data Collection
 
You can also obtain a reasonable approximation of my research activities on Google Scholar or Microsoft Academic Search.
 
Research interests
  • variationist & probabilistic linguistics
  • dialectology and dialectometry
  • varieties of English world-wide
  • historical linguistics, in particular: syntactic change
  • language/dialect typology

My research interests ultimately all boil down to variationist (socio)linguistics. I view linguistic variation as a window into the hidden structure of human language and the nature of linguistic knowledge. My PhD dissertation project was concerned with a primarily psycholinguistic phenomenon (persistence, a.k.a. priming) and its relevance for describing and understanding variation in interactional discourse. My current research interests lie with (i) large-scale dialectological and geolinguistic patterns of variation, (ii) the interface between linguistic typology and variation studies (key terms: analyticity, syntheticity, and linguistic complexity), and (iii) the probabilistic modeling of (morpho)syntactic variation (in synchrony and diachrony). I usually work quantitatively on the empirical basis of naturalistic corpus data, although I am open-minded about qualitative approaches and other data sources (e.g. survey material).

Current projects:

  • 2008-2013
    Corpus approaches to geolinguistic and genetic patterns of aggregate morphosyntactic variation in varieties of English
    Funded by the FRIAS (RAs: Christoph Wolk & Katharina Ehret)
    details

  • 2009-2013, with Joan Bresnan (Stanford University)
    Predicting syntax in space and time
    Funded by the FRIAS (RA: Christoph Wolk), in conjunction with NSF grant BCS-1025602 ("The development of syntactic alternations", PI: Joan Bresnan)
    details

  • with Lars Hinrichs, UT Austin
    Prescriptivism in Present-Day American English

  • with Marianne Hundt, University of Zurich
    Grammatical variation in Early New Zealand English

  • with Anette Rosenbach, Tanagra Wines
    Genitive and dative variability in Late Modern English

  • with Tom Ruette, Katholieke Universiteit Leuven
    A bottom-up approach to multilectal variation in the lexicon of written Standard English

  • with Jeff Siegel, University of New England
    Measuring analyticity and syntheticity in creoles

  • with Bernhard Wälchli, Stockholm University
    Aggregating dialectology and typology
     

 

Data collection
FRED and FRED-S
The Freiburg Corpus of English Dialects Sampler (FRED-S) is a 1-million word corpus sampling a subset of those texts in the 2.5-million word Freiburg Corpus of English Dialects (FRED) that are not subject to copyright restrictions. FRED-S is  publicly accessible (please contact me if you're interested). Read up about FRED-S here and here.

We are also currently creating a POS-tagged version of FRED-S; an online interface to access this corpus is available at www.meertens.nl/edisyn/searchengine.

The morphosyntactic survey coming with the Handbook of Varieties of English
I'm one of the principal designers of the morphosyntactic survey of the multimedia reference tool (available online here) accompanying the Handbook of Varieties of English (Kortmann et al. 2004). This first-ever comprehensive survey of non-standard English morphosyntax covers 46 native varieties of English, indigenized L2 varieties of English, and English-based pidgin and creole languages with regard to the presence or absence of 76 non-standard morphosyntax features. See Kortmann & Szmrecsanyi (2004) for a discussion of the survey procedure, and Szmrecsanyi & Kortmann (2009) for an exemplary quantitative analysis.