Modeling Global Syntactic Variation in English Using Dialect Classification

[Read Full-Text] This paper evaluates global-scale dialect identification for 14 national varieties of English as a means for studying syntactic variation. The paper makes three main contributions: (i) introducing data-driven language mapping as a method for selecting the inventory of national varieties to include in the task; (ii) producing a large and dynamic set of … More Modeling Global Syntactic Variation in English Using Dialect Classification

Profile-Based Authorship Analysis

[Read Full-Text: Profile-based Authorship Analysis] [Data: https://drive.google.com/open?id=0B6oBPlj4dynZb1IzRnBnWDdPRXM%5D This article presents a profile-based authorship analysis method which first categorizes texts according to social and conceptual characteristics of their author (e.g. Sex and Political Ideology) and then combines these profiles for two authorship analysis tasks: (1) determining shared authorship of pairs of texts without a set of … More Profile-Based Authorship Analysis

Finding Variants for Construction-Based Dialectometry: A Corpus-Based Approach to Regional CxGs

[Read Full-Text: Finding variants for construction-based dialectometry] [Data: https://drive.google.com/open?id=1xcnZ8uNGZTTWbaK_FYCbdtzTWPlxH7xQ%5D [Code: https://github.com/jonathandunn/c2xg%5D This paper develops a construction-based dialectometry capable of identifying previously unknown constructions and measuring the degree to which a given construction is subject to regional variation. The central idea is to learn a grammar of constructions (a CxG) using construction grammar induction and then … More Finding Variants for Construction-Based Dialectometry: A Corpus-Based Approach to Regional CxGs