Mapping Languages and Demographics

Dunn, J. and Adams, B. (2019). “Mapping Languages and Demographics with Georeferenced Corpora.” In Proceedings of Geocomputation 2019. Abstract. This paper evaluates large georeferenced corpora, taken from both web-crawled and social media sources, against ground-truth population and language-census datasets. The goal is to determine (i) which dataset best represents population demographics; (ii) in what parts … More Mapping Languages and Demographics

Multi-Unit Directional Measures of Association

Dunn, J. (2018). “Multi-Unit Directional Measures of Association: Moving Beyond Pairs of Words.” International Journal of Corpus Linguistics, 23(2): 183-215. Abstract. This paper formulates and evaluates a series of multi-unit measures of directional association, building on the pairwise ΔP measure, that are able to quantify association in sequences of varying length and type of representation. … More Multi-Unit Directional Measures of Association