Modeling the Complexity and Descriptive Adequacy of Construction Grammars

Dunn, J. (2018). “Modeling the Complexity and Descriptive Adequacy of Construction Grammars.” In Proceedings of the Society for Computation in Linguistics (SCiL 2018). Stroudsburg, PA: Association for Computational Linguistics. 81-90.

Abstract. This paper uses the Minimum Description Length paradigm to model the complexity of CxGs (operationalized as the encoding size of a grammar) alongside their descriptive adequacy (operationalized as the encoding size of a corpus given a grammar). These two quantities are combined to measure the quality of potential CxGs against unannotated corpora, supporting discovery-device CxGs for English, Spanish, French, German, and Italian. The results show (i) that these grammars provide significant generalizations as measured using compression and (ii) that more complex CxGs with access to multiple levels of representation provide greater generalizations than single-representation CxGs.

[Read Full-Text: Modeling construction grammars]

[Code: https://github.com/jonathandunn/c2xg]