Publications

Schacht, Carmen; Landwehr, Isabell

CoBra: A Compound Branching Resource for Nominal Triconstituent Compounds in English and German

Proceedings of the Ninth Workshop on Universal Dependencies (UDW, LREC 2026), pp. 128-141, Palma de Mallorca, Spain, 2026.

We present CoBra, a resource containing triconstituent nominal compounds in English and German. This addresses an understudied aspect of compound processing, since research and resources in psycholinguistics and NLP have mostly focused on two-constituent compounds. In addition, our resource covers both general and scientific language, allowing for a register-informed perspective on compounds. It provides syntactic and semantic annotation of compound structure, in particular of the branching direction (i.e. the internal embedding structure, the Compound Branching) and the semantic relationship between constituents. Annotations are implemented using extensions of Universal Dependencies (UD) labels. To explore applications of our new resource, we also conduct a pilot study investigating the relationship between semantic transparency and branching direction. Our results indicate that there is indeed a correlation. Overall, our resource contributes to gaining a more detailed understanding of the structure and processing of morphologically complex words within the UD framework.

Back

Successfully