Research software

  • PARSEME utility scripts
    • Description Preprocessing and annotation tools for the PARSEME shared task on the identification of multiword expressions.
    • My role Creator and main developer.
    • Links git repository.
  • MWEtoolkit
  • minimantics
    • Description A fast implementation of a classical model of distributional semantics.
    • My role Maintainer, along with Carlos Ramisch.
    • Links git repository.

Research datasets

  • Multilingual compositionality dataset
    • Description A multilingual dataset of nominal compounds with human judgments of compositionality.
    • My role Creator, along with the other authors of the associated ACL 2016 paper.
    • Links dataset webpage.
  • Portuguese lexical substitutes dataset
    • Description A dataset of Portuguese nominal compounds and human-annotated lexical substitutes.
    • My role Creator, along with the other authors of the associated IWCS 2017 paper.
    • Links dataset webpage.
  • PARSEME 1.0 corpora
    • Description A multilingual corpus of human-annotated occurrences of multiword expressions.
    • My role Portuguese-language annotator and language leader.
    • Links dataset webpage.