udapi.block.ud.es package¶
Submodules¶
udapi.block.ud.es.addmwt module¶
Block ud.es.AddMwt for heuristic detection of Spanish contractions.
According to the UD guidelines, contractions such as “dele” = “de ele” should be annotated using multi-word tokens.
Note that this block should be used only for converting legacy conllu files. Ideally a tokenizer should have already split the MWTs.
-
class
udapi.block.ud.es.addmwt.
AddMwt
(verbpron=False, **kwargs)[source]¶ Bases:
udapi.block.ud.addmwt.AddMwt
Detect and mark MWTs (split them into words and add the words to the tree).