udapi.block.ud.fixmwtspace module

Block ud.FixMwtSpace looks for multiword tokens whose form contains a space, which should be avoided. If found, the block checks whether it can remove the multiword token seamlessly, that is, whether the syntactic words correspond to the space-delimited parts of the multiword token. If possible, the MWT line will be removed.

class udapi.block.ud.fixmwtspace.FixMwtSpace(zones='all', if_empty_tree='process', **kwargs)[source]

Bases: Block

Try to remove multiword tokens with spaces.

process_node(node)[source]

Process a UD node