udapi.block.ud.joinasmwt module

Block ud.JoinAsMwt for creating multi-word tokens

if multiple neighboring words are not separated by a space and the boundaries between the word forms are alphabetical.

class udapi.block.ud.joinasmwt.JoinAsMwt(revert_orig_form=True, **kwargs)[source]

Bases: Block

Create MWTs if words are not separated by a space..

process_node(node)[source]

Process a UD node