udapi.block.ud.joinasmwt module

Block ud.JoinAsMwt for creating multi-word tokens

if multiple neighboring words are not separated by a space and the boundaries between the word forms are alphabetical.

class udapi.block.ud.joinasmwt.JoinAsMwt(revert_orig_form=True, **kwargs)[source]

Bases: Block

Create MWTs if words are not separated by a space..

create_mwt(mwt_nodes)[source]
postprocess_mwt(mwt)[source]
process_node(node)[source]

Process a UD node

should_join(node, next_node)[source]