Tagging
Overview
This report covers four sequence labeling pipelines in Underthesea: Word Tokenization, POS Tagging, Chunking, and Dependency Parsing. These pipelines form the core syntactic analysis chain for Vietnamese text.
Vietnamese Text
→ Word Tokenization (CRF)
→ POS Tagging (CRF)
→ Chunking (CRF)
→ Dependency Parsing (Biaffine Neural Parser)