AI RESEARCH

A Multi-head-based architecture for effective morphological tagging in Russian with open dictionary

arXiv CS.CL

ArXi:2604.02926v1 Announce Type: new The article proposes a new architecture based on Multi-head attention to solve the problem of morphological tagging for the Russian language. The preprocessing of the word vectors includes splitting the words into subtokens, followed by a trained procedure for aggregating the vectors of the subtokens into vectors for tokens. This allows to an open dictionary and analyze morphological features taking into account parts of words (prefixes, endings, etc.). The open dictionary allows in future to analyze words that are absent in the.