AI RESEARCH

Can I guess where you are from? Modeling dialectal morphosyntactic similarities in Brazilian Portuguese

arXiv CS.CL

ArXi:2603.20695v1 Announce Type: new This paper investigates morphosyntactic covariation in Brazilian Portuguese (BP) to assess whether dialectal origin can be inferred from the combined behavior of linguistic variables. Focusing on four grammatical phenomena related to pronouns, correlation and clustering methods are applied to model covariation and dialectal distribution. The results indicate that correlation captures only limited pairwise associations, whereas clustering reveals speaker groupings that reflect regional dialectal patterns.