O‘ZBEK TILI MULTIKORPUSI – ZAMONAVIY TA’LIMIY TIZIM

Authors

  • Manzura Abjalova

Keywords:

Uzbek language, multicorpus, corpus linguistics, uzcorpora.uz, NLP, morphological analysis, education system

Abstract

The necessity of natural language processing is relevant for the development of every language. It is the adaptation of the Uzbek language to the language of digital technologies and the processing of its resources that plays an important role in increasing the scope of use of the Uzbek language and its prestige among world languages. Naturally, there are texts in the Uzbek language related to each field. Therefore, combining them into a single platform leads to the creation of a large and networked database. For this purpose, a multicorpus of the Uzbek language was created, which includes 20 types of language corpora. This article analyzes the multicorpus uzcorpora.uz (http://uzcorpora.uz/), its structure, linguistic capabilities, and the scientific basis for its application in the modern education system. The article also highlights the features of statistical analysis of the corpus and shows their practical significance in the fields of linguistics, lexicography, education, and natural language processing (NLP). The role of the multicorpus in the development of the digital infrastructure of the Uzbek language has been revealed.

Downloads

Published

2025-11-10