DATA COLLECTION AND NORMATIVE PRINCIPLES IN CREATING A DIACHRONIC CORPUS
Keywords:
linguistic data, BNC, ICE, IBM, national corpora.Abstract
The corpus is a set of selected, sufficient linguistic texts (oral or written) that can be formed and classified on the basis of strict principles based on the pragmatic purpose of the corpus compiler, meet the criteria of representativeness, combine principles such as processing, tagging, and perfect computer automation, which is a database with a convenient quantity and consistency for systematic search, reference results, and empirical analysis.
References
Gries S. Introduction to S. Gries and A. Stefanowitsch (eds.).Corpora In Cognitive Linguistics: Corpus–Based Approaches To Syntax And Lexis, – New York: Mouton de Gruyter. 2006. – 1-18 рр.
Мамонтова В.В. Особенности перевода сложносоставньгх слов с английского языка на русский (на материале корпуса публицистических текстов)дис. канд. фил.наук. Ставрополь: 2008. – С 54.
Bobojon o’g’li, N. A. Corpus-Based Research On The Language Features Of Corpus Linguistics: In The Example Of ECOCL. Language, 3(2139), 950.
Ataboev, N. B. (2019). Problematic issues of corpus analysis and its shortcomings. ISJ Theoretical & Applied Science, 10(78), 170-173.
Ataboev, N. B. (2019). ICT in Linguistic Studies: Application of Electronic Language Corpus and Corpus-based Analysis. Test engineering and management, 81, 4170-4176.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 GEJournals
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.