The corpus includes valuable material on contemporary Bulgarian an Bulgarian in diachrony.
Institute for Bulgarian Language Prof.L. Andreychin
Constantly enlarged. Consists of a Bulgarian (1.2 billion words and over 240,400 text samples) part and 47 parallel corpora. Reflects state of Bulgarian since 1945 until now.
As all corpuse, BNC includes sub-corpora on subject, author, year, spurce etc. Can be used as training corpora for terminology, idioms, grammar etc.
Resource includes 4 lexicographic tools: BulNet; The Bulgarian Semantically Annotated Corpus is part of the Bulgarian Brown Corpus. It consists of 95119 lexical units, annotated with the most appropriate synonymous set from the Bulgarian wordnet. https://dcl.bas.bg/semcor/; The Bulgarian PoS Annotated Corpus. https://dcl.bas.bg/poscor/
The Bulgarian National Corpus /BNC/ is the largest systematically created and representative corpus for the Bulgarian language. It contains more than 1.2 billion words and includes over 240,000 texts. The materials in the Corpus reflect the state of the Bulgarian language (mostly in its written form) from the middle of the 20th century until present day.
The BNC is a complex language resource used for both research and linguistic training purposes. It can also be used in learning Bulgarian as a foreign language, although it has not been specifically developed for this purpose.
Most widely used during classes with students studying Bulgarian is the search system of the corpus. Through it, one can:
- illustrate the use of a random word in context, with the examples generated being from texts differing in genre and style. The teacher can preselect parts of the results obtained in view of the topic and the specific objectives of the lesson. Students can use the search system on their own to work on tasks set by the teacher.
- obtain information on the meaning and grammatical characteristics of the word.
The BNC is easy and convenient to work with, as it is mainly used as an additional resource by both beginners and advanced students. The innovative character of the Corpus in learning Bulgarian as a foreign language is connected with the possibility to extract linguistic information at different levels simultaneously - semantic, grammatical, etc.
Capacity to match the needs of lecturers and students
The provided tangible improvements
The capacity to motivate students to improve their language skills
Effectiveness in introducing innovative, creative and previously unknown approaches to LSP learning
Measurement of the transferable potential and possibility to be a source of further capitalisation/application for other language projects in different countries
Skills assessment and validation
Availability of appropriate tools for lecturers to monitor students’ progress and for students to assess own progress and to reflect on learning
Flexibility of the contents and possibilities for the LSP lecturers to adapt the contents to their and to students’ need
Assess the technical usability from the point of view of the lecturer and the student
Assess the accessibility from the point of view of the lecturer and the student
BNC is a useful tool in teaching and learning (with initial training) all LSP skills, as it can be used in terminology, grammar and translation classes, in building reading and writing skills. Working with BNC (as with any LC) can bring added value by providing students with a means to check the forms, lexical and grammatical meanings of words, their behaviour in various context, syntagms and short texts.; this will help in extending the students' view on language learnt, beyond bilingual dictionaries. It can motivate the students by the examples from various specialized text in any field. While a constant practice with language students, working with dictionaries might be innovative in the, say, medical B class. It can be adapted to the needs of any learners. It is quite easy to use. Some of it is not accessible. Web page needs some additions to the English version.
Website of the Teaching Source: