The Czech National Corpus, hosted by Charles University in Prague, has recently unveiled a linguistic treasure – OnomOs. This diachronic corpus is the result of collaborative efforts led by researchers from the Department of Czech Language at the Faculty of Arts, University of Ostrava, spearheaded by Jaroslav David and his team.
For those eager to explore the intricacies of OnomOs, detailed information and resources can be found on the Czech National Corpus website: OnomOs – Czech National Corpus. Here, you’ll find insights into the construction process, the onomastic approach to proprial units and more. Thanks to the dedication of the team at the University of Ostrava, this diachronic corpus opens new avenues for exploring the evolution of proper names in the linguistic landscape. As researchers and language enthusiasts, let’s embark on a journey of discovery with OnomOs.
The OnomOs corpus is a linguistically processed database of texts from the periodicals Rudé právo (published 1920–1995) and Právo (1995–present). It always contains one issue from each decade in which (Rudé) Právo was published. The corpus includes texts in which the language component dominates; therefore, not included are, for example, advertisements, cinema, theatre and radio programmes, some types of texts from the sports section (e.g. scoreboards and player rosters), comics or crossword puzzles. The structure of the corpus is presented in more detail in Figure 1. In total, the corpus contains 255 149 tokens.
No comments:
Post a Comment