Language: czech / Topic: corpora - Searchworks@Jio Institute Digital Library Search Results

1. Jazykové korpusy a možnosti jejich využití při interpretaci práva.

Author: Glogar, Ondřej
Subjects: CORPORA, CZECH language, TEXTUALISM (Legal interpretation), JUDGE-made law, LEGAL language
Abstract: Copyright of Pravnik is the property of Czech Academy of Sciences, Institute of State & Law and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2024

2. Here’s hoping v procesu gramatikalizace.

Author: Malá, Markéta and Nádraská, Zuzana
Subjects: DEIXIS (Linguistics), CONSTRUCTION grammar, GRAMMATICALIZATION, DATA analysis, CORPORA
Abstract: The aim of the paper is to examine the functional and formal features of the construction here’s hoping. Our analysis is informed by the frameworks of construction grammar (Fried, 2010, 2013), grammaticalization (Fried, 2009; Himmelmann, 2004), subjectification (Company, 2006) and impersonalization (Siewierska, 2008). The construction seems to have developed a novel subjectified function, i.e. to express the speaker’s positive expectation while retaining a degree of tentativeness and distance. The data for the analysis was excerpted from the corpus English Web 2021 (enTenTen21). The internal and external features of the construction (e.g. the (non-)deictic function of here, fixedness, syntactic isolation, initial position) together with the overall expansion of the semantic and pragmatic context and the gradual host-class expansion suggest that the process of grammaticalization/subjectification is currently under way (cf. Fried, 2010; Company, 2006). [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

3. Korpus OnomOs: principy a příklady aplikací.

Author: Místecký, Michal, David, Jaroslav, Glogarová, Jana Davidová, and Klemensová, Tereza
Subjects: CZECH language, WESTERN countries, PERSONAL names, SUPPURATION, CORPORA
Abstract: The study introduces OnomOs, a new corpus of Czech texts with annotation of proper names. The corpus was compiled by onomasticians from the Department of Czech Language, Faculty of Arts, University of Ostrava, and made available by the Institute of the Czech National Corpus, Faculty of Arts, Charles University in Prague. The paper briefly discusses the content and structure of the cor- pus, the selection of texts for inclusion, and the onomastic-geographical classification of the iden- tified names. The text consists chiefly of three preparatory analyses, which focus on the most fre- quent surnames, collocations found in Western and Eastern countries in the pre-1989 period, and the declension patterns of three types of onyms. In the summary, further possibilities of onomastic corpus research are presented. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

4. Korpus DIA1900: jeho koncepce a vytváření.

Author: Benešová, Lucie, Kučera, Karel, Najbrtová, Kateřina, Pivoňková, Klára, and Stluka, Martin
Subjects: ENCYCLOPEDIAS & dictionaries, NINETEENTH century, CORPORA, ORTHOGRAPHY & spelling, ANNOTATIONS
Abstract: The objective of the paper is to describe the principles for building the onemillionword DIA1900 Corpus consisting of Czech texts published between 1851 and 1900, designed to be both balanced and representative. There are two main goals determining the methods of corpus building and the decision to develop new tools tailored to the special needs of 19th century Czech: 1) to present the variability of Czech in the 2nd half of the 19th century (including spelling, morphology, wordformation) and 2) to link the detected variants to the appropriate lemmas. The paper presents the phases of the processing of the texts, including transcription, manual pre-annotation, as well as the construction of a large morphological dictionary and the selection of a suitable set of paradigms. Further sections are focused on annotation and morphological tagging and manual disambiguation. The objective was to create a gold standard, intended for use in the automatic annotation both of the DIA1900 corpus and the planned corpus of Czech texts of the years 1800-1850. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

5. Meryl Streepová a Emma Stone: přechylování cizích příjmení v češtině prizmatem korpusového výzkumu.

Author: DAVID, Jaroslav and MÍSTECKÝ, Michal
Subjects: ACTING awards, PERSONAL names, ORTHOGRAPHY & spelling, CORPORA, ACADEMY Awards, ACTRESSES
Abstract: Copyright of Bohemistyka is the property of Instytut Filologii Slowianskiej Uniwersytetu im. Adama Mickiewicza w Poznaniu and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2023
Full Text: View/download PDF

6. Funkce a slovosled partikulí zajisté a pak ve staré a střední češtině.

Author: JEŽOVÁ, Martina
Subjects: WORD order (Grammar), CORPORA, TERMS & phrases, SYNTAX (Grammar)
Abstract: Copyright of Bohemistyka is the property of Instytut Filologii Slowianskiej Uniwersytetu im. Adama Mickiewicza w Poznaniu and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2023
Full Text: View/download PDF

7. Hodnoty slovesných morfologických kategorií v korpusu SYN2020 -- atribut verbtag.

Author: Jelínek, Tomáš, Petkevič, Vladimír, and Skoumalová, Hana
Subjects: VERBS, MORPHOSYNTAX, DISEASE susceptibility, CORPORA, DETECTIVES, GENE ontology
Abstract: The paper describes the verbtag attribute, which allows a user to search, in the SYN2020 corpus (and also subsequent corpora, SYNv9 and SYNv10) of contemporary Czech, for all values of morphological categories of verbs, i.e., not only those contained in the tag attribute, but also those related mainly to multi-word participial verb predicates, which are prevalent in Czech. The verbtag attribute contains information indicating whether the verb (co-)forming the verbal meaning is either auxiliary or autosemantic, as well as information about the verb mode, diathesis, person, number and tense. The annotation applies both to verb predicates expressed in a single word (e.g., the 1st person indicative present tense: Čtu rád detektivní příběhy. 'I like to read detective stories.') and (especially) to verb predicates expressed in multiple words (e.g., the present conditional of the 1st person singular: Pak bych mu s chutí nabídla výhodnou smlouvu. 'Then I would gladly offer him a good deal.'). The authors introduce the motivation and the concept of the verbtag annotation, describe relevant morphological categories and their values in detail, and show, via examples, how various multiword structures expressing verbal meaning are annotated in the verbtag attribute. They also offer users a guide to the whole issue of verbal morphosyntax manifested in the verbtag attribute and possibilities for efficient search for and retrieval of morphological/morphosyntactic data. The paper shows which multiple verb complexes are simple in terms of annotation, but also identifies more complex cases (e.g., coordination of participles) which are not easy to automatically annotate, and/or whose annotation is unclear in terms of an adequate theoretical approach. The authors also present the method used for annotating multiword verbal complexes and its current success rate. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

8. Století cest na Slovensko: na okraj jednoho česko-slovenského dialogu.

Author: Pátková, Jana
Subjects: POSTCOLONIALISM, STEREOTYPES, MYTH, CORPORA, CLASSIFICATION, MUTUALISM
Abstract: This paper focuses on the literary representation of Slovakia in selected travelogues by Czech authors. The subject of the research is that of cultural stereotypes, images of the other, and constructions of us and them. The corpus of travelogues covers the period from the 1830s to the end of the 1930s. The methodological framework for analysing travelogues includes several diverse approaches. In addition to literaryhistory classification, the study employs an imagological approach while also taking into account an approach based on postcolonial theories in the context of the interpretation of cultural stereotypes. The material is divided into four historical periods with regard to the form and changing face of Czech-Slovak dialogue. Through the travelogue material under review we can analyse how the image of Slovakia within the Czech cultural myth of Slovakia has been shaped and transformed over the course of a century. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

9. ZPRACOVÁNÍ IDENTICKÉ KAUZY V RŮZNÝCH ON-LINE DENÍCÍCH.

Author: SVOBODOVÁ, JINDŘIŠKA and FALTÝNEK, DAN
Subjects: ELECTRONIC newspapers, PRIME ministers, POISONING, CORPORA, SPHERES, ENVIRONMENTAL disasters
Abstract: The paper deals with the reflection of the poisoning of the river Bečva on the news servers iDnes.cz and Seznam zprávy. The selection of on-line newspapers was motivated by the person of their owner, the aim of the contribution was to find out how much this circumstance will be reflected in the way the event is presented. In the analysis, the authors used the method of critical reading based on the quantitative processing of the corpus of reports that were published in the period of 13 months after the environmental disaster. They mainly focused on what kind of media image is created for the company DEZA in the news (the prime minister was financially connected to both the iDnes.cz newspaper and this company), what space in the news are given to witnesses “from the people” and experts from the academic sphere, and as the then Minister of the Environment Richard Brabec is presented in the news. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

10. Vícejazyčnost v současné české poezii. Několik úvodních postřehů z korpusové perspektivy.

Author: Piorecký, Karel and Škrabal, Michal
Subjects: DISTRIBUTION (Probability theory), MULTILINGUALISM, CORPORA, DATA analysis, QUANTITATIVE research, BIBLIOGRAPHIC databases, LITERARY research
Abstract: The paper is an attempt at a quantitative corpus related approach to the subject of multilingualism in contemporary Czech poetry (published both in books and on literary servers). The authors of the paper examine the frequency and distribution of foreign (i.e., non-Czech) lexical units, raising questions about the forms and functions of individual lexemes. Three selected poets (T. Kafka, M. Šanda, M. Torčík) are then analyzed more in-depth. The paper is also a report about a currently developed database - The Corpus of Contemporary Czech Poetry - and possibilities of using it. It suggests how beneficial the quantitative data analysis in the first phase of linguistically oriented literary research can be, pointing to the necessity of interconnecting the quantitative and qualitative approaches. It is only the researcher's interpretative competence that can define the boundaries of the research field and the significance of its elements. When conducting text-centered analyses, language corpora should begin to play a role similar to other scientific infrastructure tools, such as bibliographic databases. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

11. Italská frazeologie a frazeografie na přelomu tisíciletí.

Author: Obstová (Praha), Zora
Subjects: PHRASEOLOGY, ENCYCLOPEDIAS & dictionaries, CORPORA, IDIOMS, PROVERBS, WISDOM
Abstract: The paper gives a brief overview of the most important phraseological studies and dictionaries published in Italy since the 1980s, when Italian phraseology took the first steps as an independent linguistic discipline. It deals with fundamental theoretical contributions and describes major dictionaries of idioms and proverbs published in the last 40 years. It points also to new trends in Italian phraseography and presents some interesting current projects based on new corpus methodologies. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

12. Čtení vlastním tempem: kritické představení metody.

Author: Chromý (Praha), Jan and Dotlačil (Utrecht), Jakub
Subjects: CORPORA, ELECTRONIC data processing, RESEARCH & development, DISCOURSE, READING
Abstract: Self-paced reading has been a widely used experimental method for study of the processing of sentences and texts. In this paper, we introduce the method to the Czech audience. We summarize its advantages and limitations and provide practical suggestions on stimuli construction and data processing. We also present different variants of the method, we discuss its ecological validity, and we summarize the experimental evidence showing that reaction times collected in self-paced reading can be linked to processing demands people might experience during reading. Finally, we present three examples of the application of the method: an experiment on agreement attraction in Czech, an experiment on garden-path sentences in Czech and an experiment studying the processing of short discourses in English. We also briefly discuss new trends that connect corpus linguistics with psycholinguistic discourse processing research and lead to the development of reading-time corpora. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

13. Spor o občana: kritická analýza diskurzu o Klinice.

Author: Hořejší, Michal, Dufek, Ondřej, and Truhlařík, Štěpán
Subjects: CRITICAL discourse analysis, POLICE intervention, ACTIVISTS, CORPORA, OUTGROUPS (Social groups), DISCOURSE, PUBLIC sphere
Abstract: The paper is devoted to a critical analysis of the discourse on Autonomní sociální centrum Klinika (Autonomous Social Centre Klinika) that was of high relevance in the Czech public sphere in 2014--2019. The Klinika centre was founded in Prague by a group of civic activists in a building owned by the Czech state which had long fallen into ruin. In 2014 they entered it without the owner's consent. Both demonstrations in support of the centre and repeated police interventions attracted intense media attention, while control over identities, meanings and relations was disputed until the police clear-out in 2019. We aimed to discover what discourse strategies were chosen by particular social actors and what meaning configurations were created in their texts. Based on a) a set of qualitative analyses of nomination and predication strategies (Wodak, 2001; Wodak & Reisigl, 2009) in five initial discourse phase texts and b) frequency analysis of two large corpora representing Klinika supporters and media mainstream, we carried out a detailed concordance analysis of the relation between the activists' group and the citizen category. Using the methods of corpus-assisted critical discourse analysis (e.g., Baker, 2006; Baker et al., 2008), we focused on the question of to what extent, in which contexts and via which discourse processes Klinika ended up outside or inside the ingroup. The main finding is that by keeping control over the concept of citizen, Klinika's final displacement in January 2019 did not mean its discursive defeat. In fact, activists managed to keep their story about a civic opposition against an incompetent power. State violence terminating the existence of the Klinika centre only confirmed its discursive victory. [ABSTRACT FROM AUTHOR]
Published: 2022

14. OD PATVARU K SUPERSTAR: FREKVENČNÍ A KOLOKAČNÍ ANALÝZA NÁZVU ČESKO V KORPUSOVÉ PUBLICISTICE Z LET 1990−2018.

Author: Drkošová, Veronika and Místecký, Michal
Subjects: GEOGRAPHIC names, COLLOCATION (Linguistics), CULTURAL activities, PERSONAL names, CORPORA, FREEDOM of the press
Abstract: The paper focuses on the frequency and collocation analyses of Česko (“Czechia”), the short, geographical name of our country, in the opinion journalism section of the eight-version SYN corpus, which comprises texts from the period of 1990−2018. Within the scope of the research, the period was divided into several sections, which are delineated by the breakthrough political and cultural events (the Czech Republic entering NATO, the Czech Republic entering the EU, climax of the first season of the Pop Idol-based contest Czechia Is Looking for a SuperStar, etc.). The frequency analysis is based on the relativization via i.p.m.; the collocability force is counted on the grounds of the logDice index, which is easy to be interpreted linguistically, and independent of the corpus size. The goal of the study is to capture basic motivations which led to the popularisation of the name and its expansion in the given discourse (e.g. the influences of other one-word names of states, sport commentaries, popular contests, and generation change). It is possible to sum up that the Česko name is employed in a variety of contexts, and its usage can be seen as unmarked. [ABSTRACT FROM AUTHOR]
Published: 2022

15. CO VID(ÍME) V INTERCORPU? K HLEDÁNÍ EKVIVALENTŮ ČESKÉHO VIDU V PARALELNÍM KORPUSU.

Author: NOVÁKOVÁ, EVA
Subjects: GRAMMATICAL categories, ENGLISH language, CORPORA, TRANSLATING & interpreting, FUNCTIONAL analysis
Abstract: The paper discusses the use of parallel corpora (InterCorp, a subcorpus of Czech National Corpus) in translation. It focuses on advantages and drawbacks of exploiting corpus data for a specific translation task, i.e., presenting a language-specific grammatical category of the Czech aspect to L2 Czech learners via English as the auxiliary language. [ABSTRACT FROM AUTHOR]
Published: 2021

16. Jak se píše o inkluzi? Společné vzdělávání pohledem korpusové lingvistiky.

Author: Velčovský, Václav
Subjects: CORPORA, JOURNALISTS, INCLUSIVE education, CONCEPTS, COMPREHENSION
Abstract: Copyright of E-Pedagogium is the property of Palacky University in Olomouc and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2020
Full Text: View/download PDF

17. Příznak tantum v morfologii češtiny.

Author: VONDRÁČEK, Miloslav
Subjects: GRAMMATICAL categories, NOUNS, GRAMMAR, VERBS, CORPORA
Abstract: Copyright of Bohemistyka is the property of Instytut Filologii Slowianskiej Uniwersytetu im. Adama Mickiewicza w Poznaniu and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2020
Full Text: View/download PDF

18. HOMONYMIE MEZI APELATIVY A PROPRII JAKO PROBLÉM AUTOMATICKÉ MORFOLOGICKÉ ANALÝZY ČEŠTINY.

Author: Osolsobě, Klára and Žižková, Hana
Subjects: NOUNS, CORPORA, GEOGRAPHIC names, ANNOTATIONS
Abstract: The aim of this paper is to provide a corpus-based analysis of one type of Czech proper nouns (type Zubří). We will argue that the adequate annotation (lemmatisation and morphological tagging) of proper nouns type Zubří depends on several circumstances: 1) the coverage of the dictionary of the automatic analyser; 2) the accurate description of the variability of inflexion forms; 3) the non-trivial disambiguation of numerous homonymous word forms. We believe that while meeting the first two conditions is possible, the adequate disambiguation goes beyond the possibilities of automatic morphological analysis. [ABSTRACT FROM AUTHOR]
Published: 2020

19. Konstrukce then v matematickém textu.

Author: Malá, Lucie
Subjects: WORD recognition, CONSTRUCTION grammar, TEXTBOOKS, CONSTRUCTION, CORPORA
Abstract: The paper explores the possibilities for constructional analysis of functions of a word in a specific text type. Five constructions of the word then found in a corpus of mathematical university textbooks are described in detail: logical then, hypothetical conditional then, temporal then, resultative then, and summarising then. While this is not meant to be an exhaustive list of constructions of then, it is apparent from the results of the analysis that the constructional perspective offers more precise information on the use of then in mathematical texts. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

20. Korpusová kritická analýza diskurzu: povaha, možnosti a limity (na příkladu analýzy jazykových ideologií v českém parlamentním diskurzu).

Author: Dufek, Ondřej
Subjects: CRITICAL discourse analysis, CZECH language, CORPORA, CRITICAL realism
Abstract: The paper provides a thorough review of the corpus-linguistic approach to critical discourse analysis. It briefly presents the core of critical discourse analysis (CDA) and examines the possibilities of applying corpus tools to it. In the next step, critical commentaries on CDA are summarized and at the same time, possible corpus-linguistic solutions are offered. The final part offers an illustrative application of corpus-assisted CDA focusing on language ideologies in the Czech parliamentary discourse. [ABSTRACT FROM AUTHOR]
Published: 2019

21. MOŽNOSTI UPLATNĚNÍ LINGVISTICKÉ TEORIE V AFAZIOLOGII NA PŘÍKLADU USAGE-BASED LINGVISTIKY.

Author: Láznička, Michal
Subjects: APPLIED linguistics, LINGUISTICS education, LINGUISTICS, CORPORA
Abstract: Copyright of Speciální Pedagogika is the property of Charles University Prague, Faculty of Education and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Published: 2019

22. Využití kontrastivní analýzy v kritice překladu na příkladu textů Evropské unie.

Author: Nováková, Eva
Subjects: ENGLISH language, TRANSLATIONS, LINGUISTICS, MATHEMATICAL equivalence, CRITICISM, CORPORA, CULTURAL adaptation
Abstract: The aim of the present paper is to illustrate possible strategies to assess translation quality and functional equivalence with the methods of contrastive analysis. The study is therefore related to relationships between the two autonomous, yet tightly interrelated disciplines of linguistics and translation studies, and it seeks to contribute to the discussion on whether and how translation criticism might profit from the linguistic point of view on source and target texts originating in typologically different language systems. The corpus of data consists of English and Czech versions of the European Union official documents representing specific text types that, due to hybridity of their forms and functions, address a heterogeneous group of recipients, i.e., both EU officials and the European general public. The English texts are analysed with respect to the occurrences of nominal expressions, as these are assumed to be the prominent stylistic markers of the administrative texts as well as an inherent systemic feature of the English language, and the individual instances of nominalizations are contrasted with their authentic Czech equivalents. The analysis based on the text-oriented contrastive approach aims at assessing the (non-)adequacy of possible equivalents regarding the requirements on their functionality within the selected genre. [ABSTRACT FROM AUTHOR]
Published: 2019

23. Vybraná problematika z autorské lexikografie.

Author: ZMĚLÍK, RICHARD
Subjects: LEXICOGRAPHY, ENCYCLOPEDIAS & dictionaries, ACADEMIC discourse, CORPORA
Abstract: The present paper explores the topic of authorial dictionaries -- a subject rarely discussed in Czech academic discourse. The aim of the study is to present several forms of authorial dictionaries, focusing mainly on current and up-to-date methods. At the beginning, the study explores two viewpoints (J. Mattausch and H. E. Wiegand) of the taxonomy of authorial dictionaries, of which the second is subjected to slightly stronger criticism. The article also looks at the development of this special lexicographical discipline. Finally, the paper mentions future perspectives of modern corpus devices in the sphere of authorial lexicography. [ABSTRACT FROM AUTHOR]
Published: 2014

24. Variabilita českých frazémů v úzu.

Author: Jelínek, Tomáš, Kopřivová, Marie, Petkevič, Vladimír, and Skoumalová, Hana
Abstract: The paper addresses the variability of Czech phrasemes, i.e. semantically non-compositional multiword units, in current use represented by corpora, the variability being the result of linguistic creativity on the part of text authors. It also asks what, in fact, identifies a phraseme. A basic, original phraseme has a certain meaning that cannot be inferred from the meaning of its components, and if it is modified, made more topical and up-to-date, either the original meaning is entirely or partially preserved, or the modified phraseme acquires a totally new meaning. Some phrasemes allow for multiple modifications, while others are more rigid. The article examines different types of lexical/syntactic/morphological/semantic alteration of basic phrasemes. In addition to lexical variations, the focus is mainly on syntactic and morphological changes, and on the question as to whether the chosen syntactic means of expressing semantic shifts have an impact on the potential for a creative treatment of the phraseme. In order to identify the variants of a phraseme with the phraseme itself, we introduce the term phraseme nucleus and outline a partial solution to the phraseme variability problem -- designing a lexical database of multiword units (including phrasemes) containing entries sufficiently flexible to at least partially capture the variability of phrasemes. [ABSTRACT FROM AUTHOR]
Published: 2018

25. Monokolokabilita ve dvou typologicky odlišných jazycích: srovnání češtiny a italštiny.

Author: Obstová, Zora
Abstract: The present study deals with the phenomenon of extremely restricted collocability (monocollocability) in Czech and Italian. On the basis of a list of words with the highest degree of monocollocability extracted from corpora with the aid of the so called Herfindahl-Hirschman Index (HHI), we try to analyse this little-explored phenomenon in both languages. Monocollocable words (MW, often referred to as cranberry words) and the fixed combinations in which they occur are investigated in terms of syntactic and collocation structures, frequency and diacronic development. In the light of corpus evidence, the phenomenon of extremely restricted collocability appears more complex than originally believed. The paper shows that MW in both languages form a very heterogeneous category and that many differences between Czech and Italian monocollocable structures can be explained by typological (e.g. the degree of nominal inflection) or historical factors. [ABSTRACT FROM AUTHOR]
Published: 2017

26. Nominalizované struktury se dvěma aktanty ve formě bezpředložkového genitivu.

Author: KOLÁŘOVÁ, VERONIKA
Abstract: Double post-nominal genitives in Czech have thus far been illustrated only by a single type of nominalized structure, e.g., zbavení ženy starostí 'relieving woman-GEN worry-GEN.PL, i.e. relieving the woman of worries'. In this paper, we specify three other types of double post-nominal genitive constructions and search for their frequency in the Prague Dependency Treebank and in the Czech National Corpus. Although the constructions are rare and less acceptable, we try to show that Czech grammar system allows them. Special attention is paid to nominalizations of support verb constructions; they can be interpreted as one lexical unit which enables them to be used within double post-nominal genitive constructions. [ABSTRACT FROM AUTHOR]
Published: 2014

27. Anglická adjektivní přirovnání: srovnání korpusového vzorku s výběrem ve standardní příručce idiomů.

Author: Emmer, Jaroslav
Subjects: ENCYCLOPEDIAS & dictionaries, IDIOMS, HOMOGENEITY, CORPORA, TEXTBOOKS, INTUITION
Abstract: An adjectival simile is an established phraseological unit with a standardised form (blind as a bat). It is perhaps for this reason that it has not been attracting much attention, and most studies on similes focus on verbal similes. This study reports a significant diachronic shift in the usage of adjectival similes, identified by comparing a lexicographic sample with one from a corpus. Analysing corpus data, we collected a representative sample of 309 adjectival similes in English, which further served for the compilation of a "simile minimum" of 60 types. Both corpus samples were then contrasted with lexicographic lists from a dictionary of idioms: English Idioms and How to Use Them (Seidl & McMordie, 1978; 1988). The comparison shows that the lexicographic minimum (65 types) overlaps with that from the corpus by just one-third and the whole representative list (166 types) only by one-fifth. The significant disproportion can be explained as a shift in linguistic reality but also as a result of the dictionary authors' idiolects or homogeneity of their data. These findings can serve for textbook and reference manual authors as a warning against relying too much on their own linguistic intuition. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Akademické psaní a frázové banky.

Author: Homoláč, Jiří, Křen, Michal, Kašpárková, Alena, Rosolová, Kamila Etchegoyen, Hoffmannová, Jana, Kopecký, Jakub, Sherman, Tamah, and Vondřička, Pavel
Subjects: ACADEMIC discourse, PERIODICAL articles, WRITING processes, TERMS & phrases, CORPORA, CHARACTER
Abstract: Scholars have previously conceptualized academic writing as both process and product. Academic phrasebanks are a tool in which these two conceptions intertwine, i.e., where the products, existing texts such as journal articles, are broken down into smaller units such as steps and phrases, which are then used in the process of producing new texts. In this article, we examine the possibilities and limits of collecting these smaller units for research and didactic purposes, presenting a newly established phrasebank in this context. First, we consider various scholarly and pedagogical approaches to academic writing. We then provide an overview of existing academic phrasebanks, primarily the seminal University of Manchester Academic Phrasebank created by John Morley, focusing on how its principles and structure have been utilized to create similar tools for other languages. We subsequently describe the design and creation of the Czech Academic Phrasebank, the innovative character of which is its link to the Czech National Corpus, specifically a subcorpus of Czech scholarly articles. The processes of conceptualizing the phrasebank, its basic units and functions, excerpting phrases, linking to the corpus, and the various problems encountered throughout are reflected. We conclude by outlining directions for the phrasebank's use in Czech-language genre-based pedagogy. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

29. Slovnědruhová a morfologická homonymie, homofonie a homografie v současné češtině : Part-of-Speech and Morphological Ambiguity, Homophony and Homography in Contemporary Czech

Author: Petkevič, Vladimír
Subjects: part-of-speech and morphological ambiguity, homophony, homography, contemporary Czech, language, corpora, Philology. Linguistics, P1-1091
Abstract: The paper presents a classification of the types of morphological ambiguity and the types of homophony and homography in contemporary Czech occurring in the material of the SYN and SYN2013PUB corpora of the Czech National Corpus. The classification of homonymy and homography constitutes a data base for the rule-based automatic morphological disambiguation of written Czech performed in the Institute of Theoretical and Computational Linguistics at the Faculty of Arts, Charles University. As for homophony, the types presented in the paper and mainly the sets of word forms associated with these types, can be used for the disambiguation of spoken texts.
Published: 2015

30. Pravdepodobne si myslím, že niekedy v roku 2020. Pragmatické markery v slovenčine ako v cudzom jazyku.

Author: Ivanová, Martina
Subjects: DISTRIBUTION (Probability theory), DISCOURSE markers, WORD order (Grammar), ACQUISITION of data, CORPORA, SECOND language acquisition
Abstract: The study deals with pragmatic markers (PMs) and their frequency distribution in the written texts of Slovak as a second language (L2). It focuses on the comparison of written texts produced by nonnative speakers of Slovak to describe the acquisition aspects of PMs with respect to their preferential realization in the non-native texts, frequency distribution of individual units, maintenance of polyfunctionality of PMs and types of errors occurring in the texts. Individual functional-semantic classes of PMs are delimited within four domains: textual, propositional, illocutionary and interpersonal. On the basis of data from the acquisition corpus of Slovak errkorp-pilot the central markers of those classes are described and their frequency distribution and functions in the texts are analysed. Corpus data are used to manifest i) the tendency towards underrepresentation of PMs in the texts of Slovak as L2, ii) the specificities of the frequency distribution of certain functional and semantic classes of PMs in the texts of Slovak as L2, iii) the tendency towards decreased polyfunctionality of PMs in the texts of Slovak as L2 (polyfunctional PMs are usually limited to realizing one major function in the texts of Slovak as L2). Corpus material also brings evidence for the most frequent errors concerning the usage of PMs, namely lexical ones (falling into the category of substitution with respect to the semantically, formally or pragmatically related PMs, the redundant usage of PMs in the texts, etc.) and syntactic ones (concerning incorrect word order and embedding of PMs). [ABSTRACT FROM AUTHOR]
Published: 2022

31. Teorie relevance a nově vznikající diskurzní částice.

Author: Andersen, Gisle
Subjects: DISCOURSE markers, CORPORA, LANGUAGE contact, DISCOURSE
Abstract: This article explores the relevance-theory view of utterance interpretation (Sperber and Wilson 1986/1995) and illustrates its application in a qualitative investigation of authentic corpus data. The purpose is to show that observations derived from corpora can shed significant light on how constraints on relevance are practised by real speakers in real discourse contexts. The study focuses on discourse markers and argues that there is a need to focus more systematically on emerging discourse markers and their contributions to relevance. It is argued that the corpus-based approach can lead to new knowledge about pragmatic functions and subtle differences between different items, and that this extends beyond what is gained from a strictly theoretical or experimental approach, by far the most common approaches in the previous relevance-theory literature. As a case in point, the article includes an empirical study of the discourse marker as if, based on the large English TenTen corpus (Jakubíček et al., 2013). [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

32. Variabilita českých frazémů v úzu | Variability of Czech phraseme usage

Author: Tomáš Jelínek, Marie Kopřivová, Vladimír Petkevič, and Hana Skoumalová
Subjects: phraseme variability, language creativity, modification, phraseme nucleus, corpora, phraseme lexical database, variabilita frazémů, jazyková kreativita, modifikace, jádro frazému, jazykové korpusy, lexikální databáze frazémů, Philology. Linguistics, P1-1091
Abstract: The paper addresses the variability of Czech phrasemes, i.e. semantically non-compositional multiword units, in current use represented by corpora, the variability being the result of linguistic creativity on the part of text authors. It also asks what, in fact, identifies a phraseme. A basic, original phraseme has a certain meaning that cannot be inferred from the meaning of its components, and if it is modified, made more topical and up-to-date, either the original meaning is entirely or partially preserved, or the modified phraseme acquires a totally new meaning. Some phrasemes allow for multiple modifications, while others are more rigid. The article examines different types of lexical/syntactic/morphological/semantic alteration of basic phrasemes. In addition to lexical variations, the focus is mainly on syntactic and morphological changes, and on the question as to whether the chosen syntactic means of expressing semantic shifts have an impact on the potential for a creative treatment of the phraseme. In order to identify the variants of a phraseme with the phraseme itself, we introduce the term phraseme nucleus and outline a partial solution to the phraseme variability problem — designing a lexical database of multiword units (including phrasemes) containing entries sufficiently flexible to at least partially capture the variability of phrasemes.
Published: 2018

33. Monokolokabilita ve dvou typologicky odlišných jazycích: srovnání češtiny a italštiny : Monocollocability in Two Typologically Different Languages: A Comparison of Czech and Italian

Author: Zora Obstová
Subjects: monocollocable words, cranberry words, language typology, Czech, Italian, corpora, monokolokabilní slova, jazyková typologie, čeština, italštiny, korpusy, Philology. Linguistics, P1-1091
Abstract: The present study deals with the phenomenon of extremely restricted collocability (monocollocability) in Czech and Italian. On the basis of a list of words with the highest degree of monocollocability extracted from corpora with the aid of the so called Herfindahl-Hirschman Index (HHI), we try to analyse this little-explored phenomenon in both languages. Monocollocable words (MW, often referred to as cranberry words) and the fixed combinations in which they occur are investigated in terms of syntactic and collocation structures, frequency and diacronic development. In the light of corpus evidence, the phenomenon of extremely restricted collocability appears more complex than originally believed. The paper shows that MW in both languages form a very heterogeneous category and that many differences between Czech and Italian monocollocable structures can be explained by typological (e.g. the degree of nominal inflection) or historical factors.
Published: 2017

34. Gestikulace ve sdíleném prostoru jako kooperativní utváření významu.

Author: Lehečková, Eva and Jehlička, Jakub
Subjects: SOCIAL interaction, BUSINESS meetings, HUMAN experimentation, GESTURE, CORPORA, SPACE
Abstract: The present study focuses on strategies which speakers employ when gesturing in a shared articulatory space. Using data from English and Czech multimodal corpora of spontaneous business meetings, we conducted a qualitative analysis of gestural patterns based on two strategies: alignment and elaboration of gestures representing abstract/conceptual objects. We show that speakers make use of both strategies in the context of co-operative meaning formation (with various pragmatic functions) and that the notions of alignment and elaboration provide useful analytic and descriptive tools for the study of human interaction from a multimodal perspective. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

35 results

1. Jazykové korpusy a možnosti jejich využití při interpretaci práva.

2. Here’s hoping v procesu gramatikalizace.

3. Korpus OnomOs: principy a příklady aplikací.

4. Korpus DIA1900: jeho koncepce a vytváření.

5. Meryl Streepová a Emma Stone: přechylování cizích příjmení v češtině prizmatem korpusového výzkumu.

6. Funkce a slovosled partikulí zajisté a pak ve staré a střední češtině.

7. Hodnoty slovesných morfologických kategorií v korpusu SYN2020 -- atribut verbtag.

8. Století cest na Slovensko: na okraj jednoho česko-slovenského dialogu.

9. ZPRACOVÁNÍ IDENTICKÉ KAUZY V RŮZNÝCH ON-LINE DENÍCÍCH.

10. Vícejazyčnost v současné české poezii. Několik úvodních postřehů z korpusové perspektivy.

11. Italská frazeologie a frazeografie na přelomu tisíciletí.

12. Čtení vlastním tempem: kritické představení metody.

13. Spor o občana: kritická analýza diskurzu o Klinice.

14. OD PATVARU K SUPERSTAR: FREKVENČNÍ A KOLOKAČNÍ ANALÝZA NÁZVU ČESKO V KORPUSOVÉ PUBLICISTICE Z LET 1990−2018.

15. CO VID(ÍME) V INTERCORPU? K HLEDÁNÍ EKVIVALENTŮ ČESKÉHO VIDU V PARALELNÍM KORPUSU.

16. Jak se píše o inkluzi? Společné vzdělávání pohledem korpusové lingvistiky.

17. Příznak tantum v morfologii češtiny.

18. HOMONYMIE MEZI APELATIVY A PROPRII JAKO PROBLÉM AUTOMATICKÉ MORFOLOGICKÉ ANALÝZY ČEŠTINY.

19. Konstrukce then v matematickém textu.

20. Korpusová kritická analýza diskurzu: povaha, možnosti a limity (na příkladu analýzy jazykových ideologií v českém parlamentním diskurzu).

21. MOŽNOSTI UPLATNĚNÍ LINGVISTICKÉ TEORIE V AFAZIOLOGII NA PŘÍKLADU USAGE-BASED LINGVISTIKY.

22. Využití kontrastivní analýzy v kritice překladu na příkladu textů Evropské unie.

23. Vybraná problematika z autorské lexikografie.

24. Variabilita českých frazémů v úzu.

25. Monokolokabilita ve dvou typologicky odlišných jazycích: srovnání češtiny a italštiny.

26. Nominalizované struktury se dvěma aktanty ve formě bezpředložkového genitivu.

27. Anglická adjektivní přirovnání: srovnání korpusového vzorku s výběrem ve standardní příručce idiomů.

28. Akademické psaní a frázové banky.

29. Slovnědruhová a morfologická homonymie, homofonie a homografie v současné češtině : Part-of-Speech and Morphological Ambiguity, Homophony and Homography in Contemporary Czech

30. Pravdepodobne si myslím, že niekedy v roku 2020. Pragmatické markery v slovenčine ako v cudzom jazyku.

31. Teorie relevance a nově vznikající diskurzní částice.

32. Variabilita českých frazémů v úzu | Variability of Czech phraseme usage

33. Monokolokabilita ve dvou typologicky odlišných jazycích: srovnání češtiny a italštiny : Monocollocability in Two Typologically Different Languages: A Comparison of Czech and Italian

34. Gestikulace ve sdíleném prostoru jako kooperativní utváření významu.

35. Možnosti chybové anotace češtiny nerodilých mluvčích

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

35 results

Search Results

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources