35 results
Search Results
2. Here’s hoping v procesu gramatikalizace.
- Author
-
Malá, Markéta and Nádraská, Zuzana
- Subjects
DEIXIS (Linguistics) ,CONSTRUCTION grammar ,GRAMMATICALIZATION ,DATA analysis ,CORPORA - Abstract
The aim of the paper is to examine the functional and formal features of the construction here’s hoping. Our analysis is informed by the frameworks of construction grammar (Fried, 2010, 2013), grammaticalization (Fried, 2009; Himmelmann, 2004), subjectification (Company, 2006) and impersonalization (Siewierska, 2008). The construction seems to have developed a novel subjectified function, i.e. to express the speaker’s positive expectation while retaining a degree of tentativeness and distance. The data for the analysis was excerpted from the corpus English Web 2021 (enTenTen21). The internal and external features of the construction (e.g. the (non-)deictic function of here, fixedness, syntactic isolation, initial position) together with the overall expansion of the semantic and pragmatic context and the gradual host-class expansion suggest that the process of grammaticalization/subjectification is currently under way (cf. Fried, 2010; Company, 2006). [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Korpus OnomOs: principy a příklady aplikací.
- Author
-
Místecký, Michal, David, Jaroslav, Glogarová, Jana Davidová, and Klemensová, Tereza
- Subjects
CZECH language ,WESTERN countries ,PERSONAL names ,SUPPURATION ,CORPORA - Abstract
The study introduces OnomOs, a new corpus of Czech texts with annotation of proper names. The corpus was compiled by onomasticians from the Department of Czech Language, Faculty of Arts, University of Ostrava, and made available by the Institute of the Czech National Corpus, Faculty of Arts, Charles University in Prague. The paper briefly discusses the content and structure of the cor- pus, the selection of texts for inclusion, and the onomastic-geographical classification of the iden- tified names. The text consists chiefly of three preparatory analyses, which focus on the most fre- quent surnames, collocations found in Western and Eastern countries in the pre-1989 period, and the declension patterns of three types of onyms. In the summary, further possibilities of onomastic corpus research are presented. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Korpus DIA1900: jeho koncepce a vytváření.
- Author
-
Benešová, Lucie, Kučera, Karel, Najbrtová, Kateřina, Pivoňková, Klára, and Stluka, Martin
- Subjects
ENCYCLOPEDIAS & dictionaries ,NINETEENTH century ,CORPORA ,ORTHOGRAPHY & spelling ,ANNOTATIONS - Abstract
The objective of the paper is to describe the principles for building the onemillionword DIA1900 Corpus consisting of Czech texts published between 1851 and 1900, designed to be both balanced and representative. There are two main goals determining the methods of corpus building and the decision to develop new tools tailored to the special needs of 19th century Czech: 1) to present the variability of Czech in the 2nd half of the 19th century (including spelling, morphology, wordformation) and 2) to link the detected variants to the appropriate lemmas. The paper presents the phases of the processing of the texts, including transcription, manual pre-annotation, as well as the construction of a large morphological dictionary and the selection of a suitable set of paradigms. Further sections are focused on annotation and morphological tagging and manual disambiguation. The objective was to create a gold standard, intended for use in the automatic annotation both of the DIA1900 corpus and the planned corpus of Czech texts of the years 1800-1850. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
5. Meryl Streepová a Emma Stone: přechylování cizích příjmení v češtině prizmatem korpusového výzkumu.
- Author
-
DAVID, Jaroslav and MÍSTECKÝ, Michal
- Subjects
ACTING awards ,PERSONAL names ,ORTHOGRAPHY & spelling ,CORPORA ,ACADEMY Awards ,ACTRESSES - Abstract
Copyright of Bohemistyka is the property of Instytut Filologii Slowianskiej Uniwersytetu im. Adama Mickiewicza w Poznaniu and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2023
- Full Text
- View/download PDF
6. Funkce a slovosled partikulí zajisté a pak ve staré a střední češtině.
- Author
-
JEŽOVÁ, Martina
- Subjects
WORD order (Grammar) ,CORPORA ,TERMS & phrases ,SYNTAX (Grammar) - Abstract
Copyright of Bohemistyka is the property of Instytut Filologii Slowianskiej Uniwersytetu im. Adama Mickiewicza w Poznaniu and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2023
- Full Text
- View/download PDF
7. Hodnoty slovesných morfologických kategorií v korpusu SYN2020 -- atribut verbtag.
- Author
-
Jelínek, Tomáš, Petkevič, Vladimír, and Skoumalová, Hana
- Subjects
VERBS ,MORPHOSYNTAX ,DISEASE susceptibility ,CORPORA ,DETECTIVES ,GENE ontology - Abstract
The paper describes the verbtag attribute, which allows a user to search, in the SYN2020 corpus (and also subsequent corpora, SYNv9 and SYNv10) of contemporary Czech, for all values of morphological categories of verbs, i.e., not only those contained in the tag attribute, but also those related mainly to multi-word participial verb predicates, which are prevalent in Czech. The verbtag attribute contains information indicating whether the verb (co-)forming the verbal meaning is either auxiliary or autosemantic, as well as information about the verb mode, diathesis, person, number and tense. The annotation applies both to verb predicates expressed in a single word (e.g., the 1st person indicative present tense: Čtu rád detektivní příběhy. 'I like to read detective stories.') and (especially) to verb predicates expressed in multiple words (e.g., the present conditional of the 1st person singular: Pak bych mu s chutí nabídla výhodnou smlouvu. 'Then I would gladly offer him a good deal.'). The authors introduce the motivation and the concept of the verbtag annotation, describe relevant morphological categories and their values in detail, and show, via examples, how various multiword structures expressing verbal meaning are annotated in the verbtag attribute. They also offer users a guide to the whole issue of verbal morphosyntax manifested in the verbtag attribute and possibilities for efficient search for and retrieval of morphological/morphosyntactic data. The paper shows which multiple verb complexes are simple in terms of annotation, but also identifies more complex cases (e.g., coordination of participles) which are not easy to automatically annotate, and/or whose annotation is unclear in terms of an adequate theoretical approach. The authors also present the method used for annotating multiword verbal complexes and its current success rate. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
8. Století cest na Slovensko: na okraj jednoho česko-slovenského dialogu.
- Author
-
Pátková, Jana
- Subjects
POSTCOLONIALISM ,STEREOTYPES ,MYTH ,CORPORA ,CLASSIFICATION ,MUTUALISM - Abstract
This paper focuses on the literary representation of Slovakia in selected travelogues by Czech authors. The subject of the research is that of cultural stereotypes, images of the other, and constructions of us and them. The corpus of travelogues covers the period from the 1830s to the end of the 1930s. The methodological framework for analysing travelogues includes several diverse approaches. In addition to literaryhistory classification, the study employs an imagological approach while also taking into account an approach based on postcolonial theories in the context of the interpretation of cultural stereotypes. The material is divided into four historical periods with regard to the form and changing face of Czech-Slovak dialogue. Through the travelogue material under review we can analyse how the image of Slovakia within the Czech cultural myth of Slovakia has been shaped and transformed over the course of a century. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. ZPRACOVÁNÍ IDENTICKÉ KAUZY V RŮZNÝCH ON-LINE DENÍCÍCH.
- Author
-
SVOBODOVÁ, JINDŘIŠKA and FALTÝNEK, DAN
- Subjects
ELECTRONIC newspapers ,PRIME ministers ,POISONING ,CORPORA ,SPHERES ,ENVIRONMENTAL disasters - Abstract
The paper deals with the reflection of the poisoning of the river Bečva on the news servers iDnes.cz and Seznam zprávy. The selection of on-line newspapers was motivated by the person of their owner, the aim of the contribution was to find out how much this circumstance will be reflected in the way the event is presented. In the analysis, the authors used the method of critical reading based on the quantitative processing of the corpus of reports that were published in the period of 13 months after the environmental disaster. They mainly focused on what kind of media image is created for the company DEZA in the news (the prime minister was financially connected to both the iDnes.cz newspaper and this company), what space in the news are given to witnesses “from the people” and experts from the academic sphere, and as the then Minister of the Environment Richard Brabec is presented in the news. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Vícejazyčnost v současné české poezii. Několik úvodních postřehů z korpusové perspektivy.
- Author
-
Piorecký, Karel and Škrabal, Michal
- Subjects
DISTRIBUTION (Probability theory) ,MULTILINGUALISM ,CORPORA ,DATA analysis ,QUANTITATIVE research ,BIBLIOGRAPHIC databases ,LITERARY research - Abstract
The paper is an attempt at a quantitative corpus related approach to the subject of multilingualism in contemporary Czech poetry (published both in books and on literary servers). The authors of the paper examine the frequency and distribution of foreign (i.e., non-Czech) lexical units, raising questions about the forms and functions of individual lexemes. Three selected poets (T. Kafka, M. Šanda, M. Torčík) are then analyzed more in-depth. The paper is also a report about a currently developed database - The Corpus of Contemporary Czech Poetry - and possibilities of using it. It suggests how beneficial the quantitative data analysis in the first phase of linguistically oriented literary research can be, pointing to the necessity of interconnecting the quantitative and qualitative approaches. It is only the researcher's interpretative competence that can define the boundaries of the research field and the significance of its elements. When conducting text-centered analyses, language corpora should begin to play a role similar to other scientific infrastructure tools, such as bibliographic databases. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
11. Italská frazeologie a frazeografie na přelomu tisíciletí.
- Author
-
Obstová (Praha), Zora
- Subjects
PHRASEOLOGY ,ENCYCLOPEDIAS & dictionaries ,CORPORA ,IDIOMS ,PROVERBS ,WISDOM - Abstract
The paper gives a brief overview of the most important phraseological studies and dictionaries published in Italy since the 1980s, when Italian phraseology took the first steps as an independent linguistic discipline. It deals with fundamental theoretical contributions and describes major dictionaries of idioms and proverbs published in the last 40 years. It points also to new trends in Italian phraseography and presents some interesting current projects based on new corpus methodologies. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
12. Čtení vlastním tempem: kritické představení metody.
- Author
-
Chromý (Praha), Jan and Dotlačil (Utrecht), Jakub
- Subjects
CORPORA ,ELECTRONIC data processing ,RESEARCH & development ,DISCOURSE ,READING - Abstract
Self-paced reading has been a widely used experimental method for study of the processing of sentences and texts. In this paper, we introduce the method to the Czech audience. We summarize its advantages and limitations and provide practical suggestions on stimuli construction and data processing. We also present different variants of the method, we discuss its ecological validity, and we summarize the experimental evidence showing that reaction times collected in self-paced reading can be linked to processing demands people might experience during reading. Finally, we present three examples of the application of the method: an experiment on agreement attraction in Czech, an experiment on garden-path sentences in Czech and an experiment studying the processing of short discourses in English. We also briefly discuss new trends that connect corpus linguistics with psycholinguistic discourse processing research and lead to the development of reading-time corpora. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
13. Spor o občana: kritická analýza diskurzu o Klinice.
- Author
-
Hořejší, Michal, Dufek, Ondřej, and Truhlařík, Štěpán
- Subjects
CRITICAL discourse analysis ,POLICE intervention ,ACTIVISTS ,CORPORA ,OUTGROUPS (Social groups) ,DISCOURSE ,PUBLIC sphere - Abstract
The paper is devoted to a critical analysis of the discourse on Autonomní sociální centrum Klinika (Autonomous Social Centre Klinika) that was of high relevance in the Czech public sphere in 2014--2019. The Klinika centre was founded in Prague by a group of civic activists in a building owned by the Czech state which had long fallen into ruin. In 2014 they entered it without the owner's consent. Both demonstrations in support of the centre and repeated police interventions attracted intense media attention, while control over identities, meanings and relations was disputed until the police clear-out in 2019. We aimed to discover what discourse strategies were chosen by particular social actors and what meaning configurations were created in their texts. Based on a) a set of qualitative analyses of nomination and predication strategies (Wodak, 2001; Wodak & Reisigl, 2009) in five initial discourse phase texts and b) frequency analysis of two large corpora representing Klinika supporters and media mainstream, we carried out a detailed concordance analysis of the relation between the activists' group and the citizen category. Using the methods of corpus-assisted critical discourse analysis (e.g., Baker, 2006; Baker et al., 2008), we focused on the question of to what extent, in which contexts and via which discourse processes Klinika ended up outside or inside the ingroup. The main finding is that by keeping control over the concept of citizen, Klinika's final displacement in January 2019 did not mean its discursive defeat. In fact, activists managed to keep their story about a civic opposition against an incompetent power. State violence terminating the existence of the Klinika centre only confirmed its discursive victory. [ABSTRACT FROM AUTHOR]
- Published
- 2022
14. OD PATVARU K SUPERSTAR: FREKVENČNÍ A KOLOKAČNÍ ANALÝZA NÁZVU ČESKO V KORPUSOVÉ PUBLICISTICE Z LET 1990−2018.
- Author
-
Drkošová, Veronika and Místecký, Michal
- Subjects
GEOGRAPHIC names ,COLLOCATION (Linguistics) ,CULTURAL activities ,PERSONAL names ,CORPORA ,FREEDOM of the press - Abstract
The paper focuses on the frequency and collocation analyses of Česko (“Czechia”), the short, geographical name of our country, in the opinion journalism section of the eight-version SYN corpus, which comprises texts from the period of 1990−2018. Within the scope of the research, the period was divided into several sections, which are delineated by the breakthrough political and cultural events (the Czech Republic entering NATO, the Czech Republic entering the EU, climax of the first season of the Pop Idol-based contest Czechia Is Looking for a SuperStar, etc.). The frequency analysis is based on the relativization via i.p.m.; the collocability force is counted on the grounds of the logDice index, which is easy to be interpreted linguistically, and independent of the corpus size. The goal of the study is to capture basic motivations which led to the popularisation of the name and its expansion in the given discourse (e.g. the influences of other one-word names of states, sport commentaries, popular contests, and generation change). It is possible to sum up that the Česko name is employed in a variety of contexts, and its usage can be seen as unmarked. [ABSTRACT FROM AUTHOR]
- Published
- 2022
15. CO VID(ÍME) V INTERCORPU? K HLEDÁNÍ EKVIVALENTŮ ČESKÉHO VIDU V PARALELNÍM KORPUSU.
- Author
-
NOVÁKOVÁ, EVA
- Subjects
GRAMMATICAL categories ,ENGLISH language ,CORPORA ,TRANSLATING & interpreting ,FUNCTIONAL analysis - Abstract
The paper discusses the use of parallel corpora (InterCorp, a subcorpus of Czech National Corpus) in translation. It focuses on advantages and drawbacks of exploiting corpus data for a specific translation task, i.e., presenting a language-specific grammatical category of the Czech aspect to L2 Czech learners via English as the auxiliary language. [ABSTRACT FROM AUTHOR]
- Published
- 2021
16. Jak se píše o inkluzi? Společné vzdělávání pohledem korpusové lingvistiky.
- Author
-
Velčovský, Václav
- Subjects
CORPORA ,JOURNALISTS ,INCLUSIVE education ,CONCEPTS ,COMPREHENSION - Abstract
Copyright of E-Pedagogium is the property of Palacky University in Olomouc and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2020
- Full Text
- View/download PDF
17. Příznak tantum v morfologii češtiny.
- Author
-
VONDRÁČEK, Miloslav
- Subjects
GRAMMATICAL categories ,NOUNS ,GRAMMAR ,VERBS ,CORPORA - Abstract
Copyright of Bohemistyka is the property of Instytut Filologii Slowianskiej Uniwersytetu im. Adama Mickiewicza w Poznaniu and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2020
- Full Text
- View/download PDF
18. HOMONYMIE MEZI APELATIVY A PROPRII JAKO PROBLÉM AUTOMATICKÉ MORFOLOGICKÉ ANALÝZY ČEŠTINY.
- Author
-
Osolsobě, Klára and Žižková, Hana
- Subjects
NOUNS ,CORPORA ,GEOGRAPHIC names ,ANNOTATIONS - Abstract
The aim of this paper is to provide a corpus-based analysis of one type of Czech proper nouns (type Zubří). We will argue that the adequate annotation (lemmatisation and morphological tagging) of proper nouns type Zubří depends on several circumstances: 1) the coverage of the dictionary of the automatic analyser; 2) the accurate description of the variability of inflexion forms; 3) the non-trivial disambiguation of numerous homonymous word forms. We believe that while meeting the first two conditions is possible, the adequate disambiguation goes beyond the possibilities of automatic morphological analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2020
19. Konstrukce then v matematickém textu.
- Author
-
Malá, Lucie
- Subjects
WORD recognition ,CONSTRUCTION grammar ,TEXTBOOKS ,CONSTRUCTION ,CORPORA - Abstract
The paper explores the possibilities for constructional analysis of functions of a word in a specific text type. Five constructions of the word then found in a corpus of mathematical university textbooks are described in detail: logical then, hypothetical conditional then, temporal then, resultative then, and summarising then. While this is not meant to be an exhaustive list of constructions of then, it is apparent from the results of the analysis that the constructional perspective offers more precise information on the use of then in mathematical texts. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
20. Korpusová kritická analýza diskurzu: povaha, možnosti a limity (na příkladu analýzy jazykových ideologií v českém parlamentním diskurzu).
- Author
-
Dufek, Ondřej
- Subjects
CRITICAL discourse analysis ,CZECH language ,CORPORA ,CRITICAL realism - Abstract
The paper provides a thorough review of the corpus-linguistic approach to critical discourse analysis. It briefly presents the core of critical discourse analysis (CDA) and examines the possibilities of applying corpus tools to it. In the next step, critical commentaries on CDA are summarized and at the same time, possible corpus-linguistic solutions are offered. The final part offers an illustrative application of corpus-assisted CDA focusing on language ideologies in the Czech parliamentary discourse. [ABSTRACT FROM AUTHOR]
- Published
- 2019
21. MOŽNOSTI UPLATNĚNÍ LINGVISTICKÉ TEORIE V AFAZIOLOGII NA PŘÍKLADU USAGE-BASED LINGVISTIKY.
- Author
-
Láznička, Michal
- Subjects
APPLIED linguistics ,LINGUISTICS education ,LINGUISTICS ,CORPORA - Abstract
Copyright of Speciální Pedagogika is the property of Charles University Prague, Faculty of Education and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2019
22. Využití kontrastivní analýzy v kritice překladu na příkladu textů Evropské unie.
- Author
-
Nováková, Eva
- Subjects
ENGLISH language ,TRANSLATIONS ,LINGUISTICS ,MATHEMATICAL equivalence ,CRITICISM ,CORPORA ,CULTURAL adaptation - Abstract
The aim of the present paper is to illustrate possible strategies to assess translation quality and functional equivalence with the methods of contrastive analysis. The study is therefore related to relationships between the two autonomous, yet tightly interrelated disciplines of linguistics and translation studies, and it seeks to contribute to the discussion on whether and how translation criticism might profit from the linguistic point of view on source and target texts originating in typologically different language systems. The corpus of data consists of English and Czech versions of the European Union official documents representing specific text types that, due to hybridity of their forms and functions, address a heterogeneous group of recipients, i.e., both EU officials and the European general public. The English texts are analysed with respect to the occurrences of nominal expressions, as these are assumed to be the prominent stylistic markers of the administrative texts as well as an inherent systemic feature of the English language, and the individual instances of nominalizations are contrasted with their authentic Czech equivalents. The analysis based on the text-oriented contrastive approach aims at assessing the (non-)adequacy of possible equivalents regarding the requirements on their functionality within the selected genre. [ABSTRACT FROM AUTHOR]
- Published
- 2019
23. Vybraná problematika z autorské lexikografie.
- Author
-
ZMĚLÍK, RICHARD
- Subjects
LEXICOGRAPHY ,ENCYCLOPEDIAS & dictionaries ,ACADEMIC discourse ,CORPORA - Abstract
The present paper explores the topic of authorial dictionaries -- a subject rarely discussed in Czech academic discourse. The aim of the study is to present several forms of authorial dictionaries, focusing mainly on current and up-to-date methods. At the beginning, the study explores two viewpoints (J. Mattausch and H. E. Wiegand) of the taxonomy of authorial dictionaries, of which the second is subjected to slightly stronger criticism. The article also looks at the development of this special lexicographical discipline. Finally, the paper mentions future perspectives of modern corpus devices in the sphere of authorial lexicography. [ABSTRACT FROM AUTHOR]
- Published
- 2014
24. Variabilita českých frazémů v úzu.
- Author
-
Jelínek, Tomáš, Kopřivová, Marie, Petkevič, Vladimír, and Skoumalová, Hana
- Abstract
The paper addresses the variability of Czech phrasemes, i.e. semantically non-compositional multiword units, in current use represented by corpora, the variability being the result of linguistic creativity on the part of text authors. It also asks what, in fact, identifies a phraseme. A basic, original phraseme has a certain meaning that cannot be inferred from the meaning of its components, and if it is modified, made more topical and up-to-date, either the original meaning is entirely or partially preserved, or the modified phraseme acquires a totally new meaning. Some phrasemes allow for multiple modifications, while others are more rigid. The article examines different types of lexical/syntactic/morphological/semantic alteration of basic phrasemes. In addition to lexical variations, the focus is mainly on syntactic and morphological changes, and on the question as to whether the chosen syntactic means of expressing semantic shifts have an impact on the potential for a creative treatment of the phraseme. In order to identify the variants of a phraseme with the phraseme itself, we introduce the term phraseme nucleus and outline a partial solution to the phraseme variability problem -- designing a lexical database of multiword units (including phrasemes) containing entries sufficiently flexible to at least partially capture the variability of phrasemes. [ABSTRACT FROM AUTHOR]
- Published
- 2018
25. Monokolokabilita ve dvou typologicky odlišných jazycích: srovnání češtiny a italštiny.
- Author
-
Obstová, Zora
- Abstract
The present study deals with the phenomenon of extremely restricted collocability (monocollocability) in Czech and Italian. On the basis of a list of words with the highest degree of monocollocability extracted from corpora with the aid of the so called Herfindahl-Hirschman Index (HHI), we try to analyse this little-explored phenomenon in both languages. Monocollocable words (MW, often referred to as cranberry words) and the fixed combinations in which they occur are investigated in terms of syntactic and collocation structures, frequency and diacronic development. In the light of corpus evidence, the phenomenon of extremely restricted collocability appears more complex than originally believed. The paper shows that MW in both languages form a very heterogeneous category and that many differences between Czech and Italian monocollocable structures can be explained by typological (e.g. the degree of nominal inflection) or historical factors. [ABSTRACT FROM AUTHOR]
- Published
- 2017
26. Nominalizované struktury se dvěma aktanty ve formě bezpředložkového genitivu.
- Author
-
KOLÁŘOVÁ, VERONIKA
- Abstract
Double post-nominal genitives in Czech have thus far been illustrated only by a single type of nominalized structure, e.g., zbavení ženy starostí 'relieving woman-GEN worry-GEN.PL, i.e. relieving the woman of worries'. In this paper, we specify three other types of double post-nominal genitive constructions and search for their frequency in the Prague Dependency Treebank and in the Czech National Corpus. Although the constructions are rare and less acceptable, we try to show that Czech grammar system allows them. Special attention is paid to nominalizations of support verb constructions; they can be interpreted as one lexical unit which enables them to be used within double post-nominal genitive constructions. [ABSTRACT FROM AUTHOR]
- Published
- 2014
27. Anglická adjektivní přirovnání: srovnání korpusového vzorku s výběrem ve standardní příručce idiomů.
- Author
-
Emmer, Jaroslav
- Subjects
ENCYCLOPEDIAS & dictionaries ,IDIOMS ,HOMOGENEITY ,CORPORA ,TEXTBOOKS ,INTUITION - Abstract
An adjectival simile is an established phraseological unit with a standardised form (blind as a bat). It is perhaps for this reason that it has not been attracting much attention, and most studies on similes focus on verbal similes. This study reports a significant diachronic shift in the usage of adjectival similes, identified by comparing a lexicographic sample with one from a corpus. Analysing corpus data, we collected a representative sample of 309 adjectival similes in English, which further served for the compilation of a "simile minimum" of 60 types. Both corpus samples were then contrasted with lexicographic lists from a dictionary of idioms: English Idioms and How to Use Them (Seidl & McMordie, 1978; 1988). The comparison shows that the lexicographic minimum (65 types) overlaps with that from the corpus by just one-third and the whole representative list (166 types) only by one-fifth. The significant disproportion can be explained as a shift in linguistic reality but also as a result of the dictionary authors' idiolects or homogeneity of their data. These findings can serve for textbook and reference manual authors as a warning against relying too much on their own linguistic intuition. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Akademické psaní a frázové banky.
- Author
-
Homoláč, Jiří, Křen, Michal, Kašpárková, Alena, Rosolová, Kamila Etchegoyen, Hoffmannová, Jana, Kopecký, Jakub, Sherman, Tamah, and Vondřička, Pavel
- Subjects
ACADEMIC discourse ,PERIODICAL articles ,WRITING processes ,TERMS & phrases ,CORPORA ,CHARACTER - Abstract
Scholars have previously conceptualized academic writing as both process and product. Academic phrasebanks are a tool in which these two conceptions intertwine, i.e., where the products, existing texts such as journal articles, are broken down into smaller units such as steps and phrases, which are then used in the process of producing new texts. In this article, we examine the possibilities and limits of collecting these smaller units for research and didactic purposes, presenting a newly established phrasebank in this context. First, we consider various scholarly and pedagogical approaches to academic writing. We then provide an overview of existing academic phrasebanks, primarily the seminal University of Manchester Academic Phrasebank created by John Morley, focusing on how its principles and structure have been utilized to create similar tools for other languages. We subsequently describe the design and creation of the Czech Academic Phrasebank, the innovative character of which is its link to the Czech National Corpus, specifically a subcorpus of Czech scholarly articles. The processes of conceptualizing the phrasebank, its basic units and functions, excerpting phrases, linking to the corpus, and the various problems encountered throughout are reflected. We conclude by outlining directions for the phrasebank's use in Czech-language genre-based pedagogy. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. Slovnědruhová a morfologická homonymie, homofonie a homografie v současné češtině : Part-of-Speech and Morphological Ambiguity, Homophony and Homography in Contemporary Czech
- Author
-
Petkevič, Vladimír
- Subjects
part-of-speech and morphological ambiguity ,homophony ,homography ,contemporary Czech ,language ,corpora ,Philology. Linguistics ,P1-1091 - Abstract
The paper presents a classification of the types of morphological ambiguity and the types of homophony and homography in contemporary Czech occurring in the material of the SYN and SYN2013PUB corpora of the Czech National Corpus. The classification of homonymy and homography constitutes a data base for the rule-based automatic morphological disambiguation of written Czech performed in the Institute of Theoretical and Computational Linguistics at the Faculty of Arts, Charles University. As for homophony, the types presented in the paper and mainly the sets of word forms associated with these types, can be used for the disambiguation of spoken texts.
- Published
- 2015
30. Pravdepodobne si myslím, že niekedy v roku 2020. Pragmatické markery v slovenčine ako v cudzom jazyku.
- Author
-
Ivanová, Martina
- Subjects
DISTRIBUTION (Probability theory) ,DISCOURSE markers ,WORD order (Grammar) ,ACQUISITION of data ,CORPORA ,SECOND language acquisition - Abstract
The study deals with pragmatic markers (PMs) and their frequency distribution in the written texts of Slovak as a second language (L2). It focuses on the comparison of written texts produced by nonnative speakers of Slovak to describe the acquisition aspects of PMs with respect to their preferential realization in the non-native texts, frequency distribution of individual units, maintenance of polyfunctionality of PMs and types of errors occurring in the texts. Individual functional-semantic classes of PMs are delimited within four domains: textual, propositional, illocutionary and interpersonal. On the basis of data from the acquisition corpus of Slovak errkorp-pilot the central markers of those classes are described and their frequency distribution and functions in the texts are analysed. Corpus data are used to manifest i) the tendency towards underrepresentation of PMs in the texts of Slovak as L2, ii) the specificities of the frequency distribution of certain functional and semantic classes of PMs in the texts of Slovak as L2, iii) the tendency towards decreased polyfunctionality of PMs in the texts of Slovak as L2 (polyfunctional PMs are usually limited to realizing one major function in the texts of Slovak as L2). Corpus material also brings evidence for the most frequent errors concerning the usage of PMs, namely lexical ones (falling into the category of substitution with respect to the semantically, formally or pragmatically related PMs, the redundant usage of PMs in the texts, etc.) and syntactic ones (concerning incorrect word order and embedding of PMs). [ABSTRACT FROM AUTHOR]
- Published
- 2022
31. Teorie relevance a nově vznikající diskurzní částice.
- Author
-
Andersen, Gisle
- Subjects
DISCOURSE markers ,CORPORA ,LANGUAGE contact ,DISCOURSE - Abstract
This article explores the relevance-theory view of utterance interpretation (Sperber and Wilson 1986/1995) and illustrates its application in a qualitative investigation of authentic corpus data. The purpose is to show that observations derived from corpora can shed significant light on how constraints on relevance are practised by real speakers in real discourse contexts. The study focuses on discourse markers and argues that there is a need to focus more systematically on emerging discourse markers and their contributions to relevance. It is argued that the corpus-based approach can lead to new knowledge about pragmatic functions and subtle differences between different items, and that this extends beyond what is gained from a strictly theoretical or experimental approach, by far the most common approaches in the previous relevance-theory literature. As a case in point, the article includes an empirical study of the discourse marker as if, based on the large English TenTen corpus (Jakubíček et al., 2013). [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
32. Variabilita českých frazémů v úzu | Variability of Czech phraseme usage
- Author
-
Tomáš Jelínek, Marie Kopřivová, Vladimír Petkevič, and Hana Skoumalová
- Subjects
phraseme variability ,language creativity ,modification ,phraseme nucleus ,corpora ,phraseme lexical database ,variabilita frazémů ,jazyková kreativita ,modifikace ,jádro frazému ,jazykové korpusy ,lexikální databáze frazémů ,Philology. Linguistics ,P1-1091 - Abstract
The paper addresses the variability of Czech phrasemes, i.e. semantically non-compositional multiword units, in current use represented by corpora, the variability being the result of linguistic creativity on the part of text authors. It also asks what, in fact, identifies a phraseme. A basic, original phraseme has a certain meaning that cannot be inferred from the meaning of its components, and if it is modified, made more topical and up-to-date, either the original meaning is entirely or partially preserved, or the modified phraseme acquires a totally new meaning. Some phrasemes allow for multiple modifications, while others are more rigid. The article examines different types of lexical/syntactic/morphological/semantic alteration of basic phrasemes. In addition to lexical variations, the focus is mainly on syntactic and morphological changes, and on the question as to whether the chosen syntactic means of expressing semantic shifts have an impact on the potential for a creative treatment of the phraseme. In order to identify the variants of a phraseme with the phraseme itself, we introduce the term phraseme nucleus and outline a partial solution to the phraseme variability problem — designing a lexical database of multiword units (including phrasemes) containing entries sufficiently flexible to at least partially capture the variability of phrasemes.
- Published
- 2018
33. Monokolokabilita ve dvou typologicky odlišných jazycích: srovnání češtiny a italštiny : Monocollocability in Two Typologically Different Languages: A Comparison of Czech and Italian
- Author
-
Zora Obstová
- Subjects
monocollocable words ,cranberry words ,language typology ,Czech ,Italian ,corpora ,monokolokabilní slova ,jazyková typologie ,čeština ,italštiny ,korpusy ,Philology. Linguistics ,P1-1091 - Abstract
The present study deals with the phenomenon of extremely restricted collocability (monocollocability) in Czech and Italian. On the basis of a list of words with the highest degree of monocollocability extracted from corpora with the aid of the so called Herfindahl-Hirschman Index (HHI), we try to analyse this little-explored phenomenon in both languages. Monocollocable words (MW, often referred to as cranberry words) and the fixed combinations in which they occur are investigated in terms of syntactic and collocation structures, frequency and diacronic development. In the light of corpus evidence, the phenomenon of extremely restricted collocability appears more complex than originally believed. The paper shows that MW in both languages form a very heterogeneous category and that many differences between Czech and Italian monocollocable structures can be explained by typological (e.g. the degree of nominal inflection) or historical factors.
- Published
- 2017
34. Gestikulace ve sdíleném prostoru jako kooperativní utváření významu.
- Author
-
Lehečková, Eva and Jehlička, Jakub
- Subjects
SOCIAL interaction ,BUSINESS meetings ,HUMAN experimentation ,GESTURE ,CORPORA ,SPACE - Abstract
The present study focuses on strategies which speakers employ when gesturing in a shared articulatory space. Using data from English and Czech multimodal corpora of spontaneous business meetings, we conducted a qualitative analysis of gestural patterns based on two strategies: alignment and elaboration of gestures representing abstract/conceptual objects. We show that speakers make use of both strategies in the context of co-operative meaning formation (with various pragmatic functions) and that the notions of alignment and elaboration provide useful analytic and descriptive tools for the study of human interaction from a multimodal perspective. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
35. Možnosti chybové anotace češtiny nerodilých mluvčích
- Author
-
Svák, Jiří, Šebesta, Karel, and Pierścieniak, Piotr Paweł
- Subjects
korpus ,žákovský korpus ,multi-level annotation ,corpora ,distanční anotace ,stand-off markup ,chybová anotace ,FALKO ,vícerovinná anotace ,error annotation ,leaner corpora ,CZESL ,GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries) - Abstract
Bachelor thesis "Possibilities of Error Annotation of Non-Native Speakers' Czech" compares annotation systems of selected learners corpora from the perspective of error annotation. For the comparison, two learner corpora were chosen - Czech CZESL and German FALKO. Both corporas use stand-off multi-level annotation model. The paper is divided into two parts: theoretical and practical. In the theoretical part there is an in-depth description of both of selected corporas and their annotation models. The practical part presents annotation of pupil text processed in annotation models of both corporas. The aim of this paper is to highlight possible strengths and weaknesses of selected annotation formats.
- Published
- 2013
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.