8 results on '"Hajič, Jan"'
Search Results
2. Some of Our Best Friends Are Statisticians
- Author
-
Hajič, Jan, Hajičová, Eva, Carbonell, Jaime G., editor, Siekmann, J\'org, editor, Matoušek, Václav, editor, and Mautner, Pavel, editor
- Published
- 2007
- Full Text
- View/download PDF
3. MRP 2020: The Second Shared Task on Cross-Framework and Cross-Lingual Meaning Representation Parsing
- Author
-
Abzianidze, L., Oepen, Stephan, Abend, Omri, Abzianidze, Lasha, Bos, Johan, Hajič, Jan, Hershcovich, Daniel, Li, Bin, O'Gorman, Tim, Xue, Nianwen, Zeman, Daniel, LS Comp.semantiek en kunstm.intelligent., ILS LLI, and ILS Variation
- Subjects
Cross lingual ,Parsing ,business.industry ,Computer science ,Serialization ,Directed graph ,computer.software_genre ,Language acquisition ,Software ,Graph (abstract data type) ,Artificial intelligence ,business ,computer ,Sentence ,Natural language processing - Abstract
The 2020 Shared Task at the Conference for Computational Language Learning (CoNLL) was devoted to Meaning Representation Parsing (MRP) across frameworks and languages. Extending a similar setup from the previous year, five distinct approaches to the representation of sentence meaning in the form of directed graphs were represented in the English training and evaluation data for the task, packaged in a uniform graph abstraction and serialization; for four of these representation frameworks, additional training and evaluation data was provided for one additional language per framework. The task received submissions from eight teams, of which two do not participate in the official ranking because they arrived after the closing deadline or made use of additional training data. All technical information regarding the task, including system submissions, official results, and links to supporting resources and software are available from the task web site at: http://mrp.nlpl.eu
- Published
- 2020
4. DRS at MRP 2020: Dressing up Discourse Representation Structures as Graphs
- Author
-
Abzianidze, Lasha, Bos, Johan, Oepen, S., Oepen, Stephan, Abend, Omri, Hajič, Jan, Hershcovich, Daniel, Li, Bin, O'Gorman, Tim, Xue, Nianwen, and Zeman, Daniel
- Subjects
Structure (mathematical logic) ,FOS: Computer and information sciences ,Discourse representation theory ,Computer Science - Computation and Language ,Interpretation (logic) ,Parsing ,Computer science ,business.industry ,I.2.7 ,68T50 ,02 engineering and technology ,Meaning (non-linguistic) ,Semantics ,computer.software_genre ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Representation (mathematics) ,business ,computer ,Computation and Language (cs.CL) ,Natural language processing ,Natural language - Abstract
Discourse Representation Theory (DRT) is a formal account for representing the meaning of natural language discourse. Meaning in DRT is modeled via a Discourse Representation Structure (DRS), a meaning representation with a model-theoretic interpretation, which is usually depicted as nested boxes. In contrast, a directed labeled graph is a common data structure used to encode semantics of natural language texts. The paper describes the procedure of dressing up DRSs as directed labeled graphs to include DRT as a new framework in the 2020 shared task on Cross-Framework and Cross-Lingual Meaning Representation Parsing. Since one of the goals of the shared task is to encourage unified models for several semantic graph frameworks, the conversion procedure was biased towards making the DRT graph framework somewhat similar to other graph-based meaning representation frameworks., Comment: 10 pages, 4 figures, 4 tables, CoNLL 2020 Shared Task
- Published
- 2020
- Full Text
- View/download PDF
5. Hluboké učení v automatické analýze českého textu.
- Author
-
Straková, Jana, Straka, Milan, Hajič, Jan, and Popel, Martin
- Subjects
NATURAL language processing ,ARTIFICIAL neural networks ,CZECH language ,DEEP learning ,COMPUTATIONAL linguistics ,AUTOMATIC classification - Abstract
The deep learning methods of artificial neural networks have seen a significant uptake in recent years, and have succeeded in overcoming and advancing the success of auto-solving tasks in many fields. The field of computational linguistics and its application offshoot, natural language processing, with classic tasks such as morphological tagging, dependency analysis, named entity recognition and machine translation, are no exception to this. This paper provides an overview of recent advances in these tasks related to the Czech language and presents completely new results in the areas of morphological marking and recognition of named entities in Czech, along with a detailed error analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2019
6. The Strategic Impact of META-NET on the Regional, National and International Level
- Author
-
Rehm, Georg, Uszkoreit, Hans, Ananiadou, Sophia, Bel, Núria, Bielevičienė, Audronė, Borin, Lars, Branco, António, Budin, Gerhard, Calzolari, Nicoletta, Daelemans, Walter, Garabík, Radovan, Grobelnik, Marko, García-Mateo, Carmen, van Genabith, Josef, Hajič, Jan, Hernáez, Inma, Judge, John, Koeva, Svetla, Krek, Simon, Krstev, Cvetana, Lindén, Krister, Magnini, Bernardo, Mariani, Joseph, McNaught, John, Melero, Maite, Monachini, Monica, Moreno, Asunción, Odijk, J.E.J.M., Ogrodniczuk, Maciej, Pęzik, Piotr, Piperidis, Stelios, Przepiórkowski, Adam, Rögnvaldsson, Eiríkur, Rosner, Mike, Sandford Pedersen, Bolette, Skadiņa, Inguna, De Smedt, Koenraad, Tadić, Marko, Thompson, Paul, Tufiş, Dan, Váradi, Tamás, Vasiļjevs, Andrejs, Vider, Kadri, Zabarskaitė, Jolanta, LS OZ Taal en spraaktechnologie, ILS LLI, Calzolari, Nicoletta, Choukri, Khalid, Declerck, Thierry, Loftsson, Hrafn, Maegaard, Bente, Mariani, Joseph, Moreno, Asunción, LS OZ Taal en spraaktechnologie, ILS LLI, University of Helsinki, Department of Modern Languages 2010-2017, Language Technology, Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions, Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla, Moreno, Asuncion, Odijk, Jan, and Piperidis, Stelios
- Subjects
Linguistics and Language ,Strategic impact ,Operations research ,META-NET ,LR National/International Projects ,Infrastructural/Policy Issues ,Multilinguality ,Machine Translation ,English language -- Machine translating ,02 engineering and technology ,Library and Information Sciences ,Language and Linguistics ,Education ,machine translation ,Politics ,Order (exchange) ,Traducció automàtica ,0202 electrical engineering, electronic engineering, information engineering ,Regional science ,Computer Science (miscellaneous) ,6121 Languages ,META-SHARE ,Sociology ,Multilingual technologies ,Anglès -- Traducció automàtica ,natural language processing ,060201 languages & linguistics ,International level ,Computer. Automation ,multilingual technologies ,Linguistics ,06 humanities and the arts ,113 Computer and information sciences ,language resources ,Work (electrical) ,0602 languages and literature ,Enginyeria de la telecomunicació::Processament del senyal::Processament de la parla i del senyal acústic [Àrees temàtiques de la UPC] ,020201 artificial intelligence & image processing ,Informàtica::Intel·ligència artificial [Àrees temàtiques de la UPC] ,multilinguality ,language technology ,Language technology ,Machine translation ,Language resources ,Machine translating - Abstract
This article provides an overview of the dissemination work carried out in META-NET from 2010 until early 2014; we describe its impact on the regional, national and international level, mainly with regard to politics and the situation of fundingfor langauge technology topics. This paper documents the initiative’s work throughout Europe in order to boost progress and innovation in our field.
- Published
- 2014
7. Efficient neural speech synthesis
- Author
-
Vainer, Jan, Dušek, Ondřej, and Hajič, Jan
- Subjects
text-to-speech ,deep learning ,hluboké učení ,speech synthesis ,zpracování přirozeného jazyka ,natural language processing ,syntéza řeči - Abstract
While recent neural sequence-to-sequence models have greatly improved the quality of speech synthesis, there has not been a system capable of fast training, fast inference and high-quality audio synthesis at the same time. In this the- sis, we present a neural speech synthesis system capable of high-quality faster- than-real-time spectrogram synthesis, with low requirements on computational resources and fast training time. Our system consists of a teacher and a student network. The teacher model is used to extract alignment between the text to synthesize and the corresponding spectrogram. The student uses the alignments from the teacher model to synthesize mel-scale spectrograms from a phonemic representation of the input text efficiently. Both systems utilize simple convo- lutional layers. We train both systems on the english LJSpeech dataset. The quality of samples synthesized by our model was rated significantly higher than baseline models. Our model can be efficiently trained on a single GPU and can run in real time even on a CPU. 1
- Published
- 2020
8. Vícejazyčná databáze kolokací
- Author
-
Helcl, Jindřich, Hajič, Jan, and Mareček, David
- Subjects
clustering ,webový přístup ,databases ,statistické metody ,web access ,Collocations ,Czech ,statistical methods ,databáze ,angličtina ,zpracování přirozeného jazyka ,čeština ,natural language processing ,Kolokace ,English - Abstract
Collocations are groups of words which are co-occurring more often than appearing separately. They also include phrases that give a new meaning to a group of unrelated words. This thesis is aimed to find collocations in large data and to create a database that allows their retrieval. The Pointwise Mutual Information, a value based on word frequency, is computed for finding the collocations. Words with the highest value of PMI are considered candidates for good collocations. Chosen collocations are stored in a database in a format that allows searching with Apache Lucene. A part of the thesis is to create a Web user interface as a quick and easy way to search collocations. If this service is fast enough and the collocations are good, translators will be able to use it for finding proper equivalents in the target language. Students of a foreign language will also be able to use it to extend their vocabulary. Such database will be created independently in several languages including Czech and English. Powered by TCPDF (www.tcpdf.org)
- Published
- 2014
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.