Author: "Missen, Malik Muhammad Saad" / Topic: opinion mining - Searchworks@Jio Institute Digital Library Search Results

1. A systematic study on the role of SentiWordNet in opinion mining

Author: Husnain, Mujtaba, Missen, Malik Muhammad Saad, Akhtar, Nadeem, Coustaty, Mickaël, Mumtaz, Shahzad, and Prasath, V. B. Surya
Published: 2021
Full Text: View/download PDF

2. Combining Granularity-based Topic-Dependent and Topic-Independent Evidences for Opinion Detection

Author: Missen, Malik Muhammad Saad, Institut de recherche en informatique de Toulouse (IRIT), Université Toulouse 1 Capitole (UT1), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Université Toulouse - Jean Jaurès (UT2J)-Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Centre National de la Recherche Scientifique (CNRS)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées, Université Paul Sabatier - Toulouse III, and Mohand Boughanem(Bougha@irit.fr)
Subjects: Sentiment Detection, Opinion Detection, TREC blog Track, [INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC], Opinion Mining, Entity Ranking
Abstract: Opinion mining is a sub-discipline within Information Retrieval (IR) and Computational Linguistics. It refers to the computational techniques for extracting, classifying, understanding, and assessing the opinions expressed in various online sources like news articles, social media comments, and other user-generated content. It is also known by many other terms like opinion finding, opinion detection, sentiment analysis, sentiment classification, polarity detection, etc. Defining in more specific and simpler context, opinion mining is the task of retrieving opinions on an issue as expressed by the user in the form of a query. There are many problems and challenges associated with the field of opinion mining. In this thesis, we focus on some major problems of opinion mining. One of the foremost and major challenges of opinion mining is to find opinions specifically relevant to the given topic (query). A document can contain information about many topics at a time and it is possible that it contains opinionated text about each of the topic being discussed or about only few of them. Therefore, it becomes very important to choose topic-relevant document segments with their corresponding opinions. We approach this problem on two granularity levels, sentences and passages. In our first approach for sentence-level, we use semantic relations of WordNet to find this opinion-topic association. In our second approach for passage-level, we use more robust IR model (i.e., language model) to focus on this problem. Basic idea behind both contributions for opinion-topic association is that if a document contains more opinionated topic-relevant textual segments (i.e., sentences or passages) then it is more opinionated than a document with less opinionated topic-relevant textual segments. Most of the machine-learning based approaches for opinion mining are domain-dependent (i.e., their performance vary from domain to domain). On the other hand, a domain or topic-independent approach is more generalized and can sustain its effectiveness across different domains. However, topic-independent approaches suffer from poor performance generally. It is a big challenge in the field of opinion mining to develop an approach which is both effective and generalized at the same time. Our contributions for this thesis include the development of such approach which combines simple heuristics-based topic-independent and topic-dependent features to find opinionated documents. Entity-based opinion mining aims at identifying the relevant entities for a given topic and extract the opinions associated to them from a set of textual documents. However, identifying and determining the relevancy of entities is itself a big challenge for this task. In this thesis, we focus on this challenge by proposing an approach which takes into account both information from the current news article as well as from the past relevant articles in order to detect the most important entities in the current news. We look at different features at both local (document) and global (data collection) level to analyse their importance to assess the relevance of an entity. Experimentation with a machine learning algorithm shows the effectiveness of our approach by giving significant improvements over baseline. In addition to this, we also present idea of a framework for opinion mining related tasks. This framework exploits content and social evidences of blogosphere for the tasks of opinion finding, opinion prediction and multidimensional ranking. This premature contribution lays foundations for our future work. Evaluation of our approaches include the use of TREC Blog 2006 data collection and TREC Novelty track data collection 2004. Most of the evaluations were performed under the framework of TREC Blog track.; Fouille des opinion, une sous-discipline dans la recherche d'information (IR) et la linguistique computationnelle, fait référence aux techniques de calcul pour l'extraction, la classification, la compréhension et l'évaluation des opinions exprimées par diverses sources de nouvelles en ligne, social commentaires des médias, et tout autre contenu généré par l'utilisateur. Il est également connu par de nombreux autres termes comme trouver l'opinion, la détection d'opinion, l'analyse des sentiments, la classification sentiment, de détection de polarité, etc. Définition dans le contexte plus spécifique et plus simple, fouille des opinion est la tâche de récupération des opinions contre son besoin aussi exprimé par l'utilisateur sous la forme d'une requête. Il ya de nombreux problèmes et défis liés à l'activité fouille des opinion. Dans cette thèse, nous nous concentrons sur quelques problèmes d'analyse d'opinion. L'un des défis majeurs de fouille des opinion est de trouver des opinions concernant spécifiquement le sujet donné (requête). Un document peut contenir des informations sur de nombreux sujets à la fois et il est possible qu'elle contienne opiniâtre texte sur chacun des sujet ou sur seulement quelques-uns. Par conséquent, il devient très important de choisir les segments du document pertinentes à sujet avec leurs opinions correspondantes. Nous abordons ce problème sur deux niveaux de granularité, des phrases et des passages. Dans notre première approche de niveau de phrase, nous utilisons des relations sémantiques de WordNet pour trouver cette association entre sujet et opinion. Dans notre deuxième approche pour le niveau de passage, nous utilisons plus robuste modèle de RI i.e. la language modèle de se concentrer sur ce problème. L'idée de base derrière les deux contributions pour l'association d'opinion-sujet est que si un document contient plus segments textuels (phrases ou passages) opiniâtre et pertinentes à sujet, il est plus opiniâtre qu'un document avec moins segments textuels opiniâtre et pertinentes. La plupart des approches d'apprentissage-machine basée à fouille des opinion sont dépendants du domaine i.e. leurs performances varient d'un domaine à d'autre. D'autre part, une approche indépendant de domaine ou un sujet est plus généralisée et peut maintenir son efficacité dans différents domaines. Cependant, les approches indépendant de domaine souffrent de mauvaises performances en général. C'est un grand défi dans le domaine de fouille des opinion à développer une approche qui est plus efficace et généralisé. Nos contributions de cette thèse incluent le développement d'une approche qui utilise de simples fonctions heuristiques pour trouver des documents opiniâtre. Fouille des opinion basée entité devient très populaire parmi les chercheurs de la communauté IR. Il vise à identifier les entités pertinentes pour un sujet donné et d'en extraire les opinions qui leur sont associées à partir d'un ensemble de documents textuels. Toutefois, l'identification et la détermination de la pertinence des entités est déjà une tâche difficile. Nous proposons un système qui prend en compte à la fois l'information de l'article de nouvelles en cours ainsi que des articles antérieurs pertinents afin de détecter les entités les plus importantes dans les nouvelles actuelles. En plus de cela, nous présentons également notre cadre d'analyse d'opinion et tâches relieés. Ce cadre est basée sur les évidences contents et les évidences sociales de la blogosphère pour les tâches de trouver des opinions, de prévision et d'avis de classement multidimensionnel. Cette contribution d'prématurée pose les bases pour nos travaux futurs. L'évaluation de nos méthodes comprennent l'utilisation de TREC 2006 Blog collection et de TREC Novelty track 2004 collection. La plupart des évaluations ont été réalisées dans le cadre de TREC Blog track.
Published: 2011

3. OpinionML—Opinion Markup Language for Sentiment Representation.

Author: Missen, Malik Muhammad Saad, Coustaty, Mickaël, Choi, Gyu Sang, Alotaibi, Fahd Saleh, Akhtar, Nadeem, Jhandir, Muhammad Zeeshan, Prasath, V. B. Surya, Salamat, Nadeem, and Husnain, Mujtaba
Subjects: *PRICE markup, *MARL, *SCIENTIFIC community, *POLITICAL science, *DEFINITIONS, *IMAGE segmentation
Abstract: It is the age of the social web, where people express themselves by giving their opinions about various issues, from their personal life to the world's political issues. This process generates a lot of opinion data on the web that can be processed for valuable information, and therefore, semantic annotation of opinions becomes an important task. Unfortunately, existing opinion annotation schemes have failed to satisfy annotation challenges and cannot even adhere to the basic definition of opinion. Opinion holders, topical features and temporal expressions are major components of an opinion that remain ignored in existing annotation schemes. In this work, we propose OpinionML, a new Markup Language, that aims to compensate for the issues that existing typical opinion markup languages fail to resolve. We present a detailed discussion about existing annotation schemes and their associated problems. We argue that OpinionML is more robust, flexible and easier for annotating opinion data. Its modular approach while implementing a logical model provides us with a flexible and easier model of annotation. OpinionML can be considered a step towards "information symmetry". It is an effort for consistent sentiment annotations across the research community. We perform experiments to prove robustness of the proposed OpinionML and the results demonstrate its capability of retrieving significant components of opinion segments. We also propose OpinionML ontology in an effort to make OpinionML more inter-operable. The ontology proposed is more complete than existing opinion ontologies like Marl and Onyx. A comprehensive comparison of the proposed ontology with existing sentiment ontologies Marl and Onyx proves its worth. [ABSTRACT FROM AUTHOR]
Published: 2019
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Missen, Malik Muhammad Saad"'

1. A systematic study on the role of SentiWordNet in opinion mining

2. Combining Granularity-based Topic-Dependent and Topic-Independent Evidences for Opinion Detection

3. OpinionML—Opinion Markup Language for Sentiment Representation.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

3 results on '"Missen, Malik Muhammad Saad"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources