337 results on '"sequence mining"'
Search Results
52. Behavioral Constraint Template-Based Sequence Classification
- Author
-
De Smedt, Johannes, Deeva, Galina, De Weerdt, Jochen, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Ceci, Michelangelo, editor, Hollmén, Jaakko, editor, Todorovski, Ljupčo, editor, Vens, Celine, editor, and Džeroski, Sašo, editor
- Published
- 2017
- Full Text
- View/download PDF
53. Association Rule Learning and Frequent Sequence Mining of Cancer Diagnoses in New York State
- Author
-
Wang, Yu, Wang, Fusheng, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Begoli, Edmon, editor, Wang, Fusheng, editor, and Luo, Gang, editor
- Published
- 2017
- Full Text
- View/download PDF
54. Identifying Productive Inquiry in Virtual Labs Using Sequence Mining
- Author
-
Perez, Sarah, Massey-Allard, Jonathan, Butler, Deborah, Ives, Joss, Bonn, Doug, Yee, Nikki, Roll, Ido, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, André, Elisabeth, editor, Baker, Ryan, editor, Hu, Xiangen, editor, Rodrigo, Ma. Mercedes T., editor, and du Boulay, Benedict, editor
- Published
- 2017
- Full Text
- View/download PDF
55. Anomaly Detection and Structural Analysis in Industrial Production Environments
- Author
-
Atzmueller, Martin, Arnu, David, Schmidt, Andreas, Haber, Peter, editor, Lampoltshammer, Thomas, editor, and Mayr, Manfred, editor
- Published
- 2017
- Full Text
- View/download PDF
56. MIMVOGUE: modeling Indian music using a variable order gapped HMM.
- Author
-
Mor, Bhavya, Garhwal, Sunita, and Kumar, Ajay
- Subjects
HIDDEN Markov models ,MUSICAL composition - Abstract
The computer-assisted music composition is an active research area since mid-1900. In this paper, we have applied the VOGUE model for designing musical sequence of bandish notations of raga Bhairav, a classical Indian music. Variable Order and Gapped hidden Markov model for unstructured elements can capture variable length dependencies with variable gaps in sequential data. In most of raga pattern, a particular pattern repeats itself which may be separated by variable length gaps. VOGUE mines the frequent patterns in raga having different length gaps. These mined patterns are used to model VOGUE for Indian music ragas. Furthermore, we analyzed the benefits of VOGUE model over the standard HMM. To the best of author's knowledge, this is the very first attempt to model Indian classical music with variable order gapped HMM. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
57. What’s Next? A Recommendation System for Industrial Training
- Author
-
Rajiv Srivastava, Girish Keshav Palshikar, Saheb Chaurasia, and Arati Dixit
- Subjects
Personalized recommendation ,Sequence matching ,Sequence mining ,Industrial training ,Information technology ,T58.5-58.64 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Abstract Continuous training is crucial for creating and maintaining the right skill-profile for the industrial organization’s workforce. There is a tremendous variety in the available trainings within an organization: technical, project management, quality, leadership, domain-specific, soft-skills, etc. Hence it is important to assist the employee in choosing the best trainings, which perfectly suits her background, project needs and career goals. In this paper, we focus on algorithms for training recommendation in an industrial setting. We formalize the problem of next training recommendation, taking into account the employee’s training and work history. We present several new unsupervised sequence mining algorithms to mine the past trainings data from the organization for arriving at personalized next training recommendation. Using the real-life data about trainings of 118,587 employees over 5019 distinct trainings from a large multi-national IT organization, we show that these algorithms outperform several standard recommendation engine algorithms as well as those based on standard sequence mining algorithms.
- Published
- 2018
- Full Text
- View/download PDF
58. ST Sequence Miner: visualization and mining of spatio-temporal event sequences.
- Author
-
Koseoglu, Baran, Kaya, Erdem, Balcisoy, Selim, and Bozkaya, Burcin
- Subjects
- *
SEQUENTIAL pattern mining , *VISUAL analytics , *SEQUENCE analysis , *DATA visualization , *ALGORITHMS - Abstract
As a promising field of research, event sequence analysis seems to assist in facilitating clear reasoning behind human decisions by mining reality behind the sequential actions. Mining frequent patterns from event sequences has proved to be promising in extracting actionable insights, which plays an important role in many application domains. Much of the related work challenges the problem solely from the temporal perspective omitting the information that could be gained from the spatial part. This could be in part due to the fact that analysis of event sequences with references to both time and space is attributed as a challenging task due to the additional variance in the data introduced by the spatial aspect. We propose a visual analytics approach that incorporates spatio-temporal pattern extraction leveraging an extended sequential pattern mining algorithm and a pattern discovery guidance mechanism operating on geographic query and selection capabilities. As an implementation of our approach, we introduce a visual analytics tool, namely ST Sequence Miner, enabling event pattern exploration in time-location space. We evaluate our approach over a credit card transaction dataset by adopting case study methodology. Our study unveils that patterns mined from event sequences can better explain possible relationships with proper visualization of time-location data. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
59. Efficient Mining of Outlying Sequence Patterns for Analyzing Outlierness of Sequence Data.
- Author
-
TINGTING WANG, LEI DUAN, GUOZHU DONG, and ZHIFENG BAO
- Subjects
SEQUENTIAL pattern mining ,HEURISTIC ,DATA - Abstract
Recently, a lot of research work has been proposed in different domains to detect outliers and analyze the outlierness of outliers for relational data. However, while sequence data is ubiquitous in real life, analyzing the outlierness for sequence data has not received enough attention. In this article, we study the problem of mining outlying sequence patterns in sequence data addressing the question: given a query sequence s in a sequence dataset D, the objective is to discover sequence patterns that will indicate the most unusualness (i.e., outlierness) of s compared against other sequences. Technically, we use the rank defined by the average probabilistic strength (aps) of a sequence pattern in a sequence to measure the outlierness of the sequence. Then a minimal sequence pattern where the query sequence is ranked the highest is defined as an outlying sequence pattern. To address the above problem, we present OSPMiner, a heuristic method that computes aps by incorporating several pruning techniques. Our empirical study using both real and synthetic data demonstrates that OSPMiner is effective and efficient. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
60. Hybridization of population-based ant colony optimization via data mining.
- Author
-
Adak, Zeynep and Demiriz, Ayhan
- Subjects
- *
DATA mining , *QUADRATIC assignment problem , *TRAVELING salesman problem , *ASSOCIATION rule mining , *PATTERNS (Mathematics) - Abstract
We propose a hybrid application of Population Based Ant Colony Optimization that uses a data mining procedure to wisely initialize the pheromone entries. Hybridization of metaheuristics with data mining techniques has been studied by several researchers in recent years. In this line of research, frequent patterns in a number of initial high-quality solutions are extracted to guide the subsequent iterations of an algorithm, which results in an improvement in solution quality and computational time. Our proposal possesses certain differences from and contributions to existing literature. Instead of one single run that incorporates both the main metaheuristic and the data mining module inside, we propose to carry out independent runs and collect elite sets over these trials. Another contribution is the way we use the knowledge gained from the application of the data mining module. The extracted knowledge is used to initialize the memory model in the algorithm rather than to construct new initial solutions. One additional contribution is the use of a path mining algorithm (a specific sequence mining algorithm) rather than Apriori-like association mining algorithms. Computational experiments, conducted both on symmetric Travelling Salesman Problem and symmetric/asymmetric Quadratic Assignment Problem instances, showed that our proposal produces significantly better results, and is more robust than pure applications of population-based ant colony optimization. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
61. Mining Closed Interesting Subspaces to Discover Conducive Living Environment of Migratory Animals
- Author
-
Sirisha, G. N. V. G., Shashi, M., Kacprzyk, Janusz, Series editor, Das, Swagatam, editor, Pal, Tandra, editor, Kar, Samarjit, editor, Satapathy, Suresh Chandra, editor, and Mandal, Jyotsna Kumar, editor
- Published
- 2016
- Full Text
- View/download PDF
62. An Efficient Algorithm for Mining Frequent Sequence with Constraint Programming
- Author
-
Aoga, John O. R., Guns, Tias, Schaus, Pierre, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Frasconi, Paolo, editor, Landwehr, Niels, editor, Manco, Giuseppe, editor, and Vreeken, Jilles, editor
- Published
- 2016
- Full Text
- View/download PDF
63. Automated Setting of Bus Schedule Coverage Using Unsupervised Machine Learning
- Author
-
Khiari, Jihed, Moreira-Matias, Luis, Cerqueira, Vitor, Cats, Oded, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Bailey, James, editor, Khan, Latifur, editor, Washio, Takashi, editor, Dobbie, Gill, editor, Huang, Joshua Zhexue, editor, and Wang, Ruili, editor
- Published
- 2016
- Full Text
- View/download PDF
64. Topic Segmentation of Web Documents with Automatic Cue Phrase Identification and BLSTM-CNN
- Author
-
Wang, Liang, Li, Sujian, Xiao, Xinyan, Lyu, Yajuan, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Lin, Chin-Yew, editor, Xue, Nianwen, editor, Zhao, Dongyan, editor, Huang, Xuanjing, editor, and Feng, Yansong, editor
- Published
- 2016
- Full Text
- View/download PDF
65. Continuous Recipe Selection Model Based on Cooking History
- Author
-
Yamamoto, Shuhei, Kando, Noriko, Satoh, Tetsuji, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Spiro, Emma, editor, and Ahn, Yong-Yeol, editor
- Published
- 2016
- Full Text
- View/download PDF
66. Using Temporal Association Rules for the Synthesis of Embodied Conversational Agents with a Specific Stance
- Author
-
Janssoone, Thomas, Clavel, Chloé, Bailly, Kévin, Richard, Gaël, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Traum, David, editor, Swartout, William, editor, Khooshabeh, Peter, editor, Kopp, Stefan, editor, Scherer, Stefan, editor, and Leuski, Anton, editor
- Published
- 2016
- Full Text
- View/download PDF
67. Phylogenetic review of the comb-tooth blenny genus Hypleurochilus in the northwest Atlantic and Gulf of Mexico.
- Author
-
Carter, Joshua E., Sporre, Megan A., and Eytan, Ron I.
- Subjects
- *
NUMBERS of species , *REEF fishes , *BIOGEOGRAPHY , *BIOMASS , *PHYLOGENY , *DRILLING platforms , *CHLOROPLAST DNA - Abstract
[Display omitted] • Hypleurochilus delimited into two clades and one lineage. • The phylogenetic relationships within Hypleurochilus reflect biogeographic breaks. • Single-locus delimitation fails to resolve recently diverged species. • First report of sister relationship between H. caudovittatus and H. multifilis. • Mined sequence data supports the hypothesis of a range expansion of H. aequipinnis. As some of the smallest vertebrates, yet largest producers of consumed reef biomass, cryptobenthic reef fishes serve a disproportionate role in reef ecosystems and are one of the most poorly understood groups of fish. The blenny genera Hypleurochilus and Parablennius are currently considered paraphyletic and the interrelationships of Parablennius have been the focus of recent phylogenetic studies. However, the interrelationships of Hypleurochilus remain understudied. This genus is transatlantically distributed and comprises 11 species with a convoluted taxonomic history. In this study, relationships for ten Hypleurochilus species are resolved using multi-locus nuclear and mtDNA sequence data, morphological data, and mined COI barcode data. Mitochondrial and nuclear sequence data from 61 individuals collected from the western Atlantic and northern Gulf of Mexico (N. GoM) delimit seven species into a temperate clade, a tropical clade, and a third distinct lineage. This lineage, herein referred to as H. cf. aequipinnis, may represent a species of Hypleurochilus whose range has expanded into the N. GoM. Inclusion of publicly available COI sequence for an additional three species provides further phylogenetic resolution. H. bananensis forms a new eastern Atlantic clade with H. cf. aequipinnis , providing further evidence for a western Atlantic range expansion. Single marker COI delimitation was unable to elucidate the relationships between H. springeri/H. pseudoaequipinnis and between H. multifilis/H. caudovittatus due to incomplete lineage sorting. Mitochondrial data are also unable to accurately resolve the placement of H. bermudensis. However, a comprehensive approach using multi-locus phylogenetic and species delimitation methods was able to resolve these relationships. While mining publicly available sequence data allowed for the inclusion of an increased number of species in the analysis and a more comprehensive phylogeny, it was not without drawbacks, as a handful of sequences are potentially mis-identified. Overall, we find that the recent divergence of some species within this genus and potential introgression events confound the results of single locus delimitation methods, yet a combination of single and multi-locus analyses has allowed for insights into the biogeography of this genus and uncovered a potential transatlantic range expansion. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
68. Pattern Mining and Machine Learning for Demographic Sequences
- Author
-
Ignatov, Dmitry I., Mitrofanova, Ekaterina, Muratova, Anna, Gizdatullin, Danil, Diniz Junqueira Barbosa, Simone, Series editor, Chen, Phoebe, Series editor, Du, Xiaoyong, Series editor, Filipe, Joaquim, Series editor, Kara, Orhun, Series editor, Liu, Ting, Series editor, Kotenko, Igor, Series editor, Sivalingam, Krishna M., Series editor, Washio, Takashi, Series editor, Klinov, Pavel, editor, and Mouromtsev, Dmitry, editor
- Published
- 2015
- Full Text
- View/download PDF
69. Ramex: A Sequence Mining Algorithm Using Poly-trees
- Author
-
Cavique, Luís, Kacprzyk, Janusz, Series editor, Rocha, Alvaro, editor, Correia, Ana Maria, editor, Costanzo, Sandra, editor, and Reis, Luis Paulo, editor
- Published
- 2015
- Full Text
- View/download PDF
70. Mining Chinese Polarity Shifters
- Author
-
Xu, Ge, Huang, Churen, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Lu, Qin, editor, and Gao, Helena Hong, editor
- Published
- 2015
- Full Text
- View/download PDF
71. Improving Marketing Interactions by Mining Sequences
- Author
-
Sinha, Ritwik, Mehta, Sanket, Bohra, Tapan, Krishnan, Adit, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Wang, Jianyong, editor, Cellary, Wojciech, editor, Wang, Dingding, editor, Wang, Hua, editor, Chen, Shu-Ching, editor, Li, Tao, editor, and Zhang, Yanchun, editor
- Published
- 2015
- Full Text
- View/download PDF
72. Constraint-Based Sequence Mining Using Constraint Programming
- Author
-
Negrevergne, Benjamin, Guns, Tias, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, and Michel, Laurent, editor
- Published
- 2015
- Full Text
- View/download PDF
73. microTaboo: a general and practical solution to the k-disjoint problem
- Author
-
Mohammed Al-Jaff, Eric Sandström, and Manfred Grabherr
- Subjects
k-disjoint problem ,Software ,Sequence mining ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background A common challenge in bioinformatics is to identify short sub-sequences that are unique in a set of genomes or reference sequences, which can efficiently be achieved by k-mer (k consecutive nucleotides) counting. However, there are several areas that would benefit from a more stringent definition of “unique”, requiring that these sub-sequences of length W differ by more than k mismatches (i.e. a Hamming distance greater than k) from any other sub-sequence, which we term the k-disjoint problem. Examples include finding sequences unique to a pathogen for probe-based infection diagnostics; reducing off-target hits for re-sequencing or genome editing; detecting sequence (e.g. phage or viral) insertions; and multiple substitution mutations. Since both sensitivity and specificity are critical, an exhaustive, yet efficient solution is desirable. Results We present microTaboo, a method that allows for efficient and extensive sequence mining of unique (k-disjoint) sequences of up to 100 nucleotides in length. On a number of simulated and real data sets ranging from microbe- to mammalian-size genomes, we show that microTaboo is able to efficiently find all sub-sequences of a specified length W that do not occur within a threshold of k mismatches in any other sub-sequence. We exemplify that microTaboo has many practical applications, including point substitution detection, sequence insertion detection, padlock probe target search, and candidate CRISPR target mining. Conclusions microTaboo implements a solution to the k-disjoint problem in an alignment- and assembly free manner. microTaboo is available for Windows, Mac OS X, and Linux, running Java 7 and higher, under the GNU GPLv3 license, at: https://MohammedAlJaff.github.io/microTaboo
- Published
- 2017
- Full Text
- View/download PDF
74. A pattern growth-based sequential pattern mining algorithm called prefixSuffixSpan
- Author
-
Kenmogne Edith Belise, Tadmon Calvin, and Nkambou Roger
- Subjects
sequence mining ,sequential pattern ,pattern-growth direction ,pattern-growth ordering ,search space ,pruning ,partitioning ,Management information systems ,T58.6-58.62 - Abstract
Sequential pattern mining is an important data mining problem widely addressed by the data mining community, with a very large field of applications. The sequence pattern mining aims at extracting a set of attributes, shared across time among a large number of objects in a given database. The work presented in this paper is directed towards the general theoretical foundations of the pattern-growth approach. It helps indepth understanding of the pattern-growth approach, current status of provided solutions, and direction of research in this area. In this paper, this study is carried out on a particular class of pattern-growth algorithms for which patterns are grown by making grow either the current pattern prefix or the current pattern suffix from the same position at each growth-step. This study leads to a new algorithm called prefixSuffixSpan. Its correctness is proven and experimentations are performed.
- Published
- 2017
- Full Text
- View/download PDF
75. Efficient model selection for predictive pattern mining model by safe pattern pruning.
- Author
-
Yoshida T, Hanada H, Nakagawa K, Taji K, Tsuda K, and Takeuchi I
- Abstract
Predictive pattern mining is an approach used to construct prediction models when the input is represented by structured data, such as sets, graphs, and sequences. The main idea behind predictive pattern mining is to build a prediction model by considering unified inconsistent notation sub-structures, such as subsets, subgraphs, and subsequences (referred to as patterns), present in the structured data as features of the model. The primary challenge in predictive pattern mining lies in the exponential growth of the number of patterns with the complexity of the structured data. In this study, we propose the safe pattern pruning method to address the explosion of pattern numbers in predictive pattern mining. We also discuss how it can be effectively employed throughout the entire model building process in practical data analysis. To demonstrate the effectiveness of the proposed method, we conduct numerical experiments on regression and classification problems involving sets, graphs, and sequences., Competing Interests: The authors declare no competing interests., (© 2023 The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
76. Improving Customer Behaviour Prediction with the Item2Item model in Recommender Systems
- Author
-
T. Nguyen and P. Do
- Subjects
recommender systems ,sequence mining ,item2item ,Computer engineering. Computer hardware ,TK7885-7895 ,Systems engineering ,TA168 - Abstract
Recommender Systems are the most well-known applications in E-commerce sites. However, the trade-off between runtime and the accuracy in making recommendations is a big challenge. This work combines several traditional techniques to reduce the limitation of each single technique and exploits the Item2Item model to improve the prediction accuracy. As a case study, this paper focuses on user behaviour prediction in restaurant recommender systems and uses a public dataset including restaurant information and user sessions. Within this dataset, user behaviour can be discovered for the collaborative filtering, and restaurant information is extracted for the content-based filtering. The idea of the pre-trained word embedding in Natural Language Processing is utilized in the item-based collaborative filtering to find the similarity between restaurants based on user sessions. Experimental results have shown that the combination of these techniques makes valuable recommendations.
- Published
- 2018
- Full Text
- View/download PDF
77. DETECTING STRUCTURE IN CHAOS: A CUSTOMER PROCESS ANALYSIS METHOD.
- Author
-
Palvölgyi, Elisabeth Z.
- Subjects
CUSTOMER services ,PROCESS mining ,SERVICE design ,CLUSTER analysis (Statistics) ,BUSINESS process management - Abstract
Detecting typical patterns in customer processes is the precondition for gaining an understanding about customer issues and needs in the course of performing their processes. Such insights can be translated into customer-centric service offerings that provide added value by enabling customers to reach their process objectives more effectively and rapidly, and with less effort. However, customer processes performed in less restrictive environments are extremely heterogeneous, which makes them difficult to analyse. Current approaches deal with this issue by considering customer processes in large scope and low detail, or vice versa. However, both views are required to understand customer processes comprehensively. Therefore, we present a novel customer process analysis method capable of detecting the hidden activity-cluster structure of customer processes. Consequently, both the detailed level of process activities and the aggregated cluster level are available for customer process analysis, which increases the chances of detecting patterns in these heterogeneous processes. We apply the method to two datasets and evaluate the results' validity and utility. Moreover, we demonstrate that the method outperforms alternative solution technologies. Finally, we provide new insights into customer process theory. [ABSTRACT FROM AUTHOR]
- Published
- 2018
78. An Exploratory Approach for Understanding Customer Behavior Processes Based on Clustering and Sequence Mining
- Author
-
Seret, Alex, vanden Broucke, Seppe K. L. M., Baesens, Bart, Vanthienen, Jan, van der Aalst, Wil, Series editor, Mylopoulos, John, Series editor, Rosemann, Michael, Series editor, Shaw, Michael J., Series editor, Szyperski, Clemens, Series editor, Lohmann, Niels, editor, Song, Minseok, editor, and Wohed, Petia, editor
- Published
- 2014
- Full Text
- View/download PDF
79. Performance Analysis of Asynchronous Periodic Pattern Mining Algorithms
- Author
-
Sirisha, G. N. V. G., Mogalla, Shashi, Raju, G. V. Padma, Kacprzyk, Janusz, Series editor, Satapathy, Suresh Chandra, editor, Avadhani, P. S., editor, Udgata, Siba K., editor, and Lakshminarayana, Sadasivuni, editor
- Published
- 2014
- Full Text
- View/download PDF
80. A Differential Approach for Identifying Important Student Learning Behavior Patterns with Evolving Usage over Time
- Author
-
Kinnebrew, John S., Mack, Daniel L. C., Biswas, Gautam, Chang, Chih-Kai, Goebel, Randy, Series editor, Tanaka, Yuzuru, Series editor, Wahlster, Wolfgang, Series editor, Peng, Wen-Chih, editor, Wang, Haixun, editor, Bailey, James, editor, Tseng, Vincent S., editor, Ho, Tu Bao, editor, Zhou, Zhi-Hua, editor, and Chen, Arbee L.P., editor
- Published
- 2014
- Full Text
- View/download PDF
81. Educational Data Mining for Analysis of Students’ Solutions
- Author
-
Vaculík, Karel, Nezvalová, Leona, Popelínský, Luboš, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Kobsa, Alfred, Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, Series editor, Tanaka, Yuzuru, Series editor, Wahlster, Wolfgang, Series editor, Siekmann, Jörg, Series editor, Agre, Gennady, editor, Hitzler, Pascal, editor, Krisnadhi, Adila A., editor, and Kuznetsov, Sergei O., editor
- Published
- 2014
- Full Text
- View/download PDF
82. From Non-verbal Signals Sequence Mining to Bayesian Networks for Interpersonal Attitudes Expression
- Author
-
Chollet, Mathieu, Ochs, Magalie, Pelachaud, Catherine, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Kobsa, Alfred, Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, Series editor, Tanaka, Yuzuru, Series editor, Wahlster, Wolfgang, Series editor, Siekmann, Jörg, Series editor, Bickmore, Timothy, editor, Marsella, Stacy, editor, and Sidner, Candace, editor
- Published
- 2014
- Full Text
- View/download PDF
83. Hybrid ASP-based Approach to Pattern Mining.
- Author
-
PARAMONOV, SERGEY, STEPANOVA, DARIA, and MIETTINEN, PAULI
- Subjects
LOGIC programming ,DATA mining ,AUTOMATIC extracting (Information science) ,RULE-based programming ,SEQUENTIAL pattern mining - Abstract
Detecting small sets of relevant patterns from a given data set is a central challenge in data mining. The relevance of a pattern is based on user-provided criteria; typically, all patterns that satisfy certain criteria are considered relevant. Rule-based languages like answer set programming (ASP) seem well suited for specifying such criteria in a form of constraints. Although progress has been made, on the one hand, on solving individual mining problems and, on the other hand, developing generic mining systems, the existing methods focus either on scalability or on generality. In this paper, we make steps toward combining local (frequency, size, and cost) and global (various condensed representations like maximal, closed, and skyline) constraints in a generic and efficient way. We present a hybrid approach for itemset, sequence, and graph mining which exploits dedicated highly optimized mining systems to detect frequent patterns and then filters the results using declarative ASP. To further demonstrate the generic nature of our hybrid framework, we apply it to a problem of approximately tiling a database. Experiments on real-world data sets show the effectiveness of the proposed method and computational gains for itemset, sequence, and graph mining, as well as approximate tiling. Under consideration in Theory and Practice of Logic Programming. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
84. An efficient pixel clustering-based method for mining spatial sequential patterns from serial remote sensing images.
- Author
-
Wu, Xiaozhu and Zhang, Ximei
- Subjects
- *
REMOTE sensing , *PIXELS , *SEQUENTIAL analysis , *URBAN growth , *RUN-length encoding - Abstract
Abstract The accumulation of serial remote sensing images provides plentiful data for discovering sequential spatial patterns in various fields such as agricultural monitoring, urban development, and vegetation cover. Otherwise, traditional sequential pattern-mining algorithms cannot be directly or efficiently applied to remote sensing images. In this study, we propose a pixel clustering-based method to improve the efficiency of mining spatial sequential patterns from raster serial remote sensing images (SRSI). Firstly, the images are compressed by using the Run-Length coding schema. Then, pixels with identical sequences are clustered by means of the Run-length code-based spatial overlay operation. Finally, a pruning strategy is proposed, to extend the prefixSpan algorithm to skip unnecessary database scanning when mining from pixel groups. The experimental results indicate that the method presented in this paper could extract spatial sequential patterns from SRSI efficiently. Although accurate support rates for the patterns may not be obtained, our method could ensure that all patterns are extracted with a lower time cost. Highlights •A Run-Length Coding based pixels clustering method is proposed. •A pruning strategy to extend prefixSpan algorithm is proposed. •The proposed method is efficient for mining spatial sequential patterns. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
85. Estimating the selectivity of LIKE queries using pattern-based histograms.
- Author
-
AYTİMUR, Mehmet and ÇAKMAK, Ali
- Subjects
- *
HISTOGRAMS , *COST estimates , *DATABASE management , *SQL , *DATA mining - Abstract
Accurate cost and time estimation of a query is one of the major success indicators for database management systems. SQL allows the expression of flexible queries on text-formatted data. The LIKE operator is used to search for a specified pattern (e.g., LIKE "luck%") in a string database. It is vital to estimate the selectivity of such flexible predicates for the query optimizer to choose an efficient execution plan. In this paper, we study the problem of estimating the selectivity of a LIKE query predicate over a bag of strings. We propose a new type of pattern-based histogram structure to summarize the data distribution in a particular column. More specifically, we first mine sequential patterns over a given string database and then construct a special histogram out of the mined patterns. During query optimization time, pattern-based histograms are exploited to estimate the selectivity of a LIKE predicate. The experimental results on a real dataset from DBLP show that the proposed technique outperforms the state of the art for generic LIKE queries like %s1%s2%...%sn% where si represents one or more characters. What is more, the proposed histogram structure requires more than two orders of magnitude smaller memory space, and the estimation time is almost an order of magnitude less in comparison to the state of the art. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
86. Applying Learning Analytics to Detect Sequences of Actions and Common Errors in a Geometry Game
- Author
-
Manuel J. Gomez, José A. Ruipérez-Valiente, Pedro A. Martínez, and Yoon Jeon Kim
- Subjects
educational games ,learning analytics ,game-based assessment ,sequence mining ,visualization dashboard ,Chemical technology ,TP1-1185 - Abstract
Games have become one of the most popular activities across cultures and ages. There is ample evidence that supports the benefits of using games for learning and assessment. However, incorporating game activities as part of the curriculum in schools remains limited. Some of the barriers for broader adoption in classrooms is the lack of actionable assessment data, the fact that teachers often do not have a clear sense of how students are interacting with the game, and it is unclear if the gameplay is leading to productive learning. To address this gap, we seek to provide sequence and process mining metrics to teachers that are easily interpretable and actionable. More specifically, we build our work on top of Shadowspect, a three-dimensional geometry game that has been developed to measure geometry skills as well other cognitive and noncognitive skills. We use data from its implementation across schools in the U.S. to implement two sequence and process mining metrics in an interactive dashboard for teachers. The final objective is to facilitate that teachers can understand the sequence of actions and common errors of students using Shadowspect so they can better understand the process, make proper assessment, and conduct personalized interventions when appropriate.
- Published
- 2021
- Full Text
- View/download PDF
87. EMOSS: An Efficient Algorithm to Hide Sequential Patterns
- Author
-
Olya Sadat Behbahani, Mir Mohsen Pedram, Kambiz Badie, and Babak Rahbarinia
- Subjects
data mining ,sequence mining ,knowledge hiding ,sequential pattern ,Information technology ,T58.5-58.64 ,Telecommunication ,TK5101-6720 ,Electronic computers. Computer science ,QA75.5-76.95 - Abstract
Nowadays data mining is the way of extracting hidden knowledge from raw data whereas sequence mining aims to find sequential patterns that are frequent in the database, so publishing these data may lead to the disclosure of private information about organizations or individuals. Knowledge hiding is the process of hiding sensitive knowledge extracted previously from the database, to ensure that no abuse will be caused. This paper addresses the problem of sequential pattern hiding and proposes an efficient algorithm which uses a multi-objective approach to overcome the problem of sequence hiding as well as maintaining database fidelity as much as possible. It also shows that the proposed algorithm outperforms existing methods in terms of both speed and memory usage.
- Published
- 2015
88. Discovering Human Activities from Binary Data in Smart Homes
- Author
-
Mohamed Eldib, Wilfried Philips, and Hamid Aghajan
- Subjects
human activity discovery ,smart homes ,health monitoring ,clustering ,unsupervised learning ,sequence mining ,Chemical technology ,TP1-1185 - Abstract
With the rapid development in sensing technology, data mining, and machine learning fields for human health monitoring, it became possible to enable monitoring of personal motion and vital signs in a manner that minimizes the disruption of an individual’s daily routine and assist individuals with difficulties to live independently at home. A primary difficulty that researchers confront is acquiring an adequate amount of labeled data for model training and validation purposes. Therefore, activity discovery handles the problem that activity labels are not available using approaches based on sequence mining and clustering. In this paper, we introduce an unsupervised method for discovering activities from a network of motion detectors in a smart home setting. First, we present an intra-day clustering algorithm to find frequent sequential patterns within a day. As a second step, we present an inter-day clustering algorithm to find the common frequent patterns between days. Furthermore, we refine the patterns to have more compressed and defined cluster characterizations. Finally, we track the occurrences of various regular routines to monitor the functional health in an individual’s patterns and lifestyle. We evaluate our methods on two public data sets captured in real-life settings from two apartments during seven-month and three-month periods.
- Published
- 2020
- Full Text
- View/download PDF
89. Smart Open-Ended Learning Environments That Support Learners Cognitive and Metacognitive Processes
- Author
-
Biswas, Gautam, Segedy, James R., Kinnebrew, John S., Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Holzinger, Andreas, editor, and Pasi, Gabriella, editor
- Published
- 2013
- Full Text
- View/download PDF
90. Visual Analytics Focusing on Space
- Author
-
Andrienko, Gennady, Andrienko, Natalia, Bak, Peter, Keim, Daniel, Wrobel, Stefan, Andrienko, Gennady, Andrienko, Natalia, Bak, Peter, Keim, Daniel, and Wrobel, Stefan
- Published
- 2013
- Full Text
- View/download PDF
91. Mining General Fuzzy Sequences Based on Fuzzy Ontology
- Author
-
Gholizadeh, Mehdi, Pedram, Mir Mohsen, Shanbezadeh, Jamshid, Ao, Sio Iong, editor, Castillo, Oscar, editor, and Huang, Xu, editor
- Published
- 2012
- Full Text
- View/download PDF
92. Bus Bunching Detection by Mining Sequences of Headway Deviations
- Author
-
Moreira-Matias, Luís, Ferreira, Carlos, Gama, João, Mendes-Moreira, João, de Sousa, Jorge Freire, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, and Perner, Petra, editor
- Published
- 2012
- Full Text
- View/download PDF
93. An Experiment with Asymmetric Algorithm: CPU Vs. GPU
- Author
-
Upadhyaya, Sujatha R., Toth, David, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Lee, Sang-goo, editor, Peng, Zhiyong, editor, Zhou, Xiaofang, editor, Moon, Yang-Sae, editor, Unland, Rainer, editor, and Yoo, Jaesoo, editor
- Published
- 2012
- Full Text
- View/download PDF
94. The relational, co-temporal, contemporaneous, and longitudinal dynamics of self-regulation for academic writing
- Author
-
Mohammed Saqr, Olga Viberg, and Ward Peeters
- Subjects
Social Psychology ,Higher education ,Academic writing ,Foreign language ,Learning analytics ,Context (language use) ,Information technology ,Education ,Epistemic network analysis ,Social network analysis ,Self-regulated learning ,Management of Technology and Innovation ,Media Technology ,Mathematics education ,Process mining ,Educational sciences ,Computer. Automation ,business.industry ,Learning environment ,Writing process ,Educational technology ,Linguistics ,Temporal networks ,T58.5-58.64 ,Sequence mining ,Psychology ,business - Abstract
Writing in an academic context often requires students in higher education to acquire a new set of skills while familiarising themselves with the goals, objectives and requirements of the new learning environment. Students’ ability to continuously self-regulate their writing process, therefore, is seen as a determining factor in their learning success. In order to study students’ self-regulated learning (SRL) behaviour, research has increasingly been tapping into learning analytics (LA) methods in recent years, making use of multimodal trace data that can be obtained from students writing and working online. Nevertheless, little is still known about the ways students apply and govern SRL processes for academic writing online, and about how their SRL behaviour might change over time. To provide new perspectives on the use of LA approaches to examine SRL, this study applied a range of methods to investigate what they could tell us about the evolution of SRL tactics and strategies on a relational, co-temporal, contemporaneous and longitudinal level. The data originates from a case study in which a private Facebook group served as an online collaboration space in a first-year academic writing course for foreign language majors of English. The findings show that learners use a range of SRL tactics to manage their writing tasks and that different tactic can take up key positions in this process over time. Several shifts could be observed in students’ behaviour, from mainly addressing content-specific topics to more form-specific and social ones. Our results have also demonstrated that different methods can be used to study the relational, co-temporal, contemporaneous, and longitudinal dynamics of self-regulation in this regard, demonstrating the wealth of insights LA methods can bring to the table.
- Published
- 2021
95. System Decomposition for Temporal Concept Analysis
- Author
-
Luper, David, Kazanci, Caner, Schramski, John, Arabnia, Hamid R., Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Andrews, Simon, editor, Polovina, Simon, editor, Hill, Richard, editor, and Akhgar, Babak, editor
- Published
- 2011
- Full Text
- View/download PDF
96. Extension of TMG Framework for Mining Frequent Subsequences
- Author
-
Hadzic, Fedja, Tan, Henry, Dillon, Tharam S., Kacprzyk, Janusz, editor, Hadzic, Fedja, Tan, Henry, and Dillon, Tharam S.
- Published
- 2010
- Full Text
- View/download PDF
97. A Novel Approach to Mining Travel Sequences Using Collections of Geotagged Photos
- Author
-
Kisilevich, Slava, Keim, Daniel, Rokach, Lior, Painho, Marco, editor, Santos, Maribel Yasmina, editor, and Pundt, Hardy, editor
- Published
- 2010
- Full Text
- View/download PDF
98. Mining Class-Correlated Patterns for Sequence Labeling
- Author
-
Hopf, Thomas, Kramer, Stefan, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Pfahringer, Bernhard, editor, Holmes, Geoff, editor, and Hoffmann, Achim, editor
- Published
- 2010
- Full Text
- View/download PDF
99. Research on the Method of Recomposing Learning Objects and Tools in Adaptive Learning Platform
- Author
-
Xie, Pan, Ye, Longmei, Huang, Yueming, Chen, Youwei, Lin, Liwu, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Zhang, Xiaopeng, editor, Zhong, Shaochun, editor, Pan, Zhigeng, editor, Wong, Kevin, editor, and Yun, Ruwei, editor
- Published
- 2010
- Full Text
- View/download PDF
100. An Efficient GA-Based Algorithm for Mining Negative Sequential Patterns
- Author
-
Zheng, Zhigang, Zhao, Yanchang, Zuo, Ziye, Cao, Longbing, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Goebel, Randy, editor, Siekmann, Jörg, editor, Wahlster, Wolfgang, editor, Zaki, Mohammed J., editor, Yu, Jeffrey Xu, editor, Ravindran, B., editor, and Pudi, Vikram, editor
- Published
- 2010
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.