74 results on '"Sequence prediction"'
Search Results
2. LSTM-Based Framework for the Synthesis of Original Soundtracks
- Author
-
Yuanzhi Huo, Mengjie Jin, and Sicong You
- Subjects
Deep learning ,LSTM ,machine learning ,music synthesis ,RNN ,sequence prediction ,Electrical engineering. Electronics. Nuclear engineering ,TK1-9971 - Abstract
Recently, significant developments have been made in Long Short-Term Memory (LSTM) networks within the realm of synthesis music. Notwithstanding these advancements, several challenges persist warranting further research. Primarily, there exists an absence of dedicated research on the application of LSTM networks for the synthesis of Original Sound Tracks (OST). Secondly, in general, people can only judge whether the synthesized music meets their expectations based on the model output. However, due to the time-consuming of training the model may need to try multiple times to obtain successful training results. Moreover, the subjective of music quality evaluation relying on human perception, not only the result of model training. To address these multifaceted challenges, this paper concentrates specifically on OST and proposes a framework termed the OST Synthesis Framework (OSTSF) utilizing LSTM. This framework accepts various OST types as input, processed through LSTM to yield innovative OST. Additionally, a novel preprocessing algorithm is proposed to screen input OST elements such as notes and chords, enabling control over music type and quality before the training phase. This algorithm serves to mitigate training uncertainties and reduce situations that require repeated training. Besides, a postprocessing approach, leveraging mathematical formulations facilitates the evaluation of synthesis OST also proposed. This approach aims to quantify subjective evaluations, providing a more intuitive representation through scoring metrics. Experiment results reveal that the OSTSF synthesized OST received favorable rate among a cohort of 100 surveyed respondents attaining 78.8%, demonstrating the efficacy of the proposed framework in the realm of music synthesis utilizing LSTM.
- Published
- 2024
- Full Text
- View/download PDF
3. SeqPredNN: a neural network that generates protein sequences that fold into specified tertiary structures.
- Author
-
Lategan, F. Adriaan, Schreiber, Caroline, and Patterton, Hugh G.
- Subjects
- *
LIFE sciences , *PROTEIN structure prediction , *TERTIARY structure , *PROTEIN structure , *PROTEIN folding - Abstract
Background: The relationship between the sequence of a protein, its structure, and the resulting connection between its structure and function, is a foundational principle in biological science. Only recently has the computational prediction of protein structure based only on protein sequence been addressed effectively by AlphaFold, a neural network approach that can predict the majority of protein structures with X-ray crystallographic accuracy. A question that is now of acute relevance is the "inverse protein folding problem": predicting the sequence of a protein that folds into a specified structure. This will be of immense value in protein engineering and biotechnology, and will allow the design and expression of recombinant proteins that can, for instance, fold into specified structures as a scaffold for the attachment of recombinant antigens, or enzymes with modified or novel catalytic activities. Here we describe the development of SeqPredNN, a feed-forward neural network trained with X-ray crystallographic structures from the RCSB Protein Data Bank to predict the identity of amino acids in a protein structure using only the relative positions, orientations, and backbone dihedral angles of nearby residues. Results: We predict the sequence of a protein expected to fold into a specified structure and assess the accuracy of the prediction using both AlphaFold and RoseTTAFold to computationally generate the fold of the derived sequence. We show that the sequences predicted by SeqPredNN fold into a structure with a median TM-score of 0.638 when compared to the crystal structure according to AlphaFold predictions, yet these sequences are unique and only 28.4% identical to the sequence of the crystallized protein. Conclusions: We propose that SeqPredNN will be a valuable tool to generate proteins of defined structure for the design of novel biomaterials, pharmaceuticals, catalysts, and reporter systems. The low sequence identity of its predictions compared to the native sequence could prove useful for developing proteins with modified physical properties, such as water solubility and thermal stability. The speed and ease of use of SeqPredNN offers a significant advantage over physics-based protein design methods. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
4. Reconciling spatiotemporal conjunction with digital twin for sequential travel time prediction and intelligent routing
- Author
-
Chen, Claire Y. T., Sun, Edward W., and Lin, Yi-Bing
- Published
- 2024
- Full Text
- View/download PDF
5. Rainfall Prediction Using Machine Learning Models: Literature Survey
- Author
-
Hussein, Eslam A., Ghaziasgar, Mehrdad, Thron, Christopher, Vaccari, Mattia, Jafta, Yahlieel, Kacprzyk, Janusz, Series Editor, Alloghani, Mohamed, editor, Thron, Christopher, editor, and Subair, Saad, editor
- Published
- 2022
- Full Text
- View/download PDF
6. A Theory-based Deep-Learning Approach to Detecting Disinformation in Financial Social Media.
- Author
-
Chung, Wingyan, Zhang, Yinqiang, and Pan, Jia
- Subjects
DISINFORMATION ,HYACINTHOIDES ,KNOWLEDGE management ,MACHINE learning ,RECURRENT neural networks ,SOCIAL media - Abstract
The spreading of disinformation in social media threatens cybersecurity and undermines market efficiency. Detecting disinformation is challenging due to large volumes of social media content and a rapidly changing environment. This research developed and validated a theory-based, novel deep-learning approach (called TRNN) to disinformation detection. Grounded in social and psychological theories, TRNN uses deep-learning and data-centric augmentation to enhance disinformation detection in financial social media. Temporal and contextual information is encoded as specific knowledge about human-validated disinformation, which was identified from our unique collection of 745,139 financial social media messages about four U.S. high-tech company stocks and their fine-grained trading data. TRNN uses multiple series of long short-term memory (LSTM) recurrent neurons to learn dynamic and hidden patterns to support disinformation detection. Our experimental findings show that TRNN significantly outperformed widely-used machine learning techniques in terms of precision, recall, F-score and accuracy, achieving consistently better classification performance in disinformation detection. A case study of Apple Inc.'s stock price movement demonstrates the potential usability of TRNN for secure knowledge management. The research contributes to developing novel approach and model, producing new information systems artifacts and dataset, and providing empirical findings of detecting online disinformation. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
7. Discovery of Real World Context Event Patterns for Smartphone Devices Using Conditional Random Fields
- Author
-
Piparia, Shraddha, Khan, Md Khorrom, Bryce, Renée, Kacprzyk, Janusz, Series Editor, Pal, Nikhil R., Advisory Editor, Bello Perez, Rafael, Advisory Editor, Corchado, Emilio S., Advisory Editor, Hagras, Hani, Advisory Editor, Kóczy, László T., Advisory Editor, Kreinovich, Vladik, Advisory Editor, Lin, Chin-Teng, Advisory Editor, Lu, Jie, Advisory Editor, Melin, Patricia, Advisory Editor, Nedjah, Nadia, Advisory Editor, Nguyen, Ngoc Thanh, Advisory Editor, Wang, Jun, Advisory Editor, and Latifi, Shahram, editor
- Published
- 2021
- Full Text
- View/download PDF
8. An AI-Driven Predictive Modelling Framework to Analyze and Visualize Blood Product Transactional Data for Reducing Blood Products’ Discards
- Author
-
Rad, Jaber, Cheng, Calvino, Quinn, Jason G., Abidi, Samina, Liwski, Robert, Abidi, Syed Sibte Raza, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Woeginger, Gerhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Michalowski, Martin, editor, and Moskovitch, Robert, editor
- Published
- 2020
- Full Text
- View/download PDF
9. Using Deep Learning to Predict User Behavior in the Online Discussion
- Author
-
Nikolay, Karpov, Alexander, Demidovskij, Barbosa, Simone Diniz Junqueira, Editorial Board Member, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Kotenko, Igor, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Alexandrov, Daniel A., editor, Boukhanovsky, Alexander V., editor, Chugunov, Andrei V., editor, Kabanov, Yury, editor, Koltsova, Olessia, editor, and Musabirov, Ilya, editor
- Published
- 2019
- Full Text
- View/download PDF
10. Anomalies Detecting in Medical Metrics Using Machine Learning Tools.
- Author
-
Melnykova, Nataliia, Kulievych, Roman, Vycluk, Yaroslav, Melnykova, Kateryna, and Melnykov, Volodymyr
- Subjects
MACHINE learning ,TIME series analysis ,INFORMATION storage & retrieval systems ,ANOMALY detection (Computer security) - Abstract
The article analyses the research related to the issue of detecting anomalies in current medical data. The work aims to develop an information system for detecting data anomalies in the format of time series, such as metrics, with the ability to visualize the results for expert evaluation. The process of detecting anomalies by machine learning tools for detecting anomalies in the flow data is investigated. A system for detecting anomalies in metrics using the HTM model is built. According to the model results, obtained satisfactory accuracy for detecting anomalies at standard network parameters, the estimation of anomalies differed clearly for different aggregation intervals. The OPF Client functionality was used to build the HTM model, which allowed to achieve speed and simplicity in its construction. The obtained results allow the expert to choose the best of the size of the intervals as well as the parameters of the models. This choice is necessary because the presence of the anomaly depends on expert knowledge in a particular field. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
11. 기계학습을 이용한 동영상 서비스의 검색 편의성 향상.
- Author
-
임연섭
- Subjects
STREAMING video & television ,INFORMATION services ,MACHINE learning ,ALGORITHMS ,VIDEOS - Abstract
Information search in video streaming services such as YouTube is replacing traditional information search services. To find desired detailed information in such a video, users should repeatedly navigate several points in the video, resulting in a waste of time and network traffic. In this paper, we propose a method to assist users in searching for information in a video by using DBSCAN clustering and LSTM. Our LSTM model is trained with a dataset that consists of user search sequences and their final target points categorized by DBSCAN clustering algorithm. Then, our proposed method utilizes the trained model to suggest an expected category for the user's desired target point based on a partial search sequence that can be collected at the beginning of the search. Our experiment results show that the proposed method successfully finds user destination points with 98% accuracy and 7s of the time difference by average. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
12. An interdisciplinary comparison of sequence modeling methods for next-element prediction.
- Author
-
Tax, Niek, Teinemaa, Irene, and van Zelst, Sebastiaan J.
- Subjects
- *
FORECASTING , *PROCESS mining , *ARTIFICIAL neural networks , *MARKOV processes , *NUCLEOTIDE sequence , *RECURRENT neural networks - Abstract
Data of sequential nature arise in many application domains in the form of, e.g., textual data, DNA sequences, and software execution traces. Different research disciplines have developed methods to learn sequence models from such datasets: (i) In the machine learning field methods such as (hidden) Markov models and recurrent neural networks have been developed and successfully applied to a wide range of tasks, (ii) in process mining process discovery methods aim to generate human-interpretable descriptive models, and (iii) in the grammar inference field the focus is on finding descriptive models in the form of formal grammars. Despite their different focuses, these fields share a common goal: learning a model that accurately captures the sequential behavior in the underlying data. Those sequence models are generative, i.e., they are able to predict what elements are likely to occur after a given incomplete sequence. So far, these fields have developed mainly in isolation from each other and no comparison exists. This paper presents an interdisciplinary experimental evaluation that compares sequence modeling methods on the task of next-element prediction on four real-life sequence datasets. The results indicate that machine learning methods, which generally do not aim at model interpretability, tend to outperform methods from the process mining and grammar inference fields in terms of accuracy. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
13. Latent and sequential prediction of the novel coronavirus epidemiological spread
- Author
-
Dario Aragona, Paola Velardi, Bardh Prenkaj, and Luca Podo
- Subjects
trajectory prediction ,Multivariate statistics ,2019-20 coronavirus outbreak ,Exploit ,Coronavirus disease 2019 (COVID-19) ,Computer science ,business.industry ,pandemic ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,COVID 19 ,sequential deep machine learning ,Ocean Engineering ,Machine learning ,computer.software_genre ,medicine.disease_cause ,Sequence prediction ,medicine ,Artificial intelligence ,business ,computer ,Coronavirus - Abstract
In this paper we present C o R o NN a a deep sequential framework for epidemic prediction that leverages a flexible combination of sequential and convolutional components to analyse the transmission of COVID-19 and, perhaps, other undiscovered viruses. Importantly, our methodology is generic and may be tailored to specific analysis goals. We exploit C o R o NN a to analyse the impact of various mobility containment policies on the pandemic using cumulative viral dissemination statistics with local demographic and movement data from several nations. Our experiments show that data on mobility has a significant, but delayed, impact on viral propagation. When compared to alternative frameworks that combine multivariate lagged predictors and basic LSTM models, C o R o NN a outperforms them. On the contrary, no technique based solely on lagged viral dissemination statistics can forecast daily cases.
- Published
- 2021
- Full Text
- View/download PDF
14. Ensemble human movement sequence prediction model with Apriori based Probability Tree Classifier (APTC) and Bagged J48 on Machine learning
- Author
-
S. Sridhar Raj and M. Nandhini
- Subjects
General Computer Science ,Computer science ,Trajectory analysis ,Human movement sequence prediction ,Disabled people ,02 engineering and technology ,Machine learning ,computer.software_genre ,C4.5 algorithm ,Sequence prediction ,0202 electrical engineering, electronic engineering, information engineering ,Spatial-temporal-social data ,Cluster analysis ,Data mining ,Ensemble forecasting ,business.industry ,020206 networking & telecommunications ,QA75.5-76.95 ,Tree diagram ,Electronic computers. Computer science ,A priori and a posteriori ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Classifier (UML) ,computer - Abstract
The accurate prediction of human movement trajectory has a variety of benefits for many applications such as optimizing nurse’s trajectory in a hospital, optimizing movements of old or disabled people to minimize their routine efforts, etc. To perform human movement prediction, large amount of historical positioning data from sensors has to be collected and mined. We analyzed different human sequential movement prediction approaches and their limitations. In this work, we propose a new classifier named Apriori based Probability Tree Classifier (APTC) which predicts the human movement sequence patterns in indoor environment. The APTC classifier is integrated into Bagged J48 Machine learning algorithm which results in an ensemble model to predict the future human movement patterns. The patterns are mined based on spatial, temporal and social data which add more accuracy to our prediction. Our model also performs clustering mechanism to detect the abnormal patterns.
- Published
- 2021
- Full Text
- View/download PDF
15. What is Next when Sequential Prediction Meets Implicitly Hard Interaction?
- Author
-
Kaixi Hu, Jianquan Liu, Xiaohui Tao, Lin Li, and Qing Xie
- Subjects
FOS: Computer and information sciences ,Focus (computing) ,Computer Science - Machine Learning ,Computer science ,Process (engineering) ,Generalization ,business.industry ,Interference (wave propagation) ,Base (topology) ,Machine learning ,computer.software_genre ,Machine Learning (cs.LG) ,Sequence prediction ,Sequence learning ,Artificial intelligence ,business ,computer ,Pattern learning - Abstract
Hard interaction learning between source sequences and their next targets is challenging, which exists in a myriad of sequential prediction tasks. During the training process, most existing methods focus on explicitly hard interactions caused by wrong responses. However, a model might conduct correct responses by capturing a subset of learnable patterns, which results in implicitly hard interactions with some unlearned patterns. As such, its generalization performance is weakened. The problem gets more serious in sequential prediction due to the interference of substantial similar candidate targets. To this end, we propose a Hardness Aware Interaction Learning framework (HAIL) that mainly consists of two base sequential learning networks and mutual exclusivity distillation (MED). The base networks are initialized differently to learn distinctive view patterns, thus gaining different training experiences. The experiences in the form of the unlikelihood of correct responses are drawn from each other by MED, which provides mutual exclusivity knowledge to figure out implicitly hard interactions. Moreover, we deduce that the unlikelihood essentially introduces additional gradients to push the pattern learning of correct responses. Our framework can be easily extended to more peer base networks. Evaluation is conducted on four datasets covering cyber and physical spaces. The experimental results demonstrate that our framework outperforms several state-of-the-art methods in terms of top-k based metrics., 10 pages; 4 figures; Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM '21), November 1--5, 2021, Virtual Event, QLD, Australia
- Published
- 2022
16. A hybrid Bayesian-frequentist predictive design for monitoring multi-stage clinical trials
- Author
-
Hayet Merabet and Zohra Djeridi
- Subjects
Statistics and Probability ,Index (economics) ,business.industry ,Bayesian probability ,Stopping rule ,020206 networking & telecommunications ,02 engineering and technology ,Machine learning ,computer.software_genre ,Interim analysis ,01 natural sciences ,Clinical trial ,Multi stage ,010104 statistics & probability ,Frequentist inference ,Sequence prediction ,Modeling and Simulation ,0202 electrical engineering, electronic engineering, information engineering ,Artificial intelligence ,0101 mathematics ,business ,computer ,Mathematics - Abstract
In this article, we propose a hybrid-Bayesian frequentist approach using a Bayesian sequential prediction of the index of satisfaction. For interim analysis that addresses prediction hypoth...
- Published
- 2019
- Full Text
- View/download PDF
17. Click Sequence Prediction in Android Mobile Applications
- Author
-
Hojung Cha, Rhan Ha, and Seokjun Lee
- Subjects
Training set ,Computer Networks and Communications ,Computer science ,business.industry ,Human Factors and Ergonomics ,Tracing ,Machine learning ,computer.software_genre ,Computer Science Applications ,Data modeling ,Human-Computer Interaction ,Pathfinder ,User experience design ,Artificial Intelligence ,Control and Systems Engineering ,Sequence prediction ,Data_GENERAL ,Signal Processing ,Artificial intelligence ,Android (operating system) ,business ,computer ,Humanoid robot - Abstract
Predicting a click sequence in mobile applications improves the user experience in various ways. By predicting which button will be clicked next, one can predict how the application will work and how the device will operate. However, predicting the click sequence is difficult because of the problems involved in collecting click sequences in real application usage. More importantly, accurate predictions are extremely challenging. In this paper, we address these issues. We propose PathFinder, a scheme for collecting click events and based on them predicting the next click in the application. The clicks are collected with the Android Accessibility Service and the next click is predicted via long short-term memory (LSTM). For the prediction, the base click sequence model is first generated from all users’ data; then, a personalized model is trained with an individual click sequence. As training data considerably influences the performance of LSTM, several techniques are developed to enhance the quality of the training data. The experimental results for 100 popular applications showed that the coverage and accuracy of click sequence tracing were 95% and 96%, respectively. Furthermore, PathFinder predicted the top three buttons that would be clicked next with a 0.76 F -measure for 1 775 043 real click data.
- Published
- 2019
- Full Text
- View/download PDF
18. Environment Agnostic Invariant Risk Minimization for Classification of Sequential Datasets
- Author
-
Praveen Venkateswaran, Vinod Muthusamy, Nalini Venkatasubramanian, and Vatche Isahagian
- Subjects
business.industry ,Computer science ,Generalization ,Machine learning ,computer.software_genre ,Standard Risk ,Sequence prediction ,Segmentation ,Artificial intelligence ,Minification ,business ,Focus (optics) ,Spurious relationship ,computer ,Invariant (computer science) - Abstract
The generalization of predictive models that follow the standard risk minimization paradigm of machine learning can be hindered by the presence of spurious correlations in the data. Identifying invariant predictors while training on data from multiple environments can influence models to focus on features that have an invariant causal relationship with the target, while reducing the effect of spurious features. Such invariant risk minimization approaches heavily rely on clearly defined environments and data being perfectly segmented into these environments for training. However, in real-world settings, perfect segmentation is challenging to achieve and these environment-aware approaches prove to be sensitive to segmentation errors. In this work, we present an environment-agnostic approach to develop generalizable models for classification tasks in sequential datasets without needing prior knowledge of environments. We show that our approach results in models that can generalize to out-of-distribution data and are not influenced by spurious correlations. We evaluate our approach on real-world sequential datasets from various domains.
- Published
- 2021
- Full Text
- View/download PDF
19. Remaining Activity Sequence Prediction for ongoing process instances
- Author
-
Xiaoxiao Sun
- Subjects
Computer science ,Sequence prediction ,business.industry ,Process (computing) ,Artificial intelligence ,Machine learning ,computer.software_genre ,business ,computer - Published
- 2021
- Full Text
- View/download PDF
20. A Time Series Based Sequence Prediction Algorithm to Detect Activities of Daily Living in Smart Home.
- Author
-
Marufuzzaman, M., Reaz, M. B. I., Ali, M. A. M., and Rahman, L. F.
- Subjects
ACTIVITIES of daily living scales ,TIME series analysis ,PREDICTION models ,HOME automation ,MEDICAL decision making - Abstract
Objectives: The goal of smart homes is to create an intelligent environment adapting the inhabitants need and assisting the person who needs special care and safety in their daily life. This can be reached by collecting the ADL (activities of daily living) data and further analysis within existing computing elements. In this research, a very recent algorithm named sequence prediction via enhanced episode discovery (SPEED) is modified and in order to improve accuracy time component is included. Methods: The modified SPEED or M-SPEED is a sequence prediction algorithm, which modified the previous SPEED algorithm by using time duration of appliance's ON-OFF states to decide the next state. M-SPEED discovered periodic episodes of inhabitant behavior, trained it with learned episodes, and made decisions based on the obtained knowledge. Results: The results showed that M-SPEED achieves 96.8% prediction accuracy, which is better than other time prediction algorithms like PUBS, ALZ with temporal rules and the previous SPEED. Conclusions: Since human behavior shows natural temporal patterns, duration times can be used to predict future events more accurately. This inhabitant activity prediction system will certainly improve the smart homes by ensuring safety and better care for elderly and handicapped people. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
21. Word Level LSTM and Recurrent Neural Network for Automatic Text Generation
- Author
-
Pvs. Manogna, Harsha Vardhana Krishna Sai Buddana, P S Shijin Kumar, and Surampudi Sai Kaushik
- Subjects
business.industry ,Computer science ,Machine learning ,computer.software_genre ,Long short term memory ,Generative model ,Recurrent neural network ,Sequence prediction ,Text generation ,Artificial intelligence ,Gradient descent ,business ,Inefficiency ,computer ,Word (computer architecture) - Abstract
Sequence prediction problems have been a major problem for a long time. Recurrent Neural Network (RNN) has been a good solution for sequential prediction problems. This work aims to create a generative model for text. Even though, RNN has its own limitations such as vanishing and exploding gradient descent problems, and inefficiency to keep track of long-term dependencies. To overcome these drawbacks, Long Short Term Memory (LSTM) has been a path-breaking solution to deal with sequential data and text data in particular. This paper delineates the design and working of text generation using word-level LSTM-RNN.
- Published
- 2021
- Full Text
- View/download PDF
22. Neural Clinical Event Sequence Prediction Through Personalized Online Adaptive Learning
- Author
-
Jeong Min Lee and Milos Hauskrecht
- Subjects
Online model ,0303 health sciences ,education.field_of_study ,Event (computing) ,business.industry ,Computer science ,Clinical events ,Population ,010501 environmental sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Article ,3. Good health ,03 medical and health sciences ,Sequence prediction ,Artificial intelligence ,Adaptive learning ,business ,education ,computer ,Predictive modelling ,030304 developmental biology ,0105 earth and related environmental sciences ,Sequence (medicine) - Abstract
Clinical event sequences consist of thousands of clinical events that represent records of patient care in time. Developing accurate prediction models for such sequences is of a great importance for defining representations of a patient state and for improving patient care. One important challenge of learning a good predictive model of clinical sequences is patient-specific variability. Based on underlying clinical complications, each patient’s sequence may consist of different sets of clinical events. However, population-based models learned from such sequences may not accurately predict patient-specific dynamics of event sequences. To address the problem, we develop a new adaptive event sequence prediction framework that learns to adjust its prediction for individual patients through an online model update.
- Published
- 2021
- Full Text
- View/download PDF
23. Improving Input Prediction in Online Fighting Games
- Author
-
Ehlert, Anton
- Subjects
Computer and Information Sciences ,machine learning ,maskininlärning ,spekulativ exekvering ,Fightingspel ,sekvensförutsägelse ,Data- och informationsvetenskap ,Fighting games ,sequence prediction ,speculative execution - Abstract
Many online fighting games use rollback netcode in order to compensate for network delay. Rollback netcode allows players to experience the game as having reduced delay. A drawback of this is that players will sometimes see the game quickly ”jump” to a different state to adjust for the the remote player’s actions. Rollback netcode implementations require a method for predicting the remote player’s next button inputs. Current implementations use a naive repeatlastframe policy for such prediction. There is a possibility that alternative methods may lead to improved user experience. This project examines the problem of improving input prediction in fighting games. It details the development of a new prediction model based on recurrent neural networks. The model was trained and evaluated using a dataset of several thousand recorded player input sequences. The results show that the new model slightly outperforms the naive method in prediction accuracy, with the difference being greater for longer predictions. However, it has far higher requirements both in terms of memory and computation cost. It seems unlikely that the model would significantly improve on current rollback netcode implementations. However, there may be ways to improve predictions further, and the effects on user experience remains unknown. Många online fightingspel använder rollback netcode för att kompensera för nätverksfördröjning. Rollback netcode låter spelare uppleva spelet med mindre fördröjning. En nackdel av detta är att spelare ibland ser spelet snabbt ”hoppa” till ett annat tillstånd för att justera för motspelarens handlingar. Rollback netcode implementationer behöver en policy för att förutsäga motspelarens nästa knapptryckningar. Nuvarande implementationer använder en naiv repetera-senaste-frame policy för förutsägelser. Det finns en möjlighet att alternativa metoder kan leda till förbättrad användarupplevelse. Det här projektet undersöker problemet att förbättra förutsägelser av knapptryckningar i fightingspel. Det beskriver utvecklingen av en ny förutsägelsemodell baserad på rekursiva neuronnät. Modellen tränades och evaluerades med ett dataset av flera tusen inspelade knappsekvenser. Resultaten visar att den nya modellen överträffar den naiva metoden i noggrannhet, med större skillnad för längre förutsägelser. Dock har den mycket högre krav i både minne och beräkningskostad. Det verkar osannolikt att modellen skulle avsevärt förbättra nuvarande rollback netcode implementationer. Men det kan finnas sätt att förbättra förutsägelser ytterligare, och påverkan på användarupplevelsen förblir okänd.
- Published
- 2021
24. Analysis of Modality-Based Presentation Skills Using Sequential Models
- Author
-
Hung-Hsuan Huang, Su Shwe Yi Tun, Shogo Okada, and Chee Wee Leong
- Subjects
Modality (human–computer interaction) ,business.industry ,Computer science ,media_common.quotation_subject ,Machine learning ,computer.software_genre ,Presentation ,Sequence modeling ,Sequence prediction ,Performance prediction ,Artificial intelligence ,business ,computer ,Relevant information ,media_common - Abstract
This paper presents an analysis of informative presentations using sequential multimodal modeling for automatic assessment of presentation performance. For this purpose, we transform a single video into multiple time-series segments that are provided as inputs to sequential models, such as Long Short-Term Memory (LSTM). This sequence modeling approach enables us to capture the time-series change of multimodal behaviors during the presentation. We proposed variants of sequential models that improve the accuracy of performance prediction over non-sequential models. Moreover, we performed segment analysis on the sequential models to analyze how relevant information from various segments can lead to better performance in sequential prediction models.
- Published
- 2021
- Full Text
- View/download PDF
25. A Hybrid Mode of Sequence Prediction Based on Generative Adversarial Network
- Author
-
WenXuan Huang, Heng Luo, TingFei Zhang, and Hang Liu
- Subjects
Computer science ,Test data generation ,business.industry ,020209 energy ,Mode (statistics) ,02 engineering and technology ,Energy consumption ,010501 environmental sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Data modeling ,Sequence prediction ,0202 electrical engineering, electronic engineering, information engineering ,Artificial intelligence ,business ,Generative adversarial network ,computer ,0105 earth and related environmental sciences - Abstract
Human beings nowadays spend more than 90% of the lifetime indoors, leading to the dramatic increase of energy consumption in various buildings. Therefore, research regarding the environment friendly building becomes much more popular recently in which the prediction of energy consumption is a promised method. Nevertheless, the accuracy of prediction is not sound due to insufficient samples. A novel data generation model, termed HMSP, based on the generative adversarial networks, is proposed in this paper to generate much more data robustly, depending on a small number of samples available. The prediction CV-RMSE results, adopting data from the hybrid model, reach 3.03% at best and 7.99% at worst respectively compared to the samples recorded.
- Published
- 2020
- Full Text
- View/download PDF
26. Sequential Prediction of Glycosylated Hemoglobin Based on Long Short-Term Memory with Self-Attention Mechanism
- Author
-
Keyu Zhu, Wenqing Gong, Yuxiang Guan, Lushi Yao, Shanshan Zhang, Xiaojia Wang, and Weiqun Xu
- Subjects
General Computer Science ,endocrine system diseases ,Computer science ,Mechanism (biology) ,Self attention ,Self-attention mechanism ,nutritional and metabolic diseases ,T2DM ,Network ,Glycosylated hemoglobin ,lcsh:QA75.5-76.95 ,Computational Mathematics ,Long short term memory ,Sequence prediction ,Machine learning ,Hemoglobin ,lcsh:Electronic computers. Computer science ,Neuroscience - Abstract
Type 2 diabetes mellitus (T2DM) has been identified as one of the most challenging chronic diseases to manage. In recent years, the incidence of T2DM has increased, which has seriously endangered people’s health and life quality. Glycosylated hemoglobin (HbA1c) is the gold standard clinical indicator of the progression of T2DM. An accurate prediction of HbA1c levels not only helps medical workers improve the accuracy of clinical decision-making but also helps patients to better understand the clinical progression of T2DM and conduct self-management to achieve the goal of controlling the progression of T2DM. Therefore, we introduced the long short-term memory (LSTM) neural network to predict patients’ HbA1c levels using time sequential data from electronic medical records (EMRs). We added the self-attention mechanism based on the traditional LSTM to capture the long-term interdependence of feature elements and which ensure that the memory was more profound and effective, and used the gradient search technology to minimize the mean square error of the predicted value of the network and the real value. LSTM with the self-attention mechanism performed better than the traditional deep learning sequence prediction method. Our research provides a good reference for the application of deep learning in the field of medical health management.
- Published
- 2020
27. SPADE: Activity Prediction in Smart Homes Using Prefix Tree Based Context Generation
- Author
-
Mamun Bin Ibne Reaz, Norhana Arsad, and Araf Farayez
- Subjects
General Computer Science ,smart home ,Computer science ,02 engineering and technology ,Prediction by partial matching ,Machine learning ,computer.software_genre ,Data modeling ,Trie ,0202 electrical engineering, electronic engineering, information engineering ,General Materials Science ,data compression ,prefix tree ,Activity prediction ,Lossless compression ,business.industry ,General Engineering ,020206 networking & telecommunications ,prediction by partial matching ,sequence prediction ,Memory management ,Data model ,020201 artificial intelligence & image processing ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,Artificial intelligence ,business ,lcsh:TK1-9971 ,computer ,Data compression - Abstract
An automated smart home system is a key contributor to user assistance technology in modern civilization. Crucial merit of such a system is its ability to train itself through recorded data and recognize patterns in resident behaviors. Lack of sufficient prediction accuracy, exponential memory consumption, and extensive runtime prevent many of the current activity prediction approaches from being seamlessly integrated into consumer residences. This research introduces a sequence prediction algorithm which uses a prefix tree-based data model in order to learn and predict user actions. The algorithm applies episode discovery to detect correlated sensor events and learns the activities using a lossless data compression technique. This process assigns a probability of occurrence to sensor events and uses these probabilities to detect patterns in resident behavior. A complexity analysis of the algorithm is done to prove its efficiency in terms of memory usage and runtime. Using the presented technique, predictions are performed on popular datasets and contrasted with existing algorithms. The proposed algorithm achieves an 8.22% improvement in prediction accuracy over its predecessors, along with 66.69% better memory efficiency and 37% faster runtime.
- Published
- 2019
- Full Text
- View/download PDF
28. Residual Recurrent Highway Networks for Learning Deep Sequence Prediction Models
- Author
-
Tehseen Zia and Saad Razzaq
- Subjects
Structure (mathematical logic) ,020203 distributed computing ,Information propagation ,Perplexity ,Computer Networks and Communications ,Computer science ,business.industry ,Treebank ,020206 networking & telecommunications ,02 engineering and technology ,Machine learning ,computer.software_genre ,Residual ,Recurrent neural network ,Hardware and Architecture ,Sequence prediction ,0202 electrical engineering, electronic engineering, information engineering ,Artificial intelligence ,Baseline (configuration management) ,business ,computer ,Software ,Information Systems - Abstract
A contemporary approach for acquiring the computational gains of depth in recurrent neural networks (RNNs) is to hierarchically stack multiple recurrent layers. However, such performance gains come with the cost of challenging optimization of hierarchal RNNs (HRNNs) which are deep both hierarchically and temporally. The researchers have exclusively highlighted the significance of using shortcuts for learning deep hierarchical representations and deep temporal dependencies. However, no significant efforts are made to unify these finding into a single framework for learning deep HRNNs. We propose residual recurrent highway network (R2HN) that contains highways within temporal structure of the network for unimpeded information propagation, thus alleviating gradient vanishing problem. The hierarchical structure learning is posed as residual learning framework to prevent performance degradation problem. The proposed R2HN contain significantly reduced data-dependent parameters as compared to related methods. The experiments on language modeling (LM) tasks have demonstrated that the proposed architecture leads to design effective models. On LM experiments with Penn TreeBank, the model achieved 60.3 perplexity and outperformed baseline and related models that we tested.
- Published
- 2018
- Full Text
- View/download PDF
29. Learning Theory Analysis for Association Rules and Sequential Event Prediction.
- Author
-
Rudin, Cynthia, Letham, Benjamin, and Madigan, David
- Subjects
- *
MACHINE learning , *ASSOCIATION rule mining , *SEQUENTIAL analysis , *PREDICTION models , *STABILITY theory - Abstract
We present a theoretical analysis for prediction algorithms based on association rules. As part of this analysis, we introduce a problem for which rules are particularly natural, called "sequential event prediction." In sequential event prediction, events in a sequence are revealed one by one, and the goal is to determine which event will next be revealed. The training set is a collection of past sequences of events. An example application is to predict which item will next be placed into a customer's online shopping cart, given his/her past purchases. In the context of this problem, algorithms based on association rules have distinct advantages over classical statistical and machine learning methods: they look at correlations based on subsets of co-occurring past events (items a and b imply item c), they can be applied to the sequential event prediction problem in a natural way, they can potentially handle the "cold start" problem where the training set is small, and they yield interpretable predictions. In this work, we present two algorithms that incorporate association rules. These algorithms can be used both for sequential event prediction and for supervised classification, and they are simple enough that they can possibly be understood by users, customers, patients, managers, etc. We provide generalization guarantees on these algorithms based on algorithmic stability analysis from statistical learning theory. We include a discussion of the strict minimum support threshold often used in association rule mining, and introduce an "adjusted confidence" measure that provides a weaker minimum support condition that has advantages over the strict minimum support. The paper brings together ideas from statistical learning theory, association rule mining and Bayesian analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2013
30. Time and activity sequence prediction of business process instances
- Author
-
Andrea Burattin, Mirko Polato, Massimiliano de Leoni, Alessandro Sperduti, and Process Science
- Subjects
FOS: Computer and information sciences ,Computer Science - Artificial Intelligence ,Computer science ,Process (engineering) ,Business process ,Remaining time ,Process mining ,02 engineering and technology ,Machine learning ,computer.software_genre ,Theoretical Computer Science ,Service-level agreement ,Order (exchange) ,Sequence prediction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Numerical Analysis ,Sequence ,business.industry ,Work (physics) ,Computer Science Applications ,Computational Mathematics ,Artificial Intelligence (cs.AI) ,Computational Theory and Mathematics ,020201 artificial intelligence & image processing ,Artificial intelligence ,Prediction ,business ,computer ,Software - Abstract
The ability to know in advance the trend of running process instances, with respect to different features, such as the expected completion time, would allow business managers to timely counteract to undesired situations, in order to prevent losses. Therefore, the ability to accurately predict future features of running business process instances would be a very helpful aid when managing processes, especially under service level agreement constraints. However, making such accurate forecasts is not easy: many factors may influence the predicted features. Many approaches have been proposed to cope with this problem but, generally, they assume that the underlying process is stationary. However, in real cases this assumption is not always true. In this work we present new methods for predicting the remaining time of running cases. In particular we propose a method, assuming process stationarity, which achieves state-of-the-art performances and two other methods which are able to make predictions even with non-stationary processes. We also describe an approach able to predict the full sequence of activities that a running case is going to take. All these methods are extensively evaluated on different real case studies.
- Published
- 2018
- Full Text
- View/download PDF
31. On Finding Predictors for Arbitrary Families of Processes.
- Author
-
Ryabko, Daniil
- Subjects
- *
MATHEMATICAL sequences , *LOGICAL prediction , *BAYESIAN analysis , *MATHEMATICAL statistics , *MACHINE learning - Abstract
The problem is sequence prediction in the following setting. A sequence x1, . . . ,xn, . . . of discrete-valued observations is generated according to some unknown probabilistic law (measure) µ. After observing each outcome, it is required to give the conditional probabilities of the next observation. The measure µ belongs to an arbitrary but known class C of stochastic process measures. We are interested in predictors r whose conditional probabilities converge (in some sense) to the "true" µ-conditional probabilities, if any µ ε C is chosen to generate the sequence. The contribution of this work is in characterizing the families C for which such predictors exist, and in providing a specific and simple form in which to look for a solution. We show that if any predictor works, then there exists a Bayesian predictor, whose prior is discrete, and which works too. We also find several sufficient and necessary conditions for the existence of a predictor, in terms of topological characterizations of the family C, as well as in terms of local behaviour of the measures in C, which in some cases lead to procedures for constructing such predictors. It should be emphasized that the framework is completely general: the stochastic processes considered are not required to be i.i.d., stationary, or to belong to any parametric or countable family. [ABSTRACT FROM AUTHOR]
- Published
- 2010
32. Forecasting occurrences of activities
- Author
-
Bryan Minor and Diane J. Cook
- Subjects
Ubiquitous computing ,Computer Networks and Communications ,Computer science ,business.industry ,Decision tree ,02 engineering and technology ,computer.software_genre ,Machine learning ,Article ,Computer Science Applications ,Activity recognition ,Hardware and Architecture ,Home automation ,Sequence prediction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Data mining ,Artificial intelligence ,business ,computer ,Classifier (UML) ,Software ,Information Systems - Abstract
While activity recognition has been shown to be valuable for pervasive computing applications, less work has focused on techniques for forecasting the future occurrence of activities. We present an activity forecasting method to predict the time that will elapse until a target activity occurs. This method generates an activity forecast using a regression tree classifier and offers an advantage over sequence prediction methods in that it can predict expected time until an activity occurs. We evaluate this algorithm on real-world smart home datasets and provide evidence that our proposed approach is most effective at predicting activity timings.
- Published
- 2017
- Full Text
- View/download PDF
33. Automatic VMAT Planning via MLC Dynamic Sequence Prediction (AVP-DSP): A Novel Deep-Learning Method for Real-Time Prostate Treatment Planning
- Author
-
Chunhao Wang, Yang Sheng, Xinyi Li, Y. Ni, Jie Zhang, and Qiulian Wu
- Subjects
Cancer Research ,Radiation ,business.industry ,Deep learning ,Machine learning ,computer.software_genre ,medicine.anatomical_structure ,Oncology ,Prostate ,Sequence prediction ,medicine ,Radiology, Nuclear Medicine and imaging ,Artificial intelligence ,business ,Radiation treatment planning ,computer ,Digital signal processing - Published
- 2020
- Full Text
- View/download PDF
34. Sekvensprediktering för identifiering av användarutrustningsmönster i mobila nätverk
- Author
-
Charitidis, Theoharis
- Subjects
Maskininlärning ,Matematik ,mobile networks ,markovkedjor ,Markov chain ,statistik ,mobila nätverk ,user equipment ,stochastic process ,sequence prediction ,machine learning ,statistics ,sekvensprediktering ,stokastisk process ,all k order markov ,Mathematics - Abstract
With an increasing demand for bandwidth and lower latency in mobile communication networks it becomes gradually more important to improve current mobile network management solutions using available network data. To improve the network management it can for instance be of interest to infer future available bandwidth to the end user of the network. This can be done by utilizing the current knowledge of real-time user equipment (UE) behaviour in the network. In the scope of this thesis interest lies in, given a set of visited radio access points (cells), to predict what the next one is going to be. For this reason the aim is to investigate the prediction performance when utilizing the All-K-Order Markov (AKOM) model, with some added variations, on collected data generated from train trajectories. Moreover a method for testing the suitability of modeling the sequence of cells as a time-homogeneous Markov chain is proposed, in order to determine the goodness-of- t with the available data. Lastly, the elapsed time in each cell is attempted to be predicted using linear regression given the prior history window of previous cell and elapsed times pairs. The results show that moderate to good prediction accuracy on the upcoming cell can be achieved with AKOM and associated variations. For predicting the upcoming sojourn time in future cells the results reveal that linear regression does not yield satisfactory results and possibly another regression model should be utilized. Med en ökande efterfrågan på banbredd och kortare latens i mobila nätverk har det gradvis blivit viktigare att förbättra nuvarande lösningar för hantering av nätverk genom att använda tillgänglig nätverksdata. Specifikt är det av intresse att kunna dra slutsatser kring vad framtida bandbredsförhållanden kommer vara, samt övriga parametrar av intresse genom att använda tillgänglig information om aktuell mobil användarutrustnings (UE) beteende i det mobila nätverket. Inom ramen av detta masterarbete ligger fokus på att, givet tidigare besökta radio accesspunkter (celler), kunna förutspå vilken nästkommande besökta cell kommer att vara. Av denna anledning är målet att undersöka vilken prestanda som kan uppnås när All-$K$-Order Markov (AKOM) modellen, med associerade varianter av denna, används på samlad data från tågfärder. Dessutom ges det förslag på test som avgör hur lämpligt det är att modelera observerade sekvenser av celler som en homogen Markovkedja med tillgänglig data. Slutligen undersöks även om besökstiden i en framtida cell kan förutspås med linjär regression givet ett historiskt fönster av tidigare cell och besökstids par. Erhållna resultat visar att måttlig till bra prestanda kan uppnås när kommande celler förutspås med AKOM modellen och associerade variationer. För prediktering av besökstid i kommande cell med linjär regression erhålles det däremot inte tillfredsställande resultat, vilket tyder på att en alternativ regressionsmetod antagligen är bättre lämpad för denna data.
- Published
- 2020
35. Predicting user routines with masked dilated convolutions
- Author
-
Dragomir Yankov, Renzhong Wang, Siddhartha Cingh Arora, Senthil Palanisamy, Michael R. Evans, and Wei Wu
- Subjects
Coffee shop ,business.industry ,Computer science ,02 engineering and technology ,Recommender system ,Machine learning ,computer.software_genre ,Masking (Electronic Health Record) ,Sequence prediction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Day to day ,business ,computer - Abstract
Predicting users daily location visits - when and where they will go, and how long they will stay - is key for making effective location-based recommendations. Knowledge of an upcoming day allows the suggestion of relevant alternatives (e.g., a new coffee shop on the way to work) in advance, prior to a visit. This helps users make informed decisions and plan accordingly. People's visit routines, or just routines, can vary significantly from day to day, and visits from earlier in the day, week, or month may affect subsequent choices. Traditionally, routine prediction has been modeled with sequence methods, such as HMMs or more recently with RNN-based architectures. However, the problem with such architectures is that their predictive performance degrades when increasing the number of historical observations in the routine sequence. In this paper, we propose Masked-TCN (MTCN), a novel method based on time-dilated convolutional networks. The method implements custom dilations and masking which can process effectively long routine sequences, identifying recurring patterns at different resolution - hourly, daily, weekly, monthly. We demonstrate that MTCN achieves 8% improvement in accuracy over current state-of-the-art solutions on a large data set of visit routines.
- Published
- 2019
- Full Text
- View/download PDF
36. Multi-modal Behavioral Information-Aware Recommendation with Recurrent Neural Networks
- Author
-
Weidong Gu, Nannan Chen, Jiao Pan, and Guoyong Cai
- Subjects
Sequence ,Computer science ,business.industry ,Recommender system ,Machine learning ,computer.software_genre ,Matrix decomposition ,Recurrent neural network ,Modal ,Sequence prediction ,Contextual information ,Artificial intelligence ,business ,computer - Abstract
Data sparsity is one of the most challenging problems in recommendation systems. In this paper, we tackle this problem by proposing a novel multi-modal behavioral information-aware recommendation method named MIAR which is based on recurrent neural networks and matrix factorization. First, an interaction context-aware sequential prediction model is designed to capture user-item interaction contextual information and behavioral sequence information. Second, an attributed context-aware rating prediction model is proposed to capture attribution contextual information and rating information. Finally, three fusion methods are developed to combine two sub-models. As a result, the MIAR method has several distinguished advantages in terms of mitigating the data sparsity problem. The method can well perceive diverse influences of interaction and attribution contextual information. Meanwhile, a large number of behavioral sequence and rating information can be utilized by the MIAR approach. The proposed algorithm is evaluated on real-world datasets and the experimental results show that MIAR can significantly improve recommendation performance compared to the existing state-of-art recommendation algorithms.
- Published
- 2019
- Full Text
- View/download PDF
37. Multi-year long-term load forecast for area distribution feeders based on selective sequence learning
- Author
-
Ming Dong, Jian Shi, and Qingxin Shi
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Mean squared error ,Computer science ,020209 energy ,Distribution (economics) ,Systems and Control (eess.SY) ,02 engineering and technology ,Machine learning ,computer.software_genre ,Electrical Engineering and Systems Science - Systems and Control ,Industrial and Manufacturing Engineering ,Machine Learning (cs.LG) ,Distribution system ,020401 chemical engineering ,Sequence prediction ,FOS: Electrical engineering, electronic engineering, information engineering ,0202 electrical engineering, electronic engineering, information engineering ,0204 chemical engineering ,Electrical and Electronic Engineering ,Civil and Structural Engineering ,Sequence ,business.industry ,Mechanical Engineering ,Building and Construction ,Pollution ,Term (time) ,General Energy ,Unsupervised learning ,Artificial intelligence ,Sequence learning ,business ,computer - Abstract
Long-term load forecast (LTLF) for area distribution feeders is one of the most critical tasks frequently performed in electric distribution utility companies. For a specific planning area, cost-effective system upgrades can only be planned out based on accurate feeder LTLF results. In our previous research, we established a unique sequence prediction method which has the tremendous advantage of combining area top-down, feeder bottom-up and multi-year historical data all together for forecast and achieved a superior performance over various traditional methods by real-world tests. However, the previous method only focused on the forecast of the next one-year. In our current work, we significantly improved this method: the forecast can now be extended to a multi-year forecast window in the future; unsupervised learning techniques are used to group feeders by their load composition features to improve accuracy; we also propose a novel selective sequence learning mechanism which uses Gated Recurrent Unit network to not only learn how to predict sequence values but also learn to select the best-performing sequential configuration for each individual feeder. The proposed method was tested on an actual urban distribution system in West Canada. It was compared with traditional methods and our previous sequence prediction method. It demonstrates the best forecasting performance as well as the possibility of using sequence prediction models for multi-year component-level load forecast., Comment: 22 pages, 9 figures
- Published
- 2020
- Full Text
- View/download PDF
38. Human activity prediction in smart home environments with LSTM neural networks
- Author
-
Niek Tax and Process Science
- Subjects
Activity prediction ,Artificial neural network ,business.industry ,Computer science ,Event (computing) ,Process mining ,02 engineering and technology ,Machine learning ,computer.software_genre ,Identification (information) ,Recurrent neural network ,Home automation ,Smart home environments ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,Sequence prediction ,020201 artificial intelligence & image processing ,Artificial intelligence ,Timestamp ,business ,computer ,Neural networks - Abstract
In this paper, we investigate the performance of several sequence prediction techniques on the prediction of future events of human behavior in a smart home, as well as the timestamps of those next events. Prediction techniques in smart home environments have several use cases, such as the real-time identification of abnormal behavior, identifying coachable moments for e-coaching, and a plethora of applications in the area of home automation. We give an overview of several sequence prediction techniques, including techniques that originate from the areas of data mining, process mining, and data compression, and we evaluate the predictive accuracy of those techniques on a collection of publicly available real-life datasets from the smart home environments domain. This contrast our work with existing work on prediction in smart homes, which often evaluate their techniques on a single smart home instead of a larger collection of logs. We found that LSTM neural networks outperform the other prediction methods on the task of predicting the next activity as well as on the task of predicting the timestamp of the next event. However, surprisingly, we found that it is very dependent on the dataset which technique works best for the task of predicting a window of multiple next activities.
- Published
- 2018
39. Target Binding and Sequence Prediction With LSTMs
- Author
-
Pavlovic M, StClair R, Teti M, Elan Barenholtz, and William Edward Hahn
- Subjects
business.industry ,Computer science ,Python (programming language) ,Machine learning ,computer.software_genre ,Conserved sequence ,Text mining ,Recurrent neural network ,Sequence prediction ,medicine ,Homeobox ,Artificial intelligence ,UniProt ,Artemisinin ,Binding site ,business ,Primary sequence ,Target binding ,computer ,medicine.drug ,computer.programming_language - Abstract
Deep recurrent neural networks (DRNNs) have recently demonstrated strong performance in sequential data analysis, such as natural language processing. These capabilities make them a promising tool for inferential analysis of sequentially structured bioinformatics data as well. Here, we assessed the ability of Long Short-Term Memory (LSTM) networks, a class of DRNNs, to predict properties of proteins based on their primary structures. The proposed architecture is trained and tested on two different datasets to predict whether a given sequence falls into a certain class or not. The first dataset, directly imported from Uniprot, was used to train the network on whether a given protein contained or did not contain a conserved sequence (homeodomain), and the second dataset, derived by literature mining, was used to train a network on whether a given protein binds or doesn't bind to Artemisinin, a drug typically used to treat malaria. In each case, the model was able to differentiate between the two different classes of sequences it was given with high accuracy, illustrating successful learning and generalization. Upon completion of training, an ROC curve was created using the homeodomain and artemisinin validation datasets. The AUC of these datasets was 0.80 and 0.87 respectively, further indicating the models' effectiveness. Furthermore, using these trained models, it was possible to derive a protocol for sequence detection of homeodomain and binding motif, which are well-documented in literature, and a known Artemisinin binding site, respectively [1-3]. Along with these contributions, we developed a python API to directly connect to Uniprot data sourcing, train deep neural networks on this primary sequence data using TensorFlow, and uniquely visualize the results of this analysis. Such an approach has the potential to drastically increase accuracy and reduce computational time and, current major limitations in informatics, from inquiry to discovery in protein function research
- Published
- 2018
- Full Text
- View/download PDF
40. Music Sequence Prediction with Mixture Hidden Markov Models
- Author
-
Tao Li, Kaiming Fu, Lei Lin, and Minsoo Choi
- Subjects
FOS: Computer and information sciences ,business.industry ,Computer science ,Deep learning ,020207 software engineering ,02 engineering and technology ,Recommender system ,Mixture model ,Machine learning ,computer.software_genre ,Multimedia (cs.MM) ,Computer Science - Information Retrieval ,Sequence prediction ,0202 electrical engineering, electronic engineering, information engineering ,Collaborative filtering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Hidden Markov model ,computer ,Information Retrieval (cs.IR) ,Computer Science - Multimedia - Abstract
Recommendation systems that automatically generate personalized music playlists for users have attracted tremendous attention in recent years. Nowadays, most music recommendation systems rely on item-based or user-based collaborative filtering or content-based approaches. In this paper, we propose a novel mixture hidden Markov model (HMM) for music play sequence prediction. We compare the mixture model with state-of-the-art methods and evaluate the predictions quantitatively and qualitatively on a large-scale real-world dataset in a Kaggle competition. Results show that our model significantly outperforms traditional methods as well as other competitors. We conclude by envisioning a next-generation music recommendation system that integrates our model with recent advances in deep learning, computer vision, and speech techniques, and has promising potential in both academia and industry., Accepted to the 4th International Conference on Artificial Intelligence and Applications (AI 2018)
- Published
- 2018
41. Sequential prediction of quantitative health risk assessment for the fine particulate matter in an underground facility using deep recurrent neural networks
- Author
-
Qian Li, Paulina Vilela, Jorge Loy-Benitez, and ChangKyoo Yoo
- Subjects
Mean squared error ,Fine particulate ,Computer science ,Health, Toxicology and Mutagenesis ,0211 other engineering and technologies ,02 engineering and technology ,010501 environmental sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Risk Assessment ,Indoor air quality ,Sequence prediction ,Republic of Korea ,Humans ,Railroads ,0105 earth and related environmental sciences ,021110 strategic, defence & security studies ,Air Pollutants ,Warning system ,Health risk assessment ,business.industry ,Public Health, Environmental and Occupational Health ,General Medicine ,Pollution ,Recurrent neural network ,Air Pollution, Indoor ,Particulate Matter ,Artificial intelligence ,Neural Networks, Computer ,business ,computer ,Performance metric ,Environmental Monitoring ,Forecasting - Abstract
Particulate matter with aerodynamic diameter less than 2.5 µm (PM2.5) in indoor public spaces such as subway stations, has represented a major public health concern; however, forecasting future sequences of quantitative health risk is an effective method for protecting commuters’ health, and an important tool for developing early warning systems. Despite the existence of several predicting methods, some tend to fail to forecast long-term dependencies in an effective way. This paper aims to implement a multiple sequences prediction of a comprehensive indoor air quality index (CIAI) traced by indoor PM2.5, utilizing different structures of recurrent neural networks (RNN). A standard RNN (SRNN), long short-term memory (LSTM) and a gated recurrent unit (GRU) structures were implemented due to their capability of managing sequential, and time-dependent data. Hourly indoor PM2.5 concentration data collected in the D-subway station, South Korea, were utilized for the validation of the proposed method. For the selection of the most suitable predictive model (i.e. SRNN, LSTM, GRU), a point-by-point prediction on the PM2.5 was conducted, demonstrating that the GRU structure outperforms the other RNN structures (RMSE = 21.04 µg/m3, MAPE = 32.92%, R2 = 0.65). Then, this model is utilized to sequentially predict the concentration and quantify the health risk (i.e. CIAI) at different time lags. For a 6-h time lag, the proposed model exhibited the best performance metric (RMSE = 29.73 µg/m3, MAPE = 29.52%). Additionally, for the rest of the time lags including 12, 18 and 24 h, achieved an acceptable performance (MAPE = 29–37%).
- Published
- 2018
42. RNN and LSTM
- Author
-
Navin Kumar Manaswi
- Subjects
Recurrent neural network ,Computer science ,Sequence prediction ,business.industry ,Artificial intelligence ,Time series ,Machine learning ,computer.software_genre ,business ,computer - Abstract
This chapter will discuss the concepts of recurrent neural networks (RNNs) and their modified version, long short-term memory (LSTM). LSTM is mainly used for sequence prediction. You will learn about the varieties of sequence prediction and then learn how to do time-series forecasting with the help of the LSTM model.
- Published
- 2018
- Full Text
- View/download PDF
43. Generative Bridging Network for Neural Sequence Prediction
- Author
-
Zhirui Zhang, Guanlin Li, Mu Li, Shujie Liu, Ming Zhou, Wenhu Chen, and Shuo Ren
- Subjects
Machine translation ,Computer science ,business.industry ,Maximum likelihood ,05 social sciences ,010501 environmental sciences ,Overfitting ,computer.software_genre ,Machine learning ,01 natural sciences ,Automatic summarization ,Bridging (programming) ,Sequence prediction ,0502 economics and business ,Artificial intelligence ,050207 economics ,business ,computer ,Generative grammar ,0105 earth and related environmental sciences - Abstract
In order to alleviate data sparsity and overfitting problems in maximum likelihood estimation (MLE) for sequence prediction tasks, we propose the Generative Bridging Network (GBN), in which a novel bridge module is introduced to assist the training of the sequence prediction model (the generator network). Unlike MLE directly maximizing the conditional likelihood, the bridge extends the point-wise ground truth to a bridge distribution conditioned on it, and the generator is optimized to minimize their KL-divergence. Three different GBNs, namely uniform GBN, language-model GBN and coaching GBN, are proposed to penalize confidence, enhance language smoothness and relieve learning burden. Experiments conducted on two recognized sequence prediction tasks (machine translation and abstractive text summarization) show that our proposed GBNs can yield significant improvements over strong baselines. Furthermore, by analyzing samples drawn from different bridges, expected influences on the generator are verified.
- Published
- 2018
- Full Text
- View/download PDF
44. Sequence Prediction with Unlabeled Data by Reward Function Learning
- Author
-
Jianhuang Lai, Li Zhao, Tao Qin, Tie-Yan Liu, and Lijun Wu
- Subjects
Machine translation ,Computer science ,business.industry ,02 engineering and technology ,Semi-supervised learning ,010501 environmental sciences ,computer.software_genre ,Machine learning ,01 natural sciences ,Automatic summarization ,ComputingMethodologies_PATTERNRECOGNITION ,Sequence prediction ,0202 electrical engineering, electronic engineering, information engineering ,Reinforcement learning ,Unsupervised learning ,Leverage (statistics) ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,0105 earth and related environmental sciences - Abstract
Reinforcement learning (RL), which has been successfully applied to sequence prediction, introduces \textit{reward} as sequence-level supervision signal to evaluate the quality of a generated sequence. Existing RL approaches use the ground-truth sequence to define reward, which limits the application of RL techniques to labeled data. Since labeled data is usually scarce and/or costly to collect, it is desirable to leverage large-scale unlabeled data. In this paper, we extend existing RL methods for sequence prediction to exploit unlabeled data. We propose to learn the reward function from labeled data and use the predicted reward as \textit{pseudo reward} for unlabeled data so that we can learn from unlabeled data using the pseudo reward. To get good pseudo reward on unlabeled data, we propose a RNN-based reward network with attention mechanism, trained with purposely biased data distribution. Experiments show that the pseudo reward can provide good supervision and guide the learning process on unlabeled data. We observe significant improvements on both neural machine translation and text summarization.
- Published
- 2017
- Full Text
- View/download PDF
45. Long and Short-Term Recommendations with Recurrent Neural Networks
- Author
-
Hugues Bersini and Robin Devooght
- Subjects
Computer science ,business.industry ,02 engineering and technology ,Recommender system ,computer.software_genre ,Machine learning ,Session (web analytics) ,Term (time) ,Recurrent neural network ,Sequence prediction ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Collaborative filtering ,020201 artificial intelligence & image processing ,Data mining ,Artificial intelligence ,business ,computer - Abstract
Recurrent neural networks have recently been successfully applied to the session-based recommendation problem, and is part of a growing interest for collaborative filtering based on sequence prediction. This new approach to recommendations reveals an aspect that was previously overlooked: the difference between short-term and long-term recommendations. In this work we characterize the full short-term/long-term profile of many collaborative filtering methods, and we show how recurrent neural networks can be steered towards better short or long-term predictions. We also show that RNNs are not only adapted to session-based collaborative filtering, but are perfectly suited for collaborative filtering on dense datasets where it outperforms traditional item recommendation algorithms.
- Published
- 2017
- Full Text
- View/download PDF
46. Predictive business process monitoring with LSTM neural networks
- Author
-
Tax, N., Verenich, I., La Rosa, M., Dumas, M., Pohl, Klaus, Dubois, Eric, and Process Science
- Subjects
FOS: Computer and information sciences ,Computer science ,Process (engineering) ,Predictive monitoring ,Machine Learning (stat.ML) ,02 engineering and technology ,Machine learning ,computer.software_genre ,Statistics - Applications ,Machine Learning (cs.LG) ,Computer Science - Databases ,Statistics - Machine Learning ,Business process management ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Applications (stat.AP) ,Process mining ,Neural and Evolutionary Computing (cs.NE) ,080600 INFORMATION SYSTEMS ,Artificial neural network ,business.industry ,Event (computing) ,Computer Science - Neural and Evolutionary Computing ,Databases (cs.DB) ,Computer Science - Learning ,Task (computing) ,Range (mathematics) ,Sequence prediction ,020201 artificial intelligence & image processing ,Artificial intelligence ,LSTM ,business ,computer - Abstract
Predictive business process monitoring methods exploit logs of completed cases of a process in order to make predictions about running cases thereof. Existing methods in this space are tailor-made for specific prediction tasks. Moreover, their relative accuracy is highly sensitive to the dataset at hand, thus requiring users to engage in trial-and-error and tuning when applying them in a specific setting. This paper investigates Long Short-Term Memory (LSTM) neural networks as an approach to build consistently accurate models for a wide range of predictive process monitoring tasks. First, we show that LSTMs outperform existing techniques to predict the next event of a running case and its timestamp. Next, we show how to use models for predicting the next task in order to predict the full continuation of a running case. Finally, we apply the same approach to predict the remaining time, and show that this approach outperforms existing tailor-made methods., Accepted at the International Conference on Advanced Information Systems Engineering (CAiSE) 2017
- Published
- 2017
- Full Text
- View/download PDF
47. Distributed sequence prediction: A consensus+innovations approach
- Author
-
Anit Kumar Sahu and Soummya Kar
- Subjects
Computer science ,business.industry ,Online learning ,020206 networking & telecommunications ,Context (language use) ,Regret ,02 engineering and technology ,Machine learning ,computer.software_genre ,ComputingMethodologies_ARTIFICIALINTELLIGENCE ,Prediction algorithms ,Sequence prediction ,0202 electrical engineering, electronic engineering, information engineering ,Symmetric matrix ,Artificial intelligence ,business ,computer - Abstract
This paper focuses on the problem of distributed sequence prediction in a network of sparsely interconnected agents, where agents collaborate to achieve provably reasonable predictive performance. An expert assisted online learning algorithm in a distributed setup of the consensus+innovations form is proposed, in which the agents update their weights for the experts' predictions by simultaneously processing the latest network losses (innovations) and the cumulative losses obtained from neighboring agents (consensus). This paper characterizes the regret of the agents' prediction in lieu of the proposed distributed online learning algorithm and establishes the sub-linear regret of the agents' predictions with respect to the best forecasting expert.
- Published
- 2016
- Full Text
- View/download PDF
48. Forecasting electricity consumption by aggregating specialized experts
- Author
-
Marie Devaine, Gilles Stoltz, Pierre Gaillard, and Yannig Goude
- Subjects
Consumption (economics) ,Mean squared error ,Computer science ,business.industry ,02 engineering and technology ,Machine learning ,computer.software_genre ,01 natural sciences ,010104 statistics & probability ,Empirical research ,Artificial Intelligence ,Sequence prediction ,Theory of computing ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Electricity ,Artificial intelligence ,0101 mathematics ,business ,Adaptation (computer science) ,computer ,Software - Abstract
We consider the setting of sequential prediction of arbitrary sequences based on specialized experts. We first provide a review of the relevant literature and present two theoretical contributions: a general analysis of the specialist aggregation rule of Freund et al. (Proceedings of the Twenty-Ninth Annual ACM Symposium on the Theory of Computing (STOC), pp. 334---343, 1997) and an adaptation of fixed-share rules of Herbster and Warmuth (Mach. Learn. 32:151---178, 1998) in this setting. We then apply these rules to the sequential short-term (one-day-ahead) forecasting of electricity consumption; to do so, we consider two data sets, a Slovakian one and a French one, respectively concerned with hourly and half-hourly predictions. We follow a general methodology to perform the stated empirical studies and detail in particular tuning issues of the learning parameters. The introduced aggregation rules demonstrate an improved accuracy on the data sets at hand; the improvements lie in a reduced mean squared error but also in a more robust behavior with respect to large occasional errors.
- Published
- 2012
- Full Text
- View/download PDF
49. A text-based decision support system for financial sequence prediction
- Author
-
Samuel W. K. Chan and James Franklin
- Subjects
Finance ,Decision support system ,Information Systems and Management ,business.industry ,Computer science ,Intelligent decision support system ,Statistical model ,Machine learning ,computer.software_genre ,Management Information Systems ,Information extraction ,Arts and Humanities (miscellaneous) ,Sequence prediction ,Developmental and Educational Psychology ,Artificial intelligence ,Data mining ,Inference engine ,business ,computer ,Classifier (UML) ,Information Systems - Abstract
Although most quantitative financial data are analyzed using traditional statistical, artificial intelligence or data mining techniques, the abundance of online electronic financial news articles has opened up new possibilities for intelligent systems that can extract and organize relevant knowledge automatically in a usable format. Most information extraction systems require a hand-built dictionary of templates and thus need continual modification to accommodate new patterns that are observed in the text. In this research, we propose a novel text-based decision support system (DSS) that (i) extracts event sequences from shallow text patterns, and (ii) predicts the likelihood of the occurrence of events using a classifier-based inference engine. The prediction relies on two major, but complementary, feature sets: adjacent events and a set of information-theoretic functions. In contrast to other approaches, the proposed text-based DSS gives explanatory hypotheses about its predictions from a coalition of intimations learned from the inference engine, while preserving robustness and without indulging in formalism. We investigate more than 2000 financial reports with 28,000 sentences. Experiments show that the prediction accuracy of our model outperforms similar statistical models by 7% for the seen data while significantly improving the prediction accuracy for the unseen data. Further comparisons substantiate the experimental findings.
- Published
- 2011
- Full Text
- View/download PDF
50. Application of Large-Scale Database-Based Online Modeling to Plant State Long-Term Estimation
- Author
-
Ogawa Masatoshi and Harutoshi Ogai
- Subjects
Estimation ,Database ,Computer science ,business.industry ,Machine learning ,computer.software_genre ,Term (time) ,Plant state ,Sequence prediction ,Data mining ,State (computer science) ,Artificial intelligence ,Electrical and Electronic Engineering ,Long-term prediction ,Scale (map) ,Quantization (image processing) ,business ,computer - Abstract
Recently, attention has been drawn to the local modeling techniques of a new idea called “Just-In-Time (JIT) modeling”. To apply “JIT modeling” to a large amount of database online, “Large-scale database-based Online Modeling (LOM)” has been proposed. LOM is a technique that makes the retrieval of neighboring data more efficient by using both “stepwise selection” and quantization. In order to predict the long-term state of the plant without using future data of manipulated variables, an Extended Sequential Prediction method of LOM (ESP-LOM) has been proposed. In this paper, the LOM and the ESP-LOM are introduced.
- Published
- 2011
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.