46 results on '"Shao-Lun Huang"'
Search Results
2. Triboelectric nanogenerators enabled internet of things: A survey
- Author
-
Shao-Lun Huang, Wenbo Ding, Jiarong Li, Zihan Wang, Haixu Shen, Changsheng Wu, Xiaoyue Ni, and Ishara Dharmasena
- Subjects
Wireless transmission ,business.industry ,Software deployment ,Scientific development ,Computer science ,Information technology ,Context (language use) ,business ,Telecommunications ,Internet of Things ,Triboelectric effect - Abstract
As pioneering information technology, the Internet of Things (loT) targets at building an infrastructure of embedded devices and networks of connected objects, to offer omnipresent ecosystem and interaction across billions of smart devices, sensors, and actuators. The deployment of IoT calls for decentralized power supplies, self-powered sensors, and wireless transmission technologies, which have brought both opportunities and challenges to the existing solutions, especially when the network scales up. The Triboelectric Nanogenerators (TENGs), recently developed for mechanical energy harvesting and mechanical-to-electrical signal conversion, have the natural properties of energy and information, which have demonstrated high potentials in various applications of IoT. This context provides a comprehensive review of TENG enabled IoT and discusses the most popular and significant divisions. Firstly, the basic principle of TENG is re-examined in this article. Subsequently, a comprehensive and detailed review is given to the concept of IoT, followed by the scientific development of the TENG enabled IoT. Finally, the future of this evolving area is addressed.
- Published
- 2020
3. Less Is Better: Unweighted Data Subsampling via Influence Function
- Author
-
Zhenhua Dong, Zifeng Wang, Shao-Lun Huang, Xiuqiang He, and Hong Zhu
- Subjects
Scheme (programming language) ,Computer Science - Machine Learning ,Contextual image classification ,Computer science ,business.industry ,Volume (computing) ,Sampling (statistics) ,General Medicine ,Machine learning ,computer.software_genre ,Probabilistic sampling ,Empirical distribution function ,Statistics - Machine Learning ,Test set ,Artificial intelligence ,business ,computer ,computer.programming_language - Abstract
In the time of Big Data, training complex models on large-scale data sets is challenging, making it appealing to reduce data volume for saving computation resources by subsampling. Most previous works in subsampling are weighted methods designed to help the performance of subset-model approach the full-set-model, hence the weighted methods have no chance to acquire a subset-model that is better than the full-set-model. However, we question that how can we achieve better model with less data? In this work, we propose a novel Unweighted Influence Data Subsampling (UIDS) method, and prove that the subset-model acquired through our method can outperform the full-set-model. Besides, we show that overly confident on a given test set for sampling is common in Influence-based subsampling methods, which can eventually cause our subset-model's failure in out-of-sample test. To mitigate it, we develop a probabilistic sampling scheme to control the worst-case risk over all distributions close to the empirical distribution. The experiment results demonstrate our methods superiority over existed subsampling methods in diverse tasks, such as text classification, image classification, click-through prediction, etc., Comment: AAAI 2020
- Published
- 2020
4. Maximal Correlation Regression
- Author
-
Shao-Lun Huang and Xiangxiang Xu
- Subjects
General Computer Science ,linear discriminant analysis ,Computer science ,Feature selection ,02 engineering and technology ,regression analysis ,Statistics::Machine Learning ,machine learning algorithms ,0202 electrical engineering, electronic engineering, information engineering ,General Materials Science ,Artificial neural network ,Artificial neural networks ,business.industry ,Deep learning ,Supervised learning ,General Engineering ,020206 networking & telecommunications ,Regression analysis ,Pattern recognition ,HGR maximal correlation ,021001 nanoscience & nanotechnology ,Linear discriminant analysis ,softmax regression ,Regression ,Softmax function ,Artificial intelligence ,lcsh:Electrical engineering. Electronics. Nuclear engineering ,0210 nano-technology ,business ,lcsh:TK1-9971 - Abstract
In this paper, we propose a novel regression analysis approach, called maximal correlation regression, by exploiting the ideas from the Hirschfeld-Gebelein-Rényi (HGR) maximal correlation. We show that in supervised learning problems, the optimal weights in maximal correlation regression can be expressed analytically with the relationships to the HGR maximal correlation functions, which reveals theoretical insights for our approach. In addition, we apply the maximal correlation regression to deep learning, in which efficient training algorithms are proposed for learning the weights in hidden layers. Furthermore, we illustrate that the maximal correlation regression is deeply connected to several existing approaches in information theory and machine learning, including the universal feature selection problem, linear discriminant analysis, and the softmax regression. Finally, experiments on real datasets demonstrate that our approach can obtain performance comparable to the widely used softmax regression based-method.
- Published
- 2020
5. On the Optimal Tradeoff Between Computational Efficiency and Generalizability of Oja’s Algorithm
- Author
-
Shao-Lun Huang and Xiangxiang Xu
- Subjects
Optimal design ,Noise ,General Computer Science ,Rate of convergence ,Sampling (signal processing) ,Computer science ,General Engineering ,General Materials Science ,Generalizability theory ,Variation (game tree) ,Algorithm ,MNIST database ,Eigenvalues and eigenvectors - Abstract
The Oja's algorithm is widely applied for computing principal eigenvectors in real problems, and it is practically useful to understand the theoretical relationships between the learning rate, convergence rate, and generalization error of this algorithm for noisy samples. In this paper, we show that under mild assumptions of sampling noise, both the generalization error and the convergence rate reveal linear relationships with the learning rate in the large sample size and small learning rate regime. In addition, when the algorithm nearly converges, we provide a refined characterization of the generalization error, which suggests the optimal design for the learning rate for data with noise. Moreover, we investigate the minibatch variation of Oja's algorithm and demonstrate that the learning rate of minibatch training is decayed by a factor characterized by the batch size, which provides theoretical insights and guidance for designing the learning rate in minibatch training algorithms. Finally, our theoretical results are validated by experiments on both synthesized data and the MNIST dataset.
- Published
- 2020
6. An Information-Theoretic Approach to Transferability in Task Transfer Learning
- Author
-
Leonidas J. Guibas, Amir Roshan Zamir, Lin Zhang, Lizhong Zheng, Yang Li, Yajie Bao, and Shao-Lun Huang
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,02 engineering and technology ,010501 environmental sciences ,Evaluation function ,Machine learning ,computer.software_genre ,01 natural sciences ,Task (project management) ,Machine Learning (cs.LG) ,Metric (mathematics) ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,Selection (linguistics) ,020201 artificial intelligence & image processing ,Artificial intelligence ,Transfer of learning ,business ,Curriculum ,computer ,0105 earth and related environmental sciences - Abstract
Task transfer learning is a popular technique in image processing applications that uses pre-trained models to reduce the supervision cost of related tasks. An important question is to determine task transferability, i.e. given a common input domain, estimating to what extent representations learned from a source task can help in learning a target task. Typically, transferability is either measured experimentally or inferred through task relatedness, which is often defined without a clear operational meaning. In this paper, we present a novel metric, H-score, an easily-computable evaluation function that estimates the performance of transferred representations from one task to another in classification problems using statistical and information theoretic principles. Experiments on real image data show that our metric is not only consistent with the empirical transferability measurement, but also useful to practitioners in applications such as source model selection and task transfer curriculum learning.
- Published
- 2022
- Full Text
- View/download PDF
7. OTCMR: Bridging Heterogeneity Gap with Optimal Transport for Cross-modal Retrieval
- Author
-
Shao-Lun Huang, Lin Zhang, and Mingyang Li
- Subjects
Task (computing) ,Modal ,Similarity (geometry) ,Bridging (networking) ,Computer science ,Feature (computer vision) ,ComputerApplications_MISCELLANEOUS ,Feature vector ,Embedding ,Pairwise comparison ,Data mining ,computer.software_genre ,computer - Abstract
Cross-modal retrieval is a classic task in the multimedia community, which aims to search for semantically similar results from different modalities. The core of cross-modal retrieval is to learn the most correlated features in a common feature space for the multi-modal data so that the similarity can be directly measured. In this paper, we propose a novel model using optimal transport for bridging the heterogeneity gap in cross-modal retrieval tasks. Specifically, we calculate the optimal transport plans between feature distributions of different modalities and then minimize the transport cost by optimizing the feature embedding functions. In this way, the feature distributions of multi-modal data can be well aligned in the common feature space. In addition, our model combines the complementary losses in different levels: 1) semantic level, 2) distributional level, and 3) pairwise level for improving cross-modal retrieval performance. In extensive experiments, our method outperforms many other cross-modal retrieval methods, which proves the efficacy of using optimal transport in cross-modal retrieval tasks.
- Published
- 2021
8. On Distributed Hypothesis Testing with Constant-Bit Communication Constraints
- Author
-
Shao-Lun Huang and Xiangxiang Xu
- Subjects
Bit (horse) ,Computer science ,Arithmetic ,Constant (mathematics) ,Statistical hypothesis testing - Published
- 2021
9. Exact Recovery in the Balanced Stochastic Block Model with Side Information
- Author
-
Shao-Lun Huang, Feng Zhao, and Jin Sima
- Subjects
Mathematical optimization ,Computer science ,Stochastic block model ,Side information - Published
- 2021
10. TriboGait: A deep learning enabled triboelectric gait sensor system for human activity recognition and individual identification
- Author
-
Shao-Lun Huang, Jihong Yin, Jiyu Wang, Yuchao Jin, Jiarong Li, Zihan Wang, and Zihao Zhao
- Subjects
Activity recognition ,Identification (information) ,Gait (human) ,Computer science ,business.industry ,Deep learning ,Real-time computing ,Wearable computer ,Artificial intelligence ,Residual ,business ,Automation ,Triboelectric effect - Abstract
Accurately and continuously measuring and collecting data on human gait is critical for human activity recognition and individual identification, enabling various applications in smart homes/buildings, including security authentication, personal healthcare, and intelligent automation. Many sensing technologies have been investigated by researchers recently, such as camera-based, laser-based, and mobile approaches, which have limitations in particular sensing situations, such as environments with fewer privacy concerns, line-of-sight, and the use of wearables, etc. On the other hand, the floor with the embedded sensor is stable and robust to different circumstances, enabling non-intrusive gait recognition and human identification. Therefore, a triboelectric nanogenerator (TENG)-based gait sensor system installed on the floor is proposed in this paper. Our approach has many advantages in comparison to the existing gait recognition systems, including low cost, simple fabrication, lightweight, and high durability. The TENG-based sensors can be simply embedded into a smart carpet to discern mechanical motions through electrical signals. Furthermore, a deep learning model, deep residual bidirectional long short-term memory network with dense layers (Residual Dense-BiLSTM), is proposed for multichannel floor-based gait recognition. By utilizing this model to analyze the electrical outputs, our system can accurately detect various human activities and distinguish different individuals’ walking patterns, with a recognition rate over 98% and 97%, respectively. We conclude that the proposed deep learning enabled triboelectric gait sensor system has broad applications in security and healthcare.
- Published
- 2021
11. An Information Theoretic Framework for Distributed Learning Algorithms
- Author
-
Xiangxiang Xu and Shao-Lun Huang
- Subjects
Set (abstract data type) ,Statistical classification ,Computer science ,Node (networking) ,Distance education ,Feature (machine learning) ,Function (mathematics) ,Information theory ,Algorithm ,Statistical hypothesis testing - Abstract
Distributed learning is recently an important research topic, while the information theoretic optimality of the distributed learning algorithms is often not sufficiently addressed. This paper studies the distributed learning problems such that each node observes i.i.d. samples and sends a feature function of observed samples to the central machine for decision making. Both the binary hypothesis testing in information theory and the classification problems in machine learning are considered, and the optimal error exponent and the set of optimal features are characterized. By exploiting an information theoretic framework, we show that these two problems share the same set of optimal features, from which the information theoretic optimality of some machine learning algorithms can be established. Finally, we generalize our analyses to $M$ -ary distributed hypothesis testing and classification problems. A full version of this paper is accessible at: https://xiangxiangxu.com/media/documents/isit2021.pdf
- Published
- 2021
12. An Efficient Approach for Audio-Visual Emotion Recognition With Missing Labels And Missing Modalities
- Author
-
Shao-Lun Huang, Fei Ma, and Lin Zhang
- Subjects
Training set ,Modalities ,Computer science ,business.industry ,Deep learning ,computer.software_genre ,Visualization ,Multimodal learning ,Correlation ,ComputerApplications_MISCELLANEOUS ,Key (cryptography) ,Emotion recognition ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
Audio-visual emotion recognition is important for human-machine interaction systems by combining the information of audio and visual modalities. Although great progress has been made by previous works using multimodal learning compared with unimodal learning, they still cannot effectively deal with two key challenges. Firstly, it is difficult or expensive to acquire labeled emotional data, which results in a large amount of data with missing labels. Secondly, emotional data often has missing modalities. To address these problems, we propose a unified deep learning framework to efficiently handle missing labels and missing modalities for audio-visual emotion recognition through correlation analysis. Specifically, we consider four types of emotional data during the training stage: complete, label missing, visual missing, and audio missing. We propose a correlation loss based on Hirschfeld-Gebelein-Ŕenyi (HGR) maximal correlation to effectively capture the common information in different types of training data for emotion prediction. Experiments on the eNTERFACE’05 and RAVDESS datasets show that our deep learning approach has high effectiveness for audio-visual emotion recognition.
- Published
- 2021
13. Semi-Supervised Multimodal Image Translation for Missing Modality Imputation
- Author
-
Lin Zhang, Yang Li, Shiguang Ni, Fei Ma, Wangbin Sun, and Shao-Lun Huang
- Subjects
Modalities ,Modality (human–computer interaction) ,Computer science ,business.industry ,Missing data ,Machine learning ,computer.software_genre ,Autoencoder ,Multimodal learning ,Leverage (statistics) ,Image translation ,Imputation (statistics) ,Artificial intelligence ,business ,computer - Abstract
Missing data is a common problem in multimodal and multi-view learning. It raises a critical challenge for most multimodal algorithms, which are unable to deal with incomplete datasets. Rather than discarding entries with missing modalities, this paper aims to reconstruct the complete image-based multimodal data by imputing missing modalities. We solve the imputation problem as an image translation task, which transforms images in one domain to other domains. Existing image translation techniques either can not fully utilize the information contained in partially complete entries or are limited to the bimodal situation. We propose a semi-supervised algorithm for multimodal learning with missing data, namely Cyclic Autoencoder (CycAE). Specifically, a novel cyclical structure, as well as the correlation among modalities, is integrated to leverage infoπnation from complete entries to incomplete ones. Experiments on two multimodal datasets show that our model outperforms state-of-the-art models. Downstream tasks can also benefit from the completed datasets.
- Published
- 2021
14. Mining Regional Mobility Patterns for Urban Dynamic Analytics
- Author
-
Yang Li, Weixi Gu, Shao-Lun Huang, Jing Lian, and Lin Zhang
- Subjects
Optimization problem ,Computer Networks and Communications ,Computer science ,business.industry ,020206 networking & telecommunications ,02 engineering and technology ,computer.software_genre ,Partition (database) ,City region ,Beijing ,Hardware and Architecture ,Analytics ,Urbanization ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Data mining ,City management ,business ,Computer communication networks ,computer ,Software ,Information Systems - Abstract
City management plays an important role in the era of urbanization. Understanding city regions and urban mobility patterns are two vital aspects of city management. Numerous studies have been conducted on these two aspects respectively. However, few work has considered combining city region partition and mobility pattern mining together while these two problems are closely related. In this paper, we propose region-aware mobility pattern mining framework, which jointly finds the precise origin and destination region partitions while extracting mobility patterns. We formulate it as an optimization problem of maximizing OD’s correlations with spatial constraints. Kernelized ACE, is proposed to solve the problem by learning feature representations that guarantee both objectives. Evaluation results using Beijing’s taxi data show that the extracted features are appropriate for this problem and our approach outperforms all the other methods with ∼ 0.3% spatial overlap and 86.43% OD correlation. Our case studies on New York City’s urban dynamics and Beijing’s three-year consecutive analysis also yield insightful findings that reveal city-scale mobility patterns and propose potential improvement for city management.
- Published
- 2019
15. Live Gradient Compensation for Evading Stragglers in Distributed Learning
- Author
-
Linqi Song, Tian Lan, Jian Xu, and Shao-Lun Huang
- Subjects
Optimization problem ,Contextual image classification ,Computer science ,Encoding (memory) ,Work (physics) ,Convergence (routing) ,Process (computing) ,Distributed learning ,Algorithm ,Compensation (engineering) - Abstract
The training efficiency of distributed learning systems is vulnerable to stragglers, namely, those slow worker nodes. A naive strategy is performing the distributed learning by incor-porating the fastest K workers and ignoring these stragglers, which may induce high deviation for non-IID data. To tackle this, we develop a Live Gradient Compensation (LGC) strategy to incorporate the one-step delayed gradients from stragglers, aiming to accelerate learning process and utilize the stragglers simultaneously. In LGC framework, mini-batch data are divided into smaller blocks and processed separately, which makes the gradient computed based on partial work accessible. In addition, we provide theoretical convergence analysis of our algorithm for non-convex optimization problem under non-IID training data to show that LGC-SGD has almost the same convergence error as full synchronous SGD. The theoretical results also allow us to quantify a novel tradeoff in minimizing training time and error by selecting the optimal straggler threshold. Finally, extensive simulation experiments of image classification on CIFAR-10 dataset are conducted, and the numerical results demonstrate the effectiveness of our proposed strategy.
- Published
- 2021
16. Person Recognition with HGR Maximal Correlation on Multimodal Data
- Author
-
Yihua Liang, Shao-Lun Huang, Yang Li, and Fei Ma
- Subjects
Modalities ,business.industry ,Computer science ,Speech recognition ,Feature extraction ,02 engineering and technology ,computer.software_genre ,01 natural sciences ,Visualization ,Discriminative model ,Robustness (computer science) ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,Identity (object-oriented programming) ,020201 artificial intelligence & image processing ,Noise (video) ,Artificial intelligence ,010306 general physics ,business ,computer ,Data integration - Abstract
Multimodal person recognition is a common task in video analysis and public surveillance, where information from multiple modalities, such as images and audio extracted from videos, are used to jointly determine the identity of a person. Previous person recognition techniques either use only uni-modal data or only consider shared representations between different input modalities, while leaving the extraction of their relationship with identity information to downstream tasks. Furthermore, real-world data often contain noise, which makes recognition more challenging practical situations. In our work, we propose a novel correlation-based multimodal person recognition framework that is relatively simple but can efficaciously learn supervised information in multimodal data fusion and resist noise. Specifically, our framework learns a discriminative embeddings of persons by joint learning visual features and audio features while maximizing HGR maximal correlation among multimodal input and persons' identities. Experiments are done on a subset of Voxceleb2. Compared with state-of-the-art methods, the proposed method demonstrates an improvement of accuracy and robustness to noise.
- Published
- 2021
17. OTCE: A Transferability Metric for Cross-Domain Cross-Task Representations
- Author
-
Shao-Lun Huang, Yang Tan, and Yang Li
- Subjects
Conditional entropy ,FOS: Computer and information sciences ,Computer Science - Machine Learning ,Artificial neural network ,business.industry ,Computer science ,Computer Science - Artificial Intelligence ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,Task (project management) ,Domain (software engineering) ,Machine Learning (cs.LG) ,Artificial Intelligence (cs.AI) ,Metric (mathematics) ,Feature (machine learning) ,Domain knowledge ,Artificial intelligence ,business ,Transfer of learning ,Algorithm - Abstract
Transfer learning across heterogeneous data distributions (a.k.a. domains) and distinct tasks is a more general and challenging problem than conventional transfer learning, where either domains or tasks are assumed to be the same. While neural network based feature transfer is widely used in transfer learning applications, finding the optimal transfer strategy still requires time-consuming experiments and domain knowledge. We propose a transferability metric called Optimal Transport based Conditional Entropy (OTCE), to analytically predict the transfer performance for supervised classification tasks in such cross-domain and cross-task feature transfer settings. Our OTCE score characterizes transferability as a combination of domain difference and task difference, and explicitly evaluates them from data in a unified framework. Specifically, we use optimal transport to estimate domain difference and the optimal coupling between source and target distributions, which is then used to derive the conditional entropy of the target task (task difference). Experiments on the largest cross-domain dataset DomainNet and Office31 demonstrate that OTCE shows an average of 21% gain in the correlation with the ground truth transfer accuracy compared to state-of-the-art methods. We also investigate two applications of the OTCE score including source model selection and multi-source feature fusion., Comment: 13 pages, accepted by CVPR2021
- Published
- 2021
- Full Text
- View/download PDF
18. Lifelong Learning Based Disease Diagnosis on Clinical Notes
- Author
-
Zifeng Wang, Yefeng Zheng, Xi Chen, Shao-Lun Huang, Yifan Yang, and Rui Wen
- Subjects
Forgetting ,Computer science ,business.industry ,Deep learning ,Lifelong learning ,Context (language use) ,02 engineering and technology ,Disease ,010501 environmental sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Code (cryptography) ,Benchmark (computing) ,Artificial intelligence ,business ,Episodic memory ,computer ,0105 earth and related environmental sciences - Abstract
Current deep learning based disease diagnosis systems usually fall short in catastrophic forgetting, i.e., directly fine-tuning the disease diagnosis model on new tasks usually leads to abrupt decay of performance on previous tasks. What is worse, the trained diagnosis system would be fixed once deployed but collecting training data that covers enough diseases is infeasible, which inspires us to develop a lifelong learning diagnosis system. In this work, we propose to adopt attention to combine medical entities and context, embedding episodic memory and consolidation to retain knowledge, such that the learned model is capable of adapting to sequential disease-diagnosis tasks. Moreover, we establish a new benchmark, named Jarvis-40, which contains clinical notes collected from various hospitals. Experiments show that the proposed method can achieve state-of-the-art performance on the proposed benchmark. Code is available at https://github.com/yifyang/LifelongLearningDiseaseDiagnosis.
- Published
- 2021
19. Learning Better Representations for Audio-Visual Emotion Recognition with Common Information
- Author
-
Shao-Lun Huang, Wei Zhang, Lin Zhang, Fei Ma, and Yang Li
- Subjects
semi-supervised learning ,Computer science ,Speech recognition ,Stability (learning theory) ,02 engineering and technology ,Semi-supervised learning ,lcsh:Technology ,lcsh:Chemistry ,Discriminative model ,ComputerApplications_MISCELLANEOUS ,common information ,0202 electrical engineering, electronic engineering, information engineering ,Feature (machine learning) ,General Materials Science ,Instrumentation ,lcsh:QH301-705.5 ,Fluid Flow and Transfer Processes ,Modality (human–computer interaction) ,Artificial neural network ,business.industry ,lcsh:T ,Process Chemistry and Technology ,Deep learning ,General Engineering ,020206 networking & telecommunications ,HGR maximal correlation ,lcsh:QC1-999 ,Computer Science Applications ,lcsh:Biology (General) ,lcsh:QD1-999 ,lcsh:TA1-2040 ,020201 artificial intelligence & image processing ,Artificial intelligence ,audio-visual emotion recognition ,business ,lcsh:Engineering (General). Civil engineering (General) ,Feature learning ,lcsh:Physics - Abstract
Audio-visual emotion recognition aims to distinguish human emotional states by integrating the audio and visual data acquired in the expression of emotions. It is crucial for facilitating the affect-related human-machine interaction system by enabling machines to intelligently respond to human emotions. One challenge of this problem is how to efficiently extract feature representations from audio and visual modalities. Although progresses have been made by previous works, most of them ignore common information between audio and visual data during the feature learning process, which may limit the performance since these two modalities are highly correlated in terms of their emotional information. To address this issue, we propose a deep learning approach in order to efficiently utilize common information for audio-visual emotion recognition by correlation analysis. Specifically, we design an audio network and a visual network to extract the feature representations from audio and visual data respectively, and then employ a fusion network to combine the extracted features for emotion prediction. These neural networks are trained by a joint loss, combining: (i) the correlation loss based on Hirschfeld-Gebelein-Re´, nyi (HGR) maximal correlation, which extracts common information between audio data, visual data, and the corresponding emotion labels, and (ii) the classification loss, which extracts discriminative information from each modality for emotion prediction. We further generalize our architecture to the semi-supervised learning scenario. The experimental results on the eNTERFACE&rsquo, 05 dataset, BAUM-1s dataset, and RAVDESS dataset show that common information can significantly enhance the stability of features learned from different modalities, and improve the emotion recognition performance.
- Published
- 2020
20. Semantically Supervised Maximal Correlation For Cross-Modal Retrieval
- Author
-
Lin Zhang, Yang Li, Shao-Lun Huang, and Mingyang Li
- Subjects
Computer science ,business.industry ,Feature vector ,Feature extraction ,Pattern recognition ,02 engineering and technology ,Space (commercial competition) ,Semantics ,Task (project management) ,Modal ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,020201 artificial intelligence & image processing ,Artificial intelligence ,business - Abstract
With the rapid growth of multimedia data, the cross-modal retrieval problem has attracted a lot of interest in both research and industry in recent years. However, the inconsistency of data distribution from different modalities makes such task challenging. In this paper, we propose Semantically Supervised Maximal Correlation (S2MC) method for cross-modal retrieval by incorporating semantic label information into the traditional maximal correlation framework. Combining with maximal correlation based method for extracting unsupervised pairing information, our method effectively exploits supervised semantic information on both common feature space and label space. Extensive experiments show that our method outperforms other current state-of-the-art methods on cross-modal retrieval tasks on three widely used datasets.
- Published
- 2020
21. Online Disease Self-diagnosis with Inductive Heterogeneous Graph Convolutional Networks
- Author
-
Shao-Lun Huang, Buyue Qian, Zifeng Wang, Rui Wen, Shilei Cao, Yefeng Zheng, and Xi Chen
- Subjects
FOS: Computer and information sciences ,Service (systems architecture) ,Computer Science - Machine Learning ,Information retrieval ,Computer science ,Graph neural networks ,Computer Science - Artificial Intelligence ,02 engineering and technology ,Disease ,010501 environmental sciences ,Tracing ,Semantics ,01 natural sciences ,Machine Learning (cs.LG) ,Computer Science - Information Retrieval ,Set (abstract data type) ,Artificial Intelligence (cs.AI) ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Graph (abstract data type) ,Information Retrieval (cs.IR) ,0105 earth and related environmental sciences - Abstract
We propose a Healthcare Graph Convolutional Network (HealGCN) to offer disease self-diagnosis service for online users based on Electronic Healthcare Records (EHRs). Two main challenges are focused in this paper for online disease diagnosis: (1) serving cold-start users via graph convolutional networks and (2) handling scarce clinical description via a symptom retrieval system. To this end, we first organize the EHR data into a heterogeneous graph that is capable of modeling complex interactions among users, symptoms and diseases, and tailor the graph representation learning towards disease diagnosis with an inductive learning paradigm. Then, we build a disease self-diagnosis system with a corresponding EHR Graph-based Symptom Retrieval System (GraphRet) that can search and provide a list of relevant alternative symptoms by tracing the predefined meta-paths. GraphRet helps enrich the seed symptom set through the EHR graph when confronting users with scarce descriptions, hence yield better diagnosis accuracy. At last, we validate the superiority of our model on a large-scale EHR dataset.
- Published
- 2020
22. A Local Characterization for Wyner Common Information
- Author
-
Lizhong Zheng, Shao-Lun Huang, Xiangxiang Xu, Gregory W. Wornell, and Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
- Subjects
business.industry ,Computer science ,Information processing ,Pattern recognition ,010103 numerical & computational mathematics ,Characterization (mathematics) ,01 natural sciences ,010305 fluids & plasmas ,Feature (computer vision) ,0103 physical sciences ,Common knowledge ,Artificial intelligence ,0101 mathematics ,business ,Random variable ,MNIST database - Abstract
While the Hirschfeld-Gebelein-Renyi (HGR) maximal correlation and the Wyner common information share similar information processing purposes of extracting common knowledge structures between random variables, the relationships between these approaches are generally unclear. In this paper, we demonstrate such relationships by considering the Wyner common information in the weakly dependent regime, called ϵ-common information. We show that the HGR maximal correlation functions coincide with the relative likelihood functions of estimating the auxiliary random variables in ϵ-common information, which establishes the fundamental connections these approaches. Moreover, we extend the ϵ-common information to multiple random variables, and derive a novel algorithm for extracting feature functions of data variables regarding their common information. Our approach is validated by the MNIST problem, and can potentially be useful in multi-modal data analyses.
- Published
- 2020
23. An Information-theoretic Approach to Unsupervised Feature Selection for High-Dimensional Data
- Author
-
Shao-Lun Huang
- Subjects
Clustering high-dimensional data ,business.industry ,Computer science ,Pattern recognition ,Feature selection ,Artificial intelligence ,business - Published
- 2019
24. Anomaly detection in surface mount technology process using multi-modal data
- Author
-
Yue Zhang, Hanling Wang, Mingyang Li, Shao-Lun Huang, and Lin Zhang
- Subjects
Surface-mount technology ,Computer science ,business.industry ,05 social sciences ,Multi modal data ,Process (computing) ,Solder paste ,Pattern recognition ,010501 environmental sciences ,01 natural sciences ,Constant false alarm rate ,0502 economics and business ,Surface mounting ,Factory (object-oriented programming) ,Anomaly detection ,Artificial intelligence ,050207 economics ,business ,0105 earth and related environmental sciences - Abstract
Anomaly detection is an important area for both research and real-world applications. In the surface mounting technology (SMT) process, the defectives of solder paste printing need to be detected immediately or it may cause great effort for recycling and slow down the whole process. In this paper, we propose a novel model, MM-DNN, for anomaly detection with multi-modal data. We collect a multi-modal dataset from different sensors in the factory. Our method efficiently extracts both predictive features for classification and correlative features between multi-modal data to achieve a higher detection rate. As shown in the experiment, our method can further reduce 77% false alarm rate of the detection result in the factory while keeping 95% of real defectives be correctly detected.
- Published
- 2019
25. Maximal Correlation Embedding Network for Multilabel Learning with Missing Labels
- Author
-
Shao-Lun Huang, Xiangxiang Xu, Yang Li, Lin Zhang, and Lu Li
- Subjects
Similarity (geometry) ,Computer science ,business.industry ,Pattern recognition ,02 engineering and technology ,010501 environmental sciences ,Space (commercial competition) ,01 natural sciences ,Regularization (mathematics) ,Matrix decomposition ,Set (abstract data type) ,ComputingMethodologies_PATTERNRECOGNITION ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Embedding ,Artificial intelligence ,business ,0105 earth and related environmental sciences - Abstract
Multilabel learning, the problem of mapping each data instance to a subset of labels, appears frequently in many real-world applications. However, obtaining complete label annotation for every instance requires tremendous efforts, especially when the label set is large. As a result, multilabel learning with missing labels remains as a common challenge. Existing works either cannot handle missing labels or lack nonlinear expressiveness and scalability to large label set. In this paper, we present a novel end-to-end solution for multilabel learning with missing labels. Our algorithm, Maximal Correlation Embedding Network learns a low dimensional label embedding using an encoder-decoder architecture. It exploits label similarity through a maximal correlation regularization in the embedded label space to reduce the classification bias due to missing labels. A series of experiments on popular multilabel datasets demonstrate that our approach outperforms state of the art, both in complete data and partially observed data.
- Published
- 2019
26. An Information Theoretic Interpretation to Deep Neural Networks
- Author
-
Shao-Lun Huang, Xiangxiang Xu, Lizhong Zheng, and Gregory W. Wornell
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Optimization problem ,deep neural network ,information theory ,local information geometry ,feature extraction ,Computer science ,Science ,QC1-999 ,Computer Science - Information Theory ,Computation ,Computer Science::Neural and Evolutionary Computation ,General Physics and Astronomy ,Inference ,Feature selection ,02 engineering and technology ,Astrophysics ,01 natural sciences ,Article ,Machine Learning (cs.LG) ,0202 electrical engineering, electronic engineering, information engineering ,0101 mathematics ,business.industry ,Physics ,Information Theory (cs.IT) ,QB460-466 ,010101 applied mathematics ,Deep neural networks ,020201 artificial intelligence & image processing ,Artificial intelligence ,business - Abstract
It is commonly believed that the hidden layers of deep neural networks (DNNs) attempt to extract informative features for learning tasks. In this paper, we formalize this intuition by showing that the features extracted by DNN coincide with the result of an optimization problem, which we call the `universal feature selection' problem, in a local analysis regime. We interpret the weights training in DNN as the projection of feature functions between feature spaces, specified by the network structure. Our formulation has direct operational meaning in terms of the performance for inference tasks, and gives interpretations to the internal computation results of DNNs. Results of numerical experiments are provided to support the analysis., Accepted to ISIT 2019
- Published
- 2019
27. An End-to-End Learning Approach for Multimodal Emotion Recognition: Extracting Common and Private Information
- Author
-
Shao-Lun Huang, Wei Zhang, Fei Ma, Lin Zhang, and Yang Li
- Subjects
Modality (human–computer interaction) ,Computer science ,business.industry ,Feature extraction ,020207 software engineering ,02 engineering and technology ,Machine learning ,computer.software_genre ,Visualization ,End-to-end principle ,0202 electrical engineering, electronic engineering, information engineering ,Task analysis ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Private information retrieval - Abstract
Multimodal emotion recognition is important for facilitating efficient interaction between humans and machines. To better detect emotional states from multimodal data, we need to effectively extract both the common information that captures dependencies among different modalities, and the private information that characterizes variations in each modality. However, existing works are mostly designed to pursue either one of these objectives but not both. In our work, we propose an end-to-end learning approach to simultaneously extract the common and private information for multimodal emotion recognition. Specifically, we use a correlation loss based on Hirschfeld-Gebelein-Renyi (HGR) maximal correlation and a reconstruction loss based on autoencoders to preserve the common and private information, respectively. Experimental results on eNTERFACE'05 database and RML database demonstrate the effectiveness of our proposed approach.
- Published
- 2019
28. Unsupervised anomaly detection via generative adversarial networks
- Author
-
Fei Ma, Shao-Lun Huang, Hanling Wang, Mingyang Li, and Lin Zhang
- Subjects
Clustering high-dimensional data ,Discriminator ,Computer science ,business.industry ,010401 analytical chemistry ,Feature extraction ,Pattern recognition ,010501 environmental sciences ,01 natural sciences ,Class (biology) ,0104 chemical sciences ,Visualization ,Anomaly detection ,Artificial intelligence ,Transfer of learning ,business ,0105 earth and related environmental sciences ,Test data - Abstract
Unsupervised anomaly detection is a fundamental problem in various research areas and application domains, namely the discrimination of abnormal samples from normal samples where training data are only composed of one class (normal) while testing data contains both among which the majority are normal samples. However, previous works can not effectively fit the distribution of high dimensional data and suffers from low AUC scores which measures the classification performance of imbalanced data. To solve these problems, we propose an unsupervised anomaly detection model based on GAN, i.e., UAD-GAN. Specifically, we adopt transfer learning to extract visual features with pre-trained Inception-v3 model and use the discriminator to detect anomalies. UAD-GAN can fit the data distribution and detect anomalies efficiently. Extensive experiments show that UAD-GAN achieves state-of-the-art performance compared to other approaches.
- Published
- 2019
29. Info-Detection: An Information-Theoretic Approach to Detect Outlier
- Author
-
Shao-Lun Huang, Feng Zhao, Yang Li, Lin Zhang, and Fei Ma
- Subjects
Sequence ,Computer science ,Principal (computer security) ,0102 computer and information sciences ,02 engineering and technology ,computer.software_genre ,01 natural sciences ,Partition (database) ,ComputingMethodologies_PATTERNRECOGNITION ,010201 computation theory & mathematics ,Outlier ,0202 electrical engineering, electronic engineering, information engineering ,Unsupervised learning ,020201 artificial intelligence & image processing ,Anomaly detection ,Data mining ,Cluster analysis ,computer - Abstract
Outlier detection is one of major task in unsupervised learning. We propose a cluster analysis based outlier detection method called Info-Detection. Info-Detection determines the number of outliers automatically and captures the global property of the provided data. To implement Info-Detection and overcome the global computational complexity, we use principal sequence of partition, which we improve one order of magnitude faster than the original version. Experiments show that compared with other outlier detection methods, Info-Detection achieves better accuracy with an affordable time overhead.
- Published
- 2019
30. Reproducing Scientific Experiment with Cloud DevOps
- Author
-
Lin Zhang, Shao-Lun Huang, Xingzhi Niu, and Feng Zhao
- Subjects
FOS: Computer and information sciences ,Focus (computing) ,Computer science ,business.industry ,05 social sciences ,050301 education ,Scientific experiment ,Cloud computing ,02 engineering and technology ,Data science ,Computer Science - Distributed, Parallel, and Cluster Computing ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Distributed, Parallel, and Cluster Computing (cs.DC) ,DevOps ,business ,0503 education - Abstract
The reproducibility of scientific experiment is vital for the advancement of disciplines based on previous work. To achieve this goal, many researchers focus on complex methodology and self-invented tools which have difficulty in practical usage. In this article, we introduce the Cloud DevOps infrastructure from software engineering community and shows how it can be used effectively for heterogeneous agents to reproduce experiments for computer science related disciplines. DevOps can be enabled using freely available cloud computing machines for medium-sized experiment and self-hosted computing engines for large-scale computing, thus powering researchers to share their experiment result with others in a more reliable way.
- Published
- 2019
- Full Text
- View/download PDF
31. Euclidean Information Theory of Networks
- Author
-
Shao-Lun Huang, Changho Suh, and Lizhong Zheng
- Subjects
FOS: Computer and information sciences ,Theoretical computer science ,Computer science ,business.industry ,Information Theory (cs.IT) ,Computer Science - Information Theory ,Distributed computing ,Throughput ,Construct (python library) ,Library and Information Sciences ,Information theory ,Telecommunications network ,Computer Science Applications ,Variety (cybernetics) ,Spread spectrum ,Transmission (telecommunications) ,Euclidean geometry ,business ,Linear optimization problem ,Throughput (business) ,Communication channel ,Computer network ,Information Systems ,Mathematics - Abstract
In this paper, we extend the information theoretic framework that was developed in earlier work to multi-hop network settings. For a given network, we construct a novel deterministic model that quantifies the ability of the network in transmitting private and common messages across users. Based on this model, we formulate a linear optimization problem that explores the throughput of a multi-layer network, thereby offering the optimal strategy as to what kind of common messages should be generated in the network to maximize the throughput. With this deterministic model, we also investigate the role of feedback for multi-layer networks, from which we identify a variety of scenarios in which feedback can improve transmission efficiency. Our results provide fundamental guidelines as to how to coordinate cooperation between users to enable efficient information exchanges across them., to appear in the IEEE Transactions on Information Theory
- Published
- 2015
32. Joint Mobility Pattern Mining with Urban Region Partitions
- Author
-
Weixi Gu, Lin Zhang, Shao-Lun Huang, Jing Lian, and Yang Li
- Subjects
Urban region ,050210 logistics & transportation ,business.industry ,Computer science ,05 social sciences ,020206 networking & telecommunications ,02 engineering and technology ,Destinations ,computer.software_genre ,Partition (database) ,Biclustering ,Beijing ,Joint mobility ,Urban planning ,Public transport ,0502 economics and business ,0202 electrical engineering, electronic engineering, information engineering ,Data mining ,business ,computer - Abstract
Mobility pattern mining answers the fundamental question of where people are likely to go from a given location. It plays an important role in city planning, public transport management and location-based mobile applications. Among these applications, many concern the mobility pattern over contiguous spatial regions as a whole. Traditional ways of mobility pattern mining either result in trip clusters with overlapped origin and destination regions, or require an extra step to partition the city into discrete regions, which may not be optimal for mobility pattern extraction. In this paper, we present a region-aware mobility pattern mining framework to jointly extract trip clusters while maintaining non-overlapping partitions of trip origins and destinations. We developed kernelized ACE, a novel extension to a classic algorithm in statistics to compute the optimal mobility clusters under spatial constraints. Experimental results using Beijing taxi trip data show that our approach outperforms other methods with only ~ 0.3% spatial overlap and 86.43% origin-destination correlation. Our case studies on New York City's and Beijing's taxi datasets also yield insightful findings that reveal city-scale mobility patterns and propose potential improvement for public transportation.
- Published
- 2018
33. Speech Emotion Recognition via Attention-based DNN from Multi-Task Learning
- Author
-
Shao-Lun Huang, Wei Zhang, Fei Ma, Weixi Gu, Shiguang Ni, and Lin Zhang
- Subjects
Dependency (UML) ,Computer science ,media_common.quotation_subject ,Speech recognition ,Multi-task learning ,020206 networking & telecommunications ,02 engineering and technology ,Anger ,Sadness ,0202 electrical engineering, electronic engineering, information engineering ,Happiness ,020201 artificial intelligence & image processing ,Emotion recognition ,Construct (philosophy) ,media_common - Abstract
Speech unlocks the huge potentials in emotion recognition. High accurate and real-time understanding of human emotion via speech assists Human-Computer Interaction. Previous works are often limited in either coarse-grained emotion learning tasks or the low precisions on the emotion recognition. To solve these problems, we construct a real-world large-scale corpus composed of 4 common emotions (i.e., anger, happiness, neutral and sadness). We also propose a multi-task attention-based DNN model (i.e., MT-A-DNN) on the emotion learning. MT-A-DNN efficiently learns the high-order dependency and non-linear correlations underlying in the audio data. Extensive experiments show that MT-A-DNN outperforms conventional methods on the emotion recognition. It could take one step further on the real-time acoustic emotion recognition in many smart audio-devices.
- Published
- 2018
34. Multimodal Emotion Recognition by extracting common and modality-specific information
- Author
-
Shiguang Ni, Weixi Gu, Fei Ma, Shao-Lun Huang, Wei Zhang, and Lin Zhang
- Subjects
Modality (human–computer interaction) ,business.industry ,Computer science ,Specific-information ,SIGNAL (programming language) ,Inference ,020206 networking & telecommunications ,02 engineering and technology ,computer.software_genre ,Data set ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Emotion recognition ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
Emotion recognition technologies have been widely used in numerous areas including advertising, healthcare and online education. Previous works usually recognize the emotion from either the acoustic or the visual signal, yielding unsatisfied performances and limited applications. To improve the inference capability, we present a multimodal emotion recognition model, EMOdal. Apart from learning the audio and visual data respectively, EMOdal efficiently learns the common and modality-specific information underlying the two kinds of signals, and therefore improves the inference ability. The model has been evaluated on our large-scale emotional data set. The comprehensive evaluations demonstrate that our model outperforms traditional approaches.
- Published
- 2018
35. Attention-based LSTM-CNNs For Time-series Classification
- Author
-
Lin Zhang, Weixi Gu, Qianjin Du, and Shao-Lun Huang
- Subjects
Time series classification ,Dependency (UML) ,Series (mathematics) ,Computer science ,business.industry ,020206 networking & telecommunications ,Pattern recognition ,02 engineering and technology ,Field (computer science) ,Convolution ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Artificial intelligence ,business - Abstract
Time series classification is a critical problem in the machine learning field, which spawns numerous research works on it. In this work, we propose AttLSTM-CNNs, an attention-based LSTM network and convolution network that jointly extracts the underlying pattern among the time-series for the classification. The attention-based LSTM automatically captures the long-term temporal dependency among the series, and the CNN describes the spatial sparsity and heterogeneity in the data. The extensive experiments show that the proposed model outperforms the other methods for time-series classification.
- Published
- 2018
36. The Geometric Structure of Generalized Softmax Learning
- Author
-
Lin Zhang, Xiangxiang Xu, Shao-Lun Huang, and Lizhong Zheng
- Subjects
Computer Science::Machine Learning ,Statistics::Machine Learning ,Computer simulation ,Artificial neural network ,Computer science ,Softmax function ,Symmetric extension ,Equivalence (formal languages) ,Algorithm ,Random variable ,Regression problems ,Data modeling - Abstract
In this paper, we formulate the generalized softmax learning (GSL) problem, as a symmetric extension of the softmax regression problem. We further study the geometric structure of GSL and demonstrate the equivalence of GSL and the original softmax regression problem. Besides, this geometric structure indicates the symmetry between a neural network and its reverse network, and the symmetric roles of the weights and feature in a neural network. Finally, we present a numerical simulation to verify these symmetry properties in neural networks.
- Published
- 2018
37. A Spectral Reconstruction Algorithm of Miniature Spectrometer Based on Sparse Optimization and Dictionary Learning
- Author
-
Shang Zhang, Lin Zhang, Yuhan Dong, Shao-Lun Huang, and Hongyan Fu
- Subjects
Computer science ,filter-based miniature spectrometer ,02 engineering and technology ,System of linear equations ,lcsh:Chemical technology ,01 natural sciences ,Biochemistry ,Spectral line ,Article ,spectral reconstruction ,Analytical Chemistry ,010309 optics ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,sparse optimization ,dictionary learning ,lcsh:TP1-1185 ,Electrical and Electronic Engineering ,Instrumentation ,Spectrometer ,020206 networking & telecommunications ,Reconstruction algorithm ,Atomic and Molecular Physics, and Optics ,Transmission (telecommunications) ,Filter (video) ,Algorithm - Abstract
The miniaturization of spectrometer can broaden the application area of spectrometry, which has huge academic and industrial value. Among various miniaturization approaches, filter-based miniaturization is a promising implementation by utilizing broadband filters with distinct transmission functions. Mathematically, filter-based spectral reconstruction can be modeled as solving a system of linear equations. In this paper, we propose an algorithm of spectral reconstruction based on sparse optimization and dictionary learning. To verify the feasibility of the reconstruction algorithm, we design and implement a simple prototype of a filter-based miniature spectrometer. The experimental results demonstrate that sparse optimization is well applicable to spectral reconstruction whether the spectra are directly sparse or not. As for the non-directly sparse spectra, their sparsity can be enhanced by dictionary learning. In conclusion, the proposed approach has a bright application prospect in fabricating a practical miniature spectrometer.
- Published
- 2018
38. Communication Theoretic Data Analytics
- Author
-
Lizhong Zheng, Kwang-Cheng Chen, H. Vincent Poor, and Shao-Lun Huang
- Subjects
FOS: Computer and information sciences ,Information transfer ,Social network ,Computer Networks and Communications ,business.industry ,Computer science ,Information Theory (cs.IT) ,Computer Science - Information Theory ,Big data ,computer.software_genre ,Information theory ,Data analysis ,The Internet ,Data mining ,Electrical and Electronic Engineering ,business ,computer - Abstract
Widespread use of the Internet and social networks invokes the generation of big data, which is proving to be useful in a number of applications. To deal with explosively growing amounts of data, data analytics has emerged as a critical technology related to computing, signal processing, and information networking. In this paper, a formalism is considered in which data is modeled as a generalized social network and communication theory and information theory are thereby extended to data analytics. First, the creation of an equalizer to optimize information transfer between two data variables is considered, and financial data is used to demonstrate the advantages. Then, an information coupling approach based on information geometry is applied for dimensionality reduction, with a pattern recognition example to illustrate the effectiveness. These initial trials suggest the potential of communication theoretic data analytics for a wide range of applications., Published in IEEE Journal on Selected Areas in Communications, Jan. 2015
- Published
- 2015
39. An information-theoretic approach to unsupervised feature selection for high-dimensional data
- Author
-
Shao-Lun Huang, Lin Zhang, Lizhong Zheng, and Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
- Subjects
Clustering high-dimensional data ,FOS: Computer and information sciences ,Computer Science - Machine Learning ,Generalization ,Computer science ,Computer Science - Information Theory ,Feature selection ,02 engineering and technology ,010501 environmental sciences ,Conditional expectation ,Information theory ,01 natural sciences ,Machine Learning (cs.LG) ,Joint probability distribution ,0202 electrical engineering, electronic engineering, information engineering ,Computer Science::Databases ,0105 earth and related environmental sciences ,Artificial neural network ,business.industry ,Information Theory (cs.IT) ,020206 networking & telecommunications ,Pattern recognition ,Feature (computer vision) ,Unsupervised learning ,Total correlation ,Artificial intelligence ,business ,Random variable - Abstract
In this paper, we propose an information-theoretic approach to design the functional representations to extract the hidden common structure shared by a set of random variables. The main idea is to measure the common information between the random variables by Watanabe's total correlation, and then find the hidden attributes of these random variables such that the common information is reduced the most given these attributes. We show that these attributes can be characterized by an exponential family specified by the eigen-decomposition of some pairwise joint distribution matrix. Then, we adopt the log-likelihood functions for estimating these attributes as the desired functional representations of the random variables, and show that such representations are informative to describe the common structure. Moreover, we design both the multivariate alternating conditional expectation (MACE) algorithm to compute the proposed functional representations for discrete data, and a novel neural network training approach for continuous or high-dimensional data. Furthermore, we show that our approach has deep connections to existing techniques, such as Hirschfeld-Gebelein-R\'{e}nyi (HGR) maximal correlation, linear principal component analysis (PCA), and consistent functional map, which establishes insightful connections between information theory and machine learning. Finally, the performances of our algorithms are validated by numerical simulations., Comment: 35 pages; Submitted to IEEE Journal on Selected Areas in Information Theory (JSAIT)
- Published
- 2017
40. An information-theoretic approach to universal feature selection in high-dimensional inference
- Author
-
Shao-Lun Huang, Anuran Makur, Lizhong Zheng, and Gregory W. Wornell
- Subjects
Theoretical computer science ,Training set ,Computer science ,Feature extraction ,Universality (philosophy) ,Inference ,020206 networking & telecommunications ,Feature selection ,02 engineering and technology ,010501 environmental sciences ,Conditional expectation ,Information theory ,01 natural sciences ,Universality (dynamical systems) ,0202 electrical engineering, electronic engineering, information engineering ,Information geometry ,0105 earth and related environmental sciences - Abstract
We develop an information theoretic framework for addressing feature selection in applications where the inference task is not specified in advance and the data is from a large alphabet. We introduce a natural notion of universality for such problems, and show that locally optimal solutions are straight forward to obtain, admit natural interpretations via information geometry, have computationally efficient implementations, and represent a practically useful learning methodology. Our development also reveals the key role of Hirschfeld-Gebelein-Renyi maximal correlation and the alternating conditional expectations (ACE) algorithm in such problems.
- Published
- 2017
41. Analysis and evaluation of driving behavior recognition based on a 3-axis accelerometer using a random forest approach
- Author
-
Kai Zhang, Xin Lin, Yuhan Dong, Wangjing Cao, Lin Zhang, and Shao-Lun Huang
- Subjects
Exploit ,Computer science ,business.industry ,010401 analytical chemistry ,0206 medical engineering ,02 engineering and technology ,Accelerometer ,Behavior recognition ,020601 biomedical engineering ,01 natural sciences ,0104 chemical sciences ,Random forest ,Acceleration ,Robustness (computer science) ,Computer vision ,Artificial intelligence ,business ,Motion sensors - Abstract
Understanding human drivers' behavior is critical for the self-driving cars, and has been intensively studied in the past decade. We exploit the widely available camera and motion sensor data from car recorders, and propose a hybrid method of recognizing driving events based on the random forest approach. The classification results are analyzed by comparing different features, classifiers and filters. A high accuracy of 98.1% on driving behavior classification is obtained and the robustness is verified on a dataset including 2400 driving events.
- Published
- 2017
42. Zoning by mobility: evaluating city administrative regions by taxi data
- Author
-
Lin Zhang, Shao-Lun Huang, and Liandong Zhou
- Subjects
050210 logistics & transportation ,Exploit ,Computer science ,Urbanization ,0502 economics and business ,05 social sciences ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Context (language use) ,02 engineering and technology ,Zoning ,Cluster analysis ,Environmental planning - Abstract
The accelerating urbanization procedure is putting increasing pressure on the management of cities. The administrative zones by which a city is managed are setup based on historical or political reasons, while the dynamics of people is hardly considered in the context. We exploit the widely available mobility data to divide the urban areas into zones by the joint K-mean clustering in origin and destination spaces. The method is evaluated with the New York City and Shenzhen taxi data, and the created zones are compared with the current static zoning plans of the city to evaluate the effectiveness.
- Published
- 2017
43. Communication theoretic inference on heterogeneous data
- Author
-
Shao-Lun Huang, Baturalp Mankir, H. Vincent Poor, Lizhong Zheng, and Kwang-Cheng Chen
- Subjects
Training set ,Computer science ,business.industry ,Algorithmic learning theory ,Model selection ,05 social sciences ,Big data ,Inference ,050801 communication & media studies ,020206 networking & telecommunications ,02 engineering and technology ,Machine learning ,computer.software_genre ,Communication theory ,0508 media and communications ,Knowledge extraction ,Statistical learning theory ,0202 electrical engineering, electronic engineering, information engineering ,Data analysis ,Data mining ,Artificial intelligence ,Statistical theory ,business ,computer - Abstract
Statistical learning has attracted considerable recent research interest due to the wide-ranging demands of big data analytics. The recent introduction of communication theory and information coupling theory into this area suggests a new perspective on statistical learning and inference for data analytics. This paper investigates inference of one data variable from heterogeneous data variables, a problem that plays an increasingly important role in the emerging applications of big data analytics. To generalize the existing conceptual approach, information coupling filtering under hidden data structure or unknown knowledge of interactions among data variables is developed. A least-mean-squares (LMS) filtering approach for non-stationary data similar to an equalizer is suggested, while the training data gives the depth of the filter analogously to model selection in learning theory. The information combining in diversity communication is extended to fuse more data variables for even greater precision of inference. Extending from multiuser detection, an algorithm based on Multiple Signal Classification (MUSIC) is demonstrated to identify useful data variables for inference, as a novel solution to knowledge discovery. A series of examples illustrate the effectiveness of this framework, suggesting that statistical communication theory and statistical signal processing can substantially contribute to statistical learning theory.
- Published
- 2016
44. A spectrum decomposition to the feature spaces and the application to big data analytics
- Author
-
Shao-Lun Huang and Lizhong Zheng
- Subjects
Sequence ,business.industry ,Computer science ,Node (networking) ,Feature vector ,Feature extraction ,Pattern recognition ,Feature selection ,computer.software_genre ,Feature (computer vision) ,Principal component analysis ,Artificial intelligence ,Data mining ,business ,Hidden Markov model ,computer - Abstract
In this paper, we investigate how to efficiently extract informative features of high-dimensional data through noisy channels. Specifically, we decompose the feature space of the data into a sequence of score functions with decreasing information volumes, such that different scores are uncorrelated. From this decomposition, the features of the data become a sequence of score functions such that the most informative lowdimensional feature can be selected as the first few scores. This greatly simplifies the feature selection problem. In addition, we apply this spectrum decomposition to data with high-dimensional structures, i.e., the hidden Markov model (HMM). We show that in HMM, it is desirable to consider a particular class of score functions called as the node scores, which allows us to efficiently extract informative features of the hidden variables by applying the spectrum decomposition approach. Finally, we develop efficient algorithms to extract such features from node scores, and present an example to illustrate the performance of the node scores.
- Published
- 2015
45. Information cascades in social networks via dynamic system analyses
- Author
-
Kwang-Cheng Chen and Shao-Lun Huang
- Subjects
Structure (mathematical logic) ,Dynamic network analysis ,Social network ,business.industry ,Computer science ,Bayesian probability ,Decision rule ,Machine learning ,computer.software_genre ,Network formation ,Influence diagram ,Artificial intelligence ,Information cascade ,business ,computer - Abstract
Systematically analyzing the dynamic behaviors of social networks is one of the central topic in understanding the structure of large networks. In particular, the information cascade [1] introduced by Banerjee provides great insights in characterizing the opinion exchanging between network agents. Traditionally studies of information cascades focus on the Bayesian models, which are often difficult to model real world situations. In this paper, we attempt to study the information cascades from a non-Bayesian point of view. In particular, we consider a sequential decision model but with an arbitrary decision rule. We show that the fraction of agents in a network making any specific decision will converge. Thus, the agents in the network reach a sort of consensus with high probability, which allows us to predict the herd behaviors. In addition, we also apply our non-Bayesian model to different network structures, such as ER model and network with communities, in which the affect of information cascades are quantified. Finally, we simulate the decision process for multiple communities, which justifies our proposed model to comprehend real world complex user behaviors and dynamics.
- Published
- 2015
46. On Locally Decodable Source Coding
- Author
-
Shao-Lun Huang, Muriel Medard, Yury Polyanskiy, and Ali Makhdoumi
- Subjects
FOS: Computer and information sciences ,Lossless compression ,Discrete mathematics ,Source code ,Computer science ,Information Theory (cs.IT) ,Computer Science - Information Theory ,media_common.quotation_subject ,Concatenation ,Data compression ratio ,Data_CODINGANDINFORMATIONTHEORY ,Lossy compression ,Block error ,Distortion ,Code (cryptography) ,Encoder ,Algorithm ,Computer Science::Information Theory ,media_common - Abstract
With the boom of big data, traditional source coding techniques face the common obstacle to decode only a small portion of information efficiently. In this paper, we aim to resolve this difficulty by introducing a specific type of source coding scheme called locally decodable source coding (LDSC). Rigorously, LDSC is capable of recovering an arbitrary bit of the unencoded message from its encoded version, by only feeding a small number of the encoded message to the decoder, and we call the decoder t-local if only t encoded symbols are required.We consider both almost lossless (block error) and lossy (bit error) cases for LDSC. First, we show that using linear encoder and a decoder with bounded locality, the reliable compress rate can not be less than one. More importantly, we show that even with a general encoder and 2-local decoders (t = 2), the rate of LDSC is still one. On the contrary, the achievability bounds for almost lossless and lossy compressions with excess distortion suggest that optimal compression rate is achievable when O(log n) encoded symbols is queried by the decoder with block-length n. We also show that, rate distortion is achievable when the number of queries is scaled over n with a bound on the rate in finite-length regime. Although the achievability bounds are simply based on the concatenation of code blocks, they outperform the existing bounds in succinct data structures literature.
- Published
- 2013
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.