11 results on '"Shao-Lun Huang"'
Search Results
2. AC-SGD: Adaptively Compressed SGD for Communication-Efficient Distributed Learning
- Author
-
Guangfeng Yan, Tan Li, Shao-Lun Huang, Tian Lan, and Linqi Song
- Subjects
Computer Networks and Communications ,Electrical and Electronic Engineering - Published
- 2022
3. On Distributed Learning With Constant Communication Bits
- Author
-
Shao-Lun Huang and Xiangxiang Xu
- Subjects
FOS: Computer and information sciences ,Statistics - Machine Learning ,Information Theory (cs.IT) ,Computer Science - Information Theory ,Machine Learning (stat.ML) - Abstract
In this paper, we study a distributed learning problem constrained by constant communication bits. Specifically, we consider the distributed hypothesis testing (DHT) problem where two distributed nodes are constrained to transmit a constant number of bits to a central decoder. In such cases, we show that in order to achieve the optimal error exponents, it suffices to consider the empirical distributions of observed data sequences and encode them to the transmission bits. With such a coding strategy, we develop a geometric approach in the distribution spaces and establish an inner bound of error exponent regions. In particular, we show the optimal achievable error exponents and coding schemes for the following cases: (i) both nodes can transmit $\log_23$ bits; (ii) one of the nodes can transmit $1$ bit, and the other node is not constrained; (iii) the joint distribution of the nodes are conditionally independent given one hypothesis. Furthermore, we provide several numerical examples for illustrating the theoretical results. Our results provide theoretical guidance for designing practical distributed learning rules, and the developed approach also reveals new potentials for establishing error exponents for DHT with more general communication constraints., Submitted to JSAIT
- Published
- 2022
4. Triboelectric nanogenerators enabled internet of things: A survey
- Author
-
Shao-Lun Huang, Wenbo Ding, Jiarong Li, Zihan Wang, Haixu Shen, Changsheng Wu, Xiaoyue Ni, and Ishara Dharmasena
- Subjects
Wireless transmission ,business.industry ,Software deployment ,Scientific development ,Computer science ,Information technology ,Context (language use) ,business ,Telecommunications ,Internet of Things ,Triboelectric effect - Abstract
As pioneering information technology, the Internet of Things (loT) targets at building an infrastructure of embedded devices and networks of connected objects, to offer omnipresent ecosystem and interaction across billions of smart devices, sensors, and actuators. The deployment of IoT calls for decentralized power supplies, self-powered sensors, and wireless transmission technologies, which have brought both opportunities and challenges to the existing solutions, especially when the network scales up. The Triboelectric Nanogenerators (TENGs), recently developed for mechanical energy harvesting and mechanical-to-electrical signal conversion, have the natural properties of energy and information, which have demonstrated high potentials in various applications of IoT. This context provides a comprehensive review of TENG enabled IoT and discusses the most popular and significant divisions. Firstly, the basic principle of TENG is re-examined in this article. Subsequently, a comprehensive and detailed review is given to the concept of IoT, followed by the scientific development of the TENG enabled IoT. Finally, the future of this evolving area is addressed.
- Published
- 2020
5. On the Optimal Tradeoff Between Computational Efficiency and Generalizability of Oja’s Algorithm
- Author
-
Shao-Lun Huang and Xiangxiang Xu
- Subjects
Optimal design ,Noise ,General Computer Science ,Rate of convergence ,Sampling (signal processing) ,Computer science ,General Engineering ,General Materials Science ,Generalizability theory ,Variation (game tree) ,Algorithm ,MNIST database ,Eigenvalues and eigenvectors - Abstract
The Oja's algorithm is widely applied for computing principal eigenvectors in real problems, and it is practically useful to understand the theoretical relationships between the learning rate, convergence rate, and generalization error of this algorithm for noisy samples. In this paper, we show that under mild assumptions of sampling noise, both the generalization error and the convergence rate reveal linear relationships with the learning rate in the large sample size and small learning rate regime. In addition, when the algorithm nearly converges, we provide a refined characterization of the generalization error, which suggests the optimal design for the learning rate for data with noise. Moreover, we investigate the minibatch variation of Oja's algorithm and demonstrate that the learning rate of minibatch training is decayed by a factor characterized by the batch size, which provides theoretical insights and guidance for designing the learning rate in minibatch training algorithms. Finally, our theoretical results are validated by experiments on both synthesized data and the MNIST dataset.
- Published
- 2020
6. A Local Characterization for Wyner Common Information
- Author
-
Lizhong Zheng, Shao-Lun Huang, Xiangxiang Xu, Gregory W. Wornell, and Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
- Subjects
business.industry ,Computer science ,Information processing ,Pattern recognition ,010103 numerical & computational mathematics ,Characterization (mathematics) ,01 natural sciences ,010305 fluids & plasmas ,Feature (computer vision) ,0103 physical sciences ,Common knowledge ,Artificial intelligence ,0101 mathematics ,business ,Random variable ,MNIST database - Abstract
While the Hirschfeld-Gebelein-Renyi (HGR) maximal correlation and the Wyner common information share similar information processing purposes of extracting common knowledge structures between random variables, the relationships between these approaches are generally unclear. In this paper, we demonstrate such relationships by considering the Wyner common information in the weakly dependent regime, called ϵ-common information. We show that the HGR maximal correlation functions coincide with the relative likelihood functions of estimating the auxiliary random variables in ϵ-common information, which establishes the fundamental connections these approaches. Moreover, we extend the ϵ-common information to multiple random variables, and derive a novel algorithm for extracting feature functions of data variables regarding their common information. Our approach is validated by the MNIST problem, and can potentially be useful in multi-modal data analyses.
- Published
- 2020
7. An Information Theoretic Interpretation to Deep Neural Networks
- Author
-
Shao-Lun Huang, Xiangxiang Xu, Lizhong Zheng, and Gregory W. Wornell
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Optimization problem ,deep neural network ,information theory ,local information geometry ,feature extraction ,Computer science ,Science ,QC1-999 ,Computer Science - Information Theory ,Computation ,Computer Science::Neural and Evolutionary Computation ,General Physics and Astronomy ,Inference ,Feature selection ,02 engineering and technology ,Astrophysics ,01 natural sciences ,Article ,Machine Learning (cs.LG) ,0202 electrical engineering, electronic engineering, information engineering ,0101 mathematics ,business.industry ,Physics ,Information Theory (cs.IT) ,QB460-466 ,010101 applied mathematics ,Deep neural networks ,020201 artificial intelligence & image processing ,Artificial intelligence ,business - Abstract
It is commonly believed that the hidden layers of deep neural networks (DNNs) attempt to extract informative features for learning tasks. In this paper, we formalize this intuition by showing that the features extracted by DNN coincide with the result of an optimization problem, which we call the `universal feature selection' problem, in a local analysis regime. We interpret the weights training in DNN as the projection of feature functions between feature spaces, specified by the network structure. Our formulation has direct operational meaning in terms of the performance for inference tasks, and gives interpretations to the internal computation results of DNNs. Results of numerical experiments are provided to support the analysis., Accepted to ISIT 2019
- Published
- 2019
8. Euclidean Information Theory of Networks
- Author
-
Shao-Lun Huang, Changho Suh, and Lizhong Zheng
- Subjects
FOS: Computer and information sciences ,Theoretical computer science ,Computer science ,business.industry ,Information Theory (cs.IT) ,Computer Science - Information Theory ,Distributed computing ,Throughput ,Construct (python library) ,Library and Information Sciences ,Information theory ,Telecommunications network ,Computer Science Applications ,Variety (cybernetics) ,Spread spectrum ,Transmission (telecommunications) ,Euclidean geometry ,business ,Linear optimization problem ,Throughput (business) ,Communication channel ,Computer network ,Information Systems ,Mathematics - Abstract
In this paper, we extend the information theoretic framework that was developed in earlier work to multi-hop network settings. For a given network, we construct a novel deterministic model that quantifies the ability of the network in transmitting private and common messages across users. Based on this model, we formulate a linear optimization problem that explores the throughput of a multi-layer network, thereby offering the optimal strategy as to what kind of common messages should be generated in the network to maximize the throughput. With this deterministic model, we also investigate the role of feedback for multi-layer networks, from which we identify a variety of scenarios in which feedback can improve transmission efficiency. Our results provide fundamental guidelines as to how to coordinate cooperation between users to enable efficient information exchanges across them., to appear in the IEEE Transactions on Information Theory
- Published
- 2015
9. Communication Theoretic Data Analytics
- Author
-
Lizhong Zheng, Kwang-Cheng Chen, H. Vincent Poor, and Shao-Lun Huang
- Subjects
FOS: Computer and information sciences ,Information transfer ,Social network ,Computer Networks and Communications ,business.industry ,Computer science ,Information Theory (cs.IT) ,Computer Science - Information Theory ,Big data ,computer.software_genre ,Information theory ,Data analysis ,The Internet ,Data mining ,Electrical and Electronic Engineering ,business ,computer - Abstract
Widespread use of the Internet and social networks invokes the generation of big data, which is proving to be useful in a number of applications. To deal with explosively growing amounts of data, data analytics has emerged as a critical technology related to computing, signal processing, and information networking. In this paper, a formalism is considered in which data is modeled as a generalized social network and communication theory and information theory are thereby extended to data analytics. First, the creation of an equalizer to optimize information transfer between two data variables is considered, and financial data is used to demonstrate the advantages. Then, an information coupling approach based on information geometry is applied for dimensionality reduction, with a pattern recognition example to illustrate the effectiveness. These initial trials suggest the potential of communication theoretic data analytics for a wide range of applications., Published in IEEE Journal on Selected Areas in Communications, Jan. 2015
- Published
- 2015
10. An information-theoretic approach to unsupervised feature selection for high-dimensional data
- Author
-
Shao-Lun Huang, Lin Zhang, Lizhong Zheng, and Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
- Subjects
Clustering high-dimensional data ,FOS: Computer and information sciences ,Computer Science - Machine Learning ,Generalization ,Computer science ,Computer Science - Information Theory ,Feature selection ,02 engineering and technology ,010501 environmental sciences ,Conditional expectation ,Information theory ,01 natural sciences ,Machine Learning (cs.LG) ,Joint probability distribution ,0202 electrical engineering, electronic engineering, information engineering ,Computer Science::Databases ,0105 earth and related environmental sciences ,Artificial neural network ,business.industry ,Information Theory (cs.IT) ,020206 networking & telecommunications ,Pattern recognition ,Feature (computer vision) ,Unsupervised learning ,Total correlation ,Artificial intelligence ,business ,Random variable - Abstract
In this paper, we propose an information-theoretic approach to design the functional representations to extract the hidden common structure shared by a set of random variables. The main idea is to measure the common information between the random variables by Watanabe's total correlation, and then find the hidden attributes of these random variables such that the common information is reduced the most given these attributes. We show that these attributes can be characterized by an exponential family specified by the eigen-decomposition of some pairwise joint distribution matrix. Then, we adopt the log-likelihood functions for estimating these attributes as the desired functional representations of the random variables, and show that such representations are informative to describe the common structure. Moreover, we design both the multivariate alternating conditional expectation (MACE) algorithm to compute the proposed functional representations for discrete data, and a novel neural network training approach for continuous or high-dimensional data. Furthermore, we show that our approach has deep connections to existing techniques, such as Hirschfeld-Gebelein-R\'{e}nyi (HGR) maximal correlation, linear principal component analysis (PCA), and consistent functional map, which establishes insightful connections between information theory and machine learning. Finally, the performances of our algorithms are validated by numerical simulations., Comment: 35 pages; Submitted to IEEE Journal on Selected Areas in Information Theory (JSAIT)
- Published
- 2017
11. Match and Replace: A Functional ECO Engine for Multierror Circuit Rectification
- Author
-
Chung-Yang (Ric) Huang, Wei-Hsun Lin, Shao-Lun Huang, and Po-Kai Huang
- Subjects
Combinational logic ,Engineering change order ,Heuristic (computer science) ,Formal equivalence checking ,Hardware_PERFORMANCEANDRELIABILITY ,Computer Graphics and Computer-Aided Design ,Hardware_INTEGRATEDCIRCUITS ,Benchmark (computing) ,Equivalent circuit ,Electrical and Electronic Engineering ,Algorithm ,Software ,Hardware_LOGICDESIGN ,Interpolation ,Mathematics ,Electronic circuit - Abstract
Functional engineering change order (ECO) is a popular technique for rectifying design errors after synthesis and placement stages. We present a new approach to generating the patch circuits for multierror circuit rectification. In this paper, we propose a two-phase approach of: 1) discovering the functional matches in two circuits followed by 2) determining the final patch circuits from the matches. The ECO engine in this paper discovers functional and structural matches in two circuits by coordinating the SAT-sweeping and the cut-matching algorithms. Then, the patch selection is conducted by the combinational equivalence checking technique and a linear-time selection heuristic. The experimental results on public benchmark and industrial circuits demonstrate that this ECO engine outperforms state-of-the-art interpolation-based engines.
- Published
- 2013
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.