617 results on '"Chain rule (probability)"'
Search Results
2. A Precise and Reliable Multivariable Chain Rule
- Author
-
Raymond T. Boute
- Subjects
function ,Mathematical optimization ,Chain rule (probability) ,Computer science ,Applied Mathematics ,Multivariable calculus ,dimensional analysis ,telegrapher's equation ,Expression (mathematics) ,Theoretical Computer Science ,partial derivative ,Computational Mathematics ,Mathematics and Statistics ,composition ,transport equation ,expression ,derivative ,chain rule - Abstract
The multivariable chain rule is often challenging to students because it is usually presented with ambiguities and other defects that hamper systematic and reliable application. A very simple formulation combines the derivation operators for functions and for expressions in a manner not found elsewhere due to common confusion between them. Some issues are rooted more deeply than others and are discussed in a broader perspective, starting with the function concept. The approach is illustrated using various applications including the transport equation, partial derivatives of a definite integral, and the distortionless (but not lossless) transmission line. This note is suitable for a lecture in any first-year course covering partial derivatives, as a complement to the other course material.
- Published
- 2021
3. Infinity Learning: Learning Markov Chains from Aggregate Steady-State Observations
- Author
-
Sonia Fahmy, Jianfei Gao, Mohamed Zahran, Bruno Ribeiro, and Amit Sheoran
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Chain rule (probability) ,Steady state (electronics) ,Training set ,Markov chain ,Computer science ,Estimator ,Machine Learning (stat.ML) ,020206 networking & telecommunications ,02 engineering and technology ,General Medicine ,Machine Learning (cs.LG) ,Continuous-time Markov chain ,Stochastic gradient descent ,Statistics - Machine Learning ,Parametric model ,0202 electrical engineering, electronic engineering, information engineering ,Applied mathematics ,Parametric statistics - Abstract
We consider the task of learning a parametric Continuous Time Markov Chain (CTMC) sequence model without examples of sequences, where the training data consists entirely of aggregate steady-state statistics. Making the problem harder, we assume that the states we wish to predict are unobserved in the training data. Specifically, given a parametric model over the transition rates of a CTMC and some known transition rates, we wish to extrapolate its steady state distribution to states that are unobserved. A technical roadblock to learn a CTMC from its steady state has been that the chain rule to compute gradients will not work over the arbitrarily long sequences necessary to reach steady state —from where the aggregate statistics are sampled. To overcome this optimization challenge, we propose ∞-SGD, a principled stochastic gradient descent method that uses randomly-stopped estimators to avoid infinite sums required by the steady state computation, while learning even when only a subset of the CTMC states can be observed. We apply ∞-SGD to a real-world testbed and synthetic experiments showcasing its accuracy, ability to extrapolate the steady state distribution to unobserved states under unobserved conditions (heavy loads, when training under light loads), and succeeding in difficult scenarios where even a tailor-made extension of existing methods fails.
- Published
- 2020
4. Multi-Step Predictions for Adaptive Sampling using Proximal ADMM
- Author
-
Linh Nguyen, Truong X. Nghiem, and Viet-Anh Le
- Subjects
Conditional entropy ,symbols.namesake ,Mathematical optimization ,Adaptive sampling ,Chain rule (probability) ,Optimization problem ,Optimality criterion ,Computer science ,symbols ,Sampling (statistics) ,Gaussian process ,Wireless sensor network - Abstract
The paper presents a novel approach by using multi- step predictions to address the adaptive sampling problem in a resources and obstacles constrained mobile robotic sensor network to efficiently monitor environmental spatial phenomena. It is first proposed to employ the Gaussian process (GP) to represent the spatial field, which can then be used to predict the field at unmeasured locations. The adaptive sampling problem aims to drive the mobile sensors to optimally navigate the environment where the sensors adaptively take measurements of the spatial phenomena at each sampling step. To this end, a conditional entropy based optimality criterion is proposed, which aims to minimize prediction uncertainty of the GP model. By predicting possible measurements the mobile sensors potentially take in a horizon of multiple sampling steps ahead and exploiting the chain rule of the conditional entropy, a multi-step predictions based adaptive sampling optimization problem is formulated. The objective of the optimization problem is to find the optimal sampling paths for the mobile sensors in multiple sampling steps ahead, which then provides their benefits in terms of better navigation, deployment and data collection with more informative sensor readings. However, the optimization problem is nonconvex, complex, constrained and mixed-integer. Therefore, it is proposed to employ the proximal alternating direction method of multipliers algorithm to efficiently solve the problem. More importantly, the solution obtained by the proposed approach is theoretically guaranteed to be converged to a stationary value. Effectiveness of the proposed algorithm was extensively validated by the real- world dataset, where the obtained results are highly promising.
- Published
- 2021
5. A Compressed Hidden Naive Bayesian Classifier
- Author
-
Yulin He, Joshua Zhexue Huang, and Guiliang Ou
- Subjects
Naive Bayes classifier ,Chain rule (probability) ,Discretization ,Computer science ,business.industry ,Joint probability distribution ,Classifier (linguistics) ,Bayesian network ,Pattern recognition ,Artificial intelligence ,Mutual information ,business ,Maximal information coefficient - Abstract
This paper proposes a compressed hidden naive Bayesian (C-HNB) classifier which is an improved version of hidden naive Bayesian (HNB) by compressing the Bayesian network structure and calculating the attribute correlation with maximal information coefficient (MIC). In C-HNB, we remodel the Bayesian network structure based on the chain rule of joint probability distribution so that the number of hidden parent nodes in C-HNB is smaller than the number of hidden parent nodes in HNB, which reduces the training complexity of Bayesian network. In addition, the attribute correlation in C-HNB is calculated with MIC rather than the mutual information, which makes the Bayesian network more stable because the calculation of mutual information is severely influenced by the discretization of continuous attribute. On the selected KEEL benchmark data sets, we compare the classification performances of C-HNB with HNB. The comparative results show that C-HNB can obtain the better prediction accuracy with the less training time in comparison with HNB and thus the experimental results demonstrate the effectiveness of C-HNB.
- Published
- 2021
6. Random effects models for estimation of the probability and time to progression of a continuous biomarker
- Author
-
Liesbeth Bruckers, Geert Molenberghs, Marc Aerts, Edmund Njeru Njagi, Geert Willem H. Schurink, Tarylee Reddy, Vascular Surgery, MUMC+: MA Vaatchirurgie CVC (3), RS: CARIM - R3.08 - Regenerative and reconstructive medicine for vascular disease, and RS: Carim - V03 Regenerative and reconstructive medicine vascular disease
- Subjects
Statistics and Probability ,Mixed model ,Time Factors ,HIV Infections ,Residual ,01 natural sciences ,INTERVALS ,Cohort Studies ,010104 statistics & probability ,03 medical and health sciences ,0302 clinical medicine ,Statistics ,INFECTION ,threshold ,Humans ,Pharmacology (medical) ,030212 general & internal medicine ,0101 mathematics ,Probability ,Mathematics ,Pharmacology ,Models, Statistical ,Chain rule (probability) ,MORTALITY ,Autocorrelation ,DISEASE PROGRESSION ,Conditional probability ,serial correlation ,prediction ,Random effects model ,persistence criteria ,Markov Chains ,Confidence interval ,CD4 Lymphocyte Count ,LYMPHOCYTE COUNT ,Markov property ,Biomarkers ,Aortic Aneurysm, Abdominal - Abstract
Biomarkers play a key role in the monitoring of disease progression. The time taken for an individual to reach a biomarker exceeding or lower than a meaningful threshold is often of interest. Due to the inherent variability of biomarkers, persistence criteria are sometimes included in the definitions of progression, such that only two consecutive measurements above or below the relevant threshold signal that "true" progression has occurred. In previous work, a novel approach was developed, which allowed estimation of the time to threshold using the parameters from a linear mixed model where the residual variance was assumed to be pure measurement error. In this paper, we extend this methodology so that serial correlation can be accommodated. Assuming that the Markov property holds and applying the chain rule of probabilities, we found that the probability of progression at each timepoint can be expressed simply as the product of conditional probabilities. The methodology is applied to a cohort of HIV positive individuals, where the time to reach a CD4 count threshold is estimated. The second application we present is based on a study on abdominal aortic aneurysms, where the time taken for an individual to reach a diameter exceeding 55 mm is studied. We observed that erroneously ignoring the residual correlation when it is strong may result in substantial overestimation of the time to threshold. The estimated probability of the biomarker reaching a threshold of interest, expected time to threshold, and confidence intervals are presented for selected patients in both applications.
- Published
- 2019
7. Hierarchical estimation of parameters in Bayesian networks
- Author
-
Marco Zaffalon, Laura Azzimonti, and Giorgio Corani
- Subjects
Statistics and Probability ,Chain rule (probability) ,Estimation theory ,business.industry ,Applied Mathematics ,05 social sciences ,Posterior probability ,Bayesian network ,Pattern recognition ,Statistical model ,Conditional probability distribution ,01 natural sciences ,Hierarchical database model ,010104 statistics & probability ,Computational Mathematics ,Computational Theory and Mathematics ,0502 economics and business ,Artificial intelligence ,0101 mathematics ,business ,Conditional variance ,Algorithm ,050205 econometrics ,Mathematics - Abstract
A novel approach for parameter estimation in Bayesian networks is presented. The main idea is to introduce a hyper-prior in the Multinomial–Dirichletmodel, traditionally used for conditional distribution estimation in Bayesian networks. The resulting hierarchical model jointly estimates different conditional distributions belonging to the same conditional probability table, thus borrowing statistical strength from each other. An analytical study of the dependence structure a priori induced by the hierarchical model is performed and an ad hoc variational algorithm for fast and accurate inference is derived. The proposed hierarchical model yields a major performance improvement in classification with Bayesian networks compared to traditional models. The proposed variational algorithm reduces by two orders of magnitude the computational time, with the same accuracy in parameter estimation, compared to traditional MCMC methods. Moreover, motivated by a real case study, the hierarchical model is applied to the estimation of Bayesian networks parameters by borrowing strength from related domains.
- Published
- 2019
8. End-to-end Learned Image Compression with Conditional Latent Space Modeling for Entropy Coding
- Author
-
Fatih Kamisli and Aziz Berkay Yesilyurt
- Subjects
Chain rule (probability) ,Artificial neural network ,Computer science ,Markov process ,020206 networking & telecommunications ,02 engineering and technology ,Latent variable ,symbols.namesake ,Joint probability distribution ,0202 electrical engineering, electronic engineering, information engineering ,symbols ,020201 artificial intelligence & image processing ,Markov property ,Entropy encoding ,Algorithm ,Image compression - Abstract
The use of neural networks in image compression enables transforms and probability models for entropy coding which can process images based on much more complex models than the simple Gauss-Markov models in traditional compression methods. All at the expense of higher computational complexity. In the neural-network based image compression literature, various methods to model the dependencies in the transform domain/latent space are proposed. This work uses an alternative method to exploit the dependencies of the latent representation. The joint density of the latent representation is modeled as a product of conditional densities, which are learned using neural networks. However, each latent variable is not conditioned on all previous latent variables as in the chain rule of factoring joint distributions, but only on a few previous variables, in particular the left, upper and upperleft spatial neighbor variables based on a Markov property assumption for a simpler model and algorthm. The compression performance is comparable with the state- of-the-art compression models, while the conditional densities require a much simpler network and training time due to their simplicity and less number of parameters then its counterparts.
- Published
- 2021
9. Entropy Under Disintegrations
- Author
-
Juan Pablo Vigneaux
- Subjects
Differential entropy ,Pure mathematics ,Entropy (classical thermodynamics) ,Chain rule (probability) ,Asymptotic equipartition property ,Absolute continuity ,Measure (mathematics) ,Probability measure ,Mathematics ,Haar measure - Abstract
We consider the differential entropy of probability measures absolutely continuous with respect to a given \(\sigma \)-finite “reference” measure on an arbitrary measure space. We state the asymptotic equipartition property in this general case; the result is part of the folklore but our presentation is to some extent novel. Then we study a general framework under which such entropies satisfy a chain rule: disintegrations of measures. We give an asymptotic interpretation for conditional entropies in this case. Finally, we apply our result to Haar measures in canonical relation.
- Published
- 2021
10. Lower and Upper Bounds for ‘Useful’ Renyi Information Rate
- Author
-
Pankaj Prasad Dwivedi and D. K. Sharma
- Subjects
Discrete mathematics ,Chain rule (probability) ,Current (mathematics) ,Markov chain ,Stochastic process ,Entropy (information theory) ,Code rate ,Information measure ,Entropy rate ,Mathematics - Abstract
Choosing the stochastic process with the entropy is attributable to increase the least amount of information to the problem under consideration. As a result, the entropy rate for stochastic processes must be determined. In the current correspondence, we will describe the rate of ‘useful’ conditional Renyi information measure, as well as, show that rate for ‘useful’ Renyi Information, the chain rule holds. Therefore, for the rate of ‘useful’ conditional Renyi information and also for an ergodic Markov chain, we will present a relationship and use it to extract the rate of ‘useful’ Renyi information. At last, we will show that Shannon's information rate is the rate for ‘useful’ Renyi information.
- Published
- 2021
11. DeepComp: Which Competing Event Will Hit the Patient First?
- Author
-
Yingxue Li, Tiange Chen, Wenxiao Jia, Jianzeng Dong, Fei Wang, Guotong Xie, Changsheng Ma, Xiang Li, Xin Du, and Yashu Kang
- Subjects
Chain rule (probability) ,Computer science ,business.industry ,Deep learning ,Conditional probability ,030204 cardiovascular system & hematology ,Machine learning ,computer.software_genre ,01 natural sciences ,010104 statistics & probability ,03 medical and health sciences ,0302 clinical medicine ,Censoring (clinical trials) ,Artificial intelligence ,0101 mathematics ,Time point ,Set (psychology) ,business ,computer ,Survival analysis ,Event (probability theory) - Abstract
When taking care of complex patients with multiple morbidities, accurately predicting the occurrence of each cause-specific event is critical for designing optimal treatment plans. However, standard survival analysis cannot deal with the multiple (usually competing) adverse events and views those competing events as censored. This will result in biased estimation of the incidence rate. In this paper, we propose a deep learning based survival analysis algorithm called DeepComp to jointly predict the progress of the competing events, which can thus inform the doctors which event is more likely to hit the patient first. DeepComp constructs a multi-task recurrent neural network (RNN) and views the conditional probability of each competing event at each time point as the output of each RNN cell. Then the probability chain rule is utilized to combine them together. In this way, the survival probability and the risk for each competing event over the time space are obtained. The multitask structure not only prevents the model from unreasonable censoring but also aids the model in capturing the complex hidden association among the competing events. A novel penalty is added to the loss function to better discriminate the competing risks for each particular patient, which could benefit treatment decision-making. We conduct comprehensive experiments on two real-world clinical data sets and one synthetic data set. The proposed DeepComp method achieves significant performance improvement compared to the state-of-the-art baseline methods.
- Published
- 2020
12. Max-Variance Convolutional Neural Network Model Compression
- Author
-
Tanya Boone-Sifuentes, Antonio Robles-Kelly, and Asef Nazari
- Subjects
Chain rule (probability) ,Computer science ,020206 networking & telecommunications ,02 engineering and technology ,Filter (signal processing) ,Variance (accounting) ,010501 environmental sciences ,01 natural sciences ,Facial recognition system ,Convolutional neural network ,Constraint (information theory) ,Uncompressed video ,Compression (functional analysis) ,0202 electrical engineering, electronic engineering, information engineering ,Algorithm ,0105 earth and related environmental sciences - Abstract
In this paper, we present a method for convolutional neural network model compression which is based on the removal of filter banks that correspond to unimportant weights. To do this, we depart from the relationship between consecutive layers so as to obtain a factor that can be used to assess the degree upon which each pair of filters are coupled to each other. This allows us to use the unit-response of the coupling between two layers so as to remove pathways int he network that are negligible. Moreover, since the back-propagation gradients tend to diminish as the chain rule is applied from the output to the input layer, here we maximise the variance on the coupling factors while enforcing a monotonicity constraint that assures the most relevant pathways are preserved. We show results on widely used networks employing classification and facial expression recognition datasets. In our experiments, our approach delivers a very competitive trade-off between compression rates and performance as compared to both, the uncompressed models and alternatives elsewhere in the literature. pages = 271–279
- Published
- 2020
13. A Deep Recurrent Survival Model for Unbiased Ranking
- Author
-
Yuchen Fang, Kan Ren, Jun Wang, Xiaoqiang Zhu, Guorui Zhou, Jiarui Jin, Kun Gai, Yong Yu, Jian Xu, and Weinan Zhang
- Subjects
FOS: Computer and information sciences ,Chain rule (probability) ,Computer science ,business.industry ,02 engineering and technology ,Machine learning ,computer.software_genre ,Weighting ,Ranking (information retrieval) ,Computer Science - Information Retrieval ,Recurrent neural network ,Ranking ,Position (vector) ,Joint probability distribution ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Information Retrieval (cs.IR) - Abstract
Position bias is a critical problem in information retrieval when dealing with implicit yet biased user feedback data. Unbiased ranking methods typically rely on causality models and debias the user feedback through inverse propensity weighting. While practical, these methods still suffer from two major problems. First, when inferring a user click, the impact of the contextual information, such as documents that have been examined, is often ignored. Second, only the position bias is considered but other issues resulted from user browsing behaviors are overlooked. In this paper, we propose an end-to-end Deep Recurrent Survival Ranking (DRSR), a unified framework to jointly model user's various behaviors, to (i) consider the rich contextual information in the ranking list; and (ii) address the hidden issues underlying user behaviors, i.e., to mine observe pattern in queries without any click (non-click queries), and to model tracking logs which cannot truly reflect the user browsing intents (untrusted observation). Specifically, we adopt a recurrent neural network to model the contextual information and estimates the conditional likelihood of user feedback at each position. We then incorporate survival analysis techniques with the probability chain rule to mathematically recover the unbiased joint probability of one user's various behaviors. DRSR can be easily incorporated with both point-wise and pair-wise learning objectives. The extensive experiments over two large-scale industrial datasets demonstrate the significant performance gains of our model comparing with the state-of-the-arts., SIGIR 2020. arXiv admin note: text overlap with arXiv:1809.05818 by other authors
- Published
- 2020
14. Asymptotics of conditional probabilities of succesful allocation of random number of particles into cells
- Author
-
Aleksey N. Chuprunov, Aleksandra Igorevna Afonina, and Ilgiz Rifatovich Kayumov
- Subjects
Combinatorics ,Discrete mathematics ,010104 statistics & probability ,Chain rule (probability) ,Particle number ,Applied Mathematics ,010102 general mathematics ,Law of total probability ,Discrete Mathematics and Combinatorics ,Conditional probability ,0101 mathematics ,01 natural sciences ,Mathematics - Abstract
The article is devoted to the memory of Valentin Fedorovich Kolchin. Let ζ, ζi (i ∈ N) be independent identically distributed nonnegative integer-valued random variables, (η i1,…, ηiN ) be the fillings of cells in the generalized scheme of allocation of ζi particles into N cells, 1 ≤ i ≤ n, for fixed Zn = (ζ 1, …, ζn ) these allocation schemes are independent. We consider the conditional probabilities P(A n,N | Zn ) of the event A n,N = {each cell in each of n allocation schemes contains no more than r particles}, where r is some fixed number. The sufficient conditions for the convergence of the sequence P(A n,N | Zn ) to a nonrandom limit with probability 1 are given. It is shown that the random variable ln P(A n,N | Zn ) is asymptotically normal. Applications of the obtained results to the noise-proof encoding are discussed.
- Published
- 2017
15. A Soft-Output MIMO Detector With Achievable Information Rate based Partial Marginalization
- Author
-
Sha Hu and Fredrik Rusek
- Subjects
Signal processing ,Chain rule (probability) ,Computational complexity theory ,MIMO ,Detector ,Latency (audio) ,020206 networking & telecommunications ,02 engineering and technology ,Code rate ,Signal ,020202 computer hardware & architecture ,Control theory ,Signal Processing ,0202 electrical engineering, electronic engineering, information engineering ,Electrical and Electronic Engineering ,Algorithm ,Mathematics - Abstract
In this paper, we propose a soft-output detector for multiple-input multiple-output (MIMO) channels that utilizes achievable information rate (AIR) based partial marginalization (PM). The proposed AIR based PM (AIR-PM) detector has superior performance compared to previously proposed PM designs and other soft-output detectors such as K-best, while at the same time yielding lower computational complexity, a detection latency that is independent of the number of transmit layers, and straightforward inclusion of soft input information. Using a tree representation of the MIMO signal, the key property of the AIRPM is that the connections among all child layers are broken. Therefore, least-square (LS) estimates used for marginalization are obtained independently and in parallel, which have better quality than the zero-forcing decision feedback (ZF-DF) estimates used in previous PM designs. Such a property of the AIRPM detector is designed via a mismatched detection model that maximizes the AIR. Furthermore, we show that the chain rule holds for the AIR calculation, which facilitates an information theoretic characterization of the AIR-PM detector. (Less)
- Published
- 2017
16. Representations for Continuous Time Processes
- Author
-
Amarjit Budhiraja and Paul Dupuis
- Subjects
Algebra ,Chain rule (probability) ,Discrete time and continuous time ,Computer science - Abstract
In previous chapters we developed and applied representations for the large deviation analysis of discrete time processes. The derivation of useful representations in this setting follows from a straightforward application of the chain rule. The only significant issue is to decide on the ordering used for the underlying “driving noises” when the chain rule is applied, since controls are allowed to depend on the “past,” which is determined by this ordering.
- Published
- 2019
17. A chain rule for the quantum relative entropy
- Author
-
Omar Fawzi, Kun Fang, David Sutter, Renato Renner, Department of Applied Mathematics and Theoretical Physics [Cambridge] (DAMTP), Faculty of mathematics Centre for Mathematical Sciences [Cambridge] (CMS), University of Cambridge [UK] (CAM)-University of Cambridge [UK] (CAM), Laboratoire de l'Informatique du Parallélisme (LIP), École normale supérieure de Lyon (ENS de Lyon)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-Université de Lyon-Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS), Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich] (ETH Zürich), ANR-18-CE47-0011,ACOM,Une Théorie Algorithmique de la Communcation(2018), Centre National de la Recherche Scientifique (CNRS)-Université de Lyon-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Claude Bernard Lyon 1 (UCBL), Université de Lyon-École normale supérieure - Lyon (ENS Lyon), and École normale supérieure - Lyon (ENS Lyon)-Université Claude Bernard Lyon 1 (UCBL)
- Subjects
FOS: Computer and information sciences ,Quantum Physics ,Chain rule (probability) ,Kullback–Leibler divergence ,Computer Science - Information Theory ,Information Theory (cs.IT) ,General Physics and Astronomy ,TheoryofComputation_GENERAL ,FOS: Physical sciences ,Context (language use) ,Conditional probability distribution ,Quantum channel ,Data_CODINGANDINFORMATIONTHEORY ,01 natural sciences ,Quantum relative entropy ,Multipartite ,[PHYS.QPHY]Physics [physics]/Quantum Physics [quant-ph] ,0103 physical sciences ,Probability distribution ,Statistical physics ,010306 general physics ,Quantum Physics (quant-ph) ,Mathematics - Abstract
The chain rule for the classical relative entropy ensures that the relative entropy between probability distributions on multipartite systems can be decomposed into a sum of relative entropies of suitably chosen conditional distributions on the individual systems. Here, we prove a similar chain rule inequality for the quantum relative entropy in terms of channel relative entropies. The new chain rule allows us to solve an open problem in the context of asymptotic quantum channel discrimination: surprisingly, adaptive protocols cannot improve the error rate for asymmetric channel discrimination compared to non-adaptive strategies. In addition, we give examples of quantum channels showing that the channel relative entropy is not additive under the tensor product., Comment: 12 pages
- Published
- 2019
- Full Text
- View/download PDF
18. Deep Landscape Forecasting for Real-time Bidding Advertising
- Author
-
Kan Ren, Lei Zheng, Jiarui Qin, Yong Yu, Zhengyu Yang, and Weinan Zhang
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Chain rule (probability) ,Market competition ,Computer science ,Heuristic ,02 engineering and technology ,Real-time bidding ,Computer Science - Information Retrieval ,Machine Learning (cs.LG) ,Order (exchange) ,Computer Science - Computer Science and Game Theory ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Market price ,Econometrics ,Probability distribution ,020201 artificial intelligence & image processing ,Bid price ,Information Retrieval (cs.IR) ,Computer Science and Game Theory (cs.GT) - Abstract
The emergence of real-time auction in online advertising has drawn huge attention of modeling the market competition, i.e., bid landscape forecasting. The problem is formulated as to forecast the probability distribution of market price for each ad auction. With the consideration of the censorship issue which is caused by the second-price auction mechanism, many researchers have devoted their efforts on bid landscape forecasting by incorporating survival analysis from medical research field. However, most existing solutions mainly focus on either counting-based statistics of the segmented sample clusters, or learning a parameterized model based on some heuristic assumptions of distribution forms. Moreover, they neither consider the sequential patterns of the feature over the price space. In order to capture more sophisticated yet flexible patterns at fine-grained level of the data, we propose a Deep Landscape Forecasting (DLF) model which combines deep learning for probability distribution forecasting and survival analysis for censorship handling. Specifically, we utilize a recurrent neural network to flexibly model the conditional winning probability w.r.t. each bid price. Then we conduct the bid landscape forecasting through probability chain rule with strict mathematical derivations. And, in an end-to-end manner, we optimize the model by minimizing two negative likelihood losses with comprehensive motivations. Without any specific assumption for the distribution form of bid landscape, our model shows great advantages over previous works on fitting various sophisticated market price distributions. In the experiments over two large-scale real-world datasets, our model significantly outperforms the state-of-the-art solutions under various metrics., Comment: KDD 2019. The reproducible code and dataset link is https://github.com/rk2900/DLF
- Published
- 2019
- Full Text
- View/download PDF
19. A two-parameter entropy and its fundamental properties
- Author
-
Supriyo Dutta, Shigeru Furuichi, and Partha Guha
- Subjects
Chain rule (probability) ,Two parameter ,Kullback–Leibler divergence ,010308 nuclear & particles physics ,Tsallis entropy ,Statistical and Nonlinear Physics ,01 natural sciences ,Entropy (classical thermodynamics) ,0103 physical sciences ,Information geometry ,Statistical physics ,010306 general physics ,Mathematical Physics ,Mathematics - Abstract
This article proposes a new two-parameter generalized entropy, which can be reduced to the Tsallis and Shannon entropies for specific values of its parameters. We develop a number of information-theoretic properties of this generalized entropy and divergence, for instance, the sub-additive property, strong sub-additive property, joint convexity, and information monotonicity. This article presents an exposit investigation on the information-theoretic and information-geometric characteristics of the new generalized entropy and compare them with the properties of the Tsallis and Shannon entropies.
- Published
- 2020
20. The algebra of conditional sets and the concepts of conditional topology and compactness
- Author
-
Asgar Jamneshan, Samuel Drapeau, Michael Kupper, and Martin Karliczek
- Subjects
Ultrafilter ,0211 other engineering and technologies ,Mathematics::General Topology ,02 engineering and technology ,Conditional expectation ,Topology ,01 natural sciences ,Regular conditional probability ,Conditional event algebra ,Conditional quantum entropy ,FOS: Mathematics ,0101 mathematics ,Mathematics ,Discrete mathematics ,Conditional entropy ,Mathematics::Functional Analysis ,021103 operations research ,Chain rule (probability) ,Applied Mathematics ,Conditional mutual information ,010102 general mathematics ,Mathematics - Logic ,Algebra ,03E70 ,Logic (math.LO) ,Analysis - Abstract
The concepts of a conditional set, a conditional inclusion relation and a conditional Cartesian product are introduced. The resulting conditional set theory is sufficiently rich in order to construct a conditional topology, a conditional real and functional analysis indicating the possibility of a mathematical discourse based on conditional sets. It is proved that the conditional power set is a complete Boolean algebra, and a conditional version of the axiom of choice, the ultrafilter lemma, Tychonoff's theorem, the Borel-Lebesgue theorem, the Hahn-Banach theorem, the Banach-Alaoglu theorem and the Krein-\v{S}mulian theorem are shown., Comment: 29 pages
- Published
- 2016
21. Halo detection via large-scale Bayesian inference
- Author
-
Matthew Colless, Filipe B. Abdalla, Ofer Lahav, Alex Merson, Benjamin D. Wandelt, Jens Jasche, and D. Heath Jones
- Subjects
Physics ,Field galaxy ,Cosmology and Nongalactic Astrophysics (astro-ph.CO) ,Chain rule (probability) ,010308 nuclear & particles physics ,Dark matter ,Bayesian probability ,FOS: Physical sciences ,Estimator ,Astronomy and Astrophysics ,Astrophysics::Cosmology and Extragalactic Astrophysics ,Astrophysics ,Bayesian inference ,01 natural sciences ,Galaxy ,Space and Planetary Science ,0103 physical sciences ,Statistical physics ,Halo ,Astrophysics - Instrumentation and Methods for Astrophysics ,Instrumentation and Methods for Astrophysics (astro-ph.IM) ,010303 astronomy & astrophysics ,Astrophysics::Galaxy Astrophysics ,Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
We present a proof-of-concept of a novel and fully Bayesian methodology designed to detect halos of different masses in cosmological observations subject to noise and systematic uncertainties. Our methodology combines the previously published Bayesian large-scale structure inference algorithm, HADES, and a Bayesian chain rule (the Blackwell-Rao Estimator), which we use to connect the inferred density field to the properties of dark matter halos. To demonstrate the capability of our approach we construct a realistic galaxy mock catalogue emulating the wide-area 6-degree Field Galaxy Survey, which has a median redshift of approximately 0.05. Application of HADES to the catalogue provides us with accurately inferred three-dimensional density fields and corresponding quantification of uncertainties inherent to any cosmological observation. We then use a cosmological simulation to relate the amplitude of the density field to the probability of detecting a halo with mass above a specified threshold. With this information we can sum over the HADES density field realisations to construct maps of detection probabilities and demonstrate the validity of this approach within our mock scenario. We find that the probability of successful of detection of halos in the mock catalogue increases as a function of the signal-to-noise of the local galaxy observations. Our proposed methodology can easily be extended to account for more complex scientific questions and is a promising novel tool to analyse the cosmic large-scale structure in observations., Comment: 17 pages, 13 figures. Accepted for publication in MNRAS following moderate corrections
- Published
- 2016
22. Conditional belief functions as lower envelopes of conditional probabilities in a finite setting
- Author
-
Barbara Vantaggi, Davide Petturiti, and Giulianella Coletti
- Subjects
Information Systems and Management ,Posterior probability ,Conditional probability ,Bayesian updating ,02 engineering and technology ,01 natural sciences ,Extensions ,Theoretical Computer Science ,010104 statistics & probability ,Regular conditional probability ,Lower envelope ,Artificial Intelligence ,Conditional event algebra ,0202 electrical engineering, electronic engineering, information engineering ,0101 mathematics ,Mathematics ,Bayesian updating, Belief function, Conditional probability, Extensions, Lower envelope ,Discrete mathematics ,Chain rule (probability) ,Law of total probability ,Conditional probability distribution ,Belief function ,Computer Science Applications ,Control and Systems Engineering ,020201 artificial intelligence & image processing ,Conditional variance ,Software - Abstract
The aim is to provide a characterization of full conditional measures on a finite Boolean algebra, obtained as lower envelope of the extensions of a full conditional probability defined on another finite Boolean algebra. Such conditional measures are conditional belief functions defined by means of a generalized Bayesian conditioning rule relying on a linearly ordered class of belief functions. This notion of Bayesian conditioning for belief functions is compared with other well-known conditioning rules by looking for those conditional measures that can be seen as lower conditional probabilities.
- Published
- 2016
23. Alternative approaches to conditional specification of bivariate distributions
- Author
-
Ramesh C. Gupta and Barry C. Arnold
- Subjects
Statistics and Probability ,Conditional entropy ,Regular conditional probability ,Chain rule (probability) ,Discriminative model ,Joint probability distribution ,Econometrics ,Conditional probability distribution ,Conditional expectation ,Conditional variance ,Mathematics - Abstract
In this paper, we discuss approaches to the determination of a bivariate distribution by specifying the two conditional density functions, the two conditional survival functions, the two hazard components and the two conditional hazard functions of one variable given the value of the other. In addition, the case in which one considers determination of the joint distribution by specifying the conditional distribution of each variable given that the other variable is bounded below is discussed. Examples are provided in each case.
- Published
- 2016
24. A new nonautonomous chain rule in BV
- Author
-
Virginia De Cicco
- Subjects
010101 applied mathematics ,Discrete mathematics ,BV functions ,Chain rule ,Lower semicontinuity ,Mathematics (all) ,Chain rule (probability) ,General Mathematics ,010102 general mathematics ,0101 mathematics ,01 natural sciences ,Mathematics - Published
- 2016
25. Unsupervised Bayesian Hypothesis Testing**The work was supported by RFBR (13-08-00744a)
- Author
-
A.V. Dobrovidov and V.O. Vasilyev
- Subjects
Chain rule (probability) ,business.industry ,Bayesian probability ,Contrast (statistics) ,020206 networking & telecommunications ,02 engineering and technology ,Decision rule ,Conditional probability distribution ,Machine learning ,computer.software_genre ,Bayesian statistics ,Control and Systems Engineering ,Kernel (statistics) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Mathematics ,Statistical hypothesis testing - Abstract
The problem of two hypothesis testing using Bayesian criterion is considered. In contrast to the standard problem with known conditional distributions of classes and their prior probabilities, the adaptive (or unsupervised) version of the problem is studied where one of the conditional distributions is totally unknown and prior probabilities of hypotheses are unknown also. Observations are available from a mixed distribution only. A decision rule with empirical risk tending to the optimal risk calculated from the complete statistical information is proposed. This decision rule is constructed using methods of kernel non-parametric statistics. The simulation results are given.
- Published
- 2016
26. The DOTI Approach: An Alternative Way of Teaching and Learning the Chain Rule
- Author
-
Steve Anglin
- Subjects
Chain rule (probability) ,Computer science ,business.industry ,Artificial intelligence ,business - Published
- 2018
27. Joint, Conditional, and Total Probabilities
- Author
-
Philipp Kornreich
- Subjects
Chain rule (probability) ,Regular conditional probability ,Statistics ,Law of total probability ,Conditioning ,Conditional probability distribution ,Joint (geology) ,Conditional variance ,Mathematics - Published
- 2018
28. Directional Malliavin Derivatives: A Characterisation of Independence and a Generalised Chain Rule
- Author
-
Stefan Koch
- Subjects
Statistics and Probability ,Chain rule (probability) ,Independence (mathematical logic) ,Mathematical economics ,Mathematics - Published
- 2018
29. Interactive Leakage Chain Rule for Quantum Min-entropy
- Author
-
Ching-Yi Lai and Kai-Min Chung
- Subjects
Quantum Physics ,Chain rule (probability) ,Computer science ,Min entropy ,FOS: Physical sciences ,TheoryofComputation_GENERAL ,020206 networking & telecommunications ,02 engineering and technology ,Topology ,01 natural sciences ,0103 physical sciences ,0202 electrical engineering, electronic engineering, information engineering ,Information source (mathematics) ,State (computer science) ,010306 general physics ,Quantum information science ,Quantum Physics (quant-ph) ,Protocol (object-oriented programming) ,Quantum ,Leakage (electronics) ,Computer Science::Cryptography and Security - Abstract
The leakage chain rule for quantum min-entropy quantifies the change of min-entropy when one party gets additional leakage about the information source. Herein we provide an interactive version that quantifies the change of min-entropy between two parties, who share an initial classical-quantum state and are allowed to run a two-party protocol. As an application, we prove new versions of lower bounds on the complexity of quantum communication of classical information., A few terminology mistakes were corrected in this version
- Published
- 2018
30. Models for plant self‐thinning
- Author
-
R. K. Wade
- Subjects
0106 biological sciences ,Chain rule (probability) ,Thinning ,Ecology ,predicted self‐thinning slope ,exponential growth ,010603 evolutionary biology ,01 natural sciences ,sigmoid growth ,Exponential growth ,exponential density decrease ,Applied mathematics ,chain rule ,self‐thinning ,Ecology, Evolution, Behavior and Systematics ,QH540-549.5 ,010606 plant biology & botany ,Mathematics - Abstract
Plant self‐thinning, which is density‐dependent mortality, has several observed characteristics, including a certain mathematical relationship between growth and density. The original equation that describes self‐thinning is logw¯=C−(3/2)×logdensity, w¯ = mean weight. The basic equation is supported by data from ecology and forestry, but there have been a number of reported slopes that differ from −3/2. This study proposed that change in plant density over time decreases exponentially and that plant growth (weight or volume) increases over time according to one of two models: either exponential growth or sigmoid growth. Exponential growth with a finite time limit, in conjunction with exponential decrease in density, led to the equation log w = C + α/γ × log density, where α/γ
- Published
- 2018
31. Generalized Beta Regression to Elicit Conditional Distributions of Medical Variables
- Author
-
Alessandro Magrini, Federico M. Stefanini, and Davide Luciani
- Subjects
Statistics and Probability ,Chain rule (probability) ,Computer science ,business.industry ,Applied Mathematics ,Statistics ,Univariate ,Conditional probability distribution ,Machine learning ,computer.software_genre ,Bayesian elicitation ,degree of belief ,informative prior distribution ,logistic function ,rescaling procedure ,QA273-280 ,HA1-4737 ,Discriminative model ,Joint probability distribution ,Prior probability ,Sample space ,Artificial intelligence ,Statistics, Probability and Uncertainty ,business ,Categorical variable ,computer ,Probabilities. Mathematical statistics - Abstract
Univariate conditional models are of core importance in supporting medical reasoning, as they allow to decompose a joint probability distribution using the chain rule. Although several methods are available for the elicitation of the joint prior distribution of parameters when the response is a medical categorical variable, the case of a medical continuous response is typically difficult to address, because its sample space is often bounded to an interval and its relationship with explanatory variables may be not linear. In these situations, the elicitation of an informative prior distribution on parameters of a univariate conditional model is challenging, because some level of statistical training is required to a medical expert for interpreting parameters and for retrieving appropriate quantitative information about them. The task can be eased and made efficient by recognizing that physicians typically distinguish among values involving medically normal and pathological patient conditions on the grounds of their personal clinical experience. In this paper, we propose a Generalized Beta regression where parameter elicitation is performed by establishing a correspondence among measured values expressed as relative positions within intervals with a clinical interpretation, regardless the original scales of variables. Software implementing the elicitation procedure is freely available.
- Published
- 2018
32. Student responses to chain rule problems in thermodynamics
- Author
-
Corinne A. Manogue, Paul J. Emigh, and Ian W. Founds
- Subjects
Chain rule (probability) ,Mathematical economics ,Mathematics - Published
- 2018
33. The Chain Rule
- Author
-
Vitali Milman and Hermann König
- Subjects
Combinatorics ,Chain rule (probability) ,Mathematics - Published
- 2018
34. The Second-Order Chain Rule
- Author
-
Vitali Milman and Hermann König
- Subjects
Chain rule (probability) ,Order (business) ,Applied mathematics ,Mathematics - Published
- 2018
35. Analysis of the Dependence of the Approximation Properties of a Nonparametric Estimate of a Probability Density on the Sampling Method for the Domain of Definition
- Author
-
A. V. Lapko and V. A. Lapko
- Subjects
Freedman–Diaconis rule ,Chain rule (probability) ,Applied Mathematics ,Statistics ,Range (statistics) ,Nonparametric statistics ,Sampling (statistics) ,Applied mathematics ,Probability density function ,Instrumentation ,Root-mean-square deviation ,Regression ,Mathematics - Abstract
A method is proposed for comparing the efficiencies of procedures for sampling the range of values of a random quantity when estimating the probability density. The criterion of efficiency is taken to be an asymptotic expression for the mean square deviation of a regression fit for the probability density.
- Published
- 2015
36. An alternative teaching method of conditional probabilities and Bayes' rule: an application of the truth table
- Author
-
Amy Vashlishan Murray and Eiki Satake
- Subjects
Statistics and Probability ,Bayes' rule ,Chain rule (probability) ,business.industry ,Posterior probability ,Conditional probability ,Conditional probability table ,Education ,Bayes' theorem ,Naive Bayes classifier ,Generative model ,Artificial intelligence ,business ,Algorithm ,Mathematics - Abstract
Summary This paper presents a comparison of three approaches to the teaching of probability to demonstrate how the truth table of elementary mathematical logic can be used to teach the calculations of conditional probabilities. Students are typically introduced to the topic of conditional probabilities—especially the ones that involve Bayes' rule—with the help of such traditional approaches as formula use or conversion to natural frequencies. The truth table approach is an alternative method for explaining the concept and calculation procedure of conditional probability and Bayes' rule.
- Published
- 2015
37. An Efficient Method to Calculate the Failure Rate of Dynamic Systems with Random Parameters Using the Total Probability Theorem
- Author
-
Monica Majcher, Igor Baseski, Vasileios Geroulas, Amandeep Singh, and Zissimos P. Mourelatos
- Subjects
Mathematical optimization ,Chain rule (probability) ,Posterior probability ,Law of total probability ,Conditional probability ,General Medicine ,Conditional probability distribution ,Conditional expectation ,symbols.namesake ,Regular conditional probability ,symbols ,Applied mathematics ,Gaussian process ,Mathematics - Abstract
Using the total probability theorem, we propose a method to calculate the failure rate of a linear vibratory system with random parameters excited by stationary Gaussian processes. The response of such a system is non-stationary because of the randomness of the input parameters. A space-filling design, such as optimal symmetric Latin hypercube sampling or maximin, is first used to sample the input parameter space. For each design point, the output process is stationary and Gaussian. We present two approaches to calculate the corresponding conditional probability of failure. A Kriging metamodel is then created between the input parameters and the output conditional probabilities allowing us to estimate the conditional probabilities for any set of input parameters. The total probability theorem is finally applied to calculate the time-dependent probability of failure and the failure rate of the dynamic system. The proposed method is demonstrated using a vibratory system. Our approach can be easily extended to non-stationary Gaussian input processes.
- Published
- 2015
38. EXTENSIONS OF SEVERAL CLASSICAL RESULTS FOR INDEPENDENT AND IDENTICALLY DISTRIBUTED RANDOM VARIABLES TO CONDITIONAL CASES
- Author
-
Shun-Jing Li and De-Mei Yuan
- Subjects
Independent and identically distributed random variables ,Discrete mathematics ,Mathematics::Functional Analysis ,Regular conditional probability ,Chain rule (probability) ,Conditional independence ,General Mathematics ,Mathematics::Classical Analysis and ODEs ,Law of total probability ,Conditional probability distribution ,Conditional expectation ,Conditional variance ,Mathematics - Abstract
Extensions of the Kolmogorov convergence criterion and the Marcinkiewicz-Zygmund inequalities from independent random variables to conditional independent ones are derived. As their applications, a conditional version of the Marcinkiewicz-Zygmund strong law of large numbers and a result on convergence in Lp for conditionally independent and conditionally identically distributed random variables are established, respectively.
- Published
- 2015
39. Maximum Likelihood Social Choice Rule
- Author
-
Yuta Nakamura
- Subjects
Economics and Econometrics ,Chain rule (probability) ,Information aggregation ,Maximum likelihood ,Econometrics ,Economics ,Neutrality ,Social choice theory - Abstract
This study is related to a Condorcetian problem of information aggregation that finds a “true” social ordering using individual orderings, that are supposed to partly contain the “truth”. In this problem, we introduce a new maximum likelihood rule and analyse its performance. This rule selects an alternative that maximizes the probability of realizing individual orderings, conditional on the alternative being the top according to a true social ordering. We show that under a neutrality condition of alternatives, the probability that our rule selects the true top alternative is higher than that of any other rule.
- Published
- 2015
40. From Conditional Independence to Conditionally Negative Association: Some Preliminary Results
- Author
-
Demei Yuan and Shunjing Li
- Subjects
Statistics and Probability ,Conditional entropy ,Regular conditional probability ,Chain rule (probability) ,Conditional independence ,Statistics ,Conditional probability distribution ,Conditional expectation ,Conditional variance ,Random variable ,Mathematics - Abstract
Conditional moment estimates on the cumulative sum of conditionally independent random variables are derived, conditional prophet inequalities for conditionally independent random variables are established, a comparison theorem on conditional moment inequalities between conditionally independent and conditionally negatively associated random variables is established. As applications of these results, a conditional Rosenthal type inequality and two conditional Kolmogorov exponential inequalities for conditionally negatively associated random variables are obtained.
- Published
- 2015
41. Fuzzy probability calculation with confidence intervals in Bayesian networks
- Author
-
Duygu İçen and Derya Ersel
- Subjects
0209 industrial biotechnology ,Chain rule (probability) ,business.industry ,Bayesian network ,Computational intelligence ,02 engineering and technology ,Machine learning ,computer.software_genre ,Fuzzy logic ,Confidence interval ,Theoretical Computer Science ,Bayesian statistics ,020901 industrial engineering & automation ,0202 electrical engineering, electronic engineering, information engineering ,Credible interval ,Fuzzy number ,020201 artificial intelligence & image processing ,Geometry and Topology ,Artificial intelligence ,business ,computer ,Software ,Mathematics - Abstract
In this study, we propose to use Buckley's confidence interval approach which has not been used before in the literature to calculate marginal and conditional fuzzy probabilities in Bayesian networks. We apply this approach to a real life problem and show that Buckley's confidence interval approach provides to indicate uncertainty better and represents knowledge more explicitly than determining fuzzy probabilities based only on the expert opinion in Bayesian networks.
- Published
- 2014
42. Targeting: Logistic Regression, Special Cases and Extensions
- Author
-
Helmut Schaeben
- Subjects
Chain rule (probability) ,conditional independence ,Geography, Planning and Development ,lcsh:G1-922 ,imbalanced datasets ,Conditional probability distribution ,naive Bayes model ,balancing ,Logistic regression ,weights of evidence ,artificial neural nets ,Regular conditional probability ,Conditional independence ,Bayes factors ,potential modeling ,Statistics ,Earth and Planetary Sciences (miscellaneous) ,Econometrics ,Computers in Earth Sciences ,Logistic function ,prospectivity modeling ,Conditional variance ,lcsh:Geography (General) ,Multinomial logistic regression ,Mathematics - Abstract
Logistic regression is a classical linear model for logit-transformed conditional probabilities of a binary target variable. It recovers the true conditional probabilities if the joint distribution of predictors and the target is of log-linear form. Weights-of-evidence is an ordinary logistic regression with parameters equal to the differences of the weights of evidence if all predictor variables are discrete and conditionally independent given the target variable. The hypothesis of conditional independence can be tested in terms of log-linear models. If the assumption of conditional independence is violated, the application of weights-of-evidence does not only corrupt the predicted conditional probabilities, but also their rank transform. Logistic regression models, including the interaction terms, can account for the lack of conditional independence, appropriate interaction terms compensate exactly for violations of conditional independence. Multilayer artificial neural nets may be seen as nested regression-like models, with some sigmoidal activation function. Most often, the logistic function is used as the activation function. If the net topology, i.e., its control, is sufficiently versatile to mimic interaction terms, artificial neural nets are able to account for violations of conditional independence and yield very similar results. Weights-of-evidence cannot reasonably include interaction terms, subsequent modifications of the weights, as often suggested, cannot emulate the effect of interaction terms.
- Published
- 2014
43. Multidimensional and abstract probability
- Author
-
V. M. Maximov
- Subjects
Discrete mathematics ,symbols.namesake ,Mathematics (miscellaneous) ,Chain rule (probability) ,Law of large numbers ,symbols ,Law of total probability ,Cartesian product ,Marginal distribution ,Random variable ,Real number ,Mathematics ,Semiring - Abstract
probabilities are introduced as semiring algebraic structures that retain several properties of classical probabilities taking values in the real number interval [0, 1]. Compact probabilities and random variables with such probabilities are mainly studied. Analogs of the Borel-Cantelli lemma and of the law of large numbers are considered. New notions of superposition of probability spaces and superposition of random variables arise on the basis of the Cartesian product of abstract probabilities.
- Published
- 2014
44. Weighted Closed Form Expressions Based on Escort Distributions for Rényi Entropy Rates of Markov Chains
- Author
-
Loïck Lhote, Valérie Girardin, Philippe Regnault, Laboratoire de Mathématiques de Reims (LMR), Université de Reims Champagne-Ardenne (URCA)-Centre National de la Recherche Scientifique (CNRS), Laboratoire de Mathématiques Nicolas Oresme (LMNO), Centre National de la Recherche Scientifique (CNRS)-Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Normandie Université (NU), Equipe AMACC - Laboratoire GREYC - UMR6072, Groupe de Recherche en Informatique, Image et Instrumentation de Caen (GREYC), Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Ingénieurs de Caen (ENSICAEN), Normandie Université (NU)-Normandie Université (NU)-Université de Caen Normandie (UNICAEN), Normandie Université (NU)-Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Ingénieurs de Caen (ENSICAEN), Normandie Université (NU), and GSI 2017
- Subjects
Stationary distribution ,Chain rule (probability) ,Markov chain ,05 social sciences ,01 natural sciences ,Expression (mathematics) ,Combinatorics ,Rényi entropy ,[MATH.MATH-PR]Mathematics [math]/Probability [math.PR] ,010104 statistics & probability ,Chain (algebraic topology) ,0502 economics and business ,Limit (mathematics) ,0101 mathematics ,Marginal distribution ,ComputingMilieux_MISCELLANEOUS ,050205 econometrics ,Mathematics - Abstract
For Markov chains with transition probabilities \(p_{ij}\), the Shannon entropy rate is well-known to be equal to the sum of the \(-\sum _j p_{ij}\log p_{ij}\) weighted by the stationary distribution. This expression derives from the chain rule specific to Shannon entropy. For an ergodic Markov chain, the stationary distribution is the limit of the marginal distributions of the chain.
- Published
- 2017
45. Scan Statistics for Integer-Valued Random Variables: Conditional Case
- Author
-
Jie Chen and Joseph Glaz
- Subjects
Independent and identically distributed random variables ,Regular conditional probability ,Chain rule (probability) ,Multivariate random variable ,Sum of normally distributed random variables ,Statistics ,Conditional probability distribution ,Marginal distribution ,Conditional variance ,Mathematics - Published
- 2017
46. Converting Graphic Relationships into Conditional Probabilities in Bayesian Network
- Author
-
Loc Nguyen
- Subjects
Bayesian statistics ,Chain rule (probability) ,Computer science ,business.industry ,Bayesian network ,Conditional probability ,Artificial intelligence ,Machine learning ,computer.software_genre ,business ,computer ,GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries) - Published
- 2017
47. The Genetic Chain Rule for Probabilistic Kinship Estimation
- Author
-
Brian S. Helfer, Philip Fremont-Smith, and Darrell O. Ricke
- Subjects
Genetics ,Chain rule (probability) ,Probabilistic logic ,Inheritance (genetic algorithm) ,Kinship ,Microsatellite ,Single-nucleotide polymorphism ,Statistical model ,Computational biology ,Biology ,DNA sequencing - Abstract
Accurate kinship predictions using DNA forensic samples has utility for investigative leads, remains identification, identifying relationships between individuals of interest, etc. High throughput sequencing (HTS) of STRs and single nucleotide polymorphisms (SNPs) is enabling the characterization of larger numbers of loci. Large panels of SNP loci have been proposed for improved mixture analysis of forensic samples. While multiple kinship prediction approaches have been established, we present an approach focusing on these large HTS SNP panels for predicting degree of kinship predictions. Formulas for first degree relatives can be multiplied (chained) together to model extended kinship relationships. Predictions are made using these formulations by calculating log likelihood ratios and selecting the maximum likelihood across the possible relationships. With a panel of 30,000 SNPs evaluated on an in silico dataset, this method can resolve parents from siblings and distinguish 1st, 2nd, and 3rd degree relatives from each other and unrelated individuals.
- Published
- 2017
48. The chain rule
- Author
-
Roger B. Nelsen
- Subjects
Combinatorics ,Chain rule (probability) ,Mathematics - Published
- 2017
49. Optimal Detection under the Restricted Bayesian Criterion
- Author
-
Shujun Liu, Ting Yang, and Hongqing Liu
- Subjects
Bayes estimator ,Mathematical optimization ,Admissible decision rule ,Chain rule (probability) ,Bayes risk ,restricted Bayesian ,General Physics and Astronomy ,020206 networking & telecommunications ,Bayes factor ,lcsh:Astrophysics ,02 engineering and technology ,Decision rule ,lcsh:QC1-999 ,Bayesian statistics ,lcsh:QB460-466 ,0202 electrical engineering, electronic engineering, information engineering ,hypothesis-testing ,020201 artificial intelligence & image processing ,lcsh:Q ,Bayesian linear regression ,lcsh:Science ,lcsh:Physics ,Mathematics ,Optimal decision - Abstract
This paper aims to find a suitable decision rule for a binary composite hypothesis-testing problem with a partial or coarse prior distribution. To alleviate the negative impact of the information uncertainty, a constraint is considered that the maximum conditional risk cannot be greater than a predefined value. Therefore, the objective of this paper becomes to find the optimal decision rule to minimize the Bayes risk under the constraint. By applying the Lagrange duality, the constrained optimization problem is transformed to an unconstrained optimization problem. In doing so, the restricted Bayesian decision rule is obtained as a classical Bayesian decision rule corresponding to a modified prior distribution. Based on this transformation, the optimal restricted Bayesian decision rule is analyzed and the corresponding algorithm is developed. Furthermore, the relation between the Bayes risk and the predefined value of the constraint is also discussed. The Bayes risk obtained via the restricted Bayesian decision rule is a strictly decreasing and convex function of the constraint on the maximum conditional risk. Finally, the numerical results including a detection example are presented and agree with the theoretical results.
- Published
- 2017
50. Modeling log-linear conditional probabilities for estimation in surveys
- Author
-
Yves Thibaudeau, Eric V. Slud, and Alfred Gottschalck
- Subjects
Log-linear model ,Statistics and Probability ,model calibration ,Chain rule (probability) ,05 social sciences ,Estimator ,Horvitz–Thompson estimator ,Conditional probability distribution ,01 natural sciences ,010104 statistics & probability ,Modeling and Simulation ,0502 economics and business ,Statistics ,Econometrics ,Survey data collection ,conditional probability ,0101 mathematics ,Statistics, Probability and Uncertainty ,Survey of Income and Program Participation ,Conditional variance ,050205 econometrics ,Mathematics - Abstract
The Survey of Income and Program Participation (SIPP) is a survey with a longitudinal structure and complex nonignorable design, for which correct estimation requires using the weights. The longitudinal setting also suggests conditional-independence relations between survey variables and early- versus late-wave employment classifications. We state original assumptions justifying an extension of the partially model-based approach of Pfeffermann, Skinner and Humphreys [J. Roy. Statist. Soc. Ser. A 161 (1998) 13–32], accounting for the design of SIPP and similar longitudinal surveys. Our assumptions support the use of log-linear models of longitudinal survey data. We highlight the potential they offer for simultaneous bias-control and reduction of sampling error relative to direct methods when applied to small subdomains and cells. Our assumptions allow us to innovate by showing how to rigorously use only a longitudinal survey to estimate a complex log-linear longitudinal association structure and embed it in cross-sectional totals to construct estimators that can be more efficient than direct estimators for small cells.
- Published
- 2017
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.