30 results on '"Torr, Philip H.S."'
Search Results
2. Efficient minimization of higher order submodular functions using monotonic Boolean functions
- Author
-
Ramalingam, Srikumar, Russell, Chris, Ladický, L’ubor, and Torr, Philip H.S.
- Published
- 2017
- Full Text
- View/download PDF
3. Exploiting projective geometry for view-invariant monocular human motion analysis in man-made environments
- Author
-
Rogez, Grégory, Orrite, Carlos, Guerrero, J.J., and Torr, Philip H.S.
- Published
- 2014
- Full Text
- View/download PDF
4. Learning Discriminative Space–Time Action Parts from Weakly Labelled Videos
- Author
-
Sapienza, Michael, Cuzzolin, Fabio, and Torr, Philip H.S.
- Published
- 2014
- Full Text
- View/download PDF
5. Measuring uncertainty in graph cut solutions
- Author
-
Kohli, Pushmeet and Torr, Philip H.S.
- Published
- 2008
- Full Text
- View/download PDF
6. [P.sup.3] & beyond: move making algorithms for solving higher order functions
- Author
-
Kohli, Pushmeet, Kumar, M. Pawan, and Torr, Philip H.S.
- Subjects
Algorithm ,Algorithms -- Analysis ,Polynomials -- Analysis - Published
- 2009
7. Multiview stereo via volumetric graph-cuts and occlusion robust photo-consistency
- Author
-
Vogiatzis, George, Hernandez, Carlos, Torr, Philip H.S., and Cipolla, Roberto
- Subjects
Algorithm ,Algorithms -- Methods ,Three-dimensional graphics -- Properties ,Mathematical optimization -- Methods - Abstract
This paper presents a volumetric formulation for the multiview stereo problem which is amenable to a computationally tractable global optimization using Graph-cuts. Our approach is to seek the optimal partitioning of 3D space into two regions labeled as "object" and "empty" under a cost functional consisting of the following two terms: 1) A term that forces the boundary between the two regions to pass through photo-consistent locations and 2) a ballooning term that inflates the "object" region. To take account of the effect of occlusion on the first term, we use an occlusion robust photo-consistency metric based on Normalized Cross Correlation, which does not assume any geometric knowledge about the reconstructed object. The globally optimal 3D partitioning can be obtained as the minimum cut solution of a weighted graph. Index Terms--3D/stereo scene analysis, shape, graph algorithms, global optimization.
- Published
- 2007
8. Dynamic graph cuts for efficient inference in Markov random fields
- Author
-
Kohli, Pushmeet and Torr, Philip H.S.
- Subjects
Algorithm ,Algorithms -- Methods ,Markov processes -- Observations ,Image processing -- Methods - Abstract
In this paper, we present a fast new fully dynamic algorithm for the st-mincut/max-flow problem. We show how this algorithm can be used to efficiently compute MAP solutions for certain dynamically changing MRF models in computer vision such as image segmentation. Specifically, given the solution of the max-flow problem on a graph, the dynamic algorithm efficiently computes the maximum flow in a modified version of the graph. The time taken by it is roughly proportional to the total amount of change in the edge weights of the graph. Our experiments show that, when the number of changes in the graph is small, the dynamic algorithm is significantly faster than the best known static graph cut algorithm. We test the performance of our algorithm on one particular problem: the object-background segmentation problem for video. It should be noted that the application of our algorithm is not limited to the above problem, the algorithm is generic and can be used to yield similar improvements in many other cases that involve dynamic change. Index Terms--Energy minimization, Markov random fields, dynamic graph cuts, maximum flow, st-mincut, video segmentation.
- Published
- 2007
9. Model-based hand tracking using a hierarchical Bayesian filter
- Author
-
Stenger, Bjorn, Thayananthan, Arasanathan, Torr, Philip H.S., and Cipolla, Roberto
- Subjects
Algorithm ,Technology application ,Algorithms -- Research ,Algorithms -- Technology application ,Object recognition (Computers) -- Research ,Pattern recognition -- Research - Abstract
This paper sets out a tracking framework, which is applied to the recovery of three-dimensional hand motion from an image sequence. The method handles the issues of initialization, tracking, and recovery in a unified way. In a single input image with no prior information of the hand pose, the algorithm is equivalent to a hierarchical detection scheme, where unlikely pose candidates are rapidly discarded. In image sequences, a dynamic model is used to guide the search and approximate the optimal filtering equations. A dynamic model is given by transition probabilities between regions in parameter space and is learned from training data obtained by capturing articulated motion. The algorithm is evaluated on a number of image sequences, which include hand motion with self-occlusion in front of a cluttered background. Index Terms--Probabilistic algorithms, video analysis, tracking.
- Published
- 2006
10. The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences
- Author
-
Torr, Philip H.S., Fitzgibbon, Andrew W., and Zisserman, Andrew
- Published
- 1999
- Full Text
- View/download PDF
11. Learning disentangled representations with semi-supervised deep generative models
- Author
-
Siddharth, N., Paige, Brooks, van de Meent, Jan-Willem, Desmaison, Alban, Goodman, Noah, Kohli, Pushmeet, Wood, Frank, Torr, Philip H.S., Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R.
- Subjects
FOS: Computer and information sciences ,Computer Science - Learning ,Artificial Intelligence (cs.AI) ,Statistics - Machine Learning ,Computer Science - Artificial Intelligence ,0202 electrical engineering, electronic engineering, information engineering ,Machine Learning (stat.ML) ,020206 networking & telecommunications ,020201 artificial intelligence & image processing ,02 engineering and technology ,Machine Learning (cs.LG) - Abstract
Variational autoencoders (VAEs) learn representations of data by jointly training a probabilistic encoder and decoder network. Typically these models encode all features of the data into a single variable. Here we are interested in learning disentangled representations that encode distinct aspects of the data into separate variables. We propose to learn such representations using model architectures that generalise from standard VAEs, employing a general graphical model structure in the encoder and decoder. This allows us to train partially-specified models that make relatively strong assumptions about a subset of interpretable variables and rely on the flexibility of neural networks to learn representations for the remaining variables. We further define a general objective for semi-supervised learning in this model class, which can be approximated using an importance sampling procedure. We evaluate our framework's ability to learn disentangled representations, both by qualitative exploration of its generative capacity, and quantitative evaluation of its discriminative ability on a variety of models and datasets., Accepted for publication at NIPS 2017
- Published
- 2019
- Full Text
- View/download PDF
12. Adversarial Metric Attack and Defense for Person Re-Identification.
- Author
-
Bai, Song, Li, Yingwei, Zhou, Yuyin, Li, Qizhu, and Torr, Philip H.S.
- Subjects
IDENTIFICATION ,VIDEO surveillance - Abstract
Person re-identification (re-ID) has attracted much attention recently due to its great importance in video surveillance. In general, distance metrics used to identify two person images are expected to be robust under various appearance changes. However, our work observes the extreme vulnerability of existing distance metrics to adversarial examples, generated by simply adding human-imperceptible perturbations to person images. Hence, the security danger is dramatically increased when deploying commercial re-ID systems in video surveillance. Although adversarial examples have been extensively applied for classification analysis, it is rarely studied in metric analysis like person re-identification. The most likely reason is the natural gap between the training and testing of re-ID networks, that is, the predictions of a re-ID network cannot be directly used during testing without an effective metric. In this work, we bridge the gap by proposing Adversarial Metric Attack, a parallel methodology to adversarial classification attacks. Comprehensive experiments clearly reveal the adversarial effects in re-ID systems. Meanwhile, we also present an early attempt of training a metric-preserving network, thereby defending the metric against adversarial attacks. At last, by benchmarking various adversarial settings, we expect that our work can facilitate the development of adversarial attack and defense in metric-based applications. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
13. Learning Regional Attraction for Line Segment Detection.
- Author
-
Xue, Nan, Bai, Song, Wang, Fu-Dong, Xia, Gui-Song, Wu, Tianfu, Zhang, Liangpei, and Torr, Philip H.S.
- Subjects
DEEP learning ,IMAGE segmentation ,PIXELS - Abstract
This paper presents regional attraction of line segment maps, and hereby poses the problem of line segment detection (LSD) as a problem of region coloring. Given a line segment map, the proposed regional attraction first establishes the relationship between line segments and regions in the image lattice. Based on this, the line segment map is equivalently transformed to an attraction field map (AFM), which can be remapped to a set of line segments without loss of information. Accordingly, we develop an end-to-end framework to learn attraction field maps for raw input images, followed by a squeeze module to detect line segments. Apart from existing works, the proposed detector properly handles the local ambiguity and does not rely on the accurate identification of edge pixels. Comprehensive experiments on the Wireframe dataset and the YorkUrban dataset demonstrate the superiority of our method. In particular, we achieve an F-measure of 0.831 on the Wireframe dataset, advancing the state-of-the-art performance by 10.3 percent. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
14. Sequential Optimization for Efficient High-Quality Object Proposal Generation.
- Author
-
Zhang, Ziming, Liu, Yun, Chen, Xi, Zhu, Yanjun, Cheng, Ming-Ming, Saligrama, Venkatesh, and Torr, Philip H.S.
- Subjects
IMAGE segmentation ,OBJECT recognition (Computer vision) ,EDGE detection (Image processing) ,COMPUTER algorithms ,MACHINE learning - Abstract
We are motivated by the need for a generic object proposal generation algorithm which achieves good balance between object detection recall, proposal localization quality and computational efficiency. We propose a novel object proposal algorithm, BING++, which inherits the virtue of good computational efficiency of BING
[1] but significantly improves its proposal localization quality. At high level we formulate the problem of object proposal generation from a novel probabilistic perspective, based on which our BING++ manages to improve the localization quality by employing edges and segments to estimate object boundaries and update the proposals sequentially. We propose learning the parameters efficiently by searching for approximate solutions in a quantized parameter space for complexity reduction. We demonstrate the generalization of BING++ with the same fixed parameters across different object classes and datasets. Empirically our BING++ can run at half speed of BING on CPU, but significantly improve the localization quality by 18.5 and 16.7 percent on both VOC2007 and Microhsoft COCO datasets, respectively. Compared with other state-of-the-art approaches, BING++ can achieve comparable performance, but run significantly faster. [ABSTRACT FROM AUTHOR]- Published
- 2018
- Full Text
- View/download PDF
15. CODE: Coherence Based Decision Boundaries for Feature Correspondence.
- Author
-
Lin, Wen-Yan, Wang, Fan, Cheng, Ming-Ming, Yeung, Sai-Kit, Torr, Philip H.S., Do, Minh N., and Lu, Jiangbo
- Subjects
CLASSIFICATION algorithms ,IMAGE processing ,PATTERN recognition systems ,CODING theory ,ALGORITHMS - Abstract
A key challenge in feature correspondence is the difficulty in differentiating true and false matches at a local descriptor level. This forces adoption of strict similarity thresholds that discard many true matches. However, if analyzed at a global level, false matches are usually randomly scattered while true matches tend to be coherent (clustered around a few dominant motions), thus creating a coherence based separability constraint. This paper proposes a non-linear regression technique that can discover such a coherence based separability constraint from highly noisy matches and embed it into a correspondence likelihood model. Once computed, the model can filter the entire set of nearest neighbor matches (which typically contains over 90 percent false matches) for true matches. We integrate our technique into a full feature correspondence system which reliably generates large numbers of good quality correspondences over wide baselines where previous techniques provide few or no matches. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
16. Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation: Combining Probabilistic Graphical Models with Deep Learning for Structured Prediction.
- Author
-
Arnab, Anurag, Zheng, Shuai, Jayasumana, Sadeep, Romera-Paredes, Bernardino, Larsson, Mans, Kirillov, Alexander, Savchynskyy, Bogdan, Rother, Carsten, Kahl, Fredrik, and Torr, Philip H.S.
- Abstract
Semantic segmentation is the task of labeling every pixel in an image with a predefined object category. It has numerous applications in scenarios where the detailed understanding of an image is required, such as in autonomous vehicles and medical diagnosis. This problem has traditionally been solved with probabilistic models known as conditional random fields (CRFs) due to their ability to model the relationships between the pixels being predicted. However, deep neural networks (DNNs) recently have been shown to excel at a wide range of computer vision problems due to their ability to automatically learn rich feature representations from data, as opposed to traditional handcrafted features. The idea of combining CRFs and DNNs have achieved state-of-the-art results in a number of domains. We review the literature on combining the modeling power of CRFs with the representation-learning ability of DNNs, ranging from early work that combines these two techniques as independent stages of a common pipeline to recent approaches that embed inference of probabilistic models directly in the neural network itself. Finally, we summarize future research directions. [ABSTRACT FROM PUBLISHER]
- Published
- 2018
- Full Text
- View/download PDF
17. Struck: Structured Output Tracking with Kernels.
- Author
-
Hare, Sam, Golodetz, Stuart, Saffari, Amir, Vineet, Vibhav, Cheng, Ming-Ming, Hicks, Stephen L., and Torr, Philip H.S.
- Subjects
TRACKING & trailing ,COMPUTER vision ,PATTERN recognition systems ,POSE estimation (Computer vision) ,OBJECT tracking (Computer vision) - Abstract
Adaptive tracking-by-detection methods are widely used in computer vision for tracking arbitrary objects. Current approaches treat the tracking problem as a classification task and use online learning techniques to update the object model. However, for these updates to happen one needs to convert the estimated object position into a set of labelled training examples, and it is not clear how best to perform this intermediate step. Furthermore, the objective for the classifier (label prediction) is not explicitly coupled to the objective for the tracker (estimation of object position). In this paper, we present a framework for adaptive visual object tracking based on structured output prediction. By explicitly allowing the output space to express the needs of the tracker, we avoid the need for an intermediate classification step. Our method uses a kernelised structured output support vector machine (SVM), which is learned online to provide adaptive tracking. To allow our tracker to run at high frame rates, we (a) introduce a budgeting mechanism that prevents the unbounded growth in the number of support vectors that would otherwise occur during tracking, and (b) show how to implement tracking on the GPU. Experimentally, we show that our algorithm is able to outperform state-of-the-art trackers on various benchmark videos. Additionally, we show that we can easily incorporate additional features and kernels into our framework, which results in increased tracking performance. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
18. Object Proposal Generation Using Two-Stage Cascade SVMs.
- Author
-
Zhang, Ziming and Torr, Philip H.S.
- Subjects
- *
OBJECT recognition (Computer vision) , *SUPPORT vector machines , *COMPUTER algorithms , *CALIBRATION , *EDGE detection (Image processing) - Abstract
Object proposal algorithms have shown great promise as a first step for object recognition and detection. Good object proposal generation algorithms require high object detection recall rate as well as low computational cost, because generating object proposals is usually utilized as a preprocessing step. The problem of how to accelerate the object proposal generation and evaluation process without decreasing recall is thus of great interest. In this paper, we propose a new object proposal generation method using two-stage cascade support vector machines (SVMs), where in the first stage linear filters are learned for predefined quantized scales/aspect-ratios independently, and in the second stage a global linear classifier is learned across all the quantized scales/aspect-ratios for calibration, so that all the windows from the first stage can be compared properly. The windows with highest scores from the second stage are kept as inputs to our new efficient proposal calibration algorithm to improve their localization quality significantly, resulting in our final object proposals. We explain our scale/aspect-ratio quantization scheme, and investigate the effects of combinations of $\ell _1$
regularizers in cascade SVMs with/without ranking constraints in learning. Comprehensive experiments on VOC2007 dataset are conducted, and our method is comparable with the current state-of-the-art methods with much better computational efficiency. [ABSTRACT FROM PUBLISHER]- Published
- 2016
- Full Text
- View/download PDF
19. Pose estimation and tracking using multivariate regression
- Author
-
Thayananthan, Arasanathan, Navaratnam, Ramanan, Stenger, Björn, Torr, Philip H.S., and Cipolla, Roberto
- Published
- 2008
- Full Text
- View/download PDF
20. Dense Semantic Image Segmentation with Objects and Attributes.
- Author
-
Zheng, Shuai, Cheng, Ming-Ming, Warrell, Jonathan, Sturgess, Paul, Vineet, Vibhav, Rother, Carsten, and Torr, Philip H.S.
- Published
- 2014
- Full Text
- View/download PDF
21. Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation.
- Author
-
Ladicky, Lubor, Torr, Philip H.S., and Zisserman, Andrew
- Published
- 2013
- Full Text
- View/download PDF
22. Mesh Based Semantic Modelling for Indoor and Outdoor Scenes.
- Author
-
Valentin, Julien P.C., Sengupta, Sunando, Warrell, Jonathan, Shahrokni, Ali, and Torr, Philip H.S.
- Published
- 2013
- Full Text
- View/download PDF
23. A tiered move-making algorithm for general pairwise MRFs.
- Author
-
Vineet, Vibhav, Warrell, Jonathan, and Torr, Philip H.S.
- Abstract
A large number of problems in computer vision can be modeled as energy minimization problems in a markov random field (MRF) framework. Many methods have been developed over the years for efficient inference, especially in pairwise MRFs. In general there is a trade-off between the complexity/efficiency of the algorithm and its convergence properties, with certain problems requiring more complex inference to handle general pairwise potentials. Graphcuts based α-expansion performs well on certain classes of energies, and sequential tree reweighted message passing (TRWS) and loopy belief propagation (LBP) can be used for non-submodular cases. These methods though suffer from poor convergence and often oscillate between solutions. In this paper, we propose a tiered move making algorithm which is an iterative method. Each move to the next configuration is based on the current labeling and an optimal tiered move, where each tiered move requires one application of the dynamic programming based tiered labeling method introduced in Felzenszwalb et. al. [2]. The algorithm converges to a local minimum for any general pairwise potential, and we give a theoretical analysis of the properties of the algorithm, characterizing the situations in which we can expect good performance. We evaluate the algorithm on many benchmark labeling problems such as stereo, image segmentation, image stitching and image denoising, as well as random energy minimization. Our method consistently gets better energy values than α-expansion, LBP, quadratic pseudo-boolean optimization (QPBO), and is competitive with TRWS. [ABSTRACT FROM PUBLISHER]
- Published
- 2012
- Full Text
- View/download PDF
24. PoseCut: Simultaneous Segmentation and 3D Pose Estimation of Humans Using Dynamic Graph-Cuts.
- Author
-
Leonardis, Aleš, Bischof, Horst, Pinz, Axel, Bray, Matthieu, Kohli, Pushmeet, and Torr, Philip H.S.
- Abstract
We present a novel algorithm for performing integrated segmentation and 3D pose estimation of a human body from multiple views. Unlike other related state of the art techniques which focus on either segmentation or pose estimation individually, our approach tackles these two tasks together. Normally, when optimizing for pose, it is traditional to use some fixed set of features, e.g. edges or chamfer maps. In contrast, our novel approach consists of optimizing a cost function based on a Markov Random Field (MRF). This has the advantage that we can use all the information in the image: edges, background and foreground appearances, as well as the prior information on the shape and pose of the subject and combine them in a Bayesian framework. Previously, optimizing such a cost function would have been computationally infeasible. However, our recent research in dynamic graph cuts allows this to be done much more efficiently than before. We demonstrate the efficacy of our approach on challenging motion sequences. Note that although we target the human pose inference problem in the paper, our method is completely generic and can be used to segment and infer the pose of any specified rigid, deformable or articulated object. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
25. Measuring Uncertainty in Graph Cut Solutions - Efficiently Computing Min-marginal Energies Using Dynamic Graph Cuts.
- Author
-
Leonardis, Aleš, Bischof, Horst, Pinz, Axel, Kohli, Pushmeet, and Torr, Philip H.S.
- Abstract
In recent years the use of graph-cuts has become quite popular in computer vision. However, researchers have repeatedly asked the question whether it might be possible to compute a measure of uncertainty associated with the graph-cut solutions. In this paper we answer this particular question by showing how the min-marginals associated with the label assignments in a MRF can be efficiently computed using a new algorithm based on dynamic graph cuts. We start by reporting the discovery of a novel relationship between the min-marginal energy corresponding to a latent variable label assignment, and the flow potentials of the node representing that variable in the graph used in the energy minimization procedure. We then proceed to show how the min-marginal energy can be computed by minimizing a projection of the energy function defined by the MRF. We propose a fast and novel algorithm based on dynamic graph cuts to efficiently minimize these energy projections. The min-marginal energies obtained by our proposed algorithm are exact, as opposed to the ones obtained from other inference algorithms like loopy belief propagation and generalized belief propagation. We conclude by showing how min-marginals can be used to compute a confidence measure for label assignments in labelling problems such as image segmentation. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
26. Reconstructing relief surfaces
- Author
-
Vogiatzis, George, Torr, Philip H.S., Seitz, Steven M., and Cipolla, Roberto
- Subjects
- *
STEREOSCOPIC views , *VOLUMETRIC analysis , *RELIEF models , *TOPOLOGY , *MARKOV random fields , *GEOGRAPHY - Abstract
Abstract: This paper generalizes Markov Random Field (MRF) stereo methods to the generation of surface relief (height) fields rather than disparity or depth maps. This generalization enables the reconstruction of complete object models using the same algorithms that have been previously used to compute depth maps in binocular stereo. In contrast to traditional dense stereo where the parametrization is image based, here we advocate a parametrization by a height field over any base surface. In practice, the base surface is a coarse approximation to the true geometry, e.g., a bounding box, visual hull or triangulation of sparse correspondences, and is assigned or computed using other means. A dense set of sample points is defined on the base surface, each with a fixed normal direction and unknown height value. The estimation of heights for the sample points is achieved by a belief propagation technique. Our method provides a viewpoint independent smoothness constraint, a more compact parametrization and explicit handling of occlusions. We present experimental results on real scenes as well as a quantitative evaluation on an artificial scene. [Copyright &y& Elsevier]
- Published
- 2008
- Full Text
- View/download PDF
27. IMPSAC: Synthesis of Importance Sampling and Random Sample Consensus.
- Author
-
Torr, Philip H.S. and Davidson, Colin
- Subjects
- *
IMAGE analysis , *STATISTICAL sampling , *GEOMETRY - Abstract
This paper proposes a new method for recovery of epipolar geometry and feature correspondence between images which have undergone a significant deformation, either due to large rotation or wide baseline of the cameras. The method also encodes the uncertainty by providing an arbitrarily close approximation to the posterior distribution of the two view relation. The method operates on a pyramid from coarse to fine resolution, thus raising the problem of how to propagate information from one level to another in a statistically consistent way. The distribution of the parameters at each resolution is encoded nonparametrically as a set of particles. At the coarsest level a RANSAC-MCMC estimator is used to initialize this set of particles, the posterior can then be approximated as a mixture of Gaussians fitted to these particles. The distribution at a coarser level influences the distribution at a finer level using the technique of sampling importance resampling (SIR) and MCMC, which allows for asymptotically correct approximations of the posterior distribution. The estimate of the posterior distribution at the level above is being used as the importance sampling function to generate a new set of particles, which can be further improved by MCMC. It is shown that the method is superior to previous single resolution RANSAC-style feature matchers. [ABSTRACT FROM AUTHOR]
- Published
- 2003
- Full Text
- View/download PDF
28. An Integrated Bayesian Approach to Layer Extraction from Image Sequences.
- Author
-
Torr, Philip H.S. and Szeliski, Richard
- Subjects
- *
BAYESIAN analysis , *THREE-dimensional display systems , *COMPUTER vision , *ALGORITHMS - Abstract
Provides information on a study which described a Bayesian approach for modeling three dimensional scenes in computer vision analysis. Comparison between the algorithmic and Bayenesian approach in creating still images; Sample computation of the parametric formulation in computer imaging; Methodology and discussion.
- Published
- 2001
- Full Text
- View/download PDF
29. Lessons from reinforcement learning for biological representations of space.
- Author
-
Muryy, Alex, Siddharth, N., Nardelli, Nantas, Glennerster, Andrew, and Torr, Philip H.S.
- Subjects
- *
REINFORCEMENT learning , *NEUROSCIENTISTS , *SPACE perception , *NAVIGATION , *COGNITIVE maps (Psychology) , *BRAIN , *RESEARCH , *RESEARCH methodology , *MEDICAL cooperation , *EVALUATION research , *LEARNING , *COMPARATIVE studies , *REWARD (Psychology) - Abstract
Neuroscientists postulate 3D representations in the brain in a variety of different coordinate frames (e.g. 'head-centred', 'hand-centred' and 'world-based'). Recent advances in reinforcement learning demonstrate a quite different approach that may provide a more promising model for biological representations underlying spatial perception and navigation. In this paper, we focus on reinforcement learning methods that reward an agent for arriving at a target image without any attempt to build up a 3D 'map'. We test the ability of this type of representation to support geometrically consistent spatial tasks such as interpolating between learned locations using decoding of feature vectors. We introduce a hand-crafted representation that has, by design, a high degree of geometric consistency and demonstrate that, in this case, information about the persistence of features as the camera translates (e.g. distant features persist) can improve performance on the geometric tasks. These examples avoid Cartesian (in this case, 2D) representations of space. Non-Cartesian, learned representations provide an important stimulus in neuroscience to the search for alternatives to a 'cognitive map'. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
30. Deep learning for predicting COVID-19 malignant progression.
- Author
-
Fang, Cong, Bai, Song, Chen, Qianlan, Zhou, Yu, Xia, Liming, Qin, Lixin, Gong, Shi, Xie, Xudong, Zhou, Chunhua, Tu, Dandan, Zhang, Changzheng, Liu, Xiaowu, Chen, Weiwei, Bai, Xiang, and Torr, Philip H.S.
- Subjects
- *
COVID-19 , *DEEP learning , *COMPUTED tomography , *BRAIN natriuretic factor , *C-reactive protein , *ASPARTATE aminotransferase - Abstract
• The first approach leverages both sequential CT scans and clinical data to predict COVID-19 malignant progression. • Our method achieves an AUC of 0.920 in the single-center study and an average AUC of 0.874 in the multicenter study. • The proposed domain adaptation can improve the generalization power of our model in the multicenter study. • Our model automatically identifies crucial indicators that contribute to the malignant progression. [Display omitted] As COVID-19 is highly infectious, many patients can simultaneously flood into hospitals for diagnosis and treatment, which has greatly challenged public medical systems. Treatment priority is often determined by the symptom severity based on first assessment. However, clinical observation suggests that some patients with mild symptoms may quickly deteriorate. Hence, it is crucial to identify patient early deterioration to optimize treatment strategy. To this end, we develop an early-warning system with deep learning techniques to predict COVID-19 malignant progression. Our method leverages CT scans and the clinical data of outpatients and achieves an AUC of 0.920 in the single-center study. We also propose a domain adaptation approach to improve the generalization of our model and achieve an average AUC of 0.874 in the multicenter study. Moreover, our model automatically identifies crucial indicators that contribute to the malignant progression, including Troponin, Brain natriuretic peptide, White cell count, Aspartate aminotransferase, Creatinine, and Hypersensitive C-reactive protein. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.