200 results on '"Hailin Jin"'
Search Results
152. Compositional Sketch Search
- Author
-
Alexander Black, Hailin Jin, Long Mai, Tu Bui, and John Collomosse
- Subjects
Visual search ,FOS: Computer and information sciences ,Similarity (geometry) ,business.industry ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,Pattern recognition ,Object (computer science) ,Convolutional neural network ,Sketch ,Metric (mathematics) ,Artificial intelligence ,business ,Representation (mathematics) ,Image retrieval - Abstract
We present an algorithm for searching image collections using free-hand sketches that describe the appearance and relative positions of multiple objects. Sketch based image retrieval (SBIR) methods predominantly match queries containing a single, dominant object invariant to its position within an image. Our work exploits drawings as a concise and intuitive representation for specifying entire scene compositions. We train a convolutional neural network (CNN) to encode masked visual features from sketched objects, pooling these into a spatial descriptor encoding the spatial relationships and appearances of objects in the composition. Training the CNN backbone as a Siamese network under triplet loss yields a metric search embedding for measuring compositional similarity which may be efficiently leveraged for visual search by applying product quantization., Comment: ICIP 2021 camera-ready version
- Published
- 2021
- Full Text
- View/download PDF
153. Black-Box Diagnosis and Calibration on GAN Intra-Mode Collapse: A Pilot Study
- Author
-
Zhenyu Wu, Zhaowen Wang, Ye Yuan, Jianming Zhang, Zhangyang Wang, and Hailin Jin
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Computer Networks and Communications ,Hardware and Architecture ,Machine Learning (cs.LG) - Abstract
Generative adversarial networks (GANs) nowadays are capable of producing images of incredible realism. One concern raised is whether the state-of-the-art GAN's learned distribution still suffers from mode collapse, and what to do if so. Existing diversity tests of samples from GANs are usually conducted qualitatively on a small scale, and/or depends on the access to original training data as well as the trained model parameters. This paper explores to diagnose GAN intra-mode collapse and calibrate that, in a novel black-box setting: no access to training data, nor the trained model parameters, is assumed. The new setting is practically demanded, yet rarely explored and significantly more challenging. As a first stab, we devise a set of statistical tools based on sampling, that can visualize, quantify, and rectify intra-mode collapse. We demonstrate the effectiveness of our proposed diagnosis and calibration techniques, via extensive simulations and experiments, on unconditional GAN image generation (e.g., face and vehicle). Our study reveals that the intra-mode collapse is still a prevailing problem in state-of-the-art GANs and the mode collapse is diagnosable and calibratable in black-box settings. Our codes are available at: https://github.com/VITA-Group/BlackBoxGANCollapse., Comment: This paper has been accepted by Transactions on Multimedia Computing Communications and Applications (TOMM) for publication in 2021
- Published
- 2021
- Full Text
- View/download PDF
154. Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset
- Author
-
Haotao Wang, Hailin Jin, Zhenyu Wu, Zhangyang Wang, and Zhaowen Wang
- Subjects
FOS: Computer and information sciences ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,Datasets as Topic ,Skin Pigmentation ,02 engineering and technology ,Machine learning ,computer.software_genre ,Task (project management) ,Adversarial system ,Artificial Intelligence ,Task Performance and Analysis ,0202 electrical engineering, electronic engineering, information engineering ,Private information retrieval ,Protocol (object-oriented programming) ,business.industry ,Applied Mathematics ,Deep learning ,Computational Theory and Mathematics ,Action (philosophy) ,Privacy ,Action recognition ,020201 artificial intelligence & image processing ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,computer ,Software ,Algorithms - Abstract
We investigate privacy-preserving, video-based action recognition in deep learning, a problem with growing importance in smart camera applications. A novel adversarial training framework is formulated to learn an anonymization transform for input videos such that the trade-off between target utility task performance and the associated privacy budgets is explicitly optimized on the anonymized videos. Notably, the privacy budget, often defined and measured in task-driven contexts, cannot be reliably indicated using any single model performance because strong protection of privacy should sustain against any malicious model that tries to steal private information. To tackle this problem, we propose two new optimization strategies of model restarting and model ensemble to achieve stronger universal privacy protection against any attacker models. Extensive experiments have been carried out and analyzed. On the other hand, given few public datasets available with both utility and privacy labels, the data-driven (supervised) learning cannot exert its full power on this task. We first discuss an innovative heuristic of cross-dataset training and evaluation, enabling the use of multiple single-task datasets (one with target task labels and the other with privacy labels) in our problem. To further address this dataset challenge, we have constructed a new dataset, termed PA-HMDB51, with both target task labels (action) and selected privacy attributes (skin color, face, gender, nudity, and relationship) annotated on a per-frame basis. This first-of-its-kind video dataset and evaluation protocol can greatly facilitate visual privacy research and open up other opportunities. Our codes, models, and the PA-HMDB51 dataset are available at https://github.com/VITA-Group/PA-HMDB51., Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). arXiv admin note: text overlap with arXiv:1807.08379
- Published
- 2020
155. Critical Chords of Convex Bodies of Constant Width
- Author
-
Xinyue Zhou and Hailin Jin
- Subjects
Physics ,Multidisciplinary ,media_common.quotation_subject ,010102 general mathematics ,Regular polygon ,02 engineering and technology ,01 natural sciences ,Asymmetry ,Measure (mathematics) ,Combinatorics ,Minkowski space ,0202 electrical engineering, electronic engineering, information engineering ,Convex body ,020201 artificial intelligence & image processing ,0101 mathematics ,Constant (mathematics) ,media_common - Abstract
In this paper, we show that when Minkowski measure of asymmetry of convex body K of constant width is bigger than α (n-1), K has at least n+1 critical chords, where $$\alpha (n) = \frac{{n + \sqrt {2n(n + 1)} }}{{n + 2}}$$ .
- Published
- 2018
- Full Text
- View/download PDF
156. Real-Time 3-D Motion and Structure of Point-Features: A Front-End for Vision-Based Control and Interaction.
- Author
-
Hailin Jin, Paolo Favaro, and Stefano Soatto
- Published
- 2000
- Full Text
- View/download PDF
157. The log-Minkowski measure of asymmetry for convex bodies
- Author
-
HaiLin Jin
- Subjects
Mixed volume ,Hyperbolic geometry ,media_common.quotation_subject ,010102 general mathematics ,Regular polygon ,02 engineering and technology ,Computer Science::Computational Geometry ,01 natural sciences ,Measure (mathematics) ,Asymmetry ,Combinatorics ,Differential geometry ,Minkowski space ,0202 electrical engineering, electronic engineering, information engineering ,High Energy Physics::Experiment ,020201 artificial intelligence & image processing ,Geometry and Topology ,0101 mathematics ,Projective geometry ,Mathematics ,media_common - Abstract
In this paper, we introduce a new measure of asymmetry, called log-Minkowski measure of asymmetry for planar convex bodies in terms of the $$L_0$$ -mixed volume, and show that triangles are the most asymmetric planar convex bodies in the sense of this measure of asymmetry.
- Published
- 2017
- Full Text
- View/download PDF
158. Superpixel Segmentation with Fully Convolutional Networks
- Author
-
Qian Sun, Zihan Zhou, Fengting Yang, and Hailin Jin
- Subjects
FOS: Computer and information sciences ,Artificial neural network ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Feature extraction ,Computer Science - Computer Vision and Pattern Recognition ,Initialization ,020207 software engineering ,Pattern recognition ,02 engineering and technology ,Image segmentation ,Upsampling ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business - Abstract
In computer vision, superpixels have been widely used as an effective way to reduce the number of image primitives for subsequent processing. But only a few attempts have been made to incorporate them into deep neural networks. One main reason is that the standard convolution operation is defined on regular grids and becomes inefficient when applied to superpixels. Inspired by an initialization strategy commonly adopted by traditional superpixel algorithms, we present a novel method that employs a simple fully convolutional network to predict superpixels on a regular image grid. Experimental results on benchmark datasets show that our method achieves state-of-the-art superpixel segmentation performance while running at about 50fps. Based on the predicted superpixels, we further develop a downsampling/upsampling scheme for deep networks with the goal of generating high-resolution outputs for dense prediction tasks. Specifically, we modify a popular network architecture for stereo matching to simultaneously predict superpixels and disparities. We show that improved disparity estimation accuracy can be obtained on public datasets., 16 pages, 15 figures, to be published in CVPR'20
- Published
- 2020
159. Steering Self-Supervised Feature Learning Beyond Local Pixel Statistics
- Author
-
Hailin Jin, Simon Jenni, and Paolo Favaro
- Subjects
FOS: Computer and information sciences ,Pixel ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,05 social sciences ,Supervised learning ,Feature extraction ,Computer Science - Computer Vision and Pattern Recognition ,Inpainting ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,010501 environmental sciences ,01 natural sciences ,510 Mathematics ,Transformation (function) ,Feature (computer vision) ,Computer Science::Computer Vision and Pattern Recognition ,0502 economics and business ,Statistics ,050207 economics ,Image warping ,Feature learning ,000 Computer science, knowledge & systems ,0105 earth and related environmental sciences - Abstract
We introduce a novel principle for self-supervised feature learning based on the discrimination of specific transformations of an image. We argue that the generalization capability of learned features depends on what image neighborhood size is sufficient to discriminate different image transformations: The larger the required neighborhood size and the more global the image statistics that the feature can describe. An accurate description of global image statistics allows to better represent the shape and configuration of objects and their context, which ultimately generalizes better to new tasks such as object classification and detection. This suggests a criterion to choose and design image transformations. Based on this criterion, we introduce a novel image transformation that we call limited context inpainting (LCI). This transformation inpaints an image patch conditioned only on a small rectangular pixel boundary (the limited context). Because of the limited boundary information, the inpainter can learn to match local pixel statistics, but is unlikely to match the global statistics of the image. We claim that the same principle can be used to justify the performance of transformations such as image rotations and warping. Indeed, we demonstrate experimentally that learning to discriminate transformations such as LCI, image warping and rotations, yields features with state of the art generalization capabilities on several datasets such as Pascal VOC, STL-10, CelebA, and ImageNet. Remarkably, our trained features achieve a performance on Places on par with features trained through supervised learning with ImageNet labels., Comment: CVPR 2020 (oral)
- Published
- 2020
- Full Text
- View/download PDF
160. Learning Video Representations from Correspondence Proposals
- Author
-
Hailin Jin, Joon-Young Lee, and Xingyu Liu
- Subjects
FOS: Computer and information sciences ,Computer Science - Machine Learning ,Artificial neural network ,Computer science ,business.industry ,Deep learning ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,020207 software engineering ,02 engineering and technology ,Dynamic web page ,010501 environmental sciences ,01 natural sciences ,Machine Learning (cs.LG) ,Robustness (computer science) ,0202 electrical engineering, electronic engineering, information engineering ,Artificial intelligence ,business ,Feature learning ,0105 earth and related environmental sciences - Abstract
Correspondences between frames encode rich information about dynamic content in videos. However, it is challenging to effectively capture and learn those due to their irregular structure and complex dynamics. In this paper, we propose a novel neural network that learns video representations by aggregating information from potential correspondences. This network, named $CPNet$, can learn evolving 2D fields with temporal consistency. In particular, it can effectively learn representations for videos by mixing appearance and long-range motion with an RGB-only input. We provide extensive ablation experiments to validate our model. CPNet shows stronger performance than existing methods on Kinetics and achieves the state-of-the-art performance on Something-Something and Jester. We provide analysis towards the behavior of our model and show its robustness to errors in proposals., CVPR 2019 (Oral)
- Published
- 2019
161. Real-time Virtual Object Insertion.
- Author
-
Paolo Favaro, Hailin Jin, and Stefano Soatto
- Published
- 2001
- Full Text
- View/download PDF
162. Asymmetry of Reuleaux polygons
- Author
-
HaiLin Jin
- Subjects
Algebra and Number Theory ,media_common.quotation_subject ,010102 general mathematics ,Regular polygon ,0102 computer and information sciences ,Algebraic geometry ,Computer Science::Computational Geometry ,01 natural sciences ,Measure (mathematics) ,Asymmetry ,Combinatorics ,Reuleaux triangle ,Planar ,010201 computation theory & mathematics ,Geometry and Topology ,0101 mathematics ,Constant (mathematics) ,media_common ,Curve of constant width ,Mathematics - Abstract
In this paper, we consider a measure of asymmetry for Reuleaux polygons, and show that the n-th (\(n \ge 3, n \;\text {odd}\)) regular Reuleaux polygons are the most symmetric ones among all n-th Reuleaux polygons. As a byproduct, we show that the Reuleaux triangles are the most asymmetric planar convex bodies of constant width.
- Published
- 2016
- Full Text
- View/download PDF
163. Content-preserving warps for 3D video stabilization.
- Author
-
Feng Liu 0015, Michael Gleicher, Hailin Jin, and Aseem Agarwala
- Published
- 2009
- Full Text
- View/download PDF
164. Large-scale Tag-based Font Retrieval with Generative Feature Learning
- Author
-
Jiebo Luo, Zhaowen Wang, Ning Xu, Tianlang Chen, and Hailin Jin
- Subjects
FOS: Computer and information sciences ,business.industry ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,02 engineering and technology ,010501 environmental sciences ,computer.software_genre ,01 natural sciences ,Font ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Feature learning ,Natural language processing ,Generative grammar ,0105 earth and related environmental sciences - Abstract
Font selection is one of the most important steps in a design workflow. Traditional methods rely on ordered lists which require significant domain knowledge and are often difficult to use even for trained professionals. In this paper, we address the problem of large-scale tag-based font retrieval which aims to bring semantics to the font selection process and enable people without expert knowledge to use fonts effectively. We collect a large-scale font tagging dataset of high-quality professional fonts. The dataset contains nearly 20,000 fonts, 2,000 tags, and hundreds of thousands of font-tag relations. We propose a novel generative feature learning algorithm that leverages the unique characteristics of fonts. The key idea is that font images are synthetic and can therefore be controlled by the learning algorithm. We design an integrated rendering and learning process so that the visual feature from one image can be used to reconstruct another image with different text. The resulting feature captures important font design details while is robust to nuisance factors such as text. We propose a novel attention mechanism to re-weight the visual feature for joint visual-text modeling. We combine the feature and the attention mechanism in a novel recognition-retrieval model. Experimental results show that our method significantly outperforms the state-of-the-art for the important problem of large-scale tag-based font retrieval., Comment: accepted by ICCV 2019
- Published
- 2019
- Full Text
- View/download PDF
165. LiveSketch: Query Perturbations for Guided Sketch-based Visual Search
- Author
-
Hailin Jin, John Collomosse, and Tu Bui
- Subjects
Visual search ,FOS: Computer and information sciences ,Information retrieval ,Computer Science - Artificial Intelligence ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,020207 software engineering ,02 engineering and technology ,Autoencoder ,Sketch ,Computer Science - Information Retrieval ,Image (mathematics) ,Artificial Intelligence (cs.AI) ,Categorization ,0202 electrical engineering, electronic engineering, information engineering ,Embedding ,020201 artificial intelligence & image processing ,Artificial intelligence ,Cluster analysis ,business ,Information Retrieval (cs.IR) - Abstract
LiveSketch is a novel algorithm for searching large image collections using hand-sketched queries. LiveSketch tackles the inherent ambiguity of sketch search by creating visual suggestions that augment the query as it is drawn, making query specification an iterative rather than one-shot process that helps disambiguate users' search intent. Our technical contributions are: a triplet convnet architecture that incorporates an RNN based variational autoencoder to search for images using vector (stroke-based) queries; real-time clustering to identify likely search intents (and so, targets within the search embedding); and the use of backpropagation from those targets to perturb the input stroke sequence, so suggesting alterations to the query in order to guide the search. We show improvements in accuracy and time-to-task over contemporary baselines using a 67M image corpus., Comment: Accepted to CVPR 2019
- Published
- 2019
- Full Text
- View/download PDF
166. Synthetically Supervised Feature Learning for Scene Text Recognition
- Author
-
Yang Liu, Hailin Jin, Zhaowen Wang, and Ian J. Wassell
- Subjects
Synthetic data ,Artificial neural network ,business.industry ,Computer science ,Deep learning ,Scene text recognition ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Feature learning ,Multi-task learning ,020207 software engineering ,02 engineering and technology ,Feature (computer vision) ,Distortion ,0202 electrical engineering, electronic engineering, information engineering ,Leverage (statistics) ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,business ,Neural networks - Abstract
We address the problem of image feature learning for scene text recognition. The image features in the state-of-the-art methods are learned from large-scale synthetic image datasets. However, most meth- ods only rely on outputs of the synthetic data generation process, namely realistically looking images, and completely ignore the rest of the process. We propose to leverage the parameters that lead to the output images to improve image feature learning. Specifically, for every image out of the data generation process, we obtain the associated parameters and render another “clean” image that is free of select distortion factors that are ap- plied to the output image. Because of the absence of distortion factors, the clean image tends to be easier to recognize than the original image which can serve as supervision. We design a multi-task network with an encoder-discriminator-generator architecture to guide the feature of the original image toward that of the clean image. The experiments show that our method significantly outperforms the state-of-the-art methods on standard scene text recognition benchmarks in the lexicon-free cate- gory. Furthermore, we show that without explicit handling, our method works on challenging cases where input images contain severe geometric distortion, such as text on a curved path.
- Published
- 2018
- Full Text
- View/download PDF
167. Contrast-enhanced harmonic endoscopic ultrasonography for the differential diagnosis of pancreatic masses: A systematic review and meta-analysis
- Author
-
Hailin Jin, Min Xu, Bo Qian, Shutang Han, Yeifei Zhang, Yang Li, and Dan Liao
- Subjects
Endoscopic ultrasound ,Cancer Research ,medicine.medical_specialty ,Receiver operating characteristic ,medicine.diagnostic_test ,business.industry ,Publication bias ,Articles ,medicine.disease ,Likelihood ratios in diagnostic testing ,03 medical and health sciences ,0302 clinical medicine ,Oncology ,030220 oncology & carcinogenesis ,Meta-analysis ,Pancreatic mass ,medicine ,Diagnostic odds ratio ,030211 gastroenterology & hepatology ,Radiology ,Differential diagnosis ,business - Abstract
Understanding the difference between malignant and benign pancreatic masses is critical in terms of diagnosis, although this is difficult to determine in clinical practice. The contrast-enhanced harmonic endoscopic ultrasound (CH-EUS) technique was introduced in 2010, although, to the best of the authors' knowledge, there has been no systematic review or meta-analysis to date evaluating its diagnostic performance for the differentiation of pancreatic masses. The aim of the present study was to systematically evaluate the diagnostic performance of CH-EUS for the differentiation of pancreatic masses. Search key words and inclusion and exclusion criteria were initially presented. Two independent authors read and extracted the relevant information from the included studies. Disagreements were resolved through discussion with another two experienced authors. Metadisc and Stata software were used for the meta-analysis and the evaluation of bias. A total of 16 studies comprising 1,325 patients were included in this meta-analysis. The pooled sensitivity, specificity, positive likelihood ratio, negative likelihood ratio and diagnostic odds ratio of CH-EUS were used to distinguish between malignant and benign tumors, and the values obtained were 93% [95% confidence interval (CI): 91-94%], 84% (95% CI: 80-87%), 5.58 (95% CI: 3.90-7.97), 0.09 (95% CI: 0.07-0.11) and 72.56 (95% CI: 48.93-107.60), respectively. The area under the summary receiver operating characteristic curve was determined to be 0.96. No publication bias was identified in this meta-analysis. Taken together, these results confirm that CH-EUS has a high accuracy rate for distinguishing between benign and malignant pancreatic space-occupying lesions, and it may therefore be used as an effective diagnostic tool for pancreatic masses.
- Published
- 2018
168. Deep Learning for Font Recognition and Retrieval
- Author
-
Thomas S. Huang, Jianchao Yang, Eli Shechtman, Jonathan Brandt, Aseem Agarwala, Zhangyang Wang, Zhaowen Wang, and Hailin Jin
- Subjects
Space (punctuation) ,business.industry ,Computer science ,Deep learning ,Graphic design ,computer.software_genre ,Identification (information) ,Typography ,Font ,Selection (linguistics) ,Preprocessor ,Artificial intelligence ,business ,computer ,Natural language processing - Abstract
Typography is a fundamental graphic design component and also a ubiquitous art form that affects our understanding and perception of what we read. Thousands of different font faces have been created with enormous variations in the characters. Graphic designers have the desire to identify the fonts they encounter in daily life for later use. While they might take a photo of the text of a particularly interesting font and seek out an expert to identify the font, the manual identification process is extremely tedious and error prone. Several websites allow users to search and recognize fonts by font similarity, including Identifont, MyFonts, WhatTheFont, and Fontspring. All of them rely on tedious humans interactions and high-quality manual preprocessing of images, and the accuracies are still unsatisfactory. On the other hand, the majority of font selection interfaces in existing software applications are simple linear lists, while exhaustively exploring the entire space of fonts using an alphabetical listing is unrealistic for most users.
- Published
- 2018
- Full Text
- View/download PDF
169. On the dual Orlicz mixed volumes
- Author
-
Gangsong Leng, Hailin Jin, and Shufeng Yuan
- Subjects
Mathematics::Functional Analysis ,Pure mathematics ,Mixed volume ,Applied Mathematics ,General Mathematics ,Mathematics::Classical Analysis and ODEs ,Calculus ,Mathematics::Metric Geometry ,Convex body ,Harmonic (mathematics) ,Star (graph theory) ,Dual (category theory) ,Mathematics - Abstract
In this paper, the authors define a harmonic Orlicz combination and a dual Orlicz mixed volume of star bodies, and then establish the dual Orlicz-Minkowski mixedvolume inequality and the dual Orlicz-Brunn-Minkowksi inequality.
- Published
- 2015
- Full Text
- View/download PDF
170. Asymmetry for convex body of revolution
- Author
-
Hailin Jin
- Subjects
Multidisciplinary ,media_common.quotation_subject ,Mathematical analysis ,Regular polygon ,Geometry ,Asymmetry ,Measure (mathematics) ,Physics::History of Physics ,Computer Science::Robotics ,Minkowski space ,Isosceles triangle ,Convex body ,Constant (mathematics) ,Body of revolution ,Mathematics ,media_common - Abstract
In this paper, we study the Minkowski measure of asymmetry for n-dimensional convex bodies of revolution ( n⩾3 ). We show that among all n-dimensional convex bodies of revolution, the bodies which generated by isosceles triangles are the most asymmetric ones. Also, we study the asymmetry for n-dimensional constant width bodies of revolution.
- Published
- 2015
- Full Text
- View/download PDF
171. Visual Font Pairing
- Author
-
Aaron Hertzmann, Zhaowen Wang, Yun Fu, Shuhui Jiang, and Hailin Jin
- Subjects
FOS: Computer and information sciences ,Similarity (geometry) ,business.industry ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Computer Science - Computer Vision and Pattern Recognition ,02 engineering and technology ,computer.software_genre ,Computer Science Applications ,Pairing ,Signal Processing ,Metric (mathematics) ,Font ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,Selection (linguistics) ,020201 artificial intelligence & image processing ,Artificial intelligence ,Electrical and Electronic Engineering ,business ,computer ,Natural language processing - Abstract
This paper introduces the problem of automatic font pairing . Font pairing is an important design task that is difficult for novices. Given a font selection for one part of a document (e.g., header), our goal is to recommend a font to be used in another part (e.g., body) such that the two fonts used together look visually pleasing. There are three main challenges in font pairing. First, this is a fine-grained problem, in which the subtle distinctions between fonts may be important. Second, rules and conventions of font pairing given by human experts are difficult to formalize. Third, font pairing is an asymmetric problem in that the roles played by header and body fonts are not interchangeable. To address these challenges, we propose automatic font pairing through learning visual relationships from large-scale human-generated font pairs. We introduce a new database for font pairing constructed from millions of PDF documents available on the Internet. We propose two font pairing algorithms: dual-space $k$ -NN and asymmetric similarity metric learning (ASML). These two methods automatically learn fine-grained relationships from large-scale data. We also investigate several baseline methods based on the rules from professional designers. Experiments and user studies demonstrate the effectiveness of our proposed dataset and methods.
- Published
- 2018
- Full Text
- View/download PDF
172. Orlicz geominimal surface areas
- Author
-
Hailin Jin, Shufeng Yuan, and Gangsong Leng
- Subjects
Mathematics Subject Classification ,Differential geometry ,Mixed volume ,Applied Mathematics ,General Mathematics ,Mathematical analysis ,Affine differential geometry ,Isoperimetric inequality ,Surface (topology) ,Mathematics - Abstract
In 1996, E. Lutwak extended the important concept of geominimal surface area to Lp version, which serves as a bridge connecting a number of areas of geometry: affine differential geometry, relative differential geometry, and Minkowskian geometry. In this paper, by using the concept of Orlicz mixed volume, we extend geominimal surface area to the Orlicz version and give some properties and an isoperimetric inequalities for the Orlicz geominimal surface areas. Mathematics subject classification (2010): 52A39, 52A40.
- Published
- 2015
- Full Text
- View/download PDF
173. Rethinking Skip-thought: A Neighborhood based Approach
- Author
-
Chen Fang, Shuai Tang, Hailin Jin, Virginia R. de Sa, and Zhaowen Wang
- Subjects
FOS: Computer and information sciences ,Computer Science - Computation and Language ,Computer science ,business.industry ,Computer Science - Artificial Intelligence ,Computer Science - Neural and Evolutionary Computing ,02 engineering and technology ,010501 environmental sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Autoencoder ,Paraphrase ,Artificial Intelligence (cs.AI) ,Semantic similarity ,Path (graph theory) ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,Neural and Evolutionary Computing (cs.NE) ,business ,computer ,Computation and Language (cs.CL) ,0105 earth and related environmental sciences - Abstract
We study the skip-thought model with neighborhood information as weak supervision. More specifically, we propose a skip-thought neighbor model to consider the adjacent sentences as a neighborhood. We train our skip-thought neighbor model on a large corpus with continuous sentences, and then evaluate the trained model on 7 tasks, which include semantic relatedness, paraphrase detection, and classification benchmarks. Both quantitative comparison and qualitative investigation are conducted. We empirically show that, our skip-thought neighbor model performs as well as the skip-thought model on evaluation tasks. In addition, we found that, incorporating an autoencoder path in our model didn’t aid our model to perform better, while it hurts the performance of the skip-thought model.
- Published
- 2017
174. A sharp Rogers–Shephard type inequality for Orlicz-difference body of planar convex bodies
- Author
-
Hailin Jin and Shufeng Yuan
- Subjects
Pure mathematics ,Planar ,General Mathematics ,Mathematical analysis ,Regular polygon ,Type inequality ,Mathematics - Abstract
In this paper, we prove a sharp Rogers–Shephard type inequality for the Orlicz-difference body of planar convex bodies, which extend the works of Bianchini and Colesanti (Proc. Amer. Math. Soc. 138(7) (2008) 2575–2582).
- Published
- 2014
- Full Text
- View/download PDF
175. The Orlicz Brunn–Minkowski inequality
- Author
-
Hailin Jin, Gangsong Leng, and Dongmeng Xi
- Subjects
Mathematics::Functional Analysis ,Pure mathematics ,Inequality ,Mixed volume ,General Mathematics ,media_common.quotation_subject ,Mathematical analysis ,Mathematics::Classical Analysis and ODEs ,Regular polygon ,Mathematics::General Topology ,Minkowski inequality ,General Relativity and Quantum Cosmology ,Minkowski space ,Mathematics::Metric Geometry ,media_common ,Mathematics - Abstract
The Orlicz Brunn–Minkowski theory originated with the work of Lutwak, Yang, and Zhang in 2010. In this paper, we first introduce the Orlicz addition of convex bodies containing the origin in their interiors, and then extend the L p Brunn–Minkowski inequality to the Orlicz Brunn–Minkowski inequality. Furthermore, we extend the L p Minkowski mixed volume inequality to the Orlicz mixed volume inequality by using the Orlicz Brunn–Minkowski inequality.
- Published
- 2014
- Full Text
- View/download PDF
176. Orlicz metrics for convex bodies
- Author
-
Gangsong Leng, Qi Guo, and HaiLin Jin
- Subjects
Convex analysis ,Mathematics::Functional Analysis ,General Mathematics ,Injective metric space ,Mathematical analysis ,Mathematics::Classical Analysis and ODEs ,Convex set ,Subderivative ,Equivalence of metrics ,Convex metric space ,Combinatorics ,Hausdorff distance ,Convex combination ,Mathematics - Abstract
The $$L_p$$ metrics, extensions of the Hausdorff metric for convex bodies, were investigated by Vitale in 1985. In this paper, we extend $$L_p$$ metrics to Orlicz metrics, and show that these Orlicz metrics generate the same topology as the Hausdorff one in the space of all convex bodies (i.e., non-empty compact convex subsets), consequently, the space of all convex bodies with the Orlicz metric is a complete, separable metric space. Furthermore, we also show that the space of all convex bodies with the Orlicz metric is a metric segment space.
- Published
- 2014
- Full Text
- View/download PDF
177. On the 1-measure of asymmetry for convex bodies of constant width
- Author
-
Hailin Jin
- Subjects
Physics ,Algebra and Number Theory ,media_common.quotation_subject ,Regular polygon ,Geometry ,Algebraic geometry ,Asymmetry ,Measure (mathematics) ,High Energy Physics::Experiment ,Geometry and Topology ,Algebra over a field ,Constant (mathematics) ,Surface of constant width ,media_common - Abstract
In this paper we study the $$1$$ -measure of asymmetry for convex bodies of constant width.
- Published
- 2013
- Full Text
- View/download PDF
178. Stability for the Minkowski measure of convex domains of constant width
- Author
-
HaiLin Jin, Qi Guo, and Gangsong Leng
- Subjects
Convex analysis ,Convex hull ,Minkowski's theorem ,Mathematical analysis ,Convex set ,Danskin's theorem ,Geometry and Topology ,Minkowski addition ,Surface of constant width ,Mathematics ,Curve of constant width - Abstract
In 1988, H. Groemer gave a stability theorem for the area of convex domains of constant width. In this paper, we obtain a stability theorem for the well-known Minkowski measure of asymmetry for convex domains of constant width.
- Published
- 2013
- Full Text
- View/download PDF
179. Multiple Instance Visual-Semantic Embedding
- Author
-
Chen Fang, Zhou Ren, Alan L. Yuille, Zhe Lin, and Hailin Jin
- Subjects
Contextual image classification ,Computer science ,business.industry ,Open problem ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Pattern recognition ,02 engineering and technology ,010501 environmental sciences ,Space (commercial competition) ,01 natural sciences ,Image (mathematics) ,Automatic image annotation ,0202 electrical engineering, electronic engineering, information engineering ,Embedding ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,0105 earth and related environmental sciences - Abstract
Visual-semantic embedding models have been recently proposed and shown to be effective for image classification and zero-shot learning, by mapping images into a continuous semantic label space. Although several approaches have been proposed for single-label embedding tasks, handling images with multiple labels (which is a more general setting) still remains an open problem, mainly due to the complex underlying corresponding relationship between image and its labels. In this work, we present Multi-Instance visual-semantic Embedding model (MIE) for embedding images associated with either single or multiple labels. Our model discovers and maps semantically-meaningful image subregions to their corresponding labels. And we demonstrate the superiority of our method over the state-of-the-art on two tasks, including multi-label image annotation and zero-shot learning.
- Published
- 2017
- Full Text
- View/download PDF
180. KALMANSAC: Robust filtering by consensus
- Author
-
Stefano Soatto, Paolo Favaro, Hailin Jin, and Andrea Vedaldi
- Subjects
business.industry ,Sampling (statistics) ,Filter (signal processing) ,Machine learning ,computer.software_genre ,510 Mathematics ,Causal inference ,Motion estimation ,Outlier ,Statistical inference ,Structure from motion ,Artificial intelligence ,business ,computer ,Algorithm ,Combinatorial explosion ,000 Computer science, knowledge & systems ,Mathematics - Abstract
We propose an algorithm to perform causal inference of the state of a dynamical model when the measurements are corrupted by outliers. While the optimal (maximum-likelihood) solution has doubly exponential complexity due to the combinatorial explosion of possible choices of inliers, we exploit the structure of the problem to design a sampling-based algorithm that has constant complexity. We derive our algorithm from the equations of the optimal filter, which makes our approximation explicit. Our work is motivated by real-time tracking and the estimation of structure from motion (SFM). We test our algorithm for on-line outlier rejection both for tracking and for SFM. We show that our approach can tolerate a large proportion of outliers, whereas previous causal robust statistical inference methods failed with less than half as many. Our work can be thought of as the extension of random sample consensus algorithms to dynamic data, or as the implementation of pseudo-Bayesian filtering algorithms in a sampling framework. © 2005 IEEE.
- Published
- 2016
181. Interactive Inverse 3D Modeling
- Author
-
Carlo H. Séquin, Hailin Jin, and James Andrews
- Subjects
Engineering drawing ,Theoretical computer science ,business.industry ,Computer science ,Computational Mechanics ,Point cloud ,Inverse ,Parameterized complexity ,Type (model theory) ,3D modeling ,Computer Graphics and Computer-Aided Design ,Cursor (databases) ,Computational Mathematics ,Boundary representation ,Polygon mesh ,business - Abstract
“Interactive Inverse 3D Modeling” is a user-guided approach to shape construction and redesign that extracts well-structured, parameterized, procedural descriptions from unstructured, hierarchically flat input data, such as point clouds, boundary representation meshes, or even multiple pictorial views of a given inspirational prototype. This approach combines traditional “forward” 3D modeling tools with a system of user-guided extraction modules and optimization routines. With a few cursor strokes users can express their preferences of the type of modeling primitives to be used in a particular area of the given prototype to be approximated, and they can also select the degree of parameterization associated with each modeling routine. The results are then pliable, structured descriptions that are well suited to implement the particular design modifications intended by the user.
- Published
- 2012
- Full Text
- View/download PDF
182. On a measure of asymmetry for Reuleaux polygons
- Author
-
Qi Guo and HaiLin Jin
- Subjects
Reuleaux triangle ,Combinatorics ,media_common.quotation_subject ,Minkowski space ,Regular polygon ,Order (ring theory) ,Geometry and Topology ,Measure (mathematics) ,Asymmetry ,Mathematics ,media_common - Abstract
In a previous paper, we showed that for regular Reuleaux polygons Rn the equality \({{\rm as}_\infty(R_n) = 1/(2\cos \frac\pi{2n} -1)}\) holds, where \({{\rm as}_\infty(\cdot)}\) denotes the Minkowski measure of asymmetry for convex bodies, and \({{\rm as}_\infty(K)\leq \frac 12(\sqrt{3}+1)}\) for all convex domains K of constant width, with equality holds iff K is a Reuleaux triangle. In this paper, we investigate the Minkowski measures of asymmetry among all Reuleaux polygons of order n and show that regular Reuleaux polygons of order n (n ≥ 3 and odd) have the minimal Minkowski measure of asymmetry.
- Published
- 2011
- Full Text
- View/download PDF
183. A note on the extremal bodies of constant width for the Minkowski measure
- Author
-
Qi Guo and Hailin Jin
- Subjects
Combinatorics ,Simplex ,Differential geometry ,Hyperbolic geometry ,Mathematical analysis ,Minkowski space ,Regular polygon ,Geometry and Topology ,Algebraic geometry ,Constant (mathematics) ,Measure (mathematics) ,Mathematics - Abstract
In a previous paper, we showed that for all convex bodies K of constant width in \({\mathbb{R}^n, 1 \leq {\rm as}_\infty(K) \leq \frac{n+\sqrt{2n(n+1)}}{n+2}}\) , where as∞(·) denotes the Minkowski measure of asymmetry, with the equality holding on the right-hand side if K is a completion of a regular simplex, and asked whether or not the completions of regular simplices are the only bodies for the equality. A positive answer is given in this short note.
- Published
- 2012
- Full Text
- View/download PDF
184. Collaborative Feature Learning from Social Media
- Author
-
Zhe Lin, Chen Fang, Hailin Jin, and Jianchao Yang
- Subjects
FOS: Computer and information sciences ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Supervised learning ,Feature extraction ,Computer Science - Computer Vision and Pattern Recognition ,Semi-supervised learning ,Machine learning ,computer.software_genre ,Image (mathematics) ,Automatic image annotation ,Feature (computer vision) ,Feature (machine learning) ,Artificial intelligence ,Representation (mathematics) ,business ,Feature learning ,computer - Abstract
Image feature representation plays an essential role in image recognition and related tasks. The current state-of-the-art feature learning paradigm is supervised learning from labeled data. However, this paradigm requires large-scale category labels, which limits its applicability to domains where labels are hard to obtain. In this paper, we propose a new data-driven feature learning paradigm which does not rely on category labels. Instead, we learn from user behavior data collected on social media. Concretely, we use the image relationship discovered in the latent space from the user behavior data to guide the image feature learning. We collect a large-scale image and user behavior dataset from Behance.net. The dataset consists of 1.9 million images and over 300 million view records from 1.9 million users. We validate our feature learning paradigm on this dataset and find that the learned feature significantly outperforms the state-of-the-art image features in learning better image similarities. We also show that the learned feature performs competitively on various recognition benchmarks.
- Published
- 2015
185. DeepFont: Identify Your Font from An Image
- Author
-
Aseem Agarwala, Thomas S. Huang, Zhangyang Wang, Eli Shechtman, Jonathan Brandt, Hailin Jin, and Jianchao Yang
- Subjects
FOS: Computer and information sciences ,Identification (information) ,Computer science ,business.industry ,Computer Vision and Pattern Recognition (cs.CV) ,Font ,Computer Science - Computer Vision and Pattern Recognition ,Pattern recognition ,Computer vision ,Artificial intelligence ,Similarity measure ,business ,Convolutional neural network - Abstract
As font is one of the core design concepts, automatic font identification and similar font suggestion from an image or photo has been on the wish list of many designers. We study the Visual Font Recognition (VFR) problem, and advance the state-of-the-art remarkably by developing the DeepFont system. First of all, we build up the first available large-scale VFR dataset, named AdobeVFR, consisting of both labeled synthetic data and partially labeled real-world data. Next, to combat the domain mismatch between available training and testing data, we introduce a Convolutional Neural Network (CNN) decomposition approach, using a domain adaptation technique based on a Stacked Convolutional Auto-Encoder (SCAE) that exploits a large corpus of unlabeled real-world text images combined with synthetic data preprocessed in a specific way. Moreover, we study a novel learning-based model compression approach, in order to reduce the DeepFont model size without sacrificing its performance. The DeepFont system achieves an accuracy of higher than 80% (top-5) on our collected dataset, and also produces a good font similarity measure for font selection and suggestion. We also achieve around 6 times compression of the model without any visible loss of recognition accuracy., Comment: To Appear in ACM Multimedia as a full paper
- Published
- 2015
- Full Text
- View/download PDF
186. THE MEAN MINKOWSKI MEASURES FOR CONVEX BODIES OF CONSTANT WIDTH
- Author
-
Qi Guo and HaiLin Jin
- Subjects
convex body of constant width ,measure of asymmetry ,Mixed volume ,General Mathematics ,Minkowski's theorem ,Mathematical analysis ,52A20 ,Support function ,Minkowski addition ,52A39 ,Meissner's bodies ,Minkowski space ,reuleaux triangle ,Mathematics::Metric Geometry ,completion ,Constant (mathematics) ,mean Minkowski measure ,Surface of constant width ,Mathematics ,Curve of constant width - Abstract
In this paper, we study the so-called mean Minkowski measures, proposed and studied by Toth in a series of papers, for convex bodies of constant width. We show that, with respect to the mean Minkowski measure, the completions of regular simplices are, as well as for many other measures, the most asymmetric ones among all convex bodies of constant width.
- Published
- 2014
- Full Text
- View/download PDF
187. Enabling new LED designs through advanced cooling technology
- Author
-
Brandon Noska, Che Cheung, Hailin Jin, and Raghav Mahalingam
- Subjects
Incandescent light bulb ,Natural convection ,business.industry ,Computer science ,Heat sink ,Cooling capacity ,law.invention ,Forced convection ,Light quality ,Solid-state lighting ,law ,Electronic engineering ,Process engineering ,business ,Light-emitting diode - Abstract
Light Emitting Diodes (LEDs) are increasingly being designed into lighting products for general illumination and are now gaining traction in the market. Several LED products are already available, but have generally been accepted only for low light level applications because of the low lumen output and poor quality of light for many of the products. Much of the reason for the low lumen output is the limited cooling capacity of natural convection which is most commonly used. Unlike incandescent, halogen based, or other traditional lighting technologies, LEDs need proper thermal management to increase the lumen output, maintain quality of light and ensure high reliability and useful life. This paper investigates synthetic jet technology as a viable option for forced convection cooling as an alternative for natural convection in LED products to enable higher lumen designs without compromising form factor, quality of light and reliability.
- Published
- 2010
- Full Text
- View/download PDF
188. Mumford-Shah for Segmentation and Stereo
- Author
-
Alan S. Willsky, Anthony Yezzi, Andy Tsai, Hailin Jin, and Stefano Soatto
- Subjects
Active contour model ,business.industry ,Computer science ,Scale-space segmentation ,Image segmentation ,Ambient space ,Computer Science::Computer Vision and Pattern Recognition ,Piecewise ,Segmentation ,Computer vision ,Artificial intelligence ,business ,Computer stereo vision ,Energy functional - Abstract
We begin by presenting an active contour model which utilizes the Mumford-Shah energy functional for the purpose of piecewise smooth image segmentation. We then show how the use of simultaneous piecewise smooth image segmentation on a set of calibrated 2D images of a common 3D scene may be utilized for reconstructing the unknown shapes and radiances of scene objects. To do so, we must lift the the support of the unknown smooth functions in the traditional Mumford-Shah framework, which will now represent the unknown radiance of scene objects in this application, onto a manifold which will represent the unknown shape of scene objects. This constitutes a significant mathematical departure from the traditional Mumford-Shah model since the unknown functions now live directly on the unknown geometric surfaces rather than their surrounding ambient space. The intution, however, follows from the original model in that the estimates must closely match the observed data while maintaining a high degree of smoothness both in the estimated functions as well as the estimated geometry.
- Published
- 2006
- Full Text
- View/download PDF
189. The Impacts on Spinach Growth and Yield by Biological Organic Fertilizer.
- Author
-
Hongdou LIU, Hailin JIN, Nan LI, Xinhe LIU, Xue LI, Fanteng CONG, Renzhe PIAO, and Hongyan ZHAO
- Abstract
To decrease fertilization amount of chemical fertilizer and improve the quality of vegetable crops, spinach was taken as the test material, and the impact of different fertilizer on spinach growth and yield was studied via the manners of biological organic fertilizer and organic fertilizer + chemical fertilizer. Experimental results showed that in the formula of organic fertilizer + chemical fertilizer, chlorophyll and nitrogen contents in spinach leaves obviously increased; in the formula of only adding organic fertilizer, spinach leaf temperature, leaf width, root length, plant height and fresh weight were all better than those in the formula of organic fertilizer + chemical fertilizer, and better formulas were A5, E5, F3 and I5, in which spinach plant height in E5 was 5.63 times higher than G5, root length in E5 was 2.67 times higher than G5, and fresh weight in G5 was 32.6 times higher than G5. By comprehensive anaysis, the most suitable formula for spinach production was E5, and the research could provide theoretic basis for fertilization amount of organic fertilizer required by spinach growth and development. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
190. Real-time feature tracking and outlier rejection with changes in illumination
- Author
-
Stefano Soatto, Hailin Jin, and Paolo Favaro
- Subjects
Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Photometry (optics) ,510 Mathematics ,Personal computer ,Outlier ,Feature tracking ,Computer vision ,Artificial intelligence ,Affine transformation ,business ,000 Computer science, knowledge & systems ,ComputingMethodologies_COMPUTERGRAPHICS ,Statistical hypothesis testing - Abstract
We develop an efficient algorithm to track point features supported by image patches undergoing affine deformations and changes in illumination. The algorithm is based on a combined model of geometry and photometry, that is used to track features as well as to detect outliers in a hypothesis testing framework. The algorithm runs in real time on a personal computer; and is available to the public.
- Published
- 2002
- Full Text
- View/download PDF
191. Real-time virtual object insertion
- Author
-
Stefano Soatto, Hailin Jin, and Paolo Favaro
- Subjects
Computer science ,business.industry ,Orientation (computer vision) ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Pentium ,Real image ,Software ,510 Mathematics ,Virtual image ,Frame grabber ,Motion estimation ,Computer graphics (images) ,Computer vision ,Artificial intelligence ,MATLAB ,business ,computer ,000 Computer science, knowledge & systems ,computer.programming_language - Abstract
We present a system to insert virtual objects into real image sequences in real time. The system consists of offthe- shelf hardware (a camera connected to a Pentium PC) and software to (a) automatically select and track region features despite changes in illumination, (b) estimate threedimensional position and orientation of surface patches relative to an inertial reference frame despite individual pointfeatures appearing and disappearing, (c) insert a texturemapped virtual object into the scene so as to make it appear to be part of the scene and moving with it. This is all done in real time. The multi-thread C++ code, which is readily interfaced with a frame grabber as well as Matlab for development, will be made available to the public at the demonstration.
- Published
- 2001
- Full Text
- View/download PDF
192. Multiple view image denoising.
- Author
-
Li Zhang, Vaddadi, S., Hailin Jin, and Nayar, S.K.
- Published
- 2009
- Full Text
- View/download PDF
193. Stereo matching with nonparametric smoothness priors in feature space.
- Author
-
Smith, B.M., Li Zhang, and Hailin Jin
- Published
- 2009
- Full Text
- View/download PDF
194. GroupSAC: Efficient consensus in the presence of groupings.
- Author
-
Kai Ni, Hailin Jin, and Dellaert, F.
- Published
- 2009
- Full Text
- View/download PDF
195. Stereoscopic inpainting: Joint color and depth completion from stereo images.
- Author
-
Liang Wang, Hailin Jin, Ruigang Yang, and Minglun Gong
- Published
- 2008
- Full Text
- View/download PDF
196. A semi-direct approach to structure from motion.
- Author
-
Favaro, P., Hailin Jin, and Soatto, S.
- Published
- 2001
- Full Text
- View/download PDF
197. Automatic Scene Inference for 3D Object Compositing.
- Author
-
KARSCH, KEVIN, SUNKAVALLI, KALYAN, HADAP, SUNIL, CARR, NATHAN, HAILIN JIN, FONTE, RAFAEL, SITTIG, MICHAEL, and FORSYTH, DAVID
- Subjects
DIGITAL image editing ,THREE-dimensional imaging ,DEPTH of field ,ALGORITHMS ,GEOMETRY - Abstract
We present a user-friendly image editing system that supports a drag-and-drop object insertion (where the user merely drags objects into the image, and the system automatically places them in 3D and relights them appropriately), postprocess illumination editing, and depth-of-field manipulation. Underlying our system is a fully automatic technique for recovering a comprehensive 3D scene model (geometry, illumination, diffuse albedo, and camera parameters) from a single, low dynamic range photograph. This is made possible by two novel contributions: an illumination inference algorithm that recovers a full lighting model of the scene (including light sources that are not directly visible in the photograph), and a depth estimation algorithm that combines data-driven depth transfer with geometric reasoning about the scene layout. A user study shows that our system produces perceptually convincing results, and achieves the same level of realism as techniques that require significant user interaction. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
198. Mumford-Shah on the Move: Region-Based Segmentation on Deforming Manifolds with Application to 3-D Reconstruction of Shape and Appearance from Multi-View Images.
- Author
-
Hailin Jin, Anthony Yezzi, and Stefano Soatto
- Abstract
Abstract We address the problem of estimating the shape and appearance of a scene made of smooth Lambertian surfaces with piecewise smooth albedo. We allow the scene to have self-occlusions and multiple connected components. This class of surfaces is often used as an approximation of scenes populated by man-made objects. We assume we are given a number of images taken from different vantage points. Mathematically this problem can be posed as an extension of Mumford and Shah’s approach to static image segmentation to the segmentation of a function defined on a deforming surface. We propose an iterative procedure to minimize a global cost functional that combines geometric priors on both the shape of the scene and the boundary between smooth albedo regions. We carry out the numerical implementation in the level set framework. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
199. KALMANSAC: robust filtering by consensus.
- Author
-
Vedaldi, A., Hailin Jin, Favaro, P., and Soatto, S.
- Published
- 2005
- Full Text
- View/download PDF
200. Tales of shape and radiance in multiview stereo.
- Author
-
Soatto, S., Yezzi, A.J., and Hailin Jin
- Published
- 2003
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.