Author: "Kaiyue Pang" / Topic: 0202 electrical engineering, electronic engineering, information engineering - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Kaiyue Pang"' showing total 10 results

Start Over Author "Kaiyue Pang" Topic 0202 electrical engineering, electronic engineering, information engineering

10 results on '"Kaiyue Pang"'

1. Solving Mixed-Modal Jigsaw Puzzle for Fine-Grained Sketch-Based Image Retrieval

Author: Yongxin Yang, Kaiyue Pang, Timothy M. Hospedales, Yi-Zhe Song, and Tao Xiang
Subjects: business.industry, Computer science, Feature extraction, Inference, 02 engineering and technology, 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, Sketch, Jigsaw, Modal, 0202 electrical engineering, electronic engineering, information engineering, Task analysis, 020201 artificial intelligence & image processing, Artificial intelligence, business, Image retrieval, computer, 0105 earth and related environmental sciences
Abstract: ImageNet pre-training has long been considered crucial by the fine-grained sketch-based image retrieval (FG-SBIR) community due to the lack of large sketch-photo paired datasets for FG-SBIR training. In this paper, we propose a self-supervised alternative for representation pre-training. Specifically, we consider the jigsaw puzzle game of recomposing images from shuffled parts. We identify two key facets of jigsaw task design that are required for effective FG-SBIR pre-training. The first is formulating the puzzle in a mixed-modality fashion. Second we show that framing the optimisation as permutation matrix inference via Sinkhorn iterations is more effective than the common classifier formulation of Jigsaw self-supervision. Experiments show that this self-supervised pre-training strategy significantly outperforms the standard ImageNet-based pipeline across all four product-level FG-SBIR benchmarks. Interestingly it also leads to improved cross-category generalisation across both pre-train/fine-tune and fine-tune/testing stages.
Published: 2020
Full Text: View/download PDF

2. Generalising Fine-Grained Sketch-Based Image Retrieval

Author: Honggang Zhang, Tao Xiang, Ke Li, Timothy M. Hospedales, Kaiyue Pang, Yi-Zhe Song, and Yongxin Yang
Subjects: Matching (graph theory), business.industry, Computer science, Deep learning, 02 engineering and technology, 010501 environmental sciences, computer.software_genre, 01 natural sciences, Manifold, Sketch, Categorization, 0202 electrical engineering, electronic engineering, information engineering, Embedding, Unsupervised learning, 020201 artificial intelligence & image processing, Artificial intelligence, Representation (mathematics), business, computer, Image retrieval, Natural language processing, 0105 earth and related environmental sciences
Abstract: Fine-grained sketch-based image retrieval (FG-SBIR) addresses matching specific photo instance using free-handsketch as a query modality. Existing models aim to learnan embedding space in which sketch and photo can be directly compared. While successful, they require instance-level pairing within each coarse-grained category as annotated training data. Since the learned embedding space is domain-specific, these models do not generalise well across categories. This limits the practical applicability of FGSBIR. In this paper, we identify cross-category generalisation for FG-SBIR as a domain generalisation problem, and propose the first solution. Our key contribution is a novel unsupervised learning approach to model a universal manifold of prototypical visual sketch traits. This manifold can then be used to paramaterise the learning of a sketch/photo representation. Model adaptation to novel categories then becomes automatic via embedding the novel sketch in the manifold and updating the representation and retrieval function accordingly. Experiments on the two largest FG-SBIR datasets, Sketchy and QMUL-Shoe-V2, demonstrate the efficacy of our approach in enabling crosscategory generalisation of FG-SBIR.
Published: 2019
Full Text: View/download PDF

3. Towards Deep Universal Sketch Perceptual Grouper

Author: Ke Li, Yi-Zhe Song, Kaiyue Pang, Honggang Zhang, Tao Xiang, and Timothy M. Hospedales
Subjects: Computer science, media_common.quotation_subject, 02 engineering and technology, Semantics, computer.software_genre, Deep grouping model, Universal grouper, Discriminative model, Perception, 0202 electrical engineering, electronic engineering, information engineering, Training, Segmentation, Analytical models, Image retrieval, media_common, Visualization, Image segmentation, business.industry, Data models, Sketch Perceptual Grouping, Object (computer science), Computer Graphics and Computer-Aided Design, Sketch, Task analysis, Gestalt psychology, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Software, Natural language processing, Dataset
Abstract: Human free-hand sketches provide useful data for studying human perceptual grouping, where the grouping principles such as the Gestalt laws of grouping are naturally in play during both the perception and sketching stages. In this work, we make the first attempt to develop a universal sketch perceptual grouper. That is, a grouper that can be applied to sketches of any category created with any drawing style and ability, to group constituent strokes/segments into semantically meaningful object parts. The first obstacle to achieving this goal is the lack of largescale datasets with grouping annotation. To overcome this, we contribute the largest sketch perceptual grouping (SPG) dataset to date, consisting of 20; 000 unique sketches evenly distributed over 25 object categories. Furthermore, we propose a novel deep perceptual grouping model learned with both generative and discriminative losses. The generative loss improves the generalisation ability of the model, while the discriminative loss guarantees both local and global grouping consistency. Extensive experiments demonstrate that the proposed grouper significantly outperforms the state-of-the-art competitors. Additionally, we show that our grouper is useful for a number of sketch analysis tasks including sketch semantic segmentation, synthesis and finegrained sketch-based image retrieval (FG-SBIR).
Published: 2019
Full Text: View/download PDF

4. Learning to Sketch with Shortcut Cycle Consistency

Author: Kaiyue Pang, Tao Xiang, Jifei Song, Yi-Zhe Song, and Timothy M. Hospedales
Subjects: FOS: Computer and information sciences, business.industry, Machine vision, Computer science, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, 02 engineering and technology, 010501 environmental sciences, Object (computer science), 01 natural sciences, Sketch, Visualization, Consistency (database systems), 0202 electrical engineering, electronic engineering, information engineering, Unsupervised learning, 020201 artificial intelligence & image processing, Artificial intelligence, business, Image retrieval, Encoder, 0105 earth and related environmental sciences, Abstraction (linguistics)
Abstract: To see is to sketch -- free-hand sketching naturally builds ties between human and machine vision. In this paper, we present a novel approach for translating an object photo to a sketch, mimicking the human sketching process. This is an extremely challenging task because the photo and sketch domains differ significantly. Furthermore, human sketches exhibit various levels of sophistication and abstraction even when depicting the same object instance in a reference photo. This means that even if photo-sketch pairs are available, they only provide weak supervision signal to learn a translation model. Compared with existing supervised approaches that solve the problem of D(E(photo)) -> sketch, where E($\cdot$) and D($\cdot$) denote encoder and decoder respectively, we take advantage of the inverse problem (e.g., D(E(sketch)) -> photo), and combine with the unsupervised learning tasks of within-domain reconstruction, all within a multi-task learning framework. Compared with existing unsupervised approaches based on cycle consistency (i.e., D(E(D(E(photo)))) -> photo), we introduce a shortcut consistency enforced at the encoder bottleneck (e.g., D(E(photo)) -> photo) to exploit the additional self-supervision. Both qualitative and quantitative results show that the proposed model is superior to a number of state-of-the-art alternatives. We also show that the synthetic sketches can be used to train a better fine-grained sketch-based image retrieval (FG-SBIR) model, effectively alleviating the problem of sketch data scarcity., To appear in CVPR2018
Published: 2018
Full Text: View/download PDF

5. SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval

Author: Tongtong Yuan, Kaiyue Pang, Yongye Huang, Tao Xiang, Zhanyu Ma, Timothy M. Hospedales, Yi-Zhe Song, Jun Guo, and Peng Xu
Subjects: FOS: Computer and information sciences, Computer science, Sketch recognition, business.industry, Computer Vision and Pattern Recognition (cs.CV), Hash function, Computer Science - Computer Vision and Pattern Recognition, 02 engineering and technology, 010501 environmental sciences, Machine learning, computer.software_genre, 01 natural sciences, Sketch, Visualization, 0202 electrical engineering, electronic engineering, information engineering, Task analysis, Embedding, 020201 artificial intelligence & image processing, Binary code, Artificial intelligence, business, computer, Feature learning, 0105 earth and related environmental sciences
Abstract: We propose a deep hashing framework for sketch retrieval that, for the first time, works on a multi-million scale human sketch dataset. Leveraging on this large dataset, we explore a few sketch-specific traits that were otherwise under-studied in prior literature. Instead of following the conventional sketch recognition task, we introduce the novel problem of sketch hashing retrieval which is not only more challenging, but also offers a better testbed for large-scale sketch analysis, since: (i) more fine-grained sketch feature learning is required to accommodate the large variations in style and abstraction, and (ii) a compact binary code needs to be learned at the same time to enable efficient retrieval. Key to our network design is the embedding of unique characteristics of human sketch, where (i) a two-branch CNN-RNN architecture is adapted to explore the temporal ordering of strokes, and (ii) a novel hashing loss is specifically designed to accommodate both the temporal and abstract traits of sketches. By working with a 3.8M sketch dataset, we show that state-of-the-art hashing models specifically engineered for static images fail to perform well on temporal sketch data. Our network on the other hand not only offers the best retrieval performance on various code sizes, but also yields the best generalization performance under a zero-shot setting and when re-purposed for sketch recognition. Such superior performances effectively demonstrate the benefit of our sketch-specific design., Accepted by CVPR2018
Published: 2018
Full Text: View/download PDF

6. Universal Sketch Perceptual Grouping

Author: Jifei Song, Timothy M. Hospedales, Kaiyue Pang, Ke Li, Honggang Zhang, Yi-Zhe Song, and Tao Xiang
Subjects: Computer science, business.industry, 020207 software engineering, 02 engineering and technology, Object (computer science), computer.software_genre, Sketch, Domain (software engineering), Annotation, Consistency (database systems), Discriminative model, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, Image retrieval, computer, Natural language processing, Generative grammar
Abstract: In this work we aim to develop a universal sketch grouper. That is, a grouper that can be applied to sketches of any category in any domain to group constituent strokes/segments into semantically meaningful object parts. The first obstacle to this goal is the lack of large-scale datasets with grouping annotation. To overcome this, we contribute the largest sketch perceptual grouping (SPG) dataset to date, consisting of 20, 000 unique sketches evenly distributed over 25 object categories. Furthermore, we propose a novel deep universal perceptual grouping model. The model is learned with both generative and discriminative losses. The generative losses improve the generalisation ability of the model to unseen object categories and datasets. The discriminative losses include a local grouping loss and a novel global grouping loss to enforce global grouping consistency. We show that the proposed model significantly outperforms the state-of-the-art groupers. Further, we show that our grouper is useful for a number of sketch analysis tasks including sketch synthesis and fine-grained sketch-based image retrieval (FG-SBIR).
Published: 2018
Full Text: View/download PDF

7. Deep Factorised Inverse-Sketching

Author: Da Li, Kaiyue Pang, Yi-Zhe Song, Jifei Song, Tao Xiang, and Timothy M. Hospedales
Subjects: Computer science, business.industry, Inverse, 020207 software engineering, 02 engineering and technology, 010501 environmental sciences, 01 natural sciences, Sketch, Rendering (computer graphics), Discriminative model, Salient, 0202 electrical engineering, electronic engineering, information engineering, Embedding, Computer vision, Artificial intelligence, business, Image retrieval, 0105 earth and related environmental sciences
Abstract: Modelling human free-hand sketches has become topical recently, driven by practical applications such as fine-grained sketch based image retrieval (FG-SBIR). Sketches are clearly related to photo edge-maps, but a human free-hand sketch of a photo is not simply a clean rendering of that photo’s edge map. Instead there is a fundamental process of abstraction and iconic rendering, where overall geometry is warped and salient details are selectively included. In this paper we study this sketching process and attempt to invert it. We model this inversion by translating iconic free-hand sketches to contours that resemble more geometrically realistic projections of object boundaries, and separately factorise out the salient added details. This factorised re-representation makes it easier to match a free-hand sketch to a photo instance of an object. Specifically, we propose a novel unsupervised image style transfer model based on enforcing a cyclic embedding consistency constraint. A deep FG-SBIR model is then formulated to accommodate complementary discriminative detail from each factorised sketch for better matching with the corresponding photo. Our method is evaluated both qualitatively and quantitatively to demonstrate its superiority over a number of state-of-the-art alternatives for style transfer and FG-SBIR.
Published: 2018
Full Text: View/download PDF

8. Synergistic Instance-Level Subspace Alignment for Fine-Grained Sketch-Based Image Retrieval

Author: Ke Li, Kaiyue Pang, Yi-Zhe Song, Honggang Zhang, Tao Xiang, and Timothy M. Hospedales
Subjects: Computer science, Feature extraction, Subspace alignment, 02 engineering and technology, Machine learning, computer.software_genre, Semantics, Footwear, Domain (software engineering), Cross-modal, 0202 electrical engineering, electronic engineering, information engineering, Visual Word, Image retrieval, Visualization, Structure (mathematical logic), Information retrieval, business.industry, 020206 networking & telecommunications, Instance-level, Computer Graphics and Computer-Aided Design, Bridges, Sketch, Sketch-based Image Retrieval, Fine-grained, 020201 artificial intelligence & image processing, Deformable models, Artificial intelligence, business, computer, Software, Dataset
Abstract: We study the problem of fine-grained sketch-based image retrieval. By performing instance-level (rather than category-level) retrieval, it embodies a timely and practical application, particularly with the ubiquitous availability of touchscreens. Three factors contribute to the challenging nature of the problem: 1) free-hand sketches are inherently abstract and iconic, making visual comparisons with photos difficult; 2) sketches and photos are in two different visual domains, i.e., black and white lines versus color pixels; and 3) fine-grained distinctions are especially challenging when executed across domain and abstraction-level. To address these challenges, we propose to bridge the image-sketch gap both at the high level via parts and attributes, as well as at the low level via introducing a new domain alignment method. More specifically, first, we contribute a data set with 304 photos and 912 sketches, where each sketch and image is annotated with its semantic parts and associated part-level attributes. With the help of this data set, second, we investigate how strongly supervised deformable part-based models can be learned that subsequently enable automatic detection of part-level attributes, and provide pose-aligned sketch-image comparisons. To reduce the sketch-image gap when comparing low-level features, third, we also propose a novel method for instance-level domain-alignment that exploits both subspace and instance-level cues to better align the domains. Finally, fourth, these are combined in a matching framework integrating aligned low-level features, mid-level geometric structure, and high-level semantic attributes. Extensive experiments conducted on our new data set demonstrate effectiveness of the proposed method.
Published: 2017
Full Text: View/download PDF

9. Cross-domain Generative Learning for Fine-Grained Sketch-Based Image Retrieval

Author: Yi-Zhe Song, Tony Xiang, Kaiyue Pang, and Timothy M. Hospedales
Subjects: business.industry, Computer science, 020206 networking & telecommunications, 02 engineering and technology, computer.software_genre, Sketch, Domain (software engineering), Generative model, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Artificial intelligence, business, computer, Image retrieval, Natural language processing
Abstract: The key challenge for learning a fine-grained sketch-based image retrieval (FG-SBIR) model is to bridge the domain gap between photo and sketch. Existing models learn a deep joint embedding space with discriminative losses where a photo and a sketch can be compared. In this paper, we propose a novel discriminative-generative hybrid model by introducing a generative task of cross-domain image synthesis. This task enforces the learned embedding space to preserve all the domain invariant information that is useful for cross-domain reconstruction, thus explicitly reducing the domain gap as opposed to existing models. Extensive experiments on the largest FG-SBIR dataset Sketchy [19] show that the proposed model significantly outperforms state-of-the-art discriminative FG-SBIR models.
Published: 2017
Full Text: View/download PDF

10. Fine-grained sketch-based image retrieval: The role of part-aware attributes

Author: Yichuan Hu, Honggang Zhang, Kaiyue Pang, Ke Li, Yi-Zhe Song, and Timothy M. Hospedales
Subjects: Structure (mathematical logic), Information retrieval, Pixel, Computer science, Feature extraction, 020207 software engineering, 02 engineering and technology, Semantics, Sketch, Visualization, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Visual Word, Image retrieval
Abstract: We study the problem of fine-grained sketch-based image retrieval. By performing instance-level (rather than category-level) retrieval, it embodies a timely and practical application, particularly with the ubiquitous availability of touchscreens. Three factors contribute to the challenging nature of the problem: (i) free-hand sketches are inherently abstract and iconic, making visual comparisons with photos more difficult, (ii) sketches and photos are in two different visual domains, i.e. black and white lines vs. color pixels, and (iii) fine-grained distinctions are especially challenging when executed across domain and abstraction-level. To address this, we propose to detect visual attributes at part-level, in order to build a new representation that not only captures fine-grained characteristics but also traverses across visual domains. More specifically, (i) we propose a dataset with 304 photos and 912 sketches, where each sketch and photo is annotated with its semantic parts and associated part-level attributes, and with the help of this dataset, we investigate (ii) how strongly-supervised deformable part-based models can be learned that subsequently enable automatic detection of part-level attributes, and (iii) a novel matching framework that synergistically integrates low-level features, mid-level geometric structure and high-level semantic attributes to boost retrieval performance. Extensive experiments conducted on our new dataset demonstrate value of the proposed method.
Published: 2016
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Kaiyue Pang"'

1. Solving Mixed-Modal Jigsaw Puzzle for Fine-Grained Sketch-Based Image Retrieval

2. Generalising Fine-Grained Sketch-Based Image Retrieval

3. Towards Deep Universal Sketch Perceptual Grouper

4. Learning to Sketch with Shortcut Cycle Consistency

5. SketchMate: Deep Hashing for Million-Scale Human Sketch Retrieval

6. Universal Sketch Perceptual Grouping

7. Deep Factorised Inverse-Sketching

8. Synergistic Instance-Level Subspace Alignment for Fine-Grained Sketch-Based Image Retrieval

9. Cross-domain Generative Learning for Fine-Grained Sketch-Based Image Retrieval

10. Fine-grained sketch-based image retrieval: The role of part-aware attributes

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

10 results on '"Kaiyue Pang"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources