Author: "Miyazaki, Tomo" / Journal: sensors (14248220) - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Miyazaki, Tomo"' showing total 3 results

Start Over Author "Miyazaki, Tomo" Journal sensors (14248220)

3 results on '"Miyazaki, Tomo"'

1. TAMC: Textual Alignment and Masked Consistency for Open-Vocabulary 3D Scene Understanding.

Author: Wang, Juan, Wang, Zhijie, Miyazaki, Tomo, Fan, Yaohou, and Omachi, Shinichiro
Subjects: AUTOMATIC summarization, GEOGRAPHICAL perception, POINT cloud, FEATURE extraction, VIRTUAL reality
Abstract: Three-dimensional (3D) Scene Understanding achieves environmental perception by extracting and analyzing point cloud data with wide applications including virtual reality, robotics, etc. Previous methods align the 2D image feature from a pre-trained CLIP model and the 3D point cloud feature for the open vocabulary scene understanding ability. We believe that existing methods have the following two deficiencies: (1) the 3D feature extraction process ignores the challenges of real scenarios, i.e., point cloud data are very sparse and even incomplete; (2) the training stage lacks direct text supervision, leading to inconsistency with the inference stage. To address the first issue, we employ a Masked Consistency training policy. Specifically, during the alignment of 3D and 2D features, we mask some 3D features to force the model to understand the entire scene using only partial 3D features. For the second issue, we generate pseudo-text labels and align them with the 3D features during the training process. In particular, we first generate a description for each 2D image belonging to the same 3D scene and then use a summarization model to fuse these descriptions into a single description of the scene. Subsequently, we align 2D-3D features and 3D-text features simultaneously during training. Massive experiments demonstrate the effectiveness of our method, outperforming state-of-the-art approaches. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

2. JPEG Image Enhancement with Pre-Processing of Color Reduction and Smoothing.

Author: Shoda, Akane, Miyazaki, Tomo, and Omachi, Shinichiro
Subjects: *DEEP learning, *JPEG (Image coding standard), *IMAGE intensifiers, *DECODING algorithms, *COMPUTATIONAL complexity, *SIGNAL processing
Abstract: JPEG is the international standard for still image encoding and is the most widely used compression algorithm because of its simple encoding process and low computational complexity. Recently, many methods have been developed to improve the quality of JPEG images by using deep learning. However, these methods require the use of high-performance devices since they need to perform neural network computation for decoding images. In this paper, we propose a method to generate high-quality images using deep learning without changing the decoding algorithm. The key idea is to reduce and smooth colors and gradient regions in the original images before JPEG compression. The reduction and smoothing can suppress red block noise and pseudo-contour in the compressed images. Furthermore, high-performance devices are unnecessary for decoding. The proposed method consists of two components: a color transformation network using deep learning and a pseudo-contour suppression model using signal processing. The experimental results showed that the proposed method outperforms standard JPEG in quality measurements correlated with human perception. [ABSTRACT FROM AUTHOR]
Published: 2023
Full Text: View/download PDF

3. Text Detection Using Multi-Stage Region Proposal Network Sensitive to Text Scale †.

Author: Nagaoka, Yoshito, Miyazaki, Tomo, Sugaya, Yoshihiro, Omachi, Shinichiro, and Kyamakya, Kyandoghere
Subjects: *CONVOLUTIONAL neural networks, *INTELLIGENT sensors, *PROBLEM solving
Abstract: Recently, attention has surged concerning intelligent sensors using text detection. However, there are challenges in detecting small texts. To solve this problem, we propose a novel text detection CNN (convolutional neural network) architecture sensitive to text scale. We extract multi-resolution feature maps in multi-stage convolution layers that have been employed to prevent losing information and maintain the feature size. In addition, we developed the CNN considering the receptive field size to generate proposal stages. The experimental results show the importance of the receptive field size. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Miyazaki, Tomo"'

1. TAMC: Textual Alignment and Masked Consistency for Open-Vocabulary 3D Scene Understanding.

2. JPEG Image Enhancement with Pre-Processing of Color Reduction and Smoothing.

3. Text Detection Using Multi-Stage Region Proposal Network Sensitive to Text Scale †.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

3 results on '"Miyazaki, Tomo"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources