29 results on '"Image gradient"'
Search Results
2. A Method to Extract Image Features and Lineaments Based on a Multi-hillshade Continuous Wavelet Transform.
- Author
-
Song, Man Hyok, Ho, Jin Gyong, Kim, Chol, Chol, Yong O., and Lyu, Song
- Subjects
- *
DIGITAL elevation models , *LIGHTING , *VALLEYS - Abstract
This paper presents a new method for extracting the image features and lineaments related to local extrema of an image or a digital elevation model (DEM) such as ridges and valleys based on the continuous wavelet transform (CWT) of a set of variously illuminated hillshades. The method originates from the principle that a hillshade can exactly reflect the lineaments nearly perpendicular to the illumination direction of the hillshade, but not other ones. The method consists of four steps: (1) preparation of a set of differently illuminated hillshades of the input data, (2) detection of directional edges nearly perpendicular to the illumination direction from each hillshade based on the CWT, (3) a combination of multidirectional edges into an omnidirectional feature image, and (4) identification of lineaments through linkage and linearization of image feature lines. CWT coefficients of each hillshade are used to calculate the gradient and its direction of the hillshade. For each hillshade, directional edge pixels where the gradient direction is parallel to the illumination direction are selectively detected to form accurate and solitary image feature lines related to local extrema of the input data. Directional edges of each hillshade are easily classified into positive and negative edges using the hillshade gradient. As they have similar directions, they are easily linked to form longer line raster objects, which are converted into vector objects, that is, directional lineaments. The multidirectional edges and lineaments given from all the hillshades are combined to form an omnidirectional feature image and a group of omnidirectional lineaments. Its application to real DEMs shows the demonstrated advantages of the proposed method over other methods and the similarity between detected lineaments and fault lines in the study area. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Multi-scale gradient wavelet-based image quality assessment
- Author
-
Mobini, Mobina and Faraji, Mohammad Reza
- Published
- 2024
- Full Text
- View/download PDF
4. A DWT-Based Approach with Gradient Analysis for Robust and Blind Medical Image Watermarking.
- Author
-
Hebbache, Khaled, Khaldi, Belal, Aiadi, Oussama, and Benziane, Ali
- Subjects
DIGITAL image watermarking ,DISCRETE wavelet transforms ,MEDICAL imaging systems ,BLOCK codes ,IMAGE transmission - Abstract
Featured Application: The future applications of the present study are to secure patient information and diagnostic details within medical images, ensuring data integrity and authenticity. The method enhances the security of medical records during storage, transmission, and especially in telemedicine, where images are frequently shared over digital networks. The growing adoption of telemedicine necessitates robust security measures for medical images during transmission. This paper proposes a novel blind watermarking system for medical images that utilizes both image gradients and the Discrete Wavelet Transform (DWT). Image gradients, acting as spatial derivatives, provide a "topological map" of the image, aiding in the identification of areas susceptible to disruption. The DWT, with its multi-resolution analysis, offers a favorable balance between robustness and imperceptibility. The proposed method embeds the watermark within the low–low band (LL) of the DWT-decomposed image, specifically in 3 × 3 block regions selected based on gradient information. The mathematical relationships between the gradient's direction and magnitude are employed to extract the corresponding blocks and their codes adequately. These codes are then XORed with the watermark and embedded into the chosen blocks using the least significant bit (LSB) technique. Extensive experimentation on a medical image dataset evaluates the system's performance against some attacks like filtering, noise, and scaling. The results demonstrate the efficacy of the proposed approach in hiding information while ensuring the security and integrity of watermarked medical images. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Satellite Ortho Image Mosaic Process Quality Verification
- Author
-
Narasimharao, Jonnadula, Priyanka Chowdary, P., Reddy, Avala Raji, Swathi, G., Deepak Kumar, B. P., Batchu, Sree Saranya, Howlett, Robert J., Series Editor, Jain, Lakhmi C., Series Editor, Bhateja, Vikrant, editor, Yang, Xin-She, editor, Ferreira, Marta Campos, editor, Sengar, Sandeep Singh, editor, and Travieso-Gonzalez, Carlos M., editor
- Published
- 2023
- Full Text
- View/download PDF
6. On Applying Gradient Based Thresholding on the Canny Edge Detection Results to Improve the Effectiveness of Fuzzy Hough Transform for Colonoscopy Polyp Detection Purposes
- Author
-
Ismail, Raneem, Prukner, Péter, Nagy, Szilvia, Howlett, Robert J., Series Editor, Jain, Lakhmi C., Series Editor, Kountchev, Roumen, editor, Mironov, Rumen, editor, and Nakamatsu, Kazumi, editor
- Published
- 2023
- Full Text
- View/download PDF
7. A DWT-Based Approach with Gradient Analysis for Robust and Blind Medical Image Watermarking
- Author
-
Khaled Hebbache, Belal Khaldi, Oussama Aiadi, and Ali Benziane
- Subjects
medical image watermarking ,image gradient ,discrete wavelet transform ,image security ,telemedicine ,watermarking attacks ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
The growing adoption of telemedicine necessitates robust security measures for medical images during transmission. This paper proposes a novel blind watermarking system for medical images that utilizes both image gradients and the Discrete Wavelet Transform (DWT). Image gradients, acting as spatial derivatives, provide a “topological map” of the image, aiding in the identification of areas susceptible to disruption. The DWT, with its multi-resolution analysis, offers a favorable balance between robustness and imperceptibility. The proposed method embeds the watermark within the low–low band (LL) of the DWT-decomposed image, specifically in 3 × 3 block regions selected based on gradient information. The mathematical relationships between the gradient’s direction and magnitude are employed to extract the corresponding blocks and their codes adequately. These codes are then XORed with the watermark and embedded into the chosen blocks using the least significant bit (LSB) technique. Extensive experimentation on a medical image dataset evaluates the system’s performance against some attacks like filtering, noise, and scaling. The results demonstrate the efficacy of the proposed approach in hiding information while ensuring the security and integrity of watermarked medical images.
- Published
- 2024
- Full Text
- View/download PDF
8. Facial depth forgery detection based on image gradient.
- Author
-
Xu, Kun, Yang, Gaoming, Fang, Xianjin, and Zhang, Ji
- Subjects
HOUGH transforms ,FORGERY ,DEEP learning ,STREAMING video & television - Abstract
With the widespread application of deep learning, many artificially generated fake images and videos appear on the Internet. However, it is difficult for people to distinguish the real from the fake ones, making the research on detecting and recognizing fake images or videos receive significant attention. Since new forgery techniques can reduce the effectiveness of specific detection methods or even make them ineffective, research on detecting facial depth forgery needs to be continuously developed. To defend against the onslaught of new facial depth forgery methods, we proposed an image gradient-based approach to transform the facial depth forgery detection problem into the recognition and analysis of video frames. Specifically, there are two key components in this approach: (1) we capture images from videos and crop the face section, which dramatically reduces the amount of data; (2) we use the image gradient operator to process the face image that extracts image features for detection and recognition. After these, we have conducted extensive experiments on different facial depth forgery datasets. Experimental results demonstrated that using our image gradient approach could effectively detect facial depth forgery and achieve excellent detection and identification performance. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
9. A fast face recognition system based on annealing algorithm to optimize operator parameters.
- Author
-
Yan, Lijuan, Zhang, Yanhu, and Zhang, Yanjun
- Subjects
- *
SIMULATED annealing , *HUMAN facial recognition software , *COMPUTATIONAL complexity - Abstract
In order to improve the recognition rate and operational efficiency of the system, a method in which the image features compensation coefficients are optimized by using an improved simulated annealing algorithm is proposed. Firstly, eight computational factors with low computational complexity are given, which can be used to compensate image features. Secondly, the design flow of face recognition algorithm is presented. Thirdly, an improved simulated annealing algorithm is designed to solve the optimal combination of feature compensation coefficients in the face recognition system. Fourthly, the results of the feature compensation coefficients recommended by the improved simulated annealing algorithm are applied to the Efficient Face Recognition Algorithm (EFRA) in this paper, and verified on the simulation platform. Experiments show that the recognition rate can reach 100% when the training images are 6 in ORL. The proposed algorithm also performs well in MU_PIE dataset. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
10. Multi-region Based Radial GCN Algorithm for Human Action Recognition
- Author
-
Jang, Han-Byul, Lee, Chil-Woo, Filipe, Joaquim, Editorial Board Member, Ghosh, Ashish, Editorial Board Member, Prates, Raquel Oliveira, Editorial Board Member, Zhou, Lizhu, Editorial Board Member, Sumi, Kazuhiko, editor, Na, In Seop, editor, and Kaneko, Naoshi, editor
- Published
- 2022
- Full Text
- View/download PDF
11. Enhancing LSPIV accuracy in low-speed flows and heterogeneous seeding conditions using image gradient.
- Author
-
Massó, Leandro, Patalano, Antoine, García, Carlos M., Ochoa García, Santiago A., and Rodríguez, Andrés
- Subjects
- *
WATER management , *PARTICLE image velocimetry , *STANDARD deviations , *FLOW measurement , *FLOW velocity - Abstract
Flow measurement in rivers and channels is crucial for water resource management and infrastructure planning, especially under the context of climate change. However, traditional methods like mechanical current meters and hydroacoustic instruments face limitations in terms of cost, intrusiveness, and accessibility. In recent years, image-based velocimetry techniques have emerged as promising alternatives due to their non-contact nature and cost-effectiveness. Nevertheless, persistent challenges remain, particularly concerning the uniform distribution of surface tracers necessary for precise measurements. These challenges are particularly pronounced in cases involving artificial seeding, where ensuring uniform distribution poses a significant obstacle. To address this issue, this study presents a novel methodology for filtering Large Scale Particle Image Velocimetry (LSPIV) data based on indicators of pixel intensity gradients. The methodology was evaluated across various field measurements under low flow conditions, encompassing a wide range of seeding characteristics. The evaluations demonstrated improvements in mean surface velocity profile estimation, showing reductions of up to 70 % in normalized root mean square error compared to not using filters. Additionally, the results were compared with filters typically employed by experienced LSPIV users, such as background subtraction and cross-correlation coefficient thresholds, showing improvements with the proposed filter. Implementation of the proposed strategy reduces the subjectivity in LSPIV implementation, particularly for users with limited knowledge of the technique, but also require minimal post-processing efforts. The methodology is anticipated to be integrated into existing software tools, thereby enhancing the accessibility of LSPIV for individuals with limited expertise in image velocimetry. Overall, this methodology facilitates cost-effective expansion of hydrological information availability, particularly in resource-constrained regions. • New filtering method for image velocimetry in rivers handles heterogeneous seeding. • The method enhances surface velocity estimations for more accurate flow measurement. • The methodology was validated in a wide range of environmental and seeding conditions. • This approach minimizes the need for subjective choices during data analysis. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
12. Video indexing through human face images using LGFA and window technique.
- Author
-
Ghatak, Sanjoy and Battacharjee, Debotosh
- Subjects
VIDEO monitors ,FACE ,VIDEOS ,TELEVISION series ,HUMAN beings - Abstract
Adaptive video monitoring settings have been extensively deployed in recent years. Smart video monitoring technology enables the acquisition and analysis of movies from various devices, as well as automatic analysis based on knowledge gathering. However, the storage capacity is restricted, and the important frames from the movie cannot be saved. Though, if the movie employs face as keyframes, it creates space and time complexity. To address this issue, the Viola-Jones Algorithm was used to detect faces from extracted keyframes in Video Indexing through Human Face Images using LGFA and the sliding window technique. The image gradient for brightness is created by integrating the sliding windowing method with LGFA, and scanning the input image horizontally takes up 70% of the facial image. As a result, Barcode as an index using the sequence table of the EAN 8 approach converts a video's human face into an EAN-8 linear video indexing barcode and thereby reducing bandwidth, storage space, and time complexity. Regular TV series video datasets, datasets of YouTube faces, and data sets of Hollywood clips were used to evaluate the proposed technique, and shown to be effective for indexing videos based on human faces. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
13. Reversible Data Hiding in Encrypted Images Based on Adaptive Predictionerror Label Map.
- Author
-
Yu Ren, Jiaohua Qin, Yun Tan, and Xiong, Neal N.
- Subjects
PREDICTION models ,PIXELS - Abstract
In the field of reversible data hiding in encrypted images (RDH-EI), predict an image effectively and embed a message into the image with lower distortion are two crucial aspects. However, due to the linear regression prediction being sensitive to outliers, it is a challenge to improve the accuracy of predictions. To address this problem, this paper proposes an RDH-EI scheme based on adaptive prediction-error label map. In the prediction stage, an adaptive threshold estimation algorithm based on local complexity is proposed. Then, the pixels selection method based on gradient of image is designed to train the parameters of the prediction model. In the embedding stage, reserve enough space to embed auxiliary information and secret data embedding by flipping the least significant bits (LSBs) to encrypt the original image. In the receiver, based on the prediction- error map, error-free image recovery can be achieved. Extensive experimental results have shown that the proposed method can achieve effective pixel prediction results and get a higher embedding rate (ER). [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
14. Image Interpolation with Regional Gradient Estimation.
- Author
-
Jia, Zuhang and Huang, Qingjiu
- Subjects
INTERPOLATION ,COMPUTATIONAL complexity ,NONLINEAR equations ,PROBLEM solving ,PIXELS - Abstract
This paper proposes an image interpolation method with regional gradient estimation (GEI) to solve the problem of the nonlinear interpolation method not sufficiently considering non-edge pixels. First, the approach presented in this paper expanded on the edge diffusion idea used in CGI and proposed a regional gradient estimation strategy to improve the problem of gradient calculation in the CGI method. Next, the gradient value was used to determine whether a pixel was an edge pixel. Then, a 1D directional filter was employed to process edge pixels while interpolating non-edge pixels using a 2D directionless filter. Finally, we experimented with various representative interpolation methods for grayscale and color images, including the one presented in this paper, and compared them in terms of subjective results, objective criteria, and computational complexity. The experimental results showed that GEI performed better than the other methods in an experiment concerning the visual effect, objective criteria, and computational complexity. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
15. A fast face recognition based on image gradient compensation for feature description.
- Author
-
Zhang, Yanhu and Yan, Lijuan
- Subjects
FACE perception ,HUMAN facial recognition software ,IMAGE recognition (Computer vision) ,PRINCIPAL components analysis ,SUPPORT vector machines - Abstract
To improve the efficiency of traditional face recognition techniques, this paper proposes a novel face recognition algorithm called Image Gradient Feature Compensation (IGFC). Based on the gradients along four directions in an image, a fusion algorithm and a compensation method are implemented to obtain features of the original image. In this study, gradient magnitude maps of a face image are calculated along four directions. Fusion gradients and differential fusion gradients are produced by fusing the four gradient magnitude maps of a face image in multiple ways, and they are used as compensation variables to compensate the appropriate coefficients on the original image and build IGFC feature maps of the original face image. Subsequently, IGFC feature maps are divided into several blocks to calculate the concatenated histogram over all blocks, which is in turn utilized as the feature descriptor for face recognition. Principal component analysis (PCA) is used to cut down the number of dimensions in high-dimensional features, which are recognized by the Support Vector Machine (SVM) classifier. Finally, the proposed IGFC method is superior to traditional methods as suggested by verification studies on YALE, ORL, CMU_PIE, and FERET face databases. When the LibSVM parameter was set to '-s 0 -t 2 -c 16 -g 0.0009765625', the algorithm achieved 100% recognition on Yale and ORL data sets, 92.16% on CMU_PIE data sets, and 74.3% on FERET data sets. The average time for simultaneous completion of the data sets examined was 1.93 s, and the algorithm demonstrated a 70.71% higher running efficiency than the CLBP algorithm. Therefore, the proposed algorithm is highly efficient in feature recognition with excellent computational efficiency. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
16. Adaptive Retinex Algorithm Based on Detail Selection Used in Underwater Image Enhancement.
- Author
-
XU Li, LU Guiming, and QIU Zhenguang
- Subjects
IMAGE intensifiers - Abstract
Aiming at the contradiction of color distortion and image detail enhancement when Retinex algorithm is applied to underwater image enhancement, the adaptive multi-scale Retinex underwater image enhancement algorithm combined with detail information is proposed in this paper. The selection requirements of the scale of convolution function in the enhancement of Retinex algorithm for underwater images containing different detail information are analyzed. Image gradient is used as the adjustment factor to adaptively adjust the weight of multi-scale Retinex operator, which is used to meet the requirements of underwater image with different detail information for contrast enhancement, and effectively alleviate the contradiction between color distortion and detail contrast enhancement in underwater image enhancement. Several groups of experiments show that the algorithm is superior to the traditional multi-scale (MSR) and multi-scale Retinex with color restoration (MSRCR) in removing the blue-green background of underwater image, avoiding color distortion, eliminating non-uniform illumination and image detail enhancement. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
17. Image Watermarking Scheme Using LSB and Image Gradient.
- Author
-
Faheem, Zaid Bin, Ali, Mubashir, Raza, Muhammad Ahsan, Arslan, Farrukh, Ali, Jehad, Masud, Mehedi, and Shorfuzzaman, Mohammad
- Subjects
DIGITAL image watermarking ,WATERMARKS ,DIGITAL communications ,DIGITAL watermarking ,LYAPUNOV exponents ,LINEAR operators ,IMAGE processing - Abstract
In the modern age, watermarking techniques are mandatory to secure digital communication over the internet. For an optimal technique, a high signal-to-noise ratio and normalized correctional is required. In this paper, a digital watermarking technique is proposed on the basis of the least significant bit through an image gradient and chaotic map. The image is segmented into noncorrelated blocks, and the gradient of each block is calculated. The gradient of the image expresses the rapid changes in an image. A chaotic substitution box (S-Box) is used to scramble the watermark according to a piecewise linear chaotic map (PWLCM). PWLCM has a positive Lyapunov exponent and better balance property as compared to other chaotic maps. This S-Box technique is capable of producing a disperse sequence with high nonlinearity in the generated sequence. Least significant bit is a simple technique for embedding but it has a high payload capacity and direct pixel manipulation. The embedding payload introduces a tradeoff between robustness and imperceptibility; hence, the image gradient is a technique to identify the best-suited place to embed a watermark and avoid image degradation. By modifying the least significant bits of the original image, the watermark signal is embedded according to the image gradient. In the image gradient, the direction and magnitude decide how much embeding can be done. In comparison with other methods, the experimental results show satisfactory progress in robustness against several image processing and geometrical attacks while maintaining the imperceptibility of the watermark signal. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
18. Specular highlight removal of light field image combining dichromatic reflection with exemplar patch filling.
- Author
-
Feng, Wei, Sun, Jichen, Liu, Qianqian, Li, Xingang, Liu, Da, and Zhai, Zhongsheng
- Subjects
- *
LIGHT-field cameras , *GAUSSIAN mixture models , *SUM of squares , *K-means clustering , *MARKOV random fields , *ENTROPY (Information theory) - Abstract
• Our method detects and removes specular highlight effectively. • Gaussian mixture model clustering is more accurate than k-means clustering. • Confidence strategy makes the calculated unsaturated pixels more realistic. • New exemplar patch matching criterion finds the best exemplar patch quickly. In this paper, a new method based on dichromatic reflection model (DRM) and exemplar patch is proposed to remove highlight pixels for light field images. Firstly, a Gaussian mixture model clustering method with strong generalization performance combined with depth information is used to classify saturated highlight pixels and unsaturated highlight pixels. A confidence strategy based on DRM is proposed to remove unsaturated highlight pixels, and an exemplar patch matching method based on gradient combined with the sum of square of color difference is designed to remove saturated highlight pixels. Meanwhile, a method named SSIME based on information entropy with structural similarity index measure is designed to quantitatively evaluate the effectiveness of proposed method. Experiments show that our proposed method not only effectively detects and removes specular highlights, but also applies to objects with a large range of highlights, which breakthroughs the limitation that large-region specular highlight cannot be removed due to the short baseline of the light field camera. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Image Interpolation with Regional Gradient Estimation
- Author
-
Zuhang Jia and Qingjiu Huang
- Subjects
image interpolation ,image enhancement ,bicubic interpolation ,nonlinear interpolation ,image gradient ,edge diffusion ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
This paper proposes an image interpolation method with regional gradient estimation (GEI) to solve the problem of the nonlinear interpolation method not sufficiently considering non-edge pixels. First, the approach presented in this paper expanded on the edge diffusion idea used in CGI and proposed a regional gradient estimation strategy to improve the problem of gradient calculation in the CGI method. Next, the gradient value was used to determine whether a pixel was an edge pixel. Then, a 1D directional filter was employed to process edge pixels while interpolating non-edge pixels using a 2D directionless filter. Finally, we experimented with various representative interpolation methods for grayscale and color images, including the one presented in this paper, and compared them in terms of subjective results, objective criteria, and computational complexity. The experimental results showed that GEI performed better than the other methods in an experiment concerning the visual effect, objective criteria, and computational complexity.
- Published
- 2022
- Full Text
- View/download PDF
20. Image Watermarking Scheme Using LSB and Image Gradient
- Author
-
Zaid Bin Faheem, Mubashir Ali, Muhammad Ahsan Raza, Farrukh Arslan, Jehad Ali, Mehedi Masud, and Mohammad Shorfuzzaman
- Subjects
substitution box ,chaotic map ,piecewise linear chaotic map ,least significant bit ,image gradient ,Technology ,Engineering (General). Civil engineering (General) ,TA1-2040 ,Biology (General) ,QH301-705.5 ,Physics ,QC1-999 ,Chemistry ,QD1-999 - Abstract
In the modern age, watermarking techniques are mandatory to secure digital communication over the internet. For an optimal technique, a high signal-to-noise ratio and normalized correctional is required. In this paper, a digital watermarking technique is proposed on the basis of the least significant bit through an image gradient and chaotic map. The image is segmented into noncorrelated blocks, and the gradient of each block is calculated. The gradient of the image expresses the rapid changes in an image. A chaotic substitution box (S-Box) is used to scramble the watermark according to a piecewise linear chaotic map (PWLCM). PWLCM has a positive Lyapunov exponent and better balance property as compared to other chaotic maps. This S-Box technique is capable of producing a disperse sequence with high nonlinearity in the generated sequence. Least significant bit is a simple technique for embedding but it has a high payload capacity and direct pixel manipulation. The embedding payload introduces a tradeoff between robustness and imperceptibility; hence, the image gradient is a technique to identify the best-suited place to embed a watermark and avoid image degradation. By modifying the least significant bits of the original image, the watermark signal is embedded according to the image gradient. In the image gradient, the direction and magnitude decide how much embeding can be done. In comparison with other methods, the experimental results show satisfactory progress in robustness against several image processing and geometrical attacks while maintaining the imperceptibility of the watermark signal.
- Published
- 2022
- Full Text
- View/download PDF
21. Complex Gradient Function Based Image Descriptor
- Author
-
Shekar, B. H., Shetty, P. Rathnakara, Bhat, Sharada S., and Mestetsky, Leonid
- Published
- 2023
- Full Text
- View/download PDF
22. BARNet: Boundary Aware Refinement Network for Crack Detection
- Author
-
Jing-Ming Guo, Jiann-Der Lee, and Herleeyandi Markoni
- Subjects
Computer science ,business.industry ,Mechanical Engineering ,Deep learning ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Boundary (topology) ,Convolutional neural network ,Computer Science Applications ,Image (mathematics) ,Feature (computer vision) ,Automotive Engineering ,Shadow ,ComputingMethodologies_GENERAL ,Enhanced Data Rates for GSM Evolution ,Artificial intelligence ,business ,Algorithm ,Image gradient - Abstract
Road crack is one of the prominent problems that can frequently occur in highways and main roads. The manual road crack evaluation is laborious, time-consuming, inaccurate, and it has several implementation issues. Conversely, the computer vision-based solution is very challenging due to the complex ambient conditions, including illumination, shadow, dust, and crack shape. Most of the cracks exist as irregular edge patterns and are the most important features for detection purpose. Recent advances in deep learning adopt a convolutional neural network as the base model to detect and localize crack with a single RGB image. Yet, this approach has an inaccurate boundary for crack localization, resulting in thicker and blurry edges. To overcome this problem, the study proposes a novel and robust road crack detection based on deep learning which also considers the original edge of the image as the additional feature. The main contribution of this work is adapting the original image gradient with the coarse crack detection result and refining it to produce more precise crack boundaries. Extensive experimental results have shown that the proposed method outperforms the former state-of-the-art methods in terms of the detection accuracy.
- Published
- 2022
- Full Text
- View/download PDF
23. Occupancy Map Guided Fast Video-Based Dynamic Point Cloud Coding
- Author
-
Miaohui Wang, Jian Xiong, Weisi Lin, Hongliang Li, and Hao Gao
- Subjects
Pixel ,Computational complexity theory ,Computer science ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Point cloud ,02 engineering and technology ,Frame rate ,0202 electrical engineering, electronic engineering, information engineering ,Media Technology ,020201 artificial intelligence & image processing ,Electrical and Electronic Engineering ,Projection (set theory) ,Algorithm ,Image gradient ,Data compression ,Block (data storage) - Abstract
In video-based dynamic point cloud compression (V-PCC), 3D point clouds are projected into patches, and then the patches are padded into 2D images suitable for the video compression framework. However, the patch projection-based method produces a large number of empty pixels; the far and near components are projected to generate different 2D images (video frames), respectively. As a result, the generated video is with high resolutions and double frame rates, so the V-PCC has huge computational complexity. This paper proposes an occupancy map guided fast V-PCC method. Firstly, the relationship between the prediction coding and block complexity is studied based on a local linear image gradient model. Secondly, according to the V-PCC strategies of patch projection and block generation, we investigate the differences of rate-distortion characteristics between different types of blocks, and the temporal correlations between the far and near layers. Finally, by taking advantage of the fact that occupancy maps can explicitly indicate the block types, we propose an occupancy map guided fast coding method, in which coding is performed on the different types of blocks. Experiments have tested typical dynamic point clouds, and shown that the proposed method achieves an average 43.66% time-saving at the cost of only 0.27% and 0.16% Bjontegaard Delta (BD) rate increment under the geometry Point-to-Point (D1) error and attribute Luma Peak-Signal-Noise-Ratio (PSNR), respectively.
- Published
- 2022
- Full Text
- View/download PDF
24. SLIC Superpixel Segmentation for Polarimetric SAR Images
- Author
-
Jian Yang, Liangjiang Zhou, Tao Wang, Junjun Yin, Yanlei Du, and Xiyun Liu
- Subjects
Similarity (geometry) ,Computer science ,business.industry ,Polarimetry ,Boundary (topology) ,Initialization ,Pattern recognition ,Feature (computer vision) ,General Earth and Planetary Sciences ,Segmentation ,Artificial intelligence ,Electrical and Electronic Engineering ,Cluster analysis ,business ,Image gradient - Abstract
Superpixel segmentation approaches for polarimetric synthetic aperture radar (SAR) images have only been studied in recent years. Simple linear iterative clustering (SLIC) is a simple and efficient superpixel segmentation method, first proposed for optical images. It basically includes three implementation steps, i.e., initialization, local k-means clustering, and postprocessing. The challenge of applying SLIC to polarimetric SAR images lies in constructing the effective spatial and feature similarity and proposing the efficient segmentation procedure. In this study, to address both issues, we modify the SLIC clustering function to adapt the characteristics of polarimetric statistical measures. A new initialization method is proposed, which exploits the image gradient information to produce robust cluster centers. Furthermore, in an effort to give a comprehensive comparison and provide a fair assessment of the feature similarities for polarimetric SAR imagery, four classic statistical distances, among which two were not studied along with the SLIC previously, are embedded in the modified clustering function. The proposed method is validated by comparing with state-of-the-art SLIC-based algorithms and also the Ncut and TurboPixel algorithms. Experiments on extensive polarimetric SAR data sets show that the proposed method can significantly improve the segmentation results, with producing better boundary adherence and compact as well as uniform superpixels. We also obtain distinct conclusions that are different from the existing studies when investigating the performances of the statistical measures.
- Published
- 2022
- Full Text
- View/download PDF
25. Automatic Registration of Images with Inconsistent Content Through Line-Support Region Segmentation and Geometrical Outlier Removal
- Author
-
Bowen An, Yongpeng Wu, Ming Zhao, Andre Kaup, Pan Shengda, and Fan Zhou
- Subjects
FOS: Computer and information sciences ,Synthetic aperture radar ,Computer science ,Computer Vision and Pattern Recognition (cs.CV) ,Multispectral image ,Feature extraction ,0211 other engineering and technologies ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Computer Science - Computer Vision and Pattern Recognition ,Image registration ,Scale-invariant feature transform ,02 engineering and technology ,Line segment ,FOS: Electrical engineering, electronic engineering, information engineering ,0202 electrical engineering, electronic engineering, information engineering ,Segmentation ,Image gradient ,021101 geological & geomatics engineering ,business.industry ,Image and Video Processing (eess.IV) ,Pattern recognition ,Image segmentation ,Electrical Engineering and Systems Science - Image and Video Processing ,Computer Graphics and Computer-Aided Design ,Computer Science::Computer Vision and Pattern Recognition ,Outlier ,020201 artificial intelligence & image processing ,Affine transformation ,Artificial intelligence ,business ,Software - Abstract
The implementation of automatic image registration is still difficult in various applications. In this paper, an automatic image registration approach through line-support region segmentation and geometrical outlier removal is proposed. This new approach is designed to address the problems associated with the registration of images with affine deformations and inconsistent content, such as remote sensing images with different spectral content or noise interference, or map images with inconsistent annotations. To begin with, line-support regions, namely a straight region whose points share roughly the same image gradient angle, are extracted to address the issues of inconsistent content existing in images. To alleviate the incompleteness of line segments, an iterative strategy with multi-resolution is employed to preserve global structures that are masked at full resolution by image details or noise. Then, geometrical outlier removal is developed to provide reliable feature point matching, which is based on affine-invariant geometrical classifications for corresponding matches initialized by scale invariant feature transform. The candidate outliers are selected by comparing the disparity of accumulated classifications among all matches, instead of conventional methods which only rely on local geometrical relations. Various image sets have been considered in this paper for the evaluation of the proposed approach, including aerial images with simulated affine deformations, remote sensing optical and synthetic aperture radar images taken at different situations (multispectral, multisensor, and multitemporal), and map images with inconsistent annotations. Experimental results demonstrate the superior performance of the proposed method over the existing approaches for the whole data set.
- Published
- 2022
26. Classification of High Frequency Oscillations in intracranial EEG signals based on coupled time-frequency and image-related features
- Author
-
Fatma Krikid, Ahmad Karfoul, Sahbi Chaibi, Amar Kachenoura, Anca Nica, Abdennaceur Kachouri, Régine Le Bouquin Jeannès, Laboratoire Traitement du Signal et de l'Image (LTSI), Université de Rennes (UR)-Institut National de la Santé et de la Recherche Médicale (INSERM), Université de Sfax - University of Sfax, 19G1411, Providence Health Care, PHC: 41711PK, Jonchère, Laurent, Institut National de la Santé et de la Recherche Médicale (INSERM)-Université de Rennes 1 (UR1), and Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES)
- Subjects
[SDV.IB] Life Sciences [q-bio]/Bioengineering ,Epilepsy ,SVM ,Biomedical Engineering ,020206 networking & telecommunications ,Health Informatics ,02 engineering and technology ,03 medical and health sciences ,0302 clinical medicine ,Image gradient ,Signal Processing ,0202 electrical engineering, electronic engineering, information engineering ,Time-Frequency representation ,[SDV.IB]Life Sciences [q-bio]/Bioengineering ,Sparsity ,030217 neurology & neurosurgery ,High Frequency Oscillations ,Multi-classification - Abstract
International audience; High Frequency Oscillations (HFOs) occurring in the range of [30–500 Hz] in epileptic intracranial ElectroEncephaloGraphic (iEEG) signals have recently proven to be good biomarkers for localizing the epileptogenic zone. Identifying these particular cerebral events and their discrimination from other transient events like interictal epileptic spikes is traditionally performed by experts through a visual inspection. However, this is laborious, very time-consuming and subjective. In this paper, a new classification approach of HFOs is proposed. This approach mainly relies on the combination of raw time frequency (TF) features, computed from a TF representation of HFOs using S-transform, with relevant image-based ones derived from a binarization of the corresponding TF grayscale image. The obtained feature vector is then used to learn a multi-class Radial Basis Function (RBF) based Support Vector Machine (SVM) classifier. The efficiency of the proposed approach, compared to conventional classification schemes based only on time, frequency or energy-based features, is confirmed, using both simulated and real iEEG signals. The proposed classification system has achieved, using simulated data and a signal to noise ratio (SNR) of 15 dB, a sensitivity, specificity, accuracy, area under the curve and F1-score around 0.990, 0.996, 0.995, 0.993 and 0.990 respectively. Besides, for real data, our proposed approach has attained the scores of 0.765, 0.941, 0.906, 0.929 and 0.768 for sensitivity, specificity, accuracy, area under the curve and F1-score respectively. These results confirm the relevance of coupling TF and image-related features, in the way proposed in this paper, for higher HFOs classification quality compared to already existing approaches. © 2021 Elsevier Ltd
- Published
- 2022
- Full Text
- View/download PDF
27. Edge-Conditioned Feature Transform Network for Hyperspectral and Multispectral Image Fusion
- Author
-
Yuxuan Zheng, Wu Xianyun, Jocelyn Chanussot, Jiaojiao Li, Yanzi Shi, Jie Guo, Yunsong Li, Xidian University, GIPSA - Signal Images Physique (GIPSA-SIGMAPHY), GIPSA Pôle Sciences des Données (GIPSA-PSD), Grenoble Images Parole Signal Automatique (GIPSA-lab), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Grenoble Images Parole Signal Automatique (GIPSA-lab), Université Grenoble Alpes (UGA), Apprentissage de modèles à partir de données massives (Thoth), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire Jean Kuntzmann (LJK), Institut National de Recherche en Informatique et en Automatique (Inria)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), and ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019)
- Subjects
multispectral image (MSI) ,Computer science ,business.industry ,Feature extraction ,Multispectral image ,Hyperspectral imaging ,Pattern recognition ,Sobel operator ,Iterative reconstruction ,image fusion ,Transformation (function) ,Feature (computer vision) ,hyperspectral image (HSI) ,transfer learning (TL) ,General Earth and Planetary Sciences ,Artificial intelligence ,Electrical and Electronic Engineering ,Edge prior ,business ,[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing ,Image gradient ,feature transform network - Abstract
International audience; Despite recent advances achieved by deep learning techniques in the fusion of low-spatial-resolution hyperspectral image (LR-HSI) and high-spatial-resolution multispectral image (HR-MSI), it remains a challenge to reconstruct the high-spatial-resolution HSI (HR-HSI) with more accurate spatial details and less spectral distortions, since the low-level structure information such as sharp edges tends to be weakened or lost as the network depth grows. To tackle this issue, we creatively propose an edge-conditioned feature transform network (EC-FTN) in this article, which is mainly composed of three parts, namely, feature extraction network (FEN), feature fusion and transformation network (FFTN), and image reconstruction network (IRN). First, two computationally efficient FENs with 3-D convolutions and reshaping layers are employed to extract the joint spectral-spatial features of input images. Then, the FFTN conditioned on the edge map prior can fuse and transform the features adaptively, in which a fusion node and several cascaded feature modulation modules (FMMs) equipped with feature-wise modulation layers are constructed. Specifically, the edge map is generated via transfer learning, i.e., by applying the Sobel operator to feature maps of the red-green-blue (RGB) version of HR-MSI resulting from the pretrained VGG16 model without extra training. Finally, the desired HR-HSI is recovered from the transformed features through IRN. Furthermore, we elaborately design a weighted combinatorial loss function consisting of mean absolute error, image gradient difference, and spectral angle terms to guide the training. Experiments on both ground-based and remotely sensed datasets demonstrate that our EC-FTN outperforms state-of-the-art methods in visual and quantitive evaluations, as well as in fine details reconstruction.
- Published
- 2022
- Full Text
- View/download PDF
28. High dynamic range imaging via gradient-aware context aggregation network
- Author
-
Yu Zhu, Qingsen Yan, Anton van den Hengel, Javen Shi, Yanning Zhang, Jinqiu Sun, and Dong Gong
- Subjects
Ground truth ,Pixel ,Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Context (language use) ,Domain (software engineering) ,Artificial Intelligence ,High-dynamic-range imaging ,Signal Processing ,Computer vision ,Computer Vision and Pattern Recognition ,Artificial intelligence ,business ,Ghosting ,Image resolution ,Software ,Image gradient - Abstract
Obtaining a high dynamic range (HDR) image from multiple low dynamic range images with different exposures is an important step in various computer vision tasks. One of the ongoing challenges in the field is to generate HDR images without ghosting artifacts. Motivated by an observation that such artifacts are particularly noticeable in the gradient domain, in this paper, we propose an HDR imaging approach that aggregates the information from multiple LDR images with guidance from image gradient domain. The proposed method generates artifact-free images by integrating the image gradient information and the image context information in the pixel domain. The context information in a large area helps to reconstruct the contents contaminated by saturation and misalignments. Specifically, an additional gradient stream and the supervision in the gradient domain are applied to incorporate the gradient information in HDR imaging. To use the context information captured from a large area while preserving spatial resolution, we adopt dilated convolutions to extract multi-scale features with rich context information. Moreover, we build a new dataset containing 40 groups of real-world images from diverse scenes with ground truth to validate the proposed model. The samples in the proposed dataset include more challenging moving objects inducing misalignments. Extensive experimental results demonstrate that our proposed model outperforms previous methods on different datasets in terms of both quantitative measure and visual perception quality.
- Published
- 2022
- Full Text
- View/download PDF
29. Classification of High Frequency Oscillations in intracranial EEG signals based on coupled time-frequency and image-related features.
- Author
-
Krikid, Fatma, Karfoul, Ahmad, Chaibi, Sahbi, Kachenoura, Amar, Nica, Anca, Kachouri, Abdennaceur, and Le Bouquin Jeannès, Régine
- Subjects
ELECTROENCEPHALOGRAPHY ,BIOMEDICAL signal processing ,RADIAL basis functions ,SIGNAL-to-noise ratio ,SUPPORT vector machines ,INSPECTION & review ,CLASSIFICATION - Abstract
• Proposal of a multi-classification approach for HFOs in intracranial EEG signals. • Coupled time-frequency and image-related features towards efficient HFOs multi-classification. • High performance on simulated and real iEEG signals. High Frequency Oscillations (HFOs) occurring in the range of [30–500 Hz] in epileptic intracranial ElectroEncephaloGraphic (iEEG) signals have recently proven to be good biomarkers for localizing the epileptogenic zone. Identifying these particular cerebral events and their discrimination from other transient events like interictal epileptic spikes is traditionally performed by experts through a visual inspection. However, this is laborious, very time-consuming and subjective. In this paper, a new classification approach of HFOs is proposed. This approach mainly relies on the combination of raw time frequency (TF) features, computed from a TF representation of HFOs using S-transform, with relevant image-based ones derived from a binarization of the corresponding TF grayscale image. The obtained feature vector is then used to learn a multi-class Radial Basis Function (RBF) based Support Vector Machine (SVM) classifier. The efficiency of the proposed approach, compared to conventional classification schemes based only on time, frequency or energy-based features, is confirmed, using both simulated and real iEEG signals. The proposed classification system has achieved, using simulated data and a signal to noise ratio (SNR) of 15 dB, a sensitivity, specificity, accuracy, area under the curve and F1-score around 0.990, 0.996, 0.995, 0.993 and 0.990 respectively. Besides, for real data, our proposed approach has attained the scores of 0.765, 0.941, 0.906, 0.929 and 0.768 for sensitivity, specificity, accuracy, area under the curve and F1-score respectively. These results confirm the relevance of coupling TF and image-related features, in the way proposed in this paper, for higher HFOs classification quality compared to already existing approaches. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.