Author: "Mattoccia, Stefano" / Language: english / Publication Type: Academic Journals - Searchworks@Jio Institute Digital Library Search Results

1. RGB Guided ToF Imaging System: A Survey of Deep Learning-Based Methods

Author: Qiao, Xin, Poggi, Matteo, Deng, Pengchao, Wei, Hao, Ge, Chenyang, and Mattoccia, Stefano
Published: 2024
Full Text: View/download PDF

2. Guest Editorial: Special Issue on Traditional Computer Vision in the Age of Deep Learning

Author: Poggi, Matteo, Arrigoni, Federica, Fusiello, Andrea, Mattoccia, Stefano, Bartoli, Adrien, Sattler, Torsten, and Pajdla, Tomas
Published: 2024
Full Text: View/download PDF

3. Depth super-resolution from explicit and implicit high-frequency features

Author: Qiao, Xin, Ge, Chenyang, Zhang, Youmin, Zhou, Yanhui, Tosi, Fabio, Poggi, Matteo, and Mattoccia, Stefano
Published: 2023
Full Text: View/download PDF

4. Self-supervised depth super-resolution with contrastive multiview pre-training

Author: Qiao, Xin, Ge, Chenyang, Zhao, Chaoqiang, Tosi, Fabio, Poggi, Matteo, and Mattoccia, Stefano
Published: 2023
Full Text: View/download PDF

5. A computer vision approach based on deep learning for the detection of dairy cows in free stall barn

Author: Tassinari, Patrizia, Bovo, Marco, Benni, Stefano, Franzoni, Simone, Poggi, Matteo, Mammi, Ludovica Maria Eugenia, Mattoccia, Stefano, Di Stefano, Luigi, Bonora, Filippo, Barbaresi, Alberto, Santolini, Enrica, and Torreggiani, Daniele
Published: 2021
Full Text: View/download PDF

6. Learning a confidence measure in the disparity domain from O(1) features

Author: Poggi, Matteo, Tosi, Fabio, and Mattoccia, Stefano
Published: 2020
Full Text: View/download PDF

7. Guest Editorial: Special Issue on Embedded Computer Vision

Author: Mattoccia, Stefano, Kisačanin, Branislav, Gelautz, Margrit, Chai, Sek, Belbachir, Ahmed Nabil, Dedeoglu, Goksel, and Stein, Fridtjof
Published: 2018
Full Text: View/download PDF

8. Real-Time Self-Supervised Monocular Depth Estimation Without GPU.

Author: Poggi, Matteo, Tosi, Fabio, Aleotti, Filippo, and Mattoccia, Stefano
Abstract: Single-image depth estimation represents a longstanding challenge in computer vision and although it is an ill-posed problem, deep learning enabled astonishing results leveraging both supervised and self-supervised training paradigms. State-of-the-art solutions achieve remarkably accurate depth estimation from a single image deploying huge deep architectures, requiring powerful dedicated hardware to run in a reasonable amount of time. This overly demanding complexity makes them unsuited for a broad category of applications requiring devices with constrained resources or memory consumption. To tackle this issue, in this paper a family of compact, yet effective CNNs for monocular depth estimation is proposed, by leveraging self-supervision from a binocular stereo rig. Our lightweight architectures, namely PyD-Net and PyD-Net2, compared to complex state-of-the-art trade a small drop in accuracy to drastically reduce the runtime and memory requirements by a factor ranging from $2\times $ to $100\times $. Moreover, our networks can run real-time monocular depth estimation on a broad set of embedded or consumer devices, even not equipped with a GPU, by early stopping the inference with negligible (or no) loss in accuracy, making it ideally suited for real applications with strict constraints on hardware resources or power consumption. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

9. Continual Adaptation for Deep Stereo.

Author: Poggi, Matteo, Tonioni, Alessio, Tosi, Fabio, Mattoccia, Stefano, and Stefano, Luigi Di
Subjects: CONVOLUTIONAL neural networks, STEREO vision (Computer science), STEREOPHONIC sound systems, STEREO image, PHYSIOLOGICAL adaptation, BINOCULAR vision, DATA distribution
Abstract: Depth estimation from stereo images is carried out with unmatched results by convolutional neural networks trained end-to-end to regress dense disparities. Like for most tasks, this is possible if large amounts of labelled samples are available for training, possibly covering the whole data distribution encountered at deployment time. Being such an assumption systematically unmet in real applications, the capacity of adapting to any unseen setting becomes of paramount importance. Purposely, we propose a continual adaptation paradigm for deep stereo networks designed to deal with challenging and ever-changing environments. We design a lightweight and modular architecture, Modularly ADaptive Network (MADNet), and formulate Modular ADaptation algorithms (MAD, MAD++) which permit efficient optimization of independent sub-portions of the entire network. In our paradigm, the learning signals needed to continuously adapt models online can be sourced from self-supervision via right-to-left image warping or from traditional stereo algorithms. With both sources, no other data than the input images being gathered at deployment time are needed. Thus, our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system and pave the way for a new paradigm that can facilitate practical deployment of end-to-end architectures for dense disparity regression. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

10. On the Synergies Between Machine Learning and Binocular Stereo for Depth Estimation From Images: A Survey.

Author: Poggi, Matteo, Tosi, Fabio, Batsos, Konstantinos, Mordohai, Philippos, and Mattoccia, Stefano
Subjects: MACHINE learning, DEEP learning, COMPUTER vision, OPTICAL radar, MONOCULARS
Abstract: Stereo matching is one of the longest-standing problems in computer vision with close to 40 years of studies and research. Throughout the years the paradigm has shifted from local, pixel-level decision to various forms of discrete and continuous optimization to data-driven, learning-based methods. Recently, the rise of machine learning and the rapid proliferation of deep learning enhanced stereo matching with new exciting trends and applications unthinkable until a few years ago. Interestingly, the relationship between these two worlds is two-way. While machine, and especially deep, learning advanced the state-of-the-art in stereo matching, stereo itself enabled new ground-breaking methodologies such as self-supervised monocular depth estimation based on deep networks. In this paper, we review recent research in the field of learning-based depth estimation from single and binocular images highlighting the synergies, the successes achieved so far and the open challenges the community is going to face in the immediate future. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

11. On the Confidence of Stereo Matching in a Deep-Learning Era: A Quantitative Evaluation.

Author: Poggi, Matteo, Kim, Seungryong, Tosi, Fabio, Kim, Sunok, Aleotti, Filippo, Min, Dongbo, Sohn, Kwanghoon, and Mattoccia, Stefano
Subjects: STEREO vision (Computer science), CONFIDENCE, DEEP learning, ESTIMATION theory, SCIENTIFIC community, VOLUME measurements
Abstract: Stereo matching is one of the most popular techniques to estimate dense depth maps by finding the disparity between matching pixels on two, synchronized and rectified images. Alongside with the development of more accurate algorithms, the research community focused on finding good strategies to estimate the reliability, i.e., the confidence, of estimated disparity maps. This information proves to be a powerful cue to naively find wrong matches as well as to improve the overall effectiveness of a variety of stereo algorithms according to different strategies. In this paper, we review more than ten years of developments in the field of confidence estimation for stereo matching. We extensively discuss and evaluate existing confidence measures and their variants, from hand-crafted ones to the most recent, state-of-the-art learning based methods. We study the different behaviors of each measure when applied to a pool of different stereo algorithms and, for the first time in literature, when paired with a state-of-the-art deep stereo network. Our experiments, carried out on five different standard datasets, provide a comprehensive overview of the field, highlighting in particular both strengths and limitations of learning-based strategies. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

12. Full-Search-Equivalent pattern matching with incremental dissimilarity approximations

Author: Tombari, Federico, Mattoccia, Stefano, and Di Stefano, Luigi
Subjects: Object recognition (Computers) -- Methods, Object recognition (Computers) -- Comparative analysis, Pattern recognition -- Methods, Pattern recognition -- Comparative analysis, Approximation theory -- Usage
Abstract: This paper proposes a novel method for fast pattern matching based on dissimilarity functions derived from the [L.sub.p] norm, such as the Sum of Squared Differences (SSD) and the Sum of Absolute Differences (SAD). The proposed method is a full-search equivalent, i.e., it yields the same results as the Full Search (FS) algorithm. In order to pursue computational savings, the method deploys a succession of increasingly tighter lower bounds of the adopted [L.sub.p] norm-based dissimilarity function. Such bounding functions allow for establishing a hierarchy of pruning conditions aimed at rapidly skipping those candidates that cannot satisfy the matching criterion. The paper includes an experimental comparison between the proposed method and other FS-equivalent approaches known in the literature, which proves the remarkable computational efficiency of our proposal. Index Terms--Pattern matching, IDA, SSD, SAD, efficient, full-search equivalent.
Published: 2009

13. Monocular Depth Perception on Microcontrollers for Edge Applications.

Author: Peluso, Valentino, Cipolletta, Antonio, Calimera, Andrea, Poggi, Matteo, Tosi, Fabio, Aleotti, Filippo, and Mattoccia, Stefano
Subjects: DEPTH perception, MONOCULARS, COMPUTER vision, MICROCONTROLLERS, CONVOLUTIONAL neural networks, DISTRIBUTED sensors, SENSOR networks
Abstract: Depth estimation is crucial in several computer vision applications, and a recent trend in this field aims at inferring such a cue from a single camera. Unfortunately, despite the compelling results achieved, state-of-the-art monocular depth estimation methods are computationally demanding, thus precluding their practical deployment in several application contexts characterized by low-power constraints. Therefore, in this paper, we propose a lightweight Convolutional Neural Network based on a shallow pyramidal architecture, referred to as $\mu $ PyD-Net, enabling monocular depth estimation on microcontrollers. The network is trained in a peculiar self-supervised manner leveraging proxy labels obtained through a traditional stereo algorithm. Moreover, we propose optimization strategies aimed at performing computations with quantized 8-bit data and map the high-level description of the network to low-level layers optimized for the target microcontroller architecture. Exhaustive experimental results on standard datasets and an in-depth evaluation with a device belonging to the popular Arm Cortex-M family confirm that obtaining sufficiently accurate monocular depth estimation on microcontrollers is feasible. To the best of our knowledge, our proposal is the first one enabling such remarkable achievement, paving the way for the deployment of monocular depth cues onto the tiny end-nodes of distributed sensor networks. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

14. Fast full-search equivalent template matching by enhanced bounded correlation

Author: Mattoccia, Stefano, Tombari, Federico, and Di Stefano, Luigi
Subjects: Correlation (Statistics) -- Usage, Fourier transformations -- Evaluation, Image processing -- Research, Business, Computers, Electronics, Electronics and electrical industries
Abstract: The article presents a new technique called as enhanced bounded correlation (EBC), which greatly minimizes the calculations needed to perform template matching based on normalized cross correlation (NCC). The empirical outcomes demonstrate that the presented method can substantially speed up a full-search equivalent template matching process and surpass the high-tech techniques.
Published: 2008

15. Fast template matching using bounded partial correlation

Author: Di Stefano, Luigi and Mattoccia, Stefano
Published: 2003
Full Text: View/download PDF

16. On the Deployment of Out-of-the-Box Embedded Devices for Self-Powered River Surface Flow Velocity Monitoring at the Edge.

Author: Livoroi, Arsal-Hanif, Conti, Andrea, Foianesi, Luca, Tosi, Fabio, Aleotti, Filippo, Poggi, Matteo, Tauro, Flavia, Toth, Elena, Grimaldi, Salvatore, and Mattoccia, Stefano
Subjects: FLOW velocity, STREAMFLOW, ALGORITHMS, VELOCIMETRY, EDGES (Geometry), TRACKING algorithms
Abstract: As reported in the recent image velocimetry literature, tracking the motion of sparse feature points floating on the river surface as done by the Optical Tracking Velocimetry (OTV) algorithm is a promising strategy to address surface flow monitoring. Moreover, the lightweight nature of OTV coupled with computational optimizations makes it suited even for its deployment in situ to perform measurements at the edge with cheap embedded devices without the need to perform offload processing. Despite these notable achievements, the actual practical deployment of OTV in remote environments would require cheap and self-powered systems enabling continuous measurements without the need for cumbersome and expensive infrastructures rarely found in situ. Purposely, in this paper, we propose an additional simplification to the OTV algorithm to reduce even further its computational requirements, and we analyze self-powered off-the-shelf setups for in situ deployment. We assess the performance of such set-ups from different perspectives to determine the optimal solution to design a cost-effective self-powered measurement node. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

17. Real-Time Stereo within the VIDET Project

Author: Di Stefano, Luigi and Mattoccia, Stefano
Published: 2002
Full Text: View/download PDF

18. Good Cues to Learn From Scratch a Confidence Measure for Passive Depth Sensors.

Author: Poggi, Matteo, Tosi, Fabio, and Mattoccia, Stefano
Abstract: As reported in the stereo literature, confidence estimation represents a powerful cue to detect outliers as well as to improve depth accuracy. Purposely, we proposed a strategy enabling us to achieve state-of-the-art results by learning a confidence measure in the disparity domain only with a CNN. Since this method does not require the cost volume, it is very appealing because potentially suited for any depth-sensing technologies, including, for instance, those based on deep networks. By following this intuition, in this paper, we deeply investigate the performance of confidence estimation methods, known in the literature and new ones proposed in this paper, neglecting the use of the cost volume. Specifically, we estimate from scratch confidence measures feeding deep networks with raw depth estimates and optionally images and assess their performance deploying three datasets and three stereo algorithms. We also investigate, for the first time, their performance with disparity maps inferred by deep stereo end-to-end architectures. Moreover, we move beyond the stereo matching context, estimating confidence from depth maps generated by a monocular network. Our extensive experiments with different architectures highlight that inferring confidence prediction from the raw reference disparity only, as proposed in our previous work, is not only the most versatile solution but also the most effective one in most cases. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

19. Confidence Estimation for ToF and Stereo Sensors and Its Application to Depth Data Fusion.

Author: Poggi, Matteo, Agresti, Gianluca, Tosi, Fabio, Zanuttigh, Pietro, and Mattoccia, Stefano
Abstract: Time-of-Flight (ToF) sensors and stereo vision systems are two widely used technologies for depth estimation. Due to their rather complementary strengths and limitations, the two sensors are often combined to infer more accurate depth maps. A key research issue in this field is how to estimate the reliability of the sensed depth data. While this problem has been widely studied for stereo systems, it has been seldom considered for ToF sensors. Therefore, starting from the work done for stereo data, in this paper, we firstly introduce novel confidence estimation techniques for ToF data. Moreover, we also show how by using learning-based confidence metrics jointly trained on the two sensors yields better performance. Finally, deploying different fusion frameworks, we show how confidence estimation can be exploited in order to guide the fusion of depth data from the two sensors. Experimental results show how accurate confidence cues allow outperforming state-of-the-art data fusion schemes even with the simplest fusion strategies known in the literature. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

20. Efficient template matching for multi-channel images

Author: Mattoccia, Stefano, Tombari, Federico, and Di Stefano, Luigi
Published: 2011
Full Text: View/download PDF

21. ZNCC-based template matching using bounded partial correlation

Author: Di Stefano, Luigi, Mattoccia, Stefano, and Tombari, Federico
Published: 2005
Full Text: View/download PDF

22. Modeling and Simulation of Very High Spatial Resolution UXOs and Landmines in a Hyperspectral Scene for UAV Survey.

Author: Bajić Jr., Milan, Bajić, Milan, Kaniewski, Piotr, Pasternak, Mateusz, and Mattoccia, Stefano
Subjects: LAND mines, SPECTRAL imaging, SIMULATION methods & models, CONFIDENCE intervals
Abstract: This paper presents methods for the modeling and simulation of explosive target placement in terrain spectral images (i.e., real hyperspectral 90-channel VNIR data), considering unexploded ordnances, landmines, and improvised explosive devices. The models used for landmine detection operate at sub-pixel levels. The presented research uses very fine spatial resolutions, 0.945 × 0.945 mm for targets and 1.868 × 1.868 cm for the scene, where the number of target pixels ranges from 52 to 116. While previous research has used the mean spectral value of the target, it is omitted in this paper. The model considers the probability of detection and its confidence intervals, which are derived and used in the analysis of the considered explosive targets. The detection results are better when decreased target endmembers are used to match the scene resolution, rather than using endmembers at the full resolution of the target. Unmanned aerial vehicles, as carriers of snapshot hyperspectral cameras, enable flexible target resolution selection and good area coverage. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

23. The Analysis and Modelling of the Quality of Information Acquired from Weather Station Sensors.

Author: Stawowy, Marek, Olchowik, Wiktor, Rosiński, Adam, Dąbrowski, Tadeusz, and Mattoccia, Stefano
Subjects: METEOROLOGICAL stations, INFORMATION modeling, DETECTORS, MISSING data (Statistics), DATA analysis
Abstract: This article explores the quality of information acquired from weather station sensors. A review of literature in this field concludes that most publications concern the analysis of data acquired from weather station sensors and their characteristic properties, estimating the missing values from the data, and assessing the quality of weather information. Despite the large collection of studies devoted to these issues, there is no comprehensive approach that would consider the modelling of information uncertainty. Therefore, the article presents a proprietary method of analysing and modelling the uncertainty of the weather station sensors' information quality. For this purpose, the structure of a real meteorological station and the measurement data obtained from it were analysed. Next, an information quality model was developed using the certainty factor (CF) of hypothesis calculation. The developed method was verified on an exemplary real meteorological station. It was found that this method enables the improvement of the quality of information obtained and processed in a multi-sensor system. This becomes practical when the influence of individual measurement system elements on the information quality reaching the recipient is determined. An example is furnished by a demonstration of the usage of two sensors to improve the information quality. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

24. Real-Time Single Image Depth Perception in the Wild with Handheld Devices.

Author: Aleotti, Filippo, Zaccaroni, Giulio, Bartolomei, Luca, Poggi, Matteo, Tosi, Fabio, and Mattoccia, Stefano
Subjects: AUGMENTED reality, MONOCULARS, IMAGE, DEEP learning
Abstract: Depth perception is paramount for tackling real-world problems, ranging from autonomous driving to consumer applications. For the latter, depth estimation from a single image would represent the most versatile solution since a standard camera is available on almost any handheld device. Nonetheless, two main issues limit the practical deployment of monocular depth estimation methods on such devices: (i) the low reliability when deployed in the wild and (ii) the resources needed to achieve real-time performance, often not compatible with low-power embedded systems. Therefore, in this paper, we deeply investigate all these issues, showing how they are both addressable by adopting appropriate network design and training strategies. Moreover, we also outline how to map the resulting networks on handheld devices to achieve real-time performance. Our thorough evaluation highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications. Indeed, to further support this evidence, we report experimental results concerning real-time, depth-aware augmented reality and image blurring with smartphones in the wild. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

25. Enabling Image-Based Streamflow Monitoring at the Edge.

Author: Tosi, Fabio, Rocca, Matteo, Aleotti, Filippo, Poggi, Matteo, Mattoccia, Stefano, Tauro, Flavia, Toth, Elena, and Grimaldi, Salvatore
Subjects: STREAMFLOW velocity, INTERNET access, IMAGE processing, BODIES of water, INDUSTRIAL engineering, STREAMFLOW
Abstract: Monitoring streamflow velocity is of paramount importance for water resources management and in engineering practice. To this aim, image-based approaches have proved to be reliable systems to non-intrusively monitor water bodies in remote places at variable flow regimes. Nonetheless, to tackle their computational and energy requirements, offload processing and high-speed internet connections in the monitored environments, which are often difficult to access, is mandatory hence limiting the effective deployment of such techniques in several relevant circumstances. In this paper, we advance and simplify streamflow velocity monitoring by directly processing the image stream in situ with a low-power embedded system. By leveraging its standard parallel processing capability and exploiting functional simplifications, we achieve an accuracy comparable to state-of-the-art algorithms that typically require expensive computing devices and infrastructures. The advantage of monitoring streamflow velocity in situ with a lightweight and cost-effective embedded processing device is threefold. First, it circumvents the need for wideband internet connections, which are expensive and impractical in remote environments. Second, it massively reduces the overall energy consumption, bandwidth and deployment cost. Third, when monitoring more than one river section, processing "at the very edge" of the system efficiency improves scalability by a large margin, compared to offload solutions based on remote or cloud processing. Therefore, enabling streamflow velocity monitoring in situ with low-cost embedded devices would foster the widespread diffusion of gauge cameras even in developing countries where appropriate infrastructure might be not available or too expensive. [ABSTRACT FROM AUTHOR]
Published: 2020
Full Text: View/download PDF

26. Optical Tracking Velocimetry (OTV): Leveraging Optical Flow and Trajectory-Based Filtering for Surface Streamflow Observations.

Author: Tauro, Flavia, Tosi, Fabio, Mattoccia, Stefano, Toth, Elena, Piscopia, Rodolfo, and Grimaldi, Salvatore
Subjects: VELOCIMETRY, HYDROLOGY, COMPUTER algorithms, REMOTE sensing, KALMAN filtering
Abstract: Nonintrusive image-based methods have the potential to advance hydrological streamflow observations by providing spatially distributed data at high temporal resolution. Due to their simplicity, correlation-based approaches have until recent been preferred to alternative image-based approaches, such as optical flow, for camera-based surface flow velocity estimate. In this work, we introduce a novel optical flow scheme, optical tracking velocimetry (OTV), that entails automated feature detection, tracking through the differential sparse Lucas-Kanade algorithm, and then a posteriori filtering to retain only realistic trajectories that pertain to the transit of actual objects in the field of view. The method requires minimal input on the flow direction and camera orientation. Tested on two image data sets collected in diverse natural conditions, the approach proved suitable for rapid and accurate surface flow velocity estimations. Five different feature detectors were compared and the features from accelerated segment test (FAST) resulted in the best balance between the number of features identified and successfully tracked as well as computational efficiency. OTV was relatively insensitive to reduced image resolution but was impacted by acquisition frequencies lower than 7–8 Hz. Compared to traditional correlation-based techniques, OTV was less affected by noise and surface seeding. In addition, the scheme is foreseen to be applicable to real-time gauge-cam implementations. [ABSTRACT FROM AUTHOR]
Published: 2018
Full Text: View/download PDF

27. Introduction to the Issue on Emerging Techniques in 3-D.

Author: Alatan, A. Aydın, Ostermann, Joern, Onural, Levent, AlRegib, Ghassan, Mattoccia, Stefano, and Yuan, Chunrong
Abstract: The fifteen papers in this special section that focus on three dimensional content (3D), with particular emphasis on the fusion of conventional camera outputs with those captured by other modalities, such as active sensors, multi-spectral data or dynamic range images as well as applications that support the measurement and improvement of 3-D content. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

28. Performance Evaluation of Full Search Equivalent Pattern Matching Algorithms.

Author: Ouyang, Wanli, Tombari, Federico, Mattoccia, Stefano, Di Stefano, Luigi, and Cham, W.K.
Subjects: PATTERN recognition systems, PERFORMANCE evaluation, TEMPLATE matching (Digital image processing), ALGORITHMS, SIGNAL processing, MATCHING theory, PIXELS
Abstract: Pattern matching is widely used in signal processing, computer vision, and image and video processing. Full search equivalent algorithms accelerate the pattern matching process and, in the meantime, yield exactly the same result as the full search. This paper proposes an analysis and comparison of state-of-the-art algorithms for full search equivalent pattern matching. Our intention is that the data sets and tests used in our evaluation will be a benchmark for testing future pattern matching algorithms, and that the analysis concerning state-of-the-art algorithms could inspire new fast algorithms. We also propose extensions of the evaluated algorithms and show that they outperform the original formulations. [ABSTRACT FROM AUTHOR]
Published: 2012
Full Text: View/download PDF

29. A fast area-based stereo matching algorithm

Author: Stefano, Luigi Di, Marchionni, Massimiliano, and Mattoccia, Stefano
Subjects: *ALGORITHMS, *MATHEMATICAL optimization, *MATHEMATICAL analysis, *STATISTICAL matching
Abstract: This paper proposes an area-based stereo algorithm suitable to real time applications. The core of the algorithm relies on the uniqueness constraint and on a matching process that rejects previous matches as soon as more reliable ones are found. The proposed approach is also compared with bidirectional matching (BM), since the latter is the basic method for detecting unreliable matches in most area-based stereo algorithms. We describe the algorithm''s matching core, the additional constraints introduced to improve the reliability and the computational optimizations carried out to achieve a very fast implementation. We provide a large set of experimental results, obtained on a standard set of images with ground-truth as well as on stereo sequences, and computation time measurements. These data are used to evaluate the proposed algorithm and compare it with a well-known algorithm based on BM. [Copyright &y& Elsevier]
Published: 2004
Full Text: View/download PDF

30. Neural Disparity Refinement.

Author: Tosi F, Aleotti F, Ramirez PZ, Poggi M, Salti S, Mattoccia S, and Stefano LD
Abstract: We propose a framework that combines traditional, hand-crafted algorithms and recent advances in deep learning to obtain high-quality, high-resolution disparity maps from stereo images. By casting the refinement process as a continuous feature sampling strategy, our neural disparity refinement network can estimate an enhanced disparity map at any output resolution. Our solution can process any disparity map produced by classical stereo algorithms, as well as those predicted by modern stereo networks or even different depth-from-images approaches, such as the COLMAP structure-from-motion pipeline. Nonetheless, when deployed in the former configuration, our framework performs at its best in terms of zero-shot generalization from synthetic to real images. Moreover, its continuous formulation allows for easily handling the unbalanced stereo setup very diffused in mobile phones.
Published: 2024
Full Text: View/download PDF

31. Booster: A Benchmark for Depth From Images of Specular and Transparent Surfaces.

Author: Ramirez PZ, Costanzino A, Tosi F, Poggi M, Salti S, Mattoccia S, and Stefano LD
Abstract: Estimating depth from images nowadays yields outstanding results, both in terms of in-domain accuracy and generalization. However, we identify two main challenges that remain open in this field: dealing with non-Lambertian materials and effectively processing high-resolution images. Purposely, we propose a novel dataset that includes accurate and dense ground-truth labels at high resolution, featuring scenes containing several specular and transparent surfaces. Our acquisition pipeline leverages a novel deep space-time stereo framework, enabling easy and accurate labeling with sub-pixel precision. The dataset is composed of 606 samples collected in 85 different scenes, each sample includes both a high-resolution pair (12 Mpx) as well as an unbalanced stereo pair (Left: 12 Mpx, Right: 1.1 Mpx), typical of modern mobile devices that mount sensors with different resolutions. Additionally, we provide manually annotated material segmentation masks and 15 K unlabeled samples. The dataset is composed of a train set and two test sets, the latter devoted to the evaluation of stereo and monocular depth estimation networks. Our experiments highlight the open challenges and future research directions in this field.
Published: 2024
Full Text: View/download PDF

32. Depth Restoration in Under-Display Time-of-Flight Imaging.

Author: Qiao X, Ge C, Deng P, Wei H, Poggi M, and Mattoccia S
Abstract: Under-display imaging has recently received considerable attention in both academia and industry. As a variation of this technique, under-display ToF (UD-ToF) cameras enable depth sensing for full-screen devices. However, it also brings problems of image blurring, signal-to-noise ratio and ranging accuracy reduction. To address these issues, we propose a cascaded deep network to improve the quality of UD-ToF depth maps. The network comprises two subnets, with the first using a complex-valued network in raw domain to perform denoising, deblurring and raw measurements enhancement jointly, while the second refining depth maps in depth domain based on the proposed multi-scale depth enhancement block (MSDEB). To enable training, we establish a data acquisition device and construct a real UD-ToF dataset by collecting real paired ToF raw data. Besides, we also build a large-scale synthetic UD-ToF dataset through noise analysis. The quantitative and qualitative evaluation results on public datasets and ours demonstrate that the presented network outperforms state-of-the-art algorithms and can further promote full-screen devices in practical applications.
Published: 2023
Full Text: View/download PDF

33. Continual Adaptation for Deep Stereo.

Author: Poggi M, Tonioni A, Tosi F, Mattoccia S, and Di Stefano L
Abstract: Depth estimation from stereo images is carried out with unmatched results by convolutional neural networks trained end-to-end to regress dense disparities. Like for most tasks, this is possible if large amounts of labelled samples are available for training, possibly covering the whole data distribution encountered at deployment time. Being such an assumption systematically unmet in real applications, the capacity of adapting to any unseen setting becomes of paramount importance. Purposely, we propose a continual adaptation paradigm for deep stereo networks designed to deal with challenging and ever-changing environments. We design a lightweight and modular architecture, Modularly ADaptive Network (MADNet), and formulate Modular ADaptation algorithms (MAD, MAD++) which permit efficient optimization of independent sub-portions of the entire network. In our paradigm, the learning signals needed to continuously adapt models online can be sourced from self-supervision via right-to-left image warping or from traditional stereo algorithms. With both sources, no other data than the input images being gathered at deployment time are needed. Thus, our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system and pave the way for a new paradigm that can facilitate practical deployment of end-to-end architectures for dense disparity regression.
Published: 2021
Full Text: View/download PDF

34. Real-Time Single Image Depth Perception in the Wild with Handheld Devices.

Author: Aleotti F, Zaccaroni G, Bartolomei L, Poggi M, Tosi F, and Mattoccia S
Abstract: Depth perception is paramount for tackling real-world problems, ranging from autonomous driving to consumer applications. For the latter, depth estimation from a single image would represent the most versatile solution since a standard camera is available on almost any handheld device. Nonetheless, two main issues limit the practical deployment of monocular depth estimation methods on such devices: (i) the low reliability when deployed in the wild and (ii) the resources needed to achieve real-time performance, often not compatible with low-power embedded systems. Therefore, in this paper, we deeply investigate all these issues, showing how they are both addressable by adopting appropriate network design and training strategies. Moreover, we also outline how to map the resulting networks on handheld devices to achieve real-time performance. Our thorough evaluation highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications. Indeed, to further support this evidence, we report experimental results concerning real-time, depth-aware augmented reality and image blurring with smartphones in the wild.
Published: 2020
Full Text: View/download PDF

35. Unsupervised Domain Adaptation for Depth Prediction from Images.

Author: Tonioni A, Poggi M, Mattoccia S, and Stefano LD
Abstract: State-of-the-art approaches to infer dense depth measurements from images rely on CNNs trained end-to-end on a vast amount of data. However, these approaches suffer a drastic drop in accuracy when dealing with environments much different in appearance and/or context from those observed at training time. This domain shift issue is usually addressed by fine-tuning on smaller sets of images from the target domain annotated with depth labels. Unfortunately, relying on such supervised labeling is seldom feasible in most practical settings. Therefore, we propose an unsupervised domain adaptation technique which does not require groundtruth labels. Our method relies only on image pairs and leverages on classical stereo algorithms to produce disparity measurements alongside with confidence estimators to assess upon their reliability. We propose to fine-tune both depth-from-stereo as well as depth-from-mono architectures by a novel confidence-guided loss function that handles the measured disparities as noisy labels weighted according to the estimated confidence. Extensive experimental results based on standard datasets and evaluation protocols prove that our technique can address effectively the domain shift issue with both stereo and monocular depth prediction architectures and outperforms other state-of-the-art unsupervised loss functions that may be alternatively deployed to pursue domain adaptation.
Published: 2020
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

35 results on '"Mattoccia, Stefano"'

1. RGB Guided ToF Imaging System: A Survey of Deep Learning-Based Methods

2. Guest Editorial: Special Issue on Traditional Computer Vision in the Age of Deep Learning

3. Depth super-resolution from explicit and implicit high-frequency features

4. Self-supervised depth super-resolution with contrastive multiview pre-training

5. A computer vision approach based on deep learning for the detection of dairy cows in free stall barn

6. Learning a confidence measure in the disparity domain from O(1) features

7. Guest Editorial: Special Issue on Embedded Computer Vision

8. Real-Time Self-Supervised Monocular Depth Estimation Without GPU.

9. Continual Adaptation for Deep Stereo.

10. On the Synergies Between Machine Learning and Binocular Stereo for Depth Estimation From Images: A Survey.

11. On the Confidence of Stereo Matching in a Deep-Learning Era: A Quantitative Evaluation.

12. Full-Search-Equivalent pattern matching with incremental dissimilarity approximations

13. Monocular Depth Perception on Microcontrollers for Edge Applications.

14. Fast full-search equivalent template matching by enhanced bounded correlation

15. Fast template matching using bounded partial correlation

16. On the Deployment of Out-of-the-Box Embedded Devices for Self-Powered River Surface Flow Velocity Monitoring at the Edge.

17. Real-Time Stereo within the VIDET Project

18. Good Cues to Learn From Scratch a Confidence Measure for Passive Depth Sensors.

19. Confidence Estimation for ToF and Stereo Sensors and Its Application to Depth Data Fusion.

20. Efficient template matching for multi-channel images

21. ZNCC-based template matching using bounded partial correlation

22. Modeling and Simulation of Very High Spatial Resolution UXOs and Landmines in a Hyperspectral Scene for UAV Survey.

23. The Analysis and Modelling of the Quality of Information Acquired from Weather Station Sensors.

24. Real-Time Single Image Depth Perception in the Wild with Handheld Devices.

25. Enabling Image-Based Streamflow Monitoring at the Edge.

26. Optical Tracking Velocimetry (OTV): Leveraging Optical Flow and Trajectory-Based Filtering for Surface Streamflow Observations.

27. Introduction to the Issue on Emerging Techniques in 3-D.

28. Performance Evaluation of Full Search Equivalent Pattern Matching Algorithms.

29. A fast area-based stereo matching algorithm

30. Neural Disparity Refinement.

31. Booster: A Benchmark for Depth From Images of Specular and Transparent Surfaces.

32. Depth Restoration in Under-Display Time-of-Flight Imaging.

33. Continual Adaptation for Deep Stereo.

34. Real-Time Single Image Depth Perception in the Wild with Handheld Devices.

35. Unsupervised Domain Adaptation for Depth Prediction from Images.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Publication Type

Journal

Database

Publisher

35 results on '"Mattoccia, Stefano"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources