35 results on '"Mattoccia, Stefano"'
Search Results
2. Guest Editorial: Special Issue on Traditional Computer Vision in the Age of Deep Learning
- Author
-
Poggi, Matteo, Arrigoni, Federica, Fusiello, Andrea, Mattoccia, Stefano, Bartoli, Adrien, Sattler, Torsten, and Pajdla, Tomas
- Published
- 2024
- Full Text
- View/download PDF
3. Depth super-resolution from explicit and implicit high-frequency features
- Author
-
Qiao, Xin, Ge, Chenyang, Zhang, Youmin, Zhou, Yanhui, Tosi, Fabio, Poggi, Matteo, and Mattoccia, Stefano
- Published
- 2023
- Full Text
- View/download PDF
4. Self-supervised depth super-resolution with contrastive multiview pre-training
- Author
-
Qiao, Xin, Ge, Chenyang, Zhao, Chaoqiang, Tosi, Fabio, Poggi, Matteo, and Mattoccia, Stefano
- Published
- 2023
- Full Text
- View/download PDF
5. A computer vision approach based on deep learning for the detection of dairy cows in free stall barn
- Author
-
Tassinari, Patrizia, Bovo, Marco, Benni, Stefano, Franzoni, Simone, Poggi, Matteo, Mammi, Ludovica Maria Eugenia, Mattoccia, Stefano, Di Stefano, Luigi, Bonora, Filippo, Barbaresi, Alberto, Santolini, Enrica, and Torreggiani, Daniele
- Published
- 2021
- Full Text
- View/download PDF
6. Learning a confidence measure in the disparity domain from O(1) features
- Author
-
Poggi, Matteo, Tosi, Fabio, and Mattoccia, Stefano
- Published
- 2020
- Full Text
- View/download PDF
7. Guest Editorial: Special Issue on Embedded Computer Vision
- Author
-
Mattoccia, Stefano, Kisačanin, Branislav, Gelautz, Margrit, Chai, Sek, Belbachir, Ahmed Nabil, Dedeoglu, Goksel, and Stein, Fridtjof
- Published
- 2018
- Full Text
- View/download PDF
8. Real-Time Self-Supervised Monocular Depth Estimation Without GPU.
- Author
-
Poggi, Matteo, Tosi, Fabio, Aleotti, Filippo, and Mattoccia, Stefano
- Abstract
Single-image depth estimation represents a longstanding challenge in computer vision and although it is an ill-posed problem, deep learning enabled astonishing results leveraging both supervised and self-supervised training paradigms. State-of-the-art solutions achieve remarkably accurate depth estimation from a single image deploying huge deep architectures, requiring powerful dedicated hardware to run in a reasonable amount of time. This overly demanding complexity makes them unsuited for a broad category of applications requiring devices with constrained resources or memory consumption. To tackle this issue, in this paper a family of compact, yet effective CNNs for monocular depth estimation is proposed, by leveraging self-supervision from a binocular stereo rig. Our lightweight architectures, namely PyD-Net and PyD-Net2, compared to complex state-of-the-art trade a small drop in accuracy to drastically reduce the runtime and memory requirements by a factor ranging from $2\times $ to $100\times $. Moreover, our networks can run real-time monocular depth estimation on a broad set of embedded or consumer devices, even not equipped with a GPU, by early stopping the inference with negligible (or no) loss in accuracy, making it ideally suited for real applications with strict constraints on hardware resources or power consumption. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
9. Continual Adaptation for Deep Stereo.
- Author
-
Poggi, Matteo, Tonioni, Alessio, Tosi, Fabio, Mattoccia, Stefano, and Stefano, Luigi Di
- Subjects
CONVOLUTIONAL neural networks ,STEREO vision (Computer science) ,STEREOPHONIC sound systems ,STEREO image ,PHYSIOLOGICAL adaptation ,BINOCULAR vision ,DATA distribution - Abstract
Depth estimation from stereo images is carried out with unmatched results by convolutional neural networks trained end-to-end to regress dense disparities. Like for most tasks, this is possible if large amounts of labelled samples are available for training, possibly covering the whole data distribution encountered at deployment time. Being such an assumption systematically unmet in real applications, the capacity of adapting to any unseen setting becomes of paramount importance. Purposely, we propose a continual adaptation paradigm for deep stereo networks designed to deal with challenging and ever-changing environments. We design a lightweight and modular architecture, Modularly ADaptive Network (MADNet), and formulate Modular ADaptation algorithms (MAD, MAD++) which permit efficient optimization of independent sub-portions of the entire network. In our paradigm, the learning signals needed to continuously adapt models online can be sourced from self-supervision via right-to-left image warping or from traditional stereo algorithms. With both sources, no other data than the input images being gathered at deployment time are needed. Thus, our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system and pave the way for a new paradigm that can facilitate practical deployment of end-to-end architectures for dense disparity regression. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
10. On the Synergies Between Machine Learning and Binocular Stereo for Depth Estimation From Images: A Survey.
- Author
-
Poggi, Matteo, Tosi, Fabio, Batsos, Konstantinos, Mordohai, Philippos, and Mattoccia, Stefano
- Subjects
MACHINE learning ,DEEP learning ,COMPUTER vision ,OPTICAL radar ,MONOCULARS - Abstract
Stereo matching is one of the longest-standing problems in computer vision with close to 40 years of studies and research. Throughout the years the paradigm has shifted from local, pixel-level decision to various forms of discrete and continuous optimization to data-driven, learning-based methods. Recently, the rise of machine learning and the rapid proliferation of deep learning enhanced stereo matching with new exciting trends and applications unthinkable until a few years ago. Interestingly, the relationship between these two worlds is two-way. While machine, and especially deep, learning advanced the state-of-the-art in stereo matching, stereo itself enabled new ground-breaking methodologies such as self-supervised monocular depth estimation based on deep networks. In this paper, we review recent research in the field of learning-based depth estimation from single and binocular images highlighting the synergies, the successes achieved so far and the open challenges the community is going to face in the immediate future. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
11. On the Confidence of Stereo Matching in a Deep-Learning Era: A Quantitative Evaluation.
- Author
-
Poggi, Matteo, Kim, Seungryong, Tosi, Fabio, Kim, Sunok, Aleotti, Filippo, Min, Dongbo, Sohn, Kwanghoon, and Mattoccia, Stefano
- Subjects
STEREO vision (Computer science) ,CONFIDENCE ,DEEP learning ,ESTIMATION theory ,SCIENTIFIC community ,VOLUME measurements - Abstract
Stereo matching is one of the most popular techniques to estimate dense depth maps by finding the disparity between matching pixels on two, synchronized and rectified images. Alongside with the development of more accurate algorithms, the research community focused on finding good strategies to estimate the reliability, i.e., the confidence, of estimated disparity maps. This information proves to be a powerful cue to naively find wrong matches as well as to improve the overall effectiveness of a variety of stereo algorithms according to different strategies. In this paper, we review more than ten years of developments in the field of confidence estimation for stereo matching. We extensively discuss and evaluate existing confidence measures and their variants, from hand-crafted ones to the most recent, state-of-the-art learning based methods. We study the different behaviors of each measure when applied to a pool of different stereo algorithms and, for the first time in literature, when paired with a state-of-the-art deep stereo network. Our experiments, carried out on five different standard datasets, provide a comprehensive overview of the field, highlighting in particular both strengths and limitations of learning-based strategies. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
12. Full-Search-Equivalent pattern matching with incremental dissimilarity approximations
- Author
-
Tombari, Federico, Mattoccia, Stefano, and Di Stefano, Luigi
- Subjects
Object recognition (Computers) -- Methods ,Object recognition (Computers) -- Comparative analysis ,Pattern recognition -- Methods ,Pattern recognition -- Comparative analysis ,Approximation theory -- Usage - Abstract
This paper proposes a novel method for fast pattern matching based on dissimilarity functions derived from the [L.sub.p] norm, such as the Sum of Squared Differences (SSD) and the Sum of Absolute Differences (SAD). The proposed method is a full-search equivalent, i.e., it yields the same results as the Full Search (FS) algorithm. In order to pursue computational savings, the method deploys a succession of increasingly tighter lower bounds of the adopted [L.sub.p] norm-based dissimilarity function. Such bounding functions allow for establishing a hierarchy of pruning conditions aimed at rapidly skipping those candidates that cannot satisfy the matching criterion. The paper includes an experimental comparison between the proposed method and other FS-equivalent approaches known in the literature, which proves the remarkable computational efficiency of our proposal. Index Terms--Pattern matching, IDA, SSD, SAD, efficient, full-search equivalent.
- Published
- 2009
13. Monocular Depth Perception on Microcontrollers for Edge Applications.
- Author
-
Peluso, Valentino, Cipolletta, Antonio, Calimera, Andrea, Poggi, Matteo, Tosi, Fabio, Aleotti, Filippo, and Mattoccia, Stefano
- Subjects
DEPTH perception ,MONOCULARS ,COMPUTER vision ,MICROCONTROLLERS ,CONVOLUTIONAL neural networks ,DISTRIBUTED sensors ,SENSOR networks - Abstract
Depth estimation is crucial in several computer vision applications, and a recent trend in this field aims at inferring such a cue from a single camera. Unfortunately, despite the compelling results achieved, state-of-the-art monocular depth estimation methods are computationally demanding, thus precluding their practical deployment in several application contexts characterized by low-power constraints. Therefore, in this paper, we propose a lightweight Convolutional Neural Network based on a shallow pyramidal architecture, referred to as $\mu $ PyD-Net, enabling monocular depth estimation on microcontrollers. The network is trained in a peculiar self-supervised manner leveraging proxy labels obtained through a traditional stereo algorithm. Moreover, we propose optimization strategies aimed at performing computations with quantized 8-bit data and map the high-level description of the network to low-level layers optimized for the target microcontroller architecture. Exhaustive experimental results on standard datasets and an in-depth evaluation with a device belonging to the popular Arm Cortex-M family confirm that obtaining sufficiently accurate monocular depth estimation on microcontrollers is feasible. To the best of our knowledge, our proposal is the first one enabling such remarkable achievement, paving the way for the deployment of monocular depth cues onto the tiny end-nodes of distributed sensor networks. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
14. Fast full-search equivalent template matching by enhanced bounded correlation
- Author
-
Mattoccia, Stefano, Tombari, Federico, and Di Stefano, Luigi
- Subjects
Correlation (Statistics) -- Usage ,Fourier transformations -- Evaluation ,Image processing -- Research ,Business ,Computers ,Electronics ,Electronics and electrical industries - Abstract
The article presents a new technique called as enhanced bounded correlation (EBC), which greatly minimizes the calculations needed to perform template matching based on normalized cross correlation (NCC). The empirical outcomes demonstrate that the presented method can substantially speed up a full-search equivalent template matching process and surpass the high-tech techniques.
- Published
- 2008
15. Fast template matching using bounded partial correlation
- Author
-
Di Stefano, Luigi and Mattoccia, Stefano
- Published
- 2003
- Full Text
- View/download PDF
16. On the Deployment of Out-of-the-Box Embedded Devices for Self-Powered River Surface Flow Velocity Monitoring at the Edge.
- Author
-
Livoroi, Arsal-Hanif, Conti, Andrea, Foianesi, Luca, Tosi, Fabio, Aleotti, Filippo, Poggi, Matteo, Tauro, Flavia, Toth, Elena, Grimaldi, Salvatore, and Mattoccia, Stefano
- Subjects
FLOW velocity ,STREAMFLOW ,ALGORITHMS ,VELOCIMETRY ,EDGES (Geometry) ,TRACKING algorithms - Abstract
As reported in the recent image velocimetry literature, tracking the motion of sparse feature points floating on the river surface as done by the Optical Tracking Velocimetry (OTV) algorithm is a promising strategy to address surface flow monitoring. Moreover, the lightweight nature of OTV coupled with computational optimizations makes it suited even for its deployment in situ to perform measurements at the edge with cheap embedded devices without the need to perform offload processing. Despite these notable achievements, the actual practical deployment of OTV in remote environments would require cheap and self-powered systems enabling continuous measurements without the need for cumbersome and expensive infrastructures rarely found in situ. Purposely, in this paper, we propose an additional simplification to the OTV algorithm to reduce even further its computational requirements, and we analyze self-powered off-the-shelf setups for in situ deployment. We assess the performance of such set-ups from different perspectives to determine the optimal solution to design a cost-effective self-powered measurement node. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
17. Real-Time Stereo within the VIDET Project
- Author
-
Di Stefano, Luigi and Mattoccia, Stefano
- Published
- 2002
- Full Text
- View/download PDF
18. Good Cues to Learn From Scratch a Confidence Measure for Passive Depth Sensors.
- Author
-
Poggi, Matteo, Tosi, Fabio, and Mattoccia, Stefano
- Abstract
As reported in the stereo literature, confidence estimation represents a powerful cue to detect outliers as well as to improve depth accuracy. Purposely, we proposed a strategy enabling us to achieve state-of-the-art results by learning a confidence measure in the disparity domain only with a CNN. Since this method does not require the cost volume, it is very appealing because potentially suited for any depth-sensing technologies, including, for instance, those based on deep networks. By following this intuition, in this paper, we deeply investigate the performance of confidence estimation methods, known in the literature and new ones proposed in this paper, neglecting the use of the cost volume. Specifically, we estimate from scratch confidence measures feeding deep networks with raw depth estimates and optionally images and assess their performance deploying three datasets and three stereo algorithms. We also investigate, for the first time, their performance with disparity maps inferred by deep stereo end-to-end architectures. Moreover, we move beyond the stereo matching context, estimating confidence from depth maps generated by a monocular network. Our extensive experiments with different architectures highlight that inferring confidence prediction from the raw reference disparity only, as proposed in our previous work, is not only the most versatile solution but also the most effective one in most cases. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
19. Confidence Estimation for ToF and Stereo Sensors and Its Application to Depth Data Fusion.
- Author
-
Poggi, Matteo, Agresti, Gianluca, Tosi, Fabio, Zanuttigh, Pietro, and Mattoccia, Stefano
- Abstract
Time-of-Flight (ToF) sensors and stereo vision systems are two widely used technologies for depth estimation. Due to their rather complementary strengths and limitations, the two sensors are often combined to infer more accurate depth maps. A key research issue in this field is how to estimate the reliability of the sensed depth data. While this problem has been widely studied for stereo systems, it has been seldom considered for ToF sensors. Therefore, starting from the work done for stereo data, in this paper, we firstly introduce novel confidence estimation techniques for ToF data. Moreover, we also show how by using learning-based confidence metrics jointly trained on the two sensors yields better performance. Finally, deploying different fusion frameworks, we show how confidence estimation can be exploited in order to guide the fusion of depth data from the two sensors. Experimental results show how accurate confidence cues allow outperforming state-of-the-art data fusion schemes even with the simplest fusion strategies known in the literature. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
20. Efficient template matching for multi-channel images
- Author
-
Mattoccia, Stefano, Tombari, Federico, and Di Stefano, Luigi
- Published
- 2011
- Full Text
- View/download PDF
21. ZNCC-based template matching using bounded partial correlation
- Author
-
Di Stefano, Luigi, Mattoccia, Stefano, and Tombari, Federico
- Published
- 2005
- Full Text
- View/download PDF
22. Modeling and Simulation of Very High Spatial Resolution UXOs and Landmines in a Hyperspectral Scene for UAV Survey.
- Author
-
Bajić Jr., Milan, Bajić, Milan, Kaniewski, Piotr, Pasternak, Mateusz, and Mattoccia, Stefano
- Subjects
LAND mines ,SPECTRAL imaging ,SIMULATION methods & models ,CONFIDENCE intervals - Abstract
This paper presents methods for the modeling and simulation of explosive target placement in terrain spectral images (i.e., real hyperspectral 90-channel VNIR data), considering unexploded ordnances, landmines, and improvised explosive devices. The models used for landmine detection operate at sub-pixel levels. The presented research uses very fine spatial resolutions, 0.945 × 0.945 mm for targets and 1.868 × 1.868 cm for the scene, where the number of target pixels ranges from 52 to 116. While previous research has used the mean spectral value of the target, it is omitted in this paper. The model considers the probability of detection and its confidence intervals, which are derived and used in the analysis of the considered explosive targets. The detection results are better when decreased target endmembers are used to match the scene resolution, rather than using endmembers at the full resolution of the target. Unmanned aerial vehicles, as carriers of snapshot hyperspectral cameras, enable flexible target resolution selection and good area coverage. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
23. The Analysis and Modelling of the Quality of Information Acquired from Weather Station Sensors.
- Author
-
Stawowy, Marek, Olchowik, Wiktor, Rosiński, Adam, Dąbrowski, Tadeusz, and Mattoccia, Stefano
- Subjects
METEOROLOGICAL stations ,INFORMATION modeling ,DETECTORS ,MISSING data (Statistics) ,DATA analysis - Abstract
This article explores the quality of information acquired from weather station sensors. A review of literature in this field concludes that most publications concern the analysis of data acquired from weather station sensors and their characteristic properties, estimating the missing values from the data, and assessing the quality of weather information. Despite the large collection of studies devoted to these issues, there is no comprehensive approach that would consider the modelling of information uncertainty. Therefore, the article presents a proprietary method of analysing and modelling the uncertainty of the weather station sensors' information quality. For this purpose, the structure of a real meteorological station and the measurement data obtained from it were analysed. Next, an information quality model was developed using the certainty factor (CF) of hypothesis calculation. The developed method was verified on an exemplary real meteorological station. It was found that this method enables the improvement of the quality of information obtained and processed in a multi-sensor system. This becomes practical when the influence of individual measurement system elements on the information quality reaching the recipient is determined. An example is furnished by a demonstration of the usage of two sensors to improve the information quality. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
24. Real-Time Single Image Depth Perception in the Wild with Handheld Devices.
- Author
-
Aleotti, Filippo, Zaccaroni, Giulio, Bartolomei, Luca, Poggi, Matteo, Tosi, Fabio, and Mattoccia, Stefano
- Subjects
AUGMENTED reality ,MONOCULARS ,IMAGE ,DEEP learning - Abstract
Depth perception is paramount for tackling real-world problems, ranging from autonomous driving to consumer applications. For the latter, depth estimation from a single image would represent the most versatile solution since a standard camera is available on almost any handheld device. Nonetheless, two main issues limit the practical deployment of monocular depth estimation methods on such devices: (i) the low reliability when deployed in the wild and (ii) the resources needed to achieve real-time performance, often not compatible with low-power embedded systems. Therefore, in this paper, we deeply investigate all these issues, showing how they are both addressable by adopting appropriate network design and training strategies. Moreover, we also outline how to map the resulting networks on handheld devices to achieve real-time performance. Our thorough evaluation highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications. Indeed, to further support this evidence, we report experimental results concerning real-time, depth-aware augmented reality and image blurring with smartphones in the wild. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
25. Enabling Image-Based Streamflow Monitoring at the Edge.
- Author
-
Tosi, Fabio, Rocca, Matteo, Aleotti, Filippo, Poggi, Matteo, Mattoccia, Stefano, Tauro, Flavia, Toth, Elena, and Grimaldi, Salvatore
- Subjects
STREAMFLOW velocity ,INTERNET access ,IMAGE processing ,BODIES of water ,INDUSTRIAL engineering ,STREAMFLOW - Abstract
Monitoring streamflow velocity is of paramount importance for water resources management and in engineering practice. To this aim, image-based approaches have proved to be reliable systems to non-intrusively monitor water bodies in remote places at variable flow regimes. Nonetheless, to tackle their computational and energy requirements, offload processing and high-speed internet connections in the monitored environments, which are often difficult to access, is mandatory hence limiting the effective deployment of such techniques in several relevant circumstances. In this paper, we advance and simplify streamflow velocity monitoring by directly processing the image stream in situ with a low-power embedded system. By leveraging its standard parallel processing capability and exploiting functional simplifications, we achieve an accuracy comparable to state-of-the-art algorithms that typically require expensive computing devices and infrastructures. The advantage of monitoring streamflow velocity in situ with a lightweight and cost-effective embedded processing device is threefold. First, it circumvents the need for wideband internet connections, which are expensive and impractical in remote environments. Second, it massively reduces the overall energy consumption, bandwidth and deployment cost. Third, when monitoring more than one river section, processing "at the very edge" of the system efficiency improves scalability by a large margin, compared to offload solutions based on remote or cloud processing. Therefore, enabling streamflow velocity monitoring in situ with low-cost embedded devices would foster the widespread diffusion of gauge cameras even in developing countries where appropriate infrastructure might be not available or too expensive. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
26. Optical Tracking Velocimetry (OTV): Leveraging Optical Flow and Trajectory-Based Filtering for Surface Streamflow Observations.
- Author
-
Tauro, Flavia, Tosi, Fabio, Mattoccia, Stefano, Toth, Elena, Piscopia, Rodolfo, and Grimaldi, Salvatore
- Subjects
VELOCIMETRY ,HYDROLOGY ,COMPUTER algorithms ,REMOTE sensing ,KALMAN filtering - Abstract
Nonintrusive image-based methods have the potential to advance hydrological streamflow observations by providing spatially distributed data at high temporal resolution. Due to their simplicity, correlation-based approaches have until recent been preferred to alternative image-based approaches, such as optical flow, for camera-based surface flow velocity estimate. In this work, we introduce a novel optical flow scheme, optical tracking velocimetry (OTV), that entails automated feature detection, tracking through the differential sparse Lucas-Kanade algorithm, and then a posteriori filtering to retain only realistic trajectories that pertain to the transit of actual objects in the field of view. The method requires minimal input on the flow direction and camera orientation. Tested on two image data sets collected in diverse natural conditions, the approach proved suitable for rapid and accurate surface flow velocity estimations. Five different feature detectors were compared and the features from accelerated segment test (FAST) resulted in the best balance between the number of features identified and successfully tracked as well as computational efficiency. OTV was relatively insensitive to reduced image resolution but was impacted by acquisition frequencies lower than 7–8 Hz. Compared to traditional correlation-based techniques, OTV was less affected by noise and surface seeding. In addition, the scheme is foreseen to be applicable to real-time gauge-cam implementations. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
27. Introduction to the Issue on Emerging Techniques in 3-D.
- Author
-
Alatan, A. Aydın, Ostermann, Joern, Onural, Levent, AlRegib, Ghassan, Mattoccia, Stefano, and Yuan, Chunrong
- Abstract
The fifteen papers in this special section that focus on three dimensional content (3D), with particular emphasis on the fusion of conventional camera outputs with those captured by other modalities, such as active sensors, multi-spectral data or dynamic range images as well as applications that support the measurement and improvement of 3-D content. [ABSTRACT FROM PUBLISHER]
- Published
- 2012
- Full Text
- View/download PDF
28. Performance Evaluation of Full Search Equivalent Pattern Matching Algorithms.
- Author
-
Ouyang, Wanli, Tombari, Federico, Mattoccia, Stefano, Di Stefano, Luigi, and Cham, W.K.
- Subjects
PATTERN recognition systems ,PERFORMANCE evaluation ,TEMPLATE matching (Digital image processing) ,ALGORITHMS ,SIGNAL processing ,MATCHING theory ,PIXELS - Abstract
Pattern matching is widely used in signal processing, computer vision, and image and video processing. Full search equivalent algorithms accelerate the pattern matching process and, in the meantime, yield exactly the same result as the full search. This paper proposes an analysis and comparison of state-of-the-art algorithms for full search equivalent pattern matching. Our intention is that the data sets and tests used in our evaluation will be a benchmark for testing future pattern matching algorithms, and that the analysis concerning state-of-the-art algorithms could inspire new fast algorithms. We also propose extensions of the evaluated algorithms and show that they outperform the original formulations. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
29. A fast area-based stereo matching algorithm
- Author
-
Stefano, Luigi Di, Marchionni, Massimiliano, and Mattoccia, Stefano
- Subjects
- *
ALGORITHMS , *MATHEMATICAL optimization , *MATHEMATICAL analysis , *STATISTICAL matching - Abstract
This paper proposes an area-based stereo algorithm suitable to real time applications. The core of the algorithm relies on the uniqueness constraint and on a matching process that rejects previous matches as soon as more reliable ones are found. The proposed approach is also compared with bidirectional matching (BM), since the latter is the basic method for detecting unreliable matches in most area-based stereo algorithms. We describe the algorithm''s matching core, the additional constraints introduced to improve the reliability and the computational optimizations carried out to achieve a very fast implementation. We provide a large set of experimental results, obtained on a standard set of images with ground-truth as well as on stereo sequences, and computation time measurements. These data are used to evaluate the proposed algorithm and compare it with a well-known algorithm based on BM. [Copyright &y& Elsevier]
- Published
- 2004
- Full Text
- View/download PDF
30. Neural Disparity Refinement.
- Author
-
Tosi F, Aleotti F, Ramirez PZ, Poggi M, Salti S, Mattoccia S, and Stefano LD
- Abstract
We propose a framework that combines traditional, hand-crafted algorithms and recent advances in deep learning to obtain high-quality, high-resolution disparity maps from stereo images. By casting the refinement process as a continuous feature sampling strategy, our neural disparity refinement network can estimate an enhanced disparity map at any output resolution. Our solution can process any disparity map produced by classical stereo algorithms, as well as those predicted by modern stereo networks or even different depth-from-images approaches, such as the COLMAP structure-from-motion pipeline. Nonetheless, when deployed in the former configuration, our framework performs at its best in terms of zero-shot generalization from synthetic to real images. Moreover, its continuous formulation allows for easily handling the unbalanced stereo setup very diffused in mobile phones.
- Published
- 2024
- Full Text
- View/download PDF
31. Booster: A Benchmark for Depth From Images of Specular and Transparent Surfaces.
- Author
-
Ramirez PZ, Costanzino A, Tosi F, Poggi M, Salti S, Mattoccia S, and Stefano LD
- Abstract
Estimating depth from images nowadays yields outstanding results, both in terms of in-domain accuracy and generalization. However, we identify two main challenges that remain open in this field: dealing with non-Lambertian materials and effectively processing high-resolution images. Purposely, we propose a novel dataset that includes accurate and dense ground-truth labels at high resolution, featuring scenes containing several specular and transparent surfaces. Our acquisition pipeline leverages a novel deep space-time stereo framework, enabling easy and accurate labeling with sub-pixel precision. The dataset is composed of 606 samples collected in 85 different scenes, each sample includes both a high-resolution pair (12 Mpx) as well as an unbalanced stereo pair (Left: 12 Mpx, Right: 1.1 Mpx), typical of modern mobile devices that mount sensors with different resolutions. Additionally, we provide manually annotated material segmentation masks and 15 K unlabeled samples. The dataset is composed of a train set and two test sets, the latter devoted to the evaluation of stereo and monocular depth estimation networks. Our experiments highlight the open challenges and future research directions in this field.
- Published
- 2024
- Full Text
- View/download PDF
32. Depth Restoration in Under-Display Time-of-Flight Imaging.
- Author
-
Qiao X, Ge C, Deng P, Wei H, Poggi M, and Mattoccia S
- Abstract
Under-display imaging has recently received considerable attention in both academia and industry. As a variation of this technique, under-display ToF (UD-ToF) cameras enable depth sensing for full-screen devices. However, it also brings problems of image blurring, signal-to-noise ratio and ranging accuracy reduction. To address these issues, we propose a cascaded deep network to improve the quality of UD-ToF depth maps. The network comprises two subnets, with the first using a complex-valued network in raw domain to perform denoising, deblurring and raw measurements enhancement jointly, while the second refining depth maps in depth domain based on the proposed multi-scale depth enhancement block (MSDEB). To enable training, we establish a data acquisition device and construct a real UD-ToF dataset by collecting real paired ToF raw data. Besides, we also build a large-scale synthetic UD-ToF dataset through noise analysis. The quantitative and qualitative evaluation results on public datasets and ours demonstrate that the presented network outperforms state-of-the-art algorithms and can further promote full-screen devices in practical applications.
- Published
- 2023
- Full Text
- View/download PDF
33. Continual Adaptation for Deep Stereo.
- Author
-
Poggi M, Tonioni A, Tosi F, Mattoccia S, and Di Stefano L
- Abstract
Depth estimation from stereo images is carried out with unmatched results by convolutional neural networks trained end-to-end to regress dense disparities. Like for most tasks, this is possible if large amounts of labelled samples are available for training, possibly covering the whole data distribution encountered at deployment time. Being such an assumption systematically unmet in real applications, the capacity of adapting to any unseen setting becomes of paramount importance. Purposely, we propose a continual adaptation paradigm for deep stereo networks designed to deal with challenging and ever-changing environments. We design a lightweight and modular architecture, Modularly ADaptive Network (MADNet), and formulate Modular ADaptation algorithms (MAD, MAD++) which permit efficient optimization of independent sub-portions of the entire network. In our paradigm, the learning signals needed to continuously adapt models online can be sourced from self-supervision via right-to-left image warping or from traditional stereo algorithms. With both sources, no other data than the input images being gathered at deployment time are needed. Thus, our network architecture and adaptation algorithms realize the first real-time self-adaptive deep stereo system and pave the way for a new paradigm that can facilitate practical deployment of end-to-end architectures for dense disparity regression.
- Published
- 2021
- Full Text
- View/download PDF
34. Real-Time Single Image Depth Perception in the Wild with Handheld Devices.
- Author
-
Aleotti F, Zaccaroni G, Bartolomei L, Poggi M, Tosi F, and Mattoccia S
- Abstract
Depth perception is paramount for tackling real-world problems, ranging from autonomous driving to consumer applications. For the latter, depth estimation from a single image would represent the most versatile solution since a standard camera is available on almost any handheld device. Nonetheless, two main issues limit the practical deployment of monocular depth estimation methods on such devices: (i) the low reliability when deployed in the wild and (ii) the resources needed to achieve real-time performance, often not compatible with low-power embedded systems. Therefore, in this paper, we deeply investigate all these issues, showing how they are both addressable by adopting appropriate network design and training strategies. Moreover, we also outline how to map the resulting networks on handheld devices to achieve real-time performance. Our thorough evaluation highlights the ability of such fast networks to generalize well to new environments, a crucial feature required to tackle the extremely varied contexts faced in real applications. Indeed, to further support this evidence, we report experimental results concerning real-time, depth-aware augmented reality and image blurring with smartphones in the wild.
- Published
- 2020
- Full Text
- View/download PDF
35. Unsupervised Domain Adaptation for Depth Prediction from Images.
- Author
-
Tonioni A, Poggi M, Mattoccia S, and Stefano LD
- Abstract
State-of-the-art approaches to infer dense depth measurements from images rely on CNNs trained end-to-end on a vast amount of data. However, these approaches suffer a drastic drop in accuracy when dealing with environments much different in appearance and/or context from those observed at training time. This domain shift issue is usually addressed by fine-tuning on smaller sets of images from the target domain annotated with depth labels. Unfortunately, relying on such supervised labeling is seldom feasible in most practical settings. Therefore, we propose an unsupervised domain adaptation technique which does not require groundtruth labels. Our method relies only on image pairs and leverages on classical stereo algorithms to produce disparity measurements alongside with confidence estimators to assess upon their reliability. We propose to fine-tune both depth-from-stereo as well as depth-from-mono architectures by a novel confidence-guided loss function that handles the measured disparities as noisy labels weighted according to the estimated confidence. Extensive experimental results based on standard datasets and evaluation protocols prove that our technique can address effectively the domain shift issue with both stereo and monocular depth prediction architectures and outperforms other state-of-the-art unsupervised loss functions that may be alternatively deployed to pursue domain adaptation.
- Published
- 2020
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.