37 results on '"Camps-Valls, Gustau"'
Search Results
2. Role of locality, fidelity and symmetry regularization in learning explainable representations
- Author
-
Ronco, Michele and Camps-Valls, Gustau
- Published
- 2023
- Full Text
- View/download PDF
3. Discovering causal relations and equations from data
- Author
-
Camps-Valls, Gustau, Gerhardus, Andreas, Ninad, Urmi, Varando, Gherardo, Martius, Georg, Balaguer-Ballester, Emili, Vinuesa, Ricardo, Diaz, Emiliano, Zanna, Laure, and Runge, Jakob
- Published
- 2023
- Full Text
- View/download PDF
4. Learning main drivers of crop progress and failure in Europe with interpretable machine learning
- Author
-
Mateo-Sanchis, Anna, Piles, Maria, Amorós-López, Julia, Muñoz-Marí, Jordi, Adsuara, Jose E., Moreno-Martínez, Álvaro, and Camps-Valls, Gustau
- Published
- 2021
- Full Text
- View/download PDF
5. Retrieval of aboveground crop nitrogen content with a hybrid machine learning method
- Author
-
Berger, Katja, Verrelst, Jochem, Féret, Jean-Baptiste, Hank, Tobias, Wocher, Matthias, Mauser, Wolfram, and Camps-Valls, Gustau
- Published
- 2020
- Full Text
- View/download PDF
6. Gaussian processes retrieval of LAI from Sentinel-2 top-of-atmosphere radiance data
- Author
-
Estévez, José, Vicent, Jorge, Rivera-Caicedo, Juan Pablo, Morcillo-Pallarés, Pablo, Vuolo, Francesco, Sabater, Neus, Camps-Valls, Gustau, Moreno, José, and Verrelst, Jochem
- Published
- 2020
- Full Text
- View/download PDF
7. Deep Gaussian processes for biogeophysical parameter retrieval and model inversion
- Author
-
Svendsen, Daniel Heestermans, Morales-Álvarez, Pablo, Ruescas, Ana Belen, Molina, Rafael, and Camps-Valls, Gustau
- Published
- 2020
- Full Text
- View/download PDF
8. A global canopy water content product from AVHRR/Metop
- Author
-
García-Haro, Francisco Javier, Campos-Taberner, Manuel, Moreno, Álvaro, Tagesson, Håkan Torbern, Camacho, Fernando, Martínez, Beatriz, Sánchez, Sergio, Piles, María, Camps-Valls, Gustau, Yebra, Marta, and Gilabert, María Amparo
- Published
- 2020
- Full Text
- View/download PDF
9. Statistical retrieval of atmospheric profiles with deep convolutional neural networks
- Author
-
Malmgren-Hansen, David, Laparra, Valero, Aasbjerg Nielsen, Allan, and Camps-Valls, Gustau
- Published
- 2019
- Full Text
- View/download PDF
10. Sensitivity maps of the Hilbert–Schmidt independence criterion
- Author
-
Pérez-Suay, Adrián and Camps-Valls, Gustau
- Published
- 2018
- Full Text
- View/download PDF
11. Physics-aware Gaussian processes in remote sensing
- Author
-
Camps-Valls, Gustau, Martino, Luca, Svendsen, Daniel H., Campos-Taberner, Manuel, Muñoz-Marí, Jordi, Laparra, Valero, Luengo, David, and García-Haro, Francisco Javier
- Published
- 2018
- Full Text
- View/download PDF
12. Derivation of global vegetation biophysical parameters from EUMETSAT Polar System
- Author
-
García-Haro, Francisco Javier, Campos-Taberner, Manuel, Muñoz-Marí, Jordi, Laparra, Valero, Camacho, Fernando, Sánchez-Zapero, Jorge, and Camps-Valls, Gustau
- Published
- 2018
- Full Text
- View/download PDF
13. Assessing the relationship between microwave vegetation optical depth and gross primary production
- Author
-
Teubner, Irene E., Forkel, Matthias, Jung, Martin, Liu, Yi Y., Miralles, Diego G., Parinussa, Robert, van der Schalie, Robin, Vreugdenhil, Mariette, Schwalm, Christopher R., Tramontana, Gianluca, Camps-Valls, Gustau, and Dorigo, Wouter A.
- Published
- 2018
- Full Text
- View/download PDF
14. Hyperspectral dimensionality reduction for biophysical variable statistical retrieval
- Author
-
Rivera-Caicedo, Juan Pablo, Verrelst, Jochem, Muñoz-Marí, Jordi, Camps-Valls, Gustau, and Moreno, José
- Published
- 2017
- Full Text
- View/download PDF
15. Spectral band selection for vegetation properties retrieval using Gaussian processes regression
- Author
-
Verrelst, Jochem, Rivera, Juan Pablo, Gitelson, Anatoly, Delegido, Jesus, Moreno, José, and Camps-Valls, Gustau
- Published
- 2016
- Full Text
- View/download PDF
16. Multi-temporal and multi-source remote sensing image classification by nonlinear relative normalization
- Author
-
Tuia, Devis, Marcos, Diego, and Camps-Valls, Gustau
- Published
- 2016
- Full Text
- View/download PDF
17. Optical remote sensing and the retrieval of terrestrial vegetation bio-geophysical properties – A review
- Author
-
Verrelst, Jochem, Camps-Valls, Gustau, Muñoz-Marí, Jordi, Rivera, Juan Pablo, Veroustraete, Frank, Clevers, Jan G.P.W., and Moreno, José
- Published
- 2015
- Full Text
- View/download PDF
18. Experimental Sentinel-2 LAI estimation using parametric, non-parametric and physical retrieval methods – A comparison
- Author
-
Verrelst, Jochem, Rivera, Juan Pablo, Veroustraete, Frank, Muñoz-Marí, Jordi, Clevers, Jan G.P.W., Camps-Valls, Gustau, and Moreno, José
- Published
- 2015
- Full Text
- View/download PDF
19. Spectral alignment of multi-temporal cross-sensor images with automated kernel canonical correlation analysis
- Author
-
Volpi, Michele, Camps-Valls, Gustau, and Tuia, Devis
- Published
- 2015
- Full Text
- View/download PDF
20. Signal-to-noise ratio in reproducing kernel Hilbert spaces
- Author
-
Gómez-Chova, Luis, Santos-Rodríguez, Raúl, and Camps-Valls, Gustau
- Published
- 2018
- Full Text
- View/download PDF
21. Estimating crop primary productivity with Sentinel-2 and Landsat 8 using machine learning methods trained with radiative transfer simulations.
- Author
-
Wolanin, Aleksandra, Camps-Valls, Gustau, Gómez-Chova, Luis, Mateo-García, Gonzalo, van der Tol, Christiaan, Zhang, Yongguang, and Guanter, Luis
- Subjects
- *
RADIATIVE transfer , *AGRICULTURAL estimating & reporting , *MACHINE learning , *OPTICAL remote sensing , *ARTIFICIAL neural networks , *BIOPHYSICS - Abstract
Satellite remote sensing has been widely used in the last decades for agricultural applications, both for assessing vegetation condition and for subsequent yield prediction. Existing remote sensing-based methods to estimate gross primary productivity (GPP), which is an important variable to indicate crop photosynthetic function and stress, typically rely on empirical or semi-empirical approaches, which tend to over-simplify photosynthetic mechanisms. In this work, we take advantage of all parallel developments in mechanistic photosynthesis modeling and satellite data availability for an advanced monitoring of crop productivity. In particular, we combine process-based modeling with the soil-canopy energy balance radiative transfer model (SCOPE) with Sentinel-2 and Landsat 8 optical remote sensing data and machine learning methods in order to estimate crop GPP. With this approach, we by-pass the need for an intermediate step to retrieve the set of vegetation biophysical parameters needed to accurately model photosynthesis, while still accounting for the complex processes of the original physically-based model. Several implementations of the machine learning models are tested and validated using simulated and flux tower-based GPP data. Our final neural network model is able to estimate GPP at the tested flux tower sites with r 2 of 0.92 and RMSE of 1.38 gC d−1 m−2, which outperforms empirical models based on vegetation indices. The first test of applicability of this model to Landsat 8 data showed good results (r 2 of 0.82 and RMSE of 1.97 gC d−1 m−2), which suggests that our approach can be further applied to other sensors. Modeling and testing is restricted to C3 crops in this study, but can be extended to C4 crops by producing a new training dataset with SCOPE that accounts for the different photosynthetic pathways. Our model successfully estimates GPP across a variety of C3 crop types and environmental conditions even though it does not use any local information from the corresponding sites. This highlights its potential to map crop productivity from new satellite sensors at a global scale with the help of current Earth observation cloud computing platforms. • Sentinel-2 offers a great potential for the monitoring of agricultural productivity. • Machine learning applied to a process-based model to estimate GPP from satellite data. • Our hybrid approach accurately estimates GPP without any local information. • The use of red edge bands of Sentinel-2 improves GPP modeling. • Global application to multiple satellites using on the same model [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
22. A methodology to derive global maps of leaf traits using remote sensing and climate data.
- Author
-
Moreno-Martínez, Álvaro, Camps-Valls, Gustau, Kattge, Jens, Robinson, Nathaniel, Reichstein, Markus, van Bodegom, Peter, Kramer, Koen, Cornelissen, J. Hans C., Reich, Peter, Bahn, Michael, Niinemets, Ülo, Peñuelas, Josep, Craine, Joseph M., Cerabolini, Bruno E.L., Minden, Vanessa, Laughlin, Daniel C., Sack, Lawren, Allred, Brady, Baraloto, Christopher, and Byun, Chaeho
- Subjects
- *
REMOTE sensing , *LANDSAT satellites , *AERIAL photogrammetry , *REMOTE-sensing images , *AERIAL photography , *AERIAL surveillance , *LEAF physiology - Abstract
Abstract This paper introduces a modular processing chain to derive global high-resolution maps of leaf traits. In particular, we present global maps at 500 m resolution of specific leaf area, leaf dry matter content, leaf nitrogen and phosphorus content per dry mass, and leaf nitrogen/phosphorus ratio. The processing chain exploits machine learning techniques along with optical remote sensing data (MODIS/Landsat) and climate data for gap filling and up-scaling of in-situ measured leaf traits. The chain first uses random forests regression with surrogates to fill gaps in the database (> 45 % of missing entries) and maximizes the global representativeness of the trait dataset. Plant species are then aggregated to Plant Functional Types (PFTs). Next, the spatial abundance of PFTs at MODIS resolution (500 m) is calculated using Landsat data (30 m). Based on these PFT abundances, representative trait values are calculated for MODIS pixels with nearby trait data. Finally, different regression algorithms are applied to globally predict trait estimates from these MODIS pixels using remote sensing and climate data. The methods were compared in terms of precision, robustness and efficiency. The best model (random forests regression) shows good precision (normalized RMSE≤ 20 %) and goodness of fit (averaged Pearson's correlation R = 0.78) in any considered trait. Along with the estimated global maps of leaf traits, we provide associated uncertainty estimates derived from the regression models. The process chain is modular, and can easily accommodate new traits, data streams (traits databases and remote sensing data), and methods. The machine learning techniques applied allow attribution of information gain to data input and thus provide the opportunity to understand trait-environment relationships at the plant and ecosystem scales. The new data products – the gap-filled trait matrix, a global map of PFT abundance per MODIS gridcells and the high-resolution global leaf trait maps – are complementary to existing large-scale observations of the land surface and we therefore anticipate substantial contributions to advances in quantifying, understanding and prediction of the Earth system. Highlights • Presented a modular process chain for plant trait mapping including local effects • High-resolution global maps of leaf traits by fusing measured trait data, LANDSAT and MODIS • Scope for testing and parameterizing trait-enabled Earth System models • Implications for land management and Earth system science applications [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
23. Spatial homogeneity from temporal stability: Exploiting the combined hyper-frequent revisit of Terra and Aqua to guide Earth System Science.
- Author
-
Duveiller, Gregory, Camps-Valls, Gustau, Ceccherini, Guido, and Cescatti, Alessandro
- Subjects
- *
EARTH system science , *HOMOGENEITY , *TIME series analysis , *DATA recorders & recording , *CLOUD computing - Abstract
The terrestrial component of the Earth system has witnessed considerable changes in the past decades due to anthropogenic action. Throughout this period, the NASA Terra mission has been constantly monitoring the surface with the Moderate Resolution Imaging Spectroradiometer (MODIS) instrument. When combined with the MODIS instrument on-board of the Aqua platform, we obtain a hyper-frequent revisit capability providing sub-daily observations globally at a moderate resolution of ~250 m, but with a strong multi-angular variation of the observation footprint. Here we propose to exploit this particular configuration provided by the combined Terra + Aqua constellation to infer spatial homogeneity from the temporal stability of the observation time series. We thus designed a Temporal Coherence Index (TCI) based on the concept of information entropy. We test it on both synthetic and real landscapes showing its capacity to isolate which time series correspond to homogeneous targets based only on the MODIS measurements themselves. The index is calculated annually for the entirety of emerged surfaces for the period 2003–2020 by leveraging on the scaling capacity of the cloud computing Google Earth Engine (GEE). We foresee multiple practical applications for this index as a novel information layer that should foster the integration of surface data with satellite records, a practice that is increasingly necessary in Earth System Science. • We propose a new Temporal Coherence Index (TCI) from MODIS • TCI is an effective proxy from spatial homogeneity directly from MODIS observations • TCI should foster finer integration of surface data with satellite records [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
24. Domain knowledge-driven variational recurrent networks for drought monitoring.
- Author
-
Zhang, Mengxue, Fernández-Torres, Miguel-Ángel, and Camps-Valls, Gustau
- Subjects
- *
DROUGHT management , *RECURRENT neural networks , *DROUGHTS , *ATMOSPHERIC temperature , *DROUGHT forecasting , *GOODNESS-of-fit tests - Abstract
In the context of climate change, droughts, increasingly frequent and severe, necessitate effective monitoring. Existing methods, such as drought indices and data-driven models, face important limitations. Drought indices are built on prior expert knowledge but lack calibration based on actual drought events, while data-driven models prioritize goodness of fit over real event identification, undermining their credibility and generalization, and also struggling to generalize from regional to large-scale contexts. To address these challenges, here we introduce a hybrid machine learning framework for time series that combines domain knowledge and observational data in a variational recurrent neural network. The network models the joint distribution of total precipitation, air temperature, and real drought events, providing accurate predictions and uncertainty estimates. Extensive experiments focusing on a wide range of European drought events from 2011 to 2018 consistently show that our hybrid model surpasses both drought indices and data-driven models in terms of accuracy in drought detection, underlining its effectiveness, robustness, and stability. Our model achieves the best ROC-AUC (%) results in Afghanistan (79.7 ± 0.5), Italy (84.3 ± 0.6), Russia (89.4 ± 0.2), Europe-0 (84.3 ± 0.1), and Europe-1 (82.8 ± 0.4), effectively capturing the starting and ending times of drought events with lower uncertainty, and also generalizing better for unseen locations. • We propose DK-VRN, a novel hybrid framework for drought monitoring. • DK-VRN takes ECVs as input and integrates multi-scalar SPEI as domain knowledge prior. • DK-VRN is end-to-end supervised by historical drought events. • DK-VRN can detect droughts over diverse timeframes and across continental scales. • Experiments on European events from 2011 to 2018 show the effectiveness of DK-VRN. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
25. Explainable deep learning for automatic rock classification.
- Author
-
Zheng, Dongyu, Zhong, Hanting, Camps-Valls, Gustau, Cao, Zhisong, Ma, Xiaogang, Mills, Benjamin, Hu, Xiumian, Hou, Mingcai, and Ma, Chao
- Subjects
- *
AUTOMATIC classification , *DEEP learning , *SEDIMENTARY rocks , *FEATURE extraction , *SILTSTONE , *EARTH sciences - Abstract
As deep learning (DL) gains popularity for its ability to make accurate predictions in various fields, its applications in geosciences are also on the rise. Many studies focus on achieving high accuracy in DL models by selecting models, developing more complex architectures, and tuning hyperparameters. However, the interpretability of these models, or the ability to understand how they make their predictions, is less frequently discussed. To address the challenge of high accuracy but low interpretability of DL models in geosciences, we study rock classification from thin-section photomicrographs of six types of sedimentary rocks, including quartz arenite, feldspathic arenite, lithic arenite, siltstone, oolitic packstone, and dolomite. These rocks' characteristic framework grains and grain textures are their distinguishing features, such as the rounded or oval ooids in oolitic packstone. We first train regular DL models, such as ResNet-50, on these photomicrographs and achieve an accuracy of over 0.94. However, these models make classifications based on features like cracks, cements, and scale bars, which are irrelevant for distinguishing sedimentary rocks in real-world applications. We then propose an attention-based dual network incorporating both global (overall photomicrograph) and local (distinguishing framework grains) features to address this issue. Our proposed model achieves not only high accuracy (0.99) but also provides interpretable feature extractions. Our study highlights the need to consider interpretability and geological knowledge in developing DL models, in addition to aiming for high accuracy. [Display omitted] • Proposed dual network achieves high accuracy (0.99) and interpretable feature extractions for sedimentary rock classification. • Regular DL models achieved high accuracy (>0.94), but relied on irrelevant features for classification. • This study emphasizes importance of interpretability and geological knowledge in developing DL models for geosciences. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. A carbon sink-driven approach to estimate gross primary production from microwave satellite observations.
- Author
-
Teubner, Irene E., Forkel, Matthias, Camps-Valls, Gustau, Jung, Martin, Miralles, Diego G., Tramontana, Gianluca, van der Schalie, Robin, Vreugdenhil, Mariette, Mösinger, Leander, and Dorigo, Wouter A.
- Subjects
- *
MODIS (Spectroradiometer) , *ATMOSPHERIC carbon dioxide , *OPTICAL remote sensing , *THROUGHFALL , *GRID cells , *MICROWAVE remote sensing - Abstract
Global estimation of Gross Primary Production (GPP) - the uptake of atmospheric carbon dioxide by plants through photosynthesis - is commonly based on optical satellite remote sensing data. This presents a source-driven approach since it uses the amount of absorbed light, the main driver of photosynthesis, as a proxy for GPP. Vegetation Optical Depth (VOD) estimates obtained from microwave sensors provide an alternative and independent data source to estimate GPP on a global scale, which may complement existing GPP products. Recent studies have shown that VOD is related to aboveground biomass, and that both VOD and temporal changes in VOD relate to GPP. In this study, we build upon this concept and propose a model for estimating GPP from VOD. Since the model is driven by vegetation biomass, as observed through VOD, it presents a carbon sink-driven approach to quantify GPP and, therefore, is conceptually different from common source-driven approaches. The model developed in this study uses single frequencies from active or passive microwave VOD retrievals from C-, X- and Ku-band (Advanced Scatterometer (ASCAT) and Advanced Microwave Scanning Radiometer for Earth Observation (AMSR-E)) to estimate GPP at the global scale. We assessed the ability for temporal and spatial extrapolation of the model using global GPP from FLUXCOM and in situ GPP from FLUXNET. We further performed upscaling of in situ GPP based on different VOD data sets and compared these estimates with the FLUXCOM and MODerate-resolution Imaging Spectroradiometer (MODIS) GPP products. Our results show that the model developed for individual grid cells using VOD and change in VOD as input performs well in predicting temporal patterns in GPP for all VOD data sets. For spatial extrapolation of the model, however, additional input variables are needed to represent the spatial variability of the VOD-GPP relationship due to differences in vegetation type. As additional input variable, we included the grid cell median VOD (as a proxy for vegetation cover), which increased the model performance during cross validation. Mean annual GPP obtained for AMSR-E X-band data tends to overestimate mean annual GPP for FLUXCOM and MODIS but shows comparable latitudinal patterns. Overall, our findings demonstrate the potential of VOD for estimating GPP. The sink-driven approach provides additional information about GPP independent of optical data, which may contribute to our knowledge about the carbon source-sink balance in different ecosystems. • Estimating Gross Primary Production (GPP) based on Vegetation Optical Depth (VOD) • Relationship between VOD and GPP can be described through a differential equation. • Further, we use Generalized Additive Models for estimating GPP from VOD. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
27. Multitemporal and multiresolution leaf area index retrieval for operational local rice crop monitoring.
- Author
-
Campos-Taberner, Manuel, García-Haro, Francisco Javier, Camps-Valls, Gustau, Grau-Muedra, Gonçal, Nutini, Francesco, Crema, Alberto, and Boschetti, Mirco
- Subjects
- *
RICE , *LEAF area index , *HIGH resolution imaging , *GAUSSIAN processes , *RADIATIVE transfer , *PLANTS - Abstract
This paper presents an operational chain for high-resolution leaf area index (LAI) retrieval from multiresolution satellite data specifically developed for Mediterranean rice areas. The proposed methodology is based on the inversion of the PROSAIL radiative transfer model through the state-of-the-art nonlinear Gaussian process regression (GPR) method. Landsat and SPOT5 data were used for multitemporal LAI retrievals at high-resolution. LAI estimates were validated using time series of in situ LAI measurements collected during the rice season in Spain and Italy. Ground LAI data were collected with smartphones using PocketLAI, a specific phone application for LAI estimation. Temporal evolution of the LAI estimates using Landsat and SPOT5 data followed consistently the temporal evolution of the in situ LAI measurements acquired on several Mediterranean rice varieties. The estimates had a root-mean-square-error (RMSE) of 0.39 and 0.51 m 2 /m 2 in Spain and 0.38 and 0.47 m 2 /m 2 in Italy for Landsat and SPOT5 respectively, with a strong correlation (R 2 > 0.92) for both cases. Spatial-temporal assessment of the estimated LAI from Landsat and SPOT5 data confirmed the robustness and consistency of the retrieval chain. This paper demonstrates the importance of an adequate characterization of the underlying rice background in order to address changes in background condition related to water management. Results highlight the potential of the proposed chain for deriving multitemporal near real-time decametric LAI maps fundamental for operational rice crop monitoring, and demonstrate the readiness of the proposed method for the processing of data such as the recently launched Sentinel-2. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
28. Land use classification over smallholding areas in the European Common Agricultural Policy framework.
- Author
-
Campos-Taberner, Manuel, Javier García-Haro, Francisco, Martínez, Beatriz, Sánchez-Ruiz, Sergio, Moreno-Martínez, Álvaro, Camps-Valls, Gustau, and Amparo Gilabert, María
- Subjects
- *
ZONING , *DEEP learning , *AGRICULTURAL policy , *MACHINE learning , *RECURRENT neural networks , *FRUIT trees , *ARABLE land - Abstract
Land use (LU) monitoring and mapping from remote sensing (RS) data are relevant topics in Earth observation (EO). LU monitoring requires powerful/robust classification approaches able to provide reliable identifications in decision-making applications, even over challenging areas. The European Common Agricultural Policy (CAP) promotes the use of remote sensing for LU monitoring in Europe useful in the compliance control of the subsidy payments. In this context, the present study proposes a novel classification chain to identify ten land uses, namely: forest (FOR), pasture with trees (TRE), pasture with shrubs (SHR), pastureland (PAS), vineyard (VIN), arable land (ARL), fruit trees (FRU), nut trees (NUT), citrus (CIT), and olive grove (OLI), over three areas in the Valencian Autonomous Region (Spain) where smallholding farming predominates. The approach exploits the multitemporal information provided by Sentinel-2 data using a novel spatial strategy specifically designed to deal with heterogeneous agricultural areas. More precisely, we implemented a deep learning algorithm based on bidirectional recurrent neural networks to account for complex temporal dynamics. The classification results over the three areas showed accuracies higher than 95.5% over validation sets composed of in-situ checks never used during the training process. The proposed methodology outperformed standard approaches that not consider the spatial variability of the training samples, and revealed very good agreement concerning the Land Parcel Identification System (LPIS) of Spain. In addition, the developed chain proposes the novelty of using the kernel normalized difference vegetation index (kNDVI) as a predictor in a LU classification processing chain. Including the kNDVI instead of the traditional NDVI outperformed the classification accuracy in all metrics and classes. Ultimately, the obtained classifications were used for assessing the 2020 farmers' declarations in the three study areas. The declarations showed a level of agreement concerning the proposed approach near 99%. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
29. Kernel dependence regularizers and Gaussian processes with applications to algorithmic fairness.
- Author
-
Li, Zhu, Pérez-Suay, Adrián, Camps-Valls, Gustau, and Sejdinovic, Dino
- Subjects
- *
GAUSSIAN processes , *FAIRNESS , *HILBERT space , *REAL income , *RACE , *ADOPTED children - Abstract
• A general framework of empirical risk minimization with fairness regularizers and an analysis of its risk and fairness statistical consistency results are presented. • A Gaussian Process (GP) formulation of the fairness regularization framework is derived, which allows uncertainty quantification and principled hyperparameter selection. • A normalized version of the fairness regularizer which makes it less sensitive to the choice of kernel parameters is derived. Current adoption of machine learning in industrial, societal and economical activities has raised concerns about the fairness, equity and ethics of automated decisions. Predictive models are often developed using biased datasets and thus retain or even exacerbate biases in their decisions and recommendations. Removing the sensitive covariates, such as gender or race, is insufficient to remedy this issue since the biases may be retained due to other related covariates. We present a regularization approach to this problem that trades off predictive accuracy of the learned models (with respect to biased labels) for the fairness in terms of statistical parity, i.e. independence of the decisions from the sensitive covariates. In particular, we consider a general framework of regularized empirical risk minimization over reproducing kernel Hilbert spaces and impose an additional regularizer of dependence between predictors and sensitive covariates using kernel-based measures of dependence, namely the Hilbert-Schmidt Independence Criterion (HSIC) and its normalized version. This approach leads to a closed-form solution in the case of squared loss, i.e. ridge regression. We also provide statistical consistency results for both risk and fairness bound for our approach. Moreover, we show that the dependence regularizer has an interpretation as modifying the corresponding Gaussian process (GP) prior. As a consequence, a GP model with a prior that encourages fairness to sensitive variables can be derived, allowing principled hyperparameter selection and studying of the relative relevance of covariates under fairness constraints. Experimental results in synthetic examples and in real problems of income and crime prediction illustrate the potential of the approach to improve fairness of automated decisions. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
30. Estimation of vegetation traits with kernel NDVI.
- Author
-
Wang, Qiang, Moreno-Martínez, Álvaro, Muñoz-Marí, Jordi, Campos-Taberner, Manuel, and Camps-Valls, Gustau
- Subjects
- *
NORMALIZED difference vegetation index , *LEAF area index , *LATENT heat - Abstract
Vegetation indices computed from spectral signatures are vastly used for monitoring the terrestrial biosphere. Indices are convenient proxies for canopy structure, and leaf pigment content, and consequently to estimate the photosynthetic activity of vegetation. Owing to its simplicity, the celebrated Normalized Difference Vegetation Index (NDVI) has been used as a proxy for greenness and canopy structure. Unfortunately, NDVI can only capture linear relationships of the near infrared (NIR) - red difference with the parameter of interest. To account for higher-order relations between the spectral channels, kernel NDVI (kNDVI) was proposed in (Camps-Valls et al., 2021). In this work, we give useful prescriptions for its proper use and show its good performance in a wider set of applications. We discuss the good characteristics of the index like boundedness, low error propagation. Furthermore, we give empirical evidence of performance in estimating in-situ vegetation parameters (leaf area index (LAI), gross primary productivity (GPP), leaf, and canopy chlorophyll content, green and total LAI and fraction of absorbed photosynthetically active radiation (fAPAR)) as well as the estimation of latent heat at flux tower level. We confirm the generally good performance of the index (correlation coefficient of kNDVI and canopy chlorophyll content is 0.919 and 0.933 for maize over two sites, as well as the correlation coefficient between kNDVI and carotenoid, is 0.816, 0.520 and 0.579 for three forest sites) and highlight its convenience in monitoring terrestrial ecosystems. To foster wider adoption of the new family of the index, we provide source code in 6 programming languages as well as efficient implementations in the Google Earth Engine (GEE) platform at https://github.com/IPL-UV/kNDVI. • The nonlinear kNDVI generalizes standard vegetation indices. • kNDVI propagates lower uncertainty and addresses saturation issues. • Prescriptions for the correct use of the index are given. • kNDVI yields improved performance to estimate in-situ vegetation parameters. • kNDVI improves estimation of latent heat at flux tower level. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
31. Uncertainty analysis of gross primary production upscaling using Random Forests, remote sensing and eddy covariance data.
- Author
-
Tramontana, Gianluca, Ichii, Kazuito, Camps-Valls, Gustau, Tomelleri, Enrico, and Papale, Dario
- Subjects
- *
PRIMARY productivity (Biology) , *RANDOM forest algorithms , *REMOTE sensing , *ANALYSIS of covariance , *CLIMATE change - Abstract
The accurate quantification of carbon fluxes at continental spatial scale is important for future policy decisions in the context of global climate change. However, many elements contribute to the uncertainty of such estimate. In this study, the uncertainties of eight days gross primary production (GPP) predicted by Random Forest (RF) machine learning models were analysed at the site, ecosystem and European spatial scales. At the site level, the uncertainties caused by the missing of key drivers were evaluated. The most accurate predictions of eight days GPP were obtained when all available drivers were used (Pearson's correlation coefficient, ρ ~ 0.84; Root Mean Square Error (RMSE) ~ 1.8 g C m −2 d −1 ). However, when predictions were based on only remotely sensed data the accuracy was close to the optimum (ρ ~ 0.8; RMSE ~ 1.9 g C m −2 d −1 ) and to a commonly used light use efficiency model (MOD17) with parameters optimised for the applied study sites (the MOD17 +, ρ ~ 0.79; RMSE ~ 2.04 g C m −2 d −1 ). Remotely sensed data were key drivers for the accurate prediction of GPP in ecosystems with high variability of green biomass over the phenological cycle (e.g., deciduous broad-leaved forests) or highly affected by the human management (e.g. croplands). In contrast, in the ecosystems with low variability of greenness (e.g., evergreen broad-leaved forests), the predictions were poor when meteorological information were not used. At a European spatial scale, when modelled grids of meteorological, land cover and fPAR data were used as inputs, the propagation of their uncertainty, not accounted in the models training, had significant effects on the uncertainty of the mean annual GPP. At this scale, the effects of meteorological uncertainty were higher than the misclassification error. These findings suggested that a strategy based on satellite-measured data could be a favourable improvement for the spatial upscaling of GPP, because avoiding the propagation of the uncertainties of the modelled grids. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
32. Active emulation of computer codes with Gaussian processes – Application to remote sensing.
- Author
-
Svendsen, Daniel Heestermans, Martino, Luca, and Camps-Valls, Gustau
- Subjects
- *
GAUSSIAN processes , *REMOTE sensing , *COMPUTER programming , *EMULATION software , *MACHINE learning , *HEURISTIC - Abstract
• Introduction of methodology for active multi-output Gaussian process emulators. • Gaussian processes allow the construction of accurate, compact and interpretable emulators. • Adaptive sequential construction of both the emulator and look-up-table. • New acquisition function combines function geometry and predictive uncertainty. • Application to complex codes used on remote sensing leads to considerable computational savings. Many fields of science and engineering rely on running simulations with complex and computationally expensive models to understand the involved processes in the system of interest. Nevertheless, the high cost involved hamper reliable and exhaustive simulations. Very often such codes incorporate heuristics that ironically make them less tractable and transparent. This paper introduces an active learning methodology for adaptively constructing surrogate models, i.e. emulators , of such costly computer codes in a multi-output setting. The proposed technique is sequential and adaptive, and is based on the optimization of a suitable acquisition function. It aims to achieve accurate approximations, model tractability, as well as compact and expressive simulated datasets. In order to achieve this, the proposed Active Multi-Output Gaussian Process Emulator (AMOGAPE) combines the predictive capacity of Gaussian Processes (GPs) with the design of an acquisition function that favors sampling in low density and fluctuating regions of the approximation functions. Comparing different acquisition functions, we illustrate the promising performance of the method for the construction of emulators with toy examples, as well as for a widely used remote sensing transfer code. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
33. Quantifying uncertainty in high resolution biophysical variable retrieval with machine learning.
- Author
-
Martínez-Ferrer, Laura, Moreno-Martínez, Álvaro, Campos-Taberner, Manuel, García-Haro, Francisco Javier, Muñoz-Marí, Jordi, Running, Steven W., Kimball, John, Clinton, Nicholas, and Camps-Valls, Gustau
- Subjects
- *
MODIS (Spectroradiometer) , *LEAF area index , *MACHINE learning , *LANDSAT satellites , *REMOTE sensing , *ARTIFICIAL neural networks - Abstract
The estimation of biophysical variables is at the core of remote sensing science, allowing a close monitoring of crops and forests. Deriving temporally resolved and spatially explicit maps of parameters of interest has been the subject of intense research. However, deriving products from optical sensors is typically hampered by cloud contamination and the trade-off between spatial and temporal resolutions. In this work we rely on the HIghly Scalable Temporal Adaptive Reflectance Fusion Model (HISTARFM) algorithm to generate long gap-free time series of Landsat surface reflectance data by fusing MODIS and Landsat reflectances. An artificial neural network is trained on PROSAIL inversion to predict monthly biophysical variables at 30 m spatial resolution with associated, realistic uncertainty bars. We emphasize the need for a more thorough analysis of uncertainty, and propose a general and scalable approach to combine both epistemic and aleatoric uncertainties by exploiting Monte Carlo (MC) dropout techniques from the trained artificial network and the propagation of HISTARFM uncertainties through the model, respectively. A model recalibration was performed in order to provide reliable uncertainties. We provide new high resolution products of several key variables to quantify the terrestrial biosphere: Leaf Area Index (LAI), Fraction of Absorbed Photosynthetically Active Radiation (FAPAR), Canopy Water Content (CWC) and Fractional Vegetation Cover (FVC) are at 30 m Landsat spatial resolution and over large continental areas. Two study areas are considered: the large heterogeneous but moderately cloud covered contiguous United States, and the homogeneous but largely cloud covered Amazonia. The produced vegetation products largely agree with the test dataset (R = 0.90, RMSE = 0.80 m 2/ m 2 and ME = 0.12 m 2/ m 2 for LAI, and R = 0.98, RMSE = 0.07 and ME = 0.01 for FAPAR) providing low error and high accuracy. Additionally, the validation considers a thorough comparison with operational and largely validated medium resolution products, such as the Moderate-Resolution Imaging Spectroradiometer (MODIS) and Copernicus Global Land Service. Our products presented a good agreement and consistency with both MODIS (R = 0.84 and R = 0.85 for LAI and FAPAR, respectively) and Copernicus (R = 0.92 and R = 0.91 for LAI and FAPAR, respectively). To foster a wider adoption and reproducibility of the methodology we provide an application in GEE and source code at: https://github.com/IPL-UV/ee_BioNet/ • Proposed a methodology based on RTM inversion with NN for biophysical parameter estimation. • Uncertainty quantification considering model and data uncertainties. • Biophysical parameters and realistic uncertainties gap-free maps at high resolution (30 m). • Implications for high resolution carbon fluxes and phenologies applications. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
34. A hybrid generative adversarial network for weakly-supervised cloud detection in multispectral images.
- Author
-
Li, Jun, Wu, Zhaocong, Sheng, Qinghong, Wang, Bo, Hu, Zhongwen, Zheng, Shaobo, Camps-Valls, Gustau, and Molinier, Matthieu
- Subjects
- *
GENERATIVE adversarial networks , *OPTICAL remote sensing , *LANDSAT satellites , *CONVOLUTIONAL neural networks , *MULTISPECTRAL imaging , *DEEP learning , *HYBRID cloud computing , *OPTICAL sensors - Abstract
Cloud detection is a crucial step in the optical satellite image processing pipeline for Earth observation. Clouds in optical remote sensing images seriously affect the visibility of the background and greatly reduce the usability of images for land applications. Traditional methods based on thresholding, multi-temporal or multi-spectral information are often specific to a particular satellite sensor. Convolutional Neural Networks for cloud detection often require labeled cloud masks for training that are very time-consuming and expensive to obtain. To overcome these challenges, this paper presents a hybrid cloud detection method based on the synergistic combination of generative adversarial networks (GAN) and a physics-based cloud distortion model (CDM). The proposed weakly-supervised GAN-CDM method (available online https://github.com/Neooolee/GANCDM) only requires patch-level labels for training, and can produce cloud masks at pixel-level in both training and testing stages. GAN-CDM is trained on a new globally distributed Landsat 8 dataset (WHUL8-CDb, available online doi: https://doi.org/10.5281/zenodo.6420027) including image blocks and corresponding block-level labels. Experimental results show that the proposed GAN-CDM method trained on Landsat 8 image blocks achieves much higher cloud detection accuracy than baseline deep learning-based methods, not only in Landsat 8 images (L8 Biome dataset, 90.20% versus 72.09%) but also in Sentinel-2 images ("S2 Cloud Mask Catalogue" dataset, 92.54% versus 77.00%). This suggests that the proposed method provides accurate cloud detection in Landsat images, has good transferability to Sentinel-2 images, and can quickly be adapted for different optical satellite sensors. • A hybrid weakly-supervised cloud detection method is presented. • Generative adversarial network is injected to cloud distortion model. • A novel model finetune strategy is presented. • The method has good transfer ability. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
35. Multispectral high resolution sensor fusion for smoothing and gap-filling in the cloud.
- Author
-
Moreno-Martínez, Álvaro, Izquierdo-Verdiguier, Emma, Maneta, Marco P., Camps-Valls, Gustau, Robinson, Nathaniel, Muñoz-Marí, Jordi, Sedano, Fernando, Clinton, Nicholas, and Running, Steven W.
- Subjects
- *
OPTICAL remote sensing , *KALMAN filtering , *ALGORITHMS , *OCEAN color , *ERROR analysis in mathematics , *MULTISPECTRAL imaging , *ARTIFICIAL satellite attitude control systems - Abstract
Remote sensing optical sensors onboard operational satellites cannot have high spectral, spatial and temporal resolutions simultaneously. In addition, clouds and aerosols can adversely affect the signal contaminating the land surface observations. We present a HIghly Scalable Temporal Adaptive Reflectance Fusion Model (HISTARFM) algorithm to combine multispectral images of different sensors to reduce noise and produce monthly gap free high resolution (30 m) observations over land. Our approach uses images from the Landsat (30 m spatial resolution and 16 day revisit cycle) and the MODIS missions, both from Terra and Aqua platforms (500 m spatial resolution and daily revisit cycle). We implement a bias-aware Kalman filter method in the Google Earth Engine (GEE) platform to obtain fused images at the Landsat spatial-resolution. The added bias correction in the Kalman filter estimates accounts for the fact that both model and observation errors are temporally auto-correlated and may have a non-zero mean. This approach also enables reliable estimation of the uncertainty associated with the final reflectance estimates, allowing for error propagation analyses in higher level remote sensing products. Quantitative and qualitative evaluations of the generated products through comparison with other state-of-the-art methods confirm the validity of the approach, and open the door to operational applications at enhanced spatio-temporal resolutions at broad continental scales. • Presented a new fusion algorithm to produce gap free Landsat reflectance datasets. • The algorithm is highly scalable and runs optimally in cloud computing environments. • The algorithm also provides the uncertainty associated with the final estimates. • Quantitative and qualitative evaluation of the algorithm obtained good results. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
36. Fusing optical and SAR time series for LAI gap filling with multioutput Gaussian processes.
- Author
-
Pipia, Luca, Muñoz-Marí, Jordi, Amin, Eatidal, Belda, Santiago, Camps-Valls, Gustau, and Verrelst, Jochem
- Subjects
- *
GAUSSIAN processes , *TIME series analysis , *LEAF area index , *POTATOES , *SYNTHETIC aperture radar , *GROUND vegetation cover , *OATS , *BEETS - Abstract
The availability of satellite optical information is often hampered by the natural presence of clouds, which can be problematic for many applications. Persistent clouds over agricultural fields can mask key stages of crop growth, leading to unreliable yield predictions. Synthetic Aperture Radar (SAR) provides all-weather imagery which can potentially overcome this limitation, but given its high and distinct sensitivity to different surface properties, the fusion of SAR and optical data still remains an open challenge. In this work, we propose the use of Multi-Output Gaussian Process (MOGP) regression, a machine learning technique that learns automatically the statistical relationships among multisensor time series, to detect vegetated areas over which the synergy between SAR-optical imageries is profitable. For this purpose, we use the Sentinel-1 Radar Vegetation Index (RVI) and Sentinel-2 Leaf Area Index (LAI) time series over a study area in north west of the Iberian peninsula. Through a physical interpretation of MOGP trained models, we show its ability to provide estimations of LAI even over cloudy periods using the information shared with RVI, which guarantees the solution keeps always tied to real measurements. Results demonstrate the advantage of MOGP especially for long data gaps, where optical-based methods notoriously fail. The leave-one-image-out assessment technique applied to the whole vegetation cover shows MOGP predictions improve standard GP estimations over short-time gaps (R2 of 74% vs 68%, RMSE of 0.4 vs 0.44 [ m 2 m − 2 ]) and especially over long-time gaps (R2 of 33% vs 12%, RMSE of 0.5 vs 1.09 [ m 2 m − 2 ]). A second assessment is focused on crop-specific regions, clustering pixels fulfilling specific model conditions where the synergy is profitable. Results reveal the MOGP performance is crop type and crop stage dependent. For long time gaps, best R2 are obtained over maize, ranging from 0.1 (tillering) to 0.36 (development) up to 0.81 (maturity); for moderate time gap, R2 = 0.93 (maturity) is obtained. Crops such as wheat, oats, rye and barley, can profit from the LAI-RVI synergy, with R2 varying between 0.4 and 0.6. For beet or potatoes, MOGP provides poorer results, but alternative descriptors to RVI should be tested for these specific crops in the future before discarding synergy real benefits. In conclusion, active-passive sensor fusion with MOGP represents a novel and promising approach to cope with crop monitoring over cloud-dominated areas. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
37. Synergistic integration of optical and microwave satellite data for crop yield estimation.
- Author
-
Mateo-Sanchis, Anna, Piles, Maria, Muñoz-Marí, Jordi, Adsuara, Jose E., Pérez-Suay, Adrián, and Camps-Valls, Gustau
- Subjects
- *
CROP yields , *NONLINEAR regression , *AGRICULTURAL statistics , *MICROWAVES , *OPTICAL depth (Astrophysics) , *CORN yields - Abstract
Developing accurate models of crop stress, phenology and productivity is of paramount importance, given the increasing need of food. Earth observation (EO) remote sensing data provides a unique source of information to monitor crops in a temporally resolved and spatially explicit way. In this study, we propose the combination of multisensor (optical and microwave) remote sensing data for crop yield estimation and forecasting using two novel approaches. We first propose the lag between Enhanced Vegetation Index (EVI) derived from MODIS and Vegetation Optical Depth (VOD) derived from SMAP as a new joint metric combining the information from the two satellite sensors in a unique feature or descriptor. Our second approach avoids summarizing statistics and uses machine learning to combine full time series of EVI and VOD. This study considers two statistical methods, a regularized linear regression and its nonlinear extension called kernel ridge regression to directly estimate the county-level surveyed total production, as well as individual yields of the major crops grown in the region: corn, soybean and wheat. The study area includes the US Corn Belt, and we use agricultural survey data from the National Agricultural Statistics Service (USDA-NASS) for year 2015 for quantitative assessment. Results show that (1) the proposed EVI-VOD lag metric correlates well with crop yield and outperforms common single-sensor metrics for crop yield estimation; (2) the statistical (machine learning) models working directly with the time series largely improve results compared to previously reported estimations; (3) the combined exploitation of information from the optical and microwave data leads to improved predictions over the use of single sensor approaches with coefficient of determination R ≥ 2 0.76 ; (4) when models are used for within-season forecasting with limited time information, crop yield prediction is feasible up to four months before harvest (models reach a plateau in accuracy); and (5) the robustness of the approach is confirmed in a multi-year setting, reaching similar performances than when using single-year data. In conclusion, results confirm the value of using both EVI and VOD at the same time, and the advantage of using automatic machine learning models for crop yield/production estimation. • 1. Optical and passive microwave data are complementary and useful to estimate crop yield. • 2. The time lag between EVI and VOD is proposed as a new metric for crop assessment. • 3. The EVI/VOD lag outperforms common single-sensor metrics for crop yield estimation. • 4. Machine learning further improves results by directly blending multisensor time series. • 5. Crop yield prediction is feasible up to four months before harvest. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.