Elsevier

Remote Sensing of Environment

Volume 216, October 2018, Pages 482-496
Remote Sensing of Environment

Generalizing machine learning regression models using multi-site spectral libraries for mapping vegetation-impervious-soil fractions across multiple cities

https://doi.org/10.1016/j.rse.2018.07.011Get rights and content

Highlights

  • SVR was used to map VIS fractions across multiple cities.

  • Synthetic mixtures from spectral libraries were used for SVR model training.

  • A multi-site library allowed for mapping all cities with one generalized model

  • The generalized model achieved similar quality to the local models.

  • The generalized model showed a higher transferability to unknown sites.

Abstract

Forthcoming spaceborne imaging spectrometers will provide novel opportunities for mapping urban composition globally. To move from case studies for single cities towards comparative and more operational analyses, generalized models that may be transferred throughout space are desired. In this study, we investigated how single regression models can be spatially generalized for vegetation-impervious-soil (VIS) mapping across multiple cities. The combination of support vector regression (SVR) with synthetically mixed training data generated from spectral libraries was used for fraction mapping. We developed three local models based on separate spectral libraries from Berlin (Germany), Brussels (Belgium), and Santa Barbara (U.S.), and a generalized model based on a combined multi-site spectral library. To examine the performance and transferability of the generalized model compared to local models, we first applied all model variants to simulated Environmental Mapping and Analysis Program (EnMAP) data from the three cities that were represented in the models, i.e., known sites. Next, we transferred the models to two unknown sites not represented in the models, San Francisco Bay Area (U.S.) and Munich (Germany). In the first mapping constellation, results demonstrated that the generalized model was capable of accurately mapping VIS fractions across all three known sites. Average mean absolute errors (AV-MAEs) were 8.5, 12.2, and 11.0% for Berlin, Brussels, and Santa Barbara. The performance of the generalized model was very similar to the local models, with ∆AV-MAEs falling within a range of ±0.7%. A detailed assessment of fraction maps and class-wise accuracies confirmed that modeling errors related to remaining limitations of urban mapping based on optical remote sensing data rather than to the choice between a local or generalized model. For the second mapping constellation, the generalized model proved to be useful for mapping vegetation and impervious fractions in the unknown sites. MAEs for both cover types were 5.4 and 10.9% for the San Francisco Bay Area, and 6.3 and 15.4% for Munich. In contrast, the three local models were only found to have similar accuracies as the generalized model for one of the two sites or for individual VIS categories. Despite the enhanced transferability of the generalized model to the unknown sites, deficiencies remained for accurate soil mapping. MAEs were 22.4 and 12.3%, and high over - and underestimations were observed at the low and high end of the fraction range. These shortcomings indicated possible limitations of the spectral libraries to account for the spectral characteristics of soils in the unknown sites. Overall, we conclude that the combination of SVR and synthetically mixed training data generated from multi-site libraries constitutes a flexible modeling approach for generalized urban mapping across multiple cities.

Introduction

Optical satellite remote sensing has great potential for characterizing urban environments. Particularly the mapping of urban composition according to the vegetation-impervious-soil (VIS) framework (Ridd, 1995) has received considerable attention. This framework represents urban areas in a continuous ternary mixing diagram of vegetation, impervious, and soil cover fractions, and was proposed as a standardized mapping scheme to support urban ecosystem analysis, to monitor urban change processes, and for linking urban composition to urban morphology (Ridd, 1995). Numerous studies have exploited multispectral satellite data for mapping individual or all VIS components for different cities around the world (Phinn et al., 2002; Powell et al., 2007; Pu et al., 2008; Rashed et al., 2003). To match the quantitative nature of the VIS framework and to account for mixed pixels typical of urban environments, sub-pixel cover fraction mapping techniques based on linear spectral mixture analysis and its variants (Deng and Wu, 2013; Rashed et al., 2003; Small and Lu, 2006), or based on quantitative empirical modeling with regression algorithms or neural networks (Okujeni et al., 2015; Pu et al., 2008; Walton, 2008) have been widely adopted.

With the ongoing development in spaceborne imaging spectrometry, novel opportunities for a variety of environmental research fields arise. Several hyperspectral satellite missions are currently in preparation, including the Environmental Mapping and Analysis Program (EnMAP, Guanter et al., 2015). EnMAP will frequently collect high spectral resolution images over large geographic areas at a 30 m resolution, which will be passed through standardized processing chains to obtain consistent surface reflectance products. The availability of such imagery will open up new opportunities for global comparative analyses of urban composition and compositional changes. The hyperspectral information will most likely enhance the spectral separability of urban construction materials and vegetation types (Gamba and Dell'Acqua, 2006; Herold et al., 2003), which can lead to more accurate VIS maps. In addition, the spectral benefits will most likely enable the extension of VIS mapping into thematically more detailed impervious and vegetation sub-categories (Okujeni et al., 2015; Roberts et al., 2012; Wetherley et al., 2017). By offering the opportunity to derive more accurate and detailed descriptions of urban composition worldwide, spaceborne imaging spectrometry has considerable potential to introduce a new quality to urban remote sensing.

With regard to satellite imagery with large spatial and high temporal coverage, more generalized mapping and monitoring approaches that can be transferred throughout space and/or time are desired. Generalizing methods in remote sensing were originally introduced as “signature extension” (Botkin et al., 1984). Signature extension aims to move beyond image-by-image approaches by applying a model, which was trained with signatures from one image, to another image from a different geographic location (spatial generalization), a different acquisition time (temporal generalization), a different sensor (across-sensor generalization), or a combination thereof (Foody et al., 2003; Pax-Lenney et al., 2001; Woodcock et al., 2001). In the urban context, globally applicable models were mainly developed for mapping urban extent from MODIS data (Schneider et al., 2010) or different built-up cover types using Landsat data in combination with a digital surface model (Pesaresi et al., 2016). Several studies have explored the use of generalized approaches for mapping urban composition from Landsat data only. Amongst others, Small (2005) demonstrated the use of a global spectral mixture model to map high albedo substrate, vegetation, and dark surface fractions for 28 cities worldwide. Kaspersen et al. (2015) mapped impervious fractions for eight European cities using a regionally generalized regression model. Sexton et al. (2013) made use of a temporally generalized regression model to produce annual maps of impervious surface cover in the Washington-Baltimore metropolitan region between 1984 and 2010. Wang et al. (2014) examined the temporal generalization capability of neural networks, random forest classification, and regression trees for characterizing vegetation fraction dynamics in Zhongwei City between 1990 and 2010.

The availability of stable and comparable surface reflectance units between images is a crucial factor for successfully extending training signatures across scenes (Woodcock et al., 2001). Differences in acquisition conditions and dates, sensor characteristics, reflectance retrieval algorithms, etc., can lead to substantial radiometric inconsistencies between images, which can limit the model transfer across scenes (Olthof et al., 2005; Pax-Lenney et al., 2001). Studies therefore have focused on adjusting images by improved radiometric and atmospheric pre-processing (Olthof et al., 2005; Pax-Lenney et al., 2001), or on adapting the training signatures to individual images (Gray and Song, 2013).

The availability of comprehensive training information is another important factor influencing model generalization. Training signatures may not represent the full diversity and variability of cover types present in new regions or at different periods of time, which limits the model transferability across space and time. The use of multi-source training information representative for multiple sites or dates constitutes a suitable means to overcome these limitations. In the urban context, for example, the aforementioned studies by Kaspersen et al. (2015) and Sexton et al. (2013) demonstrated the use of specific multi-site and multi-annual training areas with impervious reference fractions calculated from high resolution land cover information for spatial and temporal regression model generalization. While this approach is straightforward and produces reliable results, the regression model training fully depends on the availability of accurately co-registered, high resolution reference data. In this regard, multi-source spectral libraries constitute an alternative solution that is independent from the availability of representative training areas and therefore marks a step forward towards more generalized mapping. For example, Michishita et al. (2012) demonstrated the use of a combined multi-annual library for mapping changes in urban composition for five time steps between 1987 and 2009. Similarly, Dudley et al. (2015) demonstrated that a multi-seasonal library can improve vegetation mapping and may support assessments regardless of the seasonality of the input image. All of these studies share the conclusion that multi-source training information provides a suitable means to enhance the spatial and temporal applicability and transferability of mapping models, and consequently their generalization capabilities.

With regard to the unprecedented opportunities of forthcoming spaceborne imaging spectrometers for assessing urban composition globally, the overarching goal of this study was to investigate how single regression models can be spatially generalized for VIS fraction mapping across multiple cities. We selected the combination of support vector regression (SVR) with synthetically mixed training data for fraction mapping (Okujeni et al., 2013). This approach proved to effectively handle urban cover types with high within-class variability, between-class similarity, and spectral mixing from airborne to spaceborne scales (Okujeni et al., 2015; Rosentreter et al., 2017). Moreover, this library-based approach is independent from the spatial context of the work with training areas and appears better suited for more generalized regression model training. Based on spectral libraries for the cities of Berlin (Germany), Brussels (Belgium), and Santa Barbara (U.S.), we created three separate models from the single libraries (henceforth referred to as “local models”), and one model from a combined multi-site library (henceforth referred to as “generalized model”). VIS mapping was subsequently conducted on simulated EnMAP data covering five cities. These included the three cities that were each represented in one local and the generalized model (henceforth referred to as “known sites”), and the two cities San Francisco Bay Area (U.S.) and Munich (Germany) that were not represented in the models (henceforth referred to as “unknown sites”). With this setup, we considered both spectrally more similar and dissimilar sites, known and unknown mapping constellations, as well as the challenges related to the use of different hyperspectral sensors underlying the EnMAP simulation. To demonstrate the good performance and improved transferability of a generalized regression model based on libraries from multiple cities for VIS fraction mapping, we defined two research questions:

  • (1)

    How does the performance of a generalized model compare to local models in mapping their respective sites?

  • (2)

    Is a generalized model more transferable than local models when transferred to unknown sites?

Section snippets

Study sites

The spatial extents of our five study sites correspond to the outline of the available hyperspectral images or a subset extracted thereof (Fig. 1). The Berlin site is located in northeastern Germany and covers a subset of the city's urban-rural gradient. The Brussels site is located in central Belgium and covers the city center and a cross-section through the city's urban-rural gradient. The Santa Barbara site is located in California and comprises the cities and surroundings of Santa Barbara

Development of local and generalized regression models

The schematic workflow for developing local and generalized regression models for fraction mapping is illustrated in Fig. 3. For regression model training, synthetically mixed data from spectral libraries are used (Okujeni et al., 2013) (Fig. 4). These are sets of pure library spectra that are structured according to the land cover categories of interest, their multiple artificial spectral mixtures, together with related mixing fractions per category. The synthetic spectra are then used to

Performances of the generalized and the local models in Berlin, Brussels, and Santa Barbara

The overall accuracy of the generalized model (BE + BR + SB) compared to the local models (BE, BR, and SB) in mapping their respective sites is illustrated in Fig. 5. The generalized model produced AV-MAEs of 8.5% for Berlin, 12.2% for Brussels, and 11.0% for Santa Barbara. Differences in AV-MAEs compared to the local model applications were small and within a range of ±0.7%. However, the transfer of the local models to the two other cities led to a considerable decrease in map accuracy, i.e.,

Application of the generalized model and the local models to known sites

We compared the performances of the generalized model and the local models when applied to the known sites Berlin, Brussels, and Santa Barbara. This comparison examined whether model generalization using a multi-site spectral library was possible without a relevant loss in mapping quality. Overall accuracies (Fig. 5), class-wise accuracies (Fig. 7), and fraction maps (Fig. 6) confirmed that the generalized model was well suited for accurate VIS mapping across the three cities. The performance

Conclusions

The overarching goal of this study was to examine how single regression models can be spatially generalized for VIS fraction mapping across multiple cities. With regard to the opportunity for global assessments of urban composition by means of forthcoming spaceborne imaging spectrometers, our analyses targeted the mapping of VIS fractions from simulated EnMAP data covering the study sites Berlin, Brussels, Santa Barbara, San Francisco Bay Area, and Munich. Based on our findings, we conclude

Acknowledgments

This research was funded by the Belgian Federal Science Policy (Belspo) as part of the UrbanEARS project (SR/00/307). The Berlin HyMap image acquisition and processing was funded by the Federal Ministry of Research and Education (BMBF, FKZ01LK0901A). The APEX image acquisition and processing was funded by Belspo (BELAIR SONIA 2015, SR/03/333). The AVIRIS images were supplied by the NASA Jet Propulsion Laboratory (JPL). Further image pre-processing and spectral library development for the Santa

References (66)

  • R. Michishita et al.

    Monitoring two decades of urbanization in the Poyang Lake area, China through spectral unmixing

    Remote Sens. Environ.

    (2012)
  • A. Okujeni et al.

    Support vector regression and synthetically mixed training data for quantifying urban land cover

    Remote Sens. Environ.

    (2013)
  • A. Okujeni et al.

    Extending the vegetation–impervious–soil model using simulated EnMAP data and machine learning

    Remote Sens. Environ.

    (2015)
  • I. Olthof et al.

    Signature extension through space for northern landcover classification: a comparison of radiometric correction methods

    Remote Sens. Environ.

    (2005)
  • M. Pax-Lenney et al.

    Forest mapping with a generalized classifier and Landsat TM data

    Remote Sens. Environ.

    (2001)
  • R.L. Powell et al.

    Sub-pixel mapping of urban land cover using multiple endmember spectral mixture analysis: Manaus, Brazil

    Remote Sens. Environ.

    (2007)
  • R. Pu et al.

    Spectral mixture analysis for mapping abundance of urban surface components from the Terra/ASTER data

    Remote Sens. Environ.

    (2008)
  • D.A. Roberts et al.

    Mapping chaparral in the Santa Monica Mountains using multiple endmember spectral mixture models

    Remote Sens. Environ.

    (1998)
  • D.A. Roberts et al.

    Synergies between VSWIR and TIR data for the urban environment: an evaluation of the potential for the Hyperspectral Infrared Imager (HyspIRI) Decadal Survey mission

    Remote Sens. Environ.

    (2012)
  • D.A. Roberts et al.

    Relationships between dominant plant species, fractional cover and land surface temperature in a Mediterranean ecosystem

    Remote Sens. Environ.

    (2015)
  • A. Schneider et al.

    Mapping global urban areas using MODIS 500-m data: new methods and datasets based on ‘urban ecoregions’

    Remote Sens. Environ.

    (2010)
  • J.O. Sexton et al.

    Urban growth of the Washington, D.C.–Baltimore, MD metropolitan region from 1984 to 2010 by annual, Landsat-based estimates of impervious cover

    Remote Sens. Environ.

    (2013)
  • C. Small et al.

    Estimation and vicarious validation of urban vegetation abundance by spectral mixture analysis

    Remote Sens. Environ.

    (2006)
  • S.V. Stehman et al.

    Pixels, blocks of pixels, and polygons: choosing a spatial unit for thematic accuracy assessment

    Remote Sens. Environ.

    (2011)
  • D.R. Thompson et al.

    Atmospheric correction for global mapping spectroscopy: ATREM advances for the HyspIRI preparatory campaign

    Remote Sens. Environ.

    (2015)
  • E.B. Wetherley et al.

    Mapping spectrally similar urban materials at sub-pixel scales

    Remote Sens. Environ.

    (2017)
  • C.E. Woodcock et al.

    Monitoring large areas for forest change using Landsat: generalization across space, time and Landsat sensors

    Remote Sens. Environ.

    (2001)
  • D.B. Botkin et al.

    Studying the Earth's vegetation from space

    Bioscience

    (1984)
  • L. Breiman

    Bagging predictors

    Mach. Learn.

    (1996)
  • A. Coates et al.

    Monitoring the impacts of severe drought on Southern California Chaparral species using hyperspectral and thermal infrared imagery

    Remote Sens.

    (2015)
  • J. Degerickx et al.

    A novel spectral library pruning technique for spectral unmixing of urban land cover

    Remote Sens.

    (2017)
  • S. van der Linden et al.

    The EnMAP-Box—a toolbox and application programming interface for EnMAP data processing

    Remote Sens.

    (2015)
  • P. Gamba et al.

    Spectral resolution in the context of very high resolution urban remote sensing

  • Cited by (34)

    • Arctic shrub expansion revealed by Landsat-derived multitemporal vegetation cover fractions in the Western Canadian Arctic

      2022, Remote Sensing of Environment
      Citation Excerpt :

      A single run of synthmix was applied for each class and model to generate 2500 synthetic training data points. Overall, the parameter setup of synthmix followed the proven strategy of previous studies (Okujeni et al., 2017, 2018; Schug et al., 2020), in which each synthetic training instance was generated as follows: First, we specified the possible number of unique endmember spectra that make up the mixed signal, each associated with a unique probability (p) of occurrence. This encompassed binary (p = 0.70), ternary (p = 0.25) and quaternary (p = 0.05) mixing ratios.

    • Pan-European urban green space dynamics: A view from space between 1990 and 2015

      2022, Landscape and Urban Planning
      Citation Excerpt :

      To also account for small-scale urban green spaces, we estimated the subpixel urban green area fraction for each Landsat pixel using a SVR model trained in 50 cites across Europe (900,000 training samples). Validation of our maps for 2000 and 2015, across 10 validation cities, showed that we obtained high accuracies for all locations and years (i.e., RMSE ranging between 0.09 and 0.16 for both 2000 and 2015, Table 1) which is consistent with other studies on the accuracies and robustness of SVR over urban areas (e.g., Okujeni et al., 2018). A comparison with CORINE and Urban Atlas for a region in Brussels, illustrated that our maps better account for sparsely distributed vegetation.

    View all citing articles on Scopus
    View full text