Stochastic estimation of biogeochemical parameters from Globcolour ocean colour satellite data in a North Atlantic 3D ocean coupled physical–biogeochemical model

https://doi.org/10.1016/j.jmarsys.2013.02.007Get rights and content

Abstract

Biogeochemical parameters remain a major source of uncertainty in coupled physical–biogeochemical models of the ocean. In a previous study (Doron et al., 2011), a stochastic estimation method was developed to estimate a subset of biogeochemical model parameters from surface phytoplankton observations. The concept was tested in the context of idealised twin experiments performed with a 1/4° resolution model of the North Atlantic ocean. The method was based on ensemble simulations describing the model response to parameter uncertainty. The statistical estimation process relies on nonlinear transformations of the estimated space to cope with the non-Gaussian behaviour of the resulting joint probability distribution of the model state variables and parameters. In the present study, the same method is applied to real ocean colour observations, as delivered by the sensors SeaWiFS, MERIS and MODIS embarked on the satellites OrbView-2, Envisat and Aqua respectively. The main outcome of the present experiments is a set of regionalised biogeochemical parameters. The benefit is quantitatively assessed with an objective norm of the misfits, which automatically adapts to the different ecological regions. The chlorophyll concentration simulated by the model with this set of optimally derived parameters is closer to the observations than the reference simulation using uniform values of the parameters. In addition, the interannual and seasonal robustness of the estimated parameters is tested by repeating the same analysis using ocean colour observations from several months and several years. The results show the overall consistency of the ensemble of estimated parameters, which are also compared to the results of an independent study.

Highlights

► Three biological rates assumed stochastic in a 3D ocean physical–biogeochemical model. ► Nonlinear analysis to estimate parameters in Kalman filter framework (anamorphosis). ► Observed data is phytoplankton from ocean colour satellites: SeaWiFS, MODIS, MERIS. ► Phytoplankton growth, mortality and grazing rates regionalised on Longhurst regions. ► Resulting biogeochemical rates compare well to those obtained by an independent study.

Introduction

Coupled physical–biogeochemical models (CPBM) of the ocean have been the subject of major developments during the past years with the goal of delivering hindcasts of the biogeochemical state of the ocean (e.g. Brasseur et al., 2009, Ford et al., 2012), short-term forecasts or longer-term simulations in relation with climate change (IPCC, 2007). However, the capacity of these CPBMs to realistically represent the observed variability of biogeochemical marine properties is still limited by the various sources of errors that occur in the simulations as a result of imperfect descriptions of the physical environment that drives the biology and empirical parameterisations of the biogeochemical interactions. From the seminal work of Fasham et al. (1990) to present day models, (e.g. Aumont et al., 2003), a common questioning about the development of biogeochemical models is the choice of state variables (each pool representing an organic or inorganic quantity) and the deterministic relationships that govern the fluxes between these pools. There is still debate about the minimal number of variables required to represent a given process and the optimal level of complexity of the model chosen to realistically simulate basic features such as primary production (Friedrichs et al., 2007 or Kriest et al., 2010).

A number of studies performed with CPBM involve the comparison with real-world observations. A special issue of the Journal of Marine Systems (volume 76: Skill assessment for coupled biological/physical models of marine systems, 2009) was devoted to this question, describing a number of normalised tests to objectively assess the results of realistic simulations. Related to this question are the availability and quantity of data which can be used to validate model outputs. Ocean biogeochemistry can be characterised by different quantities, such as the concentration of chlorophyll a, micro and macro-nutrients, or particulate/dissolved organic/inorganic carbon/nitrogen. The main assets of in situ measurements are their accuracy and the possibility to sample the sub-surface water column at appropriate depths. However, such measurements are very sparse in space and time. Worldwide, very few sites in the open ocean are routinely monitored (e.g., on a monthly basis), whereas automatic sampling performed from floats or gliders is still a long-term prospective for biogeochemical quantities (Claustre et al., 2010). On the other hand, achieving a global coverage, repetitivity on a few days cycle and availability of the archive (more than a decade) are the main advantages of ocean colour satellite data. The main drawback is that the measurements from space are representative of surface quantities only.

The optimal combination of model results and observations is the field of data assimilation. A comprehensive review of data assimilation in CPBM is provided by Gregg (2008). A collection of studies has been performed quite recently using ocean colour data, which demonstrate the value of the assimilation concepts inherited from optimal control or estimation theory (Armstrong et al., 1995, Fontana et al., 2009, Fontana et al., 2010, Fontana et al., 2012, Ford et al., 2012, Friedrichs, 2002, Garcia-Gorriz et al., 2003, Gregg, 2008, Hemmings et al., 2003, Hemmings et al., 2004, Hu et al., 2012, Ishizaka, 1990, Korres et al., 2012, Losa et al., 2004, Mattern et al., 2012, Natvik and Evensen, 2003a, Natvik and Evensen, 2003b, Nerger and Gregg, 2007, Nerger and Gregg, 2008, Roy et al., 2012).

Very often, variational data assimilation methods are implemented with the purpose of parameter estimation. Model simulations performed after optimisation are expected to yield better agreement with the measured data. Other studies that rely on sequential methods such as the Kalman filter, aim at a better control of the model trajectory and/or biogeochemical parameters.

In a recent paper, Doron et al. (2011) proposed an ensemble approach to objectively estimate a subset of biogeochemical parameters using ocean colour observations. In that paper, three key biogeochemical model parameters were identified to have the strongest impact on the chlorophyll a concentrations. Assuming independent parameter uncertainties for the different ecological provinces of the North Atlantic basin, Monte Carlo experiments were carried out to explore the model sensitivity to the uncertain model parameters. More precisely, an ensemble of 200 one-month simulations was performed during the spring bloom, to describe the joint probability distribution of the model state and model parameters. This probability distribution was observed to non-Gaussian, so that the linear observational update formulas of the Kalman filter were not directly applicable. For this reason, a nonlinear transformation (anamorphosis) was introduced to restore the Gaussianity of the marginal probability distribution for every model state variable and every uncertain parameter. With this upgraded description of the prior probability distribution, parameter corrections were computed from synthetic observations of the phytoplankton concentration (in the context of twin experiments). The nonlinear transformation has proved to be a key ingredient to obtain reliable estimates of the uncertain parameters in the various ecological provinces of the North Atlantic (each one with a specific non-Gaussian behaviour).

The twin experiments of Doron et al. (2011) provided an idealised framework where the only error source lies in the parameter values while all other elements are perfectly controlled. However, the error sources in real-world simulations may have different origins: the model initial conditions, the probability distribution function of the parameters, the physical forcings, or even observation errors being possibly the largest. Therefore, in a realistic case where real world observations are assimilated, the assumptions that were necessary to develop the method are not strictly valid anymore. For all these reasons, a real challenge in using a biogeochemical parameter estimation method with real ocean colour observations is to reduce the uncertainty in the biogeochemical parameters of CPBM and to objectively quantify the spread of the posterior distribution.

In this context, the objective of the present paper is to investigate to what extent the estimation method described in Doron et al. (2011) can be implemented to obtain realistic corrections of a few biogeochemical parameters using real ocean colour observations. The present paper is structured as follows: Sections 2 and 3 present the coupled model, the ocean colour data and the parameter estimation method. Section 4 describes the results of the experiments and provides an assessment of the estimated parameters. In particular, the optimised parameters are compared to a regional set of biogeochemical parameters obtained in the frame of an independent approach (Losa et al., 2004). A summary, conclusions and future research lines are proposed in Section 5.

Section snippets

The coupled physical–biogeochemical model

The physical component of the coupled model is the primitive equation ocean circulation model OPA-NEMO (Madec et al. (1998), http://www.nemo-ocean.eu/), implemented in the North Atlantic basin between 20°S and 80°N, and between 98°W and 23°E with a horizontal resolution of 1/4° (i.e., the Drakkar NATL4 model configuration as described in Barnier et al., 2006). The vertical coordinate is geopotential with 46 prescribed levels enabling a resolution from 6 to 15 m in the upper 100 m. The physical

Parameter estimation

Doron et al. (2011) proposed a stochastic approach to perform parameter estimation using ocean colour data together with ensemble simulations. The method is derived from the Kalman filter and includes an augmented state vector approach to perform a joint analysis of the state variables and model parameters. The analysis step is performed after anamorphic transformation of the prior ensemble into a Gaussian distribution to maximise the information extracted from the data (Brankart et al. (2012)

Results

In this section are shown the results obtained with the implementation of the stochastic parameter estimation using real satellite ocean colour data. The results are organised in the following order: first, the analysis results, both for the parameters and [chl a], are considered when the observation data is from May 1998. Second, the method is extended to ocean colour observations captured for other time periods and the impact on parameters is assessed. Finally, the estimates of the three key

Summary and conclusions

In the present study, regional values of key parameters of a 3D biogeochemical ocean model are estimated through the assimilation of satellite measurements of ocean colour distributed by the Globcolour project. The method implemented in this study was previously developed in the framework of twin experiments (Doron et al., 2011). Using the augmented state vector formalism, the stochastic estimation method relies on multivariate correlations between biogeochemical parameters and model variables,

Acknowledgements

We are thankful to two anonymous reviewers for the many helpful comments on the manuscript. This work was supported by the ESA-funded ASSOCO project (ESA/ESRIN contract network 22408/09/I-EC) in the framework of the Changing Earth Science Network. Partial support of CNES and the European Commission under grant agreement FP7-SPACE-2011-1-CT-283367-MYOCEAN2 is gratefully acknowledged. Calculations were performed using HPC resources from GENCI-IDRIS (grant 2010-011279).

References (44)

  • S. Losa et al.

    Weak constraint parameter estimation for a simple ocean ecosystem model: what can we learn about the model and data?

    J. Mar. Syst.

    (2004)
  • S. Maritorena et al.

    Consistent merging of satellite ocean color data sets using a bio-optical model

    Remote. Sens. Environ.

    (2005)
  • J.P. Mattern et al.

    Estimating time-dependent parameters for a biological ocean model using an emulator approach

    J. Mar. Syst.

    (2012)
  • L. Natvik et al.

    Assimilation of ocean colour data into a biochemical model of the North Atlantic — part 1. Data assimilation experiments

    J. Mar. Syst.

    (2003)
  • L. Natvik et al.

    Assimilation of ocean colour data into a biochemical model of the North Atlantic — part 2. Statistical analysis

    J. Mar. Syst.

    (2003)
  • L. Nerger et al.

    Assimilation of SeaWiFS data into a global ocean-biogeochemical model using a local SEIK filter

    J. Mar. Syst.

    (2007)
  • L. Nerger et al.

    Improving assimilation of SeaWiFS data by the application of bias correction with a local SEIK filter

    J. Mar. Syst.

    (2008)
  • Y. Ourmières et al.

    On the key role of nutrient data to constrain a coupled physical–biogeochemical assimilative model of the North Atlantic Ocean

    J. Mar. Syst.

    (2009)
  • S. Roy et al.

    Sequential variations of phytoplankton growth and mortality in an NPZ model: a remote-sensing-based assessment

    J. Mar. Syst.

    (2012)
  • E. Simon et al.

    Gaussian anamorphosis extension of the DENKF for combined state parameter estimation: application to a 1D ocean ecosystem model

    J. Mar. Syst.

    (2012)
  • R. Armstrong et al.

    Ecological time series

  • O. Aumont et al.

    An ecosystem model of the global ocean including Fe, Si, P colimitations

    Global Biogeochem. Cycles

    (2003)
  • Cited by (31)

    • Information content of in situ and remotely sensed chlorophyll-a: Learning from size-structured phytoplankton model

      2018, Journal of Marine Systems
      Citation Excerpt :

      Another application of DA in biogeochemical modelling is related to optimization of parameters. In this case the model parameters are modified to fit the observations (e.g. Doron et al., 2013; Laiolo et al., 2016; Gharamti et al., 2017). The parameter optimization involves usually three main steps: (1) running the model forward in time, (2) comparing simulation results with the observations through a cost function to quantify the differences between the two datasets, and (3) modifying the parameter values accordingly until the best fit to the data is obtained (i.e. minimize the cost function value).

    • Data assimilation in a coupled physical-biogeochemical model of the California Current System using an incremental lognormal 4-dimensional variational approach: Part 2—Joint physical and biological data assimilation twin experiments

      2016, Ocean Modelling
      Citation Excerpt :

      Developments in data assimilation in physical oceanography for over two decades now provide many estimates of the physical ocean state at global (Behringer et al., 1998; Bell et al., 2000; Chassignet et al., 2007; Köhl et al., 2007; Balmaseda et al., 2008; Carton and Giese, 2008) and regional (Oke et al., 2008; Cummings et al., 2009; Broquet et al., 2009; Shulman et al., 2009; Kurapov et al., 2011; Matthews et al., 2012; Sakov et al., 2012) scales. Data assimilation techniques have been developed to a lesser extent in biological oceanography, though their application has been used for both the determination of poorly known model parameters (e.g., Matear, 1995; Spitz et al., 1998; Losa et al., 2004; Mattern et al., 2012; Roy et al., 2012; Doron et al., 2013; Simon et al., 2015) as well as quantitative improvement in modeled biological fields (see Gregg, 2008 and Edwards et al., 2015 for reviews). Data assimilation in physical-biological coupled systems has focused generally on either physical or biological data assimilation in isolation.

    View all citing articles on Scopus
    View full text