Discovering divergence in the thermal physiology of intertidal crabs along latitudinal gradients using an integrated approach with machine learning
Introduction
Physiological patterns that characterize different populations are strongly defined by environmental conditions (Hoffmann and Parsons, 1989, Somero, 2002, Khaliq et al., 2014) that, among others, determine distribution ranges, tolerance capacities, and ultimately organismal fitness. One of the main abiotic factors that affects physiological changes in ectotherm organisms is temperature (Johnston and Bennett, 2008, Castañeda et al., 2005, Mora and Maya, 2006, Angilletta, 2009, Lardies et al., 2011). Specifically, temperature has been shown to influence basic organismal functions, biochemical rates, locomotion, growth and reproduction (Kingsolver and Huey, 2008, Somero, 2010, Gaitán-Espitia et al., 2013a, Gaitán-Espitia et al., 2013b, Gaitán-Espitia et al., 2014). Therefore, temperature plays a fundamental role in species distribution patterns (Somero, 2005, Deutsch et al., 2008, Calosi et al., 2008). Latitudinal gradients along with intertidal gradients provide natural variation that can be used to investigate how temperature affects thermal physiology (Stillman and Somero, 2000, Helmuth et al., 2006). Coastal areas can be considered as natural laboratories where resident organisms may differ in terms of local adaptation and/or phenotypic plasticity, both mechanisms that allow populations to maximize fitness in response to environmental heterogeneity (Gardiner et al., 2010, Yampolsky et al., 2014).
Variations in physiological traits along environmental gradients are causes and consequences of phenotypic divergence in natural populations (Torres Dowdall et al., 2012), conferring local fitness advantages (Kawecki and Ebert, 2004). In general, these evaluations of phenotypic differentiation have been often correlated usually with latitude (Lindgren and Laurila, 2009, Zippay and Hofmann, 2010). Overall, The problem is that the statistics are limited due to the lack of flexibility by incorporating only univariate and linearity for estimations (Naya et al., 2011, Sunday et al., 2014, Weber et al., 2015) with a limited capacity of data interpretation. Others have shown that environmental variation affects integrated phenotypes involving several co-dependent traits (see, Salazar-Ciudad, 2007; Armbruster et al., 2014). Furthermore, phenotypic integration provides an explanation for how phenotypes are sustained by relationships between traits (Pigliucci and Preston, 2004). Because physiological traits play an important role in fitness (Ricklefs and Wikelski, 2002), the environmental characterization plays an important role in the local adaptation for the effectiveness in the survival and reproduction of the populations (McLean et al., 2014). In this sense, understanding how environmental gradients (for example, temperature) have effects on physiological traits is desirable to understand how the increase in global temperature can affect different populations (Magozzi and Calosi, 2015). In general, measuring physiological traits in a population reflects the costs and benefits associated with somatic maintenance in thermal environments (Heusner, 1985, Clarke and Johnston, 1999, Watson et al., 2014). Metabolic rate is the main parameter used to measure subsistence energy costs being directly related thermal sensitivities (Ruel and Ayres, 1999, Kovac et al., 2014) and thermal safety (see Sunday et al., 2011). Recovery time after critical thermal events also provides an index of how sensitive species are to climate (Castañeda et al., 2004, Castañeda et al., 2005). Also, morphometric characteristics have been shown to follow some biogeographic patterns, mainly latitudinal patterns (Angilletta et al., 2004, Bidau and Martí, 2007, Zamora‐Camacho et al., 2014).
Analyses involving computational intelligence could provide an understanding of the patterns that emerge from the interaction between organismal traits and how these interactions can be modified by the environment (Park and Chon, 2007). In most cases, these interactions between traits are often too complex and do not meet the assumptions of conventional statistical procedures (Recknagel, 2001, Kampichler et al., 2010). Machine learning has many applications (see Olden et al., 2008; Thessen, 2016), but notably it has been shown to be useful when disentangling associated variables to gain a deeper understanding of multiple interactions (Peters et al., 2014). In this sense, the phenotypic divergence in an integrated phenotype has been related to the relatively low amounts of phenotypic covariance in closely related populations (Game and Caley, 2006, Renaud et al., 2006) and other studies have shown otherwise (Arnold and Phillips, 1999), generally adjusting multivariate linear models. Therefore, more studies comparing the relationships between the traits are clearly necessary to understand the link in the divergence between populations. The number of methods used for integration in machine learning has grown steadily (Acevedo et al., 2009, Valletta et al., 2017). Therefore, there are multiple models that differ in the technique of integrating the variables. Overall, in order to better understand associations between multiple associated variables, a reduction in dimensionality is a key factor in the simplification of analysis (Kasun et al., 2016). Machine learning methods, in general can fall into two categories: (1) unsupervised learning (i.e., clustering), that identifies patterns in a heuristic way (Sathya and Abraham, 2013) and (2) supervised learning (i.e., classification) which can be used to infer a function from labeled training data. Due to the existence of many methods that perform similar machine learning functions, it is pertinent to compare different algorithms, since the performance of each algorithm differs given the clustering/classification problem (Caruana and Niculescu-Mizil, 2006, Übeyli, 2007), in order to unmask the patterns of association between the traits that emerge from the population divergence.
Using three crab species, which are distributed along a small vertical intertidal gradient (i.e. intertidal zone), we analyzed variation in thermal exposure at different spatial/temporal scales. Specifically, crabs in the lower intertidal experience acute thermal variation because they are exposed to periods of greater thermal changes only in periods of extremely low tides, while those in the high intertidal experience chronic thermal variation determined by daily tidal cycles. In addition, different populations of these species are distributed along a latitudinal-environmental gradient that covers more than 3000 km and is marked by gradual thermal variation (Barría et al., 2014; Gaitán-Espitia et al., 2014). We performed trait integration using machine learning, which allowed us to unravel differences that exist in the degree of association among physiological traits of crabs and their relation with phenotypic plasticity (see Gianoli and Palacio‐López, 2009). To investigate the physiological divergence among these closely related organisms that inhabit different habitats, we determined the variation in the phenotypic matrix. Finally using both conventional statistics and machine learning methods, we investigated the thermal geographic variation of physiological traits in intertidal crabs to determine the variation in a complex phenotype along the latitudinal and intertidal gradients.
Section snippets
Model species and intertidal variability
Samples of three species of intertidal crustaceans (i.e., Cyclograpsus cinereus, Petrolisthes violaceus, and Petrolisthes tuberculosus) were used for this study. Specifically, samples were collected at three different levels within the intertidal zone: High (0.6–1.0 m), Middle (0.3–0.5 m) and Low (0.1–0.2 m). The high intertidal is characterized by chronic environmental variability, the low intertidal experiences acute variability, and the middle intertidal can be considered as a transition
Thermal variability in the intertidal
The 12 month high resolution temperature data showed a clear latitudinal cline; temperature decreased towards higher latitudes with a thermal difference of 4.5 °C between the north and south extremes in the studied sites (Fig. 1B, C, D). The largest annual thermal range (i.e., the difference between the warmest and coolest temperatures) was found at the Talcaruca site located at the biogeographical break (13.8 ± 2.88 °C, mean ± sd - annual statistic). Comparing the temperature dynamics with the
Discussion
Variation in the phenotypes of ectothermic organisms has been shown to be associated with thermal gradients, which are known to generate strong gradients of selection (Angilletta et al., 2002, Yamahira et al., 2007). In order to understanding how physiological traits vary spatially, it is necessary to quantify the degree of association between the traits of spatially distributed populations (Via and Hawthorne, 2005). Here, we show that the integration of traits of species that inhabit acute
Acknowledgements
The study received financial support from CONICYT FONDECYT 1140092 grant to MAL. The Millennium Nucleus Center for the Study of Multiple-drivers on Marine Socio-Ecological Systems (MUSELS) by MINECON Project NC120086 also gave support to MAL and SJO during final stages of the project. MAL acknowledge the support of PIA CONICYT ACT-172037. SJO acknowledges BECAS CONICYT No. 21150739 for financial support. All experiments were conducted according to common Chilean law.
References (142)
- et al.
Automated classification of bird and amphibian calls using machine learning: a comparison of methods
Ecol. Inform.
(2009) Connectance in Sorghum development: beyond the genotype–phenotype duality
BioSystems
(2000)- et al.
The evolution of thermal physiology in ectotherms
J. Therm. Biol.
(2002) - et al.
Interpopulational variation in recovery time from chill coma along a geographic gradient: a study in the common woodlouse, Porcellio laevis
J. Insect Physiol.
(2005) - et al.
Thermal preference in Drosophila
J. Therm. Biol.
(2009) - et al.
Exploring physiological plasticity and local thermal adaptation in an intertidal crab along a latitudinal cline
J. Therm. Biol.
(2017) - et al.
Intraspecific variation in the metabolic scaling exponent in ectotherms: testing the effect of latitudinal cline, ontogeny and transgenerational change in the land snail Cornu aspersum
Comp. Biochem. Physiol. Part A: Mol. Integr. Physiol.
(2013) - et al.
Environmentally induced epigenetic transgenerational inheritance of phenotype and disease
Mol. Cell. Endocrinol.
(2012) - et al.
Classification in conservation biology: a comparison of five machine-learning methods
Ecol. Inform.
(2010) Major Neogene events of the Southeastern Pacific: the Chilean and Peruvian record
Palaeogeogr. Palaeoclimatol. Palaeoecol.
(1990)