J Plant Ecol ›› 2020, Vol. 13 ›› Issue (6): 732-737 .DOI: 10.1093/jpe/rtaa065

• Research Articles • Previous Articles     Next Articles

Bayesian model predicts the aboveground biomass of Caragana microphylla in sandy lands better than OLS regression models

Yi Tang1, Arshad Ali2,3, *, and Li-Huan Feng1   

  1. 1 School of Life Science, Liaoning University, Shenyang 110036, Liaoning, China, 2 Department of Forest Resources Management, College of Forestry, Nanjing Forestry University, Nanjing 210037, Jiangsu, China, 3 Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing 210037, Jiangsu, China

    *Corresponding author. E-mail: arshadforester@njfu.edu.cn
  • Received:2020-05-21 Revised:2020-08-18 Accepted:2020-10-01 Online:2020-10-10 Published:2020-12-01

Abstract:

Aims

In forest ecosystems, different types of regression models have been frequently used for the estimation of aboveground biomass, where Ordinary Least Squares regression models (OLS) are the most common prediction models. Yet, the relative performance of Bayesian and OLS models in predicting aboveground biomass of shrubs, especially multi-stem shrubs, has relatively been less studied in forests.

Methods

In this study, we developed the biomass prediction models for Caragana microphylla Lam. which is a widely distributed multi-stems shrub, and contributes to the decrease of wind erosion and the fixation of sand dunes in the Horqin Sand Land, one of the largest sand lands in China. We developed six types of formulations under the framework of the regression models, and then, selected the best model based on specific criteria. Consequently, we estimated the parameters of the best model with OLS and Bayesian methods with training and test data under different sample sizes with the bootstrap method. Lastly, we compared the performance of the OLS and Bayesian models in predicting the aboveground biomass of C. microphylla.

Important Findings

The performance of the allometric equation (power=1) was best among six types of equations, even though all of those models were significant. The results showed that mean squared error (MSE) of test data with non-informative prior Bayesian method (NPB) and the informative prior Bayesian method (IPB) was lower than with the OLS method. Among the tested predictors (i.e. plant height and basal diameter), we found that basal diameter was not a significant predictor either in OLS or Bayesian methods, indicating that suitable predictors and well-fitted models should be seriously considered. This study highlights that Bayesian methods, the bootstrap method and the type of allometric equation could help to improve the model accuracy in predicting shrub biomass in sandy lands.

Key words: Bootstrap, Caragana microphylla, Horqin Sandy Land, mean squared error, Prior information

摘要:
回归模型可用于预测森林生态系统地上生物量,其中最为常用的是最小二乘回归模型。在预测灌木,尤其是多茎灌木的地上生物量 时,最小二乘法与贝叶斯方法的比较很少被研究。我们开发了小叶锦鸡儿(Caragana microphylla Lam.)生物量预测模型。小叶锦鸡儿是科尔 沁沙地广泛分布的多茎灌木,对减少风蚀、固定沙丘具有重要作用。本研究建立6种表征生物量的异速增长模型,并基于统计标准选择 在预测生物量方面表现最佳的1种,然后分别用最小二乘法与贝叶斯方法对模型中的参数进行估计。参数估计过程中用自助法考察样本量大 小的影响,同时区分测试集与训练集。最后,我们比较了最小二乘法与贝叶斯方法在小叶锦鸡儿地上生物量预测中的表现。异速增长的6个 模型均达到显著水平,其中幂指数为1的模型表现最佳。研究结果表明,采用无先验信息与有先验信息的贝叶斯方法进行估计,得到的均 方误差在测试集上低于最小二乘法。另外,基径作为预测变量在最小二乘法与贝叶斯方法中均不显著,表明在生物量预测模型中应谨慎选 择合适变量。本研究强调贝叶斯方法、自助法和异速增长模型相结合能够提升沙地灌木生物量预测模型的准确度。

关键词: 自助法, 小叶锦鸡儿, 科尔沁, 均方误差, 先验信息