248x Filetype PDF File size 0.17 MB Source: www.stat.fi
Forest inventory - a challenge for statistics
Erkki Tomppo
Finnish Forest Research Institute
Unioninkatu 40 A
FIN-00170 Helsinki, Finland
erkki.tomppo@metla.
Introduction
Statistically designed forest inventories were introduced simultaneously in three Nordic
countries, Norway, Finland and Sweden, in the beginning of the 1920's (e.g. Ilvessalo 1927).
Estimating the forest area and the volume of growing stock as well as analysing of increment
and drain of growing stock have been original objectives of inventories. The scope of the rst
inventories was, however, already much wider, including, e.g., information on site types, forest
silvicultural state, structure of the growing stock, and applied and required silvicultural and
cutting regimes. Later, new parameters related to forest health and forest biodiversity have
appeared, e.g. species abundances and distributions.
The information needs are increasing, especially at the moment when the awareness of
the forest health status and loss of biological diversity has arisen, the role of forests in pre-
venting global warming has been recognised and, at the same time, the pressure to increase
timber consumption is increasing. For instance, the paper consumption has increased since the
beginning of 1970s from about 130 million tons to 276 million tons in 1995. It is expected to
increase to 420-440 million tons by 2010. On the other hand, one half of the harvest timber is
still used for cooking and heating causing wide area deforestation in the dry tropics.
Forest inventories have traditionally provided information related the biological diversity
of forests, such as the structure of growing stock, areas of site fertility classes and sometimes the
distribution and abundance of plant species. An increasing concern about the loss of diversity,
caused e.g. by deforestation, human induced environmental and climate changes as well as the
extinction of species, has increased interest in the whole forest ecosystem and its biological
diversity. The composition and structure of landscape, fragmentation of forests or land types,
the areas and spatial distributions of important habitat types are examples of characteristics
which can be measured in the context of large area inventories, at least when multi-source
information is utilised. Hanski (1999) has presented mathematical models that connect the
dynamics of species to the structure of fragmented landscapes.
Forests have also been seen as having a role in reducing the eects of global warming
by binding the increasing amount of carbon dioxide in the atmosphere. Global forest area is
known reasonably well. However, the annual incrementplays an importantroleincarbon
ux
and is not known globally.
In order to be able to satisfy the increasing and diverse demands for scientically sub-
stantiated information, ecient methods are needed to measure forest resources, their status
and the components of the whole forest ecosystem.
Examples of spatial variation types present in forests
Forest variables are commonly divided into groups describing an individual tree, a forest
stand (or a sample plot) and a forest region. Each variable has usually its own covariance
structure which depends on the geographical scale. A single tree stem volume is assessed as
an integral of a stem curve, i.e. diameter as a function of height, V = Rhd(h)dh. Trees in a
0
same stand are of similar form while those further apart from each other dier more. Tree stem
form depends also on the tree species and site variables. The within stand and between stand
variation can be modelled, e.g, with mixed models, d (h) = f(x;y)+v (h)+e (h); where
ki k ki
dki(h) is the diameter of a tree i in a stand k, f(x;y) is a function of tree variables x and stand
variables y, vk(h) is a random stand eect and eik a random tree eect (Lappi 1996).
The relative locations of trees, spatial pattern of trees, aects for instance the eciency
of sampling design, the growth of trees and can thus be utilised in planning sampling methods,
in assessing the naturalness of forests, e.g. Spatial point processes, e.g. Gibbs processes, have
been used in modelling spatial patterns of trees (Sarkka and Tomppo 1998).
Variables, like growth factors, site fertility, nutrient availability,cumulative temperature
sum of growing season, e.g., are examples of variations of dierent scales. These can regarded
as a realisations of stochastic processes on the plane. These, on the other hand, very much
determine, the structure of the growing stock and its variability. Present silvicultural practice
has led to forests which are mosaics of stands of dierent age classes and tree species composi-
tions with a specic tree form and spatial patterns of trees. The distribution of stands can be
regarded as an output of mosaic models (Stoyan et al. 1995).
All these variations should be taken into account when planning sampling design of a
forest inventory, in parameter estimation and in deriving condence limits of estimators. On
the other hand, practical questions, such as moving between sampling units aects the costs
and should be considered in minimising standard errors with given costs.
Parameters estimation with eld data
Forest inventories have motivated much of the pioneering work on the general theory
of line surveys, systematic sampling, and spatial statistics. Spatial autocorrelation of trees,
stand and regional variables often leads to multi-stage sampling. An increasing utilisation of
supplementary data, e.g. remote sensing data or other georeferenced data, leads to two-phase
(double) sampling (Cochran 1963).
Multi-stage sampling are commonly used in tree measurements. Few parameters, which
vary much also between trees and which are usually easy to measure, are measured from each
tree to be sampled. A smaller sub-sample is taken of the rst sample for measuring addi-
tional variables which usually vary less within stand or within a eld plot. Statistical questions
are, how many stages should be used, what are sizes of all samples, what variables should
be measured in each stage, and how to estimate the variables which are not measured. Tree
level volume and increment estimates are usually derived from the most intensive measure-
ments. Statistical models, e.g. non-parametric regression analysis, are applied in estimating
the variables for trees of less intensive samples.
The interest in forest inventory is often in the quantity M = R z(t)dt=R y(t)dt; where
A A
A is an inventory area, z(t);t 2 R2 the variable of interest, e.g., an indicator of a land use
class, volume of timber assortment and y(t);t 2 R2 is an indicator function of the stratum
(e.g, forestry land). After estimating all variables for each sampling unit, e.g. volumes for each
tree, estimation of area and volume parameters of a forest region leads to a ratio estimator
P P
n n 2
m= izi= i yi =z= y. A natural reliability measure for the estimator is E(m M) . Unbi-
ased estimator for systematic sampling is not known. Conservative estimators can be derived
utilising the properties of second order stationary processes (Matern 1960). The parameter
estimation of spatial pattern models with Gibbs processes can be based on the properties of
Palm distributions of the process and a chosen test function (Fiksel 1988). Another possibility
is to utilise approximative maximum likelihood approaches, for a review, see Geyer (1998).
Estimation of parameters with multi-source inventory
The increasing availability of supplementary data, e.g. remote sensing data, has changed
the requirements for statistical methods in forest inventories. Supplementary data is usually
cheap but much less accurate than eld measurements. A proper use of the data can, however,
make the inventories more ecient. Some practical questions, like availability of data, due
to weather conditions, e.g., still prevent the full utilisation of data. The estimation with
supplementary data could be done in the framework of two-phase (double) sampling. A non-
parametric k-nn method, adopted in the Finnish national forest inventory, an be considered
as an extension of double sampling (Kilkki and Paivinen 1987, Tomppo 1996). An essential
property is that all inventory variables, typically 100 to 400, can be estimated at the same time.
The procedure utilises a distance measure dened in the feature space of the supplemen-
tary data, denoted here by d, and denes new area weights for each eld plot by computation
units. The weight of eld plot i to pixel p is dened as
k
w = 1 =X 1 ; (1)
i;p 2 2
d d
pi;p j=1 p(j);p
if pixel p is among the k nearest to p, otherwise w = 0. Here, k is a predened xed
i i;p
number. The weights w are summed over pixels p by computation units u (for example by
i;p
municipalities) yielding the weight of plot i to computation unit u
c =Xw : (2)
i;u i;p
p2u
The sum c can be interpreted as that area (in pixels) of unit u, which is most similar to
i;u
sample plot i. The plots outside u may also receive positiveweights (synthetic estimation).
Thek-nnmethodisa
exibleandpracticalwaytocombineeldmeasurementsandsupple-
mentary data into an operational inventory system, to obtain much more detailed information
about forests with very low additional costs compared to the inventory methods employing
sampling and eld measurements only. The method is more statistically oriented than the
old classication-based approach to the use of satellite images. The key feature concerning
the routine operational use is, that the image processing phase does not depend on the forest
variables to be estimated. After computing the sample plot weights c for each computation
i;u
unit u of interest, the image data is no longer used, and in principle, all parameters can be
estimated as weighted averages of eld plot data. Also because the weights are the same for all
variables, the method preserves the natural dependence structures between forest parameters.
Further advantages are the applicabilitytovery dierenttypes of forests and the possibilityto
use dierent kinds of remote sensing material, both with only minor modications.
Forthek-nnmethod,theRMSEofthepixellevelestimatescanbestatisticallyassessed by
cross validation. However, the error in the estimate of a forest parameter in one pixel is highly
dependent on the true value there, and thereby the errors are spatially correlated. The error
structure is made even more complex by the spatial dependencies in the image itself and the
errors of possible other data sources like maps. Developing an operationally usable statistical
error assessment technique is a highly challenging task, and a fully satisfactory solution is yet
to be found.
Conclusions
Forest inventories have motivated pioneering work in statistics, especially in spatial statis-
tics. Global forestry information needs are increasing at the moment when the requirements to
increases timber production, and at the same, to preserve forest ecosystems exist. Increasing
amount of versatile supplementary data makes it possible to increase the eciency of inven-
tories. Analysis of temporally and spatially correlated, multi-temporal, multi-resolution and
multi-source data sets of future forest inventories is achallenging task for statistics.
REFERENCES
Cochran, W.G. (1963) Sampling Techniques. Wiley. New York.
Geyer, C. J. (1998). Likelihood inference for spatial point processes. In O. E. Barndor-
Nielsen, W. S. Kendall and M. N. M. van Lieshout (eds.), Stochastic Geometry, Likelihood
and Computation, no. 80 in Monographs on Statistics and Applied Probability. Chapman and
Hall/CRC.
Hanski, I. (1999): Metapopulation Ecology. Oxford University Press, Oxford.
Fiksel, T. (1988). Estimation of interaction potentials of Gibbsian point processes. Statistics
19, 77-86.
Ilvessalo, Y. (1927) The forests of Suomi (Finland). Results of the general survey of the forests
of the country carried out during the years 1921{1924. (In Finnish with English summary).
Communicationes ex Instituto Quaestionum Forestalium Finlandie 11.
Kilkki, P.andPaivinen, R. (1987). Reference sample plots to combine eld measurements and
satellite data in forest inventory. University of Helsinki, DepartmentofForest mensuration and
management. Research Notes 19, 209-215.
Lappi, J. (1986). Mixed linear models for analyzing and predicting stem form variation of Scots
Pine. Communicationes Instituti Forestalis Fenniae. 134.
Matern, B. (1960) Spatial variation. Medd. Statens Skogsf. Inst. 49(5). Also appeared as
number 36 of Lecture Notes in Statistics. Springer-Verlag, New York, 1986.
Stoyan, D., Kendall, W. S. and Mecke, J. (1995). Stochastic Geometry and its Applications.
2nd edn. Wiley. New York
Sarkka, A. and Tomppo, E. (1998). Modelling interactions between trees by means of eld
observations. Forest Ecology and Management 108, 57-62.
Tomppo,E.(1996)Multi-sourcenationalforestinventory ofFinland. In R.Paivinen, J. Vanclay
and S. Miina (eds.), New Thrusts in Forest Inventory. Proceedings of IUFRO XX World
Congress, 6{12 Aug. 1995, Tampere, Finland, pp. 27{41.
no reviews yet
Please Login to review.