Examples

knitr::opts_chunk$set(warning = FALSE, message = FALSE)
library(iPastoralist)

Example 1 - use of ‘vegetation_abundance’ function

Suppose we want to convert Frequency of occurrence (FO) to Species percentage cover (%SC), considering also occasional species. As the %SC for each survey will (likely) be greater than 100, we want to rescale %SC of each species per each survey to obtain a sum of 100 (i.e. a proportion of %SC). The dataset used as example is shown in “Data input format” vignette, where the total measurements per transect is 25. Therefore, to obtain $SC FO should be multiplied by 4 (so that they refer to 100 measurements.)

library(iPastoralist)
vegetation.sc<-vegetation_abundance(database = vegetation,
                                    species.cover.coefficient = 4,
                                    method = "SRA_SC.fo.occ")

head(vegetation.sc)

	R1	R2	R7	R9	R10	R11	R12	R14
Achmacr	0.0000000	0.0000000	0.0000000	0.000000	0.000000	0.0000000	0.000000	2.643754
Achmill	0.0000000	0.0000000	0.0918555	6.118079	0.103484	0.0927357	11.289867	0.000000
Achmosc	0.0000000	0.0000000	0.0000000	0.000000	0.103484	0.0000000	0.000000	0.000000
Acialpi	0.0000000	0.0000000	0.0000000	0.000000	0.103484	2.4729521	2.257973	0.000000
Acolama	0.0971817	0.0000000	0.0000000	0.000000	0.000000	0.0000000	0.000000	2.643754
Adealli	0.0000000	0.1612903	0.0000000	0.000000	0.000000	0.0000000	0.000000	21.150033

we can check that the sum of %SC for each survey is 100

colSums(vegetation.sc)
#>  R1  R2  R3  R4  R5  R6  R7  R8  R9 R10 R11 R12 R13 R14 R15 R16 R17 R18 R19 R20 
#> 100 100 100 100 100 100 100 100 100 100 100 100 100 100 100 100 100 100 100 100 
#> R21 R22 R23 R24 
#> 100 100 100 100

Example 2 - computation of ecological indexes

In this case we want to compute the average Landolt indicator values for each survey, weighted with species abundance.

If the occasional species are not considered, the SRA will be used. Conversely, if we would like to keep into account also occasional species, the SRA will be calculated with the %SC rescaled to 100 (more detail in “vegetation_abundance” function).

In this case we will consider also occasional species.

The input database is the one with the Frequency of occurrences, i.e. the dataframe used in this tutorial named “vegetation”.

ec.index<-ecological_indexes(database.vegetation = vegetation,
                             database.indexes = data[,c("F_Landolt","R_Landolt","N_Landolt")],
                             occasional.species = TRUE,
                             species.cover.coefficient = 4,
                             weight = TRUE)
#> [1] "INDEX WEIGHTED WITH OCCASIONAL SPECIES"

Notes about the “ecological_indexes” function:

database.indexes = database with Ecological indicators, without the column of species names. NA values must indicated as 999
occasional.species = Logical. TRUE if you want to take into account occasional species.
species.cover.coefficient = only if “occasional.species=TRUE”. Coefficient that multiplies FO so that the number of total touches refer to 100
weight: Logical. TRUE if you want to weight Ecological indicators with abundance.

the output will be as follow:

ec.index
#>    survey F_Landolt R_Landolt N_Landolt
#> 1      R1  2.971600  2.253535  2.649474
#> 2      R2  3.716373  3.265993  2.867099
#> 3      R3  3.215537  2.981710  3.898948
#> 4      R4  3.537678  2.765013  3.392741
#> 5      R5  2.943190  2.734974  3.177504
#> 6      R6  2.926874  2.671846  2.978367
#> 7      R7  2.970695  2.635527  3.145236
#> 8      R8  3.362342  3.126754  4.306474
#> 9      R9  3.071611  2.824006  3.469644
#> 10    R10  2.612803  2.806719  2.820766
#> 11    R11  2.519994  2.808848  2.687676
#> 12    R12  2.812606  2.856213  3.017767
#> 13    R13  3.386566  2.210897  2.851104
#> 14    R14  3.417881  2.941060  4.184106
#> 15    R15  2.996712  2.227660  2.680000
#> 16    R16  3.073070  1.668550  2.091525
#> 17    R17  3.597692  2.285484  3.584541
#> 18    R18  2.464270  2.142214  2.375730
#> 19    R19  3.519828  2.043754  3.045977
#> 20    R20  2.802508  2.937901  3.024198
#> 21    R21  2.739275  2.523727  2.727625
#> 22    R22  2.895071  2.384567  2.791045
#> 23    R23  2.695533  2.274914  2.006186
#> 24    R24  2.683857  1.880419  2.004484

Example 3 - Matching a dendrogram with vegetation data

The aim of this example is to generate a cluster analysis of vegetation data, with associated to the dendrogram the species of each survey ordered by their abundance. This approach would help where to cut the dendrogram.

From the dataset shown in ‘Data input format’ vignette, the columns related to plant species names and all surveys are selected.

vegetation<-data[,c(2,7:30)]

species.name.code	R1	R2	R3	R4	R5	R6	R7	R8	R9	R10	R11	R12	R13	R14
Achmacr	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	1
Achmill	NA	NA	NA	NA	NA	NA	999	NA	5	999	999	10	NA	NA
Achmosc	NA	NA	NA	NA	NA	NA	NA	NA	NA	999	NA	NA	NA	NA
Acialpi	NA	NA	NA	NA	NA	NA	NA	NA	NA	999	2	2	NA	NA
Acolama	999	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	1
Adealli	NA	999	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA	8

Then, we need to compute the Species Relative Abundance

sra<-vegetation_abundance(database = vegetation,method = "SRA_fo")

head(sra)

	R9	R11	R12	R14
Achmacr	0.00	0.000000	0.000000	2.702703
Achmill	6.25	0.000000	11.494253	0.000000
Achmosc	0.00	0.000000	0.000000	0.000000
Acialpi	0.00	2.531646	2.298851	0.000000
Acolama	0.00	0.000000	0.000000	2.702703
Adealli	0.00	0.000000	0.000000	21.621622

Now we can generate a dendrogram using the hclust function of the R ‘stats’ base package.

db.dendro<-t(sra)#for the cluster analysis the database has to be transposed so to have surveys on rows and species on columns

library(amap)#package for computing a distance matrix
d <- Dist(db.dendro, method = "correlation")#distance matrix
cluster <- hclust(d, method="average") # clustering method

#plotting the dendrogram
par(cex=0.5, mar=c(5, 8, 4, 1))#set label size
plot(cluster,cex=0.8,hang =-1,
     main=paste0("dMatrix: Correlation","---","clustMet: UPGMA"))

To extract the first ten species ordered decreasingly by their SRA for each survey, we can run the ‘first_ten_species’ function:

firstTenSpecies<-first_ten_species(data_SRA_SC = sra ,#database with species abundance. In this example is the 'sra' database 
                                   join.dendrogram = TRUE,#LOGICAL. TRUE if species abundance need to be joined with a dendrogram
                                   cluster.hclust = cluster)# object of containing the output from 'hclust' function of 'stats' package

	Survey	Survey.order	V1	V2	V3	V4	V5	V6	V7	V8
16	R23	1	Rhoferr 63.9	Junnana 30.6	Oxaacet 2.8	Vacmyrt 2.8	Achmacr 0	Achmill 0	Achmosc 0	Acialpi 0
17	R24	2	Rhoferr 63.6	Fesscab 18.2	Aveflex 9.1	Agrcapi 3	Arnmont 3	Vacmyrt 3	Achmacr 0	Achmill 0
5	R13	3	Calvill 39.6	Alnviri 18.9	Rubidae 18.9	Rhoferr 17	Gymdryo 3.8	Galtetr 1.9	Achmacr 0	Achmill 0
8	R16	4	Vacmyrt 28.8	Calvill 27.3	Rhoferr 22.7	Aveflex 7.6	Gymdryo 3	Rubidae 3	Vacviti 3	Antalpi 1.5
6	R14	5	Adealli 21.6	Rubidae 16.2	Urtdioi 16.2	Rumalpe 13.5	Alnviri 8.1	Galtetr 8.1	Viobifl 5.4	Achmacr 2.7
9	R17	6	Alnviri 48.9	Agrcapi 14.9	Viobifl 10.6	Adealli 6.4	Alcvulg 4.3	Peuostr 4.3	Verlobe 4.3	Luzalpi 2.1
11	R19	7	Alnviri 45.2	Calvill 11.9	Rhoferr 9.5	Agrcapi 7.1	Vacmyrt 7.1	Antalpi 2.4	Genpurp 2.4	Gersylv 2.4
12	R2	8	Descesp 51.1	Carclav 22.2	Poaalpi 8.9	Fesnigr 4.4	Carechi 2.2	Luzcamp 2.2	Narstri 2.2	Peuostr 2.2
19	R4	9	Descesp 34.3	Phlrhae 16.4	Fesnigr 11.9	Alcvulg 9	Verlobe 7.5	Peuostr 6	Rumalpe 4.5	Caramar 3
3	R11	10	Fesmela 20.3	Thaminu 12.7	Rubidae 8.9	Chahirs 6.3	Rhoferr 6.3	Vacmyrt 6.3	Agrcapi 5.1	Fesovin 3.8
13	R20	11	Rubidae 20.5	Agrcapi 18.1	Poanemo 15.7	Myoalpe 12	Silvulg 6	Rhoferr 3.6	Rumalpe 3.6	Triflav 3.6
10	R18	12	Fesscab 28.3	Narstri 18.9	Fesnigr 15.1	Leohelv 7.5	Antalpi 5.7	Phlrhae 5.7	Agrcapi 3.8	Poaalpi 3.8
21	R6	13	Agrcapi 14.1	Fesnigr 10.6	Rumalpe 10.6	Rhoferr 8.2	Silvulg 8.2	Chahirs 7.1	Triflav 5.9	Alcvulg 4.7
7	R15	14	Vacmyrt 20.9	Rhoferr 13.4	Fesnigr 10.4	Agrcapi 9	Chahirs 7.5	Antalpi 6	Phlrhae 6	Poachai 4.5
2	R10	15	Fesnigr 14.3	Ranmont 12.9	Agrcapi 8.6	Phlrhae 8.6	Chahirs 7.1	Fesscab 5.7	Lashall 5.7	Bislaev 4.3

This output can be graphically merged with the dendrogram. First of all it is needed to create a database for running the cluster analysis with (e.g. four) species ordered by abundance as row labels, with the use of dendspe function:

db.dendroLab<-denspe(nspe=4,
                     SpeAbund = firstTenSpecies,
                     dbClust = db.dendro)

Then, the cluster analysis can be run again by using the new database and then plot the dendrogram:

d.lab <- Dist(db.dendroLab,#database name
           method= "correlation") 
cluster.lab <- hclust(d.lab,#distance matrix
                   method="average") 

cluster.plot<-as.dendrogram(cluster.lab)#coversion of cluster to class 'dendrogram' of 'dendexted' package (which allows a better customisation of the plot)

par(mfrow=c(1,1),mar=c(1,2,2,18))#graphical settings
plot(cluster.plot %>% 
  set("labels_cex", 0.6),horiz=T)

From this view, it can be easier to identify survey groups based on their vegetation composition and abundance. Suppose here to identify 8 groups, that can be visually shown with the following base plotting functions:

par(mfrow=c(1,1),mar=c(1,2,2,18))#graphical settings
cluster.plot %>% 
  set("labels_cex", 0.6) %>% 
  set("branches_k_color", 
      k = 8) %>% 
  plot(main = "Dendrogram",horiz=T)

Now, for each group it would be helpful to compute the average composition. First of all we can extract the ID group for each survey with the ‘clustOrder’ function:

group.id<-clustOrder(cluster.hclust = cluster,# object of containing the output from 'hclust' function of 'stats' package
                      cluster.group = T, #Logical. TRUE if you want to specify the number of groups of the dendrogram. 
                      cluster.number = 8) # Specify the number of groups.

Survey	cluster	Survey.order
R1	1	17
R10	1	15
R11	4	10
R12	1	22
R13	5	3
R14	6	5
R15	1	14
R16	5	4
R17	7	6
R18	1	12

The ‘cluster’ column is the one according to which the vegetation surveys will be pooled together. Therefore, we first need to add this column to the database with the abundances used as input of the dendrogram (i.e. ‘db.dendro’ dataset)

dendro.merge<-merge(group.id,db.dendro,by.x = "Survey",by.y = "row.names")
row.names(dendro.merge)<-dendro.merge[,1]#renaming dataset rows
dendro.merge1<-dendro.merge[,-c(1,3)]#deleting useless columns

	cluster	Achmacr	Achmill	Acialpi	Acolama	Adealli	Agrcapi	Alcvulg	Alnviri
R1	1	0.000000	0.00000	0.000000	0.000000	0.000000	6.666667	0.000000	0.000000
R10	1	0.000000	0.00000	0.000000	0.000000	0.000000	8.571429	0.000000	0.000000
R11	4	0.000000	0.00000	2.531646	0.000000	0.000000	5.063291	0.000000	0.000000
R12	1	0.000000	11.49425	2.298851	0.000000	0.000000	6.896552	3.448276	0.000000
R13	5	0.000000	0.00000	0.000000	0.000000	0.000000	0.000000	0.000000	18.867925
R14	6	2.702703	0.00000	0.000000	2.702703	21.621622	0.000000	0.000000	8.108108
R15	1	0.000000	0.00000	0.000000	0.000000	0.000000	8.955224	0.000000	0.000000
R16	5	0.000000	0.00000	0.000000	0.000000	0.000000	0.000000	0.000000	0.000000
R17	7	0.000000	0.00000	0.000000	0.000000	6.382979	14.893617	4.255319	48.936170
R18	1	0.000000	0.00000	0.000000	0.000000	0.000000	3.773585	0.000000	0.000000

Lastly, with the ‘clustGroupAggregate2’ function (clustGroupAggregate() will be deprecated!!!), the average composition for each group can be computed:

aggregate<-clustGroupAggregate2(dendro.merge1)

Let’s extract the table format of aggregate

aggregate$table

1_species	1_abundance	2_species	2_abundance	3_species	3_abundance	4_species	4_abundance	5_species	5_abundance	6_species	6_abundance	7_species	7_abundance
Fesnigr	17.557039	Descesp	42.719735	Rumalpi	18.333333	Fesmela	20.253165	Rhoferr	41.808414	Adealli	21.621622	Alnviri	47.087133
Phlrhae	8.695403	Carclav	11.111111	Rumalpe	16.892655	Thaminu	12.658228	Calvill	16.723842	Rubidae	16.216216	Agrcapi	11.018237
Agrcapi	6.197290	Phlrhae	9.320066	Triflav	16.850282	Rubidae	8.860760	Vacmyrt	8.648990	Urtdioi	16.216216	Calvill	5.952381
Antalpi	5.326626	Fesnigr	8.192372	Phlrhae	10.946328	Chahirs	6.329114	Junnana	8.017677	Rumalpe	13.513514	Viobifl	5.319149
Chahirs	4.379044	Poaalpi	5.936982	Peuostr	10.169491	Rhoferr	6.329114	Rubidae	5.474557	Alnviri	8.108108	Rhoferr	4.761905
Vacmyrt	4.036279	Alcvulg	4.477612	Alcvulg	7.556497	Vacmyrt	6.329114	Alnviri	4.716981	Galtetr	8.108108	Vacmyrt	4.635258
Alcvulg	3.947029	Peuostr	4.096186	Poaalpi	5.028249	Agrcapi	5.063291	Fesscab	4.545454	Viobifl	5.405405	Adealli	3.191489
Triflav	3.866699	Verlobe	3.731343	Chahirs	5.000000	Fesovin	3.797468	Aveflex	4.166667	Achmacr	2.702703	Luzalpi	2.254306
Ranmont	3.817788	Rumalpe	2.238806	Myoalpe	2.500000	Acialpi	2.531646	Gymdryo	1.700972	Acolama	2.702703	Alcvulg	2.127660
Poaalpi	3.422911	Caramar	1.492537	Agrcapi	1.680791	Antalpi	2.531646	Arnmont	1.136364	Fesnigr	2.702703	Peuostr	2.127660

Marco Pittarello

Example 1 - use of ‘vegetation_abundance’ function

Example 2 - computation of ecological indexes

Example 3 - Matching a dendrogram with vegetation data