Help for package RVAideMemoire

Encoding:

UTF-8

Type:

Package

Title:

Testing and Plotting Procedures for Biostatistics

Version:

0.9-83-12

Date:

2025-07-12

Imports:

ade4 (≥ 1.7-8), boot, car, FactoMineR, graphics, grDevices, lme4 (≥ 1.0-4), MASS, nnet, pls, pspearman, stats, utils, vegan (≥ 2.4-3)

Suggests:

ape, betareg, dgof, emmeans (≥ 1.11.2), EMT, FSA, glmmTMB, labdsv, mixOmics, MuMIn, mvnormtest, ordinal, RGCCA, statmod, survival

Description:

Contains miscellaneous functions useful in biostatistics, mostly univariate and multivariate testing procedures with a special emphasis on permutation tests. Many functions intend to simplify user's life by shortening existing procedures or by implementing plotting functions that can be used with as many methods from different packages as possible.

Enhances:

glmmADMB

Additional_repositories:

http://r-forge.r-project.org

License:

GPL-2

LazyLoad:

yes

NeedsCompilation:

Packaged:

2025-07-15 05:24:13 UTC; maherve

Author:

Maxime HERVE [aut, cre]

Maintainer:

Maxime HERVE <maxime.herve@univ-rennes.fr>

Repository:

CRAN

Date/Publication:

2025-07-15 06:00:02 UTC

Testing and Plotting Procedures for Biostatistics

Description

Details

Package:	RVAideMemoire
Type:	Package
Version:	0.9-83-12
Date:	2025-07-12
License:	GPL-2
LazyLoad:	yes

Author(s)

Maxime HERVE

Maintainer: Maxime HERVE <maxime.herve@univ-rennes.fr>

References

Document : "Aide-memoire de statistique appliquee a la biologie - Construire son etude et analyser les resutats a l'aide du logiciel R" (available on CRAN)

Anova Tables for Cumulative Link (Mixed) Models

Description

These functions are methods for Anova to calculate type-II or type-III analysis-of-deviance tables for model objects produced by clm and clmm. Likelihood-ratio tests are calculated in both cases.

Usage

## S3 method for class 'clm'
Anova(mod, type = c("II", "III", 2, 3), ...)

## S3 method for class 'clmm'
Anova(mod, type = c("II", "III", 2, 3), ...)

Arguments

mod

clm or clmm object.

type

type of test, "II", "III", 2 or 3.

...

additional arguments to Anova. Not usable here.

Details

See help of the Anova for a detailed explanation of what "type II" and "typ III" mean.

Value

See Anova.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Cross validation

Description

Performs cross validation with correspondence discriminant analyses.

Usage

CDA.cv(X, Y, repet = 10, k = 7, ncomp = NULL, method = c("mahalanobis",
  "euclidian"))

Arguments

X

a data frame of dependent variables (typically contingency or presence-absence table).

Y

factor giving the groups.

repet

an integer giving the number of times the whole procedure has to be repeated.

k

an integer giving the number of folds (can be re-set internally if needed).

ncomp

an integer giving the number of components to be used for prediction. If NULL all components are used.

method

criterion used to predict class membership. See predict.coadisc.

Details

The training sets are generated in respect to the relative proportions of the levels of Y in the original data set (see splitf).

Value

repet

number of times the whole procedure was repeated.

k

number of folds.

ncomp

number of components used.

method

criterion used to classify individuals of the test sets.

groups

levels of Y.

models.list

list of of models generated (repet*k models), for PLSR, CPPLS, PLS-DA, PPLS-DA, LDA and QDA.

NMC

Classification error rates (repet values).

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

require(ade4)
data(perthi02)
## Not run: CDA.cv(perthi02$tab,perthi02$cla)

Significance test for CDA

Description

Performs a significance test for correspondence discriminant analysis. See Details.

Usage

CDA.test(X, fact, ncomp = NULL, ...)

Arguments

X

a data frame of dependent variables (typically contingency or presence-absence table).

fact

factor giving the groups.

ncomp

an integer giving the number of components to be used for the test. If NULL nlevels(fact)-1 are used. See Details.

...

other arguments to pass to summary.manova. See Details.

Details

CDA consists in two steps: building a correspondence analysis (CA) on X, then using row coordinates on all CA components as input variables for a linear discriminant analysis. CDA.test builds the intermediate CA, then uses the first ncomp components to test for an effect of fact. If 1 component is used the test is an ANOVA, if more than 1 component are used the test is a MANOVA.

Value

An ANOVA or MANOVA table.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

require(ade4)
data(perthi02)

CDA.test(perthi02$tab,perthi02$cla)

Cross validation

Description

Performs cross validation with DIABLO (block.plsda or block.splsda).

Usage

DIABLO.cv(x, method = c("mahalanobis.dist", "max.dist", "centroids.dist"),
  validation = c("Mfold", "loo"), k = 7, repet = 10, ...)

Arguments

x

an object of class "sgccda".

method

criterion used to predict class membership. See perf.

validation

a character giving the kind of (internal) validation to use. See perf.

k

an integer giving the number of folds (can be re-set internally if needed).

repet

an integer giving the number of times the whole procedure has to be repeated.

...

other arguments to pass to perf.

Details

The function uses the weighted predicted classification error rate (see perf).

Value

repet

number of times the whole procedure was repeated.

k

number of folds.

validation

kind of validation used.

ncomp

number of components used.

method

criterion used to classify individuals of the test sets.

NMC.mean

mean classification error rate (based on repet values).

NMC.se

standard error of the classification error rate (based on repet values).

Author(s)

Maxime HERVE <mx.herve@gmail.com>

Examples

## Not run: 
require(mixOmics)
data(nutrimouse)
data <- list(gene=nutrimouse$gene,lipid=nutrimouse$lipid,Y=nutrimouse$diet)
DIABLO <- block.plsda(X=data,indY=3)
DIABLO.cv(DIABLO)

## End(Not run)

Significance test based on cross-validation

Description

Performs a permutation significance test based on cross-validation with DIABLO (block.plsda or block.splsda).

Usage

DIABLO.test(x, method = c("mahalanobis.dist", "max.dist", "centroids.dist"),
  validation = c("Mfold", "loo"), k = 7, nperm = 999, progress = TRUE, ...)

Arguments

x

an object of class "sgccda".

method

criterion used to predict class membership. See perf.

validation

a character giving the kind of (internal) validation to use. See perf.

k

an integer giving the number of folds (can be re-set internally if needed).

nperm

number of permutations.

progress

logical indicating if the progress bar should be displayed.

...

other arguments to pass to perf.

Details

The function uses the weighted predicted classification error rate (see perf).

Value

method

a character string indicating the name of the test.

data.name

a character string giving the name of the data, plus additional information.

statistic

the value of the test statistics (classification error rate).

permutations

the number of permutations.

p.value

the p-value of the test.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

## Not run: 
require(mixOmics)
data(nutrimouse)
data <- list(gene=nutrimouse$gene,lipid=nutrimouse$lipid,Y=nutrimouse$diet)
DIABLO <- block.plsda(X=data,indY=3)
DIABLO.test(DIABLO)

## End(Not run)

G-test for binary variables

Description

Performs a G-test for comparing response probabilities (i.e. when the response variable is a binary variable). The function is in fact a wrapper to the G-test for comparison of proportions on a contingency table. If the p-value of the test is significant, the function performs pairwise comparisons by using G-tests.

Usage

G.bintest(formula, data, alpha = 0.05, p.method = "fdr")

Arguments

formula

a formula of the form a ~ b, where a and b give the data values and corresponding groups, respectively. a can be a numeric vector or a factor, with only two possible values (except NA).

data

an optional data frame containing the variables in the formula formula. By default the variables are taken from environment(formula).

alpha

significance level to compute pairwise comparisons.

p.method

method for p-values correction. See help of p.adjust.

Details

If the response is a 0/1 variable, the probability of the '1' group is tested. In any other cases, the response is transformed into a factor and the probability of the second level is tested.

Since a G-test is an approximate test, an exact test is preferable when the number of individuals is small (200 is a reasonable minimum). See fisher.bintest in that case.

Value

method.test

a character string giving the name of the global test computed.

data.name

a character string giving the name(s) of the data.

alternative

a character string describing the alternative hypothesis.

estimate

the estimated probabilities.

null.value

the value of the difference in probabilities under the null hypothesis, always 0.

statistic

test statistics.

parameter

test degrees of freedom.

p.value

p-value of the global test.

alpha

significance level.

p.adjust.method

method for p-values correction.

p.value.multcomp

data frame of pairwise comparisons result.

method.multcomp

a character string giving the name of the test computed for pairwise comparisons.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

response <- c(rep(0:1,c(40,60)),rep(0:1,c(55,45)),rep(0:1,c(65,35)))
fact <- gl(3,100,labels=LETTERS[1:3])
G.bintest(response~fact)

Pairwise comparisons after a G-test

Description

Performs pairwise comparisons after a global G-test.

Usage

G.multcomp(x, p.method = "fdr")

Arguments

x

numeric vector (counts).

p.method

method for p-values correction. See help of p.adjust.

Details

Since a G-test is an approximate test, an exact test is preferable when the number of individuals is small (200 is a reasonable minimum). See multinomial.multcomp in that case.

Value

method

name of the test.

data.name

a character string giving the name(s) of the data.

p.adjust.method

method for p-values correction.

p.value

table of results.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

counts <- c(49,30,63,59)
G.test(counts)
G.multcomp(counts)

G-test

Description

Perfoms a G-test on a contingency table or a vector of counts.

Usage

G.test(x, p = rep(1/length(x), length(x)))

Arguments

x

a numeric vector or matrix (see Details).

p

theoretical proportions (optional).

Details

If x is matrix, it must be constructed like this:

- 2 columns giving number of successes (left) and fails (right)

- 1 row per population.

The function works as chisq.test :

- if x is a vector and theoretical proportions are not given, equality of counts is tested

- if x is a vector and theoretical proportions are given, equality of counts to theoretical counts (given by theoretical proportions) is tested

- if x is a matrix with two columns, equality of proportion of successes between populations is tested.

- if x is a matrix with more than two columns, independence of rows and columns is tested.

Since a G-test is an approximate test, an exact test is preferable when the number of individuals is small (200 is a reasonable minimum). See multinomial.test in that case with a vector, fisher.test with a matrix.

Value

method

name of the test.

statistic

test statistics.

parameter

test degrees of freedom.

p.value

p-value.

data.name

a character string giving the name(s) of the data.

observed

the observed counts.

expected

the expected counts under the null hypothesis.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

counts <- c(49,30,63,59)
G.test(counts)

Pairwise comparisons after a G-test for given probabilities

Description

Performs pairwise comparisons after a global G-test for given probabilities.

Usage

G.theo.multcomp(x, p = rep(1/length(x), length(x)), p.method = "fdr")

Arguments

x

numeric vector (counts).

p

theoretical proportions.

p.method

method for p-values correction. See help of p.adjust.

Details

Since a G-test is an approximate test, an exact test is preferable when the number of individuals is small (200 is a reasonable minimum). See multinomial.theo.multcomp in that case.

Value

method

name of the test.

data.name

a character string giving the name(s) of the data.

observed

observed counts.

expected

expected counts.

p.adjust.method

method for p-values correction.

statistic

statistics of each test.

p.value2

corrected p-values.

p.value

data frame of results.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

counts <- c(49,30,63,59)
p.theo <- c(0.2,0.1,0.45,0.25)
G.test(counts,p=p.theo)
G.theo.multcomp(counts,p=p.theo)

Significance test for GPA

Description

Performs a permutation significance test based on total variance explained for Generalized Procrustes Analysis. The function uses GPA.

Usage

GPA.test(df, group, tolerance = 10^-10, nbiteration = 200, scale = TRUE,
  nperm = 999, progress = TRUE)

Arguments

df

a data frame with n rows (individuals) and p columns (quantitative varaibles), in which all data frames are combined.

group

a vector indicating the number of variables in each group (i.e. data frame).

tolerance

a threshold with respect to which the algorithm stops, i.e. when the difference between the GPA loss function at step n and n+1 is less than tolerance.

nbiteration

the maximum number of iterations until the algorithm stops.

scale

logical, if TRUE (default) scaling is required.

nperm

number of permutations.

progress

logical indicating if the progress bar should be displayed.

Details

Rows of each data frame are randomly and independently permuted.

The function deals with the limitted floating point precision, which can bias calculation of p-values based on a discrete test statistic distribution.

Value

method

a character string indicating the name of the test.

data.name

a character string giving the name(s) of the data, plus additional information.

statistic

the value of the test statistics.

permutations

the number of permutations.

p.value

the p-value of the test.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

References

Wakeling IN, Raats MM and MacFie HJH (1992) A new significance test for consensus in Generalized Procrustes Analysis. Journal of Sensory Studies 7:91-96.

Examples

require(FactoMineR)
data(wine)

## Not run: GPA.test(wine[,-(1:2)],group=c(5,3,10,9,2))

Type II permutation test for constrained multivariate analyses

Description

This function is a wrapper to anova.cca(...,by="terms") but performs type II tests (whereas anova.cca performs type I).

Usage

MVA.anova(object, ...)

Arguments

object

a result object from cca, rda, capscale or dbrda.

...

additional arguments to anova.cca (can be permutations, model, parallel and/or strata). See help of this function.

Details

See anova.cca for detailed explanation of what is done. The only difference with anova.cca is that MVA.anova performs type II tests instead of type I.

See example of adonis.II for the difference between type I (sequential) and type II tests.

Value

a data frame of class "anova".

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Biplot of multivariate analyses

Description

Displays a biplot of a multivariate analysis. This just consists in superimposing a score plot and a correlation circle (plus centroids of factor levels in constrained analyses, RDA or CCA). The correlation circle is adjusted to fit the size of the score plot.

Usage

MVA.biplot(x, xax = 1, yax = 2, scaling = 2, sco.set = c(12, 1, 2),
  cor.set = c(12, 1, 2), space = 1, ratio = 0.9, weights = 1,
  constraints = c("nf", "n", "f", NULL), sco.args = list(),
  cor.args = list(), f.col = 1, f.cex = 1)

Arguments

x

a multivariate analysis (see Details).

xax

the horizontal axis.

yax

the vertical axis.

scaling

type of scaling (see MVA.scoreplot).

sco.set

scores to be displayed, when several sets are available (see MVA.scoreplot).

cor.set

correlations to be displayed, when several sets are available (see MVA.scoreplot).

space

space to use, when several are available (see MVA.scoreplot and MVA.corplot).

ratio

constant for adjustement of correlations to the size of the score plot (0.9 means the longest arrows is 90% of the corresponding axis).

weights

only used with constrained analyses (RDA or CCA) where some constraints are factors. Individual weights, used to calculate barycenter positions.

constraints

only used with constrained analyses (RDA or CCA). Type of constraints to display: quantitative ("n"), factors ("f"), both ("nf", default) or none ("NULL").

sco.args

list containing optional arguments to pass to MVA.scoreplot. All arguments are accepted.

cor.args

list containing optional arguments to pass to MVA.corplot. All arguments are accepted except xlab, ylab, circle, intcircle, drawintaxes, add and add.const.

f.col

color(s) used for barycenters in case of a constraint analysis (RDA or CCA) containing factor constraint(s). Can be a unique value, a vector giving one color per constraint or a vector giving one color per barycenter (all factors confounded).

f.cex

size(s) used for barycenters in case of a constraint analysis (RDA or CCA) containing factor constraint(s). Can be a unique value, a vector giving one size per constraint or a vector giving one size per barycenter (all factors confounded).

Details

This function should not be use directly. Prefer the general MVA.plot, to which all arguments can be passed.

All multivariate analyses covered by MVA.corplot can be used for biplots.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

require(vegan)
data(iris)
RDA <- rda(iris[,1:4]~Species,data=iris)
MVA.plot(RDA,"biplot",cor.args=list(col="purple"),ratio=0.8,f.col=c("red","green","blue"))

Cross model validation

Description

Performs cross model validation (2CV) with different PLS analyses.

Usage

MVA.cmv(X, Y, repet = 10, kout = 7, kinn = 6, ncomp = 8, scale = TRUE,
  model = c("PLSR", "CPPLS", "PLS-DA", "PPLS-DA", "PLS-DA/LDA", "PLS-DA/QDA",
  "PPLS-DA/LDA", "PPLS-DA/QDA"), crit.inn = c("RMSEP", "Q2", "NMC"),
  Q2diff = 0.05, lower = 0.5, upper = 0.5, Y.add = NULL, weights = rep(1, nrow(X)),
  set.prior = FALSE, crit.DA = c("plug-in", "predictive", "debiased"), ...)

Arguments

X

a data frame of independent variables.

Y

the dependent variable(s): numeric vector, data frame of quantitative variables or factor.

repet

an integer giving the number of times the whole 2CV procedure has to be repeated.

kout

an integer giving the number of folds in the outer loop (can be re-set internally if needed).

kinn

an integer giving the number of folds in the inner loop (can be re-set internally if needed). Cannot be > kout.

ncomp

an integer giving the maximal number of components to be tested in the inner loop (can be re-set depending on the size of the train sets).

scale

logical indicating if data should be scaled (see Details).

model

the model to be fitted (see Details).

crit.inn

the criterion to be used to choose the number of components in the inner loop. Root Mean Square Error of Prediction ("RMSEP", default) and Q2 ("Q2") are only used for PLSR and CPPLS, whereas the Number of MisClassifications ("NMC") is only used for discriminant analyses.

Q2diff

the threshold to be used if the number of components is chosen according to Q2. The next component is added only if it makes the Q2 increase more than Q2diff (5% by default).

lower

a vector of lower limits for power optimisation in CPPLS or PPLS-DA (see cppls.fit).

upper

a vector of upper limits for power optimisation in CPPLS or PPLS-DA (see cppls.fit).

Y.add

a vector or matrix of additional responses containing relevant information about the observations, in CPPLS or PPLS-DA (see cppls.fit).

weights

a vector of individual weights for the observations, in CPPLS or PPLS-DA (see cppls.fit).

set.prior

only used when a second analysis (LDA or QDA) is performed. If TRUE, the prior probabilities of class membership are defined according to the mean weight of individuals belonging to each class. If FALSE, prior probabilities are obtained from the data sets on which LDA/QDA models are built.

crit.DA

criterion used to predict class membership when a second analysis (LDA or QDA) is used. See predict.lda.

...

other arguments to pass to plsr (PLSR, PLS-DA) or cppls (CPPLS, PPLS-DA).

Details

Cross model validation is detailed is Szymanska et al (2012). Some more details about how this function works:

- when a discriminant analysis is used ("PLS-DA", "PPLS-DA", "PLS-DA/LDA", "PLS-DA/QDA", "PPLS-DA/LDA" or "PPLS-DA/QDA"), the training sets (test set itself in the inner loop, test+validation sets in the outer loop) are generated in respect to the relative proportions of the levels of Y in the original data set (see splitf).

- "PLS-DA" is considered as PLS2 on a dummy-coded response. For a PLS-DA based on the CPPLS algorithm, use "PPLS-DA" with lower and upper limits of the power parameters set to 0.5.

- if a second analysis is used ("PLS-DA/LDA", "PLS-DA/QDA", "PPLS-DA/LDA" or "PPLS-DA/QDA"), a LDA or QDA is built on scores of the first analysis (PLS-DA or PPLS-DA) also in the inner loop. The classification error rate, based on this second analysis, is used to choose the number of components.

If scale = TRUE, the scaling is done as this:

- for each step of the outer loop (i.e. kout steps), the rest set is pre-processed by centering and unit-variance scaling. Means and standard deviations of variables in the rest set are then used to scale the test set.

- for each step of the inner loop (i.e. kinn steps), the training set is pre-processed by centering and unit-variance scaling. Means and standard deviations of variables in the training set are then used to scale the validation set.

Value

model

model used.

type

type of model used.

repet

number of times the whole 2CV procedure was repeated.

kout

number of folds in the outer loop.

kinn

number of folds in the inner loop.

crit.inn

criterion used to choose the number of components in the inner loop.

crit.DA

criterion used to classify individuals of the test and validation sets.

Q2diff

threshold used if the number of components is chosen according to Q2.

groups

levels of Y if it is a factor.

models.list

list of of models generated (repet*kout models), for PLSR, CPPLS, PLS-DA and PPLS-DA.

models1.list

list of of (P)PLS-DA models generated (repet*kout models), for PLS-DA/LDA, PLS-DA/QDA, PPLS-DA/LDA and PPLS-DA/QDA.

models2.list

list of of LDA/QDA models generated (repet*kout models), for PLS-DA/LDA, PLS-DA/QDA, PPLS-DA/LDA and PPLS-DA/QDA.

RMSEP

RMSEP computed from the models used in the outer loops (repet values).

Q2

Q2 computed from the models used in the outer loops (repet values).

NMC

Classification error rate computed from the models used in the outer loops (repet values).

confusion

Confusion matrices computed from the models used in the outer loops (repet values).

pred.prob

Probability of each individual of being of each level of Y.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

References

Szymanska E, Saccenti E, Smilde AK and Westerhuis J (2012) Double-check: validation of diagnostic statistics for PLS-DA models in metabolomics studies. Metabolomics (2012) 8:S3-S16.

Examples

require(pls)
require(MASS)

# PLSR
data(yarn)
## Not run: MVA.cmv(yarn$NIR,yarn$density,model="PLSR")

# PPLS-DA coupled to LDA
data(mayonnaise)
## Not run: MVA.cmv(mayonnaise$NIR,factor(mayonnaise$oil.type),model="PPLS-DA/LDA",crit.inn="NMC")

Correlations of multivariate analyses

Description

Returns correlations of a multivariate analysis.

Usage

MVA.cor(x, xax = 1, yax = 2, set = c(12, 1, 2), space = 1, ...)

Arguments

x

a multivariate analysis (see Details).

xax

axis or axes for which to extract correlations.

yax

axis for which to extract correlations (ignored if length(xax) > 1).

set

variables to be displayed, when several sets are available (see Details). 12 (default) for both sets, 1 for X or constraints, 2 for Y or constrained variables.

space

variables to be displayed, when several spaces are available (see Details). space is the number of the space to be plotted.

...

not used.

Details

Many multivariate analyses are supported, from various packages:

- PCA: dudi.pca, rda.

- sPCA: spca.

- IPCA: ipca.

- sIPCA: sipca.

- LDA: lda, discrimin.

- PLS-DA (PLS2 on a dummy-coded factor): plsda. X space only.

- sPLS-DA (sPLS2 on a dummy-coded factor): splsda. X space only.

- CPPLS: mvr. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set. X space only.

- PLSR: mvr, pls, plsR (plsRglm package). Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set. X space only.

- sPLSR: pls. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set. X space only.

- PLS-GLR: plsRglm (plsRglm package). Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set. Correlations are computed with Y on the link scale.

- PCR: mvr. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set.

- CDA: discrimin, discrimin.coa.

- NSCOA: dudi.nsc. For NSCOA there is no real correlation, but the classical representation of columns is arrows. This is why MVA.corplot was made able to deal with this analysis.

- CCA: cca, pcaiv. Constraints (only quantitative constraints are extracted) in constrained space only.

- Mix analysis: dudi.mix, dudi.hillsmith. Only quantitative variables are displayed.

- RDA (or PCAIV): pcaiv, pcaivortho, rda. With rda, space 1 is constrained space, space 2 is unconstrained space. Only constrained space is available with pcaiv, the opposite for pcaivortho. Set 1 is constraints (only quantitative constraints are extracted), set 2 is dependent variables (only set 2 is available for pcaivortho). If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set.

- CCorA: CCorA, rcc. Space 1 is X, space 2 is Y. With rcc a third space is available, in which coordinates are means of X and Y coordinates. In this third space, set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set.

- rCCorA: rcc. Space 1 is X, space 2 is Y, space 3 is a "common" space in which coordinates are means of X and Y coordinates. In space 3, set 1 is X and set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set.

- CIA: coinertia. Space 1 is X, space 2 is Y, space 3 is a "common" space where X and Y scores are normed. In space 3, set 1 is X and set 2 is Y. If set=12 in space 3 (default), fac is not available and pch,cex, col, lws can be defined differently for each set.

- GPA: GPA. Only the consensus ordination can be displayed.

- 2B-PLS: pls. Space 1 is X, space 2 is Y, space 3 is a "common" space in which coordinates are means of X and Y coordinates. In space 3, set 1 is X and set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set.

- 2B-sPLS: pls. Space 1 is X, space 2 is Y, space 3 is a "common" space in which coordinates are means of X and Y coordinates. In space 3, set 1 is X and set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set.

- rGCCA: wrapper.rgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- sGCCA: wrapper.sgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- DIABLO: block.plsda, block.splsda. Space can be 1 to n, the number of blocks (i.e. datasets).

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Correlation circle of multivariate analyses

Description

Displays a correlation circle of a multivariate analysis.

Usage

MVA.corplot(x, xax = 1, yax = 2, thresh = 0, fac = NULL, set = c(12, 1, 2), space = 1,
  xlab = NULL, ylab = NULL, main = NULL, circle = TRUE, intcircle = 0.5, points = TRUE,
  ident = TRUE, arrows = TRUE, labels = NULL, main.pos = c("bottomleft", "topleft",
  "bottomright", "topright"), main.cex = 1.3, legend = FALSE, legend.pos = c("topleft",
  "topright", "bottomleft", "bottomright"), legend.title = NULL, legend.lab = NULL,
  pch = 16, cex = 1, col = 1, lwd = 1, drawintaxes = TRUE, add = FALSE, add.const = 1,
  keepmar = FALSE)

Arguments

x

a multivariate analysis (see Details).

xax

the horizontal axis.

yax

the vertical axis. This can be set to NULL for a one-dimensional graph, which is a dotchart.

thresh

threshold (in absolute value of the correlation coefficient) of variables to be plotted.

fac

an optional factor defining groups of variables.

set

variables to be displayed, when several sets are available (see Details). 12 (default) for both sets, 1 for X or constraints, 2 for Y or constrained variables.

space

variables to be displayed, when several spaces are available (see Details). space is the number of the space to be plotted.

xlab

legend of the horizontal axis. If NULL (default), automatic labels are used depending on the multivariate analysis.

ylab

only used for two-dimensional graphs. Legend of the vertical axis. If NULL (default), automatic labels are used depending on the multivariate analysis.

main

optional title of the graph.

circle

only used for two-dimensional graphs. Logical indicating if the circle of radius 1 should be plotted.

intcircle

only used for two-dimensional graphs. Vector of one or several values indicating radii of circles to be plotted inside the main circle. Can be set to NULL.

points

only used for two-dimensional graphs. If FALSE, arrows or points (see arrows) are replaced with their corresponding label (defined by labels).

ident

only used for two-dimensional graphs when points=TRUE. A logical indicating if variable names should be displayed.

arrows

only used if points=TRUE. Logical indicating if arrows should be plotted. If FALSE, points are displayed at the extremity of the arrows.

labels

names of the variables. If NULL (default), labels correspond to variable names found in the data used in the multivariate analysis. For two-dimensional graphs, only used if ident=TRUE.

main.pos

position of the title, if main is not NULL. Default to "bottomleft".

main.cex

size of the title, if main is not NULL.

legend

only used for two-dimensional graphs. Logical indicating if a legend should be added to the graph.

legend.pos

position of the legend, if legend is TRUE. Default to "topleft".

legend.title

optional title of the legend, if legend is TRUE.

legend.lab

legend labels, if legend is TRUE. If NULL, levels of the factor defined by fac are used.

pch

symbol(s) used for points, when points are displayed (see arrows). If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary.

cex

size of the points and/or of the variable names. For two-dimensional graphs: if fac is not NULL, can be a vector of length one or a vector giving one value per group; otherwise a vector of any length can be defined, which is recycled if necessary. For dotcharts, gives the size used for points and all labels (see dotchart).

col

color(s) used for points and/or variable names. If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary (not available for density histograms, see dhist).

lwd

only used if arrows are displayed. Width of arrows. If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary.

drawintaxes

logical indicating if internal axes should be drawn.

add

only used for two-dimensional graphs. Logical indicating if the correlation circle should be added to an existing graph.

add.const

only used for two-dimensional graphs and if add is TRUE. Constant by which correlations are multiplied to fit onto the original graph.

keepmar

only used for two-dimensional graphs. Logical indicating if margins defined by MVA.corplot should be kept after plotting (necessary in some cases when add=TRUE).

Details

This function should not be use directly. Prefer the general MVA.plot, to which all arguments can be passed.

Many multivariate analyses are supported, from various packages:

- PCA: dudi.pca, rda.

- sPCA: spca.

- IPCA: ipca.

- sIPCA: sipca.

- LDA: lda, discrimin.

- PLS-DA (PLS2 on a dummy-coded factor): plsda. X space only.

- sPLS-DA (sPLS2 on a dummy-coded factor): splsda. X space only.

- CPPLS: mvr. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set. X space only.

- sPLSR: pls. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set. X space only.

- PCR: mvr. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col, lwd can be defined differently for each set.

- CDA: discrimin, discrimin.coa.

- NSCOA: dudi.nsc. For NSCOA there is no real correlation, but the classical representation of columns is arrows. This is why MVA.corplot was made able to deal with this analysis.

- CCA: cca, pcaiv. Constraints (only quantitative constraints are extracted) in constrained space only.

- Mix analysis: dudi.mix, dudi.hillsmith. Only quantitative variables are displayed.

- db-RDA: capscale, dbrda. Constraints (only quantitative constraints are extracted) in constrained space only.

- PCIA: procuste. Set 1 is X, set 2 is Y.

- rGCCA: wrapper.rgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- sGCCA: wrapper.sgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- DIABLO: block.plsda, block.splsda. Space can be 1 to n, the number of blocks (i.e. datasets).

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

require(ade4)
data(olympic)
PCA <- dudi.pca(olympic$tab,scannf=FALSE)
MVA.plot(PCA,"corr")

Cross validation

Description

Performs cross validation with different PLS and/or discriminant analyses.

Usage

MVA.cv(X, Y, repet = 10, k = 7, ncomp = 8, scale = TRUE, model = c("PLSR",
  "CPPLS", "PLS-DA", "PPLS-DA", "LDA", "QDA", "PLS-DA/LDA", "PLS-DA/QDA",
  "PPLS-DA/LDA", "PPLS-DA/QDA"), lower = 0.5, upper = 0.5, Y.add = NULL,
  weights = rep(1, nrow(X)), set.prior = FALSE, crit.DA = c("plug-in",
  "predictive", "debiased"), ...)

Arguments

X

a data frame of independent variables.

Y

the dependent variable(s): numeric vector, data frame of quantitative variables or factor.

repet

an integer giving the number of times the whole procedure has to be repeated.

k

an integer giving the number of folds (can be re-set internally if needed).

ncomp

an integer giving the number of components to be used for all models except LDA and QDA (can be re-set depending on the size of the train sets).

scale

logical indicating if data should be scaled (see Details).

model

the model to be fitted (see Details).

lower

a vector of lower limits for power optimisation in CPPLS or PPLS-DA (see cppls.fit).

upper

a vector of upper limits for power optimisation in CPPLS or PPLS-DA (see cppls.fit).

Y.add

a vector or matrix of additional responses containing relevant information about the observations, in CPPLS or PPLS-DA (see cppls.fit).

weights

a vector of individual weights for the observations, in CPPLS or PPLS-DA (see cppls.fit).

set.prior

only used when a LDA or QDA is performed (coupled or not with a PLS model). If TRUE, the prior probabilities of class membership are defined according to the mean weight of individuals belonging to each class. If FALSE, prior probabilities are obtained from the data sets on which LDA/QDA models are built.

crit.DA

criterion used to predict class membership when a LDA or QDA is used. See predict.lda.

...

other arguments to pass to plsr (PLSR, PLS-DA) or cppls (CPPLS, PPLS-DA).

Details

When a discriminant analysis is used ("PLS-DA", "PPLS-DA", "LDA", "QDA", "PLS-DA/LDA", "PLS-DA/QDA", "PPLS-DA/LDA" or "PPLS-DA/QDA"), the training sets are generated in respect to the relative proportions of the levels of Y in the original data set (see splitf).

"PLS-DA" is considered as PLS2 on a dummy-coded response. For a PLS-DA based on the CPPLS algorithm, use "PPLS-DA" with lower and upper limits of the power parameters set to 0.5.

If scale = TRUE, the scaling is done as this: for each step of the validation loop (i.e. k steps), the training set is pre-processed by centering and unit-variance scaling. Means and standard deviations of variables in the training set are then used to scale the test set.

Value

model

model used.

type

type of model used.

repet

number of times the whole procedure was repeated.

k

number of folds.

ncomp

number of components used.

crit.DA

criterion used to classify individuals of the test sets.

groups

levels of Y if it is a factor.

models.list

list of of models generated (repet*k models), for PLSR, CPPLS, PLS-DA, PPLS-DA, LDA and QDA.

models1.list

list of of (P)PLS-DA models generated (repet*k models), for PLS-DA/LDA, PLS-DA/QDA, PPLS-DA/LDA and PPLS-DA/QDA.

models2.list

list of of LDA/QDA models generated (repet*k models), for PLS-DA/LDA, PLS-DA/QDA, PPLS-DA/LDA and PPLS-DA/QDA.

RMSEP

RMSEP vales (repet values).

Q2

Q2 values (repet values).

NMC

Classification error rates (repet values).

confusion

Confusion matrices (repet values).

pred.prob

Probability of each individual of being of each level of Y.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

require(pls)
require(MASS)

# PLSR
data(yarn)
## Not run: MVA.cv(yarn$NIR,yarn$density,model="PLSR")

# PPLS-DA coupled to LDA
data(mayonnaise)
## Not run: MVA.cv(mayonnaise$NIR,factor(mayonnaise$oil.type),model="PPLS-DA/LDA")

Loadings of multivariate analyses

Description

Returns loadings of a multivariate analysis.

Usage

MVA.load(x, xax = 1, yax = 2, set = c(12, 1, 2), space = 1, ...)

Arguments

x

a multivariate analysis (see Details).

xax

axis or axes for which to extract loadings.

yax

axis for which to extract loadings (ignored if length(xax) > 1).

set

variables to be displayed, when several sets are available (see Details). 12 (default) for both sets, 1 for X, 2 for Y.

space

variables to be displayed, when several spaces are available (see Details). space is the number of the space to be plotted.

...

not used.

Details

Many multivariate analyses are supported, from various packages:

- PCA: prcomp, princomp, dudi.pca, rda, pca, pca.

- sPCA: spca.

- IPCA: ipca.

- sIPCA: sipca.

- LDA: lda, discrimin.

- PLS-DA (PLS2 on a dummy-coded factor): plsda. X space only.

- sPLS-DA (sPLS2 on a dummy-coded factor): splsda. X space only.

- CPPLS: mvr. X space only.

- PLSR: mvr, pls, plsR (plsRglm package). X space only.

- sPLSR: pls. X space only.

- PLS-GLR: plsRglm (plsRglm package).

- PCR: mvr.

- CDA: discrimin, discrimin.coa.

- NSCOA: dudi.nsc.

- MCA: dudi.acm.

- Mix analysis: dudi.mix, dudi.hillsmith.

- PCIA: procuste. Set 1 is X, set 2 is Y.

- CCorA: rcc. Space 1 is X, space 2 is Y.

- rCCorA: rcc. Space 1 is X, space 2 is Y.

- CIA: coinertia. Space 1 is X, space 2 is Y.

- 2B-PLS: pls. Space 1 is X, space 2 is Y.

- 2B-sPLS: pls. Space 1 is X, space 2 is Y.

- rGCCA: wrapper.rgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- sGCCA: wrapper.sgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- DIABLO: block.plsda, block.splsda. Space can be 1 to n, the number of blocks (i.e. datasets).

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Loading plot of multivariate analyses

Description

Displays a loading plot of a multivariate analysis.

Usage

MVA.loadplot(x, xax = 1, yax = 2, fac = NULL, set = c(12, 1, 2), space = 1, map = TRUE,
  xlab = NULL, ylab = NULL, main = NULL, points = TRUE, ident = TRUE, links = TRUE, 
  line = TRUE, labels = NULL, main.pos = c("bottomleft", "topleft","bottomright",
  "topright"), main.cex = 1.3, legend = FALSE, legend.pos = c("topleft", "topright",
  "bottomleft", "bottomright"), legend.title = NULL, legend.lab = NULL, pch = 16,
  cex = 1, col = 1, lwd = 1, lty = 1, drawextaxes = TRUE, drawintaxes = TRUE, xlim = NULL,
  ylim = NULL)

Arguments

x

a multivariate analysis (see Details).

xax

the horizontal axis.

yax

the vertical axis. This can be set to NULL for a one-dimensional graph.

fac

only used for one-dimensional graphs. An optional factor defining groups of variables.

set

variables to be displayed, when several sets are available (see Details). 12 (default) for both sets, 1 for X, 2 for Y.

space

variables to be displayed, when several spaces are available (see Details). space is the number of the space to be plotted.

map

logical indicating if a two-dimensional (TRUE, default) or a one-dimensional graph should be drawn. A one-dimensional graph can show loadings for one or two dimensions, both horizontally.

xlab

only used for two-dimensional graphs. Legend of the horizontal axis. If NULL (default), automatic labels are used depending on the multivariate analysis.

ylab

legend of the vertical axis. If NULL (default), automatic labels are used depending on the multivariate analysis.

main

optional title of the graph.

points

only used for two-dimensional graphs. If FALSE, lines or points (see links) are replaced with their corresponding label (defined by labels).

ident

logical indicating if variable names should be displayed. Only used when points=TRUE for two-dimensional graphs.

links

only used for two-dimensional graphs when points=TRUE. Logical indicating if variables should be linked to the origin of the graph. If FALSE, points are displayed at the extremity of the segments.

line

only used for one-dimensional graphs when yax=NULL. Logical indicating if loadings should be linked (default) as displayed as sticks.

labels

only used if ident=TRUE. Names of the variables. If NULL (default), labels correspond to variable names found in the data used in the multivariate analysis.

main.pos

only used for one-dimensional graphs. Position of the title, if main is not NULL. Default to "bottomleft".

main.cex

size of the title, if main is not NULL.

legend

logical indicating if a legend should be added to the graph.

legend.pos

position of the legend, if legend is TRUE. Default to "topleft".

legend.title

optional title of the legend, if legend is TRUE.

legend.lab

legend labels, if legend is TRUE. If NULL for a one-dimensional graph, dimension names are used. If NULL for a two-dimensional graph, levels of the factor defined by fac are used.

pch

only used for two-dimensional graphs. Symbol(s) used for points, when points are displayed (see links). If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary.

cex

col

color(s) used for points, variable names and/or lines/sticks. For one-dimensional graphs, can be a vector of length one or a vector giving one value per line. For two-dimensional graphs: if fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary (not available for density histograms, see dhist).

lwd

width of lines. For one-dimensional graphs, can be a vector of length one or a vector giving one value per line. For two-dimensional graphs: if fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary.

lty

only used for one-dimensional graphs. Can be a vector of length one or a vector giving one value per line.

drawextaxes

logical indicating if external axes should be drawn.

drawintaxes

only used for two-dimensional graphs. Logical indicating if internal axes should be drawn.

xlim

only used in two-dimensional graphs. Limits of the horizontal axis. If NULL, limits are computed automatically.

ylim

limits of the vertical axis. If NULL, limits are computed automatically.

Details

This function should not be use directly. Prefer the general MVA.plot, to which all arguments can be passed.

Many multivariate analyses are supported, from various packages:

- PCA: prcomp, princomp, dudi.pca, rda, pca, pca.

- sPCA: spca.

- IPCA: ipca.

- sIPCA: sipca.

- LDA: lda, discrimin.

- PLS-DA (PLS2 on a dummy-coded factor): plsda. X space only.

- sPLS-DA (sPLS2 on a dummy-coded factor): splsda. X space only.

- CPPLS: mvr. X space only.

- PLSR: mvr, pls, plsR (plsRglm package). X space only.

- sPLSR: pls. X space only.

- PLS-GLR: plsRglm (plsRglm package).

- PCR: mvr.

- CDA: discrimin, discrimin.coa.

- NSCOA: dudi.nsc.

- MCA: dudi.acm.

- Mix analysis: dudi.mix, dudi.hillsmith.

- PCIA: procuste. Set 1 is X, set 2 is Y.

- CCorA: rcc. Space 1 is X, space 2 is Y.

- rCCorA: rcc. Space 1 is X, space 2 is Y.

- CIA: coinertia. Space 1 is X, space 2 is Y.

- 2B-PLS: pls. Space 1 is X, space 2 is Y.

- 2B-sPLS: pls. Space 1 is X, space 2 is Y.

- rGCCA: wrapper.rgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- sGCCA: wrapper.sgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- DIABLO: block.plsda, block.splsda. Space can be 1 to n, the number of blocks (i.e. datasets).

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

require(ade4)
data(olympic)
PCA <- dudi.pca(olympic$tab,scannf=FALSE)
MVA.plot(PCA,"load")

Paired plot of multivariate analyses

Description

Displays a paired plot (i.e. a score plot of paired points) of a multivariate analysis.

Usage

MVA.pairplot(x, xax = 1, yax = 2, pairs = NULL, scaling = 2, space = 1, fac = NULL,
  xlab = NULL, ylab = NULL, main = NULL, ident = TRUE, labels = NULL, cex = 0.7, col = 1,
  lwd = 1, main.pos = c("bottomleft", "topleft", "bottomright", "topright"),
  main.cex = 1.3, legend = FALSE, legend.pos = c("topleft", "topright", "bottomleft",
  "bottomright"), legend.title = NULL, legend.lab = NULL, drawextaxes = TRUE,
  drawintaxes = TRUE, xlim = NULL, ylim = NULL)

Arguments

x

a multivariate analysis (see Details).

xax

the horizontal axis.

yax

the vertical axis. Cannot be NULL, only two-dimensional graphs can be drawn.

pairs

two-level factor identifying paired individuals (in the same order in both sets of points). Can be omitted with multivariate analyses where two sets of points are available in the same space (see MVA.scoreplot). In this case these sets are automatically detected.

scaling

type of scaling. Only available with some analyses performed with the vegan package. See Details of MVA.scoreplot.

space

scores to be displayed, when several spaces are available (see Details of MVA.scoreplot). space is the number of the space to be plotted.

fac

an optional factor defining groups pairs.

xlab

legend of the horizontal axis. If NULL (default), automatic labels are used depending on the multivariate analysis.

ylab

legend of the vertical axis. If NULL (default), automatic labels are used depending on the multivariate analysis.

main

optional title of the graph.

ident

logical indicating if variable names should be displayed.

labels

names of the individuals. If NULL (default), labels correspond to row names of the data used in the multivariate analysis.

cex

size of the labels. If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary.

col

color(s) used for arrows and labels. If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary.

lwd

width of arrows. If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary.

main.pos

position of the title, if main is not NULL. Default to "bottomleft".

main.cex

size of the title, if main is not NULL.

legend

logical indicating if a legend should be added to the graph.

legend.pos

position of the legend, if legend is TRUE. Default to "topleft".

legend.title

optional title of the legend, if legend is TRUE.

legend.lab

legend labels, if legend is TRUE. If NULL and fac is defined, levels of fac are used.

drawextaxes

logical indicating if external axes should be drawn..

drawintaxes

logical indicating if internal axes should be drawn.

xlim

limits of the horizontal axis. If NULL, limits are computed automatically.

ylim

limits of the vertical axis. If NULL, limits are computed automatically.

Details

This function should not be use directly. Prefer the general MVA.plot, to which all arguments can be passed.

All multivariate analyses supported by MVA.scoreplot can be used for a paired plot.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

require(ade4)
data(macaca)
PCIA <- procuste(macaca$xy1,macaca$xy2)
MVA.plot(PCIA,"pairs")

Plotting of multivariate analyses

Description

Displays several kinds of plots for multivariate analyses.

Usage

MVA.plot(x, type = c("scores", "loadings", "correlations", "biplot", "pairs",
  "trajectories"), ...)

Arguments

x

a multivariate analysis (see Details).

type

the type of plot to be displayed: score plot (default), loading plot, correlation circle, biplot, score plot showing paired samples or score plot showing trajectories, respectively.

...

arguments to be passed to subfunctions. See Details.

Details

Different subfunctions are used depending on the type of plot to be displayed: MVA.scoreplot, MVA.loadplot, MVA.corplot, MVA.biplot, MVA.pairplot or MVA.trajplot. These functions should not be used directly (everything can be done with the general MVA.plot) but for convenience, arguments and analyses supported are detailed in separate help pages.

Warning: the use of attach before running a multivariate analysis can prevent MVA.plot to get the values it needs, and make it fail.

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Score plot of multivariate analyses

Description

Displays a score plot of a multivariate analysis.

Usage

MVA.scoreplot(x, xax = 1, yax = 2, scaling = 2, set = c(12, 1, 2), space = 1,
  byfac = TRUE, fac = NULL, barycenters = TRUE, stars = TRUE, contours = FALSE,
  dhist = TRUE, weights = 1, xlab = NULL, ylab = NULL, main = NULL, pch = 16,
  cex = 1, col = 1, points = TRUE, labels = NULL, main.pos = c("bottomleft",
  "topleft", "bottomright", "topright"), main.cex = 1.3, fac.lab = NULL,
  fac.cex = 1, legend = FALSE, legend.pos = c("topleft", "topright", "bottomleft",
  "bottomright"), legend.title = NULL, legend.lab = NULL, legend.cex = 1,
  drawextaxes = TRUE, drawintaxes = TRUE, xlim = NULL, ylim = NULL,
  keepmar = FALSE)

Arguments

x

a multivariate analysis (see Details).

xax

the horizontal axis.

yax

the vertical axis. This can be set to NULL for a one-dimensional graph. The type of graph to be drawn in this case depends on the value of dhist.

scaling

type of scaling. Only available with some analyses performed with the vegan package. See Details.

set

scores to be displayed, when several sets are available (see Details). 12 (default) for both sets, 1 for rows or X, 2 for columns or Y.

space

scores to be displayed, when several spaces are available (see Details). space is the number of the space to be plotted.

byfac

only used with MCA and mix analyses (see Details). If TRUE, a separate score plot is displayed for each factor included in the analysis. In this case fac cannot be used and if main=NULL, the factor names are displayed as titles on the graphs.

fac

an optional factor defining groups of individuals.

barycenters

only used if fac is not NULL. If TRUE (default), the name of each group (defined by fac.lab) is diplayed at the position of the barycenter of this group. Available for two-dimensional graphs and for dotcharts in the one-dimensional case (see dhist).

stars

only used if fac is not NULL. If TRUE (default), the individual of each group are linked to their corresponding barycenter.

contours

only used if fac is not NULL. If TRUE, a polygon of contour is displayed for each group.

dhist

only used in the one-dimensional case. If TRUE (default), a density histogram is displayed. If FALSE, a dotchart is displayed.

weights

individual weights, used to calculate barycenter positions (see barycenters).

xlab

legend of the horizontal axis. If NULL (default), automatic labels are used depending on the multivariate analysis.

ylab

legend of the vertical axis. If NULL (default), automatic labels are used depending on the multivariate analysis. Available for two-dimensional graphs and for density histograms in the one-dimensional case (see dhist).

main

optional title of the graph. Can be a vector of several values for MCA and mix analyses when byfac=TRUE (see byfac).

pch

symbol(s) used for points, when points are displayed (see points). If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary. Available for two-dimensional graphs and for dotcharts in the one-dimensional case (see dhist). Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

cex

size of the points or of the labels (see points). Available for two-dimensional graphs and for dotcharts in the one-dimensional case (see dhist). For two-dimensional graphs: if fac is not NULL, can be a vector of length one or a vector giving one value per group; otherwise a vector of any length can be defined, which is recycled if necessary. For dotcharts, gives the size used for points and all labels (see dotchart). Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

col

color(s) used for points or labels (see points). If fac is not NULL, can be a vector of length one or a vector giving one value per group. Otherwise a vector of any length can be defined, which is recycled if necessary (not available for density histograms, see dhist). Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

points

only used for two-dimensional graphs. If FALSE, points are replaced with their corresponding label (defined by labels). Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

labels

used in two-dimensional graphs when points=FALSE and in dotcharts (see dhist). Names of the individuals. If NULL (default), labels correspond to row names of the data used in the multivariate analysis. Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

main.pos

position of the title, if main is not NULL. Default to "bottomleft". Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

main.cex

size of the title, if main is not NULL. Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

fac.lab

only used if fac is not NULL. Labels used to display barycenters in two-dimensional graphs or on the vertical axis of a dotchart in the one-dimensional case (see dhist). If NULL, levels of the factor defined by fac are used. In case of a MCA or a mix analysis with byfac=TRUE (see byfac), labels cannot be changed and correspond to the levels of the factor displayed on each graph.

fac.cex

only used if fac is not NULL and in two-dimensional graphs. Labels used to display barycenters. Can be a vector of length one or a vector giving one value per group. Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

legend

logical indicating if a legend should be added to the graph. Available for two-dimensional graphs and for density histograms in the one-dimensional case (see dhist).

legend.pos

position of the legend, if legend is TRUE. Default to "topleft".

legend.title

optional title of the legend, if legend is TRUE. Not available for MCA and mix analyses when byfac=TRUE (see byfac).

legend.lab

legend labels, if legend is TRUE. If NULL, labels defined by fac.labels are used (see fac.labels).

legend.cex

size of legend labels, if legend is TRUE.

drawextaxes

logical indicating if external axes should be drawn. Available for two-dimensional graphs and for density histograms in the one-dimensional case (see dhist).

drawintaxes

logical indicating if internal axes should be drawn.

xlim

limits of the horizontal axis. If NULL, limits are computed automatically. Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

ylim

only used in two-dimensional graphs. Limits of the vertical axis. If NULL, limits are computed automatically. Re-used for all graphs for MCA and mix analyses when byfac=TRUE (see byfac).

keepmar

only used in two-dimensional graphs. Logical indicating if margins defined by MVA.scoreplot should be kept after plotting (necessary for biplots).

Details

This function should not be use directly. Prefer the general MVA.plot, to which all arguments can be passed.

Many multivariate analyses are supported, from various packages:

- PCA: prcomp, princomp (if scores=TRUE), dudi.pca, rda, pca, pca. scaling can be defined for rda (see scores.rda).

- sPCA: spca.

- IPCA: ipca.

- sIPCA: sipca.

- PCoA: cmdscale (with at least on non-default argument), dudi.pco, wcmdscale (with at least one non-default argument), capscale, pco, pcoa.

- nMDS: monoMDS, metaMDS, nmds, isoMDS.

- LDA: lda, discrimin.

- PLS-DA (PLS2 on a dummy-coded factor): plsda. X space only.

- sPLS-DA (sPLS2 on a dummy-coded factor): splsda. X space only.

- CPPLS: mvr. X space only.

- PLSR: mvr, pls, plsR (plsRglm package). X space only.

- sPLSR: pls. X space only.

- PLS-GLR: plsRglm (plsRglm package).

- PCR: mvr.

- CDA: discrimin, discrimin.coa.

- NSCOA: dudi.nsc.

- MCA: dudi.acm.

- Mix analysis: dudi.mix, dudi.hillsmith.

- COA: dudi.coa, cca. Set 1 is rows, set 2 is columns. If set=12 (default), fac is not available and pch,cex, col can be defined differently for each set. scaling can be defined for cca (see scores.cca).

- DCOA: dudi.dec. Set 1 is rows, set 2 is columns. If set=12 (default), fac is not available and pch,cex, col can be defined differently for each set.

- PCIA: procuste. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col can be defined differently for each set.

- Procrustean superimposition: procrustes. Set 1 is X, set 2 is Y. If set=12 (default), fac is not available and pch,cex, col can be defined differently for each set.

- GPA: GPA. Only the consensus ordination can be displayed.

- DPCoA: dpcoa. Set 1 is categories, set 2 is collections. If set=12 (default), fac is not available and pch,cex, col can be defined differently for each set.

- db-RDA (or CAP): capscale, dbrda. Space 1 is constrained space, space 2 is unconstrained space.

- CCA: pcaiv, cca. With rda, space 1 is constrained space, space 2 is unconstrained space. Only constrained space is available with pcaiv. Set 1 is rows, set 2 is columns. scaling can be defined for cca (see scores.cca).

- CCorA: CCorA, rcc. Space 1 is X, space 2 is Y. With rcc a third space is available, in which coordinates are means of X and Y coordinates.

- rCCorA: rcc. Space 1 is X, space 2 is Y, space 3 is a "common" space in which coordinates are means of X and Y coordinates.

- 2B-PLS: pls. Space 1 is X, space 2 is Y, space 3 is a "common" space in which coordinates are means of X and Y coordinates.

- 2B-sPLS: pls. Space 1 is X, space 2 is Y, space 3 is a "common" space in which coordinates are means of X and Y coordinates.

- rGCCA: rgcca, wrapper.rgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- sGCCA: rgcca, wrapper.sgcca. Space can be 1 to n, the number of blocks (i.e. datasets).

- DIABLO: block.plsda, block.splsda. Space can be 1 to n, the number of blocks (i.e. datasets).

Author(s)

Maxime HERVE <maxime.herve@univ-rennes1.fr>

Examples

data(iris)
PCA <- prcomp(iris[,1:4])
MVA.plot(PCA,"scores")
MVA.plot(PCA,"scores",fac=iris$Species,col=1:3,pch=15:17)

Scores of multivariate analyses

Description

Returns scores of a multivariate analysis.

Usage

MVA.scores(x, xax = 1, yax = 2, scaling = 2, set = c(12, 1, 2), space = 1, ...)

Arguments

x

a multivariate analysis (see Details).

xax

axis or axes for which to extract scores.

yax

axis for which to extract scores (ignored if length(xax) > 1).