MVApp

Glittery multivariate analysis platform for all kinds of data. Follow us on twitter @MVApp007.

The app is available here or you can run it locally from your device by typing the following command in your R window:

install.packages("shiny") library("shiny") shiny::runGitHub("mmjulkowska/MVApp", "mmjulkowska")

(….it will take some time for the first time to upload all the libraries)

Purpose statement - What is MVApp for?

MVApp was created to streamline data analysis for all kinds of biological queries - from investigating mutant phenotypes, examing the effects of an experimental treatment, to studying natural variation using any biological system.

We believe that MVApp will enhance data transparency and standardize data curation and analysis in the scientific community by empowering researchers to perform complex analyses without extensive knowledge of R or statistics, as well as improve the data analysis literacy in wider scientific community.

Although the MVApp development team is buried armpit-deep in Plant Science, we are trying to make the App as applicable as possible for all biological disciplines and beyond. If you have any suggestions on other types of analyses we can include, please check out our guidelines on how to contribute.

Currently MVApp has following features:

Identification of outliers using different methods based on one or multiple phenotypes
Summary of the data dynamics by fitting simple functions or polynomial curves to data points
Hypothesis testing using parametric and non-parametric tests, including testing the assumptions of normality and equal variance
Correlation analysis of all measured traits in the experiment or within a specific subset of data
Reduction of data dimensionality and identifying the traits that explain the most data variance using principal component analysis and multidimensional scaling
Clustering individual samples using hierarchical or k-means clustering
Estimation of broad-sense heritability of measured traits
Quantile regression analysis that allows the identification of traits with significant contribution to traits of major interest

You can read the instructions below, or watch one of our video-tutorials on youtube.

How to cite the MVApp:

The app is not published yet, but you can find the pre-print version of the MVApp manuscript on figshare: Julkowska, Magdalena; Saade, Stephanie; Agarwal, Gaurav; Gao, Ge; Pailles, Yveline; Morton, Mitchell; Awlia, Mariam; Tester, Mark (2018): MVAPP – Multivariate analysis application for streamlined data analysis and curation. figshare. Paper.

If you wish to cite the app itself, please use the following: Julkowska, M.M., Saade, S., Gao, G., Morton, M.J.L., Awlia, M., Tester, M.A., “MVApp.pre-release_v2.0 mmjulkowska/MVApp: MVApp.pre-release_v2.0”, DOI: 10.5281/zenodo.1067974

column with the genotype or the main Independent Variable (if you use only one genotype - we advise you to include one column with the genotype anyway and give the same name to all of your samples)
One or multiple column(s) with an Independent Variable (e.g. treatment, position, experimental batch number)
One or multiple column(s) containing Dependent Variable - numerical data of the measured traits - also known as phenotypes

If you have a timeseries experiment, or any other gradient, and you want to fit curves to your data, the input data should include columns containing:

Time (or other continuous Independent Variable) - this variable MUST be numeric (e.g. “1” instead of “Day 1”)
Sample ID - an identifier for each individual sample

Your data should look similar to the Example dataset, with ID and TIME column being optional:

mvapp_data

MVApp

Glittery data analysis and multi-variate analysis for all kinds of beautiful, big and small data sets. But remember - rubbish in = rubbish out!

MVApp

Purpose statement - What is MVApp for?

How to cite the MVApp:

Table of contents:

1. DATA UPLOAD

Data format:

Upload and annotate your data

2. SPATIAL VARIATION

Why test spatial variation?

Upload the spatial information into the MVApp

Examine the effect of spatial variation on individual phenotypes

3. CURVE FITTING

Why model your data?

Fit simple functions

Fit curves with MVApp

Visualise goodness of fit of the dynamic curves with fit-plots

Assess and compare the dynamics between Genotypes and / or Independent Variables

Fit polynomial curves with MVApp

4. OUTLIER SELECTION

Why identify potential outliers?

Highlight potential outliers

Examine the data with and without potential outliers

Compare the data with outliers removed

Calculate summary statistics

5. DATA EXPLORATION

Examine distribution

Examine variance

One / two sample test

Test significant differences between groups

Two-way ANOVA

6. CORRELATIONS

Select the dataset

Select the correlation method

Correlation for subsetted data

Customize the correlation plot

Scatterplots

7. PRINCIPAL COMPONENT ANALYSIS

Select data, subsets, and Dependent Variables

Visualize the principal components

Visualize the contribution of each Dependent Variable to the principal components

What are the principal component coordinates for individual samples?

Explain principal components by examining the contribution of Dependent Variables

8. MULTIDIMENSIONAL SCALING

Select data, subsets, and dependent variables

Multidimensional scaling of individual samples

Multidimensional scaling of the selected Dependent Variables

9. HIERARCHICAL CLUSTER ANALYSIS

Selecting the data

View the clusters and select the similarity distance for cluster separation

Cluster Validation

10. K-MEANS CLUSTER ANALYSIS

Selecting the data

Optimal cluster number estimation

Performing k-means clustering

11. HERITABILITY

Selecting the data

Estimated broad-sense heritability

12. QUANTILE REGRESSION

When should you use it?

Select the dataset

Select reponse, explanatory variable, subsets

Results of quantile regression

Visualize the quantile regression results

Quantile plots