Supplementary Dr. S. Mari完整原版文件.docx
《Supplementary Dr. S. Mari完整原版文件.docx》由会员分享,可在线阅读,更多相关《Supplementary Dr. S. Mari完整原版文件.docx(27页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、Supplementary MaterialCurrent Metabolomics, 2013, Vol. 1, No. 2 viiSupplementary MaterialMetabolomic Univariate & Multivariate Analysis (muma)TUTORIALTABLE OF CONTENTSmuma overview3Functions list3Download and Installation4Dataset format5Analysis procedure51| Create the working directory62| Start the
2、 analysis63| Principal Component Analysis Score and Loading plots114| Univariate Analysis135| Merge univariate and multivariate information176| Partial Least Square Discriminant Analysis (PLS-DA)197| Orthogonal Projection to Latent Structures - Discriminant Analysis (OPLS-DA)208| Tools for NMR molec
3、ular assignment and data interpretation21A| Statistical TOtal Correlation SpectroscopY (STOCSY)21B| STOCSY 1D23C| Orthogonal Signal Correction (OSC) STOCSY24D| Ratio Analysis NMR SpectroscopY (RANSY)26References27muma overviewmuma is a tool for the multivariate and univariate statistical analysis of
4、 metabolomic data, written in the form of add-on package for the open source software R. By creating this statistical protocol we wanted to provide guidelines for the whole process of metabolomic data interpretation, from data pre- processing, to dataset exploration and visualization, to identificat
5、ion of potentially interesting variables (or metabolites). For doing so, we implemented the steps that are typically used in metabolomic analyses and created some new tips that can facilitate users work. In fact, muma is designed for those people who are not R experts, but want to perform statistica
6、l analysis in a very short time and with reliable results.Even though muma has been designed for the analysis of metabolomic data generated with different analytical platforms (NMR, MS, NIR.), it provides specific methods for helping the NMR-based metabolomics. In particular, muma is equipped with t
7、wo tools (STOCSY and RANSY) aiding the identification and assignment of molecules present in NMR spectra, or suggesting possible biochemical interaction between different molecules.In this tutorial we provide a workflow for metabolomics data interpretation using muma, describing from the installatio
8、n to the specific usage of all mumas functions, to the recovery of all results generated. Enjoy.Functions listwork.dir()Generate a working directory within which all the files generated are stored.explore.data() Perform data pre-processing (normalization, scaling) and data exploration, through PCA.P
9、lot.pca()Plot the PCA Score and Loading plots for specified principal components.plsda()Perform PLS-DA.univariate()Perform an array of univariate statistical techniques.Plot.plsda()PlotthePLS-DAScoreandw*cplots,forspecified components.oplsda()Perform OPLS-DA.stocsy()Perform STOCSY analysis.stocsy.1d
10、()Perform monodimensional STOCSY analysis. ostocsy()Perform STOCSY analysis on the OSC-filtered dataset. ransy()Perform RANSY analysis.Download and InstallationFirst of all download R (version 2.15 or higher) from the CRAN (www.r- project.org), according to your operating system (Unix, MacOS or Wind
11、ows). Install R as indicated in the R manual.You can open R with its graphic interface or from command line: shell (Unix), Terminal (MacOS) or DOS (Windows).After you have installed and launched R, you can install the package muma, as described in Figure 1. You can install muma by typing the command
12、 install.packages(muma)and by chosing your CRAN mirror from the browser (Figure 1).FIGURE 1It could happen that installation fails, due to diverse R software versions. In this case, it should be sufficient to install the following packages, prior the installation of muma:install.packages(mvtnorm) in
13、stall.packages(robustbase) install.packages(gtools) install.packages(bitops) install.packages(caTools)and then run the commandinstall.packages(muma)Once muma is installed you can load the package by typing library(muma).Dataset formatData table of interest has to be submitted in .csv format and with
14、 a specific form, as indicated in Figure 2.FIGURE 2In particular:- the first column indicates the names of every samples (NOTE: these must be different from each other, even if samples belong to the same class; moreover, for an optimal graphical visualization, short names (4-5 characters) are recomm
15、ended);- the second column indicates the “Class” of each sample, with an integer, positive value, starting from 1.- From the third column to the column N are reported data values of each sample, for each variable.- The first row will be considered as header; within this row variables names must be p
16、rovided, everyone different from each other.The dataset in Figure 2 is provided with this tutorial and derives from a metabolomics analysis of B cell cultures untreated or after one, two, three and four days of LPS treatment (Garcia-Manteiga et al, 2011). As it can be observed from Figure 2, the “Cl
17、ass” column is filled according to the day of treatment.Analysis procedureFor starting the analysis move to the directory in which you have stored your data table, by selecting the option Change Working Directory from the menuMisc of the R Console. If you are not using the R console, but you decided
18、 to launch R from command line, just navigate to the directory in which your data table is stored, then launch R with the command R.1| Create the working directoryBefore starting the analysis it is recommended to create a new directory that will become the working directory from now on. This is reco
19、mmended because muma generates diverse files and directories, that could be useful to store in a unique directory. All the results created from mumas analyses will be stored here. You can use the function work.dir(dir.name=WorkDir)to create a new working directory, as indicated in Figure 3.FIGURE 3A
20、s it can be observed a directory called “WorkDir” has been created. All the files present in the first directory are copied in the new generated one. Automatically, this drectory become the current working directory.2| Start the analysisThe first step in muma analysis can be performed with the funct
21、ion explore.data(), which provides data pre-processing and dataset exploration. Figure 4 indicates a usage of such function. In particular it can be used in the following way: explore.data(file=YourFile.csv, scaling=ScalingType, scal=TRUE, normalize=TRUE, imputation=FALSE, imput=ImputType)This funct
22、ion generates three new directories:- “Groups”, in which are stored the samples of each group as identified by the “Class” column in the data table;- “PCA_Data_scalingused”, in which are stored the principal component analysis files as the matrices of score and loading values, as well as all the plo
23、ts and graphics PCA-related. Note: this directory is given different names according to the scaling used.;- “Preprocessing_Data_scalingused”, in which are stored all the files used for preprocessing the dataset, as the normalized and scaled tables. Note: this directory is given different names accor
24、ding to the scaling used.FIGURE 4A| In particular, this function reads the data table and converts all the negative values to 0 values, because metabolomics measurements resulting with negative values are considered noise or errors, hence are brought to a null baseline. A table called “NegativeValue
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- Supplementary Dr. S. Mari完整原版文件 Dr Mari 完整 原版 文件
限制150内