Parameters
This chapter introduces the parameters of each module in detail.
Genotype Dimensionality Reduction Module (genoidx)
-g
: Genotype file used for genome-wide genotype dimensionality reduction analysis, support plink bed format and hapmap format,the plink bed format file only needs to provide the file prefix, the hapmap format needs to be used together with the parameter -convert
-genome_cluster
: Perform genome-wide genotype dimensionality reduction analysis on genotype file
-w
: Window size for genome-wide genotype dimensionality reduction analysis(bp), default 1000000
-s
: Step size for genome-wide genotype dimensionality reduction analysis(bp), default 500000
-convert
: Convert hapmap genotype format to plink bed format genotype
-clump
: Clumping analysis for genotype file, used to keep a subset of SNPs that are nearly uncorrelated with each other, thereby reducing the number of total SNPs to speed up genome-wide genotype dimensionality reduction analysis
-p
: Number of threads for genome cluster analysis,max threads is total number of genome chromosomes
-o
: The prefix of the output file, default MODAS_genoidx_output
Phenotype preprocessing and transformation module (phenorm)
-phe
: Phenotype file for phenotype preprocessing and transformation, the file format is csv format, the first column and the first row are the names of inbred lines and phenotypes respectively
-g
: Genotype file in plink bed format for principal component analysis, used with the parameter -pca
to correct differences in phenotype caused by population structure
-pca
: Correct the differences in phenotype caused by population structure through PCA
-r
: Phenotype missing ratio filter cutoff
-v
: Phenotype value filter cutoff
-log2
: log2 transformation of phenotype
-log10
: log10 transformation of phenotype
-ln
: ln transformation of phenotype
-qqnorm
: Normal quantile transformation of phenotype
-norm
: Boxcox transformation of phenotype
-o
: The prefix of the output file, default MODAS_phenorm_output
Omics Phenotype Pre-screening Module (prescreen)
-g
: genotype file in plink bed format, used to calculate the kinship matrix for GWAS analysis
-genome_cluster
: pseudo-genotype file for phenotype pre-screening, generated by subcommand genoidx
-phe
: phenotype file for phenotype pre-screening analysis, the file format is csv format, the first column and the first row are the names of inbred lines and phenotypes respectively
-lm_suggest_pvalue
: linear model suggest pvalue for pseudo-SNP filter used for association analysis in mixed linear model, default 1/ pseudo-genotype file pseudo-snp number
-lmm_suggest_pvalue
: lmm suggest pvalue for phenotype prescreen, used to obtain candidate phenotypes with significant association analysis signals and significant association regions of phenotypes, default 1e-6
-p
: number of threads for analysis in prescreen sub command
-o
: The prefix of the output file, default MODAS_prescreen_output
Regional gwas identify QTL module (regiongwas)
-g
: Genotype file in plink bed format, used to calculate the kinship matrix for regional gwas analysis and extract snp in significant association regions of phenotypes
-phe
: Candidate phenotype file generated by subcommand prescreen
-phe_sig_qtl
: Significant association regions of candidate phenotypes file generated by subcommand prescreen
-p1
: Significance threshold for index SNPs, used to determine the phenotype with QTL and index snps of phenotype
-p2
: Secondary significance threshold for clumped SNPs, Used to obtain the secondary significant SNP linked to index snp to determine the reliability of the QTL. MODAS outputs the QTL containing 10 secondary significant SNPs as the phenotypic QTL result
-cluster
: Cluster phenotype by QTL
-p
: Number of threads for regional gwas analysis, default 1
-o
: The prefix of the output file, default MODAS_regiongwas_out
Mendelian randomization analysis module(mr)
-g
: Genotype file in plink bed format
-exposure
: Exposure phenotype file, such as mTrait phenotype, the file format is csv format, the first column and the first row are the names of inbred lines and phenotypes respectively
-outcome
: Outcome phenotype file, such as pTrait phenotype, the file format is csv format, the first column and the first row are the names of inbred lines and phenotypes respectively
-qtl
: Exposure phenotype QTL file, generated by subcommand regiongwas
-lm
: Perform Mendelian Randomization through linear model
-mlm
: Perform Mendelian Randomization through mixed linear model
-p
: Number of threads for Mendelian Randomization analysis
-pvalue
: The pvalue cutoff of Mendelian randomization analysis result output,default 1
-net
: Mendelian Randomization network analysis, used to identify functional modules
-o
: The prefix of the output file, default MODAS_mr_out
Whole genome-wide association analysis and visualization module(visual)
-g
: Genotype file in plink bed format for whole genome-wide association analysis
-phe
: phenotype file for GWAS analysis and visualization
-gwas_model
: model for GWAS analysis, supports the models provided by gemma, rMVP, and GAPIT software, including gemma_LM, gemma_MLM, rMVP_GLM, rMVP_MLM, rMVP_FarmCPU, GAPIT_GLM, GAPIT_MLM, GAPIT_CMLM, GAPIT_MLMM, GAPIT_FarmCPU, GAPIT_Blink. Default is gemma_MLM
-qtl
: phenotype QTL file generated by subcommand regiongwas
-visual
: Generate web-based GWAS visualization results
-anno
: gene annotation file for QTL annotation, used to display QTL information in visualized results
-p
: Number of threads genome-wide association analysis and visualization
-o
: The prefix of the output file,default MODAS_visual_out
Construction of stress response index and identification of stress-responsive QTL(contrast)
-stress_phe
: omics dataset under stress treatment
-control_phe
: omics dataset under control treatment
-alpha
: Contrast strength for contrastive PCA, it represents the trade-off between having the high stress variance and the low control variance, default 1
-n_comp
: Number of components for contrastive PCA, default 2
-beta_test_pvalue
: Significance threshold for stress-responsive CPCA PC, used to determine the CPCA PC representing stress response effect of molecular features, default 1/number of CPCA PC for beta test
-gwas
: perform GWAS on stress response index to identify stress-responsive molecular QTL
-g
: Genotype file in plink bed format, used to perform GWAS on stress response index
-genome_cluster
: Pseudo-genotype file for phenotype pre-screening, generated by subcommand genoidx
-anova_pvalue
: Significance threshold of two-way ANOVA, used to determine stress-responsvie molecular QTL, default 0.05
-gwas_model
: GWAS model for association analysis, supporting LM, GLM and MLM, default MLM
-p
: Number of threads for gwas analysis, default 1
-o
: The prefix of the output file, default MODAS_contrast_out
Co-localization analysis of QTLs for multiple traits(coloc)
-qtl
: molecular QTL file for co-localization analysis
-g
: Genotype file in plink bed format, used to extract genotype of SNPs associated with molecular features
-gwas_dir
: directory containing GWAS results for molecular features, used to extract SNPs associated with molecular features. It is recommended to use the association analysis results generated by the MODAS subcommand regiongwas
-metric
: metric for determining clustering patterns of kinship, it can be calinski_harabasz or silhouette, defualt calinski_harabasz
-eps
: The maximum distance between two traits used to consider a trait to co-localize with the other trait, defualt 0.2
-pvalue
: Threshold of GWAS results for screening SNPs associated with molecular features, defualt 1e-6
-p
: Number of threads for co-localization analysis, default 1
-o
: The prefix of the output file, default MODAS_coloc_out