LDAK-KVIK options
Default parameters in LDAK-KVIK can be modified by adding options to the command line. Below, we present a list of options for each step.
Step 1
| Argument |
Description |
--kvik-step1 |
Name of the output files of the step 1 |
--bfile |
Name of the .bed file to be analyzed |
--bgen |
Name of the .bgen file to be analyzed |
--sample |
Name of the .sample file corresponding to the .bgen file |
--pheno |
Name of the phenotype file |
--covar |
Name of the quantitative covariate file |
--covar-numbers |
Specify a subset of covariates by number. For example, --covar-numbers 1,2,4-6,8 |
--covar-names |
Specify a subset of covariates by names. For example, --covar-names PC1,PC3,age |
--factors |
Name of the categorical covariate file |
--max-threads |
Number of threads (used for parallel computing) |
--binary YES |
Indicates that the analyzed phenotype is binary |
--mpheno |
Specifies the phenotype number to analyze (e.g., --mpheno 2 to analyze the second phenotype in the phenotype file). When using --mpheno ALL, all phenotypes are analyzed |
--num-pedigree-predictors |
The number of SNPs used when testing for structure (default: 512) |
–-check-pedigree NO |
Indicates that there will be no check for structure. In this case, it is assumed that there is structure |
-–num-MCMC |
Number of random vectors used to compute the heritability estimate in randomized Haseman-Elston regression (default: ten if \(n\) < 40000, three if \(n\) > 40000) |
-–num-divide |
Number of partitions used to compute the heritability estimate in randomized Haseman-Elston regression (default: 40) |
-–num-scans |
Number of scans performed by the Variational Bayes algorithm to construct PRS |
-–cv-proportion |
Proportion of individuals used to determine elastic net hyperparameters |
-–tolerance |
This number is multiplied by \(n\), and specifies the threshold for convergence for the likelihood in the Variational Bayes algorithm (default: \(10^{-6}\)) |
–-num-calibration-predictors |
Number of SNPs used to compute the Grammar-Gamma scaling factor (default: 20) |
Step 2
Please note that several arguments must match those used in Step 1 (i.e., you must use the same output filename, provide the same data and phenotype files, and if you used covariates in Step 1, you must also use them in Step 2).
| Argument |
Description |
--kvik-step2 |
Name of the output files of Step 2. Note that this should have the same name as Step 1. |
--bfile |
Name of the .bed file to be analyzed |
--bgen |
Name of the .bgen file to be analyzed |
--sample |
Name of the .sample file corresponding to the .bgen file |
--pheno |
Name of the phenotype file |
--covar |
Name of the quantitative covariate file |
--covar-numbers |
Specify a subset of covariates by number. For example, --covar-numbers 1,2,4-6,8 |
--covar-names |
Specify a subset of covariates by names. For example, --covar-numbers PC1,PC3,age |
--factors |
Name of the categorical covariate file |
--max-threads |
Number of threads (used for parallel computing) |
--mpheno |
Specifies the phenotype number to analyze (e.g., --mpheno 2 to analyze the second phenotype in the phenotype file). When using --mpheno ALL, all phenotypes are analyzed |
–-spa-test NO |
Indicates that no saddlepoint approximation will be used when testing binary phenotypes. Conversely, it is possible to specify --spa-test YES for quantative traits to analyze using SPA |
Step 3
| Argument |
Description |
--kvik-step3 |
Name of the output files of Step 3. Note that this should have the same name as Step 1 and Step 2. |
--bfile |
Name of the .bed file to be analyzed |
--bgen |
Name of the .bgen file to be analyzed |
--sample |
Name of the .sample file corresponding to the .bgen file |
--genefile |
Name of the gene annotation file. Instructions for downloading this can be found on the input page |
--max-threads |
Number of threads (used for parallel computing) |