LDAK-KVIK steps

LDAK-KVIK uses a three-step process to test SNPs and genes for association with the phenotype:

Compute a Leave-One-Chromosome-Out (LOCO) PRS using an Elastic Net model
Use the PRS as offset in single-SNP analysis
Use the summary statistics of single-SNP analysis to run a gene-based association analysis LDAK-GBAT

These steps should be run consecutively; each next step is dependent on the results of the previous step.

An example run of LDAK-KVIK is as follows:

./ldak6.linux --kvik-step1 kvik --bfile data --pheno phenofile --covar covfile --max-threads 2
./ldak6.linux --kvik-step2 kvik --bfile data --pheno phenofile --covar covfile --max-threads 2
./ldak6.linux --kvik-step3 kvik --bfile data --genefile genefile --max-threads 2

An overview of the LDAK-KVIK algorithm is included in the technical details. Here, we describe the command lines used the the main input options. Note that a more extensive list of input options is included in the the data format page.

Step 1

An example command line of Step 1 in LDAK-KVIK is given by:

./ldak6.linux --kvik-step1 kvik --bfile data --pheno phenofile --covar covfile --max-threads 2

The main input options are as follows:

Argument	Description
`--kvik-step1`	Name of the output files of the step 1
`--bfile`	Name of the .bed file to be analyzed
`--pheno`	Name of the phenotype file
`--covar`	Name of the covariate file
`--max-threads`	Integer value, specifying the number of available threads for fitting the elastic net. By default, LDAK assumes `--max-threads` equals 1
`--binary`	Indicates that the analysed trait is binary

When analysing binary traits, the user should add --binary YES to the command. All operations in Step 1 will then be based on weighted linear regression, instead of linear regression. When analysing imputed data, it is not necessary to include all SNPs in Step 1 (see Recommendations for alternative options). A more extensive list of options can be found in data format.

Step 2

An example command line of Step 2 in LDAK-KVIK is given by:

./ldak6.linux --kvik-step2 kvik --bfile data --pheno phenofile --covar covfile --max-threads 2

The main input options are as follows:

Argument	Description
`--kvik-step2`	Name of the output files of Step 2. Note that this should have the same name as Step 1.
`--bfile`	Name of the .bed file to be analyzed
`--pheno`	Name of the phenotype file
`--covar`	Name of the covariate file

Note that the input argument of --kvik-step2 (in this case, ‘kvik’) should match the argument previously used in --kvik-step1. In case the --binary YES flag is used in Step 1, this will automatically be used in Step 2, and a saddlepoint approximation will be used for association analysis. Note that Step 2 can only be run after Step1 has been run. More options for can be found in Data Filtering.

Step 3

An example command line of Step 3 in LDAK-KVIK is given by:

./ldak6.linux --kvik-step3 kvik --bfile data --genefile <annotfile> --max-threads 2

The main input options are as follows:

Argument	Description
`--kvik-step3`	Name of the output files of Step 3. Note that this should have the same name as Step 1 and Step 2.
`--bfile`	Name of the .bed file to be analyzed
`--genefile`	Name of the gene annotation file

Note that a gene annotation file is needed to specify the locations of the genes on the genome. The gene-based testing page includes command lines for downloading gene annotation files.