QIIME (version 1.9)¶

Modules included in this section

qiime_prep
qiime_demult
qiime_chimera
qiime_pick_otus
qiime_pick_rep_set
qiime_align_seqs
qiime_filter_alignment
qiime_assign_taxonomy
qiime_make_phylogeny
qiime_make_otu_table
qiime_filter_samples_from_otu_table
qiime_filter_otus
qiime_sort_otu_table
qiime_divers

`qiime_prep`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for preparing fastq reads for analysis with QIIME (1.9):

The reads stored in each sample are optinally joined and then set it a directory in such a way the downstream, QIIME’s demult can concatenate the sequences while saving the sample of origin.

The directory will contain symbolic links to the files to be used by demult in the following step.

Requires¶

fastq files in one of the following slots:
- sample_data[<sample>]["fastq.F"]
- sample_data[<sample>]["fastq.R"]
- sample_data[<sample>]["fastq.S"]

Output¶

Puts directory of links to files to use with QIIME:
- self.sample_data["project_data"]["qiime.prep_links_dir"]
If join is performed:
- puts the new joined reads in:
  self.sample_data[<sample>]["fastq.J"]
- puts the unjoined forward reads in:
  self.sample_data[<sample>]["fastq.F"]
- puts the unjoined reverse reads in:
  self.sample_data[<sample>]["fastq.R"]

Parameters that can be set¶

Parameter	Values	Comments
join	none, join (or join_cat - not implemented)	Wheather to join paired reads.
unjoined	forward, reverse, both or none	What to do with unjoined sequences? Use only forward, only reverse, both or none. If join is none, use this parameter to indicate which reads to take for analysis.
join_algo	forward, reverse, both or none	What to do with unjoined sequences?
parameters		Path to QIIME parameter file to be used downstream

Lines for parameter file¶

q_prep_1:
    module: qiime_prep
    base: merge1
    script_path: /path/to/join_paired_ends.py
    join: join
    unjoined: forward
    parameters: /path/to/qiime_params.txt
    redirects:
        --pe_join_method: fastq-join

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_demult`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s multiple_split_libraries_fastq.py:

The reads from step qiime_prep are combined into one seqs.fna file.

Note

The module has not been tested on other types of data, such as undemultiplexed reads. It should work but there will probably be unexpected problems.

Requires¶

A directory of read files with smaple names coded in the file names, such as the directory produced by qiime_prep:
- sample_data["qiime.prep_links_dir"]

Output¶

Puts the resulting seqs.fna file in the following slots:
- self.sample_data["project_data"]["qiime.demult_seqs"]
- self.sample_data["project_data"]["qiime.fasta"]
- self.sample_data["project_data"]["fasta.nucl"]

Lines for parameter file¶

q_demult_1:
    module: qiime_demult
    base: q_prep_1
    script_path: '/path/to/multiple_split_libraries_fastq.py'
    redirects:
        --demultiplexing_method: sampleid_by_file
        --include_input_dir_path: null
        --parameter_fp: /path/to/qiime_params
        --remove_filepath_in_name: null

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_chimera`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s identify_chimeric_seqs.py:

The module can operate on the raw seqs.fna or on an aligned version. The latter is used for ChimeraSlayer and the former for usearch61

Requires¶

A fasta file in:
- sample_data["qiime.fasta"]
Alternatively, an aligned fasta file in:
- sample_data["fasta.aligned"]

Output¶

Puts the resulting list of chimeras in
- self.sample_data["project_data"]["chimeras"]
Puts the filtered fasta file in:
- self.sample_data["project_data"]["fasta.chimera_removed"]
- self.sample_data["project_data"]["fasta.nucl"]

Note

When using parallel_identify_chimeric_seqs.py, the module tries to build the scripts appropriately. It is wise to check the parallel scripts before running them…

Parameters that can be set¶

Parameter	Values	Comments
method	usearch61 or ChimeraSlayer	Method to use for the analysis (passed to the –chimera_detection_method of `identify_chimeric_seqs.py`

Lines for parameter file¶

q_chimera_usrch:
    module: qiime_chimera
    base: q_demult_1
    # script_path: '{Vars.qiime_path}/parallel_identify_chimeric_seqs.py'
    script_path: '{Vars.qiime_path}/identify_chimeric_seqs.py'
    method:         usearch61 # Or ChimeraSlayer. Will guess depending on existing files.
    redirects:
        # --jobs_to_start:              20
        --aligned_reference_seqs_fp:  /path/to/reference_files.otus_aligned
        --reference_seqs_fp:  /path/to/reference_files.otus

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_pick_otus`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s pick_otus.py

Requires¶

A fasta file in:
- sample_data["fasta.nucl"]

Output¶

Puts the resulting OTU table in:
- self.sample_data["project_data"]["otu_table"]

Lines for parameter file¶

q_pick_otu_1:
    module: qiime_pick_otus
    base: q_chimera_usrch
    script_path: '{Vars.qiime_path}/pick_otus.py'
    setenv: {Vars.qiime_env}

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_pick_rep_set`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s pick_rep_set.py

Requires¶

A fasta file in:
- sample_data["fasta.nucl"]
An OTU table in:
- sample_data["otu_table"]

Output¶

Puts the resulting fasta file in:
- self.sample_data["project_data"]["fasta.nucl"]
Saves the original fasta file in:
- self.sample_data["project_data"]["qiime.full_fasta"]

Lines for parameter file¶

q_rep_set_1:
    module: qiime_pick_rep_set
    base: q_pick_otu_1
    script_path: '{Vars.qiime_path}/pick_rep_set.py'
    setenv: {Vars.qiime_env}

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_align_seqs`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME's align_seqs.py:

Can be used for the parallel versions thereof: parallel_align_seqs_pynast.py

Requires¶

A fasta file in:
- sample_data["fasta.nucl"]

Output¶

Puts the resulting aligned fasta file in:
- self.sample_data["project_data"]["fasta.nucl"]
- self.sample_data["project_data"]["fasta.aligned"]
Stores the old, unaligned version in:
- self.sample_data["project_data"]["fasta.unaligned"]

Note

When using parallel_align_seqs_pynast.py, the module tries to build the scripts appropriately. It is wise to check the parallel scripts before running them…

Lines for parameter file¶

q_align_para:
    module: qiime_align_seqs
    base: q_rep_set_1
    script_path: '{Vars.qiime_path}/parallel_align_seqs_pynast.py'
    setenv: {Vars.qiime_env}
    redirects:
        --jobs_to_start: 5
        --retain_temp_files: 

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_filter_alignment`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s filter_alignment.py

Requires¶

A fasta file in:
- sample_data["fasta.nucl"]

Output¶

Puts the resulting aligned fasta file in:
- self.sample_data["project_data"]["fasta.nucl"]
Saves the original unaligned fasta file in:
- self.sample_data["project_data"]["fasta.aligned_unfiltered"]

Lines for parameter file¶

q_filt_align_1:
    module: qiime_filter_alignment
    base: q_align_1
    script_path: '{Vars.qiime_path}/filter_alignment.py'
    setenv: {Vars.qiime_env}

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_assign_taxonomy`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s assign_taxonomy.py

Can also be used to run the parallel versions of the program:

parallel_assign_taxonomy_blast.py

parallel_assign_taxonomy_rdp.py

parallel_assign_taxonomy_uclust.py

Requires¶

A fasta file in:
- sample_data["fasta.nucl"]

Output¶

Puts the resulting list of chimeras in
- self.sample_data["project_data"]["taxonomy"]

Note

When using the parallel version, the module tries to build the scripts appropriately. It is wise to check the parallel scripts before running them…

Lines for parameter file¶

q_tax_asn_1:
    module: qiime_assign_taxonomy
    base: q_rep_set_1
    script_path: '{Vars.qiime_path}/parallel_assign_taxonomy_rdp.py'
    setenv: {Vars.qiime_env}
    redirects:
        --confidence: 0.5
        --id_to_taxonomy_fp: {Vars.reference_files.id_to_taxonomy}
        --jobs_to_start: 20
        --rdp_max_memory: 50000
        --reference_seqs_fp: {Vars.reference_files.otus}

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_make_phylogeny`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s make_phylogeny.py

Requires¶

A fasta file in:
- sample_data["fasta.nucl"]

Output¶

Puts the resulting OTU table in:
- self.sample_data["project_data"]["phylotree"]

Lines for parameter file¶

q_phylo_1:
    module: qiime_make_phylogeny
    base: q_filt_align_1
    script_path: '{Vars.qiime_path}/make_phylogeny.py'
    setenv: {Vars.qiime_env}

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_make_otu_table`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s make_otu_table.py:

The module creates a BIOM table based on the OTU table and a taxonomy assignment if avaliable (will be available if the qiime_assign_taxonomy is in the branch).

If chimera checking has been performed, the suspected chimeric sequences will be removed from the BIOM table.

The module also adds code for creating a summary of the BIOM table and a tab-delimited version thereof.

Requires¶

An OTU table:
- sample_data["otu_table"]

Optional¶

A taxonomy assignment of the sequences:
- sample_data["taxonomy"]

Output¶

Puts the BIOM table in
- self.sample_data["project_data"]["biom_table"]
Puts the BIOM table summary in:
- self.sample_data["project_data"]["biom_table_summary"]
Puts the BIOM table in tab-delimited format in:
- self.sample_data["project_data"]["biom_table_tsv"]
If a fasta.chimera_removed file exists, will put the unfiltered BIOM table in:
- self.sample_data["project_data"]["unfiltered_biom_table"]

Parameters that can be set¶

Parameter	Values	Comments
skip_summary		If passed, will not create the BIOM table summary.
skip_tsv		If passed, will not create the tsv version of the BIOM table.

Lines for parameter file¶

q_mk_otu_1:
    module: qiime_make_otu_table
    base: q_phylo_1
    script_path: '{Vars.qiime_path}/make_otu_table.py'
    setenv: {Vars.qiime_env}
    # skip_summary:
    # skip_tsv:
    redirects:
        --mapping_fp: /path/to/qiime1_mapping.txt

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_filter_samples_from_otu_table`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s filter_samples_from_otu_table.py

Requires¶

A BIOM table in:
- sample_data["biom_table"]

Output¶

Puts the resulting BIOM table in:
- self.sample_data["project_data"]["biom_table"]
Puts the BIOM table summary in:
- self.sample_data["project_data"]["biom_table_summary"]
Puts the BIOM table in tab-delimited format in:
- self.sample_data["project_data"]["biom_table_tsv"]
Puts the unfiltered BIOM table in:
- self.sample_data["project_data"]["prefilter_biom_table"]

Parameters that can be set¶

Parameter	Values	Comments
skip_summary		If passed, will not create the BIOM table summary.
skip_tsv		If passed, will not create the tsv version of the BIOM table.

Lines for parameter file¶

filt_samp_1:
    module: qiime_filter_samples_from_otu_table
    base: q_mk_otu_1
    script_path: '{Vars.qiime_path}/filter_samples_from_otu_table.py'
    setenv: {Vars.qiime_env}
    redirects:
        --mapping_fp: /path/to/mapping.txt
        --min_count: 100000

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_filter_otus`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s filter_otus_from_otu_table.py

Requires¶

A BIOM table in:
- sample_data["biom_table"]

Output¶

Puts the resulting BIOM table in:
- self.sample_data["project_data"]["biom_table"]
Puts the BIOM table summary in:
- self.sample_data["project_data"]["biom_table_summary"]
Puts the BIOM table in tab-delimited format in:
- self.sample_data["project_data"]["biom_table_tsv"]
Puts the unfiltered BIOM table in:
- self.sample_data["project_data"]["prefilter_biom_table"]

Parameters that can be set¶

Parameter	Values	Comments
skip_summary		If passed, will not create the BIOM table summary.
skip_tsv		If passed, will not create the tsv version of the BIOM table.

Lines for parameter file¶

q_filt_otus_1:
    module: qiime_filter_otus
    base: filt_samp_1
    script_path: '{Vars.qiime_path}/filter_otus_from_otu_table.py'
    setenv: {Vars.qiime_env}
    redirects:
        --min_count_fraction: 0.00005
        --min_samples: 10

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_sort_otu_table`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s sort_otu_table.py

Requires¶

A BIOM table in:
- sample_data["biom_table"]

Output¶

Puts the resulting BIOM table in:
- self.sample_data["project_data"]["biom_table"]
Puts the BIOM table summary in:
- self.sample_data["project_data"]["biom_table_summary"]
Puts the BIOM table in tab-delimited format in:
- self.sample_data["project_data"]["biom_table_tsv"]

Parameters that can be set¶

Parameter	Values	Comments
skip_summary		If passed, will not create the BIOM table summary.
skip_tsv		If passed, will not create the tsv version of the BIOM table.

Lines for parameter file¶

q_sort_otus_1:
    module: qiime_sort_otu_table
    base: filt_samp_1
    script_path: '{Vars.qiime_path}/sort_otu_table.py'
    setenv: {Vars.qiime_env}
    redirects:
        --sort_field:   XXX

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.

`qiime_divers`¶

Authors:	Menachem Sklarz
Affiliation:	Bioinformatics core facility
Organization:	National Institute of Biotechnology in the Negev, Ben Gurion University.

Note

This module was developed as part of a study led by Dr. Jacob Moran Gilad

A module for running QIIME’s core_diversity_analyses.py:

The module creates a BIOM table based on the OTU table and a taxonomy assignment if avaliable (will be available if the qiime_assign_taxonomy is in the branch).

If chimera checking has been performed, the suspected chimeric sequences will be removed from the BIOM table.

The module also adds code for creating a summary of the BIOM table and a tab-delimited version thereof.

Requires¶

A BIOM table:
- sample_data["biom_table"]

Optional¶

A phylogenetic tree:
- sample_data["phylotree"]

Output¶

Puts the core diversity directory name in
- self.sample_data["project_data"]["diversity"]

Parameters that can be set¶

Parameter	Values	Comments
–mapping_fp		A path to the qiime mapping file (if not set, will use the mapping file passed in `qiime_prep`.
–parameter_fp		A path to a qiime parameter file.

Lines for parameter file¶

q_divers_1:
    module: qiime_divers
    base: q_filt_otus_1
    script_path: /path/to/QIIME/bin/core_diversity_analyses.py
    qsub_params:
        -pe: shared 20
    sampling_depth: 109897
    redirects:
        --categories: Disease,sex
        --parameter_fp: /path/to/parameter_file

References¶

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K., Fierer, N., Peña, A.G., Goodrich, J.K., Gordon, J.I. and Huttley, G.A., 2010. “QIIME allows analysis of high-throughput community sequencing data”. Nature methods, 7(5), pp.335-336.