|
|
---
|
|
---
|
|
|
title: Assembly
|
|
title: Assembly
|
|
|
---
|
|
---
|
|
|
|
|
|
|
# Metagenomics assemblies #
|
|
# Metagenomics assemblies #
|
|
|
|
|
|
|
|
Metagenomics assemblies are almost inevitable in metagenomic analysis to reduce the fragmentation of informations and highlight sample characteristics. Reads are assembled into contigs using [megahit](https://github.com/voutcn/megahit) a fast assembler optimized for metagenomes and based on k-mers succinct de Bruijn graphs. Megahit is faster and perform comparably as SPAde and metaSPAdes. SPAde assemblies might be better in term of long contigs and ultra long contigs, it's also show a greater rate of missassemblies which required assembly curation [(1)](https://link.springer.com/article/10.1186/s12864-017-3918-9).
|
|
Metagenomics assemblies are almost inevitable in metagenomic analysis to reduce the fragmentation of informations and highlight sample characteristics. Reads are assembled into contigs using [megahit](https://github.com/voutcn/megahit) a fast assembler optimized for metagenomes and based on k-mers succinct de Bruijn graphs. Megahit is faster and perform comparably as SPAde and metaSPAdes. SPAde assemblies might be better in term of long contigs and ultra long contigs, it's also show a greater rate of missassemblies which required assembly curation [(1)](https://link.springer.com/article/10.1186/s12864-017-3918-9).
|
| ... | @@ -16,19 +17,13 @@ For now, misasassemblies are only detected, therefore, if it's required, assembl |
... | @@ -16,19 +17,13 @@ For now, misasassemblies are only detected, therefore, if it's required, assembl |
|
|
|
|
|
|
|
Single-assembly is a per-sample approach where reads from one sample are assembled into one assembly. This approach is used by the genes_collection module and might be used by the genomes_collection module.
|
|
Single-assembly is a per-sample approach where reads from one sample are assembled into one assembly. This approach is used by the genes_collection module and might be used by the genomes_collection module.
|
|
|
|
|
|
|
|
```magneto run single-assembly **snakemake.args ```
|
|
```magneto run single-assembly --config target=single_assembly **snakemake.args ```
|
|
|
|
|
|
|
|
## Co-assembly ##
|
|
## Co-assembly ##
|
|
|
|
|
|
|
|
Co-assembly require the [simka module](Modules/simka). In this case, reads from each samples of a cluster are assembled together. This approach might be used by the genomes_collection module.
|
|
Co-assembly require the [simka module](Modules/simka). In this case, reads from each samples of a cluster are assembled together. This approach might be used by the genomes_collection module.
|
|
|
|
|
|
|
|
```magneto run co-assembly **snakemake.args ```
|
|
```magneto run assembly --config target=co_assembly **snakemake.args ```
|
|
|
|
|
|
|
|
## Assembly ##
|
|
|
|
|
|
|
|
|
|
Finally, both single-assembly and co-assembly might be ran using the following command line :
|
|
|
|
|
|
|
|
|
|
```magneto run assembly **snakemake.args ```
|
|
|
|
|
|
|
|
|
|
-----
|
|
-----
|
|
|
[Previous - Simka (Module)](Modules/simka)
|
|
[Previous - Simka (Module)](Modules/simka)
|
| ... | |
... | |
| ... | | ... | |