|
|
|
# File format #
|
|
|
|
|
|
|
|
Metagenomics samples must be stored in a unique directory. Because there is a high variability in filename format, the difficulty to automatise paired-end data recognition increase. Filename are constrained and must respect the following format :
|
|
|
|
|
|
|
|
<sample_name>(. | _)(r1 | R1 | r2 | R2)(fastq | fq | fastq.gz | fq.gz)
|
|
|
|
|
|
|
|
exemple :
|
|
|
|
S1.R1.fastq
|
|
|
|
S1.R2.fastq
|
|
|
|
S2_r1.fq
|
|
|
|
S2_r2.fq
|
|
|
|
|
|
|
|
*NOTE* : for now, fastq files must be uncompressed - I'm actually adding a rule for temporary decompressed files.
|
|
|
|
|
|
|
|
# Installation #
|
|
|
|
|
|
|
|
git clone git@gitlab.univ-nantes.fr:bird_pipeline_registry/genosysmics.git
|
|
|
|
cd genosysmics
|
|
|
|
(optional) conda create -n <project>
|
|
|
|
(optional) conda activate <project>
|
|
|
|
pip install .
|
|
|
|
genosysmics init <path/to/dir/samples.fq> --wd <project_dir>
|
|
|
|
|
|
|
|
|
|
|
|
genosysmics should be in your PATH, otherwise source your virtual environment.
|
|
|
|
|
|
|
|
|
|
|
|
[Previous](Genosysmics)
|
|
|
|
[Next](Usage)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
:warning: Here is the current installation method, but we are working on packaging the workflow so that it is directly available in Bioconda :warning:
|
|
|
|
|
|
|
|
# Dependencies
|
|
|
|
|
|
|
|
A working installation of **conda** and **git** is mandatory to build magneto. If you have **mamba** already install on your system, the creation of the main environment will be faster.
|
|
|
|
|
|
|
|
* python 3.7+
|
|
|
|
* python 2.7
|
|
|
|
* snakemake 6.3.0
|
|
|
|
* mamba 1.5.8
|
|
|
|
* conda 4.10.3
|
|
|
|
* click 8.01
|
|
|
|
|
|
|
|
Other dependencies (such as python libraries for analysis, or to compute programs) are installed through the setup.py and conda management.\
|
|
|
|
The default conda libraries management for snakemake is mamba, since a couple of month now. Even if it is possible to use conda instead of mamba for conda libraries management, the design of Magneto set mamba mandatory, as `--conda-frontend conda` does not propagate to the subworkflows.\
|
|
|
|
Except if you use your own databases or already have downloaded them, MAGNETO will also require an internet connection.
|
|
|
|
|
|
|
|
# Installation
|
|
|
|
|
|
|
|
## Main conda environment installation
|
|
|
|
|
|
|
|
Start by creating a conda environment containing snakemake, mamba and the python module click:
|
|
|
|
|
|
|
|
```bash
|
|
|
|
conda create -n magneto snakemake-minimal=6.3.0 click=8.01 mamba=1.5.8 -c bioconda -c conda-forge
|
|
|
|
```
|
|
|
|
|
|
|
|
> Note
|
|
|
|
>
|
|
|
|
> * If you have **mamba** or **micromamba** already installed, you can create the environment with it instead of conda.
|
|
|
|
|
|
|
|
Then, activate your environment:
|
|
|
|
|
|
|
|
```
|
|
|
|
conda activate magneto
|
|
|
|
```
|
|
|
|
|
|
|
|
## Installation of magneto module in the conda environment
|
|
|
|
|
|
|
|
Installation is performed using `pip`:
|
|
|
|
|
|
|
|
```
|
|
|
|
git clone https://gitlab.univ-nantes.fr/bird_pipeline_registry/magneto.git
|
|
|
|
python3 -m pip install magneto/
|
|
|
|
```
|
|
|
|
|
|
|
|
**Magneto** is now installed in the "magneto" conda environment. Activate your environment whenever you need to run the pipeline !!
|
|
|
|
|
|
|
|
[Previous - MAGNETO overview](https://gitlab.univ-nantes.fr/bird_pipeline_registry/magneto/-/wikis/Magneto)
|
|
|
|
[Next - Usage](https://gitlab.univ-nantes.fr/bird_pipeline_registry/magneto/-/wikis/Usage)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|