|
1 | 1 | Metadata-Version: 1.1 |
2 | 2 | Name: secapr |
3 | | -Version: 1.1.12+48.gb3586ef.dirty |
| 3 | +Version: 2.0.2+27.g068c2ad.dirty |
4 | 4 | Summary: Process sequence-capture fastq files into alignments for phylogenetic analyses |
5 | 5 | Home-page: https://github.com/AntonelliLab/seqcap_processor |
6 | 6 | Author: Tobias Andermann |
7 | 7 | |
8 | 8 | License: MIT |
9 | | -Description: # <img src="images/secapr_logo.png" width="100"> SEquence CApture PRocessor (SECAPR) |
| 9 | +Description: # SEquence CApture PRocessor (SECAPR) |
10 | 10 |
|
11 | | - **Original Publication: https://doi.org/10.7717/peerj.5175** |
| 11 | +  |
12 | 12 |
|
13 | | - ___ |
| 13 | + [](http://bioconda.github.io/recipes/secapr/README.html) |
14 | 14 |
|
15 | | - *We are now teaching a **1-week intensive course** on target enrichment data, including practical exercises for all functionalities of the SECAPR pipeline. Don't miss it and sign up online at:* |
| 15 | + **Original Publication: [https://doi.org/10.7717/peerj.5175](https://doi.org/10.7717/peerj.5175)** |
16 | 16 |
|
17 | | - https://www.forbio.uio.no/events/courses/2020/target_capture.html |
| 17 | + <br/> |
| 18 | + <br/> |
18 | 19 |
|
19 | | - ___ |
| 20 | + ## **Documentation: [Click here](http://antonellilab.github.io/seqcap_processor/)** |
20 | 21 |
|
21 | | - ## Installation instructions and documentation [click here](http://htmlpreview.github.io/?https://github.com/AntonelliLab/seqcap_processor/blob/master/docs/documentation/main_doc.html) |
| 22 | + <br/> |
| 23 | + <br/> |
| 24 | + <br/> |
22 | 25 |
|
23 | | - [](http://bioconda.github.io/recipes/secapr/README.html) |
| 26 | + *** |
| 27 | + |
| 28 | + ## Installation & Setup |
| 29 | + SECAPR is available as a conda package on the bioconda channel. This makes installation of the SECAPR pipeline with all dependencies very simple. Follow the instructions on this page to get the SECAPR pipeline set up and ready to use. |
| 30 | + |
| 31 | + <br/> |
| 32 | + |
| 33 | + *** |
| 34 | + |
| 35 | + ### 1. Set up conda |
| 36 | + |
| 37 | + Conda is a software and environment manager, that makes installation of new software and of required dependencies very simple and straightforward. If you don't already have conda installed on your computer, you can rather quickly and easily install the light-weight version **Miniconda**. |
| 38 | + |
| 39 | + Download the **Python3 version** of [Miniconda](https://docs.conda.io/en/latest/miniconda.html) for your operating system. |
| 40 | + |
| 41 | + **Windows** users can download the `exe` file and install it by double clicking on the file and following the instructions. |
| 42 | + |
| 43 | + If you are a **Mac** user, choose the `pkg` conda download file for installing conda using a graphic user interface. Alternatively you can download the `bash` file (same for **Linux**). In that case you need to open a command line window, navigate to the folder containing the miniconda download, and install it with `sh Miniconda3-*.sh`. |
| 44 | + |
| 45 | + Once installed you can use conda via a bash command line terminal. **Mac** and **Linux** users can simply use the default Terminal software. **Windows** users are recommended to use the **Anaconda Powershell Prompt**, which is automatically installed with Miniconda, instead of the regular Command Prompt. |
| 46 | + |
| 47 | + From your bash-command line terminal add the following channels to your conda installation manager. These channels are the locations where conda finds all required software dependencies. |
| 48 | + |
| 49 | + ```bash |
| 50 | + conda config --add channels defaults; |
| 51 | + conda config --add channels conda-forge; |
| 52 | + conda config --add channels bioconda; |
| 53 | + ``` |
| 54 | + |
| 55 | + |
| 56 | + *** |
| 57 | + |
| 58 | + ### 2. Install the SECAPR environment |
| 59 | + |
| 60 | + Besides enabling easy installation, conda also functions as an extremely light-weight **virtual environment** manager. This means you create a virtual environment on your system and install software within that environment without interferring with your main system. For example if you download a conda package that has a specific R version as dependency, you can download it in a virtual environment and it will not affect your current R version that you are using on computer. You can easily connect and disconnect from these virtual environments you create with conda, see below. |
| 61 | + |
| 62 | + Since the SECAPR pipeline has many third-party dependencies, which may effect existing installations of those softwares on your computer, we strongly recommend to install SECAPR in a virtual conda environment. Here we will name this environment `secapr_env`, but you can give it any name you want. |
| 63 | + |
| 64 | + This is super easy with conda. All you have to do to install SECAPR and all its dependencies in a virtual environment is to run the following command: |
| 65 | + |
| 66 | + ```bash |
| 67 | + conda create -n secapr_env secapr |
| 68 | + ``` |
| 69 | + |
| 70 | + More information about installing SECAPR, including installation of previous versions and using docker containers, can be found by clicking on the badge below: |
| 71 | + |
| 72 | + [](http://bioconda.github.io/recipes/secapr/README.html) |
| 73 | + |
| 74 | + *** |
| 75 | + |
| 76 | + ### 3. Activate the environment |
| 77 | + To activate the newly created `secapr_env` environment, type: |
| 78 | + |
| 79 | + ```bash |
| 80 | + conda activate secapr_env |
| 81 | + ``` |
| 82 | + |
| 83 | + When the environment is activated, all the necessary software dependencies will be available in the standarad path, e.g. when you type `samtools` the samtools version required for SECAPR will be executed. After you are done using SECAPR, you can deactivate the environment to switch back to your standard environment with this command: |
| 84 | + |
| 85 | + ```bash |
| 86 | + conda deactivate |
| 87 | + ``` |
| 88 | + |
| 89 | + *** |
| 90 | + |
| 91 | + ### 4. Check active environment |
| 92 | + Check if you are connected to the correct environment (there should eb a star in front of secapr_env in the output of this command): |
| 93 | + |
| 94 | + ```bash |
| 95 | + conda info --envs |
| 96 | + ``` |
24 | 97 |
|
| 98 | + In case you are not connected remember to always connect to the `secapr_env` before using SECAPR, by typing: |
25 | 99 |
|
26 | | - This semi-automated pipeline aims to make the processing and analysis of sequence capture (= target enrichment) data simple and straight forward for all users. The detailed documentation and simple installation makes this pipeline accessible also for users with limited biofinformatic knowledge, while enabling user-defined processing options for the more experienced users. |
| 100 | + ```bash |
| 101 | + conda activate secapr_env |
| 102 | + ``` |
27 | 103 |
|
28 | | - We included an empirical data tutorial in the [pipeline documentation](http://htmlpreview.github.io/?https://github.com/AntonelliLab/seqcap_processor/blob/master/docs/documentation/main_doc.html), which covers the processing from raw Illumina read data into multiple seqeunce alignments (MSAs) for phylogenetic analyses, including the compiling of allele sequence MSAs. This workflow can be applied to any Illumina dataset, independently of the underlying bait set and organism group. |
| 104 | + To test if the SECAPR installation is working, run the basic help function: |
29 | 105 |
|
30 | | - Some functions in this pipeline are inspired by scripts from the [Phyluce pipeline](https://github.com/faircloth-lab/phyluce) by Braint Faircloth, which is aimed at the processing of Ultraconserved Elements (UCEs). To honour some of the ideas belonging to Brant Faircloth, and the generous sharing of all his code as open-source, we ask you to cite the Phyluce pipeline (Faircloth 2016) alongside with ours (Andermann et al. 2018), when using SECAPR. |
| 106 | + ```bash |
| 107 | + secapr -h |
| 108 | + ``` |
31 | 109 |
|
| 110 | + You should see an output with all available SECAPR functions. You can check run the help command for individual functions by typing `secapr name_of_function -h`, e.g.: |
32 | 111 |
|
33 | | - ## Workflow |
| 112 | + ```bash |
| 113 | + secapr clean_reads -h |
| 114 | + ``` |
34 | 115 |
|
| 116 | + *** |
35 | 117 |
|
36 | | - <img src="https://github.com/AntonelliLab/seqcap_processor/blob/master/images/secapr_workflow.png" width="600"> |
| 118 | + ### 5. Install SECAPR development version |
37 | 119 |
|
38 | | - **SECAPR analytical workflow.** The flowchart shows the basic SECAPR functions, which are separated into two separate steps (colored boxes). Blue box (1. reference library from raw data): in this step the raw reads are cleaned and assembled into contigs (de novo assembly); Orange box (2. reference based assembly with custom reference library): the contigs from the previous step are used for reference-based assembly, enabling allele phasing and additional quality control options, e.g. concerning read-coverage. Black boxes show SECAPR commands and white boxes represent the input and output data of the respective function. Boxes marked in grey represent multiple sequence alignments (MSAs) generated with SECAPR, which can be used for phylogenetic inference. |
| 120 | + The development version of SECAPR is stored on this GitHub page and contains the newest updates, which might not yet be available through the conda version. However you need to install the SECAPR environment with conda first by following the steps above. Once the environment is installed, you can update SECAPR to the development version by following these steps: |
39 | 121 |
|
| 122 | + 1. Connect to your SECAPR environment: |
40 | 123 |
|
| 124 | + ```bash |
| 125 | + conda activate secapr_env |
| 126 | + ``` |
41 | 127 |
|
42 | | - ## Please cite: |
| 128 | + 2. [Download](https://github.com/AntonelliLab/seqcap_processor/archive/master.zip) the latest SECAPR version from GitHub |
| 129 | + 3. Unzip the downloaded file |
| 130 | + 4. Move the unzipped directory to a safe location on your computer, i.e. not on your Desktop or Download folder. This will be the path where SECAPR will be executed from in the future. |
| 131 | + 5. Enter the unzipped SECAPR directory from your command line, replace `/path/to` with the path where you storred the folder: |
43 | 132 |
|
44 | | - **Andermann T.**, Cano Á., Zizka A., Bacon C., Antonelli A. 2018. SECAPR—a bioinformatics pipeline for the rapid and user-friendly processing of targeted enriched Illumina sequences, from raw reads to alignments. PeerJ 6:e5175 https://doi.org/10.7717/peerj.5175 |
| 133 | + ```bash |
| 134 | + cd /path/to/seqcap_processor-master |
| 135 | + ``` |
45 | 136 |
|
46 | | - Faircloth B.C., PHYLUCE is a software package for the analysis of conserved genomic loci, Bioinformatics, Volume 32, Issue 5, 1 March 2016, Pages 786–788, https://doi.org/10.1093/bioinformatics/btv646 |
| 137 | + 6. Install SECAPR from the folder |
47 | 138 |
|
| 139 | + ```bash |
| 140 | + python setup.py install |
| 141 | + ``` |
48 | 142 | Platform: UNKNOWN |
49 | 143 | Classifier: Development Status :: 4 - Beta |
50 | 144 | Classifier: Environment :: Console |
|
0 commit comments