Skip to content

Commit 44c6ad5

Browse files
author
Tobias Andermann
committed
updated readme
1 parent 068c2ad commit 44c6ad5

File tree

6 files changed

+132
-25
lines changed

6 files changed

+132
-25
lines changed

README.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -126,7 +126,12 @@ The development version of SECAPR is stored on this GitHub page and contains the
126126
cd /path/to/seqcap_processor-master
127127
```
128128

129-
6. Install SECAPR from the folder
129+
6. Remove the installed conda SECAPR version:
130+
```bash
131+
conda remove --force secapr
132+
```
133+
134+
7. Install SECAPR from the folder
130135

131136
```bash
132137
python setup.py install

build/lib/secapr/_version.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -8,11 +8,11 @@
88

99
version_json = '''
1010
{
11-
"date": "2020-09-04T17:02:57+0200",
11+
"date": "2021-01-12T12:57:51+0100",
1212
"dirty": true,
1313
"error": null,
14-
"full-revisionid": "b3586ef34b66c56276cfadbb89582c4951054a87",
15-
"version": "1.1.12+48.gb3586ef.dirty"
14+
"full-revisionid": "068c2ad1fc8432b09a14ab20bb037323f8cdf555",
15+
"version": "2.0.2+27.g068c2ad.dirty"
1616
}
1717
''' # END VERSION_JSON
1818

build/lib/secapr/plot_sequence_yield.py

Lines changed: 11 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -157,11 +157,19 @@ def plot_contig_yield(contig_input_file,outdir,alignment_folder=False,read_cov_f
157157

158158
if not read_cov_file:
159159
if not alignment_folder:
160-
fig.savefig(os.path.join(outdir,'contig_yield_overview.png'),bbox_inches='tight', dpi = 500)
160+
filename = 'contig_yield_overview.png'
161161
else:
162-
fig.savefig(os.path.join(outdir,'contig_alignment_yield_overview.png'),bbox_inches='tight', dpi = 500)
162+
filename = 'contig_alignment_yield_overview.png'
163163
else:
164-
fig.savefig(os.path.join(outdir,'contig_alignment_read_cov_yield_overview.png'),bbox_inches='tight', dpi = 500)
164+
filename = 'contig_alignment_read_cov_yield_overview.png'
165+
166+
# try to print in high res, if too large, don't set quality requirement dpi
167+
try:
168+
fig.savefig(os.path.join(outdir,filename),bbox_inches='tight', dpi = 500)
169+
except ValueError:
170+
fig.savefig(os.path.join(outdir,filename),bbox_inches='tight')
171+
172+
165173

166174
def main(args):
167175
target_contig_folder = args.extracted_contigs
167 KB
Binary file not shown.
167 KB
Binary file not shown.

secapr.egg-info/PKG-INFO

Lines changed: 112 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -1,50 +1,144 @@
11
Metadata-Version: 1.1
22
Name: secapr
3-
Version: 1.1.12+48.gb3586ef.dirty
3+
Version: 2.0.2+27.g068c2ad.dirty
44
Summary: Process sequence-capture fastq files into alignments for phylogenetic analyses
55
Home-page: https://github.com/AntonelliLab/seqcap_processor
66
Author: Tobias Andermann
77
Author-email: [email protected]
88
License: MIT
9-
Description: # <img src="images/secapr_logo.png" width="100"> SEquence CApture PRocessor (SECAPR)
9+
Description: # SEquence CApture PRocessor (SECAPR)
1010

11-
**Original Publication: https://doi.org/10.7717/peerj.5175**
11+
![SECAPR](https://raw.githubusercontent.com/AntonelliLab/seqcap_processor/master/images/secapr_logo_small.png)
1212

13-
___
13+
[![downloads](https://anaconda.org/bioconda/secapr/badges/downloads.svg)](http://bioconda.github.io/recipes/secapr/README.html)
1414

15-
*We are now teaching a **1-week intensive course** on target enrichment data, including practical exercises for all functionalities of the SECAPR pipeline. Don't miss it and sign up online at:*
15+
**Original Publication: [https://doi.org/10.7717/peerj.5175](https://doi.org/10.7717/peerj.5175)**
1616

17-
https://www.forbio.uio.no/events/courses/2020/target_capture.html
17+
<br/>
18+
<br/>
1819

19-
___
20+
## **Documentation: [Click here](http://antonellilab.github.io/seqcap_processor/)**
2021

21-
## Installation instructions and documentation [click here](http://htmlpreview.github.io/?https://github.com/AntonelliLab/seqcap_processor/blob/master/docs/documentation/main_doc.html)
22+
<br/>
23+
<br/>
24+
<br/>
2225

23-
[![downloads](https://anaconda.org/bioconda/secapr/badges/downloads.svg)](http://bioconda.github.io/recipes/secapr/README.html)
26+
***
27+
28+
## Installation & Setup
29+
SECAPR is available as a conda package on the bioconda channel. This makes installation of the SECAPR pipeline with all dependencies very simple. Follow the instructions on this page to get the SECAPR pipeline set up and ready to use.
30+
31+
<br/>
32+
33+
***
34+
35+
### 1. Set up conda
36+
37+
Conda is a software and environment manager, that makes installation of new software and of required dependencies very simple and straightforward. If you don't already have conda installed on your computer, you can rather quickly and easily install the light-weight version **Miniconda**.
38+
39+
Download the **Python3 version** of [Miniconda](https://docs.conda.io/en/latest/miniconda.html) for your operating system.
40+
41+
**Windows** users can download the `exe` file and install it by double clicking on the file and following the instructions.
42+
43+
If you are a **Mac** user, choose the `pkg` conda download file for installing conda using a graphic user interface. Alternatively you can download the `bash` file (same for **Linux**). In that case you need to open a command line window, navigate to the folder containing the miniconda download, and install it with `sh Miniconda3-*.sh`.
44+
45+
Once installed you can use conda via a bash command line terminal. **Mac** and **Linux** users can simply use the default Terminal software. **Windows** users are recommended to use the **Anaconda Powershell Prompt**, which is automatically installed with Miniconda, instead of the regular Command Prompt.
46+
47+
From your bash-command line terminal add the following channels to your conda installation manager. These channels are the locations where conda finds all required software dependencies.
48+
49+
```bash
50+
conda config --add channels defaults;
51+
conda config --add channels conda-forge;
52+
conda config --add channels bioconda;
53+
```
54+
55+
56+
***
57+
58+
### 2. Install the SECAPR environment
59+
60+
Besides enabling easy installation, conda also functions as an extremely light-weight **virtual environment** manager. This means you create a virtual environment on your system and install software within that environment without interferring with your main system. For example if you download a conda package that has a specific R version as dependency, you can download it in a virtual environment and it will not affect your current R version that you are using on computer. You can easily connect and disconnect from these virtual environments you create with conda, see below.
61+
62+
Since the SECAPR pipeline has many third-party dependencies, which may effect existing installations of those softwares on your computer, we strongly recommend to install SECAPR in a virtual conda environment. Here we will name this environment `secapr_env`, but you can give it any name you want.
63+
64+
This is super easy with conda. All you have to do to install SECAPR and all its dependencies in a virtual environment is to run the following command:
65+
66+
```bash
67+
conda create -n secapr_env secapr
68+
```
69+
70+
More information about installing SECAPR, including installation of previous versions and using docker containers, can be found by clicking on the badge below:
71+
72+
[![install with bioconda](https://img.shields.io/badge/install%20with-bioconda-brightgreen.svg?style=flat-square)](http://bioconda.github.io/recipes/secapr/README.html)
73+
74+
***
75+
76+
### 3. Activate the environment
77+
To activate the newly created `secapr_env` environment, type:
78+
79+
```bash
80+
conda activate secapr_env
81+
```
82+
83+
When the environment is activated, all the necessary software dependencies will be available in the standarad path, e.g. when you type `samtools` the samtools version required for SECAPR will be executed. After you are done using SECAPR, you can deactivate the environment to switch back to your standard environment with this command:
84+
85+
```bash
86+
conda deactivate
87+
```
88+
89+
***
90+
91+
### 4. Check active environment
92+
Check if you are connected to the correct environment (there should eb a star in front of secapr_env in the output of this command):
93+
94+
```bash
95+
conda info --envs
96+
```
2497

98+
In case you are not connected remember to always connect to the `secapr_env` before using SECAPR, by typing:
2599

26-
This semi-automated pipeline aims to make the processing and analysis of sequence capture (= target enrichment) data simple and straight forward for all users. The detailed documentation and simple installation makes this pipeline accessible also for users with limited biofinformatic knowledge, while enabling user-defined processing options for the more experienced users.
100+
```bash
101+
conda activate secapr_env
102+
```
27103

28-
We included an empirical data tutorial in the [pipeline documentation](http://htmlpreview.github.io/?https://github.com/AntonelliLab/seqcap_processor/blob/master/docs/documentation/main_doc.html), which covers the processing from raw Illumina read data into multiple seqeunce alignments (MSAs) for phylogenetic analyses, including the compiling of allele sequence MSAs. This workflow can be applied to any Illumina dataset, independently of the underlying bait set and organism group.
104+
To test if the SECAPR installation is working, run the basic help function:
29105

30-
Some functions in this pipeline are inspired by scripts from the [Phyluce pipeline](https://github.com/faircloth-lab/phyluce) by Braint Faircloth, which is aimed at the processing of Ultraconserved Elements (UCEs). To honour some of the ideas belonging to Brant Faircloth, and the generous sharing of all his code as open-source, we ask you to cite the Phyluce pipeline (Faircloth 2016) alongside with ours (Andermann et al. 2018), when using SECAPR.
106+
```bash
107+
secapr -h
108+
```
31109

110+
You should see an output with all available SECAPR functions. You can check run the help command for individual functions by typing `secapr name_of_function -h`, e.g.:
32111

33-
## Workflow
112+
```bash
113+
secapr clean_reads -h
114+
```
34115

116+
***
35117

36-
<img src="https://github.com/AntonelliLab/seqcap_processor/blob/master/images/secapr_workflow.png" width="600">
118+
### 5. Install SECAPR development version
37119

38-
**SECAPR analytical workflow.** The flowchart shows the basic SECAPR functions, which are separated into two separate steps (colored boxes). Blue box (1. reference library from raw data): in this step the raw reads are cleaned and assembled into contigs (de novo assembly); Orange box (2. reference based assembly with custom reference library): the contigs from the previous step are used for reference-based assembly, enabling allele phasing and additional quality control options, e.g. concerning read-coverage. Black boxes show SECAPR commands and white boxes represent the input and output data of the respective function. Boxes marked in grey represent multiple sequence alignments (MSAs) generated with SECAPR, which can be used for phylogenetic inference.
120+
The development version of SECAPR is stored on this GitHub page and contains the newest updates, which might not yet be available through the conda version. However you need to install the SECAPR environment with conda first by following the steps above. Once the environment is installed, you can update SECAPR to the development version by following these steps:
39121

122+
1. Connect to your SECAPR environment:
40123

124+
```bash
125+
conda activate secapr_env
126+
```
41127

42-
## Please cite:
128+
2. [Download](https://github.com/AntonelliLab/seqcap_processor/archive/master.zip) the latest SECAPR version from GitHub
129+
3. Unzip the downloaded file
130+
4. Move the unzipped directory to a safe location on your computer, i.e. not on your Desktop or Download folder. This will be the path where SECAPR will be executed from in the future.
131+
5. Enter the unzipped SECAPR directory from your command line, replace `/path/to` with the path where you storred the folder:
43132

44-
**Andermann T.**, Cano Á., Zizka A., Bacon C., Antonelli A. 2018. SECAPR—a bioinformatics pipeline for the rapid and user-friendly processing of targeted enriched Illumina sequences, from raw reads to alignments. PeerJ 6:e5175 https://doi.org/10.7717/peerj.5175
133+
```bash
134+
cd /path/to/seqcap_processor-master
135+
```
45136

46-
Faircloth B.C., PHYLUCE is a software package for the analysis of conserved genomic loci, Bioinformatics, Volume 32, Issue 5, 1 March 2016, Pages 786–788, https://doi.org/10.1093/bioinformatics/btv646
137+
6. Install SECAPR from the folder
47138

139+
```bash
140+
python setup.py install
141+
```
48142
Platform: UNKNOWN
49143
Classifier: Development Status :: 4 - Beta
50144
Classifier: Environment :: Console

0 commit comments

Comments
 (0)