-
Notifications
You must be signed in to change notification settings - Fork 11
Open
Description
Hi !!
I was trying to use the updated version of PERF and use the new feature for one of my bacterial strains. However, I am getting the following error
PERF -i ../raw/Tenacibaculum_discolor_gca_003664185.fa --format fasta -a -g ../raw/Tenacibaculum_discolor_gca_003664185.ASM366418v1.49.gff3 --anno-format GFF --gene-key IDERROR:
Processing Ga0183463_112: 100%|█████████████████████████████████████████████████████████| 12/12 [00:04<00:00, 2.97it/s]
GeneKeyError:
The attribute "gene_id" is not among the attributes for gene. Please select a different one.
The available ones are [Parent, Name, constitutive, ensembl_end_phase, ensembl_phase, exon_id, rank]
My GFF files contains the following flags in last column but changing it to ID or any other flag isn't working
ID=gene:C8N27_0080;biotype=protein_coding;description=cyclophilin family peptidyl-prolyl cis-trans isomerase;gene_id=C8N27_0080;logic_name=ena
When I use GTF file the error is
Using length cutoff of 12
Processing Ga0183463_112: 100%|█████████████████████████████████████████████████████████| 14/14 [00:03<00:00, 3.66it/s]
Traceback (most recent call last):
File "/home/rohit/miniconda3/bin/PERF", line 8, in <module>
sys.exit(main())
File "/home/rohit/miniconda3/lib/python3.8/site-packages/PERF/core.py", line 162, in main
ssr_native(args, length_cutoff=args.min_length)
File "/home/rohit/miniconda3/lib/python3.8/site-packages/PERF/core.py", line 106, in ssr_native
fasta_ssrs(args, repeats_info)
File "/home/rohit/miniconda3/lib/python3.8/site-packages/PERF/rep_utils.py", line 253, in fasta_ssrs
annotate(args)
File "/home/rohit/miniconda3/lib/python3.8/site-packages/PERF/annotation.py", line 160, in annotate
gffObject = process_annofile(anno_file, annotype, gene_id)
File "/home/rohit/miniconda3/lib/python3.8/site-packages/PERF/annotation.py", line 112, in process_annofile
attr_obj = process_attrs(attribute, annotype)
File "/home/rohit/miniconda3/lib/python3.8/site-packages/PERF/annotation.py", line 66, in process_attrs
attr_obj[attrName] = attr[1].strip()
IndexError: list index out of range
I am not sure what is being used in the background to process GFF/GTF files but my highest recommendation is to integrate PERF with AGAT which is an excellent tool for GTF/GFF file processing and handling.
Metadata
Metadata
Assignees
Labels
No labels