BLAST Database creation error

I was trying to create a BLAST database but I got this error
Building a new DB, current time: 12/03/2015 09:44:18
New DB name:   plant_protein
New DB title:  /home/sanjay/bin/Genomes/plant_protein_from_plantgdb.fa
Sequence type: Protein
Keep Linkouts: T
Keep MBits: T
Maximum file size: 1000000000B

volume: plant_protein

file: plant_protein.pin
file: plant_protein.phr
file: plant_protein.psq

BLAST Database creation error: FASTA-Reader: No residues given
Then I looked whether my any FASTA sequence is empty or not by running this command
grep -c "^$" ~/bin/Genomes/plant_protein_from_plantgdb.fa
I found that there is one sequence which have only FASTA header. To remove the empty FASTA sequence I run this command
awk 'BEGIN {RS = ">" ; FS = "\n" ; ORS = ""} $2 {print ">"$0}' ~/bin/Genomes/plant_protein_from_plantgdb.fa >~/bin/Genomes/plant_protein_from_plantgdb.fasta
And finally I got the happy success message
Building a new DB, current time: 12/03/2015 09:48:01
New DB name:   plant_protein
New DB title:  /home/sanjay/bin/Genomes/plant_protein_from_plantgdb.fasta
Sequence type: Protein
Keep Linkouts: T
Keep MBits: T
Maximum file size: 1000000000B
Adding sequences from FASTA; added 980219 sequences in 24.2583 seconds.

How to install NCBI BLAST program on your computer HERE

No comments:

Post a Comment

Have Problem ?? Drop a comments here!