How to download only viridiplantae miRNA from miRBase
|
|
There is no direct way to download the organism specific miRNA from miRBase database. So I extracted the miRNA of viridiplantae plant from miRBase using some unix command. Steps are as follows
- Download the information regarding organisms from HERE.
- Download the mature miRNA sequence from HERE
- Extract both files in same directory
- Download the fasta dereplicating python script from HERE
- Now run the bash script given from the same directory
#!/bin/bash
#script to extact plant mirna from mirbase database
# convert fasta to tab
awk 'BEGIN{RS=">"}{gsub("\n"," ",$0); print ">"$0}' mature.fa >mature.tab
#extract the organisms belong to Viridiplantae. You can extract the miRNA for other
# organism too by changing the word "Viridiplantae"
grep Viridiplantae organisms.txt >plants_mirbase.txt
# extract name of plants
awk '{ print $3 " " $4 }' plants_mirbase.txt >plant_name.txt
#extract mirna for plants
grep -f plant_name.txt mature.tab >plant_mirna.tab
#convert tab to fasta
awk '{print ""$1" "$2" "$3" "$4" "$5"\n"$6}' plant_mirna.tab > plant_mirna.rna
#convert RNA to DNA
sed '/^[^>]/ y/uU/tT/' plant_mirna.rna >plant_mirna.fasta
#dereplicate mirna file
python derep.py -i plant_mirna.fasta
#cleaning fasta header
cat derep_plant_mirna.fasta | awk -F ';' '{print $1}' >plant_mature_mirna_unique.fasta
rm mature.tab
rm plants_mirbase.txt
rm plant_mirna.tab
rm plant_mirna.rna
rm plant_name.txt
rm derep_plant_mirna.fasta
echo mature mirna from all plants are in plant_mirna.fasta!!!
echo unique mature mirna from all plants are in plant_mature_mirna_unique.fasta!!!
echo all job done!!!
Basically the above bash script extract the miRNA from plant deposited to miRBase database and save them to a file plant_mirna.fasta. In second part, it remove the duplicate miRNAs and save them in another file plant_mature_mirna_unique.fasta.
How to remove duplicate sequences from FASTA file HERE
|
Related Posts HOW TO
|
Was This Post Useful? Add This To Del.icio.us Share on Facebook StumbleUpon This Add to Technorati Share on Twitter |
Labels:
HOW TO
Subscribe to:
Post Comments (Atom)

i got an error when running the script in line 5 syntax error, how can i edit ?
ReplyDeleteThanks so much
ReplyDeleteCan you explain you error please?
ReplyDelete