Pacuta HI 2022
Pacuta 2022 BLAST
These data came from the Pacuta 2022 experiment in Hawaii, done by Federica and myself. In this experiment, larval and spat Pocillopora acuta were subjected to a combination of high pH and temperature treatments. The github for that project is here.
A biomineralization gene list was created by FS (primarily from Stylophora pistillata). To see if any of the biomin genes are in this dataset, the biomin gene list was BLASTed against the Pacuta protein sequences. This was done by Zoe Dellaert in this code. Much of what is in this script comes from her code linked above.
cd /data/putnamlab/jillashey/Pacuta_HI_2022/data
mkdir blast
cd blast
I copied the sequences in fasta format to Andromeda into /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
.
In the scripts folder: nano biomin_blast.sh
#!/bin/bash
#SBATCH --job-name="biomin_blast"
#SBATCH --nodes=1 --ntasks-per-node=20
#SBATCH -t 100:00:00
#SBATCH --export=NONE
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --mem=100GB
#SBATCH --error="blast_out_error"
#SBATCH --output="blast_out"
#SBATCH --account=putnamlab
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error
module load BLAST+/2.9.0-iimpi-2019b
echo "Blasting Pacuta protein seqs against biomin genes" $(date)
cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
makeblastdb -in /data/putnamlab/jillashey/genome/Pacuta/V2/Pocillopora_acuta_HIv2.genes.pep.faa -out /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Pacuta_prot -dbtype prot
blastp -query /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineraliztion_Toolkit_FScucchia_ZDrefmt.fasta -db /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Pacuta_prot -out /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineralization_blast_results.txt -outfmt 0
blastp -query /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineraliztion_Toolkit_FScucchia_ZDrefmt.fasta -db /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Pacuta_prot -out /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineralization_blast_results_tab.txt -outfmt 6 -max_target_seqs 1
echo "Blast complete" $(date)
Submitted batch job 305440. Ran in ~45 mins. I copied the output files to this folder on github.
The result of the BLAST was a list of Pacuta proteins that correspondeed with specific biomineralization genes. In this script, I found that 158 out of 172 of the biomineralization genes are represented in the filtered gene counts. This corresponds to 84 unique Pacuta genes. I also found that 7 out of 172 of the biomineralization genes are represented in the DEGs. This corresponds to 5 unique Pacuta genes:
- Pocillopora_acuta_HIv2___RNAseq.g25214.t1
- Pocillopora_acuta_HIv2___RNAseq.g11609.t1
- Pocillopora_acuta_HIv2___TS.g23498.t1
- Pocillopora_acuta_HIv2___RNAseq.g7668.t1
- Pocillopora_acuta_HIv2___RNAseq.g30830.t1
These are the 7 biomineralization genes:
- PFX13778.1
- P33_g8985
- Gene:g38128
- JT016638.1
- P18_g810
- XP_022780303.1
- XP_022805470.1
In the script above, I then investigated where these 5 unique genes fell in the treatment comparisons.
In the high v control treatment comparison, these Pacuta biomineralization genes were differentially expressed:
- Pocillopora_acuta_HIv2___TS.g23498.t1 - α-Collagen, coadhesion, clone g810 alpha collagen-like protein gene
- Pocillopora_acuta_HIv2___RNAseq.g7668.t1 - uncharacterized protein
- Pocillopora_acuta_HIv2___RNAseq.g30830.t1 - uncharacterized protein
Corals in high treatment are downregulating a collagen related gene compared to control corals (LFC = -3.39).
In the high v mid treatment comparison, these Pacuta biomineralization genes were differentially expressed:
- Pocillopora_acuta_HIv2___RNAseq.g25214.t1 - Sacsin
- Pocillopora_acuta_HIv2___RNAseq.g11609.t1 - Flagellar associated protein
- Pocillopora_acuta_HIv2___TS.g23498.t1 - α-Collagen, coadhesion, clone g810 alpha collagen-like protein gene
- Pocillopora_acuta_HIv2___RNAseq.g7668.t1 - uncharacterized protein
Similar to the high v control comparison, corals in high treatment are downregulating a collagen related gene (Pocillopora_acuta_HIv2__TS.g23498.t1) compared to mid corals (LFC = -2.99). Corals in high treatment are also downregulating sacsin (LFC = -1.56) and flagellar-associated protein (LFC = -0.83) compared to mid corals. High v control and high v mid also had differential expression in Pocillopora_acuta_HIv2__RNAseq.g7668.t1, an uncharacterized protein.
There were no DEGs between mid and control treatments related to biomineralization. This is not surprising, as there were only 4 total DEGs between mid and control treatments.
Sequence information
To make it easier for us to reference the specific biomineralization sequences, I will add the Pacuta and Spistallata sequences in this md file. Since there are only 7 sequences, it’s not too difficult.
In the high v control comparison:
- Gene:g38128 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
- JT016638.1 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
- P18g810 corresponds to Pocillopora_acuta_HIv2__TS.g23498.t1
- XP_022780303.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g7668.t1
- XP_022805470.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g30830.t1
In the high v mid comparison:
- PFX13778.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g25214.t1
- P33g8985 corresponds to Pocillopora_acuta_HIv2__RNAseq.g11609.t1
- Gene:g38128 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
- JT016638.1 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
- P18g810 corresponds to Pocillopora_acuta_HIv2__TS.g23498.t1
- XP_022780303.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g7668.t1
Shared between high v control and high v mid:
- Gene:g38128, which corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
- JT016638.1, which corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
- P18g810, which corresponds to Pocillopora_acuta_HIv2__TS.g23498.t1
Sequences for 7 Stylophora pistillata biomineralization genes
Fasta file for these sequences is here
>PFX13778.1
MASSEASAGASSLQEQKEQRQIDVMFLCDEWRSLKGGLSTFNREFAINLAEARIGNMKIHCYVSKTDPLLCLMSPPSELPNPHLVIGHGRKLGSPAFCLVQNTKCKWIQFVHVCCEDLGKYKKTVAATDTIEENEKKHKMEIECCKAADAVVAVGSRLQQKYSRSLPQVIVEIITPGIFERFSCESQSAMHRSVKNFNVFLFGRATFEDLSLKGYDIVANAIGSLSKNFELTFVGSSPGKHQKVEKWFLDNTRIDRNQLTIRGYCSNPEELKMMFYQSDLVALPSRTEGFGLVALEAISAGVPVLVSGESGVAEALKEVEEEREANAMMLRENYRKVYSWRKECERFRRIIENVMKDGELNINVDVEDVKPKEPNRQITTLATKSKGSQEDQYQRASATLPTEDSVSHSGKEDLQELKDRVLCSIAMNYLRTTPPQSIEEHNKFMEYLEKMKVLIRGFSLGSLVITVKCESLQILEELWTDYSSGHLGKVVQNCFATEKILKELNLAELKLKTTMDIEEYNARKVYFEKRMNKGDDLSFQSHFISSEDERHLEERKRKLEVGELEKRKDDLRSQSPFISSEDQPHLEKRKQKIEVGKSEKTKVVKGRSYGQRRPPLTHILKNILERYPDGGQILKEIIQNADDAGATKVSFLLDSRQGFYGENSLVESTLAQFQGPALYAQNNALFVESDWENLQRLMDSSKKDDPLKVGRFGIGFNSVYHITALKKEAPVILLFLKNIEEIALFETDVRDVQRHVFTVRLSDGCRQEVGEKKQKFLSDLRRLSDGQIENVDLSLDLHVVEVTEGERVVENKWLVYHQVDARNPTLKKLSLELGLLPWVGCATSVNAANLLAISSSAGRIFCFLPLPPDADSKTGLPLHVHGYFGLSDNRRGLKWPGPDCQNDPTAEWNVSLVQHVASKAYANVLLHLQESCDSSVGADFVYKSWPNIQEVEKHWRCILEPMFSILLKENVLWTTANHGQWKNVSNAYLDRIKSQFQNTTNETREVVLETLTQTNEAVVIVPTHVMIAIDNYTPAPPKSITPTFLRALLKKKGEGILEDVPKKKKLLLLEFALADKNLNDMHGVPLLPLANGGFVNFHSLQYNREPGAAVYVSSTATPRSIFHNMDNKFLDDSVQTPAITYLSKLASDANSPHTIQPVQLVALNQTIAVKVLREMLPSEWSGENHVVPWYPGKNGHPPEHWLESVWMWIQRIFPTDLSLLENLPLIPHTCAGNQSLVKLSSSSVVIGRHHHQSISLPPHIVSLLRKIGCILLENLPSYIYHNTLHKYVTTPEPRGVLKIFSTLGQSRCASAIPFCSPNEKRALRSFLSSVDFNVDEKTLLYNLPIFDAADGNSFIAVRNGFQVHEVLPYGFEVPQPLPIPRASSVIALRDSDSHTLIQGLEISPMTKTTFLKNIVFSGIQSNFYSHQQIPALIGWILSQYSIFCREDSSFHVSLQQLAFVSTRTNKLVTPCCVLDPQQPILEQLFENENDKFPNGDFVKDEILLPLRQLGMRSIPNKEDILHVAKTIESVHSDVGFRKALALLEFLDKNPPDKSLGRALMNERWVPRKQSPTSSYPSAMPWFLGTKQFYKPSETFSQSKATLVGASAPIVSKPCSKALEAVFGWNKSPPVHHLLNQLRFACSVPLNDVNGSALCHFQAMVKQIYEEVSTNGSFIFAVSQDNSFPEWIWHGNGFTSPSKIAFTSSCRIDLKPYLYLVPQEFQHLNLFFQQCGVRDTFQDCDFLHVLTMIKDKHDTSSEYLEDVAVDQKLSHEILHWIVRDGKKLNPELRESLLVPIHTWDSTLKLVPCSECTFCDADWLRQGGSELLIGSKFTMIHEAISSKTAALLGVPPISTRVTCAEALGIEQTGPYEPITTRLKNILNEYKEGDGVFKELVQNADDAGATEVQFVLDWRNHPHQRLLSPGMVECQGPALLAYNNTTFTDDDLKNISRLAGATKREDLEKIGRFGLGFSSVYHFTDVPSIITRSYAVFFDPQTTHLGPHIHDASKPGIKIDLGVHENSLICFPDQFAPYNGLFGCDTATPSDNDTFYFEGTLFRFAFRTKRGDISDKIYTKQEIRSLIFSFRESSPILLLFSQNIKKVSFLELDENSSNSKDPRLLFEISRKPQTGCMPVKNSVTESTFLKSCAAWTRQSTSWSNPVDRFPSPKLTELISVCSTICHMNGQRSQETQSWLVTSCLGTENSFKLATSEEGKKQGLLSASGVAAKMFTKGDGMEKVQAVLGEVFCFLPLSIPTGLPVHTNGYFAVTSNRRSIWEGTTAEVGCQPLEVRWNQSLMEDALTQAYVQLLENMTVMQTQGKIPMYHAFTLWPNPDKLHSSAWEPLIKSFYQRVASDVDLPLVCTGGKWLPVTQCIYQDLKLWELPNSEVILENLDYKIVQLPDFARKGFQQAGCMEIIIQRTMTQEKFLQEVFFPKIAMISKELRDAIVCYLIDECLRGHSSNQSSQLLNLYKSLLSNNRCIPCGPDAGNLAFPKELISPKGTAATLFLADDRRFPVGSCYQTKERLLMLENLGMLSHILDWETLIERANSVSVLCRIAQQDAKERSASLIKYINVHLEEMDHPSELKREELMAISMFPTLAKPANYVMQWRGTDDGNSVMLPAKEMYEERYTFVAGSSRPILDESDSCGCSKLSKKTRDLFGFSSRKPSTQEVLNQLEHTVQAIVHAPHAIAGLEQAFHCLYGYIQEIIAEPDGERIIETLQEKEWVLVRGKCLSASRLAFTWKRFGAPYLNEIPQNLASRYRSLFEAAGVKEHFSTEDVISALYRLSEEKKGEPLSKSEFTVSKSLIEEISGASENSLKKERGKIPLPDHNRCLKPAEQLAINDAPWVTARSGIEYVHRDLSIDLAHRFGAIDIRTKKLARISRPIGREFGQREDLTDRLKGILKSYPCDVGVLKELVQNADDAGATEVHFIFDPRYHNTDQLLCNNWKELQGPALCVYNDRPFSEKDLQGIQRLGIGSKTDDPTKTGQYGIGFNAVYHLTDCPSFISNGDTLCILDPHCRYAPGADKENPGRLIEPISEEERSDFRDMFPCYLEDMFDLKSSTMFRFPLRLQSTCTESLISEQRISCTQMSTFMNHLAVEAKEIILFLNHVKKISLSEIKDDQLKEIHSVSVQITKEDDAERLKLANHVKNCKFLITNEIQWFGITYPLFVQEARLRQEEWLIHQCIGIQKRDGEEIPNGRDYGLLPRGGIAAKVSEKSKFSSYEVGGPTHKAFCFLPLPVPTGLPFHVNGHFFLDSARRNLWSDEKGEGFASQWNHFIKCKVLAQAYISLMLVARGYLPGSKNGDTTSFSKGFKVHEGMRWYHNLFPHYENVQSQWRDLAKAVFNKICYDDAKLLPLIKTPGKNSPTTQSTEAAKSIQPPNDAQATKGDFDCKEAISCLWVSPSQGYFNTLSLSDEWARDLSNVLINIGFKLIYSSKKIFGNFKTAGANVREITPEEVINFLGGNPNSVGILPCPVGETTVGSVANVLLLLRYCMKATTFPKQIFGIPLLLTEDDVLRQFQRDNQVFLSLFADLIPNQRARFVQHALATPLFRFQKEITYGDQGVLKKFDICALASLLPSTVKGGWCETNSLVPWDLENGPSKQWLKLLWEFLFKTYEKEPDTFSLTPLHKWPIIPTKLKELAPISKSKVIFDLTTSDSWSCGQKTVVALLRKLRCPEVDVDLLCNDGRWDLSPILKQHLSYPNSSQDILKVLDHMIGERGRIFELKENLTEAKSCFDSAIFINQNHVASIHHLITTLYHKLGNLVMAEKVLREGINIDLTAFEACDKSKRRSFPSCHRRCSNDKMAERPRKTVKGRSYGQRRPPLTHILKSILERYPDGGQILKEIIQNADDAGATKVSFLLDSREGFYGENSLVEPTLAQFQGSALYAQNNALFQESDWENLQRLMDSSKKDDPLKVGRFGIGFNSVYHITDLPSIVSGDSVVFLDPHETHFGRGETGQRFSLEDELLEIHEDQFKPYENVLDCKISTQFYNGTLFRFPLRSAPSDLSKKVYSKEKVRKLFQALKEEAPVILLFLKNIEEIALFETDERGVQNHVFTVRLSDSCREEVREKKIKFLSDLRRLTNGQIENVNLSLDLHVVEVTEGERVAENRWLVYHQVDAQNSTLKKLSSELGLLPWVGCATPVNAAKLQALSSSTGRIFCFLPLPPDADSRTGLPVHVHGYFGLTDNRRGLKWPGLDCQDDPTAEWNVSLVQHVASKAYANALLCLRELCDSSDGADFVYKSWPNIQDVEKHWQCMLKPMFSILLTKNVLWTRANGGQWKTLSDSYLDKVKSQFQNVTNETRCVVLETLTQANEAVVIVPSHVMIAIDKYTPVPTKSVTPAFLRALLKKKNKGVWNISNVPRNKKLLLLEFALEDKNLSDMHGVPLLPLADGSFIDFRSLQYNREPAAAVYVSSTSSPRSIFHNMDNKFLVDNVQAPAITYLSKIALDVSNPHTTQPVQLVKLNQTIAVRVLREMLPSEWSGGNHSAPWHPGKNGHPPEQWLESVWMWIQRMFPADLSLLENLPLIPHTCAGNRSIVKLSSSSVIIRRYHHQSVSLPSLIVSLLGKIGCVVLENLPSYIHHNNLHRYVATPDPHGVLKIFCTLGQSRCTSTISLCSPDEKRALRSFLSSAYFNGDEKSLICNLPVFDAVDGNSFIAVRNGFEFHEVSPHGFEVPRPLSIPRASSVIALKDTVSQTLIQRLGISPMTKTTFLRNIVFGGIQNNFYSRQQLSTLMHWVLSQYPLLCMEDSSFHAALQQLPFVITRSNKVVTPCCVLDPQQPVLEHLFENENDKFPHGDFVKDEILLRLRQLGMRSRPNTEDILHVAKTVDCVHSDVGSRKASALLEFLDQNPPDKSLGQALMNERWVPRKQSRPPSYPQAMPWFSETNHFYKPSETFSQSKATLVGASAPIVSKPCSKALEAVFGWNKSPPVHCLLKQLRSACSVRLNDMNGSALYHFQAMVKQIYEEASTSASFIFSVSQDNSFPEWIWHGTGFSSPSKIAFASCCKIDLKPYLYIVPQEFRHLNFFFQQCGVRNTFQDSDLLHVLTMIKDKHDTGSEYQGDVAVDRKLSHEILHWMVREGEKLDPELRESVLVPVQTRDNTFKLVPCSECTFCDADWLRKGGSELLIESKFTMIHEAISSKTADLLGVPPISTRVTCAEALGIEQTGPYEPITTRLKNILNEYKEGVGVFKELVQNADDAGATEVQFVLDWRAHPHQRLFSQGMVECQGPALLAYNNATFTDDDLKNISRLAGATKREDLEKIGRFGLGFSSVYHFTDVPSFITRSYAVFFDPQRTHLGHHIHDASKPGIKIDLAVNENSFICFPDQFAPFYGLFGCDTAPPSDNDKFYFEGTLFRFAFRTKRGEISDKIYTRQEIRSLMFSFRESSPILLLFSQNVKKVSFLEVDENATDSKDLRLLFEISRKPQTDFTPVKKSVTEGTFLKSCAVWTRQSTSQSNPVDIYTSPKLTELISVCSNICRMNGQRSQETQSWLVISCLGTGNSFQLATSEEGKKQGLLSASGVAAKICTQGDGLQKVEAVPGEVFCFLPLSIPTGLPVHANGYFAVTSNRRGIWESTTADVGRQPLEVQWNRSLMEDALTQAYVQLLQSMTVIQTEGKILSYDVFALWPNPDKLQSSAWKPLIKSFYRRIASDVELPLVCAGGNWLPVTQCIYQDFKLRELPKSEMILEKFDYKIVQLPDFARKGFEQAGCMEVINQRTMTPEMFLRDVFFPNIKTISKELRDPVVCHLIDECLRGHASKRSFPHLNLYESLLSTNRCIPCGPETRDLSFPKDLISPKGAAATLFSAEDKRFPVGSCYQTKERLLVLQNLGMISDILDWEILIERANSVSVLCRRAKQDAKKRSALLIKYINEHIEKMAHPSELNREELKAISMFPSLAKPANYVMPWRGSGDCNSGMLPAEEMYDERYKYVAGTSRPILDESESGCNKLSKKTRHLFGFSSRKPSTQEVLNQLEYTVQATIQSPHAIESLEQIFHCIYEYLQDLVLEPDGKRITHALQEKKWILVQGNCLSASRLAFVWKRCGEPYLNELPQNLASKYRSLFKAMGVKEYFSTEEVISALYKLDEEKQGERLSTREFKVSKSLLEEISEASEEFFGTERGRIPLPDHNLILQPAEKLAINDAPWVAPRSGIDYVHKDLSIDLAHKLGAIDIRTKKLSRISRPIGREFGQREELTDRLKGILKAYPCDVGVLKELVQNADDAGATEVHFIVDPRNHPTDQLLSANWKELQGPALCVYNNRPFSEDDLEGIQRLGIGSKTDDPTKTGQYGIGFNAVYHLTDCPSFITNGDKLCILDPHCRYAPEATKGNPGRLIGPIGAEERSDFRDVFPCYLENLFDLESATMFRFPLRRQTTSSISQKQVSCTEMMKFMNLLAYEAKEIILFLNHVKTITLSEIKENQLKKIYSVSAQLTQHDEAQRVRLANHIKISKTLETNQIEWLGITYPLLIQEHGLRQEKWLIHQCIGLQTSTSEEVPNGARFGLLPRGGIAAKVSEKSEKSKFHFNTKSEPRHKVFCFLPLPVTTGLPVHVNGHFYLDSARRNLWRDEKEEGFQYIPMATLQSLQTAEAYAQLLENMIVMQTLGKLPLYGVFTLWPNPEKLQSSAWEPLIKSFYRRIASDVDLPLVSTGGKWLPVTQCIYQDLKLQELPKSEMVLQKFDYKIVQLPDFARKGFQQAGCMEVINQRTMTQEMFLRNVFFPNITRISKELRDPIVCYLIDECLRGHASKRSNPHLKLYESLLSTNRCIPCGPETGDLTFPKDLISPNGAAATLFSADDKRFPVGSCYQTKERLLVLQNLGMISDILDWETLIERANSVSVLCRRAKQDAKKRSALLIKYINGHIEKMAHPSEFNREELMAISMFPSLAKPAKYVIPWRGSGDYKSVMLPAEEMYDERYKYVAGTSRPILDESESGCSKLSEKTRHLFGFSSRKPSSQEVLNQLEHTVQATVQSPHAIESLEEIFHCIYEYLQELVLEPGGKCITHALKEKKWILVQGNCLSASRLAFAWKRCGEPYLNEVPQNLASKYRSLFKATGVKEYFSTEDVISALYKLHEEKQGERLSTKEFTVSKSLIEEISEASEESFETEKGRIPLPDHNLILQTAEKLAINDAPWVAPRSGIDYVHKDLSIDLAHRLGAIDIRTKKLSRISRPIGHEFGQREELTDRLKRILKAYPCDVGVLKELVQNADDARATEIHFIVDPRNHPTDQLLSDNWKELQGPALCVYNNRPFSEDDLEGIQRLGIGSKTDDPTKTATKENPGRLIGPIGAEERSDFRDVFPCYLENLFDLRSATMFRFPLRRQSTSSISQKQVSCTEMMKFMNLLAYEAKEIILFLNHVKTITLSEIKENQLKKIYSVSAQLTQHDEAQRVRLANHIKISKTLETNHIEWLGITYPLLIQEHGLRQEKWLIHQCIGLQTSTSEEVPNGARFGLLPRGGIAAKVSEKSEKSKFHFNTKSEPRHKVFCFLPLPVTTGLPVHVNGHFYLDSARRNLWRDEKEEGIGSYWKQFIKTKLLSQAYISLMLVARGHLPGSKEEDVACFLRDHNLHEGMRWYHNLFPHFKHVESQWKDLATAVFKMICSEDASLLPLTKKTSDKTMEASQVPQAAAVVQVKESDRREVIRCFWLPPSQGFFNNLAPGSESQIELWKILLRIGFKLLYSSATLYRDFKEAGTNVREITPEFVIQFLKENPSSIGNLPCPVEETTLGTVKGVLLLLSHCMKAKKFSSEMFEMPLLLTEDNVLRIFETDSQVFLSLFADLVPTQCSQFIQHTLAIALLHFEEEIFSSGQSVLKRFDVSALASLLPRTANAGWCETDSARRNLWRDEKEEGFGSQWNHFIKTKVLSQAYISLMLAARSHLPGSKEEDGASFPRKHNLHEGMRWYHNLFPNFKSVESQWKVLAEAVFKMICSEDANLLPLTKKTSDKRVEASQLPQAAAGVQVNESDPVIRCFWLPPSQGFFNNLTLGSESQIEQWKILLRIGFKLLYSSATLYKDFKEAGTNVREITPEFVIQFLKENPSSIGNLPCPVEETTLGSVRGVLLLLSYCMEATKFPREMFGLPLLLTEDNVLRRFKTDSQVFLSLFADLVPTQCFQFIQHTLATALLNFEEEIFASDQSVLKKFDISSLASLLPSTANADWRETSDLIPWNMNEQPSKIWMQRLWEFLHKTQQKTPKAFSLDPLHNWPILPTKSGKLAPVFKGKVILDLTPSGSWSPGQEHVATLLKKLKCPEVNVDLISGDGRWNVSDILKSRVSYPNSSQDVLKVLDHLMKEGDISNILFDDEKICMLQFFQDDLTTVKQDRTSTSIVKRLPFFKTFHGAFVSLGNVKSVYEIPVGLPTDESDVWMTGNNCVFLAPEPRLSRMYKDLLGVGDKSHTDCYINFIFPKFPLLQNETRMLHLEYVRRFLLAPYCNEEQQARVLKSLSTLAFIPDANGALQTAYYFHDPGVKVFSVMLPREAKPPEQFNTTKWLELLRKIGLKQKISKAQFQTFANEVASQAVQISNSSYPSLEKKSKTLVEHLLRDDTLHDAKFLADLSPIKFLACANASDNLSLLHRQHLVPSRDQNPPFNQFKHSIPHAHEALAWTTATLLFEWAIPNPKVPLLTNLQVLQKPSLEQVIGHVKNLSQTLSRRADREQPEPKRRRLSQIMTEIYKFLTDTSGCNGTDCNELCTAVCNKICNQLSDIPCILVEDGRVFVRSNQLAFHLDEEQPPYLYKVPREYGIFEHLLKRLGAMEKATPSQFAQVLTRLRESCEDKQMHANELTVVKQAVFGLFTTLHAILCRNEDRRERNPLAEVNTLYLPNSKQELRPSTDLVLFDCTRFKRRLSDSMFEFLDDVTNYNLTMEKPGKLVALLPDHLKTKSLSSLVREELQAECRGKRCQADMQRKCEATDRIRHILYSPDLVNGILRILKFQYDKTKLTEEVRSKVHSFQKSLSISCMETLSTELVDNRTNTVIPNSQMRTHSGCFVGQDNGKKHIFIQHGAKSSDTRRKICHEIYSLTGCFLEEENILHLAAILECTSPANISIVLDNAGVSDDAEATKTPSLEPALGSEVPEEFHELLDQYSDFYFRPGEFVAYEREDSTEEELKYIYAKIVRRVKTSTSTKVKKDRTKRKQKEESNLLSRYLIDIGPERKEVDVLDLYKFRRPRKSEEEEADEEESLSKSMEVVPYAGASGQSTGQAGAESAGPSRGASEPPKPRTLEHALKEVKKTLAEIWKLPEDKRKKAIRRLYLRWHPDKNMDMQDIANEVTKFIQNEVDRLSKGKSSSRDEGGARPPPADFSEFFTRCNERARRQRASYYNFRRHNPRFTGFRSHSRRTYTAPDPRVAKMWIRQSKEDLRSVKHLLSSRDPLYYLVCFQCHQIAEKSLKAALYALAGVADRQLNSHDLVLLAHDLSLLPGAPDVTAQVARLSNYFEGTRYPNKHEPAKVPAEVFQDSQEAQEAFRLATEVLEELERENRGVARTFQRRGGGDHSVSRREYSPDFQVDKHAVFYRMWRKRHDIVISFSPPDYRSSSVDYNEEKVFLYQFEVNRFHKTNTNKITS
>P33_g8985
MTRLGDTFIRQLHEKGEDFTNLWAFSEFLESVDKSGVPDTKTIPVGLKKMKELGIRNAIIEMDLVYAGINYKKFKVEAINELLTERLRWVHANLAKDSKVVVNFRDLPDGMIKRPKRVFKVIRHLSSLPLDIRPFGIVTEESGKYFPEQLAAWISAVRREMDDCGFKDGHLLVHVHEKWGLVDSTQLSCLANGANGIWASMIIQGASMGAASSTVTLMNLVRLGNKKVLKKYNCTALREAAQEICRATTGQEPYPLQPIYGERALDMVFGMPTKLGINEFDLAKFFGEKPLMRMTTLASAEMIITRLKNIFGEDPQFTIERGTRMKEVMLEDLHKNIKEEYMSAAGLAQLFDRSGGQLTGKMADVLAQDEPRKAHAQVLIAEVRAMWDEWDLRDGKRDDQLEFDAFYNGFLAPYFGCYRCDETKQALKAMDMDEDGTVDWNEFAVYLKWAMRQYPETKTAEDLLSVAFRKGLIPAMQDE
>Gene:g38128 Annotated: α-Collagen Blast E.value:0, MS/MS SeqCoverage 42%
MAGGLSGIHGESAALHVEVENDQEHEPVLIHHHLVTDQDVLAALWRLNNAKHNRVQWMANFLRGLNGNNAARRVEEELKPGPGGVIVLLLPLVGRPVLEILSRLEIVAPMSAQLMVYGQTGKVGRSVAGHAVAENNQEYVSVTAHGRPMVGNVVLATIQRQGSAKPRPVQLMADGLNGPQGHAMFNKANAYTTLAATPTPTATAVPTPTIDPNIPKIDLVFAISATSVSSSRSYELMKNTIKRFIDRYGVNSIHYSIIVYGDQVVRVINFNRTFPPSANELKTAIDNQLALSGGPVLINALQEAYRVFKESVGRPGAKRVLVVIADENSGSSPSFLSRAVRPLEDLGVLVISVGVGDRISRSELNIISPNLLDVISARLNINPSLLAVRIMERILRLNFPDVDVGFAISAASANSDEIFSLMKQIINTIIDRYGVSKVRFSFIIYGSRVTTRFTFDNAPITQEELIKAVNGTKKVTGDPDLEKALEEAEKLFTKSSRPNATKVFVVLTDFVGAGDDNSLIANAVRLRKSGVLILSVGFGQQVNAIGNQMTKVVITQSDYIRVPDFTTQRPVVIAETIMFKALQANIPEIDLTFVISATSNSADRTFTLMKSTINSIIEKYGIVRIHYTVIVFGSDFTRSFDFSTSVPNKETLTRLVTQLQRESGTPDLVRALEEVKKVYELREVRPNAKKVVVVILDQKSVNTEVQLKTAVTDLVERNILVIGVGVGRSVDRNQLIYITEENRDIIEVEPTERPEEVAREIMLIILRSEIYSIHS
>JT016638.1
QGNYYSYGGTTPGTPIGCTNLITLSNVKFFASSSSDGPDIPVLNSTDYWCSEFNWKNQSLTVDLGFVTFFDRLLVQGEPFTSRSVSEYFVLTSIDGINYTYILGTNGQSMKFVGPLFNGDQTRDTNLTAPVQARYVQFNPQEPMIAEDDSICMRVGVESCQLVPAAVNGAWSHWSPYGPCTHACLGTAKRTRTCADPAPVFGGSPCEGVNEEEKICNDCVGTVNGGWSPWGLWSRCSTTCNPGQRSRQRTCTNPSPKNGGTDCSGPSTQSEPCQVQFCPVDGGWSAWSGLSRCTRACGGGRQYQSRTCSNPFPGHGGRDCVGVRSLSFTCNTQCCPVHGGWSPWGSFSSCTRTCGGGQKSRTRVCNSPAPSCNGITCPGGNQDIQPCNQQTCPTSPSTSFPINGNYSNWGQWTACSVTCGQGTRERTRLCDNPAPAQGGSQCQGPSSELVGCTEIPCPVNGNWSSWGDWSNCSSGCGPGKSYRYRDCDNPAPANNGLNCTGPDQESKDCNSTACPVDGGWSAWSSTPCSATCGQGTLKRTRECNNPKPQYGGASCFGNETEQEVACNKGPCPTSPPTISPPTTGSPADSNIPELDLVFAVSATSSNRLATYNSMRDTINRFITTYGSNKVHYSIIVYGKAVQRVISFNHTFPPSVGELQEAISRHAPISGPTVLKNALQETQTIFQEIPSRPNAKKVLVVFTDSNSPSDGNLVQAVRPLENNKILVVSVGVGDVNRTELLTISPNPLDVLSVQPTAGPGALSKRIMDRILRRDIPLIDIGFALSATSSDFQDIFVKMKNVIRTIVERYGVERVKFSLIVYGQNVTTVLGDFNRNLTQADLVNYVNNLQRVPQNKNLDSALLEAESLFRQRARPNSKKVFVVLTDGVSTLSNANSLLINTAELRKSDVLILSVGFGSQTNQVGNQMNSVVFAPRDYIAVPNYPAERDVVIAETIMFKALEVNLPLIDLTFALSSSSILSQETFKLMKETVQSLVHTYGIDRIHYGVIVFGSVATRSFDFATNFPDQNELIRKVSQLTRSGGSPDLVAALKEARKVFQLKEVRPYARKVLVVMIDDESSANKNDLNEEVRALRNRSVLVIGVGIGTQTLPKDLGIITDDKRNTLKAGINKNRDELAREIISIILRPSGLSKWSSWSACSKTCRYLGKAGTQIRTRDCKIPELGCDGMRIDTVECNKMDCEGCGQRGPLNESAYTASSNSESPAFLAALNTSDPTAWCLINNENGGYVQLDLGELTRVYKVATKGEQQGDRWVTSYYLTLSEDGETFFDYKAAQRLSGNTDSTSVAFNVVNTTRPYRYVRFHPVNFKGEPCMQAAVFGCNEEKILPPPETIADQADAAKGILIVLWILAGILTFLLLMACCYYCCWHVCCGRGKKRKGLVYRERSIEDDGYLINDEKRWTLGSAPMTPVPRVREDEIQEVTIEMKEDNEQPLGVIQFGIETDETKEKHVTAEDVKSEKPKYSEEASSGTIKSGSTMMRMKANDGSDRRKRTKSEGDAIDAVDGDLDWSYLSDEQGTAFTNEAFVKSQEQFLEPPGSASFRGNKVDMRRSLSADELATLDYDLFEDRQGPLHTATLGRDGYMRMHKANQGSLPPSDGGREMGTVDVAIGGIRVPNSPKDDPIYDTAGQEIHLAVEQAGRSVYPLEDGGYRGEEWYSRWG
>P18_g810
LMEAGVIGSHGQAALRRAEAVHRRASALATIHHHNMAASSVPEAIQIVNLAALKVAQLMEDGLIGHNGLPVTKRVVVATPIGEGNVQTQYHRTVVKHAQVTKMNIGVAILRDAQSVAVGVRGVSGQAAVRLAMDNEQGTAHALILHHPTMEPLAPGQEYNLKRAMWGSVQMAAGQPGVNGRPVQNRVVEEHKGGQDLAPTLHPHMAERSVWVAKRSPSSAKNKHVQWMAGGLSGIHGESAALHVEVENDQEHEPVLIHHHLVTDQDVLAALWRLNNAKHNRVQWMANFLRGLNGNNAARRVEEELKPGPGGVIVLLLPLVGRPVLEILSRLEIVAPMSAQLMVYGQTGKVGRSVAGHAVAENNQEYVSVTAHGRPMVGNVVLATIQRQGSAKPRPVQLMADGLNGPQGHAMLYVAMENATEQEPNAYTTLAATPTPTATAVPTPTIDPNIPKIDLVFAISATSVSSSRSYELMKNTIKRFIDRYGVNSIHYSIIVYGDQVVRVINFNRTFPPSANELKTAIDNQLALSGGPVLINALQEAYRVFKESVGRPGAKRVLVVIADENSGSSPSFLSRAVRPLEDLGVLVISVGVGDRISRSELNIISPNLLDVISARLNINPSLLAVRIMERILRLNFPDVDVGFAISAASANSDEIFSLMKQIINTIIDRYGVSKVRFSFIIYGSRVTTRFTFDNAPITQEELIKAVNGTKKVTGDPDLEKALEEAEKLFTKSSRPNATKVFVVLTDFVGAGDDNSLIANAVRLRKSGVLILSVGFGQQVNAIGNQMTKVVITQSDYIRVPDFTTQRPVVIAETIMFKALQGKVHVYECAAYNHA
>XP_022780303.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111321626 [Stylophora pistillata]
MTVTSSCSIRSPLCHNPCYSCRDYSIANLFDQTFSTSSCAGYFLSGCRYTKLTVDLKSLLFINHIRFYPYCSGSRPNFRVTTNDGVRDLPVTSNYVQMSNGCIQGSYVDMPVRRKATHIYVELNHPTSNTYVGLSELEVFIDGKEVWDRYKYRSFDGAKTWIEPETGADVYSFNEVHIRGSAQLALMPLNGLQGPAHFHADRLYGDKSGFLHVGYNQNFSVAVTDPDIPFGLRVYENGSIMLPRRAFLQTVSFKSSGKIWGVQDLFVFDHGTFYGDSNSSLGRDTVPGEYSVQSLHVQDRGVFELHSTDKELASRLSLTNLTIFGGGHFKSNNKMHIAVKHLVRINSGGRLSHNHAGYITKEGRSGDEFESSEGPGQGIGSVHGASGAGFAGTGGRGTGMGLVGQFYGDYRRPDDYGSAGGFGLHYGNLFYGDYRSSSVGPYKNLITRSGLGGQGGGAIKIVTRHLILDGHLSADGEDGPPPSTAGGGSGGSIWIDCEELDGYGTISANGGAGSPSKGGGGSGGRIAIYQVFMLNFNGTLSAKGGNSAVEPGASGTVFLETRNNSKVEYRVLKINNFGLAYPWAVDKSQGRLRNLMKGIYSETKYVGAVTWLHEADKYTFDEFHLHGNSHVALYGNGSLENVTLYAHTLRGDRSGVFHIGRFQSVVFDFVDLYFPINTLVYFNATLEVPRRLSLREVYMEINGTLADSDDYTIDRDGKLFLWSGGQSLGEKQGHFRFINMSIKSLGLLHTTKLKGHGPVSLHTTRFVVNAGGLASVDDFYLDSVNATIDVAGDVSADFRGYGAEQGPGAGVRRTPYNTGAGGGSHGGRGGRGTAGLQTSFSYGSIYEPTQFGSGGGNGLHGTVGGRGGGRIVFEISRMLRLEGRVHADGEPGNKNSDPSGGGAGGSIVIHAFKFDGEGTISVNGGSYPFTYAPYGGGGAGGRIAVYYNGSYTFIGSLQSYGGTSQAEWGGAGTVYIQNNQNTSQPYSILRIDNRAILSGPSRLNEIEELDLAGNNSKVCDTDNSRMSNLFKSTSNSYYTQNSNPVVTYSFPLPLFLENLLIYPQCNSHYLTRFYVREYFNNIEVIGSNEWINPTNCLQGQPLRMNVRQAVEKVEISLQKTSWYASLSLVRFFVRENPSTTLQTPHYTSPSTSWIVIEDEKTTEHDFSELQIMGRGSLSISGNTFNMRVDKIVGDNSGILTMRPTQSILMRGLEGHLPFSLLSQKGSSVAFPTSMNCREVEVTIRGVMGEMQNLTVGPQCRFVLENSSETEFMLEHMVVQTDGYMAALREDREDVKMVGKTFDIRGGAKMEANSLTLDYVNITIEPFADLSSNGLVDEVPGSRYHGDGWGGSSGSSGAGHGGHGGVGGGQSKVGVSYGKYKRPTTFGSTGGAQVFPFTGGLGGGRFKIIAHDTLVVDGVLSSKGGDARASRSGGGSGGSILAYASRIHGDGEFDVSGGDGDSSTGYRGGGGGAGRICLYYRENHFLGRFLGFGGTSSNEAGGPGTVFLENVPGMNATYGHDRIDEAAHAERILPDENVVNGTQWVRNRTLYANAMGRKPRSPDANLSSSYHDFSVGGSSRVWLILDDEDLEANGTDVELDELQLYGGAQLAIINPVNTKAFISIVIGQMEGDRTGRIHLGFNQTFLSLQSYLPMDMIIYQGGLTTMQGELLVAGVTVEIDGVLRRCQNITVVDDGVIRMKEMYDLEGKPTETFYFEAINVRKKGTMMVTNQERVREFRGTSMQVFGGGTFTAVNLQVHVVNFTIDALAKVHATLEGEKFDYEGKGPGAKYPSPGFAGGSGGGHGGLGGRSTSQVTTGAAYGSIKEPTEFGSSGGQGSSGAHGGRGGGIVFLNISDTLDIEGTLSVDGANNGGSEAGGGSAGSILIRTVLLEGSGTIQANGGNGYTGSSRGGGGGSGGRIALYYQGGFFDGVLEAAGGKGDLENGAAGTVYVEKAVNRSETPHRTLKVDNKGRSPLNERINEIEEVKLNPGGLKDNHAYSNEYTSISGLKFKSDGSTLPLFSGSGRYHSLSLMFDGHLKNAYIVQSSFVNVVIEATLPRLMFIHQMRLYPYCHRHYRVSFTLTTDHPVIGWIDRTKGSRSFADCFDTSVYNDVIIADTISKFTVTLTRLESYVALSELKVFVGRDTSSFQTSTLEQDSSRTWFVFNHQGTTTFEVDELDVRGSAHLGIQNGAGNVEFVVKEYKGDFTGTVHVGAAQNWYLNASNNSAIPFTLRTYQGSQVFLPREVYLQKSSIYAESELEGLQNLFISQGSRFDVTQGVQVNSPLKATILLDRMTILNGGTFYQRTAEPTKLAINLTGELIINAGASMEVSQVHLQAHNIFIDIDGVLTASGRGYSSMKGDEPGRKSDVAASGAGHGGAGGSSSSQEYVGRAYGSFQIPLEFGSGGGQGYQDLPGSSGGGAVKLSASHIVQVDGLLDVSAKAAVNPGTGGGSGGSVLIQATLFLGKGKIFADGGEATNSSLGDAGGGAGGRISAHYEITRFTGSFSAHGGASKSEAGGPGTVYLSENSTHTTMIIDNNGYRASKLYISDYRDRSNDGGRAWLLAGYMNEFTLNLLQLRGGSHFAVYHLKPSFTLNVERLEGDLGGLIHASKKNRVYIKNSPRLFPSSFHIYNEGFLHLPRDVLLKDLFYPKISLEGLISGMDNLTLGGGVEFVVTSKGQTEGYGKKTIHLNSLTIMNGAKVTATDTVLDSPTVTLLLNESLRVMAGGLIQGKWIDISAGDIEVEASGVITAEKQGHDAKNGPGSPSGSAGASHGGRGGLGYTGSFPGSSYGSLYRPLSQGSGSMFAVGGGVIKLATSQSITIDGSVDANGEDGHNKSTGGGSGGSIWILSKVFKGNGVIRASGGSGLFLESGGGGGGRISIAFDNRTFSGKINVLGEQLIVDNNNIGSPLTDDITDVSNDGGRTWLTPEPNTIEMSFDEVDIRGKSELAVLTTPADSPFRWNIGGIRGDRSGILHVRANQEMQMTISDKEGKQPELLWGVNVYPRGDLKLPRNLFVDGIKMIVAGSLRGAQNVSVGNNGRLILRQLIAPNNQISRNLTFDVIEIQGGGRIEIQSDKDGLSIKCTALWIRSGGVLIADRLSIVAESVTIEQSGIIDLNYKAVVAGSGPGVGYSHYSGSSGAGHGGRGGRGQSQNRTGGFYGDFTSPKLFGSSGGGGSGDDIATGGGVFYLHAQQIVHDGEITVNGKDALNNSDYGGGSGGSVFIEVEYFDGSGVIEANGGAGGINGGGGGSGGRIAVYYNQTFFTGKISAYGGASTVENGAAGTIYKRIKYNGKSILEVYNEGKKPWKNEIADYSDLTSDSARTWLTMSHIIDSSVPVTVPDVNLGTTVYKGLTITEVKLGGSAHLAIEPDVTKIRLHTFERFYGMFEGNSFGFVHVGPNQLLVMPDTDYYIPVNLKVYPSGYIKLPDRVMLHKNSLSLDAGYLIGVEDLAISQCAVSFGAGSGALSTGYLQAMFFKIQTLTIMSQGVLDMVAPNSDYSLRVDSLVINSGGALNGRKLDIVAKSVTVDESAKINLDGQGEKCIEPDVYYAGSGGSHAGYGGLGIGSKRQDPFDSVFLPVAFGSAGYAGRSSFSCMGGSGGGSLNLTVDGTLQIDGEISSRGQNAKDSESGGGAGGSILIKVTTLEGTGKFQVQGGDGGVTSGGGGSGGIVTLYYKSSSHPFSYKVQGGGGKKIGASGFLYSKKDLQRRRRQISGSDSVLILEGRDVLSFESPSVLVCDPKLIDFTFEEVKLLKSSTLTMISCSQGSPMTLITTTIKGDKTGWLVVKPNHDVYIGVTSIVKPSMELEFNAEVEDGGTLSVPGKFFISGDTQINLSGSLIGVSDVIVNDNGKLVLKYPGHTGFRITPTQGISVVQISTVRIKDGGSISTTSPNKVKIKSVVLQVDFGGNLGSGISVSSSQRIEFTNGPSLNRQGCPHGYEVVEVASKTLYNPCGVGKHIFNKRNESYLVFKNVSVAISHNETIYVIKNETRYNVTYYIACDYDDFKLLPGQSCNLAPGSYRYNSLEIQGSATMHIEPGTGKGNASTLSVSKLTIFSKGELIARESNLINTNSTSSDYGGSYGGLGGGASKNSALYGNISFPVDYGSNGGGSSQNRGLGGGVVILKVGELSNDGLIDANGGDGSLGAGGGSGGSIQVVTDYLKGSGKFRARGGNAAYPAGGGGGGRIGITVLKGRSEFRGLYDATGGDGRRPGSSGTVFVSDQRQGTSYETIIFWNKVIGYPPAQLPNTSATYTYDEIRLENRGTFLATNQLVVAKSFVTDGTGKLTVSGGARVDILSFPKSSRIFSCDLEVQAGGSIYFYNKPIFLGPGSPTVVIAGILDARGPSLGKGKALNVTSSGEIRADNLRLLKDSVMQVDADASIRKSLAYAHFHLTSLRLDTNAQLTFAQGNVSFRSDSIHLSQGATITSAADTKLLNITGNDIFIDNQARVTADRGGVLGGPGKARGSGSGCGHGGRGGGSQGGESYGSVFKPEHYGSGDNVRGGGIIFLNIKGGFTLYGSMSANGASDSSGGASGGSILVHAETLSGHGEILSNGGEGLSNSAGGSGGRIALYITDKTSFRGALTAYGGCGTTCAAAGTIFVREYVVGLPQNSTIIDNGERKTEANTIIMHEMKISYTMRLLKIVNGARLEVATLPNVGMKIAIQNLVGDGSGSFHVHYNQTLTLGAGKAVSSRPFMFPWAMIVDEGATLNLDPVLFITRTAITPSLYLAGKLTGGEKVTVGQDASVVIAKSGIIGTHSNTPGKYSLLSLKVSSGGRITIEVDEGGKAPVELKSLSVDVAFGGVITGRYLRVDASLLNIAFSGTLQANGLGNPAGVGPGAGSSSLLTGGGYGGCGGGSTNETCVVYGSLFEATEFGSGGGTTQVPDGTYGSGGGIIAVVAKVLIVNGIISSNGQGGNSITTGGGSGGSLEISVSETFSGRGKIEAEGGYVPGQVTGAGGGGRISILITGDNKFSGSFSARGGNSSAKSGSPGTVYTEDGKTVLRYRKLFLDNRGISSNSPLPIFLNQSVVASYNFQEIRLNGQVMLHVDKDMEVGKLVTDPDSVIYVKDNVTFTVEPNSRYLQPDCSFVVDANGEIRIPDEVMFLGRNNVFRGTLTGILDMVIGENRKVFLSASARTARFIDGKYTFITHRGGYRFSSLRIKNGAFFSFENAQLKKVPLTLGRLEVNFGATMQGSWLDIKASDVIIHSGATVDLSAQGHESDKGPGAGGLHQSDGTGAGQGGYGGVSTGNFGTWYGSALNPNNTGSGGGSSSNGKGGRGGGNLRLTVVRVLTLEGRISVNGESGTVLNSGGGSGGSIWISADNIRGNGIINAEGGDGKGAGGGGSGGRVALYLQELMSFEGLLNAKGGSGKDAGAAGTLYLQDNNKRILRKRLWIDNLKVADNKPQTVLYEADKVNFLFDELRLNGMSRFEIYNLQRKLQTIQVTNFISDGVGEIAISKNQTLLAEVIEAKESHLTLTTNIYVEEGANLVVASNLTVDGATLTLDGKLSNVRHLVVESGSAVKFGITSQTTLMENKNFVFQSDPGTQQFASVTLKSGSDFGAPLNLKLSVGKLDMKSGVVLQGKFVDIKSQSLLIGRGAILTTNNIIDIELNPGGRGHSSGSGGSGGGHGSTGGTGYNSLVGGIPYGTIYEPNQPGSPGGHGSSSESAGKGGGVILIDTDVLENDGSITANGGVASQSSQAGGGSGGSVYIITSSVFSGTGTVSANGGRGNGAGGCGAGGRVAIHLQSQYAYRGTLEALGGISSRSGASGGAGTVYIKDVRYKLFFEQLHVDNQGQSWQNYVTLNESKTSYHFHELLLVRKASLRMTPNSNLNQSSTLSIGKLFGDRSGLLHLYNGQKAIIEVVEAQLTTTKTPVNFRIDSGAEAVMATTVYIVGDGAVALQCSGTLNGIRNFYVTQKRVVLLEKGSRTHRDDEQPGTLKFSNVKLFSGSSVTMKDEIVMKIFAGFLNIKFHASLEAHYFEIVTSNLDVETGGLLSVAGDNKARLAVEPSEVSSPPQGAGAGHASNGGSGYGGASGGLYHGSLYKPKESGRRGGRGTNNHIGGRGGGYVNIEAGTLIINDGTITVEGGSAVSGGGAGGGSGGSLLFNTESFIGYGEMNTNGGNGGGTNAGGGSGGRIAIYATENLYRGTYLAFGGSSVSGTYGGPGTVFLQDIRAKRPFKQLRIDNLMRSIEDPVTIDEANLTDHDFSQVHLFGRAAVNMAVRQERTTLKMSRLFGDRTGLLHSRANQTFYLEASATEHSVSKPAVNLKIDEHAEMVFGASLYVIGDGAKGTGQITGDSSFTIDGRMIDVTHLFITKRLKSRFLSHAHSADYQNETLTVSAVGTFVLATFEIQDGSEVFFPDVQGVQCEVGLLHMKYGSVIVADTYRIGVTSLLLETGSRITASGKDRPSSYDSSVLPSSCRGSGGSYGSKGGKGQSGVNELHSHGSIFTPSHYGSAGCPGSQNGGKGGGLIIMDIGDELYLDGTIANEGQDAARGSAGGGGSGGSIWIKCGRFNGHGVITSNGGAGDGLTSGGGSGGRVAIDTPTESKYLGEYTAMGGNSGDPSKETTLYSGGPGTVFLKDARNQYAHTQLRLDNRGRTWDHYVTLNESLKSYTFDELYLVQKASIHLVPDGKPLNLTVHKVEGDRTGLIHVHENQTLKAEFVDSVYTITRTAANFKLDKGANAIMATSVHVVGQGEVAFEWNGRLIDVQHFHVAYGRTVKIGFYAHTAGTKAGKYRFIDGYGTFRFSTLEFGSGTLIHYPPPMGVHFIVSLLDIKFSSYFKAEFFKIEATDVYLEPNATLNCAGRGFENKTDGSGKDSALGGSGAGHGTPGGDGQDVNGGEEVGSVYEPVLPGARGGTRTGSATGSRGGGRVRVVVGFAFRLDGIINVDGDNAAAYSGSGAGSGGSVWITTGYLRGHGVISARGGVGNTHNLATGGSGSGGRIAVHVKIKDEYRGGLYALGGVSSGTQHGGSGTVYIEEMQGDKLFKRLYIDNQNADPPKVFTLDEVNPKTVKANATEENDAEFGFDELMLQRGVVFRIADLKLSKRPAISVTTVLGDGSSTLHVMENQTFFIEYQEYTRRRSFPPVNFKVDYSGELMLVSDFHVAGKNNPAFELDGRITGVSNLSLTENRVLRAGENISSALLKDKVYIETPIDGQLKFGVFIMEASSELHFAKRMKFVVSTLYMRQKAVISANKIHMALNEVHMEGSSRITTSGKGPKAGEGLGPGSTFSNVGSGGGHGGQGGPGSTVDGGSGYGSYVYPVHPGSGGGGNGGGAGGCTTEITVGYSLHLDGIIESEGANGRSNSGGGSGGSILIKTVLFSGHGLVIANGGSGDGNGGGGAGGRIAAHVAWLREYAGQYTAFGGTGFKAGAAGTVYYTDTNQGLSHRPVLINEANHTVFGEGFTKLTVDNFNRNPDIPTMIINENSSYYEVDELEMRNHGLLHVHGSNSSFVVHNFTGDRTGLVHLRQGQKMFVQVVESKSGYSVAPVSYKIDQGAEIVFPSSLTLLGTRCSFDGLVIGVHRLIVAEGANVVFASTTQTGIKEDRKFRFLTTPGNITFAEVYVQKGSKLEFSRINNTLVFTAIIFRLKYHGLVNINHGEIDSSWAWVESEGKLVLDFTGHPAEMGSGKGNTVNLIGSGAGHGGIGGVSKAGQLGGESYGSIYKAVHLGSGGGNGQGKGGFGGGMLHWRIGQEIELDGLVTLRGGDGSGTSAGGGSGGSILIETTNFTGYGEINVMGGDGSGPSGSGGSGGRISAHVRFRHKYAGVFKAYGGDGKTYAAAGTVYVEETARGPQYADLKYEKSTNRTYITATHRYMEVDNEDRKTEVSSIMMESEHLFYELDELFLTRHANLQVRHPPGSPNVTVIVHRFLGDGTGRFHVRVNQTIYVEVVESETNETTAPCSYKIDQGAEVVFPAIVNIYGTRSIIEGRITGVEHLIIASGGFVEFSSTAQTARVENRRYVEIDENGNFSFATVTVERNSRITFSRILNYTLSLRCSEFKIKYEGLMTMNHGYIYSAFAWIESEGILSLDGTGFGPEQGFGHGTTKNNFGSGAGHGGEGGKTEHGEGGSPYDSVYTPRMYGSGGGNGRGIGGSGGGSLFWIVGQRLQINGLLSSRGTNGEGIDTGGGSGGSILITTTNMTGHGEIAVPGGSGTGSGSAGSGGRVGIHCRWRYKYGGKFTDNGGQEGKYGGPAGTIYKEENFRPLQYRHLKYMKETNTTMLAVDHTYVHIDNDGYDVPGATLLMEENTTYYEFDEMELTGYSRLLVYHPGNVTVTAVVHKFIGDKSGQFHIRRDQRIFVEYVESKTNKTEAPCSYRIDVGGQIILPSEFSMHGTRSVFEGMIIGVRDLLVSFGAEADFYSTSQTALIENGDYIAISKPGNISFAIVIVKKGGDVEFRKNTGLLRINVDELKIKYQGKVSMNHGEVFSTFAWLDSQGNFNLDEGGNTAEKGHGAGSTLSSIGLGAGHGGRGARSGGQSYGSVYRPLVLGSGGGNGGGTGGIGGGQLLWEVGKRLELNGFISARGGTGNGGHAGGGSGGSILIKTTNMTGHGEIAVTGGDAVNQGGGGSGGRVGIHCRFSYTFGGKFTDRGGFGTQSQYGAPAGTVYKQENLRPLEYRILKYSKETNTTFLAVDHTYLHVDNEGHDVPEATVLMEEGTTNYEFDEVELTGYSRLIVYHPNETDVTVIAHRFIGDKTGKFHLRVNQTIYVELVESETNRTEAPCSYRIDKGAEIVLPAEFHVHGVRSELYGLMTGVHFLFLEDGATLKIASSAQTALTENRTYIDITQPGNSSFAHIIIKQGGLLDLVRVEDVVVSVTSSVFEVLHKGTVKVNHGIFYSAFADVETKGVVVLDGAGYKAATGPGAGLSDSSNSGSGGGFGGQGGRSHSYNNGGGAYGSVYKPLSYGSGGGHGKWSGGGAGGGSLWWQVGKRIHLDGRLSSKGQSGSNSGGGGSGGSVLIETTNMTGHGEINVNGGDAQSNAGGGAGGRIGIHVDFRNNFGGKFRSAGGNVSGYNDNAGAAGTVYKYESRRGPQYRDLKYNPDTNLTTFKPEHSKVKVDNENNNVATPTVIMENQTVFYEFDEMQVEGHSTVIFYHPETARNVTVIAHEVTGDKTGIIKLVSRQRLFVFVVQSTHTYMDAPCGFHVEDYAEIVFPTEVILRGESSTIRGRITGVERLVVERNGLVEFGGTAHSAQLPEESQWLADNPFDPFTPGLIIVPQLIISNAGLAKVKMSPIRVVLNIADTTVKKGGQLILSTNDVTINADFVTVESGGLIDSSGAGYTAASGPGAGSGSTGGSHASPGGRAALGTQHGSVYWPDEPGSGGGYGAGGGRFYINTGGYVIVEGTIRANGVGSSSRSSGGGSGGAIIVKRPSHEGAGSGGRIAVYLTEHFIFRGTLTALGGNSGTKYHGSPGTVYIDVDVGEEPYRMIQVDNNNRDNLLPVTLAEANTVLYEFERIHLVRKGALAFKEVSGKLVKIVIRKATGDKTGILQALQNTRIYVECHSVRTEAPVNYEAHSGGQIVFPIQTTLLGTRAPALTVNGEIHGIEKLRLSSNVGTLVTEKGFSACLDCHSNYTTDYIGHYWFKKLQVDLGGTFQVQSSVQTISSLAVRLHTGEIALDYTGSLKADAAKLLTEYFRLELDAVTDASGSGWSSQKGPGSSTACSGVAGAGHGGRGGTGYFSGCSSCTAGGGNKYESVSQAIQAGSGGGGTSASGGGVVFVSVEKLLELDGSIKSDGANGDGGGGGASGGTLWVAGRHFEGHGHLTLKGGAGSSRSACCSGNPCSSYRKYNGGGGGGGHLRHFSPDYIRRDIIRKRDVSGGAAGGGSAGNGGSGQISAAGNECSGHGTFSAQEGNCTCDAGSYGVSCLYQCDPSITCLGHGRCSASGGCDCDPGYVGYRCEHKCDAKRDCHGNGRCSVTGKCVCDPCYTGDDCRYECSGNGTCIGGKCKCDPCYIGTHCHSLCSGHGTCTNGTCYCGSEWKGDYCEVPKCPNDCSGNGICNSALLTCFCNPGWRGFDCSELDCPGEPDCYNRGTCSAINGTVMCVNCSVGWMGPACNDPCVNGVQEPMDSGFCKCDPCWAGKGCDSLCMGLGTCSDNEICNCDPLQGWRGDVCQIPGCPGDGKDCTGNGDCNSATHECTCYPGWAGLGCNIPDCPGAPNCNNRGYCNASVTPPQCQNCSRGWMGPACADPCTFGEQTPMDSGQCICWPGYTGVGCDSECSEHGKIVDKSCVCDVGWRGDLCDNPGCPGIGSDCTGHGICNTVTHVCTCNEGWAGKGCEISDCPGTPNCFERGICNASVNPPKCQNCSKGWMGPACNDPCVHGQQIPMDSANCVCEPGWVGVGCDSECSEHGTIVDSKCQCDVGWRGTYCENPGCPGEGEDCSGRGECNSALHTCICQNGWTGDGCHIPDCPGNPNCANRGFCNITYNPPKCTNCIAGWMGPACEDLCTNGTQIPMDSGNCVCDPCFTGRGCNVECTGHGTCLENKCQCDELTGWRGSLCEVPGCPGSNGKDCSGNGKCDSANHKCICDPGWTGVGCHLPDCPGVPNCFGRGNCNATDRVTPKCTDCIQGWMGPACNDPCVHGYPKDGICVCDPCFTGSGCQSECSGFGECIDNKCDCGQEEGTAHMGQYCELPGCPGQCTSPNNGFCSMDTQKCICAQGWAGDDCSTPDCPGEPICTGHGRCSNSNPRRCNCEPDWAGERCEIPCVNGTNYGNSSGCICHPCFSGSGCNIECSLNGKCMNDKCVCDKTLGYKGDVCEIESCPGWPFDCSNHGSCNRATFECTCVPGWSGAACDIPDCPGDPDCNGRGTCTPPITDNETPKCTCQQGWMGVACEKPCKFGTPTADHICVCDDCYNGPACDMLCSNHSSICENRECDCGFDGWRGTYCEKKGCPGYKKDCSGHGQCLSASQTCICDPGWSGIGCEQTDCPGDPDCNNRGQCIPAETPYCGNCAQGWAGIACELPCVNGTQNQVDPTVCDCEPCFNGLSCDVFCSARDNATCSEGKCFCGFEGWRGDFCEKKGCPGLFNKDCSGRGTCNSATQTCDCNPGWAGRGCHEPACPGTPMCSDHGTCESLATISFCSCDKGWMGRACETKCEHGAPQQTGDGSFFCQCNDCYSGISCDLECSGRGNCTNNTCDCGFEGWRGPTCATKGCPGWGSDCSGHGSCITALGICYCRPGWSGRGCHIPKCAGGGNCSGHGVCDGINHDPPVCVSCDSGYMGEGCEQRCINGTVIKGDGDTCKCDSCHTGVDCGVECNGHGKCENEKCVCDSGWRGPKCETVGCPGQGADCTNHGVCLLVTQQCDCFNGWKGEGCDIPDCHGVPDCNALGTCYGGVDPPKCVNCTNNTMGPSCEFPCINGRENPPDSVICECDPCYNGLACDTECSGRGTCREDVNPKRCECDSGWKGQTCETLDCPGEPDCSGRGACVQQGSPPTAVCLCNQGFDGDDCRKLVCPGQPMCSNRGTCTLVGGIPACVCNHGFDGSSCERCLPQFTGSECDECVSNYIGWAVGCNVYCVHGNGTGHNKDICTCHNDANFGYWNGTSCDHCVFGWGLPSCAVCDDAHVGENCDIDCFSAHAQYRDELDGDWGKRPLEPIVSCLYENAPNEVFAWFGYHNKNPHNVYVSVGADNFFTRPYVDIVPGGLKGFVLKTSGADNTTNNLVPLPTQDYGQPIKFVPGRHEKAFKVRLEDTYPIAWVLAYPLSNERNAAVANQSLLHTMKCTNIEGQGSDVSRENYTCSCLDGHWGFACQFDCPGGPQAPCHNNGFCNKTTGLCSCDPNWRGDENCTACSPDWYGLDCSVVNHNLNNHTAAAYAHGYFITIDGAGYKFLGNGEYHLLLSHLWEVQVRMVTCFSSSSCVNAIAVRIEQHTVLMHSRFIDRKEPFVFVNGKKVYSVSFEFGPASQRFTFKRTSRLQFVLSSSYGVRLIVRLYDRYLDVHLRADNQTYCKTSQGLWGNCNLNSFDDLYSRDGKIVTRLNVSQSYVTEIYGKSWEVTEGDSLFVYDINNYHEKRELYGGGYALHFNNAGAHTEEIYSFSLSDITIEFMVRTESENGTLLSYTSTDMFAVILESGKIKLRYDDIILDTLAVIQTNQWNHIALVWSLSTRILQFYHRDDTGKRVNSRNFPIASNVNVFQPGGSLALGYCMPPPGGLTLPVTEGFIGQIDELRIWNQKLDPFSISANWRGNIGCTMRVQNLASLWKFNEGDGIVAHDCVSGAHIYFKSGIWRGPTWVFSTVEIPQFSVDTSTAYSFRFGNSWMSAEQLCYNLIFGSMLKSELILLTTSTLWFYHMSCVTSVTRSNDDSHSYWTLMALSDFNQLIVEHSSWFAQSLCNSVPAFNFPVWYGKDCDHQCKFGLPGTNEHKCFCMKGFFGLNCSSECPGGNNMPCNRQSSCDVSIGTCNCPVSSNTTYDCSVCSPGWIGSDCSVSLGGNRSSSENFTCQGFGATHYTTFDGVGYNFGTYGEFYVIKTNQFTAQVRQIPCMNASFCISSVGVKIGSTEIAIRASYNGTGMPLVWLNRKFTDATSVIMENNFSFHKTSPDVYEIVRPGKILLRVKAWQDYLSFVITSAPQWCFVGSGICSSCDSNVVNDFTNSTGTVYWGNSISEGIIINNLRSQWQVPAFDSLFIFGYTNYKERREITTNGYALSFNGTTASTGSLNAFVKGKDFTIQLFVKVYSSGGTILSYSNQFTFALVNDVRVKIFLGGISYDMGIALPSGTWVQVSIAYRASTGVLTYYQLNAQGDLYYKQTYIGVEMFSSGGTLWLGHWHISHEHITGVPLQPFFGVIDEVRIWSFSLDTLIIRQSFLLVITAKVPSLSALWLFDEGVGRVIANLISGSPDMYLPEVISRRPMWQFSYVREVFPSMVVSTSVQFSITFELLAKKRCLELIYHPHLQGQCGQLLSRAVTQFFFKACLFDVQSSSSLDASHISLIAYADYCMTVLHLSSWPAQRLCEQLPQSLPRRWIGPDCSVKCVFGSADKNNASLCVCHRGYWGEDCANECPGGGNKPCNDHGNCDVKTGSCECDLNWRGNGDCSNCTPGWTGSDCGVAVAVTQLSTCSAFLGGHFTNFDSAHFNFFGVGEFWFVRSIHFHGQLRQIPCHNGESRCISAIAFSFLSEWKVTVHAPYEESKQPVVWVNGKEAVYSSTRLQISSEVFLEKTSSTTYLLSSVLKDLKFQLRVVGRGLVIAGHVNQSFCNGTNALCGNCDGNRDNDFNVTVGSSLEDTWRVSKVESLFIYQKAGYEEERVVTGAEYALMFNSIGICSDLMPDVINASSITIELLFKMYSGPNAGGVLLTYSKAISLTLFIEGTLKVRIGIGIWDTGLSPQVDAWNQVTLVYYNTTGAVYIYHINSVGIVRLATRTMIAGIFNRGSIISIGQWIPSLEVSTKENDSLPGFVGVIDEVRFWNREFSLQDVTKSWGVNVLSTARYIVILWKFNEGQGSVIHDLVSRVHLYIPSIRKAPRWVFSYADIKILPVAPEITFSTSRMKVEAESWCHTHIQNSPLGIACGGLGGGTVAFYVRACLRVIASSNQVSLGISVVVAFADACEIQANLTIWPARQMCTYEAFRNSGLTNWIGVNCDIPCPYGYQPLGLYGRCQCDPGFWGQTCSGVCPGGLVNVCNGHGSCIHSNGICKCSQRWQGALDCSQCTLGFFGKDCSVAVVPPTIQQPVTSVFGTGYIVTLDGIKINVNVAGEFSVLSLFRYGLSIQFRQVRIGSYVXVRCVIVVVQRNVLAIHSSVGVAGQVLVTLNGTPISQNSLVSLGVSGFMFQRTSLNTYVVVGPEGFNFVINSLAIHFDVSITMNKDLCQETCGLLGRCRIPGSRAPPSNCTAGGILDTYDVSNITQELLISYVNSWAIPQNESSFGPILNISGEPQLNSVSGSCLYFNGTSLISAPLLNIFSGNYVTIQFFVKAKNPHVYAGTIISYALIETFAIAVNKTIFIYFGTTVIDTKLVLETGLWNHVSFVYMRRSGQVQFYLVNSIGIIQSRVFFVGVGIFAEGGTLALALWQVTKASLSLPGFVGWIDELSFWNKRFDSVTIQQTWNSNLQAETPGIVLLWKFNEGSGFICRATVGSLNFGLPTPPWKSPVWYPSDAIKVGNVFITPDLSEIVPDNSTRELCSDVFLKGPLFNECANVTGGSEFYYEACISEVSTSGTPESALMIATTFAEECQVALNLSSVPGQGLCNIIPGGRYNDWVGINCSTKCIVGLFTDGNCKCENGYWGINCSIKCPGGAENPCYGHGKCDIISGECNCQPNWDERKNCSKCTPGWIGKTCAVAVSTTETPVTTYTSKVCTILERGYVTGFDGSLFTFTTLGEFIMINSSILQVQVRQVPCEKSSVCLNAIGVTFNDLTISVHAAYESDSFPVVHVNGELTIVGEEPGKDMLKNNISIQPISRSAYRIVISLYLTIQTVFSDRYMSVESTVTSNFCQLVDGLCGSCAKLRVSQNATQGSGGSVSSVDRPTTVLEELGRSNATSGNVNEFVKKELPVQDPIIIIDTEMHKETRVVYGGVYSLYYRFTAVVTQTVVKLFASQTLTFQLLVKSCDPQICGGTVISFASNVTLYVSNHVTVKVVIGLDVFDTGIATEAERWNQITVVFVRERLQLFVFVTFSSGLVQVRKFSFSIDPFISVGTFAIGMWQPASGSISIQPTSFFLGQIDEVYVWARPFDYALVEQSWRSNIQPGAPXLTNLWKFNEGKNSILKDLVTGVTLLFPRYPLGKPEWVFSDAPIASVVAVNPNENNATLRTIAIKACFQFIYEGPIRSACDALGNVTLEFYFRPCVQAVVDTGLTVESIDVVITIADYCQKLFGLPHWPAQSLCNKFPGKRFPNWIGRNCTIPCIFGQAANESEVCVCDPGFYGTNCSGICPGGKGNACNSHGVCDVVTGKCSCELNWQGNENCSTCTRGWAGTDCSIAVTQWPSGSVVIGIGAVLLGGQFTSLNGVSYSLQVTGEYYLIYSIHVSVNVQIRLVTCTQQESCINSIALQIESNRVVLHGPYSAGRGLIIWLNGKVIDIDLHPITHELYGLIVSKITAQLWEVKYTGLYLKIRVIGRFLSLSVEASGLVCKSSIGLLGACNQGLLESLMSYYPTKNCSEEGFMLNLSRNHPNVFSQGDTLGEKNASDTKAKTQDVINTLITTKLKVKECHSLFEYKYGEVVEYREANAGYVLYFDQTTVVSDVIYKAFSFTDLSVKIMFKTVRYGVIISYTLRKTFFVTNTGGKFTIFYGDNVYHTNIAAELNKWNQVSLVFRKSTTVLHFYYFSSGGQLHRLDLNMGVDIFTPGGIIALAGWMPSLDGSGTQPTDFFAGFIDEVRIWTRYFHPAFILHTWNRSVSVKAQDLAHAWKFNEGEGITAIDKVTGMKLDLPFKPWRKPEWRYSDAVLQMPFYDRPLHFNFTNXSLQVAAEQFCNRTILMGTLHSNCKSLGPGVSTFYFRSCLQRIATFESLYMSMEVIIAYADYCQTFNNLTVWPAKHLCNEFPGREFPIWFGERCDKKCVYGKKLASETCVCYHGYWGLECTNACPGGAANPCNNNGLCNVITGECECIVNYNGTQDXGKCSPGWSGLDCSLALVSLNLNRQTSTAISSTDGHYVSFDGYSYTLVSLGEFYLMNLPHLSFQVQVRHVPCRHQTVCVNSIGIRISSTEVSFHAPYNTGGAPLIWVNGKLLLLSGLITTLGSPHLGILLNYKGRNHYQISWRDNFAMGIRIHGRYLSFIVDVTSPYCYNSTGLLGSCDNEPNNDFKASFNESIVPTNVSQPVLNTEIRSHAFVYEKDRVIVLKYKHYHEKRLPTGGIYALLFNQSGASSKPLIKTFNFNADITLEILLQPYQFGGTIFSYAVLQTFAITIESSLRIHFGKAIIDTGVNVTINQWNHVSLVWYHKSRVLEFYHFNFQGKVQRRSYVLSSNPFLPGGILSLGQWELSPGDSEAHIVASFVGTIDEIRVWKRAFNPAFILQNWRMNVVPTHPDLSGLWKMNEGESDIIVNLLTDEHIYLPRSPWQQPHWVFSDADIKTNLTSSDQPFEMHFSNETLERMAKTFCYELFYKSTLHDQCHGQLKSELEFHYLVCLKDIATSSYISAALTAVVTFADHCQAVFNSTTWPAQSLCNKFPGSHFPLWIGDRCDVKCVFGVADPDDRNRCICMEGYWGSDCSQICPGGLLNICAGHGWCDRTTGQCQCQVNWKGDENCSSCSPGWNGTDCQFAVKRVTGITSQNVFVASIGGNGYFTTFFGVSFTYRAVGEFYVLQSASQNFVIQLRQTPCIVDGSYTPLCTTGFSFSLNRNVIVIRAPVTTFSRTVPIFPLVWLNGNFVQVDHRTQLSVDFVMVRISTVAFEIYGLNDVKFGITLGNSLSVTIHLPAMYCQNSTGLLGACTGTSFNNSSSLQAHITSLKQSSVVDKLQTLFIYKYLHYSEYRSPTGAGFNLFFRDHSVRSGPLHLPPVDVLTVELLIKTHQTGGIIFSYLSQNIFAVIDNTTLGIIYKGTVYDTGLKLEIKQWNQLTIVFKQLVGTLYFYHVSSTGAVKVRVFKLDKNVFVDGGFLVLGQWQPSPSSDSMLPQSSFVGEIDEFRIWKRRTNSDLVKSNWRLNVQTGIYPDLLHLWKFNQANGRIILDLLGKNDLFVTKFHEPQWTFSDADIPRVNLEETAFVNLSLQRDAESFCFSLILSGPLYAKCEDLGIQVAQFHYKVCLHDISLSSQLRSAVYPVVTFADHCQNDLNLSEWPAKELCHHFVDQRFPYWIGSSCNTRCVFGYPIPDINSTDGVSCKCEQNYWGVDCANLCPGGLRETCNGHGVCSVTNGTCECESHWKGNTSAEYTAPIVENNSIPPIPCSKCTPGWTGADCAIAEDSSILENSSIPRIAINFGDPHFTSVTGVNFHFEAPGAYHLFNSSVVVAQVLIVPCNNRVSCRRISEVALRTAKRELSVRYNGLETVASSLFDLTSNTSKELSKSDQWVEDADIQYRWLTDNILEVRIQEEIQFNILSYYGTIGTAVEVLEQKDQTDGICGEKESIWIRQQGNKSLKSENLVPDTSNNSTKDQQGLTQATIETQLITRFRIMEKDNFLTTKYAWRSYSGAGYMLEFSSGNTAVMYASNTSLPVLDEFTIERWVCLTNAGVSVASLCTTDQRNTTEPVTGGHAVFSVANAIRDFAIVYKDGLQVKWDKEKFITGINLYEGVWTHLAVTWRRIDGRMQAFVYSNGQHRQSTRYGVKNGKQFSFNGLFVLGRYMRGYMVDSEYDMFGALDEFKVWQYAKTMEQIRMSMSVKFEDYREGLLLSIPFDEGVGQTTVGHLYSPIPTEEALSLFEAQVVNMTNIHLFIHSGDSPGWAPSGVHLTPLANYSLAFLNKTLEEEALKKCYESFYEGKLQEHCSPTLVSQALFYYESCLTDIADSGSLAHHKLSVSLFGFYCQKVLGIKECLLHGTYDAFLRCPGEEKQTKLTPTEIIVITVSSLLFLLFLLIIIIVMCRRRKRRKSEVEQIYLHEAGCERSHKYVAGEEGHHPHADSARRMLDEYDFEPDMDESLQDTPRVTRRPLVRDPAGGVLLDGEEETTL
>XP_022805470.1 uncharacterized protein LOC111342641 [Stylophora pistillata]
MKLSLNDLKHGRKTLERAPPPKTPTVIPTSQDDDNTGALSVASVRERFEQKRIPPQKPNATQSPSQGRGKPTEEKVLSVASARAKFEQELKPVNKGPHPWKPTPKYRNQSKTDLLRLDKNSNAKGHGNKEPPRKELPNIFRIGAAPSKPAKPTNLRFMLKKYKDKIILSNSLTSSTQTSTKADTPVIDTSWNVHQQVRRLLELLPSRALNTEALLRPSLRLLLRTSEEIRVLASTPTISEPSKENPLLRFEHYKETSITSPPSWKQNEGNQKKTGSQRRDGWIYLIDQGNTEPNRKELPTLCDIGAAPQKPARPDYLKFNLRKYWHMIAISKGVRNIAKTSSGSKSNEGAGYLDLEENCDYGAAEPPMITRNVTRLTPGEAEDSKDATASKVITDKGVSLSVKGVNLICPPGAVKDPVSIRLTLEEPYRYCYLIARCGLHNDVLFVSPIVNCRPNGQKFEKYVTLKVTLNRKRVKSDGDLLVLHGTRTGQSQIINWEDITNESKFDLQTKNLEVRINQFSLIAVLARLTLVRTKEIVTRLNLMPFNYNLSVLLKLNKQQSPFDELALVFMSQDTYREEYYRDHEDSAIMRLKKDGFEELPIDSKNAQESICIYNKEIITISVQLGEDYKPANNQQECFEVVVDSCAWWNTGHVIKLPLQVCNTNSKIVCGKILVKGEFGHVRENKFCQQDLCGYVRHVLGVKKAIFDVKAVAQKLELPAETQRQVLTCWQSEAKQLELVLLHWREKHGDAANPNHLKKALEELEPEEYKVKERGQMHIDHLRELAFKIAGLEHHDTDVMKYFDREVEKLCSAVLRDCCIENATSKEEVESAIQTENFASVLVMKKRLPESVGKTCTRMFSSQEDEFNTSSFLCVVSELIQCIKEIFREEKIQRTLSGLRGVTEEATLQKITSGIRDAAYDKSIFKLLSEVCHILEILCRNNNDDRQRESRIQSWGSCAMMAVMLHFLERFFQCAEACRLYRRLKLFSFSIRDIMSNPTAYEGSFLRDFHNVAVSLLKFARFDPSHLVQFELKDPNYSINNQTKDDIKYDTSKEMFSQLFWMIKKCEENLQLEATAHVHVGGQLLTLGAFEAMVILPESFSEVVENAPIASFPVSFKSETDGESSNLRVTLYSTQKDNIGKALEYLQDALDGQLSLSKQTEIKETSLKLLNVAKAGTDVRLRLTHIWGSGTLGVESNTFVPLGYSSPETHNNLEITDEPDQQAVGESSLLPVAGTSEYIADNHNRGNDFMTGPSSAELIPSLTSTTRQTVINVNNYINHTVNMEGHVCVAGERPNLNVQSPSSAAHRFLEAGPGAATNRSITAGDI
Sequences for 5 P. acuta biomineralization genes
Fasta file for these sequences is here
>Pocillopora_acuta_HIv2___RNAseq.g25214.t1
MAEQPRKIVKGRSYGQRRPPLTHILKSILERYPDGGQILKEIIQNADDAGATKVSFLLDSRQGFYGRNSLVAPSLYQFQGPALYAQNNARFEESDWENLQKLMDSDKKDDPLKVGRFGIGFNSVYHLTDLPSIVSGDSIVFLDPQEIHFGRGETGQEFSLEDELLENHEDQFKPYENVLDCKIWTRFYNGTLFRFPLRSAPSDLSEKVYSKEKVRKLFQALEKEAPVILLFLKNIEEIALFETDERGVERHVFTVRLSDSCRQEVREKKRNFLNDLRRLSDGEIENINLSLDLHVLEKTEGGREVENKWLVYHQVDARNSTLKKLSLELGLLPWVACATPVNAVTLQALSSRTGRIFCFLPLPPDADSKTGLPVHVHGYFGLTDNRRGLKWPGLDCQDDSTAEWNVSLVQHVASEAYANVLLLLRDSCDSSDGTDLVYKSWPNIREVENHWQCMLEHMFSILLKENILWTPANHGQWRNLSDAYLDRMTTQFQSTSDETRRVVLDTLTQANEAVVIVPSHVMTAIDKYRSVPTKSITPAFLRALLKKKEKGVWKISNVLKEKKLLLLEFALADKNLDDMRGVPLLPLADGSFVDFRSIQYNREPAAAVYVSSTSNPRSIFHNMDNKFLDDSVQTSAVTYLSKVATDANNSHTIEPVQLVKLNQTIAVKVLREMLPSEWSGGNHSVPWYPGKNGHPPEHWLESVWVWIKRMLPTDLSLLENLPLIPHTCAGNRSIVKLSSSSVVIRRHYQSISLPPFIVSLLGKIGCIVLENLPSYVHHNTLNRYVVTPDPHGVLKIFCTLGQSGCIPAITHCSPDEKRALRSFVSSASLSGDQRNLLYDLPIFDAADGYSFIAVRNGFQFHGVSPYDFKLPQSLPIPRASSIIALRDSQSHTLIHRLGISPMTKTTFLRDIVFSGIQNNFYNHQQISTLMCWVLSQYSLFCGEDSSFHSSLQQLPFVLTMSNKLVTPCCVLDPQQPILKQLFESEYDKFPNGSFVKEETLLRLRQLGMRSTPNKEDILHVAKTVDKIHSDVGSRKASALLEFLDGNPPDKSLGQTLMNERWVPRKQSRPSSYPGAMPWFSGTTHLYKPSETFSQSKATLVGASAPIVSKPCSKALEAVFGWDKSPPVHHLLNQLRSACLVRLNDMNRSALYHFQVMLRQIYEEGSSNASLIDAVNQDNSFPEWIWHGNGFSSPSKTAFTSCCRIDLRPYLYIVPKEFKILYPFFQRCGVREKFQDSDLLRILTMIKDKHNTSSDYLKDVADDRKLSHEILHWIVRDGKKLNPDLRESLLVPIHTWDNTLKLVPCSECTFCDAEWLRKGGTELLITSEFPMIHKAISPETAALLGVPPISTRVTCAEAVGIEQTGPHEPITTRLKNILNEYKEGVGVFNELVQNADDAGATEVQFVLDWRNHPNQRLLSPGMVECQGPALLAYNNATFTNDDLKNISRLAGATKKEDLEKIGRFGLGFSSVYHFTDVPSFITGSYAVFFDPETTHLGSHISHASKPGIKIDLAVNENSLTCFPDQFAPFNGLFGCDTTPYSDHDKFYFQGTLFRFAFRTKRGEISDKIYNKKEIRSLMFSFRESSPTLLLFSQNVKKVSFLEVDENATDTKDSRLLFEICRKSQSECTPVKNSVTESTFLKSCAAWTRQSTSQSEPFARYPSPKLTELISVCFTICRKNEKRCQETQSWLVTSCLGTGNSFKLATSEEGKKEGLLSASGVAAKICTQGDGSQKVEAVPGEMFCFLPLSIQTGLPVHANGYFAVTSNRRGIWEGTTADIGRQPLEVRWNQSLMEDALTQAYVQLLENMIVMQTQGKIPSYDIFTLWPNPDKLQSSAWEPLTKSFYRRIASYDLPLVRKAGKWLPVTQCIYQDLKLRELPNSKIVLEKFDYKIAQLPDFVKKGFQQAGCMEVIDQRTVTQEKFLRYVFFPNITAIPEELRNPIVCYLIDECLRRHASNWSKQLLDLCEFLLSTNRCIPCGLDTGHLAFPKELISPKSAAATLFSADDRRFPFGSCYQTEERLLVLQNLGMLSDILDWETLIERANSVSVLCRRAKQDDRKRSALLIKYINVYLEKMDPPSELNREELMAISMFPTLAKPPNYVMPWKGTADWNSDILPAKEMYDTRYKFVAGSSRPILDESESGCSRLSERARHLFGFNSRKPSAHEVLSQLEHAVQAIFHSPHAVESLEKVFHCIYDYLQELVLEPDGERIIHALQEKKWILVQGKCLPASRLAFAWRRFGEPYLNEVPQNLASKYRSLFQATGIKEHFSTKEIISALYELNEEKQGERLSTKEFKVSKSLIDEISEASEESLVTERGKIPLPDHNLILQPAEKLAINDAPWVAARSDIGYVHKDLSIDLAHRLGAIDIRTKKLSRISRPIGQKFGQREELTDRLKGILKAYPCDVGVLKELVQNADDAGATEVHLIFDPRYHNTDQLLCDNWKELQGPALCVYNNRPFSKNDLEGIQRLGIGSKANDPTRTGQYGIGFNAVYHLTDCPSFISNGDTLCILDPHYRYAPGADKENPGRLIEPIGEEERSDFRDVFPCYLEDMFDLKSSTMFRFPLRLQSTCTESMISEQRISSTEMNQFMNQLASEAREIILFLNHVKKISLSEIKDDRLKEIHSVSVQLTADDNAKRLKLTNHIKNCRFLNTNEIQWFGITYPLFVHEGRVRQEQWLVHQCIGIQKRGGDKIPNGRDYGLLPRGGIAAKVSEKSKLHSHIGSEPTFKAFCFLPLPEHTGLPVHVNGHFFLDSARRNLWNDEKGEGFGNQWNHFIMSKVLAQAYISLMLEARGHLPGSKRGETTSFSKEFKVHNGMRWYHNLFPHFGTVQSRWRVLAEAFFHNICKEDAELLQLTTRLFGKNSPTSQSAKATQSIQPLDNAQVAKNDFIDREEPIRCFWLSPSQGYFNTLSFTDESAKDLSKVLFNIDFKLFYSPFKLFNDFKTAGTDVREITPEGVIKFLEKNPNSVGILPCPVSETTVGSVVNVVLLLNYCMETPTFLNQIFGIPLLLTEDGVLRRFQMDKQVFLSRFADLLPNQRSEFIHHILAAPLLHFEKEIFQADQGVLKRFDICALASLLPSIAKGKWRETNSLIPWDWKDGPTEPWLKPSEAWSVGQKTVVELLRKLRCPELDVELIGSGNRWDLSPVLKQHLSYPNSSQDILEVLDHLMRTEDISGYLSDDEMISLLQFFQHDAASLKQDRSLTSILKRLPFFKTFMGTFVSLENAKSVYTIPKGLPTDECDVWMRGINCLFLAPEPLLDHLYNRVLNVVSRTHADCYINFIFPKFSSLKEQTRMLHLNYVKRFLLAPFSDEDQHTRVLQSLRTLEFIPDTNGSLRTASYFHDPREKVFAVLLPREAKPPEPFNEKSGWLDLLCEVGLKKQVSRDQFLEFSKEVAKQAENMSKKTRITLAEKSQTLVTHLLGEESFHEAEYLRKLSMIKFVASSKASDNLLSLHKHYHCSKEVQDGTLPFIHFHGSVPKMNERLAWTTAPLLPDWAVSDSGKEMLALGVFPYPSLDQVINHVTILSQNVPKDIDGEIPQPKRRLLGDIMTEVYTFLQRMSRCQESDSLNSCSRECHEIRKRLSDKSCVLVEDGRVFVRGDQLAFRLDEQLAPYLYKVPREYGSFQHLLLRLGAVEEATPEQFAKLLNKLNASCPGKKMRRNEVSIAKHAVHGLFTSLKALQDHNKKGEPVNNALSKIERLYLPSSEKKLERSVDLVLFDFLWYKLRIPTAMYKQLDPLQKYNLTFATPQQIVDLLPAHLKIPSITALVREELHSECREKKCRADVEKKCHETNRLRHILFSPKFVDGMIRILKNQYQKAKLNDEVRGNVRRFQNELKISCMEMLSTELVENKSNTSIPGSQRSRNIDCFVERDESGRKHIFIKHGVGPRNVRRILCEEINQLTGCYIDKVSWLHLADILECESPEDISSTLDKARVSEDVDTTDTPNVEPDLGTDVREEFHYLQEQYGDFYFGKGEFVAFEKDDSTDEEPRYIYAKIMNKVTTKIKPKKDRTKRKQKDESKLLDRYLIDIGHVKKEVDVLDLYKIRRPQTFLEKDEKPEGECFSETMELVPHEGTYRQSAEPEGAECSTTPQSKDDCGELPKPRTLDAARKEVRKKLSEIWKLPEDKRKKAVRRLYLRWHADKNMDMQDITNEVMKYIQTEVERLSKGKSSSRDEGCARPPPPDFSDLFKLWDVIARRQRSSFENYRRHNPRFTGFASHSRRTYTASNLRVAKMWIRQSKEDLRSVKLLLTARDPLYYLVCFQCHQIAEKSLKATLYALSGVADRQLKYNDLVLLAHDLSRLPRAPDVTPQVARLSDYYDGTRYPNKHVPPKVPAEVYQDSQQAQEAFRLATEVLEVLEQFVGP
>Pocillopora_acuta_HIv2___RNAseq.g11609.t1
MKTSDNNYGGQAYFEIPSRKPVASVFSNQMSVMANKAYYKERIQKMQKEIDEAGKESKVLKNLDVFVLDNSLRESTVGQLRGHTIENKWKIYKEIKKVGFKDIIVASFSHMTRLGDTFIKQLHDAGEDFTNLWAFTELLESVDKSGVPDTETIPVGLRKMKELGIRNAIIEMDLVYVGINYKKFKVEAINDLLTERLKWIHANLAKDSKVVVNLRDLPDGMIKKPKRVFKVIRHLSSLPLAIRPFGIVTEESGKYFPEQLAAWIRAVRKEMDDCGFKDGHLLVHVHEKWGHVDTTQLSCLANGANGIWASMIIQGASMGAASSTVTLMNLVRLGNKKVLKKYNCSALREAAQEICRVTTGYEPYPLQPIYGERALDMVFGMPTKLGINEFDLAKFFGEEPLMRMTTLASAEMIVTRLKNLFGEDPQFTIERGTRMKEVMLEDLHKNIKEEYMSAAGLAMLFDRSGGHLTEKMAEVLTKEKPTRAHAQVLIAEIRAMWDEWDLRDGQRDDKLEFDAFYNGFLAPYFGCYRCDETKQALKAMDMDEDGTVDWNEFAVYLKWAMRQYPETKTSEDLLSVAFRKGLIPAMQDEVVRKQEGEPMEA
>Pocillopora_acuta_HIv2___TS.g23498.t1
MSRSSWKWLFVFIIAIVFSQARGTYYGHGYYPAPTTQSPACPSYITSNNNVKFTASTSSLGPGHPIINGNEIWCAATIDKAQHLIVDLGGVVKIDHVVIQGKAGTKQSVSMYYVKTSKDGSTFNYILDDSGTRPKAFWGAIDDGDIVAKTNLKKPVKARYVSFNPREPQTDANGLCLRVDVNICNGELTPINGGWTSWSSWSECSQQCHIYGQGSGYAIKSRHRTCSNPIPTFEGAACKGNSGEDAFCLNSCSSQVNGNWGYWSQWSACSKTCGNGTATRTRKCDSPPPSGGGSSCVGDAKQTKYCSDRLHCPINGGWSPWSSFGGCSLMAAGLIGSHGQVALRRAEEVHRRASALATIPHHNMAAFTVKEAIQIINLATLKIAPLMEAGLSGHNGLPVTKRVAVAAPIDEGNVPAHHLRTVAKHALEIVTKFRNAIRKDAQWTAGGLSGPNGQDAVQHAEVEYDQEDVPVPIHHRPMTDQDVLAITWRLKGATHNRVQWMANFLHGLHGNNAARCVEEELKAGHGSVIIPLPPRVGSPVLEIPSRLECVAPTSVQLMVAGQIGQVGPPVAGHAVAEHSQEDVSAAARNLDGGWTNWATGPCNVLCGDGKRNRTRTCTNPPASGGGDVCMGPAFETENCNSGPCKATTAPTATALPTPTIDPNIPKIDLVFAISATSASFQQSYELMKNTVKKFIDTYGVNKIHYSVIVYGNQVIRVVNFNRTFPLSANELKTAIDRQPALPGGPVLTNALQEAYRVFKESVGRSGAKKVLVVITDRNSGSSTNSLSQAVRPLEDLGVLVISIGVGNEVSRSELNIISPNPLDVISARLNINPSVLAVRIMERILRLNFPDVDVGFAISAASADSDKIFSLMKQIINTIVDRYGVNKVRFSFIVYGSRVTTRFTFDNAPITQEELIKAVNGTEKVTGDPDLEKALEEAEKLFTKTSRPNATRVFVVLSDIVGSGDDNSLIATSARLRKGGVLILSVGFGQQVNAISNQMTKVVIAQSDYISVPDFTTQRPVVIAETIMFKALQANIPEIDLTFVISATSASTDRTFTLMKSTINNIIDKYGISRIHYTVIVFGSGFTTSVDFSTNVPDKETLTRLVTSLQRETGTPDLAKALQEVKRAYELREVRPNAKKVLVVILDKKTDNSKVQLETIVTDLVQKSILIIGVGIGRSVDRNELIYITEENRNIIEVEPTERPEEVAREIMKIILRSCAERTSLNDTNYSASTVEQPAQFAKLDTSDPSASQRAWCRDINDVGGYLQIDLGETADVYQVATKGQEQNNRWVKSYYITLSSDGNSFFNYTQGGRTKVFSGNRDSRTVVFNNLNATQAGRYVRFYPITFNQQPCMQASVFACTEIIDPTANPARISEDAGNGILIALWILAGILTFLLLLACLYYCCWHVCCKRGKKSKGLTTFSETYTEEDGGYLIEDGESKRWNPRAVPMVARAPKVDKVPEDEVQEVSIEMKEDSPQLGVIQFGIEADNTKDQHVTAETVHSETPLYSQEVNTGTVKKGATKMTMSSTDTQKRKRAQSESAAATLEEAEATSKQWSYQTEQRQSSAFANEGYMRSQESIAPTQVRTRTQELRRAQSADELSAIDYDMFEQRRESAQSALKGEMSRDGYMRMKQSSRGSSVDETDRGFQMGTVDLAIGGIEAPNVQRGSSQFYGMEEQEMTFSADNGGRHYYELEDGGYRTEEWYARGGHEPGRLRDEGFREIHVEHQPIYHEIEHDNKVSEERVCYSYVKYDKFVGEF
>Pocillopora_acuta_HIv2___RNAseq.g7668.t1
MFKTRRKNALLVAVLLLLTPSFVVATLSSCTSNSNADISSPCKFAPGVHSYTSLTVRTEVFLETTSGSSQHFFNVSQTFDIKANGRLVLDYNKESGSNPGAGVLTSGSTGRSGGSYGGRAGAAAKTPLSTSQAESYGSAFVVTQPGSNGGGDPNTRGRGGGFLKIYARKLVVAGTVQANGQRAQQNDGGGSGGGISVDCFEIDGNGRMEASGGMGSGEGGGGSGGRISVQFQHGSFLGQAKSYGGRTENEVKAQGVAAGSHSLTASSRKSVSAPHPSSPNIVYDPSKGRLNYQGSTGGWMPATNNIGSEWLEVGLSKEAYITDVATQGRYSGSEYTSSYTLKYLDPYISSPETWRDVKENGEVKTFTGNSNTNTVKQNALPTDVYTRAIRFYPKAFSGEIALRVELYGYAADLSSNPGCSYVLPQDNNPEPGGPGTVYFDGYKSGSRHRVLSVDNQGRRPRINSSSDVSVFLQTGAAAWVTVSADHKIEEVEVSGGAQLVIAGSAKNLTVDSITDDNTGYLYLLSSMQLQISTLLSPCTLIYHGAELILSDLAPNVSLERSVDVHGTLSTLGAPSVYVGAQKGVFTMHPGSGPSSLSFGELIIESSGHLQLLNYDKQSPASCRWSVALTSSKFTLSDNSKMDVECPFNLTGDQMNIGLNSALKIDGNSSVSYISMNGVSIRGTFDPGVLSLLEGWKTLRIERYGDMSFFPHGDVRINTFYSNGNFHVEGVIYLRGRDPAVTQLIEVDEYGSVQFDLPLSSNSLVFVHNNSHEFGSHGRLSLNGVSLVHADIVVVDGTWLPNKLKIEPGCKELTVEHGGKFHFDPVGIFQLNKLLLDGDVKSLNAVKMEGLSQQKVQQCEVGFHGTVLIDSADLTTILCEYVLLSGTLRVGNLSIGSVWNDLHVNGTNGKFYFETSEPLNINQIRVSGLIDTGSAIGPSAPLTSNNFTIESAGEVNIHFQAQPAITVDGAINSTLYVTTLEVNGLFKLGSLYLITDNLSVGSSGRISVNGGGAQGGMGPGAGSQKDGGASGASHGGRGGRGSQTLAQQLIYGDIFSPGGWGSGGGNGSGNSGGGRGGGVIFLQINQTFDVNGVIQMNGLPGQTSTSGGGSGGSLWATCQQFTGSGKIEAKGGNGNTYGGGGAGGRITVNYVTGGFHSDGTDASGGQAGSGSNVEHGGPGVIYLDGKSPIVKNLRIDNKGLKPLTSLSGDYSTFIYSGAVAYLNPPSDNFVYDFTLVEVYGGGQLVFPRAGTEVRVDTINGDDTGYIHVPPFNTLNVTGPSEYRRINVTWAPFIYEDATFVLPNGTVEIRKAESLLYPNIERSSHISFWGSVLGFKAHLMVAYGATVSFENSCPRNLNFVGITVQKTSRLLFKSNMSVEADGWTVEVTKDNGPVYRDGIVTIEGDGLIEARALTIKAASLIVDPRGKLTLDGKGYFAGSGSGSGSSDGSGAGYGGTGGNGRVTTTTGIPYGDYTNPRLFGSGGGLGSNAGFGGGALLLKIVDTLVVEGTITVSGSQGVSSNSGGGSGGSILIRTRALEGSGVIAVNGGAGNSNGGGGAGGRMAIYWQDREWWRGSLTAFGGSSSQGGNGGAGTVYLVDTQHGVNNRTLIFDNNNLSPSAMEISDYSSPYTNGGRAWILPIDQEFELEEVHILRKAHVALHPNITRPHGLKVYSFVGDRTGVLHVGPNQVVIGQFADHELFEVNVFVYRDGHFHLPPTFACYGIDITVRGYLGIEDMTIAKSCRLFLALTGSTELANSEGTYRYNSLTVADGGELTSTSDVGNNSLTLDVDDITIQGGGVLHMVRMRIFVGNFTVDDLGHLRGDTFDNSCTSGAGVTNSNGGSSGAGHGGTGGLGRYGTRVGVAYGHVYEPEHFGCRGGGSGGLGGGIIRMSVRGTLQIDGTISCNGDNGQQSRSGGGSGGSIWIDTILMKGYGTVQANGGSGHVETSHSLHGGGGAGGRIAVYFRSNRTYSGVFESLGGDSTGDALPGGAGTVFLYHLIHKHRTLLISNAGRKALPTEHLVIKDYSEPALISGKTWLLPSSGEHNFSRNQNYYFEELQIYGAAHLAVLTEPHNRSATIYFRNMIGDRTGTIHVGYNQTMDLIRPTIDLPFNVRVYRGGFLGLAPATEVHGVEIHVDGVISYVKNLTLHHGGLLALNENSRTGNEATENDFKFDFLRVQFEGVIQMTSSPVTHNGMNLTVRVLHIEGGGKVEGSDLRILAENISINTEGQLTVSGRGYKHEDGTGEGVHGKINQGLGSGSSSGASGGGHGGTGGRGKHTSKVGLPYGNMYEPIEFGSSGGGVNKKQGVGGGTIFLNVTNLLEIDGALSADGGDALPQGGGGSGGSVWINCYIIKGFGKITANGGSSPSDGYGNHGGGGAGGRVAVYFIKNDTFSYFSYQAHGGQAKEGQDKVENGGPGTVFLYHLVHTHRTLLIDNNGGKPLNKHINYGRLDEEGGKAWIMPESGFHHFAAEEDKFHFEELQIYSKGHLAIWPRAGNDSRNVSMFFKYMIGDRSGMIHIGDKQVMDLKRPEIDLPFSAQVYLGGFLGLAPYTQVHGIEIIVRGILAYIRNMTIHNGGDLWLNHGGRTDHEIINHYDFDFIRVQDTGTIHCVTSPVNDPGVLFTTRAVFIEGGGLMRGSRLTFVTENITIDDGGRLIADGLGYNTSHGYQGNDISGAPINPGHGVDDNEGASGAGHGGSGGRGSLTYGTPKTGFAYGDLYEPYIYGSAGGKGRGGTRGGNGGGMLWMNVTGLIDVDGLVSANGEDASSLTGSGGGSGGSIWMYCKTIRGYGRIAANGGAGSKDSSYPGGGGAGGRVAIYFQINETSTYFVYEARGGSALGCEVGKEHLCKAEAGGPGTVFLYHMIHTHRTLLIHNGGQKPLVSAIADYKDLSEDGCRAWILPQSANHDFAGRGRDFHFEELQVYGGGHLAVLTEPVGEKASLFFLHMIGDRTGTVHVSKNQTMDLHRPEIDTPFSAHVYAGGYLGLAPYTEVHGVTLFISGTVDHIQNMTIHHGGAFWMYHGGNTANQTNSSFEFDAVRVQDNGVIQAITSPIIHPGITIIARAFFVEGGGLFHGTKMTVLGENITVDDGGLISADGEGYNRTHPQGSGLHGVINPGIGSSHIYGSSGAGCGGRGGRGDHSAVVGAAYGDLYEPVRFGSSGGGDKSGRGGGVIWFNVTNVIQIDGEVSADGRKGDNSGSGGGSGGSIWMHCYRIKGTGAIKVNGGAGGGSSGGGAGGRIAVYFTENTTYTGSFQSRGGAKGGGSNTEAGGPGTAFLYHLVHTHRTLLVDNGGQHPLTRRISDYSDLSRDGGRAWILPESGGHDFANGSHDFHFEELQIYGGAHLAILTEPVNRAASLFFRYMIGDRTGMIHISQNQVMNLHRLFLDIPFSAYVYDGGYLGLAPISEMNKIIVYVEGTLDHIRNLTILNGGELHCYLTGSTGERIQRHYNFNETVRIMARSQIQSHSPNAHKETFSLTAKILLVEGGAAISTFNMNITAVNLTVDDGGSIDASDGGYTATKGPGSLLTNNWRRSGAGHGGTGGRGSCGGYHTCRLKKGLPYGNLYYPRDFGSGGDGNGGKGGGILSISVAHTLQVDGNIFSNSRAVNNDNGGGSGGSILIHTQVLSGGHTGVIQSKGGSGTAGSGGGSGGRIAVYYSNNDTHHPYRGKFDTSGGSVTSGAEAGASGTVYLKHTGSGFSTLRVDNNGQQALDDEIPNAGVLLDLSGGRADQGTTYNAPNGMTVTSSCAIRSPLCHNPCQSCRDYSIANLFDQTFSTSSCAGYFLSGCHYTKLTVDLKSLLFINHIRFYPFCSGSRPNFRVTTNDGIRNVPVTSNYVQISNGCIQGSYVDMPVRRKATQIYVELNHPTSNTYSGLSELEVFIDGKEVWDRYKYRSFDGAKTWIEPATGTDVYSFSEVHIRGSAQLAVMPLNGLQAPVHFHADRLYGDKSGFLHVGYNQNFSVAVTDPDIPFGLRVYENGSMMLPRRAFLQTVSFKSSGKIWGVQDLFVFDHGTFYGDSNSSLGKDTVPGQYLVKSLHVQDRGVFELHSTDKKLTSRLSLTNLTIFGGGHFKSNNKLHIAVKHLLRINSGGRLSHNHAGYVTKEGRSGDEFEPSEGPGQGIGSVHGASGAGFAGTGGRGTGTGLVGQFYGDYRRPDDYGSAGGFGLHYGNLYYGDYRTSSVGPYRNFITRSGLGGQGGGAIEIVTRHLILDGHLSADGEDGPPPSTAGGGSGGSIWIDCEELDGYGTISANGGAGSPSKGGGGSGGRIAIYQAFMLNFNGTLSAKGGNSAVEPGASGTVFLETRNNSKVEYRVLKINNFGMAYPWAVDKSQGRLRNLMRGIYTDTKYVGAVTWLHEADKYTLDEFHLHGNSHVALYGNGSRGNVTLYAHTLRGDRSGVFHVGRFQSVVFDFIDLYFPINTLVYFNATLEVPRRLSLREVYMEINGTLADSDDYTIDRDGKLFLWSGGQSLGEKQGHFRFINMSIKSLGLLHTTKIQGHGPVSLHTTRFVVNAGGLANVDDFFLYSVNATIDVAGDVSADFRGYGAEQGPGTAVQRTPYYTGAGGGSHGGRGGRGTAGLHTPSSYGSIYEPTQFGSGGGNGLNGMGGGQGGGRIVFEISDMLRLEGHVHADGEPGSRSSDPSGGGAGGSIVIRSFKFDGEGTVSVNGGSCPSTYAPYGGGGAGGRIAVYYNGSYTFIGSFQSYGGISQAEWGGAGTVYIKNNQNVSQPYSILRIDNRATRSGPSRLNEIQELHLAGNSADYPYYLTSYTAPNGVTLITTGIPYCGRTSHHDSKICDTDDSKISNLFKSTSNSYYTQNSNPVVTYRFPLPLFLEYLLIYPHCNSYHLTQFYVRVYFNDSEVVGSNGWINPTNCLQGQPLRMNVRQTVEKVEVSLQKISSYSSLSLVRFFVRENPSTTLQTPHYTSPSTSWIVTDEEKTTQHDFSELQIMGQGSLSMSGNSVKMSVNKVVGDNSGILTMRPTQSILMRGSEGHLPFSLLSQKGSSVAFPTSMTCREVEVTIRGVMGEMKNLTVGPQCRFVLDNSSETEFMLDHMVVQTDGYMAVLREDREDVKMVGKTFDIRGGAKMEANSLTLDYINITIEPFADLSSDGLVDEVSGSRYHGDGWGGDSGGGSSGAGHGGHGGVGGRQQKVGISYGKYRRPTTFGSTGGAQVFPFTGGLGGGRFKIIAHDTLVVDGVLSSKGGNARASRSGGGSGGSILAYTSRIHGDGEFDVSGGNGDSSTGYHGGGGGAGRICLYYRENHFLGRFLGFGGTSSIEPGGPGTVFLENVPGMNATYGHDRIDEAAHAERVLLDENVVNGTQWVRNRTLYANAMGRKPQSPDANLSSSYRDFSVGGSSRVWLILDDEDLEANGTDVELDELQLYGGAQLAIINPANTKAYISIVIGQMEGDRTGRIHLGFNQTFLSLQSYLPMDMIIYQGGLTTMQGELLVAGVTVEIDGVLRRCQNITVVDDGVIRMKEMYDLEGKPTETFYFEAINVRNKGTMMVTNQERVREFRGKSMQIYGGGTFTAVNLHVHVVNFTIDALATVHATLEGEKFDYEGKGPGAKYPTPGFAGGSGGGHGGLGGRSTSQVTTGAAYGSVKEPTEFGSSGGKGSSGAHGGRGGGILFLNISNTLDIEGTLSVDGANNGGSEAGGGSAGSILIRTVLLEGSGTVQANGGNGYTGSSRAGGGGSGGRVALYYQGGFFDGVLEAAGGKGDLENGAAGTVYVEKAVNNSDTPHRTLKVDNKGRPPLNERVNEIEEVKLYPGRLKDGRVYSDEYTSISGLRFKSDGSTMPLSLGSSSYYSLSRMFDGDLKNAYIVQSSSVHVVIEATLPRLMFIHHIRLYPYCHQHYRVSFTLTTDHPVIGWKDRTKGSRSFANCFDTLVYNDVTIEDTISKFTVTLTRLESNVALSELKVFVGRDTSSFKAITLEQDSSRTWLVFNDQGTMTFEVDELDVRGSAHLGIQNGAGKLDFKVKEYKGDFTGTVHVGGAQNWYLNASNNSVIPFTLRTYQGSQVFLPREVYLQKSSIYADSKLEGLKNLFISQGSRFDVTQGAHVNSPSKATILLDRITILNGGTFYQRTAEPAKLALNLTGELIINAGASMEVSKVHLQAHNIFIDIDGVLSASGRGYSSMKGDEPGRKSDVAASGAGHGGAGGSSTSQEYVGRAYGSFQIPLDFGSGGGQGYQELPGSSGGGAVKLSASHIVQVDGLLDVSAGAAVNPGTGGGSGGSVLIQATLFLGKGKIFADGGEVAHSSLGDAGGGAGGRISAHYKSTRFAGSFSAHGGASRSEAGGPGTVFLSENSTHTTMIIDNNGYRASKLYISDYRDRSNDGGRAWLLAGYMDEFTLNLLQLRGGTHFAVYHLKPSFTLNVERLEGDLGGLIHVSKKNRVYIKNAPRLFPSSFHIYNEGFLHLPRDVLLKDLFYPRISLEGLISGMDNLTLGGGAEFVVTAEGQTDGYSKKTIHLNSLTIMNDAKLVATDTVLDSPTVTLLLNESLRVMAGGWIQGKWIDINAGDIEVEASGVITAEKQGHDAKSGSGSPSGSAGAGHGGRGGLGDSGSSPGNSYGSLFRPLSHGSGSMFAVGGGVIKLATSQSVTIDGLVDANGEDALNKSSGGGSGGSIWILSKVFKGNGVIRASGGSGLFFESGGGGGGRISIAFENRTFSGKINVFGGASNKTAGGAGSLYLHNKYTDFKQLIVDNNNIGSPLTDDITDVSNDGGRTWLTPEPNTIEMSFDEVDIRGQSQLAVLTTPPDSPFRWNIGGIRGDRSGILHVRANQEMHMTISDNEGKQPQLLWGVNVYPRGDLKLPHNLVVDGIKIITAGSLSGAQNVTVGNNGRLILRQLISPNNQMSRNLTFDVIEIQGGGRIEIQSDKDGLSIKCTALWIRSGGVLIADRLSIVADSVTIEQSGIIDLNFKAVVAGSGPGAGYSHYSGSSGAGHGGRGGRGQSENRTGGFYGDFISPKMFGSSGGGGSGDDIATGGGVFYLHAQRIVHDGEISVNGKDALNNSDYGGGSGGSVFIEVEYFDGSGIIEANGGAGGINGGGGGSGGRIAIYYNQTFFTGQIFAYGGGSTVESGAAGTIYKKNKHNGKSILEVYNEGKKPLKKAIADYSDLTSDSARTWLTISHIIGSPVPVTVPDINLGTTVYKGLTITEVKLGGSAHLAIEPDATKIRLHTFAQFYGMFEGNSFGFVHVGPKQLLAMPDTDYYIPVNLKVYPSGYIKLPDRVMLHKNSLSLDAGYLIGVEDLAISQCTVSFGAGSGAQSTGSLQAMYFKIQTLTIMSQGILDMVAPNSNYSLHIDSLVINSGGALNGRKVDIVAKSVTVDESGKINLDGQGEKCLDPNVYYAGSGGSHSGYGGFGIGSKRQDRFDSVFLPVAFGTAGYAGRSSFSCMGGSGGGSLNLTVDGTLQIDGQISSRGQNAKDSESGGGAGGSILIRITTLEGTGTFEVQGGDGGVTSGGGGSGGIVAIYYKISSHQFSYKVRGGGGKKIGASGFLYTKRDLQRSRRQVSESDSVLILEGREVLSFESPSVLVCDPKLIDFTFEEVKLLKSSTLTMISCSQGSPMTLIAKTIKGDKTAWLVVKPNHDVYIGVTSIVEPSMELEFNAEIEDGGTLSVPGNFFISGDTQINLSGSLIGVSDLTVNDKGKLVLNYPGHTGFRITPDQGKSVVQISTVRIKDGGSITTTSPSKVEMKSDLLQKDFGGNLGPGISVSSNQRIEHTNGPSLNKQGCPHGYEVVEVASKTLYNPCGVGKHIFNKRNESYLVLKNVSVAISHNETIYVIKNETRFNVTYYIACDYDDFKLLPGQSCNLAPGSYKYNSLEIQGSAAMYFEPGTEKGNASTLSVSKLTIFSQGQLIAKTSNFIDAISTPSDYGGSYGGLGGGASENSALYGNISFPVDYGSNGGGSSQNHGWGGGVIILKTTELFNDGLIDASGGDGSSGAGAGSGGSIQVVTDYMKGSGIFRARGGNANFPAGGGGGGRVGITIRKGQSEFRGLYDATGGDGRRPGSSGTVFVRDQRQGASYEMIMFWNKIIGYPPAQLPNTSAPYTYDEIRLENRGTFLATAQLVVAKSFVTDGTGKLTISGGARVDILSFSKSSRTFSCDLEIQAGGSLYFYNQPIFLGPGSPTVVVAGILDAREPSVGKGKSIKITSTGEIRLDKLRLLKDSVMRVDSDASIKKSFSYAQFHLTSLRLDTNAQLIFAQENVSLRADLIHLSQGAAITSDTDTKLINITSNDILIDNQARITADEGGFLGGPGKANGSGSGCGHGGRGGGGQGGESYGSVFEPQHYGSGNNARGGGVIFLNIKGGFTLYGSVSANGANDSRGGASGGSILVHAGTLSGHGEVLSNGGEGLSNSAGGSGGRIALYITDRTSFKGVLTTYGGCGTTCAAAGTIFIREYVVGLPQNSTVIDNGDRKTEANTIIMHEMKISYTMRLLKLVNGARLEVATVPNVEMKIAIQNLEGDGSGSFHVHHNQTLTLGAGKAVSSRPFMFPWAMIVDEGATLNLDPVLFITRTAISPSLYLAGKLTGGEKVTVGQDASVVIAKSGVIGTHSNTPGKYSFLSLKVSSGGRITIEVDEDANAPVELKSLSVDVAFGGVIIGRYLRVDTSLLNIAFSGTLQANGLGNPAGVGPGAGSSSLLTGGGYGGCGGGNTNETCVVYGSLFEATEFGSGGGTTQVPDGIFGSGGGIIEVEAQVLIVDGTISSNGESGSSTTGGGSGGSVDISISQTFSGRGKIKAEGGYVSGQVTGAGGGGRISILITGDNKFSGSLSARGGSSSAKSGSPGTVYTEDGKTVLRKRKLFLDNGGISSNSPLPIFLNQSVVASYDFQEIHLNGLVMLHVDKDMEVEKLVTDSDSVIYIKDNVTFTVEPNSKYLQPDCSFIVDANGEIRIPDKVMFLGRNNIFKGTLTGILDMVIGENRKVYLLASARTARYIDGKYTFITHRGEYRFSSLRIKNGAFFSFENAHLKKVPLTLGRLEVNFGASMQGSWLDIKASDVIIHSGATIDLSAQGYESDKGPGAGGLHQSDGTGAGHGGYGGISTVNFGKWYGSALNPNNTGSGGGSSSSGKGGKGGGYLRLTVVRLLTLEGTISVDGDGGTVLNSGGGSGGSIWISADNIQGNGIISAEGGDGNGTGGGGSGGRVALYLQGLMSFEGLLNAKGGDGKDAGAAGTLYIQDNNKRIPRKRLWIDNLKVGNNKPQTVLYEADRVNFLFDELRLNGMSRFEIYNLQRKLQTIQVTNFISDGVGEIAIRKNQTLLAEVLEAKESHLTLTTNIYVEEGANLVVASNLTIDGATLTLDGKLSNVRHLVVESGSAIKFGVTSQTTLMENKNFVFQSDPGTQQFASVTLKSGSDFGAPLNLKLSVGKLNMKSGVILQGKFVDIKSQSLLIGRGATLTTNDIMEIELNAGGRGHSSSNGGSGGGHGSIGGTGYNTLAGGIPYGTIYEPNQPGSPGGDGGSGDSGGKGGGVISIDTDILENDGSITANGGDASQSSQAGGGSGGSVYIIASSVFSGTGTVSAHGGRGDGAGGCGAGGRVAIHLKSQYAYRGTLEALGGISSSSGASGGPGTVYIKDVRYKLYFEQLHVDNQGQSWQNYVTLNESKTSYHFHELHLVHKASLRMTPSSNLTQSSTLSIGKLFGDRSGLLHLYNGHKAIIEVVEAQLTTTKTPVNLRIDSGAEAVMATTVYIVGDGAVALHCNGTLNGVRNLYVTQKRVVLLEQGSRTLRDDEQPGTFMFSNVKLFSGSSVTMKDEIVMKIIAGFLNIKFHASLEAHYFDIVTSNLDVETGGLLSVAGDNKARLAVEPSEVSSLPQGAGAGHASNGGSGYGGAAGGLYHGSLYKPKESGRRGGKGTNNGIGGRGGGYVKIEAGTLIINDGIITVEGGSAVSGGGAGGGSGGSLLFNTESFIGYGEMNSNGGNGGGTNAGGGSGGRIAIYATENLYRGTYQAFGGSSVSGAYGGPGTVFLQDIRSKRPFKQLRIDNLMRSIKDPVTIDEANLTNHDFSQVHLFGRAAINMAVRKERTTLKMSRLFGDRTGLLHSRANQTFYLEASATEHSVSKPAVNLRIDENAEMVFGASLYVIGDGAKGTGQITGDSSFTIDGRMIDVTHLFLTKRLKSRFLSHAHSADYHNETLTVSAVGTFVLATFEIQDGSEVFLPDVQGVQCEVGLLHMKYGSVIVADTYRIGVTSLLLETGSKITASGKVRPSGYDSSVLPSSCKGSGGSYGSKGGKGHNGVNELHSYGSIFTPGHYGSAGCPGSQNGGKGGGLIIMEVGDELYLDGTIANDGQDAASGSAGGGGSGGSIWIKCGRFNGHGLITSNGGAGDGLTSGGGSGGRIAVDTPTENKYVGEYTAIGGDSGDPSKDTTQYSGGPGTVFLKDARNQYAHTQLRLDNKGRTWDHYVTLNESLKSYTFDELYLARKAAIHLVPDGKPLNLTVHKVEGDRTGLIHVHENQTLKAEFLDAVYTITRTAANFKLDKGANAIMATSVHVVGQGEVAFEWNGRLIDVQHFHVAYGRTIKIGFYAHTAGTKAGKYRFIDGYGTFRFSTLEFGSGTLIHYPPPMGVHFIVSLLDIKFSSYFEAEFFKIEATDFYLEPNATLNCAGRGFESKTEGSGKDSASGGSGAGHGTPGGDGKDVSGGEEVGSVYEPVLPGARGGTRTGRTTGSRGGGRVRASVGFAFRLDGIINVDGDDAATNSGSGAGSGGSVWITTGYLRGHGDISARGGVGNTDGLATGGSGSGGRIAVHVKIKDEYRGGFYALGGVSSGTQHGGSGTVYIEEIQGDKLFRRLYIDNQNANPPKIFTLDEMNPKTVKANATEENDAEFGFDELMLQRGVVFRIADMRLSKRPAISVITVLGDGSSVLHVMENQTFFIEYQEYTRRRSFPPVNFKVDYGGELMLVSDFHVAGKNNPAFELEGRITGVSNLSLTEGRVLRAGENMSSALLKDKVYIETPIDGQLKFGVFIMEASSGLYFAKRMKFVVSTLYMRQKAVISADKIHMALNEVHMEGSSRITTSGKGPKAGEGLGPGSSFSNVGSGGGHGGQGGPGSTVDGGTGYGSYVYPVHPGSGGGGDGGGAGGCTTEITVGYSLHLDGIIESEGANGTSNSGGGSGGSILIKTVLFSGHGLIVANGGRGDGNGGGGAGGRIAAHVAWLREYAGQYTAFGGTGFKAGAAGTVYYTDTNQGLSHRPVLINKANHTVFGDGFTKLTVDNFNRNPDIPTIIINENSSYYEVDELEMRNHGLLHIHGNNSSFVIHNFTGDRTGLVHLRQGQKMFVQVVESKSGYSVAPVSYKIDEGAEIVFPSSLTLLGTRCSFDGLIIGVHRLIVAEGADVVFASTTQTGIKEDRKFRFLTTPGNVTFAEVYVQKGSRLEFSRINNTLVFTAIIFRLKYHGLVNINHGEIDSSWAWVESEGKLVLDYTGHPAEMGSGPGNTVNLIGSGAGHGGMGEVSQAGQLGGEPYGSIYKAVHLGSGGGNGNGKGGSGGGMLHWRIGQEIELDGLVTLRGGDGSGASAGGGSGGSILIETTNFTGYGEINVMGGDGSGPSGSGGAGGRISAHVRFRHKYAGVFKAYGGDGKTYAAAGTVYIEETARGPQYADLKYDKSTNTTYITATHRYMEVDNEDRKTEVSTMMMESEHLFYELDELFLTRHANLQVRHPPGSLNVTVIVHRFLGDGTGRFHVRINQTIYVEVVESEINETTAPCSYKIDQGAEVVFPAIVNIYGTRSIIEGRITGVEHLIIASGGFVEFTSTAQTARVENRRYVEIDENGNFSFATVTVERNSRLTFSRILNYTLSLRCSEFRIKYEGLMTMNHGYIYSAFAWIESEGILSLDGTGFGPEQGFGHGTTKNNFGSGAGHGGEGGKTDYGEGGIPYDSVYTPRLYGSGGGNGRGIGGSGGGSLFWIVGQRLQINGLLSSKGTDGEGIDAGGGSGGSILITTTNMTGHGEIAVPGGSGTGSGSAGSGGRVGIHCRWRYKYGGKFTDHGGQKGRYGGPAGTIYKEENFRPLQYRHLKYMKETNTTMLAVDHTYVHIDNDGFDVPGATLLMEENTTYYEFDEMELTGYSRLLVYHPGNVTVTAVVHKFIGDKSGQFHIRRDQKIFVEYVESETNKTEAPCSYRIDVGGQIILPSEFSMHGTRSVFEGMIIGVRDLLVSLGAEADFYSTSQTALIENGDYIAISKPGNISFAIVIVKKGGDIEFRKNTGFLRVNVDELKIKYQGKLSMNHGEMFSTFAWLDSQGHFNLNEGGNTAAKGQGAGSTVNSIGLGAGHGGRGARSGGQAYGSVYRPLVLGSGGGNGGGTGGTGGGQLLWEVGKRLELNGLVSAIGGTGNGGHAGGGSGGSILIKTTNMTGHGEIAVTGGDAINQGGGGSGGRVGIHCRFRYTFGGKFTDRGGFGTQSQYGAPAGTVYKQENLRPLEYRILKYSKETNETFLAVDHTYLHVDNEGHDVPEATVLMEEGTTDYEFDEVELTGYSRLIVYHPNETDVTVIAHRFIGDKTGQFHLRVNQTIYVEVVESETNRTEAPCSYRIDEGAEIVLPAEFHVHGVRSELYGLMTGVHFLFLEDGGTLKIASSAQTALTENRTYIDITQPGNSSFAHIIIKQGGLLDLVRVEDVAVSVTSSVFEVLHKGTVRVNHGVFYSAFAEVETKGVVVLDGAGYKAATGPGAGSSYSSNSGSGGGFGGQGGRSHSNSNGGSAYGSVYKPLSYGSGGGHGRWNGGGAGGGSLWWQVGKLVHLDGLLSSKGESGSSNGGGGSGGSVLIETTNMTGHGEINVNGGDGQSNAGGGAGGRIGIHVDFQNNFGGKFRSAGGNVSGYPANAGAAGTVYKYESRRGPQYRDLKYNPDANLTSFKPEHSKVKVDNENNNVATPTVIMENQTVFYEFDEMQVEGHSTAIFYHPETARNVTVIAHEVTGDKTGIIKLVSRQRLFVFVVESTHTYMDAPCGFHVEDYAEIIFPTEVILRGESSTIRGRITGVERLVIERNGFIEFGGTAHTAQLPEESQWIADNPFDPFTPGLIIVPQLIISNTGVVKVKMTPIRVVLDIADTNVKKGGQLILYTNHVTINADFVTVESGGLIDSSGAGYTAASGPGAGSGSTGGSHASPGGGAASGTQYGSVYLPDEPGSGGGYGAGGGQVYIKTGGYVIVEGTIRANGNGLSSKSSGGGSGGAIVVRSLFLKGYGTIECHGGPSHDGAGSGGRIAVYLTEHFIFRGSLTALGGDSGTTRYGSPGTVYIDVNVGEEPYRIVQIDNKNRDSLLPVTLAEANTSLYEFERIHLVRKGALAFKAVSGKLVKIFIGKATGDKTGILLALRNTRIYVESHSVRTEAPVNYQADTGGQIVFPIQTTLLGTRAPALTVNGEIHGIEELRLSSNVGSLVAEKGFSACLDCHSNYTSDYIGHYWFKKLQVDLGGTFEVQSSVQTISSQAVRLHMGEIALDYTGSLKADAAKLLTEYFSLEFDAATDASSSGWSSKQGPGSSTTCSGVAGAGHGGRGGTGYTSGCTSCTANGGNTYENVSQAIQAGSGGGVNLADGGGVVFVSVEKLLELDGSIKSDGANGDYGGGGASGGTLWVAGRHFEGHGHLTVKGGAGSHRSECCSVSPCNSHRNYHGGGGGGGHLRHFSPDYIRRDIIRNRGVSGGASGGGSAGNGGSGQISAAGNQCSGHGTFSVQEGSCTCDAGSYGVSCLYQCDASITCLGHGRCSASGGCDCDAGYVGYRCEHMCDATRDCHGNGRCSVTGKCVCDPCYSGDDCRYECSGNGTCIGGKCKCDPCYIGTHCHSLCSGHGTCNNGTCYCGSKWKGDYCEVPKCPNDCSGNGICNSALLTCFCNPGWRGLDCSELDCPGEPDCHNRGTCSSINGTVMCVNCSVGWMGPACNDPCVNGVQEPMDSGFCKCNPCWAGKGCDALCMGRGTCSDNGICKCDPLQGWRGDVCQIPGCPGVGKDCTGNGDCNSATHECTCYPGWAGLGCDIPDCPGAPNCNNRGYCNASVTPPQCQNCSRGWMGAACADPCTFGEQTPMDSGQCVCWPGYTGVGCDSECSEHGKIVNNSCVCDIGWRGDLCDNPGCPGIGSDCTGHGICNSATHICTCNEGWAGEGCEIPDCPGTPNCFERGLCNASVNPPKCQNCSKGWMGPACNNPCVHGQQVPMDSGNCVCEPGWVGVGCDSECSEHGTIVNSKCQCDIGWRGTYCENPGCPGDGEDCSGHGECNSALHTCICQNGWTGDGCHIPDCPGNPNCADRGVCNVTYNPPKCTNCIAGWMGPACEDLCTNGTQVPMDSGNCVCDPCFAGRGCNVECNGYGTCLENKCRCDELTGWRGSLCEVPGCPGSNGKDCSGNGKCDSANHICICDPGWTGVGCHLPDCPGVPNCFGRGHCNATNRVTPECTDCIQGWMGPACNDPCVHGYPKDGICVCDPCFTGSGCQSECSGFGECIDNKCDCGQEEGIAHMGEYCELPGCPGQCTSLDNGFCSMDTQKCICAQGWAGDDCNTPDCPGEPICSGHGSCSNSNPRRCNCEPDWAGVMCELPCVNGTNYGNSSGCICHSCFSGSGCNVECSLNGKCVNDKCVCDKILGYKGDVCEIASCPGWPFDCSNHGSCNGATFECTCVPGWSGAACDIPDCPGDPDCNGRGACTPSIADNETPKCICQQGWMGVACEKPCKFGTPTADHICDCDDCHNGPACDMHCSNHSSNCVNKKCDCGFDGWRGNYCEKKGCPGYKKDCSGHGQCLSASQTCICDPGWSGIGCEQTDCPGDPDCNNRGQCIPAETPYCGNCAQGWAGIACELPCVNGTQNQVDPTVCDCEPCFNGLSCDVFCSARGNATCAEGKCYCGFEGWRGDFCEKKGCPGLFNMDCSGRGTCNSATQTCDCNPGWAGRGCHEPACPGTPMCSDHGTCESLATISFCSCDKGWMGRACETKCEHGTPQQTADGSFFCQCDDCFSGISCDMECSGRGNCTNNTCDCGFEGWRGPTCDTKGCPGWGSDCSGHGSCITALGICYCRPGWSGRGCHIPQCAGGGNCSGHGVCDGVNHDPPVCVSCDSGYMGEGCEQRCINGTVIKSGEGDTCKCDSCHTGVDCGVECNGHGKCNNGKCACDSGWRGSKCETIGCPGQGVDCTNHGVCLLVTQQCDCFNGWKGEGCDIPDCLGVPDCNALGTCYGGVDPPKCVNCTNNTMGPSCEFPCIHGRENPPDSVICECDPCYIGLACDTECSGRGTCREDVNPKRCECDSGWKGPTCETLDCPGEPDCSGRGACVQQGTPPTAVCLCNQGFDGDDCSKLVCPGRPMCSNRGTCTLVGGIPVCVCNHGFDGSSCERCLPQFTGSECDKCITNYIGWAVGCNIYCVHGNGTGQNEDICTCHNDANLGYWNGTSCDRCVFGWGLPSCAVCDDAHVGENCNIDCFSAHAQYRDELDGDWGKHPVAPILNCLYENAHDEVFAWFGYHNKNPHNVYLNVGADNFLTRPYLDIVPGGLKGFVLKTGGADNATDNLVPLPTQDYGQPNKFVPGRHDKAFKVRMEDAFPIAWVLAFPLSNERNAAVANQSLLHTMKCTDIEAQESNVSSENYICSCLDGHWGFACQFDCPGGPQAPCHNNGFCNKTTGSCSCDPNWRGDENCTTCSPGWYGLDCSVVNQSTNNYTAAAYGHGYFITIDGAGYKFLGNGEYHLLLSHLWEVQARMVTCFSSSSCVNAVAVRIEQHTLLLHSRFVNRKEPVVFANGKRVYSVDFEFGPASHRFTFKRTSRLQFVLSSSYGVRLIIRLYDRYLDVHLRVDNQTYCKTSQGLWGNCNLNSLDDLYSRDGKIVTGLNVSQSYVTELYAKSWKVTERDSLFVYDINNYHEQRELYGGGYALYFNNSGAHTEEIYSFSLSDITIEFMVRAESENGTLLSYTSTDMFAVILESGKIKLRYDDIILDTLAVIQRHEWNHIALVWSLTTRILQFYHRDDTRQRVNSRNFPITSNVNVFQPGGILALGYCIPPPGGLTLSLTEGFIGQIDELRIWNQKLDPFSISANWRGNLGCTMRVPNLASLWKFNEGDGVVAHDCVSGAHIYFKSGTWKGPMWVFSMVEIPQFSVDTSTAYSFRFGSSWMSAEQLCYNLIFGSMLKSGHILLTTSTLWFYYMSCVTSVTRSNDHGHAYWTLMALSDFNQLIVEQSSWFAQSLCNSVSVFNFPVWYGKNCDYQCKFGLPGTNEHKCFCMKGFFGLNCSSECPGGNNVPCNGLSSCDISIGTCNCPVSSNTTYDCSVCSPGWIGSDCSVSLGENRSSSENFTCQGFGATHYTTFDGVGYNFRTYGEFYLMKTNQFTAQVRQIPCMNASFCISSVGVKIGSIEIVIRASYNGTGMPLVWLNRKLTDATSVILENNFSFQRASPKVYEIAKPERILLRVKAWQDYLSFELTSASQWCIVGSGICSSCDSNVVNDFTNSTGTMYWGGSISESIIINILRSQWQVSAIDSLFIFGYTSYKERREITSNGYALSFNGTTASTGILNAFFKDFTIQMFVKVHSSGGTILSYSNKFTFALVNDVRVKIFLGGASYDMGITLPSGTWVLVSIAYRASTGVLTYYQLNAQGDLYFKQTYIGVEMLTSGGTLWLGHWHITKEHITGVPLQPFFGVIDEVRIWSFSLDTLLIRQSFRLVITANLPSLSALWLFDEGVGRVIANFISTSPNMYLPEVISRRPTWQFSYVRDVFPSMVVSASVQFSVSFGLLAKKRCLELIYHHHLQAQCGKLLSAAVTQFYFKACLFDVQSSSSLDTAYIALIAYADYCMTVLHLSSWPAQRLCKQLPQSLPRGWIGPDCSVKCVFGSADKNNASLCVCHRGYWGEDCANECLGGGNKPCNDHGNCNVRTGSCECDLNWRGNGDCSNCTPGWTGSDCAIAVAVTQLPTCSAFLGGHFTNFDGAHFNFFGVGEFWFVRSIHFNGQLRQIPCYNGESRCINAVAFSFLSGWKVVVHAPYEESKQPVVWLNGREAVYSSTRIQISSDVFLEKTSSTTYLLSSVLKGFKFQLRVVGRGLVIAGHVNQSFCNGTNVLCGNCDSNRDNDFNVTAGSSLEEIWRVSMEESLFIYHDTGYMEERAVTGAEYALMFNGVGVCSDLMPDVLNASSITMELLFKMYSEPKVGGVLLTYSKAISLTLFIEGTLKVRIGIEIWDTGLSPQVDSWNQVTLVYYNTTGAVYIYHINSIGVVRLATRTMTAGIINRGSIISIGQWIPSLEINTKESDSLPGFVGVIDEVRFWNREFSLQDVTTSWRVNVLSNARYLVILWKFNEGQSGVIHDLISRVHLYIPSIREAPRWVFSYADIKVLPVTPEITFSTSKVRVEAESWCHTHIQNSPLGIACGGLGGGTVAFYVRACLRVIASGKQVSLGISVVVAFADTCEIQANLTIWPARQMCTYEVFRNSRLMNWIGLDCNIPCPYGYQPLGLYGSCQCDSGFWGQTCNGVCPGGLVNVCSGHGNCIDSNGICKCKRRWQGALDCSQCTPGFFGKDCSVAVAPPITELPVTSVFGTGYIVTLDGIKISVNVAGEFRVLALSRYGLSIQFRQVRIGSYVRVRCVIVVVQQDVLAIHSSVGVAGQVLVTLNGLPISQNSLVSLGVSGFEFRRTSLNTYVMVGPEGFNFVINSLAIHFDVSITMNKDLCQETCGLLGRCHIPGSRVPPSNCTAGGILDTQEVSNITQELLISYVNSWAVPQNESSFGPILNISGEPQLSSVAGSCLYFNGTSVISAPLLNTFVGNYITIQFFVKAKNPDVYTGTIISYALNETFAIAVNKTIHIYFGTTVIDTQLVLERELWNHISFVYMRSSGQVQFYLVNSIGIIQSKVFLVGVGIFADGGTLALALWQVTKVSLSLPGFVGWIDELSFWNKRFDSVTVLQTWNSNLQSGTPGIALLWKFNEGSGFICRATVGSLNFGLPTPPWKSPLWYPSDAIKEANVFITSDLSEKEPDKSTQDLCSDVFLKGPLLNECANVTGGSKFYYEACLSEVSTSGTPESALMIATNFAKECQAALNLSSLPGKGLCNVIPGGRYNDWVGVNCTTKCEVGWFTDGDCKCDNGYWGINCSRECPGGAANPCYGNGKCDIRSGKCNCHPNWDESENCFKCAPGWIGKSCSVAVSTTESSVTRQTSKVCIILERGYVTGFDGSLFTFTTLGEFIMINSSILQVQVRQVPCEKSSVCLNAIGVRFNDLTISVHAAYESDSFPVVYVNEELTKVGGEPSKDMLKNNISIQPIYRSAYRIVVSHYLSIQTVFSDRYMSVESTVTSNFCQLVDGLCGSCAKLRVGQNATQGSGGSVTSIDRPTTVLEELGKPNATSDNVNEFVTKVFPVQDPIIVIDAEMHKETRVVYGGLYSLYYRFTAVVTQTVVKLFVSQTLTFQLLVKSCNPQICGGTVISYTSNVTFYISNHVTVRVVIGLDVFDTGIATEADVWNQITVVFVREKLQLFVFVTFSSGLVQVRKFSFTIDPFISAGTFAIGMWQPASGSISVQPTNVFLGQIDEVSVWERPFDYALVEQSWRSNIQLGAPSLTNLWKFNEGKNSIVKDLVADVALLFPRYPLGKPEWVFSDAPITSVVTVNPNEDNATLHTIAIKVCFEFIYEGPIHSACNALGNVTLEFYLRACVQAVVDTGLTVESIDVVITIADYCQKIFGLPYWPAQSLCNKFPGKRFPNWMGKNCTIPCIFGQAANESEVCVCDPGFYGTNCSGICPGGKGNACNNHGVCDVVTGKCSCELNWQGNENCSTCARGWVGTDCSIAVTQWPSGSVIIGIGAVSLGGQFTSLSGVSYSLQVTGEYYLIYSIHLSVNVQIRLVSCTQQESCINSIALQIASYKVVLHGPYSSGGSLIVWLNGKVIDIDLHPITLDVYGFTVSKITAHLWEVKYAGLYLKIRVTGRFLSVSVEASGLVCKSSIGLLGSCNQGLVQSLLSYYPTKDCSEEGFMFNVSGNHSNVFRQGSDILSEKNASDTKAKTQDIISTLITTKLKVKKCHSLFEYKYKEIVEYREANAGYALYFDHTTVVTDVIYKAFSFTDITVKIMFKTVRYGVIISYTMRKTFFVTNTGGKFTIFYGENVYHTNIVAERNKWNQVSLVFRKSTTVLQFYYFSSGGQLHRLDINVGFDIFTPGGTIALAGWMPSLDGSGIQPTDFFAGFIDEVRIWTRYFHPAFILQTWNRSVSVNAQDLAHAWKFNEGEGIAAVDKVTGMKLVLPFKPWRKPEWRYSDVELQLPFYDRPLDFSFTNKTLQVAAELFCNRTLLMGTLHSHCKSLGPGVSTFYFRSCLQRIATSESLYMSMEVIIAYADYCQAFNNLTVWPAKHLCNEFPGREFPIWFGERCERKCIFGKKLASETCVCYHGYWGLECSNTCPGGAANPCNNNGMCNVITGECECNVNYNGTQDCGKCSPGWHGFDCSLALVSLNLNRHISIGMSSTGGHYVSFDGYSFTLVSLGQFYLMNLPDLSFQIQVRHVPCRQQTVCVNAIGIRITTTVVSFHAPYTTGGAPVIWVNGKLLLLSGLISTLGSPHLGILLKYNGRNYYQIIWKDNFAMAIRIHGRYLSFKVDVTSAYCYNSTGLLGSCDNEPDNDLKVSPNGSIIPANVTQPVLNTEIGSHAIVYDKDSLIVLKYEHYHETRLPTGGMYALLFNKTGASSKPLIKTFNLNVDITLEILLKPYQFSGTIFSYAVLQTFAVLIESSLRIHFGKAIIDTGVNVTINQWSHVSLVWYHKSRVLEFYHFNFKGKVQRRSYVLPSNPFLPGGILSLGQWELSPGDSETHTVASFVGTIDEIRVWKRAFNPAFILQNWRMNVVPTHPDLTGLWKMNEGESDVIMNLVTDEHIYLPRSPWQQPHWVFSDADINTNLTSSDKPFEMHFLNKTLEKMAKAFCFELFYKSTLHDQCHGQLKSELEFHYLVCLIDIATTDDISAALTVIVTFADHCQAVLNYSTWPAQPLCNKFPGSRFPLWIGDRCDIKCVFGAADPDDRNLCICMEGYWGSDCSQICPGGLLNICGGHGWCDSSTGQCQCQVNWRGNENCSSCSPGWNGTDCQFAVELVTSITSQTVLVAAIGGNGYFTTFFGVSFTYRVVGEFYVLRSASQNLVIQLRQAPCPIDGSYIPLCTTGFSFSLNNNVIAIRAPVATFSRTVPIFPLIWLNGNLVQVDHRTQLSVDFVMLRISTVAFEIYGPNGVKFGITVGHSLSVTIHLPAIYCRNSTGLLGACTGVSFNNSNSLESYITSLKHNSVVDKSQTLFIYKYLHYSEYRSPSGAGFNLFFKDHSVRSGPLQLPPVDVLTIELLIKTHQTGGIIFSYLSQNIFAVIDNTTLEIIYNGSVFDTGLKLEIKQWNQLTIVLKQLVGTLYFYHVSSSGVVKVRVFKLDGNVFSNGGVLALGQWQPSPSSDSMLPQSSFVGEIDEFRIWKRRTNSDLVKSNWRLNVQDGIYPDLLHLWKFNQANGRVIPDILGKNDLFLTKFHEPQWTFSDADIPRLNPEETTFVNLSLQRDAESFCFSLILSGPLYANCEDLSIQVAQFYYKVCLHDISLSSKLRSAVYAVVTFADYCQNTLNLSEWPGKELCHHFVDQRFPYWIGSRCNTRCVFGYPRPDINSTDGVSCKCEQNYWGVDCANLCPGGLRETCNGHGVCSVTNGTCECEPHWKGNISAEYNAPIDENNSVSPIPCSRCTPGWTGADCAIAEDSSILDNSSIPRIAINFGDPHFTSVTGVNFHFEAPGAYHLFNSSIVDAQVLIVPCNNRVSCRRISEVSLRTSKTELSVRYNDFETVVSSLFDKTSNTSKELSKSDEWAEDADIRYRWLTDNILEVRIQDEIQFNILSYYGTIGTAIEVLKQRDQTDGICGEKESSWIRQQGNQSLTSKNQIADTSNNDTTDQQGLTQATIQKRLITRFRIMEKDNSLTTKYASRSYSGAGYMLEFSSGNAAVMYASNTSLPVLDEFTIEIWVCLTNAGESVARFCSSDQRNNTEPVTGSHAVFSVVTAIGDFAIVCNDGLQVKWDKEKFITDINLYEGVWTHLAVTWRTIDGRMQAFVYSNGKHRQSTTYGIKNGKQFSFNGLFILGRYMRGYMVDSEYDMLGALDELKVWQYAKTMEQIRASMSVKFEDYREGLVLNVPFDEGMGRTTVGHLYSPISVEVALSLFEAQVVNVTNIHLFIHSGDYPGWAPSGVHLSPLANYSLAFLNKTLEGKALEKCYESFYEGKLQEHCSPKLVSQALFYYESCLADIADSGSLAHSKLSVSLFGFYCQKVLGIKECLLHGTYDAFLRCPGDDEKQTKFTPLEIIVTTVSSLLFLLFLLIILILVCRRRKRRKSEVEQIYLHEAGGERSHKYVAEDEGDHPQAYSMRQMLDEYDFEPDMDDSPHDTPSVVRKPLVRNPAGGVLPEGEEESAV
>Pocillopora_acuta_HIv2___RNAseq.g30830.t1
MARLDRTRRWSMDPTRNITPSYYGGQELRRKRLPPLSEIGPAPTKPAKPLHLKLLLKKFQEMVFLSGRDYSIPTTVYGPNIYTTLKEKENIAIGELNEDVSTIVEPPMMIRNRRGLTLRKEEGHLAMEEIYDDVSTTVEPPMIKRNRRELTPGKDENSKDATASKIVTSNGASLSVKGVTLTFPPGAVEDPVTIRLTLEEPYRYCYLFARCGLQNDLIFVAPIVNCQPNGQKFKNHITVEVTLNGKRANSHGDLLVLHGTRTGQSQKPNWEDITDKSKFDFETKELKVKELKVKVSHFSLIAVLARLTWVRTKEIVTRLNLMPFNYKLSVLLKSNRQQYPFDELALAFMSQDTYQEEYYRDHDDYAIMRLKKDGFEELSMDCRNSQENNCIYNKEILTISIQLGEDYKPANNQQECFKVVVDSTAWWNAGHVIKLPLQVSNANSKIFYGKILVKGEYGHVRENKFCQQDLCGYVRHVLAVKRAIFDVKSVAQKLELPVETQQQVVACWQGEEEQLELAIQHWREKHGDAANPNNLKKAIEELEPEEFKVKERGQMHIEHLRDLAFKIASLRHQDSDLMKYFGREVEMLCGAVLRDCCIENATSKGGVESATQTENFASVLVMKKRLPESVGKMCTRMFSSQEDEFITSSFLCIVSEFIQAIKEIFREEKIQRTPSGLRGVTEEATLQNVTSGIQGAAHDKNILKLLNEVCYILEILCRNNNNDDRQEESRIQRWGSGVMMGDMLEFLERFFQSTEARRLYKRLDLFSVSLRDIIKNPTDFEGSFLRDFHNIAVSLLKFAKFDPSHLVQFELKDPTNSINNQTRDDIKYDMSKETFSQLFWMIKESEETLQLEATAHVHIGGHLLTIGAFEAVIVLPESSGEVVENAPITSFPVSFKSETGGDSGNLRVTLYSRQKNDISKALEYLQNALRGHLILPKQPEIKETSLKLVNVAKAGTDVALRLTHRWGSGTIEVESNSFVPQPLGYSSPESQHNLEITGRSECIANYYNSGNNFMTGPSSAELSPLSPSTTQSTVINVNNYINHTVNMEGHVCVDGENPSLNVQAPSSAVQRFLEAGPGAATNRSITEGANEDS
I also copied the subset fasta files to /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
on Andromeda. I want to blast them against the nr database. In the scripts folder: nano blast_biomin_subset.sh
#!/bin/bash
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=36
#SBATCH --export=NONE
#SBATCH --mem=250GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error
module load BLAST+/2.13.0-gompi-2022a
cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
echo "Blasting Pacuta subset biomin genes against nr database" $(date)
blastp -query Biomineralization_Pacuta_subset_sequences.fasta -db nr -outfmt 6 -out Biomineralization_blast_results_Pacuta_subset.txt
echo "Pacuta subset biomin genes blast complete, now blasting Spist subset biomin genes" $(date)
blastp -query Biomineralization_Spist_subset_sequences.fasta -db nr -outfmt 6 -out Biomineralization_blast_results_Spist_subset.txt
echo "Blast complete" $(date)
Submitted batch job 305441. Job was running for about a day but there still wasn’t any data in the output file so I cancelled the job. Going to edit the script so that I’m using the nr database that we have in the putnam lab shared folder.
In the scripts folder: nano blast_biomin_subset.sh
#!/bin/bash
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=10
#SBATCH --export=NONE
#SBATCH --mem=250GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error
module load BLAST+/2.13.0-gompi-2022a
gunzip /data/putnamlab/shared/databases/nr.gz
cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
echo "Blasting Pacuta subset biomin genes against nr database" $(date)
blastp -query Biomineralization_Pacuta_subset_sequences.fasta -db /data/putnamlab/shared/databases/nr -outfmt 6 -out Biomineralization_blast_results_Pacuta_subset.txt
echo "Pacuta subset biomin genes blast complete, now blasting Spist subset biomin genes" $(date)
blastp -query Biomineralization_Spist_subset_sequences.fasta -db /data/putnamlab/shared/databases/nr -outfmt 6 -out Biomineralization_blast_results_Spist_subset.txt
echo "Blast complete" $(date)
Submitted batch job 308861. Completed in 1.5 hours, but didn’t work. There was nothing in the output files and I got this error:
BLAST Database error: No alias or index file found for protein database [/data/putnamlab/shared/databases/nr] in search path [/glfs/brick01/gv0/putnamlab/jillashey/Pacuta_HI_2022/data/blast::/glfs/brick01/gv0/shared/ncbi-db/2024-03-11:]
BLAST Database error: No alias or index file found for protein database [/data/putnamlab/shared/databases/nr] in search path [/glfs/brick01/gv0/putnamlab/jillashey/Pacuta_HI_2022/data/blast::/glfs/brick01/gv0/shared/ncbi-db/2024-03-11:]
I’ll revisit this….
I think I will retry to run the code above but use the remote NCBI server? Idk if this will work. Editing the script to include the -remote
flag:
#!/bin/bash
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=20
#SBATCH --export=NONE
#SBATCH --mem=250GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error
module load BLAST+/2.13.0-gompi-2022a
cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
echo "Blasting Pacuta subset biomin genes against remote nr database" $(date)
blastp -query Biomineralization_Pacuta_subset_sequences.fasta -db nr -remote -outfmt 6 -out Biomineralization_blast_results_Pacuta_subset.txt
echo "Pacuta subset biomin genes blast complete, now blasting Spist subset biomin genes" $(date)
blastp -query Biomineralization_Spist_subset_sequences.fasta -db nr -remote -outfmt 6 -out Biomineralization_blast_results_Spist_subset.txt
echo "Blast complete" $(date)
Submitted batch job 309006. Failed with this error:
Error: [blastp] internal_error: (Severe Error) Blast search error: Details: search failed. # Informational Message: [blastsrv4.REAL]: Error: CPU usage limit was exceeded, resulting in SIGXCPU (24).
Removed -remote
flag. Submitted batch job 314563. Ran for about 4 days, then timed out. Going to add: -evalue 1E-40 -num_threads 10 -max_target_seqs 1 -max_hsps 1 -outfmt 6
to the script. This will likely make it run faster. Also changed tasks per node to 10, -t
to 40 hrs, and --mem
to 125GB. Submitted batch job 315190.
I also want to blast the down and upregulated genes of interest against the nt db. In the /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
folder, make two fasta files: downreg_subset_seqs.fasta
and upreg_subset_seqs.fasta
.
Here are the sequences for the downregulated genes of interest in downreg_subset_seqs.fasta
:
>Pocillopora_acuta_HIv2___RNAseq.g24121.t1
ATGAGTCTTAAATTCCTCGCTGCTCTTATACCGCTCATATGTTTCACACATGCATCTTCTAAAAAAATTCCGCTGGTCATCGGCGGAGTGTTTGACATCGACACAAAGCTTGGAGATGAAAACTCTGCAAGTATGATCCCAATAGTGCGCATGGCAATAAAACATGTGAATGAATGTCCCAAAACACTGCTTAACTATGAACTGCAAATGGAAGTCAAAGACGTCAAGTGTCAAGACTCCGATGCAATCCACGCGTTCACTGAGTTCATTCGAGAGGAGAAGAAGAAAATCATGATCTTAGGACCGGGTTGTTCCAAGTCAGCAGAACCGTTTGCCAAGGCTACTCCTTTCTACAACTTGGTTCAAGTGGCTATGGCGGGAGCAAACCCGGCTCTATCATGCGCGAAGATCTTTCCAACGTTCATGCGCACCATACCACCGGAGCATTTTCAAAACCAGGGAAGAGTGGCGATCGTGAAACATTTCAAATGGAGAAGAGTGGCCATACTACGAGAGAATATGGACACTTACCAAGGACTTTCCAACGATTTGGTCAAGAGGTTGAAAAAGGCTGGGATACAGTTGGCCAGTTATCAGACTTTCACCGGCAACGCTGAGTTGCAGATTGAAAACATACAGAAAAACGACTTGCGAATAATTTTTGGCATGTTTTCAGAGAAAGCCGCAAGAAAAGTTATTTGCACGGCTTTTCAGAAGGGGTTTTATGGTCCTAAAGTCGTCTGGATTCTGATGGCGGGTGGGTACAGAGAACAATGGTGGGGCTACAATGATGTGAATTGCACCAAAGAGGAACTTTGCAAGGCGCTCGGGAACTACCTTAGTACCGAGGCGTTGATGATTGGAAAAGATAATCAGGATACGATTGCCAGAAAGACCATAGAAGAACTCAGAGCTGAGTACAAGGCAGAGCTATCCAAGACTCCTTACAAAGAGCCCAGCCGTCATGCCGCCTTTGCTTATGACGCTGTGTGGACTATAGCCCTTACTTTGCATAAGTCTATCTCAGCACTTCAACAGCAAAACGAAACCAGAAACATGTCATTGGAAGACTTTGACTATAACAATACTGCTATGAGGAAGACATTCATGAAGGTTGCAAAAGGGGTATCATTTCAGGGAATGTCTGGCTTGGTACAATTCTACAAGAACTCTGACAGGCTAAGCTTGCTTAACATTGATCAGCGTCAACGTGGCGAAATGAAACGAATTGGAACTTATGATATGAAAAGGAAAGTCATCGTAGTCGATAAGAGCCAAACAGCGCAGTGGGAAGATAGAAAGGTTCCAACCTGCACGATCAAAAAAATCCTTGAGCCACGGTACCTTCCCAAGAGTATGGTATTTACCATGGATGGATTGTGCGTGCTCGGGATCCTCTTTGCGGCAGGGCTCTTCTTTTTCAACTTTAAGTATCGAAATGTCAGGTATATTCGAATGTCCAGCCCGAATATGAATAACATTATTATTCTGGGATGTGTTTTGATCTACATTTCTGGAATCCTTTTCGGCATTGACGCTGAAATTGTCTCAAAGAAGACCCACGAGAAAGTGTGTCAGACCAGTGCATGGACGGCATCATTTGGATTTACCATGGCTTTCGGTGCCTTATTCTCTAAGACATGGAGGGTTCATCGCATTTTTATGAACAAACTTAAAAAAACGGTGATCCAAGACTACCAGCTAATCATTGTGGTAATTCTTCTCCTGATGATTGACGCCTGCGTCCTTTCCACCTGGCAAATACTGGACCCCATATATACCACTTCTAAGACCTTCCCAAGGAGGATTGACCAAGACGGTGACATTGCCATTTACCCATACCAGACCTTCTGCACTTCAAAACATGAAAACTTATGGACAGGTTTGCTATATGGTTACAAAGGCTTCCTTCTGCTATTCTGCACCTACCTGGCATGGGAAACTCGCAAAGTCCATATGCCAGCGCTAAACGATTCCAAGCAGATCGGCTTTGCTGTGTACAATGTGTTCATTCCTTGTGTCATTATCATTCCTATACTGAACCTACTGGGGAGTCACAGCGATGCAGTGTACTTGTTAAGCACTTTGCTGTGCCTTTTCTGTACCACCATCACTCAGTGCCTGATCTTTGTTCCAAAGATTTTTGCGATGAAACGCACAAATGGTGACCCGTCTAGTTACGAGAAGGGTTCACTCTCTTCTGGCAGTACCGTCGACAGCAAAATTTGCCCATCATCTTCAGCAAAAAGCCTTGATCGTAATTACAAAGCGGAGAAAACAACCTTTCCGCCACATGCTGGTTAA
>Pocillopora_acuta_HIv2___RNAseq.g13974.t1
ATGGCAGACACCTCCTTCGCGATCCAACAGTCCATGTCTGAGAGGAAGCCCCCTCGCATGCCTAAGTGCGCCCGTTGTCGCATCCACGGTATGGTGTCATGGTTAAAAGGGCACAAGCGATATTGTAGGTGGAGAGATTGCAACTGTGCTCAATGCACACTGATCGCAGAGAGGCAGCGTGTCATGGCGGCGCAAGTGGCGCTTAGAAGACAGCAGACACAGGAGGAGACCATGCGGGTACAGATAGCTAAGAAAGCACAGGCGTACGTACCTCCCCTTGTCAGCCCTGAACCGCAGTTTAACCACGGTGTCGCAAGAACTCACTCAGAAGAGGCACAGCCCGTGTTCTCGTACCACCGCGCTGACAGTCAAGAGCTGGAAAAGGAAGAGATTAAAAAAACCCCGATAACCTCTCAATCAAAATCTGTTGAGTCCTATGAGGTTAAAATCAAGGAGGAACCAGTGAGTCCGGAAGATTTTGAGAAAGATCACAGCTCAGATGCTGAAGAGAACCACAAGAGATCTTCCATTGATGAAGATAAAGAACCTGATTTCGAGCGACGAAAAGTACCCCGCCTCTCCCCCCACAGGAAAGAAACGGAAGCATTTAAATTTTCTTTAGAACTTCTCCAGCGGATTTTTCCTGATCAAAGTAGAGCCATTCTAGAGCTCATCCTCGGGGCTTGTGAAGAAGACCTGGTCAAAGCTATAGAGTCTCTTCTACCGGAGAACAATCAAAGACCATTTAGCCTGCCTCTTCCTCTTAGAAGCTACGGTTCTGCTTCCTTTATCCCTTGCGACGGCAATCAGGCTAAGTCAGCGTTCTCTCCCATCGCCAAGTCTCCGTCGTATATGTTTCCAGGAGCTCTTGCTGCGCAAGCGCAGTCGTTGAAGAGTCCAAACGACAAAAGCCCCAATACACCTAGCGCATTTCAGCCCGTTCATTCCACGTGCAGCCCACCTGAACGTAGCCCGTCAATGGCAGATAGGTTCCAGTTTCCCGTCGTTGCGGGGTATTTTTTTAACAGGCCTGGCACGTCGGCTCTTCTCTCGCTTAACCCCGCACAGCAGAGGAATGGGGCATCTCAGCCGGGAACAAGATTTTGTAGGCATTGTGGGCACCCGAGCAAAATTGGAGACAAGTTTTGCAGTGACTGCGGCAAGTCGTTGGAGTGA
>Pocillopora_acuta_HIv2___TS.g25049.t1
ATGTATTGGGTGTACAATGATGGCACCAATACAGTCAATTTTGTTTTGGAGGTTAGCACGCTCGGATGGGTCGGTTTCGGATTTGCCAACAAAATCAACAGAATGAAAAACTATGATGTGATTGTTGGAAAGATTGAAAATGGAAGGGGATGGCTCACCGATCGTTTTACCGGAGGTTATTCAGAACCTTTAGCAGATCTATACCAAGATTATAATCTTACCGCTTTCAATGAAAGCAACGGCAAAACATTCTTGGAATTCTACAGAAAAAGAGACACTGGCGACAAGAAAGACATTGAAATCAAGCCAGGACCGATGCTTTTGGTGTACGCTTACCACACACTGGATTATCCATCGCCTTATAAAACAAAGCACGAAAAGCAGGGCTTCAAAATAGTCACCTTAATCCCAGCAGATACAAGCACGACCCAGGCACCCAAACGCAGCACCAATGTCAGAAGTCTGAGGCTCACAATAGAAAATAATCAGACGAAAACCACCACACCAGCGCCCTTCACAGCTACATTGGAGGAGACTGAGAAAGCAAGAGTTCTTGGATCCATATCTTCATACACTCGTGGTTCAACTTTCCTGACCTGTGGGGTAACTCATTCTCCCCGACCAAGCCTATCAGCATCGTATTCATTGAAAGAAAGGTGCAGGTGGAGGGTACCGCTAAGGCCTGACGAGTTGCCAGTTGATGGTTTTGGTCACGGATAG
>Pocillopora_acuta_HIv2___TS.g8000.t1
ATGGAGTCGAAATTGAACGAACTGAAAGAATTATGCCAGTGTTTCGATTGGATAAGACTTAAATTTAAAGGGATCAAACATGTCGACATTTCAGGAGACGAAAAGATGACGTCAGAATATATAATCACTGTCGTTACGGGAAACCGCAAAGGAGCTGGTACAGATGCATCAGTGTCTCTTATAATCAAAGGTAGCAACGGCGAAACAAAACCTCTGTCTATGGACAAATGGTTCCATAATGACTTTGAAGCCGGACAGAAGGACGACTACTACATAACCGCCAAGGATGTTGGCGAGCTTTTGATGATCACTCTAAAAAATGATGGCGGCGGGTACAGAAGTGATTGGTTCGTCGACCGAGTAACAATCAAAACTAAGAACGTCACCTATGTTTTTCCTTGCAATCGTTGGGTAGAGAGTGAAGTCACTTTCTTTGAAGGAAAAGCTAAACTGCCCACGGATGAGCAACATCCTGAATTGAAAAGTCGACGTGAAGCCGAGCTTAAGGAGAGGAGGGCACTGTATGAGTGGGGTAAAGATGAGGTGTATGAGGATCTGCCAGGCTACGTGAAAGCATCTGGAGTGAAGAATCTTCCCAAGGACGTGCAGTTTACCGAGGAAGCTGCCTATGACCTTCATCGAGCCAGGAAAAACGCACTCATCAACCTGGGTCTTGTGCACTTGTTGAATTTTTTTGACCAGTGGGACGACTTTGATGACTACTGCAAGGCATTCACTGGCTTTGTTGGAGAGGTTCCACTAGCTGCAAAATACTGGAAAGAAGACCGTTTCTTTGGATACCAGTTTCTCAATGGCTGTAATCCTGATTCAATCATGAGGTGCACAAAACTGCCACCCCACTTTCCTGTCACACAGGAGTTGGTTGGTAACCTGCTGGACAGTGGGGACACTTTAGAGAAAGCCATGGCGGATGGCCGTATTTACATGGTGGATTATAAGATCTTGGAGGACATCCCACACTACGGGCAGGACCGACCGGACCTAGAAAGGAGATACATGTGCGCAAGCCTAGGCCTGTTTTACGTGAAAGGTAACGGAGACCTGGTGCCGATAGCAGTTCAGTTCCATCAGGAACCACACCATGAAAACCCTATATGGACCCCAAATGATTCCGAGATGGACTGGACCTGTGCTAAGCTGTGGCTGCGTAACTCTGACACTCAATTCCATCAGATGGTCACCCATCTTCTCCGCACCCATCTCTTCATGGAACCCATTGCCGTTGCCAGCTATAGACAGCTACCCACAATCCACCCCGTTTGGAAATTGTTGGCCCCTCATATCCGAGGAGTTCTGGCCATCAACACTCTTGGCAGAGATGTCTTGATAGCGGAGGGAGGAGTGGCTGATAACACTTTGACTGTTGGCGGAGGAGGGCATGTCACCCTAATGAAGAAATTCTACAAGAGCAGCAGTACCTGGCCCTCGTACATCCTGCCACAAGTGCTAAAAGACAGAGGGGTGGATGATCCCGAAAAGCTACCCAACTTCCACTACCGCGAAGATTCTTTAAAACTCTGGGCAGCCATTGCAGACTTCGTCAAGGAGATCTTGTCTGACTATTACCACTCTGATGGTGAAGTGCAGAAGGACTACGAGCTTCAGAATTGGGTCAAAGATCTGCATGACGATGGCTATCCCAATAAACCAGGCCATACAAACCATGGTGTTCCGCCATCTTTTACCAGTTGCGTCCAGCTGTACGAATTCTTGACCTCCATCATCTTCACCTGCGCCTGCCAACACGCAGCAGTTAACTTCTCCCAGATGGACGTATACGGTTTCCCGCCAAATTCTCCAGCACTCATGCGTCAGCCACCACCGACCAAGAAAGGAGTTGTGGGGCAAGCAGACCTCATGAAATGCTTGGCTACCAAGCACCAATCTTCCCTCACCATTGCCACGGTGTACGACTTGACCCGTATTTTTAAAGACGAGAAATTCATTGGTGACTACCCAGAGGAGCTATTCATCGAAGAACCTGCCAAAGCCGCCATTGAAGTATTTCAGAGGAAACTGAAGGGCATATCCGCTGAAATTAAGGCGCGAAACGCAAAACTCTGTGTCCCGTACCCATATCTTTTGCCAGAGCAAATTCCAAATAGTATTGCCATCTGA
Here are the sequences for the upregulated genes of interest in upreg_subset_seqs.fasta
:
>Pocillopora_acuta_HIv2___RNAseq.g22884.t1
ATGAATGTGTTCAACCCAAATCGAAACATCCAGTTAACCAGGATAAAGAATTTTCTACTGGACAACGTTGGATATCAAGAATTTCTATCGCCTGGTGCAACGGTAGTGAATCCGCTGCAGCCTTCTCGCATCGATTCCCTAACGATGGGCTCCTCACAAATAGCTCCACGGAAAAGACTGCCAGACTGTCCTCAAGAAGAAAATACAGAAAATCGGGCAGCAAGCAACAAGCATCGAGCAAGAAAAAGAAATCGAGGTCCAAAACCAAAGAAAATGTACGACGGCAATGGACCAATTCAGCTGTGGCAACTAATTTTGTCAGAGCTCGTTTCTTCATCGTCAGAGCCTCTTGTGGAATGGACTAAGAAAGACAAATACGAATTTCGCATTTTGCAGCCTGATAAACTGGCGGCCCTATGGGGAGAGCAGAAGAAAAAGACCAATATGAATTTTGCAAAGCTTGCGCGAGGTTTGCGGTATTACTATGGAAAGTCCATTTTGGAAAAGGTTCGTGGCCAACAGTTTACCTATCAGTTTGTTATGGACATCGATGCAATTCTCGCGAACGATTCTGATGGCGAAGCTTCCGACGGTGGAGGCAGGAGCACGCCTGATGTTTTTTGTGAAGCACCGAGAACTCTGGAACAAAGGGAGGGGTATGGGGAAACAGAGACACAACTCACTAGTACAAGGGTGGGTGGATATGTGGGTGTATGGGGGGACTTTGGATGCCCCATGGAGGCATCGAGTGACACTGGGGGTGCTGAACAGGGTGTTATTGACATAGAGGGGATAGTGCAGAACGCAGGAGAGACATTTGGCTGTAAGGGGGAGAGTTCAAGGCTCCCTATGGGGATATCAAGGGATACTGAGGGAGGAAAAGAGGTTGCTATAGGCACTAAGGGGGTAGAATGGAACTTAGGGGAAGCTTTTAGGGGAACAGGAGAGGGAAAGGTGAGAAAATCGTGGGGCCAAGAGAATACAGGGATGGGATGTGATACCACAGATAAAGATTTTGGAATTGTGCCTGGAGGACTTGAAAGCCTCCCTGGGGGGTGTGGGGTCATTAAGGGGGAGCTAGGGGAGATAAAGTCATTCGATTTAAGTGATCTTATTGAATCCAATGACACAAGTAGCTTCTATGAAGCACTATAA
>Pocillopora_acuta_HIv2___TS.g23786.t1
ATGAATATGTCGTCAATGGAGCGTGAAAGCCATGTTGATCACTCAATGCAGAAACAACCTGAAACCTTTACCGAAGAATGTTTGAACCAAGTAGATCTTCAGAAGTCCGAAATGACCTCAGTCGCAAGAGAACCTCGTTACAAAAACATAGTCCACTTGTGGGAGTTTCTATTAGAGCTGTTGGCGAGCGATGGCTGCAAAGGGATCATTTGCTGGAGTAGAAAAGACCACAGGGAGTTCAGGTTGAACAACCCTCACGAGGTTGCTAAAAGATGGGGACGCTTAAAAGGGAAGACAGGAATGAACTATGAAAAACTCAGTCGAGCTTTGAGATACTACTATCAACAAGGAATTATCAAAAAGGTCCGTGGTCAAAGGCTCGTGTACAAGTTTAACAAACTTCCATATCGATACGAGCCTGGTGTAACAAGATCTCAACATCAATTAAAGAAAATAAATAAAAGCAACACCGAAGAACAAGATGAACATCAAGCACCATCTCCTCAGACAGTAACTGTGCCATCGCCTGTACCTTCAGCTTTCCTTCCACCAAGTCCAACAAGTCCCATTAGCCCCGTTATTACCCCACTCAGTAAAGATTGGTCATGGCCAGTTGTTCCCGTGCCTACTCGGCCGATGTTGTGGTATCAAGGCTCCTCGCTTCTTAAACCATCAGCAATCCTCATTGGCAGAAGCAGGATTATGGTTCCGGTCATGGATCCGTTGACGTCACTTCCATTAGGTTTTAAACCAATTCAGCCTACACCTTTTAATACGTCAATACCAGTTTCAGTTATTCAGCGAACCATTTAA
>Pocillopora_acuta_HIv2___TS.g26760.t1
ATGGCCAAACTTTGCCATGCGCTGTGTATTTCGTTGTACTTTGTAGGTGCATTTGCAAAATCCGTCGAAGATGGAAAAAAAAGCGGAAAACTTTCGGAACAGGAGCATTACAATAAGGACGGTCAACACAACACGGAATACGATCACGAAGCTTTCCTGGGAAAACAGAAGAAAACTTTTGATCAGCTTACTCTTGAAGAGTCCAAAGAAAGACTTGGGCTCCAACACAACTATCTGTTTCGCAGGAAAATTGTTGATAAAATAGACAAAGACAAAGATGGGAAAATTACTAAAGAAGAACTTGGAGAGTGGATTAAATTCACAAAGGATCAACACAATGAGGAAACTATTGATAAAAAGTGGAAAGATGTCATTGCAAGATTACAGAAAGTAATGTCCCGAAAGGATGCTTCATCTGCAAAAACTGTTGATCCTGATGGTGCGATCACATGGGAGGACCACAATGAAGTCAGCTATGGAGGAAAGCCAGAGGAGGAACTAGATGACATGTACAAGGGACAAGTAAAGATAGAAAAACGAAGATGGAAGATGGCAGATCTTGATCAAGATGGCAAACTGTCCAGAGAAGAATTCAGTGCCTTTTATCATCCATGGGAACATGAACAAACACATGATGCAGTTGTACAGGAAACTATTGAGGACATGGACAAAGATAAAGATGGAGTCTTATCACTCAAGGAGTACTTAGACGAAGTGCACCCTAGGGATGAAAAAGACCTGACAAAGGAACAAATGGCTCAGAAGAAAGCGGATGAGAACTATTTCCATACAAACCGTGACATAAATAAGGATGGTGTGATGGACAAGGAGGAAGTCAAGGAATGGATGTTTCCTTCAAATTATGATGCTGTCAAATCTGAAGTGTCTCATTTAATATACCATGCTGATACTGATAAGGATACAATGTTGACAAAAGAAGAAATTCTCAACAATCATAAGTATTTTGTTGGAAGCAAAGCAACAAATTATGGCAAAGATCTGACCAGACATGAGGAGTTTTAA
>Pocillopora_acuta_HIv2___RNAseq.g19477.t1
ATGGTTGCAATGGATGAGAATGACTCTGTATGGCCCCTTGCCCCACTCACTTTTGTAAACATGACCAGTTCTCTGAGTCAGAAATGTATAGTTATTGACTGTCGTTCCTTCTTATCATTTAACGTCGCCCACATCAAAGGATCGCTTAACGTTCACTGCCCTCCTATCTTAAAAAGAAGATTTCATCGTGGGTCATCAACGTTAGATTGTCTACTCAAATCTCCTGAGTTGAAACGAAGGGTTGCGGACGCAGAGACTTTAATTTTGTACGGGGAAGGAACTCAGGATTGGATTGACTTGGAGAAGGACAATACGATGAAGATACTCCACATGCTTTTGAGAAGAGAGAGAGTAGACAAGACTCTATATTTCATTAAAGGAGGATTTGAAAAATTCGCGTCGTCTTTCCCCTCCATGTGCTACTTTGCAAATCCCCATGCGGCATCACAAGCATGCTCCAGTCCTCTCGGATTAAAGCTCAAAACGAAAGATTTTAAAAGATTCAGCCCGAGAAACGAGATCGATCCTGTCAGGGATTATGCCGAGTCGCGCCCAAATGTGTCAGCCAAGCCAAATGAGCCTGTTGAAATCTTACCACATCTGTATCTTGGAAGCGAATTCCACTCGTCTCAAAAGGAGCTCCTTCAACACTTGGGTATCACTGCCATAGTCAATGTTTCAAGTAATATTCCGAACTTTTTTGAGGATACATTTGACTACAAGTCTATTCCGGTTGACGATACTTACACCGCAGATATCGGCCGATGGTTTGAGGAGGCAGCGATGTTTATAGATTCAGTGAAGAAATCGAAAGGACGGGTACTAGTACATTGCCAGGCTGGGATCTCAAGATCAGCCACTATTTGCCTTGCCTATCTTATAAGTAGACATCAACTCAGGTTGGACGAAGCTTATGAGTACGTTAAGAAGCGCCGTTCAGTTATATCACCAAACTTTAACTTCATGGGGCAACTGCTTAATTGGGAATCAGAGACTCAGCTTACAAATAGAGTATCGAGCACACACACACCCACCACTCCCTTTGGATTTTTCAGTTTTTCTCCGTTGCCATGTGGATCCGAAATGACTACCTCTGGTAACAAACAGAACTCACCTGGGCTCGTGACTTCACCCATGTAA
I’m going to blast these sequences against the BLAST nt database using similar code to the blastp I did above. In the scripts folder: nano blast_downreg.sh
#!/bin/bash
#SBATCH -t 40:00:00
#SBATCH --nodes=1 --ntasks-per-node=10
#SBATCH --export=NONE
#SBATCH --mem=125GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error
module load BLAST+/2.13.0-gompi-2022a
cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
echo "Blasting Pacuta downregulated genes of interest against remote nt database" $(date)
blastn -query downreg_subset_seqs.fasta -db nt -evalue 1E-40 -num_threads 10 -max_target_seqs 1 -max_hsps 1 -outfmt 6 -out downreg_subset_blast_results_Pacuta.txt
echo "Blast complete" $(date)
Submitted batch job 315195. In the scripts folder: nano blast_upreg.sh
#!/bin/bash
#SBATCH -t 40:00:00
#SBATCH --nodes=1 --ntasks-per-node=10
#SBATCH --export=NONE
#SBATCH --mem=125GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error
module load BLAST+/2.13.0-gompi-2022a
cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast
echo "Blasting Pacuta upregulated genes of interest against remote nt database" $(date)
blastn -query upreg_subset_seqs.fasta -db nt -evalue 1E-40 -num_threads 10 -max_target_seqs 1 -max_hsps 1 -outfmt 6 -out upreg_subset_blast_results_Pacuta.txt
echo "Blast complete" $(date)
Submitted batch job 315196