Pacuta HI 2022

Pacuta 2022 BLAST

These data came from the Pacuta 2022 experiment in Hawaii, done by Federica and myself. In this experiment, larval and spat Pocillopora acuta were subjected to a combination of high pH and temperature treatments. The github for that project is here.

A biomineralization gene list was created by FS (primarily from Stylophora pistillata). To see if any of the biomin genes are in this dataset, the biomin gene list was BLASTed against the Pacuta protein sequences. This was done by Zoe Dellaert in this code. Much of what is in this script comes from her code linked above.

cd /data/putnamlab/jillashey/Pacuta_HI_2022/data
mkdir blast 
cd blast 

I copied the sequences in fasta format to Andromeda into /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast.

In the scripts folder: nano biomin_blast.sh

#!/bin/bash
#SBATCH --job-name="biomin_blast"
#SBATCH --nodes=1 --ntasks-per-node=20
#SBATCH -t 100:00:00
#SBATCH --export=NONE
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --mem=100GB
#SBATCH --error="blast_out_error"
#SBATCH --output="blast_out"
#SBATCH --account=putnamlab
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts            
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error

module load BLAST+/2.9.0-iimpi-2019b

echo "Blasting Pacuta protein seqs against biomin genes" $(date)

cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast

makeblastdb -in /data/putnamlab/jillashey/genome/Pacuta/V2/Pocillopora_acuta_HIv2.genes.pep.faa -out /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Pacuta_prot -dbtype prot

blastp -query /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineraliztion_Toolkit_FScucchia_ZDrefmt.fasta -db /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Pacuta_prot -out /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineralization_blast_results.txt -outfmt 0

blastp -query /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineraliztion_Toolkit_FScucchia_ZDrefmt.fasta -db /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Pacuta_prot -out /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast/Biomineralization_blast_results_tab.txt -outfmt 6 -max_target_seqs 1

echo "Blast complete" $(date)

Submitted batch job 305440. Ran in ~45 mins. I copied the output files to this folder on github.

The result of the BLAST was a list of Pacuta proteins that correspondeed with specific biomineralization genes. In this script, I found that 158 out of 172 of the biomineralization genes are represented in the filtered gene counts. This corresponds to 84 unique Pacuta genes. I also found that 7 out of 172 of the biomineralization genes are represented in the DEGs. This corresponds to 5 unique Pacuta genes:

  • Pocillopora_acuta_HIv2___RNAseq.g25214.t1
  • Pocillopora_acuta_HIv2___RNAseq.g11609.t1
  • Pocillopora_acuta_HIv2___TS.g23498.t1
  • Pocillopora_acuta_HIv2___RNAseq.g7668.t1
  • Pocillopora_acuta_HIv2___RNAseq.g30830.t1

These are the 7 biomineralization genes:

  • PFX13778.1
  • P33_g8985
  • Gene:g38128
  • JT016638.1
  • P18_g810
  • XP_022780303.1
  • XP_022805470.1

In the script above, I then investigated where these 5 unique genes fell in the treatment comparisons.

In the high v control treatment comparison, these Pacuta biomineralization genes were differentially expressed:

  • Pocillopora_acuta_HIv2___TS.g23498.t1 - α-Collagen, coadhesion, clone g810 alpha collagen-like protein gene
  • Pocillopora_acuta_HIv2___RNAseq.g7668.t1 - uncharacterized protein
  • Pocillopora_acuta_HIv2___RNAseq.g30830.t1 - uncharacterized protein

Corals in high treatment are downregulating a collagen related gene compared to control corals (LFC = -3.39).

In the high v mid treatment comparison, these Pacuta biomineralization genes were differentially expressed:

  • Pocillopora_acuta_HIv2___RNAseq.g25214.t1 - Sacsin
  • Pocillopora_acuta_HIv2___RNAseq.g11609.t1 - Flagellar associated protein
  • Pocillopora_acuta_HIv2___TS.g23498.t1 - α-Collagen, coadhesion, clone g810 alpha collagen-like protein gene
  • Pocillopora_acuta_HIv2___RNAseq.g7668.t1 - uncharacterized protein

Similar to the high v control comparison, corals in high treatment are downregulating a collagen related gene (Pocillopora_acuta_HIv2__TS.g23498.t1) compared to mid corals (LFC = -2.99). Corals in high treatment are also downregulating sacsin (LFC = -1.56) and flagellar-associated protein (LFC = -0.83) compared to mid corals. High v control and high v mid also had differential expression in Pocillopora_acuta_HIv2__RNAseq.g7668.t1, an uncharacterized protein.

There were no DEGs between mid and control treatments related to biomineralization. This is not surprising, as there were only 4 total DEGs between mid and control treatments.

Sequence information

To make it easier for us to reference the specific biomineralization sequences, I will add the Pacuta and Spistallata sequences in this md file. Since there are only 7 sequences, it’s not too difficult.

In the high v control comparison:

  • Gene:g38128 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
  • JT016638.1 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
  • P18g810 corresponds to Pocillopora_acuta_HIv2__TS.g23498.t1
  • XP_022780303.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g7668.t1
  • XP_022805470.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g30830.t1

In the high v mid comparison:

  • PFX13778.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g25214.t1
  • P33g8985 corresponds to Pocillopora_acuta_HIv2__RNAseq.g11609.t1
  • Gene:g38128 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
  • JT016638.1 corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
  • P18g810 corresponds to Pocillopora_acuta_HIv2__TS.g23498.t1
  • XP_022780303.1 corresponds to Pocillopora_acuta_HIv2___RNAseq.g7668.t1

Shared between high v control and high v mid:

  • Gene:g38128, which corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
  • JT016638.1, which corresponds to Pocillopora_acuta_HIv2___TS.g23498.t1
  • P18g810, which corresponds to Pocillopora_acuta_HIv2__TS.g23498.t1
Sequences for 7 Stylophora pistillata biomineralization genes

Fasta file for these sequences is here

>PFX13778.1
MASSEASAGASSLQEQKEQRQIDVMFLCDEWRSLKGGLSTFNREFAINLAEARIGNMKIHCYVSKTDPLLCLMSPPSELPNPHLVIGHGRKLGSPAFCLVQNTKCKWIQFVHVCCEDLGKYKKTVAATDTIEENEKKHKMEIECCKAADAVVAVGSRLQQKYSRSLPQVIVEIITPGIFERFSCESQSAMHRSVKNFNVFLFGRATFEDLSLKGYDIVANAIGSLSKNFELTFVGSSPGKHQKVEKWFLDNTRIDRNQLTIRGYCSNPEELKMMFYQSDLVALPSRTEGFGLVALEAISAGVPVLVSGESGVAEALKEVEEEREANAMMLRENYRKVYSWRKECERFRRIIENVMKDGELNINVDVEDVKPKEPNRQITTLATKSKGSQEDQYQRASATLPTEDSVSHSGKEDLQELKDRVLCSIAMNYLRTTPPQSIEEHNKFMEYLEKMKVLIRGFSLGSLVITVKCESLQILEELWTDYSSGHLGKVVQNCFATEKILKELNLAELKLKTTMDIEEYNARKVYFEKRMNKGDDLSFQSHFISSEDERHLEERKRKLEVGELEKRKDDLRSQSPFISSEDQPHLEKRKQKIEVGKSEKTKVVKGRSYGQRRPPLTHILKNILERYPDGGQILKEIIQNADDAGATKVSFLLDSRQGFYGENSLVESTLAQFQGPALYAQNNALFVESDWENLQRLMDSSKKDDPLKVGRFGIGFNSVYHITALKKEAPVILLFLKNIEEIALFETDVRDVQRHVFTVRLSDGCRQEVGEKKQKFLSDLRRLSDGQIENVDLSLDLHVVEVTEGERVVENKWLVYHQVDARNPTLKKLSLELGLLPWVGCATSVNAANLLAISSSAGRIFCFLPLPPDADSKTGLPLHVHGYFGLSDNRRGLKWPGPDCQNDPTAEWNVSLVQHVASKAYANVLLHLQESCDSSVGADFVYKSWPNIQEVEKHWRCILEPMFSILLKENVLWTTANHGQWKNVSNAYLDRIKSQFQNTTNETREVVLETLTQTNEAVVIVPTHVMIAIDNYTPAPPKSITPTFLRALLKKKGEGILEDVPKKKKLLLLEFALADKNLNDMHGVPLLPLANGGFVNFHSLQYNREPGAAVYVSSTATPRSIFHNMDNKFLDDSVQTPAITYLSKLASDANSPHTIQPVQLVALNQTIAVKVLREMLPSEWSGENHVVPWYPGKNGHPPEHWLESVWMWIQRIFPTDLSLLENLPLIPHTCAGNQSLVKLSSSSVVIGRHHHQSISLPPHIVSLLRKIGCILLENLPSYIYHNTLHKYVTTPEPRGVLKIFSTLGQSRCASAIPFCSPNEKRALRSFLSSVDFNVDEKTLLYNLPIFDAADGNSFIAVRNGFQVHEVLPYGFEVPQPLPIPRASSVIALRDSDSHTLIQGLEISPMTKTTFLKNIVFSGIQSNFYSHQQIPALIGWILSQYSIFCREDSSFHVSLQQLAFVSTRTNKLVTPCCVLDPQQPILEQLFENENDKFPNGDFVKDEILLPLRQLGMRSIPNKEDILHVAKTIESVHSDVGFRKALALLEFLDKNPPDKSLGRALMNERWVPRKQSPTSSYPSAMPWFLGTKQFYKPSETFSQSKATLVGASAPIVSKPCSKALEAVFGWNKSPPVHHLLNQLRFACSVPLNDVNGSALCHFQAMVKQIYEEVSTNGSFIFAVSQDNSFPEWIWHGNGFTSPSKIAFTSSCRIDLKPYLYLVPQEFQHLNLFFQQCGVRDTFQDCDFLHVLTMIKDKHDTSSEYLEDVAVDQKLSHEILHWIVRDGKKLNPELRESLLVPIHTWDSTLKLVPCSECTFCDADWLRQGGSELLIGSKFTMIHEAISSKTAALLGVPPISTRVTCAEALGIEQTGPYEPITTRLKNILNEYKEGDGVFKELVQNADDAGATEVQFVLDWRNHPHQRLLSPGMVECQGPALLAYNNTTFTDDDLKNISRLAGATKREDLEKIGRFGLGFSSVYHFTDVPSIITRSYAVFFDPQTTHLGPHIHDASKPGIKIDLGVHENSLICFPDQFAPYNGLFGCDTATPSDNDTFYFEGTLFRFAFRTKRGDISDKIYTKQEIRSLIFSFRESSPILLLFSQNIKKVSFLELDENSSNSKDPRLLFEISRKPQTGCMPVKNSVTESTFLKSCAAWTRQSTSWSNPVDRFPSPKLTELISVCSTICHMNGQRSQETQSWLVTSCLGTENSFKLATSEEGKKQGLLSASGVAAKMFTKGDGMEKVQAVLGEVFCFLPLSIPTGLPVHTNGYFAVTSNRRSIWEGTTAEVGCQPLEVRWNQSLMEDALTQAYVQLLENMTVMQTQGKIPMYHAFTLWPNPDKLHSSAWEPLIKSFYQRVASDVDLPLVCTGGKWLPVTQCIYQDLKLWELPNSEVILENLDYKIVQLPDFARKGFQQAGCMEIIIQRTMTQEKFLQEVFFPKIAMISKELRDAIVCYLIDECLRGHSSNQSSQLLNLYKSLLSNNRCIPCGPDAGNLAFPKELISPKGTAATLFLADDRRFPVGSCYQTKERLLMLENLGMLSHILDWETLIERANSVSVLCRIAQQDAKERSASLIKYINVHLEEMDHPSELKREELMAISMFPTLAKPANYVMQWRGTDDGNSVMLPAKEMYEERYTFVAGSSRPILDESDSCGCSKLSKKTRDLFGFSSRKPSTQEVLNQLEHTVQAIVHAPHAIAGLEQAFHCLYGYIQEIIAEPDGERIIETLQEKEWVLVRGKCLSASRLAFTWKRFGAPYLNEIPQNLASRYRSLFEAAGVKEHFSTEDVISALYRLSEEKKGEPLSKSEFTVSKSLIEEISGASENSLKKERGKIPLPDHNRCLKPAEQLAINDAPWVTARSGIEYVHRDLSIDLAHRFGAIDIRTKKLARISRPIGREFGQREDLTDRLKGILKSYPCDVGVLKELVQNADDAGATEVHFIFDPRYHNTDQLLCNNWKELQGPALCVYNDRPFSEKDLQGIQRLGIGSKTDDPTKTGQYGIGFNAVYHLTDCPSFISNGDTLCILDPHCRYAPGADKENPGRLIEPISEEERSDFRDMFPCYLEDMFDLKSSTMFRFPLRLQSTCTESLISEQRISCTQMSTFMNHLAVEAKEIILFLNHVKKISLSEIKDDQLKEIHSVSVQITKEDDAERLKLANHVKNCKFLITNEIQWFGITYPLFVQEARLRQEEWLIHQCIGIQKRDGEEIPNGRDYGLLPRGGIAAKVSEKSKFSSYEVGGPTHKAFCFLPLPVPTGLPFHVNGHFFLDSARRNLWSDEKGEGFASQWNHFIKCKVLAQAYISLMLVARGYLPGSKNGDTTSFSKGFKVHEGMRWYHNLFPHYENVQSQWRDLAKAVFNKICYDDAKLLPLIKTPGKNSPTTQSTEAAKSIQPPNDAQATKGDFDCKEAISCLWVSPSQGYFNTLSLSDEWARDLSNVLINIGFKLIYSSKKIFGNFKTAGANVREITPEEVINFLGGNPNSVGILPCPVGETTVGSVANVLLLLRYCMKATTFPKQIFGIPLLLTEDDVLRQFQRDNQVFLSLFADLIPNQRARFVQHALATPLFRFQKEITYGDQGVLKKFDICALASLLPSTVKGGWCETNSLVPWDLENGPSKQWLKLLWEFLFKTYEKEPDTFSLTPLHKWPIIPTKLKELAPISKSKVIFDLTTSDSWSCGQKTVVALLRKLRCPEVDVDLLCNDGRWDLSPILKQHLSYPNSSQDILKVLDHMIGERGRIFELKENLTEAKSCFDSAIFINQNHVASIHHLITTLYHKLGNLVMAEKVLREGINIDLTAFEACDKSKRRSFPSCHRRCSNDKMAERPRKTVKGRSYGQRRPPLTHILKSILERYPDGGQILKEIIQNADDAGATKVSFLLDSREGFYGENSLVEPTLAQFQGSALYAQNNALFQESDWENLQRLMDSSKKDDPLKVGRFGIGFNSVYHITDLPSIVSGDSVVFLDPHETHFGRGETGQRFSLEDELLEIHEDQFKPYENVLDCKISTQFYNGTLFRFPLRSAPSDLSKKVYSKEKVRKLFQALKEEAPVILLFLKNIEEIALFETDERGVQNHVFTVRLSDSCREEVREKKIKFLSDLRRLTNGQIENVNLSLDLHVVEVTEGERVAENRWLVYHQVDAQNSTLKKLSSELGLLPWVGCATPVNAAKLQALSSSTGRIFCFLPLPPDADSRTGLPVHVHGYFGLTDNRRGLKWPGLDCQDDPTAEWNVSLVQHVASKAYANALLCLRELCDSSDGADFVYKSWPNIQDVEKHWQCMLKPMFSILLTKNVLWTRANGGQWKTLSDSYLDKVKSQFQNVTNETRCVVLETLTQANEAVVIVPSHVMIAIDKYTPVPTKSVTPAFLRALLKKKNKGVWNISNVPRNKKLLLLEFALEDKNLSDMHGVPLLPLADGSFIDFRSLQYNREPAAAVYVSSTSSPRSIFHNMDNKFLVDNVQAPAITYLSKIALDVSNPHTTQPVQLVKLNQTIAVRVLREMLPSEWSGGNHSAPWHPGKNGHPPEQWLESVWMWIQRMFPADLSLLENLPLIPHTCAGNRSIVKLSSSSVIIRRYHHQSVSLPSLIVSLLGKIGCVVLENLPSYIHHNNLHRYVATPDPHGVLKIFCTLGQSRCTSTISLCSPDEKRALRSFLSSAYFNGDEKSLICNLPVFDAVDGNSFIAVRNGFEFHEVSPHGFEVPRPLSIPRASSVIALKDTVSQTLIQRLGISPMTKTTFLRNIVFGGIQNNFYSRQQLSTLMHWVLSQYPLLCMEDSSFHAALQQLPFVITRSNKVVTPCCVLDPQQPVLEHLFENENDKFPHGDFVKDEILLRLRQLGMRSRPNTEDILHVAKTVDCVHSDVGSRKASALLEFLDQNPPDKSLGQALMNERWVPRKQSRPPSYPQAMPWFSETNHFYKPSETFSQSKATLVGASAPIVSKPCSKALEAVFGWNKSPPVHCLLKQLRSACSVRLNDMNGSALYHFQAMVKQIYEEASTSASFIFSVSQDNSFPEWIWHGTGFSSPSKIAFASCCKIDLKPYLYIVPQEFRHLNFFFQQCGVRNTFQDSDLLHVLTMIKDKHDTGSEYQGDVAVDRKLSHEILHWMVREGEKLDPELRESVLVPVQTRDNTFKLVPCSECTFCDADWLRKGGSELLIESKFTMIHEAISSKTADLLGVPPISTRVTCAEALGIEQTGPYEPITTRLKNILNEYKEGVGVFKELVQNADDAGATEVQFVLDWRAHPHQRLFSQGMVECQGPALLAYNNATFTDDDLKNISRLAGATKREDLEKIGRFGLGFSSVYHFTDVPSFITRSYAVFFDPQRTHLGHHIHDASKPGIKIDLAVNENSFICFPDQFAPFYGLFGCDTAPPSDNDKFYFEGTLFRFAFRTKRGEISDKIYTRQEIRSLMFSFRESSPILLLFSQNVKKVSFLEVDENATDSKDLRLLFEISRKPQTDFTPVKKSVTEGTFLKSCAVWTRQSTSQSNPVDIYTSPKLTELISVCSNICRMNGQRSQETQSWLVISCLGTGNSFQLATSEEGKKQGLLSASGVAAKICTQGDGLQKVEAVPGEVFCFLPLSIPTGLPVHANGYFAVTSNRRGIWESTTADVGRQPLEVQWNRSLMEDALTQAYVQLLQSMTVIQTEGKILSYDVFALWPNPDKLQSSAWKPLIKSFYRRIASDVELPLVCAGGNWLPVTQCIYQDFKLRELPKSEMILEKFDYKIVQLPDFARKGFEQAGCMEVINQRTMTPEMFLRDVFFPNIKTISKELRDPVVCHLIDECLRGHASKRSFPHLNLYESLLSTNRCIPCGPETRDLSFPKDLISPKGAAATLFSAEDKRFPVGSCYQTKERLLVLQNLGMISDILDWEILIERANSVSVLCRRAKQDAKKRSALLIKYINEHIEKMAHPSELNREELKAISMFPSLAKPANYVMPWRGSGDCNSGMLPAEEMYDERYKYVAGTSRPILDESESGCNKLSKKTRHLFGFSSRKPSTQEVLNQLEYTVQATIQSPHAIESLEQIFHCIYEYLQDLVLEPDGKRITHALQEKKWILVQGNCLSASRLAFVWKRCGEPYLNELPQNLASKYRSLFKAMGVKEYFSTEEVISALYKLDEEKQGERLSTREFKVSKSLLEEISEASEEFFGTERGRIPLPDHNLILQPAEKLAINDAPWVAPRSGIDYVHKDLSIDLAHKLGAIDIRTKKLSRISRPIGREFGQREELTDRLKGILKAYPCDVGVLKELVQNADDAGATEVHFIVDPRNHPTDQLLSANWKELQGPALCVYNNRPFSEDDLEGIQRLGIGSKTDDPTKTGQYGIGFNAVYHLTDCPSFITNGDKLCILDPHCRYAPEATKGNPGRLIGPIGAEERSDFRDVFPCYLENLFDLESATMFRFPLRRQTTSSISQKQVSCTEMMKFMNLLAYEAKEIILFLNHVKTITLSEIKENQLKKIYSVSAQLTQHDEAQRVRLANHIKISKTLETNQIEWLGITYPLLIQEHGLRQEKWLIHQCIGLQTSTSEEVPNGARFGLLPRGGIAAKVSEKSEKSKFHFNTKSEPRHKVFCFLPLPVTTGLPVHVNGHFYLDSARRNLWRDEKEEGFQYIPMATLQSLQTAEAYAQLLENMIVMQTLGKLPLYGVFTLWPNPEKLQSSAWEPLIKSFYRRIASDVDLPLVSTGGKWLPVTQCIYQDLKLQELPKSEMVLQKFDYKIVQLPDFARKGFQQAGCMEVINQRTMTQEMFLRNVFFPNITRISKELRDPIVCYLIDECLRGHASKRSNPHLKLYESLLSTNRCIPCGPETGDLTFPKDLISPNGAAATLFSADDKRFPVGSCYQTKERLLVLQNLGMISDILDWETLIERANSVSVLCRRAKQDAKKRSALLIKYINGHIEKMAHPSEFNREELMAISMFPSLAKPAKYVIPWRGSGDYKSVMLPAEEMYDERYKYVAGTSRPILDESESGCSKLSEKTRHLFGFSSRKPSSQEVLNQLEHTVQATVQSPHAIESLEEIFHCIYEYLQELVLEPGGKCITHALKEKKWILVQGNCLSASRLAFAWKRCGEPYLNEVPQNLASKYRSLFKATGVKEYFSTEDVISALYKLHEEKQGERLSTKEFTVSKSLIEEISEASEESFETEKGRIPLPDHNLILQTAEKLAINDAPWVAPRSGIDYVHKDLSIDLAHRLGAIDIRTKKLSRISRPIGHEFGQREELTDRLKRILKAYPCDVGVLKELVQNADDARATEIHFIVDPRNHPTDQLLSDNWKELQGPALCVYNNRPFSEDDLEGIQRLGIGSKTDDPTKTATKENPGRLIGPIGAEERSDFRDVFPCYLENLFDLRSATMFRFPLRRQSTSSISQKQVSCTEMMKFMNLLAYEAKEIILFLNHVKTITLSEIKENQLKKIYSVSAQLTQHDEAQRVRLANHIKISKTLETNHIEWLGITYPLLIQEHGLRQEKWLIHQCIGLQTSTSEEVPNGARFGLLPRGGIAAKVSEKSEKSKFHFNTKSEPRHKVFCFLPLPVTTGLPVHVNGHFYLDSARRNLWRDEKEEGIGSYWKQFIKTKLLSQAYISLMLVARGHLPGSKEEDVACFLRDHNLHEGMRWYHNLFPHFKHVESQWKDLATAVFKMICSEDASLLPLTKKTSDKTMEASQVPQAAAVVQVKESDRREVIRCFWLPPSQGFFNNLAPGSESQIELWKILLRIGFKLLYSSATLYRDFKEAGTNVREITPEFVIQFLKENPSSIGNLPCPVEETTLGTVKGVLLLLSHCMKAKKFSSEMFEMPLLLTEDNVLRIFETDSQVFLSLFADLVPTQCSQFIQHTLAIALLHFEEEIFSSGQSVLKRFDVSALASLLPRTANAGWCETDSARRNLWRDEKEEGFGSQWNHFIKTKVLSQAYISLMLAARSHLPGSKEEDGASFPRKHNLHEGMRWYHNLFPNFKSVESQWKVLAEAVFKMICSEDANLLPLTKKTSDKRVEASQLPQAAAGVQVNESDPVIRCFWLPPSQGFFNNLTLGSESQIEQWKILLRIGFKLLYSSATLYKDFKEAGTNVREITPEFVIQFLKENPSSIGNLPCPVEETTLGSVRGVLLLLSYCMEATKFPREMFGLPLLLTEDNVLRRFKTDSQVFLSLFADLVPTQCFQFIQHTLATALLNFEEEIFASDQSVLKKFDISSLASLLPSTANADWRETSDLIPWNMNEQPSKIWMQRLWEFLHKTQQKTPKAFSLDPLHNWPILPTKSGKLAPVFKGKVILDLTPSGSWSPGQEHVATLLKKLKCPEVNVDLISGDGRWNVSDILKSRVSYPNSSQDVLKVLDHLMKEGDISNILFDDEKICMLQFFQDDLTTVKQDRTSTSIVKRLPFFKTFHGAFVSLGNVKSVYEIPVGLPTDESDVWMTGNNCVFLAPEPRLSRMYKDLLGVGDKSHTDCYINFIFPKFPLLQNETRMLHLEYVRRFLLAPYCNEEQQARVLKSLSTLAFIPDANGALQTAYYFHDPGVKVFSVMLPREAKPPEQFNTTKWLELLRKIGLKQKISKAQFQTFANEVASQAVQISNSSYPSLEKKSKTLVEHLLRDDTLHDAKFLADLSPIKFLACANASDNLSLLHRQHLVPSRDQNPPFNQFKHSIPHAHEALAWTTATLLFEWAIPNPKVPLLTNLQVLQKPSLEQVIGHVKNLSQTLSRRADREQPEPKRRRLSQIMTEIYKFLTDTSGCNGTDCNELCTAVCNKICNQLSDIPCILVEDGRVFVRSNQLAFHLDEEQPPYLYKVPREYGIFEHLLKRLGAMEKATPSQFAQVLTRLRESCEDKQMHANELTVVKQAVFGLFTTLHAILCRNEDRRERNPLAEVNTLYLPNSKQELRPSTDLVLFDCTRFKRRLSDSMFEFLDDVTNYNLTMEKPGKLVALLPDHLKTKSLSSLVREELQAECRGKRCQADMQRKCEATDRIRHILYSPDLVNGILRILKFQYDKTKLTEEVRSKVHSFQKSLSISCMETLSTELVDNRTNTVIPNSQMRTHSGCFVGQDNGKKHIFIQHGAKSSDTRRKICHEIYSLTGCFLEEENILHLAAILECTSPANISIVLDNAGVSDDAEATKTPSLEPALGSEVPEEFHELLDQYSDFYFRPGEFVAYEREDSTEEELKYIYAKIVRRVKTSTSTKVKKDRTKRKQKEESNLLSRYLIDIGPERKEVDVLDLYKFRRPRKSEEEEADEEESLSKSMEVVPYAGASGQSTGQAGAESAGPSRGASEPPKPRTLEHALKEVKKTLAEIWKLPEDKRKKAIRRLYLRWHPDKNMDMQDIANEVTKFIQNEVDRLSKGKSSSRDEGGARPPPADFSEFFTRCNERARRQRASYYNFRRHNPRFTGFRSHSRRTYTAPDPRVAKMWIRQSKEDLRSVKHLLSSRDPLYYLVCFQCHQIAEKSLKAALYALAGVADRQLNSHDLVLLAHDLSLLPGAPDVTAQVARLSNYFEGTRYPNKHEPAKVPAEVFQDSQEAQEAFRLATEVLEELERENRGVARTFQRRGGGDHSVSRREYSPDFQVDKHAVFYRMWRKRHDIVISFSPPDYRSSSVDYNEEKVFLYQFEVNRFHKTNTNKITS

>P33_g8985
MTRLGDTFIRQLHEKGEDFTNLWAFSEFLESVDKSGVPDTKTIPVGLKKMKELGIRNAIIEMDLVYAGINYKKFKVEAINELLTERLRWVHANLAKDSKVVVNFRDLPDGMIKRPKRVFKVIRHLSSLPLDIRPFGIVTEESGKYFPEQLAAWISAVRREMDDCGFKDGHLLVHVHEKWGLVDSTQLSCLANGANGIWASMIIQGASMGAASSTVTLMNLVRLGNKKVLKKYNCTALREAAQEICRATTGQEPYPLQPIYGERALDMVFGMPTKLGINEFDLAKFFGEKPLMRMTTLASAEMIITRLKNIFGEDPQFTIERGTRMKEVMLEDLHKNIKEEYMSAAGLAQLFDRSGGQLTGKMADVLAQDEPRKAHAQVLIAEVRAMWDEWDLRDGKRDDQLEFDAFYNGFLAPYFGCYRCDETKQALKAMDMDEDGTVDWNEFAVYLKWAMRQYPETKTAEDLLSVAFRKGLIPAMQDE

>Gene:g38128 Annotated: α-Collagen Blast E.value:0, MS/MS SeqCoverage 42%
MAGGLSGIHGESAALHVEVENDQEHEPVLIHHHLVTDQDVLAALWRLNNAKHNRVQWMANFLRGLNGNNAARRVEEELKPGPGGVIVLLLPLVGRPVLEILSRLEIVAPMSAQLMVYGQTGKVGRSVAGHAVAENNQEYVSVTAHGRPMVGNVVLATIQRQGSAKPRPVQLMADGLNGPQGHAMFNKANAYTTLAATPTPTATAVPTPTIDPNIPKIDLVFAISATSVSSSRSYELMKNTIKRFIDRYGVNSIHYSIIVYGDQVVRVINFNRTFPPSANELKTAIDNQLALSGGPVLINALQEAYRVFKESVGRPGAKRVLVVIADENSGSSPSFLSRAVRPLEDLGVLVISVGVGDRISRSELNIISPNLLDVISARLNINPSLLAVRIMERILRLNFPDVDVGFAISAASANSDEIFSLMKQIINTIIDRYGVSKVRFSFIIYGSRVTTRFTFDNAPITQEELIKAVNGTKKVTGDPDLEKALEEAEKLFTKSSRPNATKVFVVLTDFVGAGDDNSLIANAVRLRKSGVLILSVGFGQQVNAIGNQMTKVVITQSDYIRVPDFTTQRPVVIAETIMFKALQANIPEIDLTFVISATSNSADRTFTLMKSTINSIIEKYGIVRIHYTVIVFGSDFTRSFDFSTSVPNKETLTRLVTQLQRESGTPDLVRALEEVKKVYELREVRPNAKKVVVVILDQKSVNTEVQLKTAVTDLVERNILVIGVGVGRSVDRNQLIYITEENRDIIEVEPTERPEEVAREIMLIILRSEIYSIHS

>JT016638.1
QGNYYSYGGTTPGTPIGCTNLITLSNVKFFASSSSDGPDIPVLNSTDYWCSEFNWKNQSLTVDLGFVTFFDRLLVQGEPFTSRSVSEYFVLTSIDGINYTYILGTNGQSMKFVGPLFNGDQTRDTNLTAPVQARYVQFNPQEPMIAEDDSICMRVGVESCQLVPAAVNGAWSHWSPYGPCTHACLGTAKRTRTCADPAPVFGGSPCEGVNEEEKICNDCVGTVNGGWSPWGLWSRCSTTCNPGQRSRQRTCTNPSPKNGGTDCSGPSTQSEPCQVQFCPVDGGWSAWSGLSRCTRACGGGRQYQSRTCSNPFPGHGGRDCVGVRSLSFTCNTQCCPVHGGWSPWGSFSSCTRTCGGGQKSRTRVCNSPAPSCNGITCPGGNQDIQPCNQQTCPTSPSTSFPINGNYSNWGQWTACSVTCGQGTRERTRLCDNPAPAQGGSQCQGPSSELVGCTEIPCPVNGNWSSWGDWSNCSSGCGPGKSYRYRDCDNPAPANNGLNCTGPDQESKDCNSTACPVDGGWSAWSSTPCSATCGQGTLKRTRECNNPKPQYGGASCFGNETEQEVACNKGPCPTSPPTISPPTTGSPADSNIPELDLVFAVSATSSNRLATYNSMRDTINRFITTYGSNKVHYSIIVYGKAVQRVISFNHTFPPSVGELQEAISRHAPISGPTVLKNALQETQTIFQEIPSRPNAKKVLVVFTDSNSPSDGNLVQAVRPLENNKILVVSVGVGDVNRTELLTISPNPLDVLSVQPTAGPGALSKRIMDRILRRDIPLIDIGFALSATSSDFQDIFVKMKNVIRTIVERYGVERVKFSLIVYGQNVTTVLGDFNRNLTQADLVNYVNNLQRVPQNKNLDSALLEAESLFRQRARPNSKKVFVVLTDGVSTLSNANSLLINTAELRKSDVLILSVGFGSQTNQVGNQMNSVVFAPRDYIAVPNYPAERDVVIAETIMFKALEVNLPLIDLTFALSSSSILSQETFKLMKETVQSLVHTYGIDRIHYGVIVFGSVATRSFDFATNFPDQNELIRKVSQLTRSGGSPDLVAALKEARKVFQLKEVRPYARKVLVVMIDDESSANKNDLNEEVRALRNRSVLVIGVGIGTQTLPKDLGIITDDKRNTLKAGINKNRDELAREIISIILRPSGLSKWSSWSACSKTCRYLGKAGTQIRTRDCKIPELGCDGMRIDTVECNKMDCEGCGQRGPLNESAYTASSNSESPAFLAALNTSDPTAWCLINNENGGYVQLDLGELTRVYKVATKGEQQGDRWVTSYYLTLSEDGETFFDYKAAQRLSGNTDSTSVAFNVVNTTRPYRYVRFHPVNFKGEPCMQAAVFGCNEEKILPPPETIADQADAAKGILIVLWILAGILTFLLLMACCYYCCWHVCCGRGKKRKGLVYRERSIEDDGYLINDEKRWTLGSAPMTPVPRVREDEIQEVTIEMKEDNEQPLGVIQFGIETDETKEKHVTAEDVKSEKPKYSEEASSGTIKSGSTMMRMKANDGSDRRKRTKSEGDAIDAVDGDLDWSYLSDEQGTAFTNEAFVKSQEQFLEPPGSASFRGNKVDMRRSLSADELATLDYDLFEDRQGPLHTATLGRDGYMRMHKANQGSLPPSDGGREMGTVDVAIGGIRVPNSPKDDPIYDTAGQEIHLAVEQAGRSVYPLEDGGYRGEEWYSRWG

>P18_g810
LMEAGVIGSHGQAALRRAEAVHRRASALATIHHHNMAASSVPEAIQIVNLAALKVAQLMEDGLIGHNGLPVTKRVVVATPIGEGNVQTQYHRTVVKHAQVTKMNIGVAILRDAQSVAVGVRGVSGQAAVRLAMDNEQGTAHALILHHPTMEPLAPGQEYNLKRAMWGSVQMAAGQPGVNGRPVQNRVVEEHKGGQDLAPTLHPHMAERSVWVAKRSPSSAKNKHVQWMAGGLSGIHGESAALHVEVENDQEHEPVLIHHHLVTDQDVLAALWRLNNAKHNRVQWMANFLRGLNGNNAARRVEEELKPGPGGVIVLLLPLVGRPVLEILSRLEIVAPMSAQLMVYGQTGKVGRSVAGHAVAENNQEYVSVTAHGRPMVGNVVLATIQRQGSAKPRPVQLMADGLNGPQGHAMLYVAMENATEQEPNAYTTLAATPTPTATAVPTPTIDPNIPKIDLVFAISATSVSSSRSYELMKNTIKRFIDRYGVNSIHYSIIVYGDQVVRVINFNRTFPPSANELKTAIDNQLALSGGPVLINALQEAYRVFKESVGRPGAKRVLVVIADENSGSSPSFLSRAVRPLEDLGVLVISVGVGDRISRSELNIISPNLLDVISARLNINPSLLAVRIMERILRLNFPDVDVGFAISAASANSDEIFSLMKQIINTIIDRYGVSKVRFSFIIYGSRVTTRFTFDNAPITQEELIKAVNGTKKVTGDPDLEKALEEAEKLFTKSSRPNATKVFVVLTDFVGAGDDNSLIANAVRLRKSGVLILSVGFGQQVNAIGNQMTKVVITQSDYIRVPDFTTQRPVVIAETIMFKALQGKVHVYECAAYNHA

>XP_022780303.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111321626 [Stylophora pistillata]
MTVTSSCSIRSPLCHNPCYSCRDYSIANLFDQTFSTSSCAGYFLSGCRYTKLTVDLKSLLFINHIRFYPYCSGSRPNFRVTTNDGVRDLPVTSNYVQMSNGCIQGSYVDMPVRRKATHIYVELNHPTSNTYVGLSELEVFIDGKEVWDRYKYRSFDGAKTWIEPETGADVYSFNEVHIRGSAQLALMPLNGLQGPAHFHADRLYGDKSGFLHVGYNQNFSVAVTDPDIPFGLRVYENGSIMLPRRAFLQTVSFKSSGKIWGVQDLFVFDHGTFYGDSNSSLGRDTVPGEYSVQSLHVQDRGVFELHSTDKELASRLSLTNLTIFGGGHFKSNNKMHIAVKHLVRINSGGRLSHNHAGYITKEGRSGDEFESSEGPGQGIGSVHGASGAGFAGTGGRGTGMGLVGQFYGDYRRPDDYGSAGGFGLHYGNLFYGDYRSSSVGPYKNLITRSGLGGQGGGAIKIVTRHLILDGHLSADGEDGPPPSTAGGGSGGSIWIDCEELDGYGTISANGGAGSPSKGGGGSGGRIAIYQVFMLNFNGTLSAKGGNSAVEPGASGTVFLETRNNSKVEYRVLKINNFGLAYPWAVDKSQGRLRNLMKGIYSETKYVGAVTWLHEADKYTFDEFHLHGNSHVALYGNGSLENVTLYAHTLRGDRSGVFHIGRFQSVVFDFVDLYFPINTLVYFNATLEVPRRLSLREVYMEINGTLADSDDYTIDRDGKLFLWSGGQSLGEKQGHFRFINMSIKSLGLLHTTKLKGHGPVSLHTTRFVVNAGGLASVDDFYLDSVNATIDVAGDVSADFRGYGAEQGPGAGVRRTPYNTGAGGGSHGGRGGRGTAGLQTSFSYGSIYEPTQFGSGGGNGLHGTVGGRGGGRIVFEISRMLRLEGRVHADGEPGNKNSDPSGGGAGGSIVIHAFKFDGEGTISVNGGSYPFTYAPYGGGGAGGRIAVYYNGSYTFIGSLQSYGGTSQAEWGGAGTVYIQNNQNTSQPYSILRIDNRAILSGPSRLNEIEELDLAGNNSKVCDTDNSRMSNLFKSTSNSYYTQNSNPVVTYSFPLPLFLENLLIYPQCNSHYLTRFYVREYFNNIEVIGSNEWINPTNCLQGQPLRMNVRQAVEKVEISLQKTSWYASLSLVRFFVRENPSTTLQTPHYTSPSTSWIVIEDEKTTEHDFSELQIMGRGSLSISGNTFNMRVDKIVGDNSGILTMRPTQSILMRGLEGHLPFSLLSQKGSSVAFPTSMNCREVEVTIRGVMGEMQNLTVGPQCRFVLENSSETEFMLEHMVVQTDGYMAALREDREDVKMVGKTFDIRGGAKMEANSLTLDYVNITIEPFADLSSNGLVDEVPGSRYHGDGWGGSSGSSGAGHGGHGGVGGGQSKVGVSYGKYKRPTTFGSTGGAQVFPFTGGLGGGRFKIIAHDTLVVDGVLSSKGGDARASRSGGGSGGSILAYASRIHGDGEFDVSGGDGDSSTGYRGGGGGAGRICLYYRENHFLGRFLGFGGTSSNEAGGPGTVFLENVPGMNATYGHDRIDEAAHAERILPDENVVNGTQWVRNRTLYANAMGRKPRSPDANLSSSYHDFSVGGSSRVWLILDDEDLEANGTDVELDELQLYGGAQLAIINPVNTKAFISIVIGQMEGDRTGRIHLGFNQTFLSLQSYLPMDMIIYQGGLTTMQGELLVAGVTVEIDGVLRRCQNITVVDDGVIRMKEMYDLEGKPTETFYFEAINVRKKGTMMVTNQERVREFRGTSMQVFGGGTFTAVNLQVHVVNFTIDALAKVHATLEGEKFDYEGKGPGAKYPSPGFAGGSGGGHGGLGGRSTSQVTTGAAYGSIKEPTEFGSSGGQGSSGAHGGRGGGIVFLNISDTLDIEGTLSVDGANNGGSEAGGGSAGSILIRTVLLEGSGTIQANGGNGYTGSSRGGGGGSGGRIALYYQGGFFDGVLEAAGGKGDLENGAAGTVYVEKAVNRSETPHRTLKVDNKGRSPLNERINEIEEVKLNPGGLKDNHAYSNEYTSISGLKFKSDGSTLPLFSGSGRYHSLSLMFDGHLKNAYIVQSSFVNVVIEATLPRLMFIHQMRLYPYCHRHYRVSFTLTTDHPVIGWIDRTKGSRSFADCFDTSVYNDVIIADTISKFTVTLTRLESYVALSELKVFVGRDTSSFQTSTLEQDSSRTWFVFNHQGTTTFEVDELDVRGSAHLGIQNGAGNVEFVVKEYKGDFTGTVHVGAAQNWYLNASNNSAIPFTLRTYQGSQVFLPREVYLQKSSIYAESELEGLQNLFISQGSRFDVTQGVQVNSPLKATILLDRMTILNGGTFYQRTAEPTKLAINLTGELIINAGASMEVSQVHLQAHNIFIDIDGVLTASGRGYSSMKGDEPGRKSDVAASGAGHGGAGGSSSSQEYVGRAYGSFQIPLEFGSGGGQGYQDLPGSSGGGAVKLSASHIVQVDGLLDVSAKAAVNPGTGGGSGGSVLIQATLFLGKGKIFADGGEATNSSLGDAGGGAGGRISAHYEITRFTGSFSAHGGASKSEAGGPGTVYLSENSTHTTMIIDNNGYRASKLYISDYRDRSNDGGRAWLLAGYMNEFTLNLLQLRGGSHFAVYHLKPSFTLNVERLEGDLGGLIHASKKNRVYIKNSPRLFPSSFHIYNEGFLHLPRDVLLKDLFYPKISLEGLISGMDNLTLGGGVEFVVTSKGQTEGYGKKTIHLNSLTIMNGAKVTATDTVLDSPTVTLLLNESLRVMAGGLIQGKWIDISAGDIEVEASGVITAEKQGHDAKNGPGSPSGSAGASHGGRGGLGYTGSFPGSSYGSLYRPLSQGSGSMFAVGGGVIKLATSQSITIDGSVDANGEDGHNKSTGGGSGGSIWILSKVFKGNGVIRASGGSGLFLESGGGGGGRISIAFDNRTFSGKINVLGEQLIVDNNNIGSPLTDDITDVSNDGGRTWLTPEPNTIEMSFDEVDIRGKSELAVLTTPADSPFRWNIGGIRGDRSGILHVRANQEMQMTISDKEGKQPELLWGVNVYPRGDLKLPRNLFVDGIKMIVAGSLRGAQNVSVGNNGRLILRQLIAPNNQISRNLTFDVIEIQGGGRIEIQSDKDGLSIKCTALWIRSGGVLIADRLSIVAESVTIEQSGIIDLNYKAVVAGSGPGVGYSHYSGSSGAGHGGRGGRGQSQNRTGGFYGDFTSPKLFGSSGGGGSGDDIATGGGVFYLHAQQIVHDGEITVNGKDALNNSDYGGGSGGSVFIEVEYFDGSGVIEANGGAGGINGGGGGSGGRIAVYYNQTFFTGKISAYGGASTVENGAAGTIYKRIKYNGKSILEVYNEGKKPWKNEIADYSDLTSDSARTWLTMSHIIDSSVPVTVPDVNLGTTVYKGLTITEVKLGGSAHLAIEPDVTKIRLHTFERFYGMFEGNSFGFVHVGPNQLLVMPDTDYYIPVNLKVYPSGYIKLPDRVMLHKNSLSLDAGYLIGVEDLAISQCAVSFGAGSGALSTGYLQAMFFKIQTLTIMSQGVLDMVAPNSDYSLRVDSLVINSGGALNGRKLDIVAKSVTVDESAKINLDGQGEKCIEPDVYYAGSGGSHAGYGGLGIGSKRQDPFDSVFLPVAFGSAGYAGRSSFSCMGGSGGGSLNLTVDGTLQIDGEISSRGQNAKDSESGGGAGGSILIKVTTLEGTGKFQVQGGDGGVTSGGGGSGGIVTLYYKSSSHPFSYKVQGGGGKKIGASGFLYSKKDLQRRRRQISGSDSVLILEGRDVLSFESPSVLVCDPKLIDFTFEEVKLLKSSTLTMISCSQGSPMTLITTTIKGDKTGWLVVKPNHDVYIGVTSIVKPSMELEFNAEVEDGGTLSVPGKFFISGDTQINLSGSLIGVSDVIVNDNGKLVLKYPGHTGFRITPTQGISVVQISTVRIKDGGSISTTSPNKVKIKSVVLQVDFGGNLGSGISVSSSQRIEFTNGPSLNRQGCPHGYEVVEVASKTLYNPCGVGKHIFNKRNESYLVFKNVSVAISHNETIYVIKNETRYNVTYYIACDYDDFKLLPGQSCNLAPGSYRYNSLEIQGSATMHIEPGTGKGNASTLSVSKLTIFSKGELIARESNLINTNSTSSDYGGSYGGLGGGASKNSALYGNISFPVDYGSNGGGSSQNRGLGGGVVILKVGELSNDGLIDANGGDGSLGAGGGSGGSIQVVTDYLKGSGKFRARGGNAAYPAGGGGGGRIGITVLKGRSEFRGLYDATGGDGRRPGSSGTVFVSDQRQGTSYETIIFWNKVIGYPPAQLPNTSATYTYDEIRLENRGTFLATNQLVVAKSFVTDGTGKLTVSGGARVDILSFPKSSRIFSCDLEVQAGGSIYFYNKPIFLGPGSPTVVIAGILDARGPSLGKGKALNVTSSGEIRADNLRLLKDSVMQVDADASIRKSLAYAHFHLTSLRLDTNAQLTFAQGNVSFRSDSIHLSQGATITSAADTKLLNITGNDIFIDNQARVTADRGGVLGGPGKARGSGSGCGHGGRGGGSQGGESYGSVFKPEHYGSGDNVRGGGIIFLNIKGGFTLYGSMSANGASDSSGGASGGSILVHAETLSGHGEILSNGGEGLSNSAGGSGGRIALYITDKTSFRGALTAYGGCGTTCAAAGTIFVREYVVGLPQNSTIIDNGERKTEANTIIMHEMKISYTMRLLKIVNGARLEVATLPNVGMKIAIQNLVGDGSGSFHVHYNQTLTLGAGKAVSSRPFMFPWAMIVDEGATLNLDPVLFITRTAITPSLYLAGKLTGGEKVTVGQDASVVIAKSGIIGTHSNTPGKYSLLSLKVSSGGRITIEVDEGGKAPVELKSLSVDVAFGGVITGRYLRVDASLLNIAFSGTLQANGLGNPAGVGPGAGSSSLLTGGGYGGCGGGSTNETCVVYGSLFEATEFGSGGGTTQVPDGTYGSGGGIIAVVAKVLIVNGIISSNGQGGNSITTGGGSGGSLEISVSETFSGRGKIEAEGGYVPGQVTGAGGGGRISILITGDNKFSGSFSARGGNSSAKSGSPGTVYTEDGKTVLRYRKLFLDNRGISSNSPLPIFLNQSVVASYNFQEIRLNGQVMLHVDKDMEVGKLVTDPDSVIYVKDNVTFTVEPNSRYLQPDCSFVVDANGEIRIPDEVMFLGRNNVFRGTLTGILDMVIGENRKVFLSASARTARFIDGKYTFITHRGGYRFSSLRIKNGAFFSFENAQLKKVPLTLGRLEVNFGATMQGSWLDIKASDVIIHSGATVDLSAQGHESDKGPGAGGLHQSDGTGAGQGGYGGVSTGNFGTWYGSALNPNNTGSGGGSSSNGKGGRGGGNLRLTVVRVLTLEGRISVNGESGTVLNSGGGSGGSIWISADNIRGNGIINAEGGDGKGAGGGGSGGRVALYLQELMSFEGLLNAKGGSGKDAGAAGTLYLQDNNKRILRKRLWIDNLKVADNKPQTVLYEADKVNFLFDELRLNGMSRFEIYNLQRKLQTIQVTNFISDGVGEIAISKNQTLLAEVIEAKESHLTLTTNIYVEEGANLVVASNLTVDGATLTLDGKLSNVRHLVVESGSAVKFGITSQTTLMENKNFVFQSDPGTQQFASVTLKSGSDFGAPLNLKLSVGKLDMKSGVVLQGKFVDIKSQSLLIGRGAILTTNNIIDIELNPGGRGHSSGSGGSGGGHGSTGGTGYNSLVGGIPYGTIYEPNQPGSPGGHGSSSESAGKGGGVILIDTDVLENDGSITANGGVASQSSQAGGGSGGSVYIITSSVFSGTGTVSANGGRGNGAGGCGAGGRVAIHLQSQYAYRGTLEALGGISSRSGASGGAGTVYIKDVRYKLFFEQLHVDNQGQSWQNYVTLNESKTSYHFHELLLVRKASLRMTPNSNLNQSSTLSIGKLFGDRSGLLHLYNGQKAIIEVVEAQLTTTKTPVNFRIDSGAEAVMATTVYIVGDGAVALQCSGTLNGIRNFYVTQKRVVLLEKGSRTHRDDEQPGTLKFSNVKLFSGSSVTMKDEIVMKIFAGFLNIKFHASLEAHYFEIVTSNLDVETGGLLSVAGDNKARLAVEPSEVSSPPQGAGAGHASNGGSGYGGASGGLYHGSLYKPKESGRRGGRGTNNHIGGRGGGYVNIEAGTLIINDGTITVEGGSAVSGGGAGGGSGGSLLFNTESFIGYGEMNTNGGNGGGTNAGGGSGGRIAIYATENLYRGTYLAFGGSSVSGTYGGPGTVFLQDIRAKRPFKQLRIDNLMRSIEDPVTIDEANLTDHDFSQVHLFGRAAVNMAVRQERTTLKMSRLFGDRTGLLHSRANQTFYLEASATEHSVSKPAVNLKIDEHAEMVFGASLYVIGDGAKGTGQITGDSSFTIDGRMIDVTHLFITKRLKSRFLSHAHSADYQNETLTVSAVGTFVLATFEIQDGSEVFFPDVQGVQCEVGLLHMKYGSVIVADTYRIGVTSLLLETGSRITASGKDRPSSYDSSVLPSSCRGSGGSYGSKGGKGQSGVNELHSHGSIFTPSHYGSAGCPGSQNGGKGGGLIIMDIGDELYLDGTIANEGQDAARGSAGGGGSGGSIWIKCGRFNGHGVITSNGGAGDGLTSGGGSGGRVAIDTPTESKYLGEYTAMGGNSGDPSKETTLYSGGPGTVFLKDARNQYAHTQLRLDNRGRTWDHYVTLNESLKSYTFDELYLVQKASIHLVPDGKPLNLTVHKVEGDRTGLIHVHENQTLKAEFVDSVYTITRTAANFKLDKGANAIMATSVHVVGQGEVAFEWNGRLIDVQHFHVAYGRTVKIGFYAHTAGTKAGKYRFIDGYGTFRFSTLEFGSGTLIHYPPPMGVHFIVSLLDIKFSSYFKAEFFKIEATDVYLEPNATLNCAGRGFENKTDGSGKDSALGGSGAGHGTPGGDGQDVNGGEEVGSVYEPVLPGARGGTRTGSATGSRGGGRVRVVVGFAFRLDGIINVDGDNAAAYSGSGAGSGGSVWITTGYLRGHGVISARGGVGNTHNLATGGSGSGGRIAVHVKIKDEYRGGLYALGGVSSGTQHGGSGTVYIEEMQGDKLFKRLYIDNQNADPPKVFTLDEVNPKTVKANATEENDAEFGFDELMLQRGVVFRIADLKLSKRPAISVTTVLGDGSSTLHVMENQTFFIEYQEYTRRRSFPPVNFKVDYSGELMLVSDFHVAGKNNPAFELDGRITGVSNLSLTENRVLRAGENISSALLKDKVYIETPIDGQLKFGVFIMEASSELHFAKRMKFVVSTLYMRQKAVISANKIHMALNEVHMEGSSRITTSGKGPKAGEGLGPGSTFSNVGSGGGHGGQGGPGSTVDGGSGYGSYVYPVHPGSGGGGNGGGAGGCTTEITVGYSLHLDGIIESEGANGRSNSGGGSGGSILIKTVLFSGHGLVIANGGSGDGNGGGGAGGRIAAHVAWLREYAGQYTAFGGTGFKAGAAGTVYYTDTNQGLSHRPVLINEANHTVFGEGFTKLTVDNFNRNPDIPTMIINENSSYYEVDELEMRNHGLLHVHGSNSSFVVHNFTGDRTGLVHLRQGQKMFVQVVESKSGYSVAPVSYKIDQGAEIVFPSSLTLLGTRCSFDGLVIGVHRLIVAEGANVVFASTTQTGIKEDRKFRFLTTPGNITFAEVYVQKGSKLEFSRINNTLVFTAIIFRLKYHGLVNINHGEIDSSWAWVESEGKLVLDFTGHPAEMGSGKGNTVNLIGSGAGHGGIGGVSKAGQLGGESYGSIYKAVHLGSGGGNGQGKGGFGGGMLHWRIGQEIELDGLVTLRGGDGSGTSAGGGSGGSILIETTNFTGYGEINVMGGDGSGPSGSGGSGGRISAHVRFRHKYAGVFKAYGGDGKTYAAAGTVYVEETARGPQYADLKYEKSTNRTYITATHRYMEVDNEDRKTEVSSIMMESEHLFYELDELFLTRHANLQVRHPPGSPNVTVIVHRFLGDGTGRFHVRVNQTIYVEVVESETNETTAPCSYKIDQGAEVVFPAIVNIYGTRSIIEGRITGVEHLIIASGGFVEFSSTAQTARVENRRYVEIDENGNFSFATVTVERNSRITFSRILNYTLSLRCSEFKIKYEGLMTMNHGYIYSAFAWIESEGILSLDGTGFGPEQGFGHGTTKNNFGSGAGHGGEGGKTEHGEGGSPYDSVYTPRMYGSGGGNGRGIGGSGGGSLFWIVGQRLQINGLLSSRGTNGEGIDTGGGSGGSILITTTNMTGHGEIAVPGGSGTGSGSAGSGGRVGIHCRWRYKYGGKFTDNGGQEGKYGGPAGTIYKEENFRPLQYRHLKYMKETNTTMLAVDHTYVHIDNDGYDVPGATLLMEENTTYYEFDEMELTGYSRLLVYHPGNVTVTAVVHKFIGDKSGQFHIRRDQRIFVEYVESKTNKTEAPCSYRIDVGGQIILPSEFSMHGTRSVFEGMIIGVRDLLVSFGAEADFYSTSQTALIENGDYIAISKPGNISFAIVIVKKGGDVEFRKNTGLLRINVDELKIKYQGKVSMNHGEVFSTFAWLDSQGNFNLDEGGNTAEKGHGAGSTLSSIGLGAGHGGRGARSGGQSYGSVYRPLVLGSGGGNGGGTGGIGGGQLLWEVGKRLELNGFISARGGTGNGGHAGGGSGGSILIKTTNMTGHGEIAVTGGDAVNQGGGGSGGRVGIHCRFSYTFGGKFTDRGGFGTQSQYGAPAGTVYKQENLRPLEYRILKYSKETNTTFLAVDHTYLHVDNEGHDVPEATVLMEEGTTNYEFDEVELTGYSRLIVYHPNETDVTVIAHRFIGDKTGKFHLRVNQTIYVELVESETNRTEAPCSYRIDKGAEIVLPAEFHVHGVRSELYGLMTGVHFLFLEDGATLKIASSAQTALTENRTYIDITQPGNSSFAHIIIKQGGLLDLVRVEDVVVSVTSSVFEVLHKGTVKVNHGIFYSAFADVETKGVVVLDGAGYKAATGPGAGLSDSSNSGSGGGFGGQGGRSHSYNNGGGAYGSVYKPLSYGSGGGHGKWSGGGAGGGSLWWQVGKRIHLDGRLSSKGQSGSNSGGGGSGGSVLIETTNMTGHGEINVNGGDAQSNAGGGAGGRIGIHVDFRNNFGGKFRSAGGNVSGYNDNAGAAGTVYKYESRRGPQYRDLKYNPDTNLTTFKPEHSKVKVDNENNNVATPTVIMENQTVFYEFDEMQVEGHSTVIFYHPETARNVTVIAHEVTGDKTGIIKLVSRQRLFVFVVQSTHTYMDAPCGFHVEDYAEIVFPTEVILRGESSTIRGRITGVERLVVERNGLVEFGGTAHSAQLPEESQWLADNPFDPFTPGLIIVPQLIISNAGLAKVKMSPIRVVLNIADTTVKKGGQLILSTNDVTINADFVTVESGGLIDSSGAGYTAASGPGAGSGSTGGSHASPGGRAALGTQHGSVYWPDEPGSGGGYGAGGGRFYINTGGYVIVEGTIRANGVGSSSRSSGGGSGGAIIVKRPSHEGAGSGGRIAVYLTEHFIFRGTLTALGGNSGTKYHGSPGTVYIDVDVGEEPYRMIQVDNNNRDNLLPVTLAEANTVLYEFERIHLVRKGALAFKEVSGKLVKIVIRKATGDKTGILQALQNTRIYVECHSVRTEAPVNYEAHSGGQIVFPIQTTLLGTRAPALTVNGEIHGIEKLRLSSNVGTLVTEKGFSACLDCHSNYTTDYIGHYWFKKLQVDLGGTFQVQSSVQTISSLAVRLHTGEIALDYTGSLKADAAKLLTEYFRLELDAVTDASGSGWSSQKGPGSSTACSGVAGAGHGGRGGTGYFSGCSSCTAGGGNKYESVSQAIQAGSGGGGTSASGGGVVFVSVEKLLELDGSIKSDGANGDGGGGGASGGTLWVAGRHFEGHGHLTLKGGAGSSRSACCSGNPCSSYRKYNGGGGGGGHLRHFSPDYIRRDIIRKRDVSGGAAGGGSAGNGGSGQISAAGNECSGHGTFSAQEGNCTCDAGSYGVSCLYQCDPSITCLGHGRCSASGGCDCDPGYVGYRCEHKCDAKRDCHGNGRCSVTGKCVCDPCYTGDDCRYECSGNGTCIGGKCKCDPCYIGTHCHSLCSGHGTCTNGTCYCGSEWKGDYCEVPKCPNDCSGNGICNSALLTCFCNPGWRGFDCSELDCPGEPDCYNRGTCSAINGTVMCVNCSVGWMGPACNDPCVNGVQEPMDSGFCKCDPCWAGKGCDSLCMGLGTCSDNEICNCDPLQGWRGDVCQIPGCPGDGKDCTGNGDCNSATHECTCYPGWAGLGCNIPDCPGAPNCNNRGYCNASVTPPQCQNCSRGWMGPACADPCTFGEQTPMDSGQCICWPGYTGVGCDSECSEHGKIVDKSCVCDVGWRGDLCDNPGCPGIGSDCTGHGICNTVTHVCTCNEGWAGKGCEISDCPGTPNCFERGICNASVNPPKCQNCSKGWMGPACNDPCVHGQQIPMDSANCVCEPGWVGVGCDSECSEHGTIVDSKCQCDVGWRGTYCENPGCPGEGEDCSGRGECNSALHTCICQNGWTGDGCHIPDCPGNPNCANRGFCNITYNPPKCTNCIAGWMGPACEDLCTNGTQIPMDSGNCVCDPCFTGRGCNVECTGHGTCLENKCQCDELTGWRGSLCEVPGCPGSNGKDCSGNGKCDSANHKCICDPGWTGVGCHLPDCPGVPNCFGRGNCNATDRVTPKCTDCIQGWMGPACNDPCVHGYPKDGICVCDPCFTGSGCQSECSGFGECIDNKCDCGQEEGTAHMGQYCELPGCPGQCTSPNNGFCSMDTQKCICAQGWAGDDCSTPDCPGEPICTGHGRCSNSNPRRCNCEPDWAGERCEIPCVNGTNYGNSSGCICHPCFSGSGCNIECSLNGKCMNDKCVCDKTLGYKGDVCEIESCPGWPFDCSNHGSCNRATFECTCVPGWSGAACDIPDCPGDPDCNGRGTCTPPITDNETPKCTCQQGWMGVACEKPCKFGTPTADHICVCDDCYNGPACDMLCSNHSSICENRECDCGFDGWRGTYCEKKGCPGYKKDCSGHGQCLSASQTCICDPGWSGIGCEQTDCPGDPDCNNRGQCIPAETPYCGNCAQGWAGIACELPCVNGTQNQVDPTVCDCEPCFNGLSCDVFCSARDNATCSEGKCFCGFEGWRGDFCEKKGCPGLFNKDCSGRGTCNSATQTCDCNPGWAGRGCHEPACPGTPMCSDHGTCESLATISFCSCDKGWMGRACETKCEHGAPQQTGDGSFFCQCNDCYSGISCDLECSGRGNCTNNTCDCGFEGWRGPTCATKGCPGWGSDCSGHGSCITALGICYCRPGWSGRGCHIPKCAGGGNCSGHGVCDGINHDPPVCVSCDSGYMGEGCEQRCINGTVIKGDGDTCKCDSCHTGVDCGVECNGHGKCENEKCVCDSGWRGPKCETVGCPGQGADCTNHGVCLLVTQQCDCFNGWKGEGCDIPDCHGVPDCNALGTCYGGVDPPKCVNCTNNTMGPSCEFPCINGRENPPDSVICECDPCYNGLACDTECSGRGTCREDVNPKRCECDSGWKGQTCETLDCPGEPDCSGRGACVQQGSPPTAVCLCNQGFDGDDCRKLVCPGQPMCSNRGTCTLVGGIPACVCNHGFDGSSCERCLPQFTGSECDECVSNYIGWAVGCNVYCVHGNGTGHNKDICTCHNDANFGYWNGTSCDHCVFGWGLPSCAVCDDAHVGENCDIDCFSAHAQYRDELDGDWGKRPLEPIVSCLYENAPNEVFAWFGYHNKNPHNVYVSVGADNFFTRPYVDIVPGGLKGFVLKTSGADNTTNNLVPLPTQDYGQPIKFVPGRHEKAFKVRLEDTYPIAWVLAYPLSNERNAAVANQSLLHTMKCTNIEGQGSDVSRENYTCSCLDGHWGFACQFDCPGGPQAPCHNNGFCNKTTGLCSCDPNWRGDENCTACSPDWYGLDCSVVNHNLNNHTAAAYAHGYFITIDGAGYKFLGNGEYHLLLSHLWEVQVRMVTCFSSSSCVNAIAVRIEQHTVLMHSRFIDRKEPFVFVNGKKVYSVSFEFGPASQRFTFKRTSRLQFVLSSSYGVRLIVRLYDRYLDVHLRADNQTYCKTSQGLWGNCNLNSFDDLYSRDGKIVTRLNVSQSYVTEIYGKSWEVTEGDSLFVYDINNYHEKRELYGGGYALHFNNAGAHTEEIYSFSLSDITIEFMVRTESENGTLLSYTSTDMFAVILESGKIKLRYDDIILDTLAVIQTNQWNHIALVWSLSTRILQFYHRDDTGKRVNSRNFPIASNVNVFQPGGSLALGYCMPPPGGLTLPVTEGFIGQIDELRIWNQKLDPFSISANWRGNIGCTMRVQNLASLWKFNEGDGIVAHDCVSGAHIYFKSGIWRGPTWVFSTVEIPQFSVDTSTAYSFRFGNSWMSAEQLCYNLIFGSMLKSELILLTTSTLWFYHMSCVTSVTRSNDDSHSYWTLMALSDFNQLIVEHSSWFAQSLCNSVPAFNFPVWYGKDCDHQCKFGLPGTNEHKCFCMKGFFGLNCSSECPGGNNMPCNRQSSCDVSIGTCNCPVSSNTTYDCSVCSPGWIGSDCSVSLGGNRSSSENFTCQGFGATHYTTFDGVGYNFGTYGEFYVIKTNQFTAQVRQIPCMNASFCISSVGVKIGSTEIAIRASYNGTGMPLVWLNRKFTDATSVIMENNFSFHKTSPDVYEIVRPGKILLRVKAWQDYLSFVITSAPQWCFVGSGICSSCDSNVVNDFTNSTGTVYWGNSISEGIIINNLRSQWQVPAFDSLFIFGYTNYKERREITTNGYALSFNGTTASTGSLNAFVKGKDFTIQLFVKVYSSGGTILSYSNQFTFALVNDVRVKIFLGGISYDMGIALPSGTWVQVSIAYRASTGVLTYYQLNAQGDLYYKQTYIGVEMFSSGGTLWLGHWHISHEHITGVPLQPFFGVIDEVRIWSFSLDTLIIRQSFLLVITAKVPSLSALWLFDEGVGRVIANLISGSPDMYLPEVISRRPMWQFSYVREVFPSMVVSTSVQFSITFELLAKKRCLELIYHPHLQGQCGQLLSRAVTQFFFKACLFDVQSSSSLDASHISLIAYADYCMTVLHLSSWPAQRLCEQLPQSLPRRWIGPDCSVKCVFGSADKNNASLCVCHRGYWGEDCANECPGGGNKPCNDHGNCDVKTGSCECDLNWRGNGDCSNCTPGWTGSDCGVAVAVTQLSTCSAFLGGHFTNFDSAHFNFFGVGEFWFVRSIHFHGQLRQIPCHNGESRCISAIAFSFLSEWKVTVHAPYEESKQPVVWVNGKEAVYSSTRLQISSEVFLEKTSSTTYLLSSVLKDLKFQLRVVGRGLVIAGHVNQSFCNGTNALCGNCDGNRDNDFNVTVGSSLEDTWRVSKVESLFIYQKAGYEEERVVTGAEYALMFNSIGICSDLMPDVINASSITIELLFKMYSGPNAGGVLLTYSKAISLTLFIEGTLKVRIGIGIWDTGLSPQVDAWNQVTLVYYNTTGAVYIYHINSVGIVRLATRTMIAGIFNRGSIISIGQWIPSLEVSTKENDSLPGFVGVIDEVRFWNREFSLQDVTKSWGVNVLSTARYIVILWKFNEGQGSVIHDLVSRVHLYIPSIRKAPRWVFSYADIKILPVAPEITFSTSRMKVEAESWCHTHIQNSPLGIACGGLGGGTVAFYVRACLRVIASSNQVSLGISVVVAFADACEIQANLTIWPARQMCTYEAFRNSGLTNWIGVNCDIPCPYGYQPLGLYGRCQCDPGFWGQTCSGVCPGGLVNVCNGHGSCIHSNGICKCSQRWQGALDCSQCTLGFFGKDCSVAVVPPTIQQPVTSVFGTGYIVTLDGIKINVNVAGEFSVLSLFRYGLSIQFRQVRIGSYVXVRCVIVVVQRNVLAIHSSVGVAGQVLVTLNGTPISQNSLVSLGVSGFMFQRTSLNTYVVVGPEGFNFVINSLAIHFDVSITMNKDLCQETCGLLGRCRIPGSRAPPSNCTAGGILDTYDVSNITQELLISYVNSWAIPQNESSFGPILNISGEPQLNSVSGSCLYFNGTSLISAPLLNIFSGNYVTIQFFVKAKNPHVYAGTIISYALIETFAIAVNKTIFIYFGTTVIDTKLVLETGLWNHVSFVYMRRSGQVQFYLVNSIGIIQSRVFFVGVGIFAEGGTLALALWQVTKASLSLPGFVGWIDELSFWNKRFDSVTIQQTWNSNLQAETPGIVLLWKFNEGSGFICRATVGSLNFGLPTPPWKSPVWYPSDAIKVGNVFITPDLSEIVPDNSTRELCSDVFLKGPLFNECANVTGGSEFYYEACISEVSTSGTPESALMIATTFAEECQVALNLSSVPGQGLCNIIPGGRYNDWVGINCSTKCIVGLFTDGNCKCENGYWGINCSIKCPGGAENPCYGHGKCDIISGECNCQPNWDERKNCSKCTPGWIGKTCAVAVSTTETPVTTYTSKVCTILERGYVTGFDGSLFTFTTLGEFIMINSSILQVQVRQVPCEKSSVCLNAIGVTFNDLTISVHAAYESDSFPVVHVNGELTIVGEEPGKDMLKNNISIQPISRSAYRIVISLYLTIQTVFSDRYMSVESTVTSNFCQLVDGLCGSCAKLRVSQNATQGSGGSVSSVDRPTTVLEELGRSNATSGNVNEFVKKELPVQDPIIIIDTEMHKETRVVYGGVYSLYYRFTAVVTQTVVKLFASQTLTFQLLVKSCDPQICGGTVISFASNVTLYVSNHVTVKVVIGLDVFDTGIATEAERWNQITVVFVRERLQLFVFVTFSSGLVQVRKFSFSIDPFISVGTFAIGMWQPASGSISIQPTSFFLGQIDEVYVWARPFDYALVEQSWRSNIQPGAPXLTNLWKFNEGKNSILKDLVTGVTLLFPRYPLGKPEWVFSDAPIASVVAVNPNENNATLRTIAIKACFQFIYEGPIRSACDALGNVTLEFYFRPCVQAVVDTGLTVESIDVVITIADYCQKLFGLPHWPAQSLCNKFPGKRFPNWIGRNCTIPCIFGQAANESEVCVCDPGFYGTNCSGICPGGKGNACNSHGVCDVVTGKCSCELNWQGNENCSTCTRGWAGTDCSIAVTQWPSGSVVIGIGAVLLGGQFTSLNGVSYSLQVTGEYYLIYSIHVSVNVQIRLVTCTQQESCINSIALQIESNRVVLHGPYSAGRGLIIWLNGKVIDIDLHPITHELYGLIVSKITAQLWEVKYTGLYLKIRVIGRFLSLSVEASGLVCKSSIGLLGACNQGLLESLMSYYPTKNCSEEGFMLNLSRNHPNVFSQGDTLGEKNASDTKAKTQDVINTLITTKLKVKECHSLFEYKYGEVVEYREANAGYVLYFDQTTVVSDVIYKAFSFTDLSVKIMFKTVRYGVIISYTLRKTFFVTNTGGKFTIFYGDNVYHTNIAAELNKWNQVSLVFRKSTTVLHFYYFSSGGQLHRLDLNMGVDIFTPGGIIALAGWMPSLDGSGTQPTDFFAGFIDEVRIWTRYFHPAFILHTWNRSVSVKAQDLAHAWKFNEGEGITAIDKVTGMKLDLPFKPWRKPEWRYSDAVLQMPFYDRPLHFNFTNXSLQVAAEQFCNRTILMGTLHSNCKSLGPGVSTFYFRSCLQRIATFESLYMSMEVIIAYADYCQTFNNLTVWPAKHLCNEFPGREFPIWFGERCDKKCVYGKKLASETCVCYHGYWGLECTNACPGGAANPCNNNGLCNVITGECECIVNYNGTQDXGKCSPGWSGLDCSLALVSLNLNRQTSTAISSTDGHYVSFDGYSYTLVSLGEFYLMNLPHLSFQVQVRHVPCRHQTVCVNSIGIRISSTEVSFHAPYNTGGAPLIWVNGKLLLLSGLITTLGSPHLGILLNYKGRNHYQISWRDNFAMGIRIHGRYLSFIVDVTSPYCYNSTGLLGSCDNEPNNDFKASFNESIVPTNVSQPVLNTEIRSHAFVYEKDRVIVLKYKHYHEKRLPTGGIYALLFNQSGASSKPLIKTFNFNADITLEILLQPYQFGGTIFSYAVLQTFAITIESSLRIHFGKAIIDTGVNVTINQWNHVSLVWYHKSRVLEFYHFNFQGKVQRRSYVLSSNPFLPGGILSLGQWELSPGDSEAHIVASFVGTIDEIRVWKRAFNPAFILQNWRMNVVPTHPDLSGLWKMNEGESDIIVNLLTDEHIYLPRSPWQQPHWVFSDADIKTNLTSSDQPFEMHFSNETLERMAKTFCYELFYKSTLHDQCHGQLKSELEFHYLVCLKDIATSSYISAALTAVVTFADHCQAVFNSTTWPAQSLCNKFPGSHFPLWIGDRCDVKCVFGVADPDDRNRCICMEGYWGSDCSQICPGGLLNICAGHGWCDRTTGQCQCQVNWKGDENCSSCSPGWNGTDCQFAVKRVTGITSQNVFVASIGGNGYFTTFFGVSFTYRAVGEFYVLQSASQNFVIQLRQTPCIVDGSYTPLCTTGFSFSLNRNVIVIRAPVTTFSRTVPIFPLVWLNGNFVQVDHRTQLSVDFVMVRISTVAFEIYGLNDVKFGITLGNSLSVTIHLPAMYCQNSTGLLGACTGTSFNNSSSLQAHITSLKQSSVVDKLQTLFIYKYLHYSEYRSPTGAGFNLFFRDHSVRSGPLHLPPVDVLTVELLIKTHQTGGIIFSYLSQNIFAVIDNTTLGIIYKGTVYDTGLKLEIKQWNQLTIVFKQLVGTLYFYHVSSTGAVKVRVFKLDKNVFVDGGFLVLGQWQPSPSSDSMLPQSSFVGEIDEFRIWKRRTNSDLVKSNWRLNVQTGIYPDLLHLWKFNQANGRIILDLLGKNDLFVTKFHEPQWTFSDADIPRVNLEETAFVNLSLQRDAESFCFSLILSGPLYAKCEDLGIQVAQFHYKVCLHDISLSSQLRSAVYPVVTFADHCQNDLNLSEWPAKELCHHFVDQRFPYWIGSSCNTRCVFGYPIPDINSTDGVSCKCEQNYWGVDCANLCPGGLRETCNGHGVCSVTNGTCECESHWKGNTSAEYTAPIVENNSIPPIPCSKCTPGWTGADCAIAEDSSILENSSIPRIAINFGDPHFTSVTGVNFHFEAPGAYHLFNSSVVVAQVLIVPCNNRVSCRRISEVALRTAKRELSVRYNGLETVASSLFDLTSNTSKELSKSDQWVEDADIQYRWLTDNILEVRIQEEIQFNILSYYGTIGTAVEVLEQKDQTDGICGEKESIWIRQQGNKSLKSENLVPDTSNNSTKDQQGLTQATIETQLITRFRIMEKDNFLTTKYAWRSYSGAGYMLEFSSGNTAVMYASNTSLPVLDEFTIERWVCLTNAGVSVASLCTTDQRNTTEPVTGGHAVFSVANAIRDFAIVYKDGLQVKWDKEKFITGINLYEGVWTHLAVTWRRIDGRMQAFVYSNGQHRQSTRYGVKNGKQFSFNGLFVLGRYMRGYMVDSEYDMFGALDEFKVWQYAKTMEQIRMSMSVKFEDYREGLLLSIPFDEGVGQTTVGHLYSPIPTEEALSLFEAQVVNMTNIHLFIHSGDSPGWAPSGVHLTPLANYSLAFLNKTLEEEALKKCYESFYEGKLQEHCSPTLVSQALFYYESCLTDIADSGSLAHHKLSVSLFGFYCQKVLGIKECLLHGTYDAFLRCPGEEKQTKLTPTEIIVITVSSLLFLLFLLIIIIVMCRRRKRRKSEVEQIYLHEAGCERSHKYVAGEEGHHPHADSARRMLDEYDFEPDMDESLQDTPRVTRRPLVRDPAGGVLLDGEEETTL

>XP_022805470.1 uncharacterized protein LOC111342641 [Stylophora pistillata]
MKLSLNDLKHGRKTLERAPPPKTPTVIPTSQDDDNTGALSVASVRERFEQKRIPPQKPNATQSPSQGRGKPTEEKVLSVASARAKFEQELKPVNKGPHPWKPTPKYRNQSKTDLLRLDKNSNAKGHGNKEPPRKELPNIFRIGAAPSKPAKPTNLRFMLKKYKDKIILSNSLTSSTQTSTKADTPVIDTSWNVHQQVRRLLELLPSRALNTEALLRPSLRLLLRTSEEIRVLASTPTISEPSKENPLLRFEHYKETSITSPPSWKQNEGNQKKTGSQRRDGWIYLIDQGNTEPNRKELPTLCDIGAAPQKPARPDYLKFNLRKYWHMIAISKGVRNIAKTSSGSKSNEGAGYLDLEENCDYGAAEPPMITRNVTRLTPGEAEDSKDATASKVITDKGVSLSVKGVNLICPPGAVKDPVSIRLTLEEPYRYCYLIARCGLHNDVLFVSPIVNCRPNGQKFEKYVTLKVTLNRKRVKSDGDLLVLHGTRTGQSQIINWEDITNESKFDLQTKNLEVRINQFSLIAVLARLTLVRTKEIVTRLNLMPFNYNLSVLLKLNKQQSPFDELALVFMSQDTYREEYYRDHEDSAIMRLKKDGFEELPIDSKNAQESICIYNKEIITISVQLGEDYKPANNQQECFEVVVDSCAWWNTGHVIKLPLQVCNTNSKIVCGKILVKGEFGHVRENKFCQQDLCGYVRHVLGVKKAIFDVKAVAQKLELPAETQRQVLTCWQSEAKQLELVLLHWREKHGDAANPNHLKKALEELEPEEYKVKERGQMHIDHLRELAFKIAGLEHHDTDVMKYFDREVEKLCSAVLRDCCIENATSKEEVESAIQTENFASVLVMKKRLPESVGKTCTRMFSSQEDEFNTSSFLCVVSELIQCIKEIFREEKIQRTLSGLRGVTEEATLQKITSGIRDAAYDKSIFKLLSEVCHILEILCRNNNDDRQRESRIQSWGSCAMMAVMLHFLERFFQCAEACRLYRRLKLFSFSIRDIMSNPTAYEGSFLRDFHNVAVSLLKFARFDPSHLVQFELKDPNYSINNQTKDDIKYDTSKEMFSQLFWMIKKCEENLQLEATAHVHVGGQLLTLGAFEAMVILPESFSEVVENAPIASFPVSFKSETDGESSNLRVTLYSTQKDNIGKALEYLQDALDGQLSLSKQTEIKETSLKLLNVAKAGTDVRLRLTHIWGSGTLGVESNTFVPLGYSSPETHNNLEITDEPDQQAVGESSLLPVAGTSEYIADNHNRGNDFMTGPSSAELIPSLTSTTRQTVINVNNYINHTVNMEGHVCVAGERPNLNVQSPSSAAHRFLEAGPGAATNRSITAGDI
Sequences for 5 P. acuta biomineralization genes

Fasta file for these sequences is here

>Pocillopora_acuta_HIv2___RNAseq.g25214.t1
MAEQPRKIVKGRSYGQRRPPLTHILKSILERYPDGGQILKEIIQNADDAGATKVSFLLDSRQGFYGRNSLVAPSLYQFQGPALYAQNNARFEESDWENLQKLMDSDKKDDPLKVGRFGIGFNSVYHLTDLPSIVSGDSIVFLDPQEIHFGRGETGQEFSLEDELLENHEDQFKPYENVLDCKIWTRFYNGTLFRFPLRSAPSDLSEKVYSKEKVRKLFQALEKEAPVILLFLKNIEEIALFETDERGVERHVFTVRLSDSCRQEVREKKRNFLNDLRRLSDGEIENINLSLDLHVLEKTEGGREVENKWLVYHQVDARNSTLKKLSLELGLLPWVACATPVNAVTLQALSSRTGRIFCFLPLPPDADSKTGLPVHVHGYFGLTDNRRGLKWPGLDCQDDSTAEWNVSLVQHVASEAYANVLLLLRDSCDSSDGTDLVYKSWPNIREVENHWQCMLEHMFSILLKENILWTPANHGQWRNLSDAYLDRMTTQFQSTSDETRRVVLDTLTQANEAVVIVPSHVMTAIDKYRSVPTKSITPAFLRALLKKKEKGVWKISNVLKEKKLLLLEFALADKNLDDMRGVPLLPLADGSFVDFRSIQYNREPAAAVYVSSTSNPRSIFHNMDNKFLDDSVQTSAVTYLSKVATDANNSHTIEPVQLVKLNQTIAVKVLREMLPSEWSGGNHSVPWYPGKNGHPPEHWLESVWVWIKRMLPTDLSLLENLPLIPHTCAGNRSIVKLSSSSVVIRRHYQSISLPPFIVSLLGKIGCIVLENLPSYVHHNTLNRYVVTPDPHGVLKIFCTLGQSGCIPAITHCSPDEKRALRSFVSSASLSGDQRNLLYDLPIFDAADGYSFIAVRNGFQFHGVSPYDFKLPQSLPIPRASSIIALRDSQSHTLIHRLGISPMTKTTFLRDIVFSGIQNNFYNHQQISTLMCWVLSQYSLFCGEDSSFHSSLQQLPFVLTMSNKLVTPCCVLDPQQPILKQLFESEYDKFPNGSFVKEETLLRLRQLGMRSTPNKEDILHVAKTVDKIHSDVGSRKASALLEFLDGNPPDKSLGQTLMNERWVPRKQSRPSSYPGAMPWFSGTTHLYKPSETFSQSKATLVGASAPIVSKPCSKALEAVFGWDKSPPVHHLLNQLRSACLVRLNDMNRSALYHFQVMLRQIYEEGSSNASLIDAVNQDNSFPEWIWHGNGFSSPSKTAFTSCCRIDLRPYLYIVPKEFKILYPFFQRCGVREKFQDSDLLRILTMIKDKHNTSSDYLKDVADDRKLSHEILHWIVRDGKKLNPDLRESLLVPIHTWDNTLKLVPCSECTFCDAEWLRKGGTELLITSEFPMIHKAISPETAALLGVPPISTRVTCAEAVGIEQTGPHEPITTRLKNILNEYKEGVGVFNELVQNADDAGATEVQFVLDWRNHPNQRLLSPGMVECQGPALLAYNNATFTNDDLKNISRLAGATKKEDLEKIGRFGLGFSSVYHFTDVPSFITGSYAVFFDPETTHLGSHISHASKPGIKIDLAVNENSLTCFPDQFAPFNGLFGCDTTPYSDHDKFYFQGTLFRFAFRTKRGEISDKIYNKKEIRSLMFSFRESSPTLLLFSQNVKKVSFLEVDENATDTKDSRLLFEICRKSQSECTPVKNSVTESTFLKSCAAWTRQSTSQSEPFARYPSPKLTELISVCFTICRKNEKRCQETQSWLVTSCLGTGNSFKLATSEEGKKEGLLSASGVAAKICTQGDGSQKVEAVPGEMFCFLPLSIQTGLPVHANGYFAVTSNRRGIWEGTTADIGRQPLEVRWNQSLMEDALTQAYVQLLENMIVMQTQGKIPSYDIFTLWPNPDKLQSSAWEPLTKSFYRRIASYDLPLVRKAGKWLPVTQCIYQDLKLRELPNSKIVLEKFDYKIAQLPDFVKKGFQQAGCMEVIDQRTVTQEKFLRYVFFPNITAIPEELRNPIVCYLIDECLRRHASNWSKQLLDLCEFLLSTNRCIPCGLDTGHLAFPKELISPKSAAATLFSADDRRFPFGSCYQTEERLLVLQNLGMLSDILDWETLIERANSVSVLCRRAKQDDRKRSALLIKYINVYLEKMDPPSELNREELMAISMFPTLAKPPNYVMPWKGTADWNSDILPAKEMYDTRYKFVAGSSRPILDESESGCSRLSERARHLFGFNSRKPSAHEVLSQLEHAVQAIFHSPHAVESLEKVFHCIYDYLQELVLEPDGERIIHALQEKKWILVQGKCLPASRLAFAWRRFGEPYLNEVPQNLASKYRSLFQATGIKEHFSTKEIISALYELNEEKQGERLSTKEFKVSKSLIDEISEASEESLVTERGKIPLPDHNLILQPAEKLAINDAPWVAARSDIGYVHKDLSIDLAHRLGAIDIRTKKLSRISRPIGQKFGQREELTDRLKGILKAYPCDVGVLKELVQNADDAGATEVHLIFDPRYHNTDQLLCDNWKELQGPALCVYNNRPFSKNDLEGIQRLGIGSKANDPTRTGQYGIGFNAVYHLTDCPSFISNGDTLCILDPHYRYAPGADKENPGRLIEPIGEEERSDFRDVFPCYLEDMFDLKSSTMFRFPLRLQSTCTESMISEQRISSTEMNQFMNQLASEAREIILFLNHVKKISLSEIKDDRLKEIHSVSVQLTADDNAKRLKLTNHIKNCRFLNTNEIQWFGITYPLFVHEGRVRQEQWLVHQCIGIQKRGGDKIPNGRDYGLLPRGGIAAKVSEKSKLHSHIGSEPTFKAFCFLPLPEHTGLPVHVNGHFFLDSARRNLWNDEKGEGFGNQWNHFIMSKVLAQAYISLMLEARGHLPGSKRGETTSFSKEFKVHNGMRWYHNLFPHFGTVQSRWRVLAEAFFHNICKEDAELLQLTTRLFGKNSPTSQSAKATQSIQPLDNAQVAKNDFIDREEPIRCFWLSPSQGYFNTLSFTDESAKDLSKVLFNIDFKLFYSPFKLFNDFKTAGTDVREITPEGVIKFLEKNPNSVGILPCPVSETTVGSVVNVVLLLNYCMETPTFLNQIFGIPLLLTEDGVLRRFQMDKQVFLSRFADLLPNQRSEFIHHILAAPLLHFEKEIFQADQGVLKRFDICALASLLPSIAKGKWRETNSLIPWDWKDGPTEPWLKPSEAWSVGQKTVVELLRKLRCPELDVELIGSGNRWDLSPVLKQHLSYPNSSQDILEVLDHLMRTEDISGYLSDDEMISLLQFFQHDAASLKQDRSLTSILKRLPFFKTFMGTFVSLENAKSVYTIPKGLPTDECDVWMRGINCLFLAPEPLLDHLYNRVLNVVSRTHADCYINFIFPKFSSLKEQTRMLHLNYVKRFLLAPFSDEDQHTRVLQSLRTLEFIPDTNGSLRTASYFHDPREKVFAVLLPREAKPPEPFNEKSGWLDLLCEVGLKKQVSRDQFLEFSKEVAKQAENMSKKTRITLAEKSQTLVTHLLGEESFHEAEYLRKLSMIKFVASSKASDNLLSLHKHYHCSKEVQDGTLPFIHFHGSVPKMNERLAWTTAPLLPDWAVSDSGKEMLALGVFPYPSLDQVINHVTILSQNVPKDIDGEIPQPKRRLLGDIMTEVYTFLQRMSRCQESDSLNSCSRECHEIRKRLSDKSCVLVEDGRVFVRGDQLAFRLDEQLAPYLYKVPREYGSFQHLLLRLGAVEEATPEQFAKLLNKLNASCPGKKMRRNEVSIAKHAVHGLFTSLKALQDHNKKGEPVNNALSKIERLYLPSSEKKLERSVDLVLFDFLWYKLRIPTAMYKQLDPLQKYNLTFATPQQIVDLLPAHLKIPSITALVREELHSECREKKCRADVEKKCHETNRLRHILFSPKFVDGMIRILKNQYQKAKLNDEVRGNVRRFQNELKISCMEMLSTELVENKSNTSIPGSQRSRNIDCFVERDESGRKHIFIKHGVGPRNVRRILCEEINQLTGCYIDKVSWLHLADILECESPEDISSTLDKARVSEDVDTTDTPNVEPDLGTDVREEFHYLQEQYGDFYFGKGEFVAFEKDDSTDEEPRYIYAKIMNKVTTKIKPKKDRTKRKQKDESKLLDRYLIDIGHVKKEVDVLDLYKIRRPQTFLEKDEKPEGECFSETMELVPHEGTYRQSAEPEGAECSTTPQSKDDCGELPKPRTLDAARKEVRKKLSEIWKLPEDKRKKAVRRLYLRWHADKNMDMQDITNEVMKYIQTEVERLSKGKSSSRDEGCARPPPPDFSDLFKLWDVIARRQRSSFENYRRHNPRFTGFASHSRRTYTASNLRVAKMWIRQSKEDLRSVKLLLTARDPLYYLVCFQCHQIAEKSLKATLYALSGVADRQLKYNDLVLLAHDLSRLPRAPDVTPQVARLSDYYDGTRYPNKHVPPKVPAEVYQDSQQAQEAFRLATEVLEVLEQFVGP

>Pocillopora_acuta_HIv2___RNAseq.g11609.t1
MKTSDNNYGGQAYFEIPSRKPVASVFSNQMSVMANKAYYKERIQKMQKEIDEAGKESKVLKNLDVFVLDNSLRESTVGQLRGHTIENKWKIYKEIKKVGFKDIIVASFSHMTRLGDTFIKQLHDAGEDFTNLWAFTELLESVDKSGVPDTETIPVGLRKMKELGIRNAIIEMDLVYVGINYKKFKVEAINDLLTERLKWIHANLAKDSKVVVNLRDLPDGMIKKPKRVFKVIRHLSSLPLAIRPFGIVTEESGKYFPEQLAAWIRAVRKEMDDCGFKDGHLLVHVHEKWGHVDTTQLSCLANGANGIWASMIIQGASMGAASSTVTLMNLVRLGNKKVLKKYNCSALREAAQEICRVTTGYEPYPLQPIYGERALDMVFGMPTKLGINEFDLAKFFGEEPLMRMTTLASAEMIVTRLKNLFGEDPQFTIERGTRMKEVMLEDLHKNIKEEYMSAAGLAMLFDRSGGHLTEKMAEVLTKEKPTRAHAQVLIAEIRAMWDEWDLRDGQRDDKLEFDAFYNGFLAPYFGCYRCDETKQALKAMDMDEDGTVDWNEFAVYLKWAMRQYPETKTSEDLLSVAFRKGLIPAMQDEVVRKQEGEPMEA

>Pocillopora_acuta_HIv2___TS.g23498.t1
MSRSSWKWLFVFIIAIVFSQARGTYYGHGYYPAPTTQSPACPSYITSNNNVKFTASTSSLGPGHPIINGNEIWCAATIDKAQHLIVDLGGVVKIDHVVIQGKAGTKQSVSMYYVKTSKDGSTFNYILDDSGTRPKAFWGAIDDGDIVAKTNLKKPVKARYVSFNPREPQTDANGLCLRVDVNICNGELTPINGGWTSWSSWSECSQQCHIYGQGSGYAIKSRHRTCSNPIPTFEGAACKGNSGEDAFCLNSCSSQVNGNWGYWSQWSACSKTCGNGTATRTRKCDSPPPSGGGSSCVGDAKQTKYCSDRLHCPINGGWSPWSSFGGCSLMAAGLIGSHGQVALRRAEEVHRRASALATIPHHNMAAFTVKEAIQIINLATLKIAPLMEAGLSGHNGLPVTKRVAVAAPIDEGNVPAHHLRTVAKHALEIVTKFRNAIRKDAQWTAGGLSGPNGQDAVQHAEVEYDQEDVPVPIHHRPMTDQDVLAITWRLKGATHNRVQWMANFLHGLHGNNAARCVEEELKAGHGSVIIPLPPRVGSPVLEIPSRLECVAPTSVQLMVAGQIGQVGPPVAGHAVAEHSQEDVSAAARNLDGGWTNWATGPCNVLCGDGKRNRTRTCTNPPASGGGDVCMGPAFETENCNSGPCKATTAPTATALPTPTIDPNIPKIDLVFAISATSASFQQSYELMKNTVKKFIDTYGVNKIHYSVIVYGNQVIRVVNFNRTFPLSANELKTAIDRQPALPGGPVLTNALQEAYRVFKESVGRSGAKKVLVVITDRNSGSSTNSLSQAVRPLEDLGVLVISIGVGNEVSRSELNIISPNPLDVISARLNINPSVLAVRIMERILRLNFPDVDVGFAISAASADSDKIFSLMKQIINTIVDRYGVNKVRFSFIVYGSRVTTRFTFDNAPITQEELIKAVNGTEKVTGDPDLEKALEEAEKLFTKTSRPNATRVFVVLSDIVGSGDDNSLIATSARLRKGGVLILSVGFGQQVNAISNQMTKVVIAQSDYISVPDFTTQRPVVIAETIMFKALQANIPEIDLTFVISATSASTDRTFTLMKSTINNIIDKYGISRIHYTVIVFGSGFTTSVDFSTNVPDKETLTRLVTSLQRETGTPDLAKALQEVKRAYELREVRPNAKKVLVVILDKKTDNSKVQLETIVTDLVQKSILIIGVGIGRSVDRNELIYITEENRNIIEVEPTERPEEVAREIMKIILRSCAERTSLNDTNYSASTVEQPAQFAKLDTSDPSASQRAWCRDINDVGGYLQIDLGETADVYQVATKGQEQNNRWVKSYYITLSSDGNSFFNYTQGGRTKVFSGNRDSRTVVFNNLNATQAGRYVRFYPITFNQQPCMQASVFACTEIIDPTANPARISEDAGNGILIALWILAGILTFLLLLACLYYCCWHVCCKRGKKSKGLTTFSETYTEEDGGYLIEDGESKRWNPRAVPMVARAPKVDKVPEDEVQEVSIEMKEDSPQLGVIQFGIEADNTKDQHVTAETVHSETPLYSQEVNTGTVKKGATKMTMSSTDTQKRKRAQSESAAATLEEAEATSKQWSYQTEQRQSSAFANEGYMRSQESIAPTQVRTRTQELRRAQSADELSAIDYDMFEQRRESAQSALKGEMSRDGYMRMKQSSRGSSVDETDRGFQMGTVDLAIGGIEAPNVQRGSSQFYGMEEQEMTFSADNGGRHYYELEDGGYRTEEWYARGGHEPGRLRDEGFREIHVEHQPIYHEIEHDNKVSEERVCYSYVKYDKFVGEF

>Pocillopora_acuta_HIv2___RNAseq.g7668.t1
MFKTRRKNALLVAVLLLLTPSFVVATLSSCTSNSNADISSPCKFAPGVHSYTSLTVRTEVFLETTSGSSQHFFNVSQTFDIKANGRLVLDYNKESGSNPGAGVLTSGSTGRSGGSYGGRAGAAAKTPLSTSQAESYGSAFVVTQPGSNGGGDPNTRGRGGGFLKIYARKLVVAGTVQANGQRAQQNDGGGSGGGISVDCFEIDGNGRMEASGGMGSGEGGGGSGGRISVQFQHGSFLGQAKSYGGRTENEVKAQGVAAGSHSLTASSRKSVSAPHPSSPNIVYDPSKGRLNYQGSTGGWMPATNNIGSEWLEVGLSKEAYITDVATQGRYSGSEYTSSYTLKYLDPYISSPETWRDVKENGEVKTFTGNSNTNTVKQNALPTDVYTRAIRFYPKAFSGEIALRVELYGYAADLSSNPGCSYVLPQDNNPEPGGPGTVYFDGYKSGSRHRVLSVDNQGRRPRINSSSDVSVFLQTGAAAWVTVSADHKIEEVEVSGGAQLVIAGSAKNLTVDSITDDNTGYLYLLSSMQLQISTLLSPCTLIYHGAELILSDLAPNVSLERSVDVHGTLSTLGAPSVYVGAQKGVFTMHPGSGPSSLSFGELIIESSGHLQLLNYDKQSPASCRWSVALTSSKFTLSDNSKMDVECPFNLTGDQMNIGLNSALKIDGNSSVSYISMNGVSIRGTFDPGVLSLLEGWKTLRIERYGDMSFFPHGDVRINTFYSNGNFHVEGVIYLRGRDPAVTQLIEVDEYGSVQFDLPLSSNSLVFVHNNSHEFGSHGRLSLNGVSLVHADIVVVDGTWLPNKLKIEPGCKELTVEHGGKFHFDPVGIFQLNKLLLDGDVKSLNAVKMEGLSQQKVQQCEVGFHGTVLIDSADLTTILCEYVLLSGTLRVGNLSIGSVWNDLHVNGTNGKFYFETSEPLNINQIRVSGLIDTGSAIGPSAPLTSNNFTIESAGEVNIHFQAQPAITVDGAINSTLYVTTLEVNGLFKLGSLYLITDNLSVGSSGRISVNGGGAQGGMGPGAGSQKDGGASGASHGGRGGRGSQTLAQQLIYGDIFSPGGWGSGGGNGSGNSGGGRGGGVIFLQINQTFDVNGVIQMNGLPGQTSTSGGGSGGSLWATCQQFTGSGKIEAKGGNGNTYGGGGAGGRITVNYVTGGFHSDGTDASGGQAGSGSNVEHGGPGVIYLDGKSPIVKNLRIDNKGLKPLTSLSGDYSTFIYSGAVAYLNPPSDNFVYDFTLVEVYGGGQLVFPRAGTEVRVDTINGDDTGYIHVPPFNTLNVTGPSEYRRINVTWAPFIYEDATFVLPNGTVEIRKAESLLYPNIERSSHISFWGSVLGFKAHLMVAYGATVSFENSCPRNLNFVGITVQKTSRLLFKSNMSVEADGWTVEVTKDNGPVYRDGIVTIEGDGLIEARALTIKAASLIVDPRGKLTLDGKGYFAGSGSGSGSSDGSGAGYGGTGGNGRVTTTTGIPYGDYTNPRLFGSGGGLGSNAGFGGGALLLKIVDTLVVEGTITVSGSQGVSSNSGGGSGGSILIRTRALEGSGVIAVNGGAGNSNGGGGAGGRMAIYWQDREWWRGSLTAFGGSSSQGGNGGAGTVYLVDTQHGVNNRTLIFDNNNLSPSAMEISDYSSPYTNGGRAWILPIDQEFELEEVHILRKAHVALHPNITRPHGLKVYSFVGDRTGVLHVGPNQVVIGQFADHELFEVNVFVYRDGHFHLPPTFACYGIDITVRGYLGIEDMTIAKSCRLFLALTGSTELANSEGTYRYNSLTVADGGELTSTSDVGNNSLTLDVDDITIQGGGVLHMVRMRIFVGNFTVDDLGHLRGDTFDNSCTSGAGVTNSNGGSSGAGHGGTGGLGRYGTRVGVAYGHVYEPEHFGCRGGGSGGLGGGIIRMSVRGTLQIDGTISCNGDNGQQSRSGGGSGGSIWIDTILMKGYGTVQANGGSGHVETSHSLHGGGGAGGRIAVYFRSNRTYSGVFESLGGDSTGDALPGGAGTVFLYHLIHKHRTLLISNAGRKALPTEHLVIKDYSEPALISGKTWLLPSSGEHNFSRNQNYYFEELQIYGAAHLAVLTEPHNRSATIYFRNMIGDRTGTIHVGYNQTMDLIRPTIDLPFNVRVYRGGFLGLAPATEVHGVEIHVDGVISYVKNLTLHHGGLLALNENSRTGNEATENDFKFDFLRVQFEGVIQMTSSPVTHNGMNLTVRVLHIEGGGKVEGSDLRILAENISINTEGQLTVSGRGYKHEDGTGEGVHGKINQGLGSGSSSGASGGGHGGTGGRGKHTSKVGLPYGNMYEPIEFGSSGGGVNKKQGVGGGTIFLNVTNLLEIDGALSADGGDALPQGGGGSGGSVWINCYIIKGFGKITANGGSSPSDGYGNHGGGGAGGRVAVYFIKNDTFSYFSYQAHGGQAKEGQDKVENGGPGTVFLYHLVHTHRTLLIDNNGGKPLNKHINYGRLDEEGGKAWIMPESGFHHFAAEEDKFHFEELQIYSKGHLAIWPRAGNDSRNVSMFFKYMIGDRSGMIHIGDKQVMDLKRPEIDLPFSAQVYLGGFLGLAPYTQVHGIEIIVRGILAYIRNMTIHNGGDLWLNHGGRTDHEIINHYDFDFIRVQDTGTIHCVTSPVNDPGVLFTTRAVFIEGGGLMRGSRLTFVTENITIDDGGRLIADGLGYNTSHGYQGNDISGAPINPGHGVDDNEGASGAGHGGSGGRGSLTYGTPKTGFAYGDLYEPYIYGSAGGKGRGGTRGGNGGGMLWMNVTGLIDVDGLVSANGEDASSLTGSGGGSGGSIWMYCKTIRGYGRIAANGGAGSKDSSYPGGGGAGGRVAIYFQINETSTYFVYEARGGSALGCEVGKEHLCKAEAGGPGTVFLYHMIHTHRTLLIHNGGQKPLVSAIADYKDLSEDGCRAWILPQSANHDFAGRGRDFHFEELQVYGGGHLAVLTEPVGEKASLFFLHMIGDRTGTVHVSKNQTMDLHRPEIDTPFSAHVYAGGYLGLAPYTEVHGVTLFISGTVDHIQNMTIHHGGAFWMYHGGNTANQTNSSFEFDAVRVQDNGVIQAITSPIIHPGITIIARAFFVEGGGLFHGTKMTVLGENITVDDGGLISADGEGYNRTHPQGSGLHGVINPGIGSSHIYGSSGAGCGGRGGRGDHSAVVGAAYGDLYEPVRFGSSGGGDKSGRGGGVIWFNVTNVIQIDGEVSADGRKGDNSGSGGGSGGSIWMHCYRIKGTGAIKVNGGAGGGSSGGGAGGRIAVYFTENTTYTGSFQSRGGAKGGGSNTEAGGPGTAFLYHLVHTHRTLLVDNGGQHPLTRRISDYSDLSRDGGRAWILPESGGHDFANGSHDFHFEELQIYGGAHLAILTEPVNRAASLFFRYMIGDRTGMIHISQNQVMNLHRLFLDIPFSAYVYDGGYLGLAPISEMNKIIVYVEGTLDHIRNLTILNGGELHCYLTGSTGERIQRHYNFNETVRIMARSQIQSHSPNAHKETFSLTAKILLVEGGAAISTFNMNITAVNLTVDDGGSIDASDGGYTATKGPGSLLTNNWRRSGAGHGGTGGRGSCGGYHTCRLKKGLPYGNLYYPRDFGSGGDGNGGKGGGILSISVAHTLQVDGNIFSNSRAVNNDNGGGSGGSILIHTQVLSGGHTGVIQSKGGSGTAGSGGGSGGRIAVYYSNNDTHHPYRGKFDTSGGSVTSGAEAGASGTVYLKHTGSGFSTLRVDNNGQQALDDEIPNAGVLLDLSGGRADQGTTYNAPNGMTVTSSCAIRSPLCHNPCQSCRDYSIANLFDQTFSTSSCAGYFLSGCHYTKLTVDLKSLLFINHIRFYPFCSGSRPNFRVTTNDGIRNVPVTSNYVQISNGCIQGSYVDMPVRRKATQIYVELNHPTSNTYSGLSELEVFIDGKEVWDRYKYRSFDGAKTWIEPATGTDVYSFSEVHIRGSAQLAVMPLNGLQAPVHFHADRLYGDKSGFLHVGYNQNFSVAVTDPDIPFGLRVYENGSMMLPRRAFLQTVSFKSSGKIWGVQDLFVFDHGTFYGDSNSSLGKDTVPGQYLVKSLHVQDRGVFELHSTDKKLTSRLSLTNLTIFGGGHFKSNNKLHIAVKHLLRINSGGRLSHNHAGYVTKEGRSGDEFEPSEGPGQGIGSVHGASGAGFAGTGGRGTGTGLVGQFYGDYRRPDDYGSAGGFGLHYGNLYYGDYRTSSVGPYRNFITRSGLGGQGGGAIEIVTRHLILDGHLSADGEDGPPPSTAGGGSGGSIWIDCEELDGYGTISANGGAGSPSKGGGGSGGRIAIYQAFMLNFNGTLSAKGGNSAVEPGASGTVFLETRNNSKVEYRVLKINNFGMAYPWAVDKSQGRLRNLMRGIYTDTKYVGAVTWLHEADKYTLDEFHLHGNSHVALYGNGSRGNVTLYAHTLRGDRSGVFHVGRFQSVVFDFIDLYFPINTLVYFNATLEVPRRLSLREVYMEINGTLADSDDYTIDRDGKLFLWSGGQSLGEKQGHFRFINMSIKSLGLLHTTKIQGHGPVSLHTTRFVVNAGGLANVDDFFLYSVNATIDVAGDVSADFRGYGAEQGPGTAVQRTPYYTGAGGGSHGGRGGRGTAGLHTPSSYGSIYEPTQFGSGGGNGLNGMGGGQGGGRIVFEISDMLRLEGHVHADGEPGSRSSDPSGGGAGGSIVIRSFKFDGEGTVSVNGGSCPSTYAPYGGGGAGGRIAVYYNGSYTFIGSFQSYGGISQAEWGGAGTVYIKNNQNVSQPYSILRIDNRATRSGPSRLNEIQELHLAGNSADYPYYLTSYTAPNGVTLITTGIPYCGRTSHHDSKICDTDDSKISNLFKSTSNSYYTQNSNPVVTYRFPLPLFLEYLLIYPHCNSYHLTQFYVRVYFNDSEVVGSNGWINPTNCLQGQPLRMNVRQTVEKVEVSLQKISSYSSLSLVRFFVRENPSTTLQTPHYTSPSTSWIVTDEEKTTQHDFSELQIMGQGSLSMSGNSVKMSVNKVVGDNSGILTMRPTQSILMRGSEGHLPFSLLSQKGSSVAFPTSMTCREVEVTIRGVMGEMKNLTVGPQCRFVLDNSSETEFMLDHMVVQTDGYMAVLREDREDVKMVGKTFDIRGGAKMEANSLTLDYINITIEPFADLSSDGLVDEVSGSRYHGDGWGGDSGGGSSGAGHGGHGGVGGRQQKVGISYGKYRRPTTFGSTGGAQVFPFTGGLGGGRFKIIAHDTLVVDGVLSSKGGNARASRSGGGSGGSILAYTSRIHGDGEFDVSGGNGDSSTGYHGGGGGAGRICLYYRENHFLGRFLGFGGTSSIEPGGPGTVFLENVPGMNATYGHDRIDEAAHAERVLLDENVVNGTQWVRNRTLYANAMGRKPQSPDANLSSSYRDFSVGGSSRVWLILDDEDLEANGTDVELDELQLYGGAQLAIINPANTKAYISIVIGQMEGDRTGRIHLGFNQTFLSLQSYLPMDMIIYQGGLTTMQGELLVAGVTVEIDGVLRRCQNITVVDDGVIRMKEMYDLEGKPTETFYFEAINVRNKGTMMVTNQERVREFRGKSMQIYGGGTFTAVNLHVHVVNFTIDALATVHATLEGEKFDYEGKGPGAKYPTPGFAGGSGGGHGGLGGRSTSQVTTGAAYGSVKEPTEFGSSGGKGSSGAHGGRGGGILFLNISNTLDIEGTLSVDGANNGGSEAGGGSAGSILIRTVLLEGSGTVQANGGNGYTGSSRAGGGGSGGRVALYYQGGFFDGVLEAAGGKGDLENGAAGTVYVEKAVNNSDTPHRTLKVDNKGRPPLNERVNEIEEVKLYPGRLKDGRVYSDEYTSISGLRFKSDGSTMPLSLGSSSYYSLSRMFDGDLKNAYIVQSSSVHVVIEATLPRLMFIHHIRLYPYCHQHYRVSFTLTTDHPVIGWKDRTKGSRSFANCFDTLVYNDVTIEDTISKFTVTLTRLESNVALSELKVFVGRDTSSFKAITLEQDSSRTWLVFNDQGTMTFEVDELDVRGSAHLGIQNGAGKLDFKVKEYKGDFTGTVHVGGAQNWYLNASNNSVIPFTLRTYQGSQVFLPREVYLQKSSIYADSKLEGLKNLFISQGSRFDVTQGAHVNSPSKATILLDRITILNGGTFYQRTAEPAKLALNLTGELIINAGASMEVSKVHLQAHNIFIDIDGVLSASGRGYSSMKGDEPGRKSDVAASGAGHGGAGGSSTSQEYVGRAYGSFQIPLDFGSGGGQGYQELPGSSGGGAVKLSASHIVQVDGLLDVSAGAAVNPGTGGGSGGSVLIQATLFLGKGKIFADGGEVAHSSLGDAGGGAGGRISAHYKSTRFAGSFSAHGGASRSEAGGPGTVFLSENSTHTTMIIDNNGYRASKLYISDYRDRSNDGGRAWLLAGYMDEFTLNLLQLRGGTHFAVYHLKPSFTLNVERLEGDLGGLIHVSKKNRVYIKNAPRLFPSSFHIYNEGFLHLPRDVLLKDLFYPRISLEGLISGMDNLTLGGGAEFVVTAEGQTDGYSKKTIHLNSLTIMNDAKLVATDTVLDSPTVTLLLNESLRVMAGGWIQGKWIDINAGDIEVEASGVITAEKQGHDAKSGSGSPSGSAGAGHGGRGGLGDSGSSPGNSYGSLFRPLSHGSGSMFAVGGGVIKLATSQSVTIDGLVDANGEDALNKSSGGGSGGSIWILSKVFKGNGVIRASGGSGLFFESGGGGGGRISIAFENRTFSGKINVFGGASNKTAGGAGSLYLHNKYTDFKQLIVDNNNIGSPLTDDITDVSNDGGRTWLTPEPNTIEMSFDEVDIRGQSQLAVLTTPPDSPFRWNIGGIRGDRSGILHVRANQEMHMTISDNEGKQPQLLWGVNVYPRGDLKLPHNLVVDGIKIITAGSLSGAQNVTVGNNGRLILRQLISPNNQMSRNLTFDVIEIQGGGRIEIQSDKDGLSIKCTALWIRSGGVLIADRLSIVADSVTIEQSGIIDLNFKAVVAGSGPGAGYSHYSGSSGAGHGGRGGRGQSENRTGGFYGDFISPKMFGSSGGGGSGDDIATGGGVFYLHAQRIVHDGEISVNGKDALNNSDYGGGSGGSVFIEVEYFDGSGIIEANGGAGGINGGGGGSGGRIAIYYNQTFFTGQIFAYGGGSTVESGAAGTIYKKNKHNGKSILEVYNEGKKPLKKAIADYSDLTSDSARTWLTISHIIGSPVPVTVPDINLGTTVYKGLTITEVKLGGSAHLAIEPDATKIRLHTFAQFYGMFEGNSFGFVHVGPKQLLAMPDTDYYIPVNLKVYPSGYIKLPDRVMLHKNSLSLDAGYLIGVEDLAISQCTVSFGAGSGAQSTGSLQAMYFKIQTLTIMSQGILDMVAPNSNYSLHIDSLVINSGGALNGRKVDIVAKSVTVDESGKINLDGQGEKCLDPNVYYAGSGGSHSGYGGFGIGSKRQDRFDSVFLPVAFGTAGYAGRSSFSCMGGSGGGSLNLTVDGTLQIDGQISSRGQNAKDSESGGGAGGSILIRITTLEGTGTFEVQGGDGGVTSGGGGSGGIVAIYYKISSHQFSYKVRGGGGKKIGASGFLYTKRDLQRSRRQVSESDSVLILEGREVLSFESPSVLVCDPKLIDFTFEEVKLLKSSTLTMISCSQGSPMTLIAKTIKGDKTAWLVVKPNHDVYIGVTSIVEPSMELEFNAEIEDGGTLSVPGNFFISGDTQINLSGSLIGVSDLTVNDKGKLVLNYPGHTGFRITPDQGKSVVQISTVRIKDGGSITTTSPSKVEMKSDLLQKDFGGNLGPGISVSSNQRIEHTNGPSLNKQGCPHGYEVVEVASKTLYNPCGVGKHIFNKRNESYLVLKNVSVAISHNETIYVIKNETRFNVTYYIACDYDDFKLLPGQSCNLAPGSYKYNSLEIQGSAAMYFEPGTEKGNASTLSVSKLTIFSQGQLIAKTSNFIDAISTPSDYGGSYGGLGGGASENSALYGNISFPVDYGSNGGGSSQNHGWGGGVIILKTTELFNDGLIDASGGDGSSGAGAGSGGSIQVVTDYMKGSGIFRARGGNANFPAGGGGGGRVGITIRKGQSEFRGLYDATGGDGRRPGSSGTVFVRDQRQGASYEMIMFWNKIIGYPPAQLPNTSAPYTYDEIRLENRGTFLATAQLVVAKSFVTDGTGKLTISGGARVDILSFSKSSRTFSCDLEIQAGGSLYFYNQPIFLGPGSPTVVVAGILDAREPSVGKGKSIKITSTGEIRLDKLRLLKDSVMRVDSDASIKKSFSYAQFHLTSLRLDTNAQLIFAQENVSLRADLIHLSQGAAITSDTDTKLINITSNDILIDNQARITADEGGFLGGPGKANGSGSGCGHGGRGGGGQGGESYGSVFEPQHYGSGNNARGGGVIFLNIKGGFTLYGSVSANGANDSRGGASGGSILVHAGTLSGHGEVLSNGGEGLSNSAGGSGGRIALYITDRTSFKGVLTTYGGCGTTCAAAGTIFIREYVVGLPQNSTVIDNGDRKTEANTIIMHEMKISYTMRLLKLVNGARLEVATVPNVEMKIAIQNLEGDGSGSFHVHHNQTLTLGAGKAVSSRPFMFPWAMIVDEGATLNLDPVLFITRTAISPSLYLAGKLTGGEKVTVGQDASVVIAKSGVIGTHSNTPGKYSFLSLKVSSGGRITIEVDEDANAPVELKSLSVDVAFGGVIIGRYLRVDTSLLNIAFSGTLQANGLGNPAGVGPGAGSSSLLTGGGYGGCGGGNTNETCVVYGSLFEATEFGSGGGTTQVPDGIFGSGGGIIEVEAQVLIVDGTISSNGESGSSTTGGGSGGSVDISISQTFSGRGKIKAEGGYVSGQVTGAGGGGRISILITGDNKFSGSLSARGGSSSAKSGSPGTVYTEDGKTVLRKRKLFLDNGGISSNSPLPIFLNQSVVASYDFQEIHLNGLVMLHVDKDMEVEKLVTDSDSVIYIKDNVTFTVEPNSKYLQPDCSFIVDANGEIRIPDKVMFLGRNNIFKGTLTGILDMVIGENRKVYLLASARTARYIDGKYTFITHRGEYRFSSLRIKNGAFFSFENAHLKKVPLTLGRLEVNFGASMQGSWLDIKASDVIIHSGATIDLSAQGYESDKGPGAGGLHQSDGTGAGHGGYGGISTVNFGKWYGSALNPNNTGSGGGSSSSGKGGKGGGYLRLTVVRLLTLEGTISVDGDGGTVLNSGGGSGGSIWISADNIQGNGIISAEGGDGNGTGGGGSGGRVALYLQGLMSFEGLLNAKGGDGKDAGAAGTLYIQDNNKRIPRKRLWIDNLKVGNNKPQTVLYEADRVNFLFDELRLNGMSRFEIYNLQRKLQTIQVTNFISDGVGEIAIRKNQTLLAEVLEAKESHLTLTTNIYVEEGANLVVASNLTIDGATLTLDGKLSNVRHLVVESGSAIKFGVTSQTTLMENKNFVFQSDPGTQQFASVTLKSGSDFGAPLNLKLSVGKLNMKSGVILQGKFVDIKSQSLLIGRGATLTTNDIMEIELNAGGRGHSSSNGGSGGGHGSIGGTGYNTLAGGIPYGTIYEPNQPGSPGGDGGSGDSGGKGGGVISIDTDILENDGSITANGGDASQSSQAGGGSGGSVYIIASSVFSGTGTVSAHGGRGDGAGGCGAGGRVAIHLKSQYAYRGTLEALGGISSSSGASGGPGTVYIKDVRYKLYFEQLHVDNQGQSWQNYVTLNESKTSYHFHELHLVHKASLRMTPSSNLTQSSTLSIGKLFGDRSGLLHLYNGHKAIIEVVEAQLTTTKTPVNLRIDSGAEAVMATTVYIVGDGAVALHCNGTLNGVRNLYVTQKRVVLLEQGSRTLRDDEQPGTFMFSNVKLFSGSSVTMKDEIVMKIIAGFLNIKFHASLEAHYFDIVTSNLDVETGGLLSVAGDNKARLAVEPSEVSSLPQGAGAGHASNGGSGYGGAAGGLYHGSLYKPKESGRRGGKGTNNGIGGRGGGYVKIEAGTLIINDGIITVEGGSAVSGGGAGGGSGGSLLFNTESFIGYGEMNSNGGNGGGTNAGGGSGGRIAIYATENLYRGTYQAFGGSSVSGAYGGPGTVFLQDIRSKRPFKQLRIDNLMRSIKDPVTIDEANLTNHDFSQVHLFGRAAINMAVRKERTTLKMSRLFGDRTGLLHSRANQTFYLEASATEHSVSKPAVNLRIDENAEMVFGASLYVIGDGAKGTGQITGDSSFTIDGRMIDVTHLFLTKRLKSRFLSHAHSADYHNETLTVSAVGTFVLATFEIQDGSEVFLPDVQGVQCEVGLLHMKYGSVIVADTYRIGVTSLLLETGSKITASGKVRPSGYDSSVLPSSCKGSGGSYGSKGGKGHNGVNELHSYGSIFTPGHYGSAGCPGSQNGGKGGGLIIMEVGDELYLDGTIANDGQDAASGSAGGGGSGGSIWIKCGRFNGHGLITSNGGAGDGLTSGGGSGGRIAVDTPTENKYVGEYTAIGGDSGDPSKDTTQYSGGPGTVFLKDARNQYAHTQLRLDNKGRTWDHYVTLNESLKSYTFDELYLARKAAIHLVPDGKPLNLTVHKVEGDRTGLIHVHENQTLKAEFLDAVYTITRTAANFKLDKGANAIMATSVHVVGQGEVAFEWNGRLIDVQHFHVAYGRTIKIGFYAHTAGTKAGKYRFIDGYGTFRFSTLEFGSGTLIHYPPPMGVHFIVSLLDIKFSSYFEAEFFKIEATDFYLEPNATLNCAGRGFESKTEGSGKDSASGGSGAGHGTPGGDGKDVSGGEEVGSVYEPVLPGARGGTRTGRTTGSRGGGRVRASVGFAFRLDGIINVDGDDAATNSGSGAGSGGSVWITTGYLRGHGDISARGGVGNTDGLATGGSGSGGRIAVHVKIKDEYRGGFYALGGVSSGTQHGGSGTVYIEEIQGDKLFRRLYIDNQNANPPKIFTLDEMNPKTVKANATEENDAEFGFDELMLQRGVVFRIADMRLSKRPAISVITVLGDGSSVLHVMENQTFFIEYQEYTRRRSFPPVNFKVDYGGELMLVSDFHVAGKNNPAFELEGRITGVSNLSLTEGRVLRAGENMSSALLKDKVYIETPIDGQLKFGVFIMEASSGLYFAKRMKFVVSTLYMRQKAVISADKIHMALNEVHMEGSSRITTSGKGPKAGEGLGPGSSFSNVGSGGGHGGQGGPGSTVDGGTGYGSYVYPVHPGSGGGGDGGGAGGCTTEITVGYSLHLDGIIESEGANGTSNSGGGSGGSILIKTVLFSGHGLIVANGGRGDGNGGGGAGGRIAAHVAWLREYAGQYTAFGGTGFKAGAAGTVYYTDTNQGLSHRPVLINKANHTVFGDGFTKLTVDNFNRNPDIPTIIINENSSYYEVDELEMRNHGLLHIHGNNSSFVIHNFTGDRTGLVHLRQGQKMFVQVVESKSGYSVAPVSYKIDEGAEIVFPSSLTLLGTRCSFDGLIIGVHRLIVAEGADVVFASTTQTGIKEDRKFRFLTTPGNVTFAEVYVQKGSRLEFSRINNTLVFTAIIFRLKYHGLVNINHGEIDSSWAWVESEGKLVLDYTGHPAEMGSGPGNTVNLIGSGAGHGGMGEVSQAGQLGGEPYGSIYKAVHLGSGGGNGNGKGGSGGGMLHWRIGQEIELDGLVTLRGGDGSGASAGGGSGGSILIETTNFTGYGEINVMGGDGSGPSGSGGAGGRISAHVRFRHKYAGVFKAYGGDGKTYAAAGTVYIEETARGPQYADLKYDKSTNTTYITATHRYMEVDNEDRKTEVSTMMMESEHLFYELDELFLTRHANLQVRHPPGSLNVTVIVHRFLGDGTGRFHVRINQTIYVEVVESEINETTAPCSYKIDQGAEVVFPAIVNIYGTRSIIEGRITGVEHLIIASGGFVEFTSTAQTARVENRRYVEIDENGNFSFATVTVERNSRLTFSRILNYTLSLRCSEFRIKYEGLMTMNHGYIYSAFAWIESEGILSLDGTGFGPEQGFGHGTTKNNFGSGAGHGGEGGKTDYGEGGIPYDSVYTPRLYGSGGGNGRGIGGSGGGSLFWIVGQRLQINGLLSSKGTDGEGIDAGGGSGGSILITTTNMTGHGEIAVPGGSGTGSGSAGSGGRVGIHCRWRYKYGGKFTDHGGQKGRYGGPAGTIYKEENFRPLQYRHLKYMKETNTTMLAVDHTYVHIDNDGFDVPGATLLMEENTTYYEFDEMELTGYSRLLVYHPGNVTVTAVVHKFIGDKSGQFHIRRDQKIFVEYVESETNKTEAPCSYRIDVGGQIILPSEFSMHGTRSVFEGMIIGVRDLLVSLGAEADFYSTSQTALIENGDYIAISKPGNISFAIVIVKKGGDIEFRKNTGFLRVNVDELKIKYQGKLSMNHGEMFSTFAWLDSQGHFNLNEGGNTAAKGQGAGSTVNSIGLGAGHGGRGARSGGQAYGSVYRPLVLGSGGGNGGGTGGTGGGQLLWEVGKRLELNGLVSAIGGTGNGGHAGGGSGGSILIKTTNMTGHGEIAVTGGDAINQGGGGSGGRVGIHCRFRYTFGGKFTDRGGFGTQSQYGAPAGTVYKQENLRPLEYRILKYSKETNETFLAVDHTYLHVDNEGHDVPEATVLMEEGTTDYEFDEVELTGYSRLIVYHPNETDVTVIAHRFIGDKTGQFHLRVNQTIYVEVVESETNRTEAPCSYRIDEGAEIVLPAEFHVHGVRSELYGLMTGVHFLFLEDGGTLKIASSAQTALTENRTYIDITQPGNSSFAHIIIKQGGLLDLVRVEDVAVSVTSSVFEVLHKGTVRVNHGVFYSAFAEVETKGVVVLDGAGYKAATGPGAGSSYSSNSGSGGGFGGQGGRSHSNSNGGSAYGSVYKPLSYGSGGGHGRWNGGGAGGGSLWWQVGKLVHLDGLLSSKGESGSSNGGGGSGGSVLIETTNMTGHGEINVNGGDGQSNAGGGAGGRIGIHVDFQNNFGGKFRSAGGNVSGYPANAGAAGTVYKYESRRGPQYRDLKYNPDANLTSFKPEHSKVKVDNENNNVATPTVIMENQTVFYEFDEMQVEGHSTAIFYHPETARNVTVIAHEVTGDKTGIIKLVSRQRLFVFVVESTHTYMDAPCGFHVEDYAEIIFPTEVILRGESSTIRGRITGVERLVIERNGFIEFGGTAHTAQLPEESQWIADNPFDPFTPGLIIVPQLIISNTGVVKVKMTPIRVVLDIADTNVKKGGQLILYTNHVTINADFVTVESGGLIDSSGAGYTAASGPGAGSGSTGGSHASPGGGAASGTQYGSVYLPDEPGSGGGYGAGGGQVYIKTGGYVIVEGTIRANGNGLSSKSSGGGSGGAIVVRSLFLKGYGTIECHGGPSHDGAGSGGRIAVYLTEHFIFRGSLTALGGDSGTTRYGSPGTVYIDVNVGEEPYRIVQIDNKNRDSLLPVTLAEANTSLYEFERIHLVRKGALAFKAVSGKLVKIFIGKATGDKTGILLALRNTRIYVESHSVRTEAPVNYQADTGGQIVFPIQTTLLGTRAPALTVNGEIHGIEELRLSSNVGSLVAEKGFSACLDCHSNYTSDYIGHYWFKKLQVDLGGTFEVQSSVQTISSQAVRLHMGEIALDYTGSLKADAAKLLTEYFSLEFDAATDASSSGWSSKQGPGSSTTCSGVAGAGHGGRGGTGYTSGCTSCTANGGNTYENVSQAIQAGSGGGVNLADGGGVVFVSVEKLLELDGSIKSDGANGDYGGGGASGGTLWVAGRHFEGHGHLTVKGGAGSHRSECCSVSPCNSHRNYHGGGGGGGHLRHFSPDYIRRDIIRNRGVSGGASGGGSAGNGGSGQISAAGNQCSGHGTFSVQEGSCTCDAGSYGVSCLYQCDASITCLGHGRCSASGGCDCDAGYVGYRCEHMCDATRDCHGNGRCSVTGKCVCDPCYSGDDCRYECSGNGTCIGGKCKCDPCYIGTHCHSLCSGHGTCNNGTCYCGSKWKGDYCEVPKCPNDCSGNGICNSALLTCFCNPGWRGLDCSELDCPGEPDCHNRGTCSSINGTVMCVNCSVGWMGPACNDPCVNGVQEPMDSGFCKCNPCWAGKGCDALCMGRGTCSDNGICKCDPLQGWRGDVCQIPGCPGVGKDCTGNGDCNSATHECTCYPGWAGLGCDIPDCPGAPNCNNRGYCNASVTPPQCQNCSRGWMGAACADPCTFGEQTPMDSGQCVCWPGYTGVGCDSECSEHGKIVNNSCVCDIGWRGDLCDNPGCPGIGSDCTGHGICNSATHICTCNEGWAGEGCEIPDCPGTPNCFERGLCNASVNPPKCQNCSKGWMGPACNNPCVHGQQVPMDSGNCVCEPGWVGVGCDSECSEHGTIVNSKCQCDIGWRGTYCENPGCPGDGEDCSGHGECNSALHTCICQNGWTGDGCHIPDCPGNPNCADRGVCNVTYNPPKCTNCIAGWMGPACEDLCTNGTQVPMDSGNCVCDPCFAGRGCNVECNGYGTCLENKCRCDELTGWRGSLCEVPGCPGSNGKDCSGNGKCDSANHICICDPGWTGVGCHLPDCPGVPNCFGRGHCNATNRVTPECTDCIQGWMGPACNDPCVHGYPKDGICVCDPCFTGSGCQSECSGFGECIDNKCDCGQEEGIAHMGEYCELPGCPGQCTSLDNGFCSMDTQKCICAQGWAGDDCNTPDCPGEPICSGHGSCSNSNPRRCNCEPDWAGVMCELPCVNGTNYGNSSGCICHSCFSGSGCNVECSLNGKCVNDKCVCDKILGYKGDVCEIASCPGWPFDCSNHGSCNGATFECTCVPGWSGAACDIPDCPGDPDCNGRGACTPSIADNETPKCICQQGWMGVACEKPCKFGTPTADHICDCDDCHNGPACDMHCSNHSSNCVNKKCDCGFDGWRGNYCEKKGCPGYKKDCSGHGQCLSASQTCICDPGWSGIGCEQTDCPGDPDCNNRGQCIPAETPYCGNCAQGWAGIACELPCVNGTQNQVDPTVCDCEPCFNGLSCDVFCSARGNATCAEGKCYCGFEGWRGDFCEKKGCPGLFNMDCSGRGTCNSATQTCDCNPGWAGRGCHEPACPGTPMCSDHGTCESLATISFCSCDKGWMGRACETKCEHGTPQQTADGSFFCQCDDCFSGISCDMECSGRGNCTNNTCDCGFEGWRGPTCDTKGCPGWGSDCSGHGSCITALGICYCRPGWSGRGCHIPQCAGGGNCSGHGVCDGVNHDPPVCVSCDSGYMGEGCEQRCINGTVIKSGEGDTCKCDSCHTGVDCGVECNGHGKCNNGKCACDSGWRGSKCETIGCPGQGVDCTNHGVCLLVTQQCDCFNGWKGEGCDIPDCLGVPDCNALGTCYGGVDPPKCVNCTNNTMGPSCEFPCIHGRENPPDSVICECDPCYIGLACDTECSGRGTCREDVNPKRCECDSGWKGPTCETLDCPGEPDCSGRGACVQQGTPPTAVCLCNQGFDGDDCSKLVCPGRPMCSNRGTCTLVGGIPVCVCNHGFDGSSCERCLPQFTGSECDKCITNYIGWAVGCNIYCVHGNGTGQNEDICTCHNDANLGYWNGTSCDRCVFGWGLPSCAVCDDAHVGENCNIDCFSAHAQYRDELDGDWGKHPVAPILNCLYENAHDEVFAWFGYHNKNPHNVYLNVGADNFLTRPYLDIVPGGLKGFVLKTGGADNATDNLVPLPTQDYGQPNKFVPGRHDKAFKVRMEDAFPIAWVLAFPLSNERNAAVANQSLLHTMKCTDIEAQESNVSSENYICSCLDGHWGFACQFDCPGGPQAPCHNNGFCNKTTGSCSCDPNWRGDENCTTCSPGWYGLDCSVVNQSTNNYTAAAYGHGYFITIDGAGYKFLGNGEYHLLLSHLWEVQARMVTCFSSSSCVNAVAVRIEQHTLLLHSRFVNRKEPVVFANGKRVYSVDFEFGPASHRFTFKRTSRLQFVLSSSYGVRLIIRLYDRYLDVHLRVDNQTYCKTSQGLWGNCNLNSLDDLYSRDGKIVTGLNVSQSYVTELYAKSWKVTERDSLFVYDINNYHEQRELYGGGYALYFNNSGAHTEEIYSFSLSDITIEFMVRAESENGTLLSYTSTDMFAVILESGKIKLRYDDIILDTLAVIQRHEWNHIALVWSLTTRILQFYHRDDTRQRVNSRNFPITSNVNVFQPGGILALGYCIPPPGGLTLSLTEGFIGQIDELRIWNQKLDPFSISANWRGNLGCTMRVPNLASLWKFNEGDGVVAHDCVSGAHIYFKSGTWKGPMWVFSMVEIPQFSVDTSTAYSFRFGSSWMSAEQLCYNLIFGSMLKSGHILLTTSTLWFYYMSCVTSVTRSNDHGHAYWTLMALSDFNQLIVEQSSWFAQSLCNSVSVFNFPVWYGKNCDYQCKFGLPGTNEHKCFCMKGFFGLNCSSECPGGNNVPCNGLSSCDISIGTCNCPVSSNTTYDCSVCSPGWIGSDCSVSLGENRSSSENFTCQGFGATHYTTFDGVGYNFRTYGEFYLMKTNQFTAQVRQIPCMNASFCISSVGVKIGSIEIVIRASYNGTGMPLVWLNRKLTDATSVILENNFSFQRASPKVYEIAKPERILLRVKAWQDYLSFELTSASQWCIVGSGICSSCDSNVVNDFTNSTGTMYWGGSISESIIINILRSQWQVSAIDSLFIFGYTSYKERREITSNGYALSFNGTTASTGILNAFFKDFTIQMFVKVHSSGGTILSYSNKFTFALVNDVRVKIFLGGASYDMGITLPSGTWVLVSIAYRASTGVLTYYQLNAQGDLYFKQTYIGVEMLTSGGTLWLGHWHITKEHITGVPLQPFFGVIDEVRIWSFSLDTLLIRQSFRLVITANLPSLSALWLFDEGVGRVIANFISTSPNMYLPEVISRRPTWQFSYVRDVFPSMVVSASVQFSVSFGLLAKKRCLELIYHHHLQAQCGKLLSAAVTQFYFKACLFDVQSSSSLDTAYIALIAYADYCMTVLHLSSWPAQRLCKQLPQSLPRGWIGPDCSVKCVFGSADKNNASLCVCHRGYWGEDCANECLGGGNKPCNDHGNCNVRTGSCECDLNWRGNGDCSNCTPGWTGSDCAIAVAVTQLPTCSAFLGGHFTNFDGAHFNFFGVGEFWFVRSIHFNGQLRQIPCYNGESRCINAVAFSFLSGWKVVVHAPYEESKQPVVWLNGREAVYSSTRIQISSDVFLEKTSSTTYLLSSVLKGFKFQLRVVGRGLVIAGHVNQSFCNGTNVLCGNCDSNRDNDFNVTAGSSLEEIWRVSMEESLFIYHDTGYMEERAVTGAEYALMFNGVGVCSDLMPDVLNASSITMELLFKMYSEPKVGGVLLTYSKAISLTLFIEGTLKVRIGIEIWDTGLSPQVDSWNQVTLVYYNTTGAVYIYHINSIGVVRLATRTMTAGIINRGSIISIGQWIPSLEINTKESDSLPGFVGVIDEVRFWNREFSLQDVTTSWRVNVLSNARYLVILWKFNEGQSGVIHDLISRVHLYIPSIREAPRWVFSYADIKVLPVTPEITFSTSKVRVEAESWCHTHIQNSPLGIACGGLGGGTVAFYVRACLRVIASGKQVSLGISVVVAFADTCEIQANLTIWPARQMCTYEVFRNSRLMNWIGLDCNIPCPYGYQPLGLYGSCQCDSGFWGQTCNGVCPGGLVNVCSGHGNCIDSNGICKCKRRWQGALDCSQCTPGFFGKDCSVAVAPPITELPVTSVFGTGYIVTLDGIKISVNVAGEFRVLALSRYGLSIQFRQVRIGSYVRVRCVIVVVQQDVLAIHSSVGVAGQVLVTLNGLPISQNSLVSLGVSGFEFRRTSLNTYVMVGPEGFNFVINSLAIHFDVSITMNKDLCQETCGLLGRCHIPGSRVPPSNCTAGGILDTQEVSNITQELLISYVNSWAVPQNESSFGPILNISGEPQLSSVAGSCLYFNGTSVISAPLLNTFVGNYITIQFFVKAKNPDVYTGTIISYALNETFAIAVNKTIHIYFGTTVIDTQLVLERELWNHISFVYMRSSGQVQFYLVNSIGIIQSKVFLVGVGIFADGGTLALALWQVTKVSLSLPGFVGWIDELSFWNKRFDSVTVLQTWNSNLQSGTPGIALLWKFNEGSGFICRATVGSLNFGLPTPPWKSPLWYPSDAIKEANVFITSDLSEKEPDKSTQDLCSDVFLKGPLLNECANVTGGSKFYYEACLSEVSTSGTPESALMIATNFAKECQAALNLSSLPGKGLCNVIPGGRYNDWVGVNCTTKCEVGWFTDGDCKCDNGYWGINCSRECPGGAANPCYGNGKCDIRSGKCNCHPNWDESENCFKCAPGWIGKSCSVAVSTTESSVTRQTSKVCIILERGYVTGFDGSLFTFTTLGEFIMINSSILQVQVRQVPCEKSSVCLNAIGVRFNDLTISVHAAYESDSFPVVYVNEELTKVGGEPSKDMLKNNISIQPIYRSAYRIVVSHYLSIQTVFSDRYMSVESTVTSNFCQLVDGLCGSCAKLRVGQNATQGSGGSVTSIDRPTTVLEELGKPNATSDNVNEFVTKVFPVQDPIIVIDAEMHKETRVVYGGLYSLYYRFTAVVTQTVVKLFVSQTLTFQLLVKSCNPQICGGTVISYTSNVTFYISNHVTVRVVIGLDVFDTGIATEADVWNQITVVFVREKLQLFVFVTFSSGLVQVRKFSFTIDPFISAGTFAIGMWQPASGSISVQPTNVFLGQIDEVSVWERPFDYALVEQSWRSNIQLGAPSLTNLWKFNEGKNSIVKDLVADVALLFPRYPLGKPEWVFSDAPITSVVTVNPNEDNATLHTIAIKVCFEFIYEGPIHSACNALGNVTLEFYLRACVQAVVDTGLTVESIDVVITIADYCQKIFGLPYWPAQSLCNKFPGKRFPNWMGKNCTIPCIFGQAANESEVCVCDPGFYGTNCSGICPGGKGNACNNHGVCDVVTGKCSCELNWQGNENCSTCARGWVGTDCSIAVTQWPSGSVIIGIGAVSLGGQFTSLSGVSYSLQVTGEYYLIYSIHLSVNVQIRLVSCTQQESCINSIALQIASYKVVLHGPYSSGGSLIVWLNGKVIDIDLHPITLDVYGFTVSKITAHLWEVKYAGLYLKIRVTGRFLSVSVEASGLVCKSSIGLLGSCNQGLVQSLLSYYPTKDCSEEGFMFNVSGNHSNVFRQGSDILSEKNASDTKAKTQDIISTLITTKLKVKKCHSLFEYKYKEIVEYREANAGYALYFDHTTVVTDVIYKAFSFTDITVKIMFKTVRYGVIISYTMRKTFFVTNTGGKFTIFYGENVYHTNIVAERNKWNQVSLVFRKSTTVLQFYYFSSGGQLHRLDINVGFDIFTPGGTIALAGWMPSLDGSGIQPTDFFAGFIDEVRIWTRYFHPAFILQTWNRSVSVNAQDLAHAWKFNEGEGIAAVDKVTGMKLVLPFKPWRKPEWRYSDVELQLPFYDRPLDFSFTNKTLQVAAELFCNRTLLMGTLHSHCKSLGPGVSTFYFRSCLQRIATSESLYMSMEVIIAYADYCQAFNNLTVWPAKHLCNEFPGREFPIWFGERCERKCIFGKKLASETCVCYHGYWGLECSNTCPGGAANPCNNNGMCNVITGECECNVNYNGTQDCGKCSPGWHGFDCSLALVSLNLNRHISIGMSSTGGHYVSFDGYSFTLVSLGQFYLMNLPDLSFQIQVRHVPCRQQTVCVNAIGIRITTTVVSFHAPYTTGGAPVIWVNGKLLLLSGLISTLGSPHLGILLKYNGRNYYQIIWKDNFAMAIRIHGRYLSFKVDVTSAYCYNSTGLLGSCDNEPDNDLKVSPNGSIIPANVTQPVLNTEIGSHAIVYDKDSLIVLKYEHYHETRLPTGGMYALLFNKTGASSKPLIKTFNLNVDITLEILLKPYQFSGTIFSYAVLQTFAVLIESSLRIHFGKAIIDTGVNVTINQWSHVSLVWYHKSRVLEFYHFNFKGKVQRRSYVLPSNPFLPGGILSLGQWELSPGDSETHTVASFVGTIDEIRVWKRAFNPAFILQNWRMNVVPTHPDLTGLWKMNEGESDVIMNLVTDEHIYLPRSPWQQPHWVFSDADINTNLTSSDKPFEMHFLNKTLEKMAKAFCFELFYKSTLHDQCHGQLKSELEFHYLVCLIDIATTDDISAALTVIVTFADHCQAVLNYSTWPAQPLCNKFPGSRFPLWIGDRCDIKCVFGAADPDDRNLCICMEGYWGSDCSQICPGGLLNICGGHGWCDSSTGQCQCQVNWRGNENCSSCSPGWNGTDCQFAVELVTSITSQTVLVAAIGGNGYFTTFFGVSFTYRVVGEFYVLRSASQNLVIQLRQAPCPIDGSYIPLCTTGFSFSLNNNVIAIRAPVATFSRTVPIFPLIWLNGNLVQVDHRTQLSVDFVMLRISTVAFEIYGPNGVKFGITVGHSLSVTIHLPAIYCRNSTGLLGACTGVSFNNSNSLESYITSLKHNSVVDKSQTLFIYKYLHYSEYRSPSGAGFNLFFKDHSVRSGPLQLPPVDVLTIELLIKTHQTGGIIFSYLSQNIFAVIDNTTLEIIYNGSVFDTGLKLEIKQWNQLTIVLKQLVGTLYFYHVSSSGVVKVRVFKLDGNVFSNGGVLALGQWQPSPSSDSMLPQSSFVGEIDEFRIWKRRTNSDLVKSNWRLNVQDGIYPDLLHLWKFNQANGRVIPDILGKNDLFLTKFHEPQWTFSDADIPRLNPEETTFVNLSLQRDAESFCFSLILSGPLYANCEDLSIQVAQFYYKVCLHDISLSSKLRSAVYAVVTFADYCQNTLNLSEWPGKELCHHFVDQRFPYWIGSRCNTRCVFGYPRPDINSTDGVSCKCEQNYWGVDCANLCPGGLRETCNGHGVCSVTNGTCECEPHWKGNISAEYNAPIDENNSVSPIPCSRCTPGWTGADCAIAEDSSILDNSSIPRIAINFGDPHFTSVTGVNFHFEAPGAYHLFNSSIVDAQVLIVPCNNRVSCRRISEVSLRTSKTELSVRYNDFETVVSSLFDKTSNTSKELSKSDEWAEDADIRYRWLTDNILEVRIQDEIQFNILSYYGTIGTAIEVLKQRDQTDGICGEKESSWIRQQGNQSLTSKNQIADTSNNDTTDQQGLTQATIQKRLITRFRIMEKDNSLTTKYASRSYSGAGYMLEFSSGNAAVMYASNTSLPVLDEFTIEIWVCLTNAGESVARFCSSDQRNNTEPVTGSHAVFSVVTAIGDFAIVCNDGLQVKWDKEKFITDINLYEGVWTHLAVTWRTIDGRMQAFVYSNGKHRQSTTYGIKNGKQFSFNGLFILGRYMRGYMVDSEYDMLGALDELKVWQYAKTMEQIRASMSVKFEDYREGLVLNVPFDEGMGRTTVGHLYSPISVEVALSLFEAQVVNVTNIHLFIHSGDYPGWAPSGVHLSPLANYSLAFLNKTLEGKALEKCYESFYEGKLQEHCSPKLVSQALFYYESCLADIADSGSLAHSKLSVSLFGFYCQKVLGIKECLLHGTYDAFLRCPGDDEKQTKFTPLEIIVTTVSSLLFLLFLLIILILVCRRRKRRKSEVEQIYLHEAGGERSHKYVAEDEGDHPQAYSMRQMLDEYDFEPDMDDSPHDTPSVVRKPLVRNPAGGVLPEGEEESAV

>Pocillopora_acuta_HIv2___RNAseq.g30830.t1
MARLDRTRRWSMDPTRNITPSYYGGQELRRKRLPPLSEIGPAPTKPAKPLHLKLLLKKFQEMVFLSGRDYSIPTTVYGPNIYTTLKEKENIAIGELNEDVSTIVEPPMMIRNRRGLTLRKEEGHLAMEEIYDDVSTTVEPPMIKRNRRELTPGKDENSKDATASKIVTSNGASLSVKGVTLTFPPGAVEDPVTIRLTLEEPYRYCYLFARCGLQNDLIFVAPIVNCQPNGQKFKNHITVEVTLNGKRANSHGDLLVLHGTRTGQSQKPNWEDITDKSKFDFETKELKVKELKVKVSHFSLIAVLARLTWVRTKEIVTRLNLMPFNYKLSVLLKSNRQQYPFDELALAFMSQDTYQEEYYRDHDDYAIMRLKKDGFEELSMDCRNSQENNCIYNKEILTISIQLGEDYKPANNQQECFKVVVDSTAWWNAGHVIKLPLQVSNANSKIFYGKILVKGEYGHVRENKFCQQDLCGYVRHVLAVKRAIFDVKSVAQKLELPVETQQQVVACWQGEEEQLELAIQHWREKHGDAANPNNLKKAIEELEPEEFKVKERGQMHIEHLRDLAFKIASLRHQDSDLMKYFGREVEMLCGAVLRDCCIENATSKGGVESATQTENFASVLVMKKRLPESVGKMCTRMFSSQEDEFITSSFLCIVSEFIQAIKEIFREEKIQRTPSGLRGVTEEATLQNVTSGIQGAAHDKNILKLLNEVCYILEILCRNNNNDDRQEESRIQRWGSGVMMGDMLEFLERFFQSTEARRLYKRLDLFSVSLRDIIKNPTDFEGSFLRDFHNIAVSLLKFAKFDPSHLVQFELKDPTNSINNQTRDDIKYDMSKETFSQLFWMIKESEETLQLEATAHVHIGGHLLTIGAFEAVIVLPESSGEVVENAPITSFPVSFKSETGGDSGNLRVTLYSRQKNDISKALEYLQNALRGHLILPKQPEIKETSLKLVNVAKAGTDVALRLTHRWGSGTIEVESNSFVPQPLGYSSPESQHNLEITGRSECIANYYNSGNNFMTGPSSAELSPLSPSTTQSTVINVNNYINHTVNMEGHVCVDGENPSLNVQAPSSAVQRFLEAGPGAATNRSITEGANEDS

I also copied the subset fasta files to /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast on Andromeda. I want to blast them against the nr database. In the scripts folder: nano blast_biomin_subset.sh

#!/bin/bash 
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=36
#SBATCH --export=NONE
#SBATCH --mem=250GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error

module load BLAST+/2.13.0-gompi-2022a

cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast

echo "Blasting Pacuta subset biomin genes against nr database" $(date)

blastp -query Biomineralization_Pacuta_subset_sequences.fasta -db nr -outfmt 6 -out Biomineralization_blast_results_Pacuta_subset.txt 

echo "Pacuta subset biomin genes blast complete, now blasting Spist subset biomin genes" $(date)

blastp -query Biomineralization_Spist_subset_sequences.fasta -db nr -outfmt 6 -out Biomineralization_blast_results_Spist_subset.txt 

echo "Blast complete" $(date)

Submitted batch job 305441. Job was running for about a day but there still wasn’t any data in the output file so I cancelled the job. Going to edit the script so that I’m using the nr database that we have in the putnam lab shared folder.

In the scripts folder: nano blast_biomin_subset.sh

#!/bin/bash 
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=10
#SBATCH --export=NONE
#SBATCH --mem=250GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error

module load BLAST+/2.13.0-gompi-2022a

gunzip /data/putnamlab/shared/databases/nr.gz

cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast

echo "Blasting Pacuta subset biomin genes against nr database" $(date)

blastp -query Biomineralization_Pacuta_subset_sequences.fasta -db /data/putnamlab/shared/databases/nr -outfmt 6 -out Biomineralization_blast_results_Pacuta_subset.txt 

echo "Pacuta subset biomin genes blast complete, now blasting Spist subset biomin genes" $(date)

blastp -query Biomineralization_Spist_subset_sequences.fasta -db /data/putnamlab/shared/databases/nr -outfmt 6 -out Biomineralization_blast_results_Spist_subset.txt 

echo "Blast complete" $(date)

Submitted batch job 308861. Completed in 1.5 hours, but didn’t work. There was nothing in the output files and I got this error:

BLAST Database error: No alias or index file found for protein database [/data/putnamlab/shared/databases/nr] in search path [/glfs/brick01/gv0/putnamlab/jillashey/Pacuta_HI_2022/data/blast::/glfs/brick01/gv0/shared/ncbi-db/2024-03-11:]
BLAST Database error: No alias or index file found for protein database [/data/putnamlab/shared/databases/nr] in search path [/glfs/brick01/gv0/putnamlab/jillashey/Pacuta_HI_2022/data/blast::/glfs/brick01/gv0/shared/ncbi-db/2024-03-11:]

I’ll revisit this….

I think I will retry to run the code above but use the remote NCBI server? Idk if this will work. Editing the script to include the -remote flag:

#!/bin/bash 
#SBATCH -t 100:00:00
#SBATCH --nodes=1 --ntasks-per-node=20
#SBATCH --export=NONE
#SBATCH --mem=250GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error

module load BLAST+/2.13.0-gompi-2022a

cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast

echo "Blasting Pacuta subset biomin genes against remote nr database" $(date)

blastp -query Biomineralization_Pacuta_subset_sequences.fasta -db nr -remote -outfmt 6 -out Biomineralization_blast_results_Pacuta_subset.txt 

echo "Pacuta subset biomin genes blast complete, now blasting Spist subset biomin genes" $(date)

blastp -query Biomineralization_Spist_subset_sequences.fasta -db nr -remote -outfmt 6 -out Biomineralization_blast_results_Spist_subset.txt 

echo "Blast complete" $(date)

Submitted batch job 309006. Failed with this error:

Error: [blastp] internal_error: (Severe Error) Blast search error: Details: search failed. # Informational Message: [blastsrv4.REAL]: Error: CPU usage limit was exceeded, resulting in SIGXCPU (24).  

Removed -remote flag. Submitted batch job 314563. Ran for about 4 days, then timed out. Going to add: -evalue 1E-40 -num_threads 10 -max_target_seqs 1 -max_hsps 1 -outfmt 6 to the script. This will likely make it run faster. Also changed tasks per node to 10, -t to 40 hrs, and --mem to 125GB. Submitted batch job 315190.

I also want to blast the down and upregulated genes of interest against the nt db. In the /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast folder, make two fasta files: downreg_subset_seqs.fasta and upreg_subset_seqs.fasta.

Here are the sequences for the downregulated genes of interest in downreg_subset_seqs.fasta:

>Pocillopora_acuta_HIv2___RNAseq.g24121.t1
ATGAGTCTTAAATTCCTCGCTGCTCTTATACCGCTCATATGTTTCACACATGCATCTTCTAAAAAAATTCCGCTGGTCATCGGCGGAGTGTTTGACATCGACACAAAGCTTGGAGATGAAAACTCTGCAAGTATGATCCCAATAGTGCGCATGGCAATAAAACATGTGAATGAATGTCCCAAAACACTGCTTAACTATGAACTGCAAATGGAAGTCAAAGACGTCAAGTGTCAAGACTCCGATGCAATCCACGCGTTCACTGAGTTCATTCGAGAGGAGAAGAAGAAAATCATGATCTTAGGACCGGGTTGTTCCAAGTCAGCAGAACCGTTTGCCAAGGCTACTCCTTTCTACAACTTGGTTCAAGTGGCTATGGCGGGAGCAAACCCGGCTCTATCATGCGCGAAGATCTTTCCAACGTTCATGCGCACCATACCACCGGAGCATTTTCAAAACCAGGGAAGAGTGGCGATCGTGAAACATTTCAAATGGAGAAGAGTGGCCATACTACGAGAGAATATGGACACTTACCAAGGACTTTCCAACGATTTGGTCAAGAGGTTGAAAAAGGCTGGGATACAGTTGGCCAGTTATCAGACTTTCACCGGCAACGCTGAGTTGCAGATTGAAAACATACAGAAAAACGACTTGCGAATAATTTTTGGCATGTTTTCAGAGAAAGCCGCAAGAAAAGTTATTTGCACGGCTTTTCAGAAGGGGTTTTATGGTCCTAAAGTCGTCTGGATTCTGATGGCGGGTGGGTACAGAGAACAATGGTGGGGCTACAATGATGTGAATTGCACCAAAGAGGAACTTTGCAAGGCGCTCGGGAACTACCTTAGTACCGAGGCGTTGATGATTGGAAAAGATAATCAGGATACGATTGCCAGAAAGACCATAGAAGAACTCAGAGCTGAGTACAAGGCAGAGCTATCCAAGACTCCTTACAAAGAGCCCAGCCGTCATGCCGCCTTTGCTTATGACGCTGTGTGGACTATAGCCCTTACTTTGCATAAGTCTATCTCAGCACTTCAACAGCAAAACGAAACCAGAAACATGTCATTGGAAGACTTTGACTATAACAATACTGCTATGAGGAAGACATTCATGAAGGTTGCAAAAGGGGTATCATTTCAGGGAATGTCTGGCTTGGTACAATTCTACAAGAACTCTGACAGGCTAAGCTTGCTTAACATTGATCAGCGTCAACGTGGCGAAATGAAACGAATTGGAACTTATGATATGAAAAGGAAAGTCATCGTAGTCGATAAGAGCCAAACAGCGCAGTGGGAAGATAGAAAGGTTCCAACCTGCACGATCAAAAAAATCCTTGAGCCACGGTACCTTCCCAAGAGTATGGTATTTACCATGGATGGATTGTGCGTGCTCGGGATCCTCTTTGCGGCAGGGCTCTTCTTTTTCAACTTTAAGTATCGAAATGTCAGGTATATTCGAATGTCCAGCCCGAATATGAATAACATTATTATTCTGGGATGTGTTTTGATCTACATTTCTGGAATCCTTTTCGGCATTGACGCTGAAATTGTCTCAAAGAAGACCCACGAGAAAGTGTGTCAGACCAGTGCATGGACGGCATCATTTGGATTTACCATGGCTTTCGGTGCCTTATTCTCTAAGACATGGAGGGTTCATCGCATTTTTATGAACAAACTTAAAAAAACGGTGATCCAAGACTACCAGCTAATCATTGTGGTAATTCTTCTCCTGATGATTGACGCCTGCGTCCTTTCCACCTGGCAAATACTGGACCCCATATATACCACTTCTAAGACCTTCCCAAGGAGGATTGACCAAGACGGTGACATTGCCATTTACCCATACCAGACCTTCTGCACTTCAAAACATGAAAACTTATGGACAGGTTTGCTATATGGTTACAAAGGCTTCCTTCTGCTATTCTGCACCTACCTGGCATGGGAAACTCGCAAAGTCCATATGCCAGCGCTAAACGATTCCAAGCAGATCGGCTTTGCTGTGTACAATGTGTTCATTCCTTGTGTCATTATCATTCCTATACTGAACCTACTGGGGAGTCACAGCGATGCAGTGTACTTGTTAAGCACTTTGCTGTGCCTTTTCTGTACCACCATCACTCAGTGCCTGATCTTTGTTCCAAAGATTTTTGCGATGAAACGCACAAATGGTGACCCGTCTAGTTACGAGAAGGGTTCACTCTCTTCTGGCAGTACCGTCGACAGCAAAATTTGCCCATCATCTTCAGCAAAAAGCCTTGATCGTAATTACAAAGCGGAGAAAACAACCTTTCCGCCACATGCTGGTTAA
>Pocillopora_acuta_HIv2___RNAseq.g13974.t1
ATGGCAGACACCTCCTTCGCGATCCAACAGTCCATGTCTGAGAGGAAGCCCCCTCGCATGCCTAAGTGCGCCCGTTGTCGCATCCACGGTATGGTGTCATGGTTAAAAGGGCACAAGCGATATTGTAGGTGGAGAGATTGCAACTGTGCTCAATGCACACTGATCGCAGAGAGGCAGCGTGTCATGGCGGCGCAAGTGGCGCTTAGAAGACAGCAGACACAGGAGGAGACCATGCGGGTACAGATAGCTAAGAAAGCACAGGCGTACGTACCTCCCCTTGTCAGCCCTGAACCGCAGTTTAACCACGGTGTCGCAAGAACTCACTCAGAAGAGGCACAGCCCGTGTTCTCGTACCACCGCGCTGACAGTCAAGAGCTGGAAAAGGAAGAGATTAAAAAAACCCCGATAACCTCTCAATCAAAATCTGTTGAGTCCTATGAGGTTAAAATCAAGGAGGAACCAGTGAGTCCGGAAGATTTTGAGAAAGATCACAGCTCAGATGCTGAAGAGAACCACAAGAGATCTTCCATTGATGAAGATAAAGAACCTGATTTCGAGCGACGAAAAGTACCCCGCCTCTCCCCCCACAGGAAAGAAACGGAAGCATTTAAATTTTCTTTAGAACTTCTCCAGCGGATTTTTCCTGATCAAAGTAGAGCCATTCTAGAGCTCATCCTCGGGGCTTGTGAAGAAGACCTGGTCAAAGCTATAGAGTCTCTTCTACCGGAGAACAATCAAAGACCATTTAGCCTGCCTCTTCCTCTTAGAAGCTACGGTTCTGCTTCCTTTATCCCTTGCGACGGCAATCAGGCTAAGTCAGCGTTCTCTCCCATCGCCAAGTCTCCGTCGTATATGTTTCCAGGAGCTCTTGCTGCGCAAGCGCAGTCGTTGAAGAGTCCAAACGACAAAAGCCCCAATACACCTAGCGCATTTCAGCCCGTTCATTCCACGTGCAGCCCACCTGAACGTAGCCCGTCAATGGCAGATAGGTTCCAGTTTCCCGTCGTTGCGGGGTATTTTTTTAACAGGCCTGGCACGTCGGCTCTTCTCTCGCTTAACCCCGCACAGCAGAGGAATGGGGCATCTCAGCCGGGAACAAGATTTTGTAGGCATTGTGGGCACCCGAGCAAAATTGGAGACAAGTTTTGCAGTGACTGCGGCAAGTCGTTGGAGTGA
>Pocillopora_acuta_HIv2___TS.g25049.t1
ATGTATTGGGTGTACAATGATGGCACCAATACAGTCAATTTTGTTTTGGAGGTTAGCACGCTCGGATGGGTCGGTTTCGGATTTGCCAACAAAATCAACAGAATGAAAAACTATGATGTGATTGTTGGAAAGATTGAAAATGGAAGGGGATGGCTCACCGATCGTTTTACCGGAGGTTATTCAGAACCTTTAGCAGATCTATACCAAGATTATAATCTTACCGCTTTCAATGAAAGCAACGGCAAAACATTCTTGGAATTCTACAGAAAAAGAGACACTGGCGACAAGAAAGACATTGAAATCAAGCCAGGACCGATGCTTTTGGTGTACGCTTACCACACACTGGATTATCCATCGCCTTATAAAACAAAGCACGAAAAGCAGGGCTTCAAAATAGTCACCTTAATCCCAGCAGATACAAGCACGACCCAGGCACCCAAACGCAGCACCAATGTCAGAAGTCTGAGGCTCACAATAGAAAATAATCAGACGAAAACCACCACACCAGCGCCCTTCACAGCTACATTGGAGGAGACTGAGAAAGCAAGAGTTCTTGGATCCATATCTTCATACACTCGTGGTTCAACTTTCCTGACCTGTGGGGTAACTCATTCTCCCCGACCAAGCCTATCAGCATCGTATTCATTGAAAGAAAGGTGCAGGTGGAGGGTACCGCTAAGGCCTGACGAGTTGCCAGTTGATGGTTTTGGTCACGGATAG
>Pocillopora_acuta_HIv2___TS.g8000.t1
ATGGAGTCGAAATTGAACGAACTGAAAGAATTATGCCAGTGTTTCGATTGGATAAGACTTAAATTTAAAGGGATCAAACATGTCGACATTTCAGGAGACGAAAAGATGACGTCAGAATATATAATCACTGTCGTTACGGGAAACCGCAAAGGAGCTGGTACAGATGCATCAGTGTCTCTTATAATCAAAGGTAGCAACGGCGAAACAAAACCTCTGTCTATGGACAAATGGTTCCATAATGACTTTGAAGCCGGACAGAAGGACGACTACTACATAACCGCCAAGGATGTTGGCGAGCTTTTGATGATCACTCTAAAAAATGATGGCGGCGGGTACAGAAGTGATTGGTTCGTCGACCGAGTAACAATCAAAACTAAGAACGTCACCTATGTTTTTCCTTGCAATCGTTGGGTAGAGAGTGAAGTCACTTTCTTTGAAGGAAAAGCTAAACTGCCCACGGATGAGCAACATCCTGAATTGAAAAGTCGACGTGAAGCCGAGCTTAAGGAGAGGAGGGCACTGTATGAGTGGGGTAAAGATGAGGTGTATGAGGATCTGCCAGGCTACGTGAAAGCATCTGGAGTGAAGAATCTTCCCAAGGACGTGCAGTTTACCGAGGAAGCTGCCTATGACCTTCATCGAGCCAGGAAAAACGCACTCATCAACCTGGGTCTTGTGCACTTGTTGAATTTTTTTGACCAGTGGGACGACTTTGATGACTACTGCAAGGCATTCACTGGCTTTGTTGGAGAGGTTCCACTAGCTGCAAAATACTGGAAAGAAGACCGTTTCTTTGGATACCAGTTTCTCAATGGCTGTAATCCTGATTCAATCATGAGGTGCACAAAACTGCCACCCCACTTTCCTGTCACACAGGAGTTGGTTGGTAACCTGCTGGACAGTGGGGACACTTTAGAGAAAGCCATGGCGGATGGCCGTATTTACATGGTGGATTATAAGATCTTGGAGGACATCCCACACTACGGGCAGGACCGACCGGACCTAGAAAGGAGATACATGTGCGCAAGCCTAGGCCTGTTTTACGTGAAAGGTAACGGAGACCTGGTGCCGATAGCAGTTCAGTTCCATCAGGAACCACACCATGAAAACCCTATATGGACCCCAAATGATTCCGAGATGGACTGGACCTGTGCTAAGCTGTGGCTGCGTAACTCTGACACTCAATTCCATCAGATGGTCACCCATCTTCTCCGCACCCATCTCTTCATGGAACCCATTGCCGTTGCCAGCTATAGACAGCTACCCACAATCCACCCCGTTTGGAAATTGTTGGCCCCTCATATCCGAGGAGTTCTGGCCATCAACACTCTTGGCAGAGATGTCTTGATAGCGGAGGGAGGAGTGGCTGATAACACTTTGACTGTTGGCGGAGGAGGGCATGTCACCCTAATGAAGAAATTCTACAAGAGCAGCAGTACCTGGCCCTCGTACATCCTGCCACAAGTGCTAAAAGACAGAGGGGTGGATGATCCCGAAAAGCTACCCAACTTCCACTACCGCGAAGATTCTTTAAAACTCTGGGCAGCCATTGCAGACTTCGTCAAGGAGATCTTGTCTGACTATTACCACTCTGATGGTGAAGTGCAGAAGGACTACGAGCTTCAGAATTGGGTCAAAGATCTGCATGACGATGGCTATCCCAATAAACCAGGCCATACAAACCATGGTGTTCCGCCATCTTTTACCAGTTGCGTCCAGCTGTACGAATTCTTGACCTCCATCATCTTCACCTGCGCCTGCCAACACGCAGCAGTTAACTTCTCCCAGATGGACGTATACGGTTTCCCGCCAAATTCTCCAGCACTCATGCGTCAGCCACCACCGACCAAGAAAGGAGTTGTGGGGCAAGCAGACCTCATGAAATGCTTGGCTACCAAGCACCAATCTTCCCTCACCATTGCCACGGTGTACGACTTGACCCGTATTTTTAAAGACGAGAAATTCATTGGTGACTACCCAGAGGAGCTATTCATCGAAGAACCTGCCAAAGCCGCCATTGAAGTATTTCAGAGGAAACTGAAGGGCATATCCGCTGAAATTAAGGCGCGAAACGCAAAACTCTGTGTCCCGTACCCATATCTTTTGCCAGAGCAAATTCCAAATAGTATTGCCATCTGA

Here are the sequences for the upregulated genes of interest in upreg_subset_seqs.fasta:

>Pocillopora_acuta_HIv2___RNAseq.g22884.t1
ATGAATGTGTTCAACCCAAATCGAAACATCCAGTTAACCAGGATAAAGAATTTTCTACTGGACAACGTTGGATATCAAGAATTTCTATCGCCTGGTGCAACGGTAGTGAATCCGCTGCAGCCTTCTCGCATCGATTCCCTAACGATGGGCTCCTCACAAATAGCTCCACGGAAAAGACTGCCAGACTGTCCTCAAGAAGAAAATACAGAAAATCGGGCAGCAAGCAACAAGCATCGAGCAAGAAAAAGAAATCGAGGTCCAAAACCAAAGAAAATGTACGACGGCAATGGACCAATTCAGCTGTGGCAACTAATTTTGTCAGAGCTCGTTTCTTCATCGTCAGAGCCTCTTGTGGAATGGACTAAGAAAGACAAATACGAATTTCGCATTTTGCAGCCTGATAAACTGGCGGCCCTATGGGGAGAGCAGAAGAAAAAGACCAATATGAATTTTGCAAAGCTTGCGCGAGGTTTGCGGTATTACTATGGAAAGTCCATTTTGGAAAAGGTTCGTGGCCAACAGTTTACCTATCAGTTTGTTATGGACATCGATGCAATTCTCGCGAACGATTCTGATGGCGAAGCTTCCGACGGTGGAGGCAGGAGCACGCCTGATGTTTTTTGTGAAGCACCGAGAACTCTGGAACAAAGGGAGGGGTATGGGGAAACAGAGACACAACTCACTAGTACAAGGGTGGGTGGATATGTGGGTGTATGGGGGGACTTTGGATGCCCCATGGAGGCATCGAGTGACACTGGGGGTGCTGAACAGGGTGTTATTGACATAGAGGGGATAGTGCAGAACGCAGGAGAGACATTTGGCTGTAAGGGGGAGAGTTCAAGGCTCCCTATGGGGATATCAAGGGATACTGAGGGAGGAAAAGAGGTTGCTATAGGCACTAAGGGGGTAGAATGGAACTTAGGGGAAGCTTTTAGGGGAACAGGAGAGGGAAAGGTGAGAAAATCGTGGGGCCAAGAGAATACAGGGATGGGATGTGATACCACAGATAAAGATTTTGGAATTGTGCCTGGAGGACTTGAAAGCCTCCCTGGGGGGTGTGGGGTCATTAAGGGGGAGCTAGGGGAGATAAAGTCATTCGATTTAAGTGATCTTATTGAATCCAATGACACAAGTAGCTTCTATGAAGCACTATAA
>Pocillopora_acuta_HIv2___TS.g23786.t1
ATGAATATGTCGTCAATGGAGCGTGAAAGCCATGTTGATCACTCAATGCAGAAACAACCTGAAACCTTTACCGAAGAATGTTTGAACCAAGTAGATCTTCAGAAGTCCGAAATGACCTCAGTCGCAAGAGAACCTCGTTACAAAAACATAGTCCACTTGTGGGAGTTTCTATTAGAGCTGTTGGCGAGCGATGGCTGCAAAGGGATCATTTGCTGGAGTAGAAAAGACCACAGGGAGTTCAGGTTGAACAACCCTCACGAGGTTGCTAAAAGATGGGGACGCTTAAAAGGGAAGACAGGAATGAACTATGAAAAACTCAGTCGAGCTTTGAGATACTACTATCAACAAGGAATTATCAAAAAGGTCCGTGGTCAAAGGCTCGTGTACAAGTTTAACAAACTTCCATATCGATACGAGCCTGGTGTAACAAGATCTCAACATCAATTAAAGAAAATAAATAAAAGCAACACCGAAGAACAAGATGAACATCAAGCACCATCTCCTCAGACAGTAACTGTGCCATCGCCTGTACCTTCAGCTTTCCTTCCACCAAGTCCAACAAGTCCCATTAGCCCCGTTATTACCCCACTCAGTAAAGATTGGTCATGGCCAGTTGTTCCCGTGCCTACTCGGCCGATGTTGTGGTATCAAGGCTCCTCGCTTCTTAAACCATCAGCAATCCTCATTGGCAGAAGCAGGATTATGGTTCCGGTCATGGATCCGTTGACGTCACTTCCATTAGGTTTTAAACCAATTCAGCCTACACCTTTTAATACGTCAATACCAGTTTCAGTTATTCAGCGAACCATTTAA
>Pocillopora_acuta_HIv2___TS.g26760.t1
ATGGCCAAACTTTGCCATGCGCTGTGTATTTCGTTGTACTTTGTAGGTGCATTTGCAAAATCCGTCGAAGATGGAAAAAAAAGCGGAAAACTTTCGGAACAGGAGCATTACAATAAGGACGGTCAACACAACACGGAATACGATCACGAAGCTTTCCTGGGAAAACAGAAGAAAACTTTTGATCAGCTTACTCTTGAAGAGTCCAAAGAAAGACTTGGGCTCCAACACAACTATCTGTTTCGCAGGAAAATTGTTGATAAAATAGACAAAGACAAAGATGGGAAAATTACTAAAGAAGAACTTGGAGAGTGGATTAAATTCACAAAGGATCAACACAATGAGGAAACTATTGATAAAAAGTGGAAAGATGTCATTGCAAGATTACAGAAAGTAATGTCCCGAAAGGATGCTTCATCTGCAAAAACTGTTGATCCTGATGGTGCGATCACATGGGAGGACCACAATGAAGTCAGCTATGGAGGAAAGCCAGAGGAGGAACTAGATGACATGTACAAGGGACAAGTAAAGATAGAAAAACGAAGATGGAAGATGGCAGATCTTGATCAAGATGGCAAACTGTCCAGAGAAGAATTCAGTGCCTTTTATCATCCATGGGAACATGAACAAACACATGATGCAGTTGTACAGGAAACTATTGAGGACATGGACAAAGATAAAGATGGAGTCTTATCACTCAAGGAGTACTTAGACGAAGTGCACCCTAGGGATGAAAAAGACCTGACAAAGGAACAAATGGCTCAGAAGAAAGCGGATGAGAACTATTTCCATACAAACCGTGACATAAATAAGGATGGTGTGATGGACAAGGAGGAAGTCAAGGAATGGATGTTTCCTTCAAATTATGATGCTGTCAAATCTGAAGTGTCTCATTTAATATACCATGCTGATACTGATAAGGATACAATGTTGACAAAAGAAGAAATTCTCAACAATCATAAGTATTTTGTTGGAAGCAAAGCAACAAATTATGGCAAAGATCTGACCAGACATGAGGAGTTTTAA
>Pocillopora_acuta_HIv2___RNAseq.g19477.t1
ATGGTTGCAATGGATGAGAATGACTCTGTATGGCCCCTTGCCCCACTCACTTTTGTAAACATGACCAGTTCTCTGAGTCAGAAATGTATAGTTATTGACTGTCGTTCCTTCTTATCATTTAACGTCGCCCACATCAAAGGATCGCTTAACGTTCACTGCCCTCCTATCTTAAAAAGAAGATTTCATCGTGGGTCATCAACGTTAGATTGTCTACTCAAATCTCCTGAGTTGAAACGAAGGGTTGCGGACGCAGAGACTTTAATTTTGTACGGGGAAGGAACTCAGGATTGGATTGACTTGGAGAAGGACAATACGATGAAGATACTCCACATGCTTTTGAGAAGAGAGAGAGTAGACAAGACTCTATATTTCATTAAAGGAGGATTTGAAAAATTCGCGTCGTCTTTCCCCTCCATGTGCTACTTTGCAAATCCCCATGCGGCATCACAAGCATGCTCCAGTCCTCTCGGATTAAAGCTCAAAACGAAAGATTTTAAAAGATTCAGCCCGAGAAACGAGATCGATCCTGTCAGGGATTATGCCGAGTCGCGCCCAAATGTGTCAGCCAAGCCAAATGAGCCTGTTGAAATCTTACCACATCTGTATCTTGGAAGCGAATTCCACTCGTCTCAAAAGGAGCTCCTTCAACACTTGGGTATCACTGCCATAGTCAATGTTTCAAGTAATATTCCGAACTTTTTTGAGGATACATTTGACTACAAGTCTATTCCGGTTGACGATACTTACACCGCAGATATCGGCCGATGGTTTGAGGAGGCAGCGATGTTTATAGATTCAGTGAAGAAATCGAAAGGACGGGTACTAGTACATTGCCAGGCTGGGATCTCAAGATCAGCCACTATTTGCCTTGCCTATCTTATAAGTAGACATCAACTCAGGTTGGACGAAGCTTATGAGTACGTTAAGAAGCGCCGTTCAGTTATATCACCAAACTTTAACTTCATGGGGCAACTGCTTAATTGGGAATCAGAGACTCAGCTTACAAATAGAGTATCGAGCACACACACACCCACCACTCCCTTTGGATTTTTCAGTTTTTCTCCGTTGCCATGTGGATCCGAAATGACTACCTCTGGTAACAAACAGAACTCACCTGGGCTCGTGACTTCACCCATGTAA

I’m going to blast these sequences against the BLAST nt database using similar code to the blastp I did above. In the scripts folder: nano blast_downreg.sh

#!/bin/bash 
#SBATCH -t 40:00:00
#SBATCH --nodes=1 --ntasks-per-node=10
#SBATCH --export=NONE
#SBATCH --mem=125GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error

module load BLAST+/2.13.0-gompi-2022a

cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast

echo "Blasting Pacuta downregulated genes of interest against remote nt database" $(date)

blastn -query downreg_subset_seqs.fasta -db nt -evalue 1E-40 -num_threads 10 -max_target_seqs 1 -max_hsps 1 -outfmt 6 -out downreg_subset_blast_results_Pacuta.txt 

echo "Blast complete" $(date)

Submitted batch job 315195. In the scripts folder: nano blast_upreg.sh

#!/bin/bash 
#SBATCH -t 40:00:00
#SBATCH --nodes=1 --ntasks-per-node=10
#SBATCH --export=NONE
#SBATCH --mem=125GB
#SBATCH --mail-type=BEGIN,END,FAIL #email you when job starts, stops and/or fails
#SBATCH --mail-user=jillashey@uri.edu #your email to send notifications
#SBATCH --account=putnamlab
#SBATCH --exclusive
#SBATCH -D /data/putnamlab/jillashey/Pacuta_HI_2022/scripts
#SBATCH -o slurm-%j.out
#SBATCH -e slurm-%j.error

module load BLAST+/2.13.0-gompi-2022a

cd /data/putnamlab/jillashey/Pacuta_HI_2022/data/blast

echo "Blasting Pacuta upregulated genes of interest against remote nt database" $(date)

blastn -query upreg_subset_seqs.fasta -db nt -evalue 1E-40 -num_threads 10 -max_target_seqs 1 -max_hsps 1 -outfmt 6 -out upreg_subset_blast_results_Pacuta.txt 

echo "Blast complete" $(date)

Submitted batch job 315196

Written on March 11, 2024