From bda39896f22ecae8cb1260ad0a7183153b0c7f44 Mon Sep 17 00:00:00 2001 From: noah Date: Tue, 6 Sep 2022 22:48:19 -0500 Subject: [PATCH] Completion of code --- .~lock.Schrick-Noah_CS-6643_Lab1-Report.doc# | 1 + README.md | 8 +- Schrick-Noah_CS-6643_Lab1-Report.doc | Bin 0 -> 37376 bytes Schrick-Noah_CS-6643_Lab1.R | 92 +++- apoe.fasta | 54 +++ covid.fasta | 430 +++++++++++++++++++ 6 files changed, 576 insertions(+), 9 deletions(-) create mode 100644 .~lock.Schrick-Noah_CS-6643_Lab1-Report.doc# create mode 100644 Schrick-Noah_CS-6643_Lab1-Report.doc create mode 100644 apoe.fasta create mode 100644 covid.fasta diff --git a/.~lock.Schrick-Noah_CS-6643_Lab1-Report.doc# b/.~lock.Schrick-Noah_CS-6643_Lab1-Report.doc# new file mode 100644 index 0000000..afb4326 --- /dev/null +++ b/.~lock.Schrick-Noah_CS-6643_Lab1-Report.doc# @@ -0,0 +1 @@ +,noah,NovaArchSys,06.09.2022 17:52,file:///home/noah/.config/libreoffice/4; \ No newline at end of file diff --git a/README.md b/README.md index a04e870..754ee90 100644 --- a/README.md +++ b/README.md @@ -9,7 +9,7 @@ d) Use the == logic operator and other R functions on your my.dna variable to d e) Confirm your answer in d with the table(my.dna). From the output of table, create a pie chart and barplot. Add x and y labels to your barplot. f) Use the sample function with the option prob=c(.1,.4,.4,.1)to create a vector with variable name my.dna2 that consists of 20 non-uniformly random letters “A”, “C”, “G”, and “T”. Use table to show the nucleotide counts. -## Part B: NCBI Search +## Part B - NCBI Search ### Setup @@ -21,7 +21,7 @@ a) What is the name of the gene? b) What chromosome is the gene on? c) What species has the most similar gene to the human version? -## Part C: Reading fasta files, nucleotide and dinucleotide frequencies +## Part C - Reading fasta files, nucleotide and dinucleotide frequencies ### Setup @@ -39,13 +39,13 @@ e) How many of each nucleotide are there in the sequence? f) Create a barplot of the counts, including axes labels g) Calculate the probability of each nucleotide -## Part D: GC Content +## Part D - GC Content a) Add code to your R script to calculate the G+C content of the fasta vector b) How many gc pairs are there? c) Show a barplot of all dinucleotide counts -## Part E: Coronavirus +## Part E - Coronavirus ### Setup Paper: https://www.ncbi.nlm.nih.gov/pubmed/32015508 diff --git a/Schrick-Noah_CS-6643_Lab1-Report.doc b/Schrick-Noah_CS-6643_Lab1-Report.doc new file mode 100644 index 0000000000000000000000000000000000000000..ac039103fcd080b7162a71255541a19bbf40b4c4 GIT binary patch literal 37376 zcmeHw2S8NE7WSozD1wLxYV?YzXm9~(HllzicElbFF1ri6y6oZ>!4@0F5_^jpjha{! zgM}c8iak+-1#2WqY%#`XH`Ykx|IXaK?6OMKmskGx{|tURcjnHVnK^Uj%$d1!uldKT zURt%K=4H;xy~o*cFN!O2ju!DYxNawg-8im2t|{(Cad9z4M+3MlcmG2ixV2_4_mNjc zdyc#3zMDFcvq8_oIdNQmU5=~5B@Rs-x;1}mK8r2&Q_-BlliZ8`2F;?X#+ zvKM~-2|p2ZK!%gUC-){k)uTcKj@yQexs6bFgr_2Zbs}KpL_Uc1v%3A-dJ|nsC(;vr zYr0~1Ir4cT?^|3Ygmzd9*F7e0GiF7RE-sD=OUlu*0OZ_;I=Q#g2A^3)A zegQp8C(7ISnB0Wy2f`=$BR7xdxOn(= z{V2q7i;#aKd+&hhKjVl_Lu3@o5ow718yQG z9!0;b{9=3*`jz(rio&H}H7W=ELdY2V1*7t4wvgna+ z?pGNy38l~ya^k?bGCki#n#c!9c)p{?pwr5YGJ{I1 z;SE|o-kSPMNKd&*$)C#RZC0FoCI>m6KLL(c=>kS~RC)yG| zPO3L3_>oA@%d~2(uBl8bSIBvgNY!fe3Oz5?$a$^NkZLsWDio_!>G{4|ot&3zWyTbR z#^8^%P4&_d$dslsC?`cd-Q5ghsIz_5hXJAc~YQPspOwWV8u~OX##YiyS#lNkb;ABGfAsfYtvMkWPXG~X3*-a*=wr1nWGXHmuMlo z)Rcu!Qz~>OYAT)Hz$?@W;xZ4m2l=y7)aWtG=-^;~Q~Nbw3#!A?Ym@k9e4@gTrch|a z)OwVc@b}!y^`$-!3gsbNQnea9OqI$Y3Z`naj!ILiWJ-ZS)=dg$yeHmSw<;u1!pEvK zhL*e$tZzfi#)#mR3Uz8p9&IWc3TlNW*`Sm_NO>XVU(n2w4%ahjHTSx?$IV?vEEplx zsicV}N#|2WO5_?T+AK9-SV8ji28b29B#<|1R5Y~JBb(|V2-*~0jbWhB>G}DQ3;g-{ z(e%>+KO`mddo5rh`1krO!9|qbhzMS-O;*XkCxuQ531y{d!GE2FWU~RlG=ixaC2d3F zj-+<69yGN>3i(A_#O?zjUWTzs-KaOF(BP8tz2bYv1hkLr(lbVs zd(_1;B^yqrk*3F>APFLlnED#S^6&B_TB}J?=~4tmAl2y8zyoL%lNO`$4b-200uLHP zdFjwu6mQao)ND%OFRB13pQ=*uGNsg{QWB-QRJGP1;Und8KAly6B;-6%q1Kb)U>p|H z{O$G#>~JaRYHPV=x|!8p7&NImZDNGXPZH!W31R<&8o!NbH+!XQYqXlCCGyR`seCi( zWMc<4fZk7IT(+3GWLl%f@GnqaB9C06Bz%-qk8#i~I;tbDS4ed-C1wWZ3vqr*gCVtL zKtNhrnnWW@R7o`I6p2Qqlq73M1TkO{KJ~ zbaUZGjT}{&-vQ(ofQrd(ps9|mhQ6eNP%A2j&B~_HO0{Xj^UX{tWJ*Z4R_93nHJ?jDWSB+D64~(Pn6<@i4volDoqCCFgas-h1rqDj33<5rl1FPiewZ5OGl3> zM+V+tc9gQQMrF>|D>U*2ydULKYo+qWL|AG{#0(P)cFfa*2s4pjs>ES7Aqp%JSsT1u zrDL=DNYfn6#xE=og`PE&bX<}ODoTYB-;L~_lI+A*7*4PY)GCMtHHz6oNm8cWVPd#M zOZa$&R8DDx7ErmR?Ke_?Nbs^jhdvQ>3@fn220=^3DhfstB2*f12V7*T6#QUyM59$2 z9qN{-lj>me^g^@6gv`9uYA~^D6bjf~ppmFRCotKNu{LU$dPl5Gn?j}sW`Af%fsBbJ zyG3QD7$Gu9sx!hUk&;z_@x#Ue>P2c3#iWs0Cook4Jzw8c6$uQBVUPs9pN~*spT>NB z2oS2KB>NCGlau&Cd{s%jMoZ=@Ypg={+GvibkmnC$?;m0y` z)#?-y3A9(D(O}SGXj1b`)q1H$m~=P>-pZWm1om};0{ zg(R(;Vl+$TLdcH>K!lGpRjWXR4{D{C=nd3Q5oA0xc5#7n>#Z02O$Pf>Ut*<^nuXdU zHOd=CrZSx-76Tq`YCvp&Wgx8j6-UVZ;&(LW_#Skb_QRmnF6z)CW!Y zQW=?WEzONF)s;t^#DWJ52pTj9s3b;67-6Ik%OhKZvBH$w-^Zu1IW^W4L{6xWIVlkV zH=xL2(FpZJ=SUN^7!xJU5*S~?AJ#A_^`l8g2rD1hI5c;eWEt83OGu1Xh#L(=RAMxTaY{uWIK*mWoYwzgSWGV9wqk9mDT@?64vlsteR zB#1&wHrYybIw_Vmn6gOT{K#%5xkN&YhftwnqAASOnz>-69YEf!Ba)2ZvAO19=s0l= zXtD-Hi(0UI{LvvYwUNyY(scBQU_wgxaO9M#Wk$8Q{v?}Dny7;0GNatOCVq0&2-*lp zgcZt`OsKqzOFIeQA(~_dD`b<_Mi{bAVNyCB-n9rOK}Cyo7cB*5)hF>CFn?0_M$f~% zMHlE|bd*sqEDROtsn}G|V|IYLgQ={*28G4|g)X+%pvBaLE=x&-a>KYX^AOWpUyMu0 zrNTH2Qxx_yWLRi2Ys8;wFV+^tX_HWISV_sWra=yDXHcJ+%mmg}F*Vy;AVtKaTv#cK zHV(>%9fCovlJb7wt5B>$&sYY{LrRc#N=imh$yNj@ic-B|#Xn0ivlb;mAZc<%K zGG+&oO*2=L!DTcP_h@QB%SeA1Q_T7RLKzX^lk9_<_CfWNMhZj4w89be8=LS!NF2rB zVJQNvm64d7FrsKvfz`pJOaz7CFRbT8LQGQ4Z2(dcbmGTSF&Qwpk$l>@0#SoD0V z6a$sIS}Gq3U4s~lK8Mj}VeAm&I~u4^5+001)~ncSfn{eVSejLgJPboN`C=0~;}x)CX088R)RdhN176Jy0@w1f3@iO~zH? zhD@u|k{O^gVh~{~i`iEekr<46txN?CVp{mJr3{9PK2^nb2u7k=SRg^xAO_JGgEStb z7`?0lIz@9&`J^vv{gjF=IE6eQxLIIOXlP(_AKv3_ zl2?b$>J}Lg-!1ZO%7kR0*@}R!-C6`TZxIqM32r0o{zb4E>~(c`O?K1?y=+i`M}A~_ zLQbLoG^DEEjxz$0fF7~!LV`nrLM1_ORZCMH%F2#fLUpw>t%ca2HgU3)Ig3GS-sKU7 z)JvPlY|RlZ82n_GmQ1~-O=i-mP#CEsv`iZi9YMAtpY2NalPHCiu4Lvi zEtg1kyMe!K0MMF2Sb@UE(b00oatffO8e29(OR7?6e^*MI)G%)O6bOK6f=Cm_6YL_$ zFLuC%-qJ#I!{patw@KVY1!?pn*hE@gjdane9iV{KnBdr&U9L)EL%<+TF{BNsDPpy; zxO?j-4tUP*3U~lDfQCS0;9VdHXbW@(dINoc1Yjtj22z1EU?MOHSOI(mdg04_KIA_ zg*r{>-pP*Z0k~IJ-l}qeD~TWIG=cKmvcJOeE{mPc{N6(zGQS{rlR$ImZRR(s@g1Jj zS6BW#dA$kEjgvGO$!sCg<#_+=9os39)!g=HjWm79PR9*TjZ`2oYUV zs9JU9ry;XJjZ&La#I$*A5u*B-?^&cosTATW3qD#(Ekfu__W)`)LJ~UXVc7>F-z|9_ z1ztx3lK?k7>8lF#0DK{B4FCo35O@Uqf$~aR*@1tQq`}2y%fK>Ja{489gNTrgoIQaI4RyY-yt!1 zeSRiP=t0+veA~f5vYf@>n2l?O(xz(0;*t`THbW3H-N&<8p(b248b^_UWgpK#cpk71 zI0W1USS?Xg%l@B+@ND2a;Cr9|xBzgFgQ|chPz$IB)CVl(WeUPGfCa!pU=gqs$OldV zqCDXM64wf74YUI~0dgP>SPpy+tOkAuik?1wT6C-Ea#3L+t_qG75DNC;kAnNQKP}w0 zwJ?*p&oUQgPM?@5bXyfqC3Tmm1~?BGa9oZf4v|#-arB{1dsxEs-pjad%%6*0q{Fe8N%rT^Z+IU4+K^Uzoka^ zkCt~(U$e_KUvk_xmVvB)Uto=NFE4M`_Pp)8^5}0n zg7mj*+pfH=d3o0}uU{`%z4DV!R(?Dm#4H^~%ldkc&10^j5XP*FJAsvsqiE4~ppY=P z(g+o824))|Eb4)DzcY;5vOzyHpJrqZR86xtQnH!5gerwhR7>KBmboO+DiN&}CP<*8 zOS#5MVBPI%sl$+$3nv~Kff2xW0Go6$o+|<37_WkBah!jM>qEd(fR3S61Y7|RpgOP& zxDDI^=-6Tdpdru*XbebzWI*8BmhGI888kQ4?Mbm`~LmgH*Q=w ze*Ade{k**UkM`fs+mV^MYWb3dbMvOoU6{8pZ|uS)3r8-r>^tjg@FZr8pf@*ax-Lw`whGiRVmVwMMu3!>B%1(@hZ?g%AV+`$Y` zs#JtYzQiJ_47yLWH?|*vI+B91rY+nd$traRd6~+A&(sopCcBs?$S++|6Vlz(r(&Op z{k0wZ-T?&TR96VF5Lg6!3+w^*0^b4O1NVX70goE!azKp}peeu#Z;o3D+_&t4t8W&- zurRx_0|n+pusZw!0pxv+DynFZ9v15C4?|z%T zB|Cfd>ZPk!XD`}<>qXhwGiHpNF>ZBscJ|%u(RWARHL(<9g0Zb+d`%nusPrgu_f)OhI&UU3JxVg}nWeZjg9VH&wK#@Vb;q9tA$$&E$LVtpqmma`fD zgEJaqI!p?^PC8A{Y|vd)Ch0gTi*y#OK|$BiuZ3>o0*Hqbfe(S{z)av%U?cDqkO^!8 zjsW?1bg02MF{5cNSSu15i507q}+0cr#FflmQZU#!CQ zTHpro1aQRU=L0kWf`Ilw2S5i*1Tuihz=yyZU@fo}D0*7-^x?gSH?CbSdU&m{u&A)8 zsOVb3wSuCe;|KQ_6cp^oMr3(wQc^mymzOd&2>!M_? z*XU1M%=MxzGApPt%NtaqEAyFrVSP`s9p*G)kg$JozWXJ1SDdxPS&V8JM*3Og-6fo; zz6?~Z4ci{*54;a-1Ne8ajsuziUjx}dKJb1W*mXd4yer}f90863$AIGiy+iVc1s6*P z9$DTgJ8;`F8O#~8`>zBF&E<348wN@>p;X|!CACrk%NthLIAP6MSov&OX%VJvEX(_~ z6P}x93nsd34Z>y+FB&H~z&F5N;CtXGa26;8?gP(&KLAI_Mh(CRXb(t%vA_qwG++jh z1#AQIfFrcN&=?2= zIsxgxRA4=@0oVv^1$F>MMUNg872W>jHvCIx&t7`;=r(2yiaL6fu1dSZhrc_#_i#~B zZfTO=> z-~-g3v(mvAXs{U?fAaxbFQD?A1GfKB29y`}AK-wyV^wUM*>LGLu4FKnoIO1jDigF_ zC_>Ef@A9df|D>aKzm0p_^h8_a?c0>||3lh;hL>-`7m?rpF8Xg$hyV37mOK=%sZDP> zKIdTWSP5(f_5k_7c;FIn55VUpI5&U?0)e(b`MrN1Uub(iA=bt|1!oR4fF3Xah~|di z-~D1+_?7AD4s(uNH%`mxxD-yxsS!gm;4b&DvEhPn%)utQlPA^|wlGj^VW8OJxu`88 zFGV~RfnyMGC56Oc#(tr(D!PoTAt=j5K zHXf_fxkz8gU8oaE4dezo*+}8Vf!gaSHe(cREh#ENG0uXbuRzfk6z$%CVlbmu4%a#$nf<16O2oB? z+-Y!^Ts&ms_u?%0W-67rUdWLM3JRnyEv2=&8jqn!Le8R5&UR|y!T~W-ingW0M4TTt zjDJ|X7d{$paupaIp6i10^sH=KdWZ@a#`PZZpcn=@R~ci4E!=R-DV7__rJ{TtYC+OF z3@Izqp`55D9EpB0YCNLegJK&S)ExQ90KaGJ&hI?X+xbTc-v$CLWx1@Ut;{4lVKZJbZ5Jy57&9bUE28 zYCZ3%_Ady2@@cE9y4<=IviC1uwyx8r2T||(-mKMO(3*?k-3z|zGi^dmuh!Bvu8SXT z-?h^(e9HKa)0W1s8U1T&a+j?Uv*zgP&RKo(kLI>{p(7fO|8snmqsAT=ylzgL+x+U` z=pV1yZQJ}#zlb~6o-OvhGUw~3>=f-ZDQnNX)3???Yk$}?O7g?EjUGu>zdLQ! z(f<2-J@elD+h9fLiO}^w^A9S`aNobV^0b8G$-H;dUw60qblR=+XZx#n7S7r_YfRYS zBi;5EdklHh=C^96j*J^H4jd^iwzJ~bnzQS!?*X5}(CZ`z2|Bzi86ctGjM>MB?hmXy zjTCb-vck^W6_RI`44GSbg`&hKLuLA3eLg z;5zTQ^b5!RSEAeGDKC$R*gbCI;mNDlHEi-0_%;URE>m;L4LL z&i{Dr>ZI^v7d_5=(%GfI|FQ}$(-lW0Q*-W(@n2VK{ZFyAzw=jLK3M(N8|hw$zn)p` zV#h=J9%JjDnKSvT>_Yx*_Pb~5-SsaV{H^cMh>tq-{q59~##dr)zT42*w)v|ozpv%2 zN0;!-B@6o#cx3c)<%6i&xB7S1)(UsqRpYmD{ch!Mk>9LZYp&hR>f!e*&7J4dp;PDE zC$C6q_$oT&__TNwzq0N75A(U)M{yo5`JBtB8?%Eht?W9k-IRfE!qdBFHp zYj$~mux!clCChX5PgGrUs~vHWcHAFuvB|SK-9Gf^JD$5&_tB0Og^wfZ^>F8ZaKCVG zUB9PkIc@&@rOHq9s?OR{;lZ?XQ7&!fH7k67K(*+8?XP%t9&WpPrq}Gb8J+87I@Df1 z@#n>vUwzyuJKJmSfhX=BlYZKNVru0rKlW|%=k0GFRGFOD&N(}9_0Dy@WSygAlZN-N zcc9nTeowl&r99o-eNVUNPd7Y_n4XbQx8?^Oi=PcUpLRC?Y?rA&RNwmHz`n&te@U|4 zn?C67;r)@jRBPhDIz2-7XU(?Pnq~I9d%9)zz!i=wFS~VHI(&=q?2iK$dY|PK3GZw_ z|LAf1f_=6A2w&9vZm%;3_YI3%yL)TK-4O$Z#8lc8>Ah=v#-_0FPf`Z#c2qq5$$i)0 zpHzMSvFHPr>3$u*@BCYQ_O=sCSA}1$*J(-okb^~KhLRqG52ZM`B7B^x)hyH*fr|;!4-FVx;Bz@8UCX1%bZOK%f}Tz zN@%k?ReABUa>%~tzTfX_wP)SW(VqKl-YL32KB4Ppb?YT|y5+m7ZN>N<%^#-8<*QuR zPW7pfIW48td%w;ZI&jbE`Ld5v1Flr-Si5H3c3T`SB+m%*PJ93IgrD*P_2cGc+&TK} zz=4Q%za7Xua`V= z=ISRI>0R3`Yj|V+gz8-<=S!yT4d_`T@?MVOlRpm*p5Nog4dh{c= z@g%?9XXn;zot-$e<`~z*hhnydUyH8xV^#Oo?_A3!01T zPMvG7>-fE7`RJ)Z)e4fX_Q|+%dh+eLzRk9*_$VmD@4>9ahP_XgjGeN6*gd;WeYQUj z+Pi2+z%j?!{Qjj2rgJ@Zcll=j$4>&YuH=NTsh=@uL(-IkeWn*)KH&Y%(T>W=kL!Hz zvHUYmcK5{fDvn(jy&o13dvP}Z<*mfNLkc^L*!aaw`%1qp?4{3I*67L9%~9K?L~omT z{Dq<3{VqM+>+kCHyG?rG*CQ)*8g$Efe00m&pX{--jo=S_bjojvt-)(=Zv4$285?ca zW=?K3c-@5u7ZrDcz6qG!wC2a(1VlO}^mlaM(fIel=O0$h4)e2JJ9}kb>Y4#=TP`F_ zU4P|^?;3XdcJA0yo|^G@4hYW_59NQhpjZ<_1U^T zaa~rECATCG!@o@ZXi(RW?vEc+)U|7^uJh_wHp*Mb3Ll1qN6akVeOJ||@Ob$0BVT-Q zb?yB7tM-3$L!PvMq3fx`9!D!Z3>|uDuFI&0=Nb)e@yNSsr_s7j_dm^_RM+NsaAXb7 zh08uozcR*oar=9{7q&RJXW!)7D(~6B&m89V2#9LjWy%k`;@zh&`A1xO<7$B+C-b@$ zI(Kh$|Gi3ErODrJbo`{hqI#_7%{4=JRhiUe$*N(G632i1<&Wy<2<7r>aa;S=ld6;E zpG%#y{&T$Rrkk{{nRb{UFh?V>zVH-+ML#H+GCefVfGD|-S68}b(=V{ zcF6S!4|@Lkk8jhmgWIfL*Vp^hbWXYXdeyk}Q9Xwb8rj;o-T$=Yvg)IZs+zEhDc*&b zra9G_lzd`CpMdDAU+ntXf8fv)J=f+xk5pC<`A5#hgIBiia{TtY3vH_f_sl=gD&=YR z%qAH}+}5~^vyYZNt+~#vRaAt&Ud?@?EKw6{m_lH}CoKj>Tj(IjWW@X%JlUZ(>OZ*;2?)q60HtHU$&t$&yyWQZnx0}W6JM>5Pw?8@TxE8wa+jsm2 zfB9h2g~ZeQPAY%yJGAr7oL_(aC?%}~ zR?temdC0_B+VI=eljnB!>)Oya9v^pDtgF(f)BI)m(|bJ|pV|EL*?oTT`Fw5Up&j|L zPmE(%-v7jA%y+)IiEg9MaNdjJmKJCI*}Ueo z9|mmSe&AwB(`{^<;-pzwW-gLhxK|jxah*#Ou(tcmxdXvZ3oFn%qqQAn&gcyT)54w- zY%2@)TT`+&YRr_*zKRkvaja=r3xS!&KPr~4&0mN_TxSQ^R`ks-JX1jHuhE$ju5}xj zRN^-kB7A}*fj)RsMR>b0!l!qy_D%5~72ccG$fbBi7H>|EROo%$yyxQFTG~>k)^(Mp z@*tqmx0FV(m$UG0mR<&<6R%a{V=x-r6FwY)pe{K;u9MPtj8fDA%>o0%0#c+ZjSrug z92DWBHU#Gi^%h#SJGpJ2=U4%;636VosvluJ~=2H&%yGHPTA8*TRLq? zC$8w!4Ly0Mr?2z`j!sF?Rtp`#q!lxga7e=8pbUp(93GxC)18)jbYh!M?a|2%%0~xZ zX}w3*F`~F_AaEb310qcTBd`_72WlZvKY;33t)EA6_3@U!$jk*`UEBiL2OI*XVYNCN zn2D!N^MQlFG2kZf0Jx1BJpo!n=pq3rkOW)=NDyBDc91G9UgemE70~DZU0@gLZ`XmO>U(hV zC?In8Q^@Dnz;U#aT55-#F*AhcP%q^H$2S7R?I(apgXFy~WSe9?3)l%914!QcgOl07 zHsCryClBdVp(n5&I1IeH4wm)#tLy)*y4rFLJ}@?k`e07;Vpr=wH}1O+l;h-MN6oUn zQ|#)7-XVu6!pgZ|4A9$0^sW)TfkbZ^Y5lk+2gk_}7PiyDQvafUyvafWY z?3Q_LI3cg?#Io|HI#tc+Qam^#)oBQP%e=OxyxjP*^0uucl-stFb-8w?y!K`0b*m!e zb*o~X*Z#G2zT_s<`I4J;ogH3V=grlGayM7AF4yt3bsq02m27T)H%-E zy3S5wonO(vcCQu>SUIn)Y`IW~1L%KMD5#f7r&tbp(m$4AI>0PAN=>d9PoBsU`n1CK zXWsaZY%RbQc-6H#UR!$g9m{xIF1E$efs8o)*w|v{zsU#AX{BR2BM3u>?WSKd-RcCN zUcI3M+?`uEaCH1?f~+zZ+}ec;ZBUP+mulz`Zm)q&I67V%-SJ(HUge=fzVvzy9lOaG z*@nxl5zW!dB4f3kI6Cl42X^TgF1@Hh=P<9>4dgyc8P3rGUV2fO4v5oXFnYc~uMll; zSja6&Uc%8KKYD$ybJ9AF&h{IeH*s{thhF8Y6}*dM2gMs5;^=HY9RR08$8>a%UKo_u z$>*-do#f~!F};4&y5c$Z9D-gHqSuh<;2<3(YwCQNqeIB_;?QF57N^JS&2-49t;YjA z`*_IFv3NRwOfMnRxl%fKbla|&%k5FjxJI@m&FSPw@fNow1xz*-f!r#aRW{)_=D2`jylqrELT`{E|trRZ0b9*KuRDm=`hJmFPX z9xTiRCrrSm06Z}qjWUcZA3c8JxHb_iUj@60wzdwo_GOk;I-l5Dlq+z`-T|C)v_oBO zxl6_N)Ek0ZL0tkDje1fGg&HF)@h1+u+ED{3AKEzD_hk-KK6*$d{DTdr-NA8}bSO~G@zi-BYl@2Q%zUIE> zCQNV-DV^g0^aJ7VI1-96OfsB;Ck>~5a^i~V`BNFrDUNP9-=qJ(!lsRo2$R;hvhd#M zb!;wa_DXtXm1C*XUe4%xGra}Y%goV8BPzwkSW%X8RUq{AK-qvll5ZK;8DmX@KLs?> zU*9c7$P|FEW9Q8@j3D(krpMr~N?RWHKgfX>m@b`NUlvJfYhk*IZ)1lat@Sp zpqvBc94O~NIS0x)P|ksJ4*U%cl(qh+b+zr8qi2>$YPrq-80&xkXPNB~=U^2_Yj;}L z)B2g#xk`Z6>Zt(j0gM4?11STbeS;YQ?cOZ_XwAP2pmp|YfY$UI0X(F~)*t(fB)1bj z(LM~kYOPOeWZH5ipVE;}+k@m2WpT@_EWZv4r+O3E>rPY%Z6Hvkd#Ma+1tZ07x5hnz zF30h>O_9w3%F-5~FkMqU65=|xONgZpRQD{YcO>FzGl2F!h)z#DVj!Q266umpe5(uK z;{a@9+yHLs@qd^G;4h8+NwA?O!xp*Q_nMtf%OU^=*m>dhqbi^4EwSpoc|r z)HaxFhPo18Ls^}vEb2p%e-tivIZ)1lat@SppqvBc94O~NIS2lm9H8|H ztyyVJN6*h`ZA9x)dTvk8?CE(uJ)@`fGp%dsxja3Sr!_vU+i6Wt>sea6(_s-Dx@YTm zTI18Rae5=g@(mWe@WQ>oxfxm;@jyMm2dEGD0u6wM03PYn7a6$5038^mHNHPUYc>fG z00aU-Kr?`7hQO!i@?k(YKx^|BKudt`w}ww^YI;ql4bT>d1fl@C|0fV*x~4GU|8+-) z8)=t-=VGu+V8GgYBslQq&XaSn5qU&1m3$J-}tf$XUobbxutr%Nk-b<2`1r!>&x~Vb)gj(@L@y+!hR z!KU(UaFCRa*QRpav70iK?!1+AhxQW7e$)0-0lK#;<*yU}Gv)s^ z#kafrcgiQ({r^GFS@Nj-+ROnP^gg|xNio*Cn}=+e&$ji|>g0Tj@TPc6HyHo1pN#!a ko6I)o+LP`fiVbSImWcNy&AEpywhn(gc`i@=pX0#)0gi_g@&Et; literal 0 HcmV?d00001 diff --git a/Schrick-Noah_CS-6643_Lab1.R b/Schrick-Noah_CS-6643_Lab1.R index e7c50b3..96bf23d 100644 --- a/Schrick-Noah_CS-6643_Lab1.R +++ b/Schrick-Noah_CS-6643_Lab1.R @@ -3,6 +3,9 @@ # Professor: Dr. McKinney, Fall 2022 # Noah L. Schrick - 1492657 +## Set Working Directory to file directory - RStudio approach +setwd(dirname(rstudioapi::getActiveDocumentContext()$path)) + #### Part A: Seq Function ## a AAvec <- seq(from = 1, to = 33, by = 2) @@ -49,17 +52,96 @@ my.dna2.table #### Part C: Reading fasta files, nucelotide and dinucleotide frequencies ## Pre-cursor: Load associated supportive libraries +if (!require("seqinr")) install.packages("seqinr") +library(seqinr) -## 1 +## Load in the fasta file as a string +myfasta <- read.fasta(file="apoe.fasta", as.string= TRUE) -## 2 +## a +class(myfasta) -## 3 +## b +fasta2vec <- function(fasta.file){ + if (!require("seqinr")) install.packages("seqinr") + library(seqinr) + fasta <- read.fasta(file=fasta.file, as.string= TRUE) + fasta.string <- fasta[[1]][1] + fasta.list <- strsplit(fasta.string,"") + fasta.vec <- unlist(fasta.list) + +} + +fasta.vec <- fasta2vec("apoe.fasta") + +## c +fasta.len <- length(fasta.vec) + +## d +fasta.vec[1:20] + +## e +fasta.table <- table(fasta.vec) + +## f +fasta.cols <- rainbow(nrow(fasta.table)) + +fasta.bp <- barplot(as.matrix(fasta.table), beside = TRUE, xlab = "Letter", + ylab = "Frequency", ylim = c(-0.25*max(as.numeric(fasta.table)), + 1.25*max(as.numeric(fasta.table))), + main = "Bar Plot Representation of APOE Nucelotides", + col = fasta.cols, legend = TRUE) + +text(x = fasta.bp, y = 1.1*fasta.table, labels = as.numeric(fasta.table)) +text(x = fasta.bp, y = -0.10*max(as.numeric(fasta.table)), labels = names(fasta.table)) + +## g +fasta.nuc_prob <- fasta.table/fasta.len #### Part D: GC Content -## 1 +## a +fasta.gc <- (sum(fasta.vec=="g") + sum(fasta.vec=="c"))/fasta.len +# Verify: +fasta.gc == GC(fasta.vec) + +## b +fasta.pairs <- seqinr::count(fasta.vec,2) +fasta.pairs.gc <- fasta.pairs['gc'] + +## c +fasta.pairs.cols <- rainbow(nrow(fasta.pairs)) + +fasta.pairs.bp <- barplot(as.matrix(fasta.pairs), beside = TRUE, xlab = "Pairs", + ylab = "Frequency", + ylim = c(-0.25*max(as.numeric(fasta.pairs)), + 1.25*max(as.numeric(fasta.pairs))), + main = "Bar Plot Representation of APOE Nucelotide Pairs", + col = fasta.pairs.cols) + +text(x = fasta.pairs.bp, y = 1.1*fasta.pairs, labels = as.numeric(fasta.pairs)) +text(x = fasta.pairs.bp, y = -0.10*max(as.numeric(fasta.pairs)), + labels = names(fasta.pairs)) #### Part E: Coronavirus -## 1 +## a +covid.vec <- fasta2vec("covid.fasta") +covid.table <- table(covid.vec) +covid.nuc_prob <- covid.table / length(covid.vec) + +compare.bind <- rbind(fasta.nuc_prob, covid.nuc_prob) + +compare.bp <- barplot(as.matrix(compare.bind), beside = TRUE, + xlab = "Nucelotide", + ylab = "Probability", + ylim = c(-0.0, + 1.25*max(c(fasta.nuc_prob, covid.nuc_prob))), + main = "Bar Plot Representation and Comparison of APOE to COVID Nucleotide Probabilities", + legend = TRUE) + +ycords = 1.03*c(rbind(as.vector(fasta.nuc_prob), as.vector(covid.nuc_prob))) +tlabs = c(rbind(as.vector(fasta.nuc_prob), as.vector(covid.nuc_prob))) +text(x = compare.bp, y = ycords, + labels = round(as.numeric(tlabs), digits = 3)) + diff --git a/apoe.fasta b/apoe.fasta new file mode 100644 index 0000000..82b1eaa --- /dev/null +++ b/apoe.fasta @@ -0,0 +1,54 @@ +>NC_000019.10:44905796-44909393 Homo sapiens chromosome 19, GRCh38.p14 Primary Assembly +CTACTCAGCCCCAGCGGAGGTGAAGGACGTCCTTCCCCAGGAGCCGGTGAGAAGCGCAGTCGGGGGCACG +GGGATGAGCTCAGGGGCCTCTAGAAAGAGCTGGGACCCTGGGAACCCCTGGCCTCCAGGTAGTCTCAGGA +GAGCTACTCGGGGTCGGGCTTGGGGAGAGGAGGAGCGGGGGTGAGGCAAGCAGCAGGGGACTGGACCTGG +GAAGGGCTGGGCAGCAGAGACGACCCGACCCGCTAGAAGGTGGGGTGGGGAGAGCAGCTGGACTGGGATG +TAAGCCATAGCAGGACTCCACGAGTTGTCACTATCATTTATCGAGCACCTACTGGGTGTCCCCAGTGTCC +TCAGATCTCCATAACTGGGGAGCCAGGGGCAGCGACACGGTAGCTAGCCGTCGATTGGAGAACTTTAAAA +TGAGGACTGAATTAGCTCATAAATGGAACACGGCGCTTAACTGTGAGGTTGGAGCTTAGAATGTGAAGGG +AGAATGAGGAATGCGAGACTGGGACTGAGATGGAACCGGCGGTGGGGAGGGGGTGGGGGGATGGAATTTG +AACCCCGGGAGAGGAAGATGGAATTTTCTATGGAGGCCGACCTGGGGATGGGGAGATAAGAGAAGACCAG +GAGGGAGTTAAATAGGGAATGGGTTGGGGGCGGCTTGGTAAATGTGCTGGGATTAGGCTGTTGCAGATAA +TGCAACAAGGCTTGGAAGGCTAACCTGGGGTGAGGCCGGGTTGGGGCCGGGCTGGGGGTGGGAGGAGTCC +TCACTGGCGGTTGATTGACAGTTTCTCCTTCCCCAGACTGGCCAATCACAGGCAGGAAGATGAAGGTTCT +GTGGGCTGCGTTGCTGGTCACATTCCTGGCAGGTATGGGGGCGGGGCTTGCTCGGTTCCCCCCGCTCCTC +CCCCTCTCATCCTCACCTCAACCTCCTGGCCCCATTCAGGCAGACCCTGGGCCCCCTCTTCTGAGGCTTC +TGTGCTGCTTCCTGGCTCTGAACAGCGATTTGACGCTCTCTGGGCCTCGGTTTCCCCCATCCTTGAGATA +GGAGTTAGAAGTTGTTTTGTTGTTGTTGTTTGTTGTTGTTGTTTTGTTTTTTTGAGATGAAGTCTCGCTC +TGTCGCCCAGGCTGGAGTGCAGTGGCGGGATCTCGGCTCACTGCAAGCTCCGCCTCCCAGGTCCACGCCA +TTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGCACATGCCACCACACCCGACTAACTTTTTTG +TATTTTCAGTAGAGACGGGGTTTCACCATGTTGGCCAGGCTGGTCTGGAACTCCTGACCTCAGGTGATCT +GCCCGTTTCGATCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCACCTGGCTGGGAGTTAGAGGT +TTCTAATGCATTGCAGGCAGATAGTGAATACCAGACACGGGGCAGCTGTGATCTTTATTCTCCATCACCC +CCACACAGCCCTGCCTGGGGCACACAAGGACACTCAATACATGCTTTTCCGCTGGGCGCGGTGGCTCACC +CCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAGGATCACTTGAGCCCAGGAGTTCAACACCAGCCT +GGGCAACATAGTGAGACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGGCATGGTGCCACACACCTGT +GCTCTCAGCTACTCAGGAGGCTGAGGCAGGAGGATCGCTTGAGCCCAGAAGGTCAAGGTTGCAGTGAACC +ATGTTCAGGCCGCTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTTTATAAATACATAATGCTTTCC +AAGTGATTAAACCGACTCCCCCCTCACCCTGCCCACCATGGCTCCAAAGAAGCATTTGTGGAGCACCTTC +TGTGTGCCCCTAGGTACTAGATGCCTGGACGGGGTCAGAAGGACCCTGACCCACCTTGAACTTGTTCCAC +ACAGGATGCCAGGCCAAGGTGGAGCAAGCGGTGGAGACAGAGCCGGAGCCCGAGCTGCGCCAGCAGACCG +AGTGGCAGAGCGGCCAGCGCTGGGAACTGGCACTGGGTCGCTTTTGGGATTACCTGCGCTGGGTGCAGAC +ACTGTCTGAGCAGGTGCAGGAGGAGCTGCTCAGCTCCCAGGTCACCCAGGAACTGAGGTGAGTGTCCCCA +TCCTGGCCCTTGACCCTCCTGGTGGGCGGCTATACCTCCCCAGGTCCAGGTTTCATTCTGCCCCTGTCGC +TAAGTCTTGGGGGGCCTGGGTCTCTGCTGGTTCTAGCTTCCTCTTCCCATTTCTGACTCCTGGCTTTAGC +TCTCTGGAATTCTCTCTCTCAGCTTTGTCTCTCTCTCTTCCCTTCTGACTCAGTCTCTCACACTCGTCCT +GGCTCTGTCTCTGTCCTTCCCTAGCTCTTTTATATAGAGACAGAGAGATGGGGTCTCACTGTGTTGCCCA +GGCTGGTCTTGAACTTCTGGGCTCAAGCGATCCTCCCGCCTCGGCCTCCCAAAGTGCTGGGATTAGAGGC +ATGAGCCACCTTGCCCGGCCTCCTAGCTCCTTCTTCGTCTCTGCCTCTGCCCTCTGCATCTGCTCTCTGC +ATCTGTCTCTGTCTCCTTCTCTCGGCCTCTGCCCCGTTCCTTCTCTCCCTCTTGGGTCTCTCTGGCTCAT +CCCCATCTCGCCCGCCCCATCCCAGCCCTTCTCCCCGCCTCCCACTGTGCGACACCCTCCCGCCCTCTCG +GCCGCAGGGCGCTGATGGACGAGACCATGAAGGAGTTGAAGGCCTACAAATCGGAACTGGAGGAACAACT +GACCCCGGTGGCGGAGGAGACGCGGGCACGGCTGTCCAAGGAGCTGCAGGCGGCGCAGGCCCGGCTGGGC +GCGGACATGGAGGACGTGTGCGGCCGCCTGGTGCAGTACCGCGGCGAGGTGCAGGCCATGCTCGGCCAGA +GCACCGAGGAGCTGCGGGTGCGCCTCGCCTCCCACCTGCGCAAGCTGCGTAAGCGGCTCCTCCGCGATGC +CGATGACCTGCAGAAGCGCCTGGCAGTGTACCAGGCCGGGGCCCGCGAGGGCGCCGAGCGCGGCCTCAGC +GCCATCCGCGAGCGCCTGGGGCCCCTGGTGGAACAGGGCCGCGTGCGGGCCGCCACTGTGGGCTCCCTGG +CCGGCCAGCCGCTACAGGAGCGGGCCCAGGCCTGGGGCGAGCGGCTGCGCGCGCGGATGGAGGAGATGGG +CAGCCGGACCCGCGACCGCCTGGACGAGGTGAAGGAGCAGGTGGCGGAGGTGCGCGCCAAGCTGGAGGAG +CAGGCCCAGCAGATACGCCTGCAGGCCGAGGCCTTCCAGGCCCGCCTCAAGAGCTGGTTCGAGCCCCTGG +TGGAAGACATGCAGCGCCAGTGGGCCGGGCTGGTGGAGAAGGTGCAGGCTGCCGTGGGCACCAGCGCCGC +CCCTGTGCCCAGCGACAATCACTGAACGCCGAAGCCTGCAGCCATGCGACCCCACGCCACCCCGTGCCTC +CTGCCTCCGCGCAGCCTGCAGCGGGAGACCCTGTCCCCGCCCCAGCCGTCCTCCTGGGGTGGACCCTAGT +TTAATAAAGATTCACCAAGTTTCACGCA + diff --git a/covid.fasta b/covid.fasta new file mode 100644 index 0000000..c92695d --- /dev/null +++ b/covid.fasta @@ -0,0 +1,430 @@ +>MN908947.3 Severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1, complete genome +ATTAAAGGTTTATACCTTCCCAGGTAACAAACCAACCAACTTTCGATCTCTTGTAGATCTGTTCTCTAAA +CGAACTTTAAAATCTGTGTGGCTGTCACTCGGCTGCATGCTTAGTGCACTCACGCAGTATAATTAATAAC +TAATTACTGTCGTTGACAGGACACGAGTAACTCGTCTATCTTCTGCAGGCTGCTTACGGTTTCGTCCGTG +TTGCAGCCGATCATCAGCACATCTAGGTTTCGTCCGGGTGTGACCGAAAGGTAAGATGGAGAGCCTTGTC +CCTGGTTTCAACGAGAAAACACACGTCCAACTCAGTTTGCCTGTTTTACAGGTTCGCGACGTGCTCGTAC +GTGGCTTTGGAGACTCCGTGGAGGAGGTCTTATCAGAGGCACGTCAACATCTTAAAGATGGCACTTGTGG +CTTAGTAGAAGTTGAAAAAGGCGTTTTGCCTCAACTTGAACAGCCCTATGTGTTCATCAAACGTTCGGAT +GCTCGAACTGCACCTCATGGTCATGTTATGGTTGAGCTGGTAGCAGAACTCGAAGGCATTCAGTACGGTC +GTAGTGGTGAGACACTTGGTGTCCTTGTCCCTCATGTGGGCGAAATACCAGTGGCTTACCGCAAGGTTCT +TCTTCGTAAGAACGGTAATAAAGGAGCTGGTGGCCATAGTTACGGCGCCGATCTAAAGTCATTTGACTTA +GGCGACGAGCTTGGCACTGATCCTTATGAAGATTTTCAAGAAAACTGGAACACTAAACATAGCAGTGGTG +TTACCCGTGAACTCATGCGTGAGCTTAACGGAGGGGCATACACTCGCTATGTCGATAACAACTTCTGTGG +CCCTGATGGCTACCCTCTTGAGTGCATTAAAGACCTTCTAGCACGTGCTGGTAAAGCTTCATGCACTTTG +TCCGAACAACTGGACTTTATTGACACTAAGAGGGGTGTATACTGCTGCCGTGAACATGAGCATGAAATTG +CTTGGTACACGGAACGTTCTGAAAAGAGCTATGAATTGCAGACACCTTTTGAAATTAAATTGGCAAAGAA +ATTTGACACCTTCAATGGGGAATGTCCAAATTTTGTATTTCCCTTAAATTCCATAATCAAGACTATTCAA +CCAAGGGTTGAAAAGAAAAAGCTTGATGGCTTTATGGGTAGAATTCGATCTGTCTATCCAGTTGCGTCAC +CAAATGAATGCAACCAAATGTGCCTTTCAACTCTCATGAAGTGTGATCATTGTGGTGAAACTTCATGGCA +GACGGGCGATTTTGTTAAAGCCACTTGCGAATTTTGTGGCACTGAGAATTTGACTAAAGAAGGTGCCACT +ACTTGTGGTTACTTACCCCAAAATGCTGTTGTTAAAATTTATTGTCCAGCATGTCACAATTCAGAAGTAG +GACCTGAGCATAGTCTTGCCGAATACCATAATGAATCTGGCTTGAAAACCATTCTTCGTAAGGGTGGTCG +CACTATTGCCTTTGGAGGCTGTGTGTTCTCTTATGTTGGTTGCCATAACAAGTGTGCCTATTGGGTTCCA +CGTGCTAGCGCTAACATAGGTTGTAACCATACAGGTGTTGTTGGAGAAGGTTCCGAAGGTCTTAATGACA +ACCTTCTTGAAATACTCCAAAAAGAGAAAGTCAACATCAATATTGTTGGTGACTTTAAACTTAATGAAGA +GATCGCCATTATTTTGGCATCTTTTTCTGCTTCCACAAGTGCTTTTGTGGAAACTGTGAAAGGTTTGGAT +TATAAAGCATTCAAACAAATTGTTGAATCCTGTGGTAATTTTAAAGTTACAAAAGGAAAAGCTAAAAAAG +GTGCCTGGAATATTGGTGAACAGAAATCAATACTGAGTCCTCTTTATGCATTTGCATCAGAGGCTGCTCG +TGTTGTACGATCAATTTTCTCCCGCACTCTTGAAACTGCTCAAAATTCTGTGCGTGTTTTACAGAAGGCC +GCTATAACAATACTAGATGGAATTTCACAGTATTCACTGAGACTCATTGATGCTATGATGTTCACATCTG +ATTTGGCTACTAACAATCTAGTTGTAATGGCCTACATTACAGGTGGTGTTGTTCAGTTGACTTCGCAGTG +GCTAACTAACATCTTTGGCACTGTTTATGAAAAACTCAAACCCGTCCTTGATTGGCTTGAAGAGAAGTTT +AAGGAAGGTGTAGAGTTTCTTAGAGACGGTTGGGAAATTGTTAAATTTATCTCAACCTGTGCTTGTGAAA +TTGTCGGTGGACAAATTGTCACCTGTGCAAAGGAAATTAAGGAGAGTGTTCAGACATTCTTTAAGCTTGT +AAATAAATTTTTGGCTTTGTGTGCTGACTCTATCATTATTGGTGGAGCTAAACTTAAAGCCTTGAATTTA +GGTGAAACATTTGTCACGCACTCAAAGGGATTGTACAGAAAGTGTGTTAAATCCAGAGAAGAAACTGGCC +TACTCATGCCTCTAAAAGCCCCAAAAGAAATTATCTTCTTAGAGGGAGAAACACTTCCCACAGAAGTGTT +AACAGAGGAAGTTGTCTTGAAAACTGGTGATTTACAACCATTAGAACAACCTACTAGTGAAGCTGTTGAA +GCTCCATTGGTTGGTACACCAGTTTGTATTAACGGGCTTATGTTGCTCGAAATCAAAGACACAGAAAAGT +ACTGTGCCCTTGCACCTAATATGATGGTAACAAACAATACCTTCACACTCAAAGGCGGTGCACCAACAAA +GGTTACTTTTGGTGATGACACTGTGATAGAAGTGCAAGGTTACAAGAGTGTGAATATCACTTTTGAACTT +GATGAAAGGATTGATAAAGTACTTAATGAGAAGTGCTCTGCCTATACAGTTGAACTCGGTACAGAAGTAA +ATGAGTTCGCCTGTGTTGTGGCAGATGCTGTCATAAAAACTTTGCAACCAGTATCTGAATTACTTACACC +ACTGGGCATTGATTTAGATGAGTGGAGTATGGCTACATACTACTTATTTGATGAGTCTGGTGAGTTTAAA +TTGGCTTCACATATGTATTGTTCTTTCTACCCTCCAGATGAGGATGAAGAAGAAGGTGATTGTGAAGAAG +AAGAGTTTGAGCCATCAACTCAATATGAGTATGGTACTGAAGATGATTACCAAGGTAAACCTTTGGAATT +TGGTGCCACTTCTGCTGCTCTTCAACCTGAAGAAGAGCAAGAAGAAGATTGGTTAGATGATGATAGTCAA +CAAACTGTTGGTCAACAAGACGGCAGTGAGGACAATCAGACAACTACTATTCAAACAATTGTTGAGGTTC +AACCTCAATTAGAGATGGAACTTACACCAGTTGTTCAGACTATTGAAGTGAATAGTTTTAGTGGTTATTT +AAAACTTACTGACAATGTATACATTAAAAATGCAGACATTGTGGAAGAAGCTAAAAAGGTAAAACCAACA +GTGGTTGTTAATGCAGCCAATGTTTACCTTAAACATGGAGGAGGTGTTGCAGGAGCCTTAAATAAGGCTA +CTAACAATGCCATGCAAGTTGAATCTGATGATTACATAGCTACTAATGGACCACTTAAAGTGGGTGGTAG +TTGTGTTTTAAGCGGACACAATCTTGCTAAACACTGTCTTCATGTTGTCGGCCCAAATGTTAACAAAGGT +GAAGACATTCAACTTCTTAAGAGTGCTTATGAAAATTTTAATCAGCACGAAGTTCTACTTGCACCATTAT +TATCAGCTGGTATTTTTGGTGCTGACCCTATACATTCTTTAAGAGTTTGTGTAGATACTGTTCGCACAAA +TGTCTACTTAGCTGTCTTTGATAAAAATCTCTATGACAAACTTGTTTCAAGCTTTTTGGAAATGAAGAGT +GAAAAGCAAGTTGAACAAAAGATCGCTGAGATTCCTAAAGAGGAAGTTAAGCCATTTATAACTGAAAGTA +AACCTTCAGTTGAACAGAGAAAACAAGATGATAAGAAAATCAAAGCTTGTGTTGAAGAAGTTACAACAAC +TCTGGAAGAAACTAAGTTCCTCACAGAAAACTTGTTACTTTATATTGACATTAATGGCAATCTTCATCCA +GATTCTGCCACTCTTGTTAGTGACATTGACATCACTTTCTTAAAGAAAGATGCTCCATATATAGTGGGTG +ATGTTGTTCAAGAGGGTGTTTTAACTGCTGTGGTTATACCTACTAAAAAGGCTGGTGGCACTACTGAAAT +GCTAGCGAAAGCTTTGAGAAAAGTGCCAACAGACAATTATATAACCACTTACCCGGGTCAGGGTTTAAAT +GGTTACACTGTAGAGGAGGCAAAGACAGTGCTTAAAAAGTGTAAAAGTGCCTTTTACATTCTACCATCTA +TTATCTCTAATGAGAAGCAAGAAATTCTTGGAACTGTTTCTTGGAATTTGCGAGAAATGCTTGCACATGC +AGAAGAAACACGCAAATTAATGCCTGTCTGTGTGGAAACTAAAGCCATAGTTTCAACTATACAGCGTAAA +TATAAGGGTATTAAAATACAAGAGGGTGTGGTTGATTATGGTGCTAGATTTTACTTTTACACCAGTAAAA +CAACTGTAGCGTCACTTATCAACACACTTAACGATCTAAATGAAACTCTTGTTACAATGCCACTTGGCTA +TGTAACACATGGCTTAAATTTGGAAGAAGCTGCTCGGTATATGAGATCTCTCAAAGTGCCAGCTACAGTT +TCTGTTTCTTCACCTGATGCTGTTACAGCGTATAATGGTTATCTTACTTCTTCTTCTAAAACACCTGAAG +AACATTTTATTGAAACCATCTCACTTGCTGGTTCCTATAAAGATTGGTCCTATTCTGGACAATCTACACA +ACTAGGTATAGAATTTCTTAAGAGAGGTGATAAAAGTGTATATTACACTAGTAATCCTACCACATTCCAC +CTAGATGGTGAAGTTATCACCTTTGACAATCTTAAGACACTTCTTTCTTTGAGAGAAGTGAGGACTATTA +AGGTGTTTACAACAGTAGACAACATTAACCTCCACACGCAAGTTGTGGACATGTCAATGACATATGGACA +ACAGTTTGGTCCAACTTATTTGGATGGAGCTGATGTTACTAAAATAAAACCTCATAATTCACATGAAGGT +AAAACATTTTATGTTTTACCTAATGATGACACTCTACGTGTTGAGGCTTTTGAGTACTACCACACAACTG +ATCCTAGTTTTCTGGGTAGGTACATGTCAGCATTAAATCACACTAAAAAGTGGAAATACCCACAAGTTAA +TGGTTTAACTTCTATTAAATGGGCAGATAACAACTGTTATCTTGCCACTGCATTGTTAACACTCCAACAA +ATAGAGTTGAAGTTTAATCCACCTGCTCTACAAGATGCTTATTACAGAGCAAGGGCTGGTGAAGCTGCTA +ACTTTTGTGCACTTATCTTAGCCTACTGTAATAAGACAGTAGGTGAGTTAGGTGATGTTAGAGAAACAAT +GAGTTACTTGTTTCAACATGCCAATTTAGATTCTTGCAAAAGAGTCTTGAACGTGGTGTGTAAAACTTGT +GGACAACAGCAGACAACCCTTAAGGGTGTAGAAGCTGTTATGTACATGGGCACACTTTCTTATGAACAAT +TTAAGAAAGGTGTTCAGATACCTTGTACGTGTGGTAAACAAGCTACAAAATATCTAGTACAACAGGAGTC +ACCTTTTGTTATGATGTCAGCACCACCTGCTCAGTATGAACTTAAGCATGGTACATTTACTTGTGCTAGT +GAGTACACTGGTAATTACCAGTGTGGTCACTATAAACATATAACTTCTAAAGAAACTTTGTATTGCATAG +ACGGTGCTTTACTTACAAAGTCCTCAGAATACAAAGGTCCTATTACGGATGTTTTCTACAAAGAAAACAG +TTACACAACAACCATAAAACCAGTTACTTATAAATTGGATGGTGTTGTTTGTACAGAAATTGACCCTAAG +TTGGACAATTATTATAAGAAAGACAATTCTTATTTCACAGAGCAACCAATTGATCTTGTACCAAACCAAC +CATATCCAAACGCAAGCTTCGATAATTTTAAGTTTGTATGTGATAATATCAAATTTGCTGATGATTTAAA +CCAGTTAACTGGTTATAAGAAACCTGCTTCAAGAGAGCTTAAAGTTACATTTTTCCCTGACTTAAATGGT +GATGTGGTGGCTATTGATTATAAACACTACACACCCTCTTTTAAGAAAGGAGCTAAATTGTTACATAAAC +CTATTGTTTGGCATGTTAACAATGCAACTAATAAAGCCACGTATAAACCAAATACCTGGTGTATACGTTG +TCTTTGGAGCACAAAACCAGTTGAAACATCAAATTCGTTTGATGTACTGAAGTCAGAGGACGCGCAGGGA +ATGGATAATCTTGCCTGCGAAGATCTAAAACCAGTCTCTGAAGAAGTAGTGGAAAATCCTACCATACAGA +AAGACGTTCTTGAGTGTAATGTGAAAACTACCGAAGTTGTAGGAGACATTATACTTAAACCAGCAAATAA +TAGTTTAAAAATTACAGAAGAGGTTGGCCACACAGATCTAATGGCTGCTTATGTAGACAATTCTAGTCTT +ACTATTAAGAAACCTAATGAATTATCTAGAGTATTAGGTTTGAAAACCCTTGCTACTCATGGTTTAGCTG +CTGTTAATAGTGTCCCTTGGGATACTATAGCTAATTATGCTAAGCCTTTTCTTAACAAAGTTGTTAGTAC +AACTACTAACATAGTTACACGGTGTTTAAACCGTGTTTGTACTAATTATATGCCTTATTTCTTTACTTTA +TTGCTACAATTGTGTACTTTTACTAGAAGTACAAATTCTAGAATTAAAGCATCTATGCCGACTACTATAG +CAAAGAATACTGTTAAGAGTGTCGGTAAATTTTGTCTAGAGGCTTCATTTAATTATTTGAAGTCACCTAA +TTTTTCTAAACTGATAAATATTATAATTTGGTTTTTACTATTAAGTGTTTGCCTAGGTTCTTTAATCTAC +TCAACCGCTGCTTTAGGTGTTTTAATGTCTAATTTAGGCATGCCTTCTTACTGTACTGGTTACAGAGAAG +GCTATTTGAACTCTACTAATGTCACTATTGCAACCTACTGTACTGGTTCTATACCTTGTAGTGTTTGTCT +TAGTGGTTTAGATTCTTTAGACACCTATCCTTCTTTAGAAACTATACAAATTACCATTTCATCTTTTAAA +TGGGATTTAACTGCTTTTGGCTTAGTTGCAGAGTGGTTTTTGGCATATATTCTTTTCACTAGGTTTTTCT +ATGTACTTGGATTGGCTGCAATCATGCAATTGTTTTTCAGCTATTTTGCAGTACATTTTATTAGTAATTC +TTGGCTTATGTGGTTAATAATTAATCTTGTACAAATGGCCCCGATTTCAGCTATGGTTAGAATGTACATC +TTCTTTGCATCATTTTATTATGTATGGAAAAGTTATGTGCATGTTGTAGACGGTTGTAATTCATCAACTT +GTATGATGTGTTACAAACGTAATAGAGCAACAAGAGTCGAATGTACAACTATTGTTAATGGTGTTAGAAG +GTCCTTTTATGTCTATGCTAATGGAGGTAAAGGCTTTTGCAAACTACACAATTGGAATTGTGTTAATTGT +GATACATTCTGTGCTGGTAGTACATTTATTAGTGATGAAGTTGCGAGAGACTTGTCACTACAGTTTAAAA +GACCAATAAATCCTACTGACCAGTCTTCTTACATCGTTGATAGTGTTACAGTGAAGAATGGTTCCATCCA +TCTTTACTTTGATAAAGCTGGTCAAAAGACTTATGAAAGACATTCTCTCTCTCATTTTGTTAACTTAGAC +AACCTGAGAGCTAATAACACTAAAGGTTCATTGCCTATTAATGTTATAGTTTTTGATGGTAAATCAAAAT +GTGAAGAATCATCTGCAAAATCAGCGTCTGTTTACTACAGTCAGCTTATGTGTCAACCTATACTGTTACT +AGATCAGGCATTAGTGTCTGATGTTGGTGATAGTGCGGAAGTTGCAGTTAAAATGTTTGATGCTTACGTT +AATACGTTTTCATCAACTTTTAACGTACCAATGGAAAAACTCAAAACACTAGTTGCAACTGCAGAAGCTG +AACTTGCAAAGAATGTGTCCTTAGACAATGTCTTATCTACTTTTATTTCAGCAGCTCGGCAAGGGTTTGT +TGATTCAGATGTAGAAACTAAAGATGTTGTTGAATGTCTTAAATTGTCACATCAATCTGACATAGAAGTT +ACTGGCGATAGTTGTAATAACTATATGCTCACCTATAACAAAGTTGAAAACATGACACCCCGTGACCTTG +GTGCTTGTATTGACTGTAGTGCGCGTCATATTAATGCGCAGGTAGCAAAAAGTCACAACATTGCTTTGAT +ATGGAACGTTAAAGATTTCATGTCATTGTCTGAACAACTACGAAAACAAATACGTAGTGCTGCTAAAAAG +AATAACTTACCTTTTAAGTTGACATGTGCAACTACTAGACAAGTTGTTAATGTTGTAACAACAAAGATAG +CACTTAAGGGTGGTAAAATTGTTAATAATTGGTTGAAGCAGTTAATTAAAGTTACACTTGTGTTCCTTTT +TGTTGCTGCTATTTTCTATTTAATAACACCTGTTCATGTCATGTCTAAACATACTGACTTTTCAAGTGAA +ATCATAGGATACAAGGCTATTGATGGTGGTGTCACTCGTGACATAGCATCTACAGATACTTGTTTTGCTA +ACAAACATGCTGATTTTGACACATGGTTTAGCCAGCGTGGTGGTAGTTATACTAATGACAAAGCTTGCCC +ATTGATTGCTGCAGTCATAACAAGAGAAGTGGGTTTTGTCGTGCCTGGTTTGCCTGGCACGATATTACGC +ACAACTAATGGTGACTTTTTGCATTTCTTACCTAGAGTTTTTAGTGCAGTTGGTAACATCTGTTACACAC +CATCAAAACTTATAGAGTACACTGACTTTGCAACATCAGCTTGTGTTTTGGCTGCTGAATGTACAATTTT +TAAAGATGCTTCTGGTAAGCCAGTACCATATTGTTATGATACCAATGTACTAGAAGGTTCTGTTGCTTAT +GAAAGTTTACGCCCTGACACACGTTATGTGCTCATGGATGGCTCTATTATTCAATTTCCTAACACCTACC +TTGAAGGTTCTGTTAGAGTGGTAACAACTTTTGATTCTGAGTACTGTAGGCACGGCACTTGTGAAAGATC +AGAAGCTGGTGTTTGTGTATCTACTAGTGGTAGATGGGTACTTAACAATGATTATTACAGATCTTTACCA +GGAGTTTTCTGTGGTGTAGATGCTGTAAATTTACTTACTAATATGTTTACACCACTAATTCAACCTATTG +GTGCTTTGGACATATCAGCATCTATAGTAGCTGGTGGTATTGTAGCTATCGTAGTAACATGCCTTGCCTA +CTATTTTATGAGGTTTAGAAGAGCTTTTGGTGAATACAGTCATGTAGTTGCCTTTAATACTTTACTATTC +CTTATGTCATTCACTGTACTCTGTTTAACACCAGTTTACTCATTCTTACCTGGTGTTTATTCTGTTATTT +ACTTGTACTTGACATTTTATCTTACTAATGATGTTTCTTTTTTAGCACATATTCAGTGGATGGTTATGTT +CACACCTTTAGTACCTTTCTGGATAACAATTGCTTATATCATTTGTATTTCCACAAAGCATTTCTATTGG +TTCTTTAGTAATTACCTAAAGAGACGTGTAGTCTTTAATGGTGTTTCCTTTAGTACTTTTGAAGAAGCTG +CGCTGTGCACCTTTTTGTTAAATAAAGAAATGTATCTAAAGTTGCGTAGTGATGTGCTATTACCTCTTAC +GCAATATAATAGATACTTAGCTCTTTATAATAAGTACAAGTATTTTAGTGGAGCAATGGATACAACTAGC +TACAGAGAAGCTGCTTGTTGTCATCTCGCAAAGGCTCTCAATGACTTCAGTAACTCAGGTTCTGATGTTC +TTTACCAACCACCACAAACCTCTATCACCTCAGCTGTTTTGCAGAGTGGTTTTAGAAAAATGGCATTCCC +ATCTGGTAAAGTTGAGGGTTGTATGGTACAAGTAACTTGTGGTACAACTACACTTAACGGTCTTTGGCTT +GATGACGTAGTTTACTGTCCAAGACATGTGATCTGCACCTCTGAAGACATGCTTAACCCTAATTATGAAG +ATTTACTCATTCGTAAGTCTAATCATAATTTCTTGGTACAGGCTGGTAATGTTCAACTCAGGGTTATTGG +ACATTCTATGCAAAATTGTGTACTTAAGCTTAAGGTTGATACAGCCAATCCTAAGACACCTAAGTATAAG +TTTGTTCGCATTCAACCAGGACAGACTTTTTCAGTGTTAGCTTGTTACAATGGTTCACCATCTGGTGTTT +ACCAATGTGCTATGAGGCCCAATTTCACTATTAAGGGTTCATTCCTTAATGGTTCATGTGGTAGTGTTGG +TTTTAACATAGATTATGACTGTGTCTCTTTTTGTTACATGCACCATATGGAATTACCAACTGGAGTTCAT +GCTGGCACAGACTTAGAAGGTAACTTTTATGGACCTTTTGTTGACAGGCAAACAGCACAAGCAGCTGGTA +CGGACACAACTATTACAGTTAATGTTTTAGCTTGGTTGTACGCTGCTGTTATAAATGGAGACAGGTGGTT +TCTCAATCGATTTACCACAACTCTTAATGACTTTAACCTTGTGGCTATGAAGTACAATTATGAACCTCTA +ACACAAGACCATGTTGACATACTAGGACCTCTTTCTGCTCAAACTGGAATTGCCGTTTTAGATATGTGTG +CTTCATTAAAAGAATTACTGCAAAATGGTATGAATGGACGTACCATATTGGGTAGTGCTTTATTAGAAGA +TGAATTTACACCTTTTGATGTTGTTAGACAATGCTCAGGTGTTACTTTCCAAAGTGCAGTGAAAAGAACA +ATCAAGGGTACACACCACTGGTTGTTACTCACAATTTTGACTTCACTTTTAGTTTTAGTCCAGAGTACTC +AATGGTCTTTGTTCTTTTTTTTGTATGAAAATGCCTTTTTACCTTTTGCTATGGGTATTATTGCTATGTC +TGCTTTTGCAATGATGTTTGTCAAACATAAGCATGCATTTCTCTGTTTGTTTTTGTTACCTTCTCTTGCC +ACTGTAGCTTATTTTAATATGGTCTATATGCCTGCTAGTTGGGTGATGCGTATTATGACATGGTTGGATA +TGGTTGATACTAGTTTGTCTGGTTTTAAGCTAAAAGACTGTGTTATGTATGCATCAGCTGTAGTGTTACT +AATCCTTATGACAGCAAGAACTGTGTATGATGATGGTGCTAGGAGAGTGTGGACACTTATGAATGTCTTG +ACACTCGTTTATAAAGTTTATTATGGTAATGCTTTAGATCAAGCCATTTCCATGTGGGCTCTTATAATCT +CTGTTACTTCTAACTACTCAGGTGTAGTTACAACTGTCATGTTTTTGGCCAGAGGTATTGTTTTTATGTG +TGTTGAGTATTGCCCTATTTTCTTCATAACTGGTAATACACTTCAGTGTATAATGCTAGTTTATTGTTTC +TTAGGCTATTTTTGTACTTGTTACTTTGGCCTCTTTTGTTTACTCAACCGCTACTTTAGACTGACTCTTG +GTGTTTATGATTACTTAGTTTCTACACAGGAGTTTAGATATATGAATTCACAGGGACTACTCCCACCCAA +GAATAGCATAGATGCCTTCAAACTCAACATTAAATTGTTGGGTGTTGGTGGCAAACCTTGTATCAAAGTA +GCCACTGTACAGTCTAAAATGTCAGATGTAAAGTGCACATCAGTAGTCTTACTCTCAGTTTTGCAACAAC +TCAGAGTAGAATCATCATCTAAATTGTGGGCTCAATGTGTCCAGTTACACAATGACATTCTCTTAGCTAA +AGATACTACTGAAGCCTTTGAAAAAATGGTTTCACTACTTTCTGTTTTGCTTTCCATGCAGGGTGCTGTA +GACATAAACAAGCTTTGTGAAGAAATGCTGGACAACAGGGCAACCTTACAAGCTATAGCCTCAGAGTTTA +GTTCCCTTCCATCATATGCAGCTTTTGCTACTGCTCAAGAAGCTTATGAGCAGGCTGTTGCTAATGGTGA +TTCTGAAGTTGTTCTTAAAAAGTTGAAGAAGTCTTTGAATGTGGCTAAATCTGAATTTGACCGTGATGCA +GCCATGCAACGTAAGTTGGAAAAGATGGCTGATCAAGCTATGACCCAAATGTATAAACAGGCTAGATCTG +AGGACAAGAGGGCAAAAGTTACTAGTGCTATGCAGACAATGCTTTTCACTATGCTTAGAAAGTTGGATAA +TGATGCACTCAACAACATTATCAACAATGCAAGAGATGGTTGTGTTCCCTTGAACATAATACCTCTTACA +ACAGCAGCCAAACTAATGGTTGTCATACCAGACTATAACACATATAAAAATACGTGTGATGGTACAACAT +TTACTTATGCATCAGCATTGTGGGAAATCCAACAGGTTGTAGATGCAGATAGTAAAATTGTTCAACTTAG +TGAAATTAGTATGGACAATTCACCTAATTTAGCATGGCCTCTTATTGTAACAGCTTTAAGGGCCAATTCT +GCTGTCAAATTACAGAATAATGAGCTTAGTCCTGTTGCACTACGACAGATGTCTTGTGCTGCCGGTACTA +CACAAACTGCTTGCACTGATGACAATGCGTTAGCTTACTACAACACAACAAAGGGAGGTAGGTTTGTACT +TGCACTGTTATCCGATTTACAGGATTTGAAATGGGCTAGATTCCCTAAGAGTGATGGAACTGGTACTATC +TATACAGAACTGGAACCACCTTGTAGGTTTGTTACAGACACACCTAAAGGTCCTAAAGTGAAGTATTTAT +ACTTTATTAAAGGATTAAACAACCTAAATAGAGGTATGGTACTTGGTAGTTTAGCTGCCACAGTACGTCT +ACAAGCTGGTAATGCAACAGAAGTGCCTGCCAATTCAACTGTATTATCTTTCTGTGCTTTTGCTGTAGAT +GCTGCTAAAGCTTACAAAGATTATCTAGCTAGTGGGGGACAACCAATCACTAATTGTGTTAAGATGTTGT +GTACACACACTGGTACTGGTCAGGCAATAACAGTTACACCGGAAGCCAATATGGATCAAGAATCCTTTGG +TGGTGCATCGTGTTGTCTGTACTGCCGTTGCCACATAGATCATCCAAATCCTAAAGGATTTTGTGACTTA +AAAGGTAAGTATGTACAAATACCTACAACTTGTGCTAATGACCCTGTGGGTTTTACACTTAAAAACACAG +TCTGTACCGTCTGCGGTATGTGGAAAGGTTATGGCTGTAGTTGTGATCAACTCCGCGAACCCATGCTTCA +GTCAGCTGATGCACAATCGTTTTTAAACGGGTTTGCGGTGTAAGTGCAGCCCGTCTTACACCGTGCGGCA +CAGGCACTAGTACTGATGTCGTATACAGGGCTTTTGACATCTACAATGATAAAGTAGCTGGTTTTGCTAA +ATTCCTAAAAACTAATTGTTGTCGCTTCCAAGAAAAGGACGAAGATGACAATTTAATTGATTCTTACTTT +GTAGTTAAGAGACACACTTTCTCTAACTACCAACATGAAGAAACAATTTATAATTTACTTAAGGATTGTC +CAGCTGTTGCTAAACATGACTTCTTTAAGTTTAGAATAGACGGTGACATGGTACCACATATATCACGTCA +ACGTCTTACTAAATACACAATGGCAGACCTCGTCTATGCTTTAAGGCATTTTGATGAAGGTAATTGTGAC +ACATTAAAAGAAATACTTGTCACATACAATTGTTGTGATGATGATTATTTCAATAAAAAGGACTGGTATG +ATTTTGTAGAAAACCCAGATATATTACGCGTATACGCCAACTTAGGTGAACGTGTACGCCAAGCTTTGTT +AAAAACAGTACAATTCTGTGATGCCATGCGAAATGCTGGTATTGTTGGTGTACTGACATTAGATAATCAA +GATCTCAATGGTAACTGGTATGATTTCGGTGATTTCATACAAACCACGCCAGGTAGTGGAGTTCCTGTTG +TAGATTCTTATTATTCATTGTTAATGCCTATATTAACCTTGACCAGGGCTTTAACTGCAGAGTCACATGT +TGACACTGACTTAACAAAGCCTTACATTAAGTGGGATTTGTTAAAATATGACTTCACGGAAGAGAGGTTA +AAACTCTTTGACCGTTATTTTAAATATTGGGATCAGACATACCACCCAAATTGTGTTAACTGTTTGGATG +ACAGATGCATTCTGCATTGTGCAAACTTTAATGTTTTATTCTCTACAGTGTTCCCACCTACAAGTTTTGG +ACCACTAGTGAGAAAAATATTTGTTGATGGTGTTCCATTTGTAGTTTCAACTGGATACCACTTCAGAGAG +CTAGGTGTTGTACATAATCAGGATGTAAACTTACATAGCTCTAGACTTAGTTTTAAGGAATTACTTGTGT +ATGCTGCTGACCCTGCTATGCACGCTGCTTCTGGTAATCTATTACTAGATAAACGCACTACGTGCTTTTC +AGTAGCTGCACTTACTAACAATGTTGCTTTTCAAACTGTCAAACCCGGTAATTTTAACAAAGACTTCTAT +GACTTTGCTGTGTCTAAGGGTTTCTTTAAGGAAGGAAGTTCTGTTGAATTAAAACACTTCTTCTTTGCTC +AGGATGGTAATGCTGCTATCAGCGATTATGACTACTATCGTTATAATCTACCAACAATGTGTGATATCAG +ACAACTACTATTTGTAGTTGAAGTTGTTGATAAGTACTTTGATTGTTACGATGGTGGCTGTATTAATGCT +AACCAAGTCATCGTCAACAACCTAGACAAATCAGCTGGTTTTCCATTTAATAAATGGGGTAAGGCTAGAC +TTTATTATGATTCAATGAGTTATGAGGATCAAGATGCACTTTTCGCATATACAAAACGTAATGTCATCCC +TACTATAACTCAAATGAATCTTAAGTATGCCATTAGTGCAAAGAATAGAGCTCGCACCGTAGCTGGTGTC +TCTATCTGTAGTACTATGACCAATAGACAGTTTCATCAAAAATTATTGAAATCAATAGCCGCCACTAGAG +GAGCTACTGTAGTAATTGGAACAAGCAAATTCTATGGTGGTTGGCACAACATGTTAAAAACTGTTTATAG +TGATGTAGAAAACCCTCACCTTATGGGTTGGGATTATCCTAAATGTGATAGAGCCATGCCTAACATGCTT +AGAATTATGGCCTCACTTGTTCTTGCTCGCAAACATACAACGTGTTGTAGCTTGTCACACCGTTTCTATA +GATTAGCTAATGAGTGTGCTCAAGTATTGAGTGAAATGGTCATGTGTGGCGGTTCACTATATGTTAAACC +AGGTGGAACCTCATCAGGAGATGCCACAACTGCTTATGCTAATAGTGTTTTTAACATTTGTCAAGCTGTC +ACGGCCAATGTTAATGCACTTTTATCTACTGATGGTAACAAAATTGCCGATAAGTATGTCCGCAATTTAC +AACACAGACTTTATGAGTGTCTCTATAGAAATAGAGATGTTGACACAGACTTTGTGAATGAGTTTTACGC +ATATTTGCGTAAACATTTCTCAATGATGATACTCTCTGACGATGCTGTTGTGTGTTTCAATAGCACTTAT +GCATCTCAAGGTCTAGTGGCTAGCATAAAGAACTTTAAGTCAGTTCTTTATTATCAAAACAATGTTTTTA +TGTCTGAAGCAAAATGTTGGACTGAGACTGACCTTACTAAAGGACCTCATGAATTTTGCTCTCAACATAC +AATGCTAGTTAAACAGGGTGATGATTATGTGTACCTTCCTTACCCAGATCCATCAAGAATCCTAGGGGCC +GGCTGTTTTGTAGATGATATCGTAAAAACAGATGGTACACTTATGATTGAACGGTTCGTGTCTTTAGCTA +TAGATGCTTACCCACTTACTAAACATCCTAATCAGGAGTATGCTGATGTCTTTCATTTGTACTTACAATA +CATAAGAAAGCTACATGATGAGTTAACAGGACACATGTTAGACATGTATTCTGTTATGCTTACTAATGAT +AACACTTCAAGGTATTGGGAACCTGAGTTTTATGAGGCTATGTACACACCGCATACAGTCTTACAGGCTG +TTGGGGCTTGTGTTCTTTGCAATTCACAGACTTCATTAAGATGTGGTGCTTGCATACGTAGACCATTCTT +ATGTTGTAAATGCTGTTACGACCATGTCATATCAACATCACATAAATTAGTCTTGTCTGTTAATCCGTAT +GTTTGCAATGCTCCAGGTTGTGATGTCACAGATGTGACTCAACTTTACTTAGGAGGTATGAGCTATTATT +GTAAATCACATAAACCACCCATTAGTTTTCCATTGTGTGCTAATGGACAAGTTTTTGGTTTATATAAAAA +TACATGTGTTGGTAGCGATAATGTTACTGACTTTAATGCAATTGCAACATGTGACTGGACAAATGCTGGT +GATTACATTTTAGCTAACACCTGTACTGAAAGACTCAAGCTTTTTGCAGCAGAAACGCTCAAAGCTACTG +AGGAGACATTTAAACTGTCTTATGGTATTGCTACTGTACGTGAAGTGCTGTCTGACAGAGAATTACATCT +TTCATGGGAAGTTGGTAAACCTAGACCACCACTTAACCGAAATTATGTCTTTACTGGTTATCGTGTAACT +AAAAACAGTAAAGTACAAATAGGAGAGTACACCTTTGAAAAAGGTGACTATGGTGATGCTGTTGTTTACC +GAGGTACAACAACTTACAAATTAAATGTTGGTGATTATTTTGTGCTGACATCACATACAGTAATGCCATT +AAGTGCACCTACACTAGTGCCACAAGAGCACTATGTTAGAATTACTGGCTTATACCCAACACTCAATATC +TCAGATGAGTTTTCTAGCAATGTTGCAAATTATCAAAAGGTTGGTATGCAAAAGTATTCTACACTCCAGG +GACCACCTGGTACTGGTAAGAGTCATTTTGCTATTGGCCTAGCTCTCTACTACCCTTCTGCTCGCATAGT +GTATACAGCTTGCTCTCATGCCGCTGTTGATGCACTATGTGAGAAGGCATTAAAATATTTGCCTATAGAT +AAATGTAGTAGAATTATACCTGCACGTGCTCGTGTAGAGTGTTTTGATAAATTCAAAGTGAATTCAACAT +TAGAACAGTATGTCTTTTGTACTGTAAATGCATTGCCTGAGACGACAGCAGATATAGTTGTCTTTGATGA +AATTTCAATGGCCACAAATTATGATTTGAGTGTTGTCAATGCCAGATTACGTGCTAAGCACTATGTGTAC +ATTGGCGACCCTGCTCAATTACCTGCACCACGCACATTGCTAACTAAGGGCACACTAGAACCAGAATATT +TCAATTCAGTGTGTAGACTTATGAAAACTATAGGTCCAGACATGTTCCTCGGAACTTGTCGGCGTTGTCC +TGCTGAAATTGTTGACACTGTGAGTGCTTTGGTTTATGATAATAAGCTTAAAGCACATAAAGACAAATCA +GCTCAATGCTTTAAAATGTTTTATAAGGGTGTTATCACGCATGATGTTTCATCTGCAATTAACAGGCCAC +AAATAGGCGTGGTAAGAGAATTCCTTACACGTAACCCTGCTTGGAGAAAAGCTGTCTTTATTTCACCTTA +TAATTCACAGAATGCTGTAGCCTCAAAGATTTTGGGACTACCAACTCAAACTGTTGATTCATCACAGGGC +TCAGAATATGACTATGTCATATTCACTCAAACCACTGAAACAGCTCACTCTTGTAATGTAAACAGATTTA +ATGTTGCTATTACCAGAGCAAAAGTAGGCATACTTTGCATAATGTCTGATAGAGACCTTTATGACAAGTT +GCAATTTACAAGTCTTGAAATTCCACGTAGGAATGTGGCAACTTTACAAGCTGAAAATGTAACAGGACTC +TTTAAAGATTGTAGTAAGGTAATCACTGGGTTACATCCTACACAGGCACCTACACACCTCAGTGTTGACA +CTAAATTCAAAACTGAAGGTTTATGTGTTGACATACCTGGCATACCTAAGGACATGACCTATAGAAGACT +CATCTCTATGATGGGTTTTAAAATGAATTATCAAGTTAATGGTTACCCTAACATGTTTATCACCCGCGAA +GAAGCTATAAGACATGTACGTGCATGGATTGGCTTCGATGTCGAGGGGTGTCATGCTACTAGAGAAGCTG +TTGGTACCAATTTACCTTTACAGCTAGGTTTTTCTACAGGTGTTAACCTAGTTGCTGTACCTACAGGTTA +TGTTGATACACCTAATAATACAGATTTTTCCAGAGTTAGTGCTAAACCACCGCCTGGAGATCAATTTAAA +CACCTCATACCACTTATGTACAAAGGACTTCCTTGGAATGTAGTGCGTATAAAGATTGTACAAATGTTAA +GTGACACACTTAAAAATCTCTCTGACAGAGTCGTATTTGTCTTATGGGCACATGGCTTTGAGTTGACATC +TATGAAGTATTTTGTGAAAATAGGACCTGAGCGCACCTGTTGTCTATGTGATAGACGTGCCACATGCTTT +TCCACTGCTTCAGACACTTATGCCTGTTGGCATCATTCTATTGGATTTGATTACGTCTATAATCCGTTTA +TGATTGATGTTCAACAATGGGGTTTTACAGGTAACCTACAAAGCAACCATGATCTGTATTGTCAAGTCCA +TGGTAATGCACATGTAGCTAGTTGTGATGCAATCATGACTAGGTGTCTAGCTGTCCACGAGTGCTTTGTT +AAGCGTGTTGACTGGACTATTGAATATCCTATAATTGGTGATGAACTGAAGATTAATGCGGCTTGTAGAA +AGGTTCAACACATGGTTGTTAAAGCTGCATTATTAGCAGACAAATTCCCAGTTCTTCACGACATTGGTAA +CCCTAAAGCTATTAAGTGTGTACCTCAAGCTGATGTAGAATGGAAGTTCTATGATGCACAGCCTTGTAGT +GACAAAGCTTATAAAATAGAAGAATTATTCTATTCTTATGCCACACATTCTGACAAATTCACAGATGGTG +TATGCCTATTTTGGAATTGCAATGTCGATAGATATCCTGCTAATTCCATTGTTTGTAGATTTGACACTAG +AGTGCTATCTAACCTTAACTTGCCTGGTTGTGATGGTGGCAGTTTGTATGTAAATAAACATGCATTCCAC +ACACCAGCTTTTGATAAAAGTGCTTTTGTTAATTTAAAACAATTACCATTTTTCTATTACTCTGACAGTC +CATGTGAGTCTCATGGAAAACAAGTAGTGTCAGATATAGATTATGTACCACTAAAGTCTGCTACGTGTAT +AACACGTTGCAATTTAGGTGGTGCTGTCTGTAGACATCATGCTAATGAGTACAGATTGTATCTCGATGCT +TATAACATGATGATCTCAGCTGGCTTTAGCTTGTGGGTTTACAAACAATTTGATACTTATAACCTCTGGA +ACACTTTTACAAGACTTCAGAGTTTAGAAAATGTGGCTTTTAATGTTGTAAATAAGGGACACTTTGATGG +ACAACAGGGTGAAGTACCAGTTTCTATCATTAATAACACTGTTTACACAAAAGTTGATGGTGTTGATGTA +GAATTGTTTGAAAATAAAACAACATTACCTGTTAATGTAGCATTTGAGCTTTGGGCTAAGCGCAACATTA +AACCAGTACCAGAGGTGAAAATACTCAATAATTTGGGTGTGGACATTGCTGCTAATACTGTGATCTGGGA +CTACAAAAGAGATGCTCCAGCACATATATCTACTATTGGTGTTTGTTCTATGACTGACATAGCCAAGAAA +CCAACTGAAACGATTTGTGCACCACTCACTGTCTTTTTTGATGGTAGAGTTGATGGTCAAGTAGACTTAT +TTAGAAATGCCCGTAATGGTGTTCTTATTACAGAAGGTAGTGTTAAAGGTTTACAACCATCTGTAGGTCC +CAAACAAGCTAGTCTTAATGGAGTCACATTAATTGGAGAAGCCGTAAAAACACAGTTCAATTATTATAAG +AAAGTTGATGGTGTTGTCCAACAATTACCTGAAACTTACTTTACTCAGAGTAGAAATTTACAAGAATTTA +AACCCAGGAGTCAAATGGAAATTGATTTCTTAGAATTAGCTATGGATGAATTCATTGAACGGTATAAATT +AGAAGGCTATGCCTTCGAACATATCGTTTATGGAGATTTTAGTCATAGTCAGTTAGGTGGTTTACATCTA +CTGATTGGACTAGCTAAACGTTTTAAGGAATCACCTTTTGAATTAGAAGATTTTATTCCTATGGACAGTA +CAGTTAAAAACTATTTCATAACAGATGCGCAAACAGGTTCATCTAAGTGTGTGTGTTCTGTTATTGATTT +ATTACTTGATGATTTTGTTGAAATAATAAAATCCCAAGATTTATCTGTAGTTTCTAAGGTTGTCAAAGTG +ACTATTGACTATACAGAAATTTCATTTATGCTTTGGTGTAAAGATGGCCATGTAGAAACATTTTACCCAA +AATTACAATCTAGTCAAGCGTGGCAACCGGGTGTTGCTATGCCTAATCTTTACAAAATGCAAAGAATGCT +ATTAGAAAAGTGTGACCTTCAAAATTATGGTGATAGTGCAACATTACCTAAAGGCATAATGATGAATGTC +GCAAAATATACTCAACTGTGTCAATATTTAAACACATTAACATTAGCTGTACCCTATAATATGAGAGTTA +TACATTTTGGTGCTGGTTCTGATAAAGGAGTTGCACCAGGTACAGCTGTTTTAAGACAGTGGTTGCCTAC +GGGTACGCTGCTTGTCGATTCAGATCTTAATGACTTTGTCTCTGATGCAGATTCAACTTTGATTGGTGAT +TGTGCAACTGTACATACAGCTAATAAATGGGATCTCATTATTAGTGATATGTACGACCCTAAGACTAAAA +ATGTTACAAAAGAAAATGACTCTAAAGAGGGTTTTTTCACTTACATTTGTGGGTTTATACAACAAAAGCT +AGCTCTTGGAGGTTCCGTGGCTATAAAGATAACAGAACATTCTTGGAATGCTGATCTTTATAAGCTCATG +GGACACTTCGCATGGTGGACAGCCTTTGTTACTAATGTGAATGCGTCATCATCTGAAGCATTTTTAATTG +GATGTAATTATCTTGGCAAACCACGCGAACAAATAGATGGTTATGTCATGCATGCAAATTACATATTTTG +GAGGAATACAAATCCAATTCAGTTGTCTTCCTATTCTTTATTTGACATGAGTAAATTTCCCCTTAAATTA +AGGGGTACTGCTGTTATGTCTTTAAAAGAAGGTCAAATCAATGATATGATTTTATCTCTTCTTAGTAAAG +GTAGACTTATAATTAGAGAAAACAACAGAGTTGTTATTTCTAGTGATGTTCTTGTTAACAACTAAACGAA +CAATGTTTGTTTTTCTTGTTTTATTGCCACTAGTCTCTAGTCAGTGTGTTAATCTTACAACCAGAACTCA +ATTACCCCCTGCATACACTAATTCTTTCACACGTGGTGTTTATTACCCTGACAAAGTTTTCAGATCCTCA +GTTTTACATTCAACTCAGGACTTGTTCTTACCTTTCTTTTCCAATGTTACTTGGTTCCATGCTATACATG +TCTCTGGGACCAATGGTACTAAGAGGTTTGATAACCCTGTCCTACCATTTAATGATGGTGTTTATTTTGC +TTCCACTGAGAAGTCTAACATAATAAGAGGCTGGATTTTTGGTACTACTTTAGATTCGAAGACCCAGTCC +CTACTTATTGTTAATAACGCTACTAATGTTGTTATTAAAGTCTGTGAATTTCAATTTTGTAATGATCCAT +TTTTGGGTGTTTATTACCACAAAAACAACAAAAGTTGGATGGAAAGTGAGTTCAGAGTTTATTCTAGTGC +GAATAATTGCACTTTTGAATATGTCTCTCAGCCTTTTCTTATGGACCTTGAAGGAAAACAGGGTAATTTC +AAAAATCTTAGGGAATTTGTGTTTAAGAATATTGATGGTTATTTTAAAATATATTCTAAGCACACGCCTA +TTAATTTAGTGCGTGATCTCCCTCAGGGTTTTTCGGCTTTAGAACCATTGGTAGATTTGCCAATAGGTAT +TAACATCACTAGGTTTCAAACTTTACTTGCTTTACATAGAAGTTATTTGACTCCTGGTGATTCTTCTTCA +GGTTGGACAGCTGGTGCTGCAGCTTATTATGTGGGTTATCTTCAACCTAGGACTTTTCTATTAAAATATA +ATGAAAATGGAACCATTACAGATGCTGTAGACTGTGCACTTGACCCTCTCTCAGAAACAAAGTGTACGTT +GAAATCCTTCACTGTAGAAAAAGGAATCTATCAAACTTCTAACTTTAGAGTCCAACCAACAGAATCTATT +GTTAGATTTCCTAATATTACAAACTTGTGCCCTTTTGGTGAAGTTTTTAACGCCACCAGATTTGCATCTG +TTTATGCTTGGAACAGGAAGAGAATCAGCAACTGTGTTGCTGATTATTCTGTCCTATATAATTCCGCATC +ATTTTCCACTTTTAAGTGTTATGGAGTGTCTCCTACTAAATTAAATGATCTCTGCTTTACTAATGTCTAT +GCAGATTCATTTGTAATTAGAGGTGATGAAGTCAGACAAATCGCTCCAGGGCAAACTGGAAAGATTGCTG +ATTATAATTATAAATTACCAGATGATTTTACAGGCTGCGTTATAGCTTGGAATTCTAACAATCTTGATTC +TAAGGTTGGTGGTAATTATAATTACCTGTATAGATTGTTTAGGAAGTCTAATCTCAAACCTTTTGAGAGA +GATATTTCAACTGAAATCTATCAGGCCGGTAGCACACCTTGTAATGGTGTTGAAGGTTTTAATTGTTACT +TTCCTTTACAATCATATGGTTTCCAACCCACTAATGGTGTTGGTTACCAACCATACAGAGTAGTAGTACT +TTCTTTTGAACTTCTACATGCACCAGCAACTGTTTGTGGACCTAAAAAGTCTACTAATTTGGTTAAAAAC +AAATGTGTCAATTTCAACTTCAATGGTTTAACAGGCACAGGTGTTCTTACTGAGTCTAACAAAAAGTTTC +TGCCTTTCCAACAATTTGGCAGAGACATTGCTGACACTACTGATGCTGTCCGTGATCCACAGACACTTGA +GATTCTTGACATTACACCATGTTCTTTTGGTGGTGTCAGTGTTATAACACCAGGAACAAATACTTCTAAC +CAGGTTGCTGTTCTTTATCAGGATGTTAACTGCACAGAAGTCCCTGTTGCTATTCATGCAGATCAACTTA +CTCCTACTTGGCGTGTTTATTCTACAGGTTCTAATGTTTTTCAAACACGTGCAGGCTGTTTAATAGGGGC +TGAACATGTCAACAACTCATATGAGTGTGACATACCCATTGGTGCAGGTATATGCGCTAGTTATCAGACT +CAGACTAATTCTCCTCGGCGGGCACGTAGTGTAGCTAGTCAATCCATCATTGCCTACACTATGTCACTTG +GTGCAGAAAATTCAGTTGCTTACTCTAATAACTCTATTGCCATACCCACAAATTTTACTATTAGTGTTAC +CACAGAAATTCTACCAGTGTCTATGACCAAGACATCAGTAGATTGTACAATGTACATTTGTGGTGATTCA +ACTGAATGCAGCAATCTTTTGTTGCAATATGGCAGTTTTTGTACACAATTAAACCGTGCTTTAACTGGAA +TAGCTGTTGAACAAGACAAAAACACCCAAGAAGTTTTTGCACAAGTCAAACAAATTTACAAAACACCACC +AATTAAAGATTTTGGTGGTTTTAATTTTTCACAAATATTACCAGATCCATCAAAACCAAGCAAGAGGTCA +TTTATTGAAGATCTACTTTTCAACAAAGTGACACTTGCAGATGCTGGCTTCATCAAACAATATGGTGATT +GCCTTGGTGATATTGCTGCTAGAGACCTCATTTGTGCACAAAAGTTTAACGGCCTTACTGTTTTGCCACC +TTTGCTCACAGATGAAATGATTGCTCAATACACTTCTGCACTGTTAGCGGGTACAATCACTTCTGGTTGG +ACCTTTGGTGCAGGTGCTGCATTACAAATACCATTTGCTATGCAAATGGCTTATAGGTTTAATGGTATTG +GAGTTACACAGAATGTTCTCTATGAGAACCAAAAATTGATTGCCAACCAATTTAATAGTGCTATTGGCAA +AATTCAAGACTCACTTTCTTCCACAGCAAGTGCACTTGGAAAACTTCAAGATGTGGTCAACCAAAATGCA +CAAGCTTTAAACACGCTTGTTAAACAACTTAGCTCCAATTTTGGTGCAATTTCAAGTGTTTTAAATGATA +TCCTTTCACGTCTTGACAAAGTTGAGGCTGAAGTGCAAATTGATAGGTTGATCACAGGCAGACTTCAAAG +TTTGCAGACATATGTGACTCAACAATTAATTAGAGCTGCAGAAATCAGAGCTTCTGCTAATCTTGCTGCT +ACTAAAATGTCAGAGTGTGTACTTGGACAATCAAAAAGAGTTGATTTTTGTGGAAAGGGCTATCATCTTA +TGTCCTTCCCTCAGTCAGCACCTCATGGTGTAGTCTTCTTGCATGTGACTTATGTCCCTGCACAAGAAAA +GAACTTCACAACTGCTCCTGCCATTTGTCATGATGGAAAAGCACACTTTCCTCGTGAAGGTGTCTTTGTT +TCAAATGGCACACACTGGTTTGTAACACAAAGGAATTTTTATGAACCACAAATCATTACTACAGACAACA +CATTTGTGTCTGGTAACTGTGATGTTGTAATAGGAATTGTCAACAACACAGTTTATGATCCTTTGCAACC +TGAATTAGACTCATTCAAGGAGGAGTTAGATAAATATTTTAAGAATCATACATCACCAGATGTTGATTTA +GGTGACATCTCTGGCATTAATGCTTCAGTTGTAAACATTCAAAAAGAAATTGACCGCCTCAATGAGGTTG +CCAAGAATTTAAATGAATCTCTCATCGATCTCCAAGAACTTGGAAAGTATGAGCAGTATATAAAATGGCC +ATGGTACATTTGGCTAGGTTTTATAGCTGGCTTGATTGCCATAGTAATGGTGACAATTATGCTTTGCTGT +ATGACCAGTTGCTGTAGTTGTCTCAAGGGCTGTTGTTCTTGTGGATCCTGCTGCAAATTTGATGAAGACG +ACTCTGAGCCAGTGCTCAAAGGAGTCAAATTACATTACACATAAACGAACTTATGGATTTGTTTATGAGA +ATCTTCACAATTGGAACTGTAACTTTGAAGCAAGGTGAAATCAAGGATGCTACTCCTTCAGATTTTGTTC +GCGCTACTGCAACGATACCGATACAAGCCTCACTCCCTTTCGGATGGCTTATTGTTGGCGTTGCACTTCT +TGCTGTTTTTCAGAGCGCTTCCAAAATCATAACCCTCAAAAAGAGATGGCAACTAGCACTCTCCAAGGGT +GTTCACTTTGTTTGCAACTTGCTGTTGTTGTTTGTAACAGTTTACTCACACCTTTTGCTCGTTGCTGCTG +GCCTTGAAGCCCCTTTTCTCTATCTTTATGCTTTAGTCTACTTCTTGCAGAGTATAAACTTTGTAAGAAT +AATAATGAGGCTTTGGCTTTGCTGGAAATGCCGTTCCAAAAACCCATTACTTTATGATGCCAACTATTTT +CTTTGCTGGCATACTAATTGTTACGACTATTGTATACCTTACAATAGTGTAACTTCTTCAATTGTCATTA +CTTCAGGTGATGGCACAACAAGTCCTATTTCTGAACATGACTACCAGATTGGTGGTTATACTGAAAAATG +GGAATCTGGAGTAAAAGACTGTGTTGTATTACACAGTTACTTCACTTCAGACTATTACCAGCTGTACTCA +ACTCAATTGAGTACAGACACTGGTGTTGAACATGTTACCTTCTTCATCTACAATAAAATTGTTGATGAGC +CTGAAGAACATGTCCAAATTCACACAATCGACGGTTCATCCGGAGTTGTTAATCCAGTAATGGAACCAAT +TTATGATGAACCGACGACGACTACTAGCGTGCCTTTGTAAGCACAAGCTGATGAGTACGAACTTATGTAC +TCATTCGTTTCGGAAGAGACAGGTACGTTAATAGTTAATAGCGTACTTCTTTTTCTTGCTTTCGTGGTAT +TCTTGCTAGTTACACTAGCCATCCTTACTGCGCTTCGATTGTGTGCGTACTGCTGCAATATTGTTAACGT +GAGTCTTGTAAAACCTTCTTTTTACGTTTACTCTCGTGTTAAAAATCTGAATTCTTCTAGAGTTCCTGAT +CTTCTGGTCTAAACGAACTAAATATTATATTAGTTTTTCTGTTTGGAACTTTAATTTTAGCCATGGCAGA +TTCCAACGGTACTATTACCGTTGAAGAGCTTAAAAAGCTCCTTGAACAATGGAACCTAGTAATAGGTTTC +CTATTCCTTACATGGATTTGTCTTCTACAATTTGCCTATGCCAACAGGAATAGGTTTTTGTATATAATTA +AGTTAATTTTCCTCTGGCTGTTATGGCCAGTAACTTTAGCTTGTTTTGTGCTTGCTGCTGTTTACAGAAT +AAATTGGATCACCGGTGGAATTGCTATCGCAATGGCTTGTCTTGTAGGCTTGATGTGGCTCAGCTACTTC +ATTGCTTCTTTCAGACTGTTTGCGCGTACGCGTTCCATGTGGTCATTCAATCCAGAAACTAACATTCTTC +TCAACGTGCCACTCCATGGCACTATTCTGACCAGACCGCTTCTAGAAAGTGAACTCGTAATCGGAGCTGT +GATCCTTCGTGGACATCTTCGTATTGCTGGACACCATCTAGGACGCTGTGACATCAAGGACCTGCCTAAA +GAAATCACTGTTGCTACATCACGAACGCTTTCTTATTACAAATTGGGAGCTTCGCAGCGTGTAGCAGGTG +ACTCAGGTTTTGCTGCATACAGTCGCTACAGGATTGGCAACTATAAATTAAACACAGACCATTCCAGTAG +CAGTGACAATATTGCTTTGCTTGTACAGTAAGTGACAACAGATGTTTCATCTCGTTGACTTTCAGGTTAC +TATAGCAGAGATATTACTAATTATTATGAGGACTTTTAAAGTTTCCATTTGGAATCTTGATTACATCATA +AACCTCATAATTAAAAATTTATCTAAGTCACTAACTGAGAATAAATATTCTCAATTAGATGAAGAGCAAC +CAATGGAGATTGATTAAACGAACATGAAAATTATTCTTTTCTTGGCACTGATAACACTCGCTACTTGTGA +GCTTTATCACTACCAAGAGTGTGTTAGAGGTACAACAGTACTTTTAAAAGAACCTTGCTCTTCTGGAACA +TACGAGGGCAATTCACCATTTCATCCTCTAGCTGATAACAAATTTGCACTGACTTGCTTTAGCACTCAAT +TTGCTTTTGCTTGTCCTGACGGCGTAAAACACGTCTATCAGTTACGTGCCAGATCAGTTTCACCTAAACT +GTTCATCAGACAAGAGGAAGTTCAAGAACTTTACTCTCCAATTTTTCTTATTGTTGCGGCAATAGTGTTT +ATAACACTTTGCTTCACACTCAAAAGAAAGACAGAATGATTGAACTTTCATTAATTGACTTCTATTTGTG +CTTTTTAGCCTTTCTGCTATTCCTTGTTTTAATTATGCTTATTATCTTTTGGTTCTCACTTGAACTGCAA +GATCATAATGAAACTTGTCACGCCTAAACGAACATGAAATTTCTTGTTTTCTTAGGAATCATCACAACTG +TAGCTGCATTTCACCAAGAATGTAGTTTACAGTCATGTACTCAACATCAACCATATGTAGTTGATGACCC +GTGTCCTATTCACTTCTATTCTAAATGGTATATTAGAGTAGGAGCTAGAAAATCAGCACCTTTAATTGAA +TTGTGCGTGGATGAGGCTGGTTCTAAATCACCCATTCAGTACATCGATATCGGTAATTATACAGTTTCCT +GTTTACCTTTTACAATTAATTGCCAGGAACCTAAATTGGGTAGTCTTGTAGTGCGTTGTTCGTTCTATGA +AGACTTTTTAGAGTATCATGACGTTCGTGTTGTTTTAGATTTCATCTAAACGAACAAACTAAAATGTCTG +ATAATGGACCCCAAAATCAGCGAAATGCACCCCGCATTACGTTTGGTGGACCCTCAGATTCAACTGGCAG +TAACCAGAATGGAGAACGCAGTGGGGCGCGATCAAAACAACGTCGGCCCCAAGGTTTACCCAATAATACT +GCGTCTTGGTTCACCGCTCTCACTCAACATGGCAAGGAAGACCTTAAATTCCCTCGAGGACAAGGCGTTC +CAATTAACACCAATAGCAGTCCAGATGACCAAATTGGCTACTACCGAAGAGCTACCAGACGAATTCGTGG +TGGTGACGGTAAAATGAAAGATCTCAGTCCAAGATGGTATTTCTACTACCTAGGAACTGGGCCAGAAGCT +GGACTTCCCTATGGTGCTAACAAAGACGGCATCATATGGGTTGCAACTGAGGGAGCCTTGAATACACCAA +AAGATCACATTGGCACCCGCAATCCTGCTAACAATGCTGCAATCGTGCTACAACTTCCTCAAGGAACAAC +ATTGCCAAAAGGCTTCTACGCAGAAGGGAGCAGAGGCGGCAGTCAAGCCTCTTCTCGTTCCTCATCACGT +AGTCGCAACAGTTCAAGAAATTCAACTCCAGGCAGCAGTAGGGGAACTTCTCCTGCTAGAATGGCTGGCA +ATGGCGGTGATGCTGCTCTTGCTTTGCTGCTGCTTGACAGATTGAACCAGCTTGAGAGCAAAATGTCTGG +TAAAGGCCAACAACAACAAGGCCAAACTGTCACTAAGAAATCTGCTGCTGAGGCTTCTAAGAAGCCTCGG +CAAAAACGTACTGCCACTAAAGCATACAATGTAACACAAGCTTTCGGCAGACGTGGTCCAGAACAAACCC +AAGGAAATTTTGGGGACCAGGAACTAATCAGACAAGGAACTGATTACAAACATTGGCCGCAAATTGCACA +ATTTGCCCCCAGCGCTTCAGCGTTCTTCGGAATGTCGCGCATTGGCATGGAAGTCACACCTTCGGGAACG +TGGTTGACCTACACAGGTGCCATCAAATTGGATGACAAAGATCCAAATTTCAAAGATCAAGTCATTTTGC +TGAATAAGCATATTGACGCATACAAAACATTCCCACCAACAGAGCCTAAAAAGGACAAAAAGAAGAAGGC +TGATGAAACTCAAGCCTTACCGCAGAGACAGAAGAAACAGCAAACTGTGACTCTTCTTCCTGCTGCAGAT +TTGGATGATTTCTCCAAACAATTGCAACAATCCATGAGCAGTGCTGACTCAACTCAGGCCTAAACTCATG +CAGACCACACAAGGCAGATGGGCTATATAAACGTTTTCGCTTTTCCGTTTACGATATATAGTCTACTCTT +GTGCAGAATGAATTCTCGTAACTACATAGCACAAGTAGATGTAGTTAACTTTAATCTCACATAGCAATCT +TTAATCAGTGTGTAACATTAGGGAGGACTTGAAAGAGCCACCACATTTTCACCGAGGCCACGCGGAGTAC +GATCGAGTGTACAGTGAACAATGCTAGGGAGAGCTGCCTATATGGAAGAGCCCTAATGTGTAAAATTAAT +TTTAGTAGTGCTATCCCCATGTGATTTTAATAGCTTCTTAGGAGAATGACAAAAAAAAAAAAAAAAAAAA +AAAAAAAAAAAAA +