!transposon_sequence_set.README.v9.42 !February 3 2009 !Comments, corrections to http://flybase.org/cgi-bin/mailto-fbhelp.html TRANSPOSON SEQUENCE CANONICAL SETS FOR DROSOPHILA This is a file of 'canonical' sequences of the transposable elements from Drosophila. History: These sequences were originally compiled by Takis Benos (EBI), Leyla Bayraktaroglu (Harvard) and Michael Ashburner (EBI & Cambridge) with help from Aubrey de Grey (Cambridge), Joe Chillemi (Harvard) and Martin Reese (LBL). We thank Suzi Lewis (Berkeley) for inspiration and discussion, and Guochun Liao (Berkeley) for his repeat sequence set and newly discovered transposable element sequences from the Berkeley P1 clones, and Lynn Crosby (Harvard) for her annotations of some elements. Subsequent curation of these sequences has been in the context of the Drosophila Genome Project and was a collaboration between M. Ashburner (Cambridge), Josh Kaminker (Berkeley) and Casey Bergman (Berkeley). From Version 8.0 this set has been maintained by Michael Ashburner and Casey Bergman in Cambridge. We thank Margi Butler, Elena Casacuberta, Madeline Crosby, Bob Levis, Mary-Lou Pardue, Kevin O'Hare, Horacio Naveira, Dmitri Petrov, Steve Schaeffer, Todd Schlenke, Alfredo Villesante & the authors of REBASE for sequences and/or annotations. =============================================================================== March 16 2004. v.8.0 ==================== 1. Updates of FB identifiers. 2. Added from REPBASE (drorep.ref.4.5.3, January 2003): BS3, BS4, Doc4-element, Doc5-element, Fw2, Fw3, Helitron, R1-2, Tc1-2, G5A, G7, accord2, gypsy7, gypsy8, gypsy9, gypsy10, gypsy11, gypsy12, invader6. 3. ORF1 of Juan added from Repbase record. 4. A new line for synonyms has been introduced into records, e.g.: SY synonym:BEL This is not a complete list of synonyms (for which see FlyBase), just those in widespread use. April 6 2004. v.9.0 =================== 1. New records for elements from species other than D. melanogaster: Dana\Tom, Dvir\Dv, Dhyd\Bungy, Dbuz\Osvaldo, Dkoe\Gandalf, Dmau\mariner, Dhyd\Minos, Dfun\Isfun-1, Dsub\bilbo, Dsil\Loa, Dhet\Uhu, Dvir\Ulysses, Dsim\ninja, Dvir\Helena, Dvir\Penelope, Dvir\Tv1, Dvir\Tel, Dmir\TRAM, Dmir\TRIM, Dvir\Paris, Dmir\spock, Dmir\worf, Dwil\Vege, Dwil\Mar, Damb\P-element_T, Dbif\P-element_M, Dbif\P-element_O, Dsub\SGM, Ddip\Bari1, Dpse\mini-me, Dbuz\BuT1, Dbuz\BuT2, Dbuz\BuT3, Dbuz\BuT4, Dbuz\BuT5, Dbuz\BuT6, Dbuz\INE-1, Dbuz\ISBu2, Dbuz\Galileo, Dbuz\Kepler, Dbuz\Newton, Dyak\TART, Dyak\HeT-A, Dvir\TART, Dvir\HeT-A. 2. D. melanogaster Helena consensus sequence added. The Circe sequence has been replaced from a consensus from A. Villesante. 3. The micropia sequence has been replaced by one that lacks the 4bp deletion within the CDS present in the previous record. 4. New sequences for TART-A and TART-C subfamilies added (the only previous TART sequence (U14101) was TART-B). The sequences for Tc3 & Beagle2 have been extracted from the R3 genome and added. The sequence of the Q element has been extracted from the R1 genome and added. 5. Annotations improved for some of the sequences. 6. All annotations now use SO terms. The syntax is: FT SO_feature ; :.. e.g.: FT SO_feature five_prime_UTR ; SO:0000204:1..730 April 7 2004. v.9.1 =================== 1. The following R1-element variants have been added: Dnet\R1A, Dtak\R1A2, Dmer\R1A3, Dnet\R1B. The original melanogaster element has been re-named:R1A1-element. April 7 2004. v.9.2 =================== 1. The Dbuz\ISBu3, Dsub\GEM and Dvir\Uvir elements have been added. April 18 2004. v.9.2.2 ====================== 1. Update of annotation of TART-C. 2. Dtei\I-element sequence added. May 1 2004. v.9.2.3 =================== 1. Added prygun as a synonym of Tirant. June 23 2004. v.9.2.4 ===================== 1. accord2 and qbert found to be the same element; accord2 sequence (AC008256) removed and qbert sequence (AF541947) renamed as accord2. September 3 2004. v.9.2.5 ========================= 1. Added Doc5-element as synonym of Porto1. December 10 2004. v.9.3 ======================= 1. Added the cDNA sequence for a D. melanogaster Osvaldo-like element. April 22 2005. v.9.4 ==================== 1. Updates on FBgn identifiers. 2. Added sequence of the TAHRE element. April 22 2005. v.9.41 ===================== 1. A formatting error in the Helana sequence corrected. February 3 2009. v.9.42 ===================== 1. Dmel\TART-A, Dmel\TART-B, and Dmel\TART-C annotations corrected, as per Mary-Lou Pardue and Greg DeBaryshe. 2. Dmel\TAHRE annotation corrected, as per Alfredo Villasante. ===================================================================== The current data set includes 179 elements: FB gene ID Symbol EMBL Size Comment Retroviral elements: FBgn0000004 17.6 X01472 7439bp complete FBgn0000007 1731 X07656 4648bp complete FBgn0000005 297 X03431 6995bp complete FBgn0005384 3S18 U23420 6126bp complete FBgn0000006 412 nnnnnnnn 7567bp complete FBgn0063447 accord nnnnnnnn 7404bp complete FBgn0063782 accord2 AF541947 7650bp complete FBgn0010103 aurora-element AB022762 4263bp ?complete FBgn0000199 blood nnnnnnnn 7410bp complete FBgn0010302 Burdock U89994 6411bp complete FBgn0022937 Circe nnnnnnnn 7450bp complete FBgn0000349 copia X02599 5143bp complete FBgn0043969 diver AC004377 6112bp complete FBgn0063439 diver2 nnnnnnnn 4917bp complete FBgn0062343 Dm88 nnnnnnnn 4558bp complete FBgn0014947 flea Z27119 5034bp complete FBgn0061513 frogger AF492763 2483bp ?complete FBgn0015945 GATE AJ010298 8507bp complete FBgn0063436 gtwin nnnnnnnn 7411bp complete FBgn0001167 gypsy M12927 7469bp complete FBgn0063435 gypsy2 nnnnnnnn 6841bp complete FBgn0063434 gypsy3 nnnnnnnn 6973bp complete FBgn0063433 gypsy4 nnnnnnnn 7369bp complete FBgn0063432 gypsy5 nnnnnnnn 6852bp complete FBgn0063431 gypsy6 nnnnnnnn 7826bp complete FBgn0067384 gypsy7 AE003788 5486bp incomplete FBgn0067383 gypsy8 AE003788 4955bp incomplete FBgn0067382 gypsy9 AE002591 5349bp incomplete FBgn0067387 gypsy10 nnnnnnnn 6006bp incomplete FBgn0067386 gypsy11 nnnnnnnn 4428bp incomplete FBgn0067385 gypsy12 nnnnnnnn 10218bp incomplete FBgn0001207 HMS-Beagle AF365402 7062bp complete FBgnnnnnnnn HMS-Beagle2 nnnnnnnn 7220bp complete FBgn0026065 Idefix AJ009736 7411bp complete FBgn0063430 invader1 nnnnnnnn 4032bp complete FBgn0063429 invader2 nnnnnnnn 5124bp complete FBgn0063428 invader3 nnnnnnnn 5484bp complete FBgn0063427 invader4 nnnnnnnn 3105bp complete FBgn0063426 invader5 nnnnnnnn 4038bp complete FBgn0067380 invader6 NT_033778 4885bp incomplete FBgn0063919 Max-element AJ487856 8556bp complete FBgn0063917 McClintock AF541948 6450bp complete FBgn0002697 mdg1 X59545 7480bp complete FBgn0002698 mdg3 X95908 5519bp complete FBgn0002745 micropia X14037,X15066 5461bp complete FBgn0003007 opus AY180918 7521bp complete FBgn0063755 Osvaldo AY089271 1543bp incomplete FBgn0044355 Quasimodo AF364550 7387bp complete FBgn0000155 roo AY180917 9092bp complete FBgn0063394 rooA nnnnnnnn 7621bp complete FBgn0061485 rover AF492764 7318bp complete FBgn0003490 springer AF364549 7546bp complete FBgn0003519 Stalker AF420242 7256bp complete FBgn0063455 Stalker2 nnnnnnnn 7672bp complete FBgn0063454 Stalker3 nnnnnnnn 372bp LTR FBgn0063897 Stalker4 AF541949 7359bp complete FBgn0045970 Tabor AC007146 7345bp complete FBgn0004082 Tirant nnnnnnnn 8526bp complete FBgn0063450 Tom1 nnnnnnnn 410bp LTR FBgn0040267 Transpac AF222049 5249bp complete FBgn0023131 ZAM AJ000387 8435bp complete FBgn0004357 Dana\Tom Z24451 7060bp complete FBgn0013796 Dbuz\Osvaldo AJ133521 9045bp complete FBgn0005772 Dmir\TRAM Y08905 3452bp ?complete FBgn0004642 Dmir\TRIM X59239 3111bp ?complete FBgn0015168 Dsim\ninja D83207 6644bp complete FBgn0004146 Dvir\Ulysses X56645 10653bp complete FBgn0020675 Dvir\Tel AF009439 2485bp incomplete FBgn0013099 Dvir\Tv1 AF056940 6898bp complete non-LTR retrotransposons: FBgn0063440 baggins nnnnnnnn 5453bp complete FBgn0000224 BS nnnnnnnn 5142bp complete FBgn0067624 BS3 nnnnnnnn 1790bp ?complete FBgn0067623 BS4 nnnnnnnn 754bp incomplete FBgn0063594 Cr1a nnnnnnnn 4470bp complete FBgn0000481 Doc X17551 4725bp ?incomplete FBgn0063534 Doc2-element nnnnnnnn 4789bp complete FBgn0063533 Doc3-element nnnnnnnn 4740bp complete FBgn0069587 Doc4-element nnnnnnnn 2791bp incomplete FBgn0000652 F-element AC005198 4708bp complete FBgn0067421 Fw2 nnnnnnnn 3961bp ?complete FBgn0067420 Fw3 nnnnnnnn 3132bp ?complete FBgn0001100 G-element X06950 4346bp ?complete FBgn0063507 G2 nnnnnnnn 3102bp complete FBgn0063506 G3 nnnnnnnn 4605bp complete FBgn0063505 G4 nnnnnnnn 3856bp complete FBgn0063504 G5 nnnnnnnn 4856bp complete FBgn0069433 G5A nnnnnnnn 2841bp incomplete FBgn0063503 G6 nnnnnnnn 2042bp complete FBgn0067419 G7 AC003788 1192bp incomplete FBgn0020425 Helena nnnnnnnn 1318bp incomplete FBgn0004141 HeT-A U06920 6083bp complete FBgn0001249 I-element M14954 5371bp complete FBgn0043055 Ivk nnnnnnnn 5402bp complete FBgn0046110 Juan AY180919 4236bp complete FBgn0001283 jockey M22874 5020bp complete FBgn0063425 jockey2 nnnnnnnn 3428bp complete FBgn0046701 Penelope AF418572 804bp incomplete FBgn0015786 Porto1 nnnnnnnn 4682bp ?complete FBgn0063900 Q-element AE002612 759bp incomplete FBgn0003908 R1A1-element X51968 5356bp complete FBgn0067405 R1-2 nnnnnnnn 3216bp incomplete FBgn0003909 R2-element X51967 3607bp complete FBgn0041728 Rt1a AJ278684 5108bp complete FBgn0063467 Rt1c nnnnnnnn 5443bp complete FBgn0042682 Rt1b AF281636 5171bp complete FBgn0069343 TAHRE AJ542581 10463bp complete FBgn0004904 TART-A AY561850 13424bp complete FBgn0004904 TART-B U14101 10654bp complete FBgn0004904 TART-C AY600955 11124bp complete FBgn0042231 X-element AF237761 4740bp complete FBgn0013836 Dmer\R1A3 AF015277 3772bp incomplete FBgn0015678 Dmir\spock AY144571 4952bp ?complete FBgn0064494 Dmir\worf AY144572 4174bp ?complete FBgn0013854 Dnet\R1A AF248067 1757bp incomplete FBgn0013854 Dnet\R1B AF248068 2038bp incomplete FBgnnnnnnnn Dpse\mini-me AC131959 4622bp complete FBgn0005661 Dsil\Loa X60177 7779bp ?complete FBgn0023239 Dsub\bilbo U73803 5540bp complete FBgn0013903 Dtak\R1A2 U23198 1753bp incomplete FBgn0013017 Dtei\I-element M28878 5386bp complete FBgn0011601 Dvir\Helena U26847 691bp incomplete FBgn0015679 Dvir\Penelope U49102 4158bp ?complete FBgn0067468 Dvir\HeT-A AY369259 6610bp complete FBgn0066148 Dvir\TART AY219709 8500bp complete FBgn0067460 Dvir\Uvir AY369259 6564bp ?complete FBgn0024768 Dyak\HeT-A AF043258 5691bp complete FBgn0026443 Dyak\TART AF468026 8444bp incomplete SINE-like elements: FBgn0026416 INE-1 U66884 611bp ?incomplete FBgn0012361 Dhyd\Bungy U14600 227bp ?complete IR-elements: FBgn0005673 1360 nnnnnnnn 3409bp complete FBgn0005773 Bari1 X67681 1728bp complete FBgn0064134 Bari2 AF541951 1064bp complete FBgn0001181 HB X01748 1653bp ?incomplete FBgn0001210 hobo M69216 2959bp complete FBgn0014967 hopper X80025 1435bp incomplete FBgn0067381 hopper2 AF541950 1593bp incomplete FBgn0063402 looper1 nnnnnnnn 1881bp incomplete FBgn0063401 mariner2 nnnnnnnn 912bp complete FBgn0002949 NOF X15469;X51937 4347bp complete FBgn0003055 P-element X06779 2907bp complete FBgn0003122 pogo X59837 2121bp complete FBgn0004905 S-element U33463 1736bp ?incomplete FBgn0063466 S2 nnnnnnnn 1735bp complete FBgn0026410 Tc1 nnnnnnnn 1666bp complete FBgn0069340 Tc1-2 nnnnnnnn 1644bp complete FBgn0061191 Tc3 AC009537 1743bp complete FBgn0063372 transib1 nnnnnnnn 2167bp complete FBgn0063371 transib2 nnnnnnnn 2844bp complete FBgn0063370 transib3 nnnnnnnn 2883bp complete FBgn0063369 transib4 nnnnnnnn 2656bp complete FBgn0020218 Damb\P-element_T AF012414 3329bp ?complete FBgn0012207 Dbif\P-element_M X60990 2935bp complete FBgn0012207 Dbif\P-element_O X71634 2986bp complete FBgn0063576 Dbuz\BuT1 AF162798 769bp ?incomplete FBgn0063575 Dbuz\BuT2 AF368884 2775bp ?incomplete FBgn0063575 Dbuz\BuT3 AF368870 795bp ?incomplete FBgn0063573 Dbuz\BuT4 AF368868 1447bp ?incomplete FBgn0063572 Dbuz\BuT5 AF368868 669bp ?incomplete FBgn0069879 Dbuz\BuT6 AY187768 387bp ?incomplete FBgn0045754 Dbuz\INE-1 AF368900 1467bp ?incomplete FBgn0045754 Dbuz\ISBu2 AF368867 726bp ?incomplete FBgn0045754 Dbuz\ISBu3 AY313771 993bp ?incomplete FBgn0020486 Ddip\Bari1 Y13852 1676bp incomplete FBgn0044997 Dfun\Isfun-1 AJ309320 928bp incomplete FBgn0003948 Dhet\Uhu X63028 1658bp ?complete FBgn0010242 Dhyd\Minos Z29098 1773bp complete FBgn0014755 Dkoe\Gandalf U29466 979bp incomplete FBgn0002651 Dmau\mariner M14653 1286bp complete FBgn0026463 Dsub\GEM AJ131629 1730bp ?complete FBgn0015678 Dvir\Paris Z49253 1728bp complete MITE elements: FBgn0066141 Dwil\Mar AF518731 610bp ?complete FBgn0066140 Dwil\Vege AF518730 884bp ?complete FBgn0069871 Dsub\SGM AF043638 823bp ?complete Foldback elements: FBgn0000638 FB V00246 1106bp ?incomplete FBgn0027840 Dbuz\Galileo AY187769 2304bp ?incomplete FBgn0063570 Dbuz\Kepler AF368884 722bp ?incomplete FBgn0063569 Dbuz\Newton AF368890 1510bp ?incomplete Helitron elements: FBgn0067418 Helitron AE002840 564bp ?incomplete Class uncertain: FBgn0000513 Dvir\Dv X03936 845bp ?incomplete =============================================================================== ________________________________________________________________________ !transposon_sequence_set.embl.v.9.42 !February 3 2009 !See transposon_sequence_set.readme.v9.42 for description & comments. !Comments, corrections to http://flybase.org/cgi-bin/mailto-fbhelp.html ! ID DME9736 standard; DNA; INV; 7411 BP. XX AC AJ009736; XX DR FLYBASE; FBgn0026065; Idefix. XX FT source AJ009736:1..7411 FT SO_feature five_prime_LTR ; SO:0000425:1..600 FT SO_feature three_prime_LTR ; SO:0000426:6841..7411 FT SO_feature CDS ; SO:0000316:<988..2031 FT /db_xref="FLYBASE:FBgn0027381; Idefix\gag" FT /db_xref="SPTREMBL:O96739" FT /protein_id="CAA08806.1" FT /translation="ARKLKDIMAVPQLSETHLNQLLNQIKELNYYDGAPGKLSGFVNQV FT EQLLSLYPTQEARQAHVIYGAVKRLLVDSALEVVTQERANTWLDMKKALAMAFKDHRPY FT VTLIRQLEDISYPGSICKFIEKLETQYWIMFDKLELESDHVDKSNYTEMLNKTVKSVID FT RKLPDRIYMSLARKDIDTIYKLKQASMELGLYDAIPENHRSNRTEMNKRRNRGNYNQNN FT NQKYYNNRNHNYSNYYPSMNQNHNTQPPQNPTQPMTNQNQYSPRFIPNNQRGNYYAFRR FT DLTQAQQNNPLNNTLNFQPSTSNNINRQGPVKRQRESQSDQSRMDVNFHQAASDTQMIE FT KDIQVPM" FT SO_feature CDS ; SO:0000316:<1950..5402 FT /db_xref="FLYBASE:FBgn0027380; Idefix\pol" FT /db_xref="SPTREMBL:O96740" FT /protein_id="CAA08807.1" FT /translation="PKQDGCKFSSSCLGHSNDREGHTSPYVKIIHHNKNYKGMIDTGSS FT INIIRENFENLEEKEENLIVYTIKGPITLKRSIIIKPTSVCPSAQKFYIHKFSDNYDFL FT LGRKYLEDTKAKIDYANETVTLGSKVFKFLYEEKKGETASKCLDPQEKNDSALVDRTKP FT KMQKVKTAPKCLKPKHQQQKKETALPKCLISNVVKDTVDNDVTHLDPMSVDNDIVNFAI FT NNELRECNEYRLEHLNAEEVECLKKFLYEYRDIQYKEGENLTFTSTIKHVIQTQHEDPV FT YRKPYKYPQSVDQEVNKQIKEMIEQGIVRKSKSPYCSPIWVVPKKADASGKQKFRLVVD FT YRNLNEITVNDKFPIPRMDEILDKLGRCQYFTTIDLAKGFHQIQMDENSIAKTAFSTKH FT GHYEYTRMPFGLKNAPATFQRCMNNLLEDLIYKDCLVYLDDIIVYSTPLEEHILSLKKV FT FEKLRDANLKLQLDKCEFMKKETEFLGHIVTTNGIKPNPNKTKAITNFPLPKTPKQIKS FT FLGLCGFYRKFIPNFAKIVKPMTLKLKKGAIIDTKCKEYIESFEKLKVLITSDPILIYP FT DFSKPFSLTTDASNVAIGAVLSQNHKPVCYASRTLNEHEINYATIEKELLAIVWATKYF FT RSYLFGRPFEVLSDHKPLVWLNNIKEPNMKLQRWKIKLNEFDYKIKYLPGKENHVADAL FT SRTKIEVMVGEVANSADATIHSAIEDNLNYIPITERPINYFSRQIEIEKGDNDTTSVQH FT LFQKLKIKIVYKEMTPELAKNLIKEYVCTKKSAIYFPNDEDFLIFQRAFTEIISPNNFT FT KLLRCTTKLIDILTYAEFKDLILKKHKELLHPGIEKTINLFKEEYYYPDSQKLIQTIIN FT ECQICYLAKTEHQTQMTYETTPEIFNTREKYMIDFYLTGNQIFLSCIDIYSKFASLVEL FT KSRDWLEAKRAITKIFNDMGKPQEIKADKDSAFMCLALQNWLRSEGVQISISTSKNGIS FT DIERFHKTVNEKLRIIGSQQNVEDRCTKFERILYIYNHKTKHNSTKRFPADIFLYAGSP FT DFNVQQNKIDRIEYLNKNRHDFEVDIKYRQAPLVKSKITNPFKKTGRIGQVDDKHFEET FT NRGRKIVHYKSKFKKQKKFNKSKYDNSRPTKEAQSTQHTSNNA" FT SO_feature CDS ; SO:0000316:5248..6780 FT /db_xref="FLYBASE:FBgn0027382; Idefix\env" FT /db_xref="SPTREMBL:O96741" FT /protein_id="CAA08808.1" FT /translation="MINISKKQIVAGRSFTISQNLRNRKSLIRANMIIPDQPKKHKVHN FT ILLIMLSCILSLIITVKCNNIEVNPVNAKNGYLIFQTGTMEIPTSYEYHYLSINITKTM FT LMFEDIVSEANNYPNVPQIQYLVDKLKREINGLRIISRSKRGLLNVVGKAYKYLFGTLD FT EDDREELEEKINNMSEDSVKTHDLNTILDVINSGIDIINKLKVDKEQHQQIAVLIFNLE FT QFTEYIEDIELGLQLTRLGIFNPRLLKHDYLKHVNSEKMLKIKTSTWLKTDTNEILIIS FT HIPSEVTKVPIFQIVPYPDEHNYILTEQIFDKFYIFDNQVFHKDTNRDIFDKCIIGIIK FT QEQTQCKYIKTHKNYQINYIEPNILLTWNIPETAVNQDCTHNKILISGNNIIKIKNCTI FT QIDEFLISNNLADFTQTIYITNNVTRLEPINHLQTREMIETHVKHYNFFQIICITTFVI FT MIISLTLYVAYKFKNIPKKIIVNIVSKKNTRTLKIMSMKIFNKEIILPYTQI" XX CC Derived from AJ009736 (e1371475) (Rel. 58, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 1-Feb-1999. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7411 BP; 3047 A; 1363 C; 1109 G; 1892 T; 0 other; GTGACATATC CATAAGTCCC TAAGACTTAA GCATATGCCT ACATACTAAT ACACTTACAA 60 CACATACACC CCAATACAAC ATACACTACT CCGGATGTAC CCAACAGATA CCAGATAAGA 120 ATAAGATTGT TATATGATCC TCGAGAATGG AAAAAACCCC AATTCTAGAT AAGTCACCCA 180 CTGGTAGACT AAACATCCGT CCCCTAATTT AAACAATTCC TTGCTTAAGC CTCACCCCAT 240 CGTCACATTC CCACGTTCAA AGCTCGGAGC CGCAATCCCG AAAAACAAAA GTATCGATTT 300 CAATAAACAA ATTATAAGAA TCTAAGAGCA CTTGTATCCA AGAGCAAATG CACTTGAATC 360 CAAGAGAAAC GCAAAGCTTT TTCTCTTTAC GATCAGAATC CTAAAGTCTA AAGTCCATAT 420 TAGAAAAGCT CGATACCGAG GCTTGAACGT CAACCAAATC AGAATAATTA TCAGAGTTCA 480 GTTTGAGACC TAATTGTAAA AGGTTCGGTG TTCTTCTCAA ATAAAAAGAT TGTAATCATT 540 TAGTGAAATA AAAATTATAT TTTTTTCACT TATAAATATT GCAAGTATTT AATTGGCGCA 600 GTCGGTTAGG ATCCAATAAA ATAAAAGAGT CCTTTTAGTA CGGTACTGAT CAACTGAAGG 660 ATATGCTATA CGACTAGCTA TCCAAGATCA GCGAATTAAA ATAGTGATTC AAAAATATTT 720 TTTAATCCGC AAAAGAATCT ACGTGAAAGT AGTATTCAAA ATAAAATCCC GTGCGGTCGG 780 AAACAAAAAT TAATTTAAAT TTTTTAATTC CGAAACTTAA AACCAAGTTT AAAGAAAACT 840 TAAAATCAAG AAAACTTAAA ACCAAGTTTA AAGAAAACTT AAAATCAAGA AAACTTAAAA 900 CCAAGTTTAA AGAAAACTTA AAATCAAGAA AACTTAAAAC CAAGTTTAAA GAAAACTCAA 960 AATCAAGAAA ACTTAAAGCC AAAATAAGCT AGAAAACTAA AAGACATCAT GGCAGTCCCA 1020 CAACTCTCAG AAACACACCT AAACCAACTG CTAAACCAAA TCAAAGAATT AAACTACTAC 1080 GATGGCGCAC CTGGCAAATT ATCTGGATTC GTCAACCAAG TGGAACAACT GCTCAGTTTA 1140 TACCCAACAC AGGAAGCAAG ACAGGCACAC GTCATATATG GAGCAGTGAA GCGGTTATTA 1200 GTGGATTCAG CCTTAGAAGT CGTAACCCAG GAAAGAGCTA ACACATGGCT GGACATGAAG 1260 AAAGCACTGG CAATGGCATT CAAAGACCAT AGACCTTATG TAACTCTCAT CAGACAATTA 1320 GAAGACATAT CATACCCAGG AAGTATCTGT AAGTTTATAG AAAAATTAGA AACACAATAC 1380 TGGATTATGT TCGATAAGTT AGAATTAGAA AGTGACCATG TTGATAAATC GAATTATACC 1440 GAAATGTTAA ACAAAACTGT TAAATCAGTA ATAGATCGAA AACTGCCGGA TAGAATTTAT 1500 ATGTCTTTGG CACGTAAAGA TATTGATACA ATTTATAAAT TAAAACAAGC ATCAATGGAA 1560 TTAGGCCTTT ATGATGCTAT TCCAGAAAAT CACCGTTCTA ATAGAACAGA AATGAATAAA 1620 CGTAGGAACA GGGGAAACTA TAATCAAAAT AATAATCAAA AATATTACAA TAATAGAAAT 1680 CACAACTACA GTAATTATTA TCCTAGCATG AATCAGAATC ATAATACACA ACCACCTCAG 1740 AATCCGACTC AACCTATGAC AAATCAAAAC CAATATTCAC CGCGTTTCAT ACCGAATAAT 1800 CAAAGAGGGA ATTATTATGC ATTTAGACGA GACTTAACAC AAGCTCAGCA GAACAACCCA 1860 CTTAATAACA CCCTTAACTT CCAACCTTCG ACATCGAATA ATATTAACAG ACAAGGGCCA 1920 GTAAAAAGAC AACGCGAGAG TCAGAGTGAC CAAAGCAGGA TGGATGTAAA TTTTCATCAA 1980 GCTGCCTCGG ACACTCAAAT GATAGAGAAG GACATACAAG TCCCTATGTA AAAATAATTC 2040 ATCATAATAA AAATTATAAG GGAATGATCG ATACAGGATC ATCAATTAAC ATCATAAGAG 2100 AAAATTTTGA GAACTTAGAA GAAAAGGAAG AAAACCTAAT AGTATACACT ATTAAAGGAC 2160 CAATAACACT AAAGAGAAGT ATAATAATAA AACCTACTTC AGTATGTCCG TCTGCTCAAA 2220 AATTCTACAT TCACAAATTT TCTGATAACT ATGATTTCTT GTTAGGTCGA AAGTATTTAG 2280 AAGATACAAA AGCTAAAATA GATTATGCTA ACGAAACAGT AACACTAGGC TCAAAAGTAT 2340 TTAAGTTTCT CTATGAAGAA AAGAAGGGCG AGACCGCATC CAAATGCCTT GACCCACAAG 2400 AAAAGAATGA TTCCGCTCTA GTGGACAGAA CCAAACCAAA AATGCAAAAG GTTAAGACCG 2460 CACCTAAGTG CCTTAAACCA AAGCATCAAC AGCAGAAGAA AGAGACCGCA TTACCCAAAT 2520 GCCTCATTTC AAATGTTGTT AAAGACACAG TGGACAATGA TGTAACACAT CTCGATCCCA 2580 TGTCCGTTGA CAACGATATA GTCAACTTCG CGATTAACAA TGAGTTACGC GAATGTAACG 2640 AGTATAGACT CGAACACTTA AATGCAGAGG AAGTTGAATG TTTAAAGAAG TTCCTATACG 2700 AATATAGAGA CATTCAGTAC AAAGAGGGCG AAAATTTGAC CTTCACCAGT ACTATTAAAC 2760 ATGTCATCCA GACTCAACAC GAAGACCCAG TATACCGTAA ACCCTACAAG TACCCTCAAA 2820 GCGTTGACCA AGAAGTTAAC AAACAAATTA AAGAAATGAT AGAACAAGGG ATTGTTCGCA 2880 AATCGAAGTC CCCTTATTGT TCTCCTATTT GGGTGGTCCC CAAGAAGGCA GACGCCTCTG 2940 GGAAACAAAA ATTCAGGTTG GTAGTCGATT ACAGGAACCT AAATGAGATA ACTGTTAACG 3000 ACAAATTTCC CATTCCCCGA ATGGATGAGA TATTGGACAA ACTAGGTAGA TGCCAATACT 3060 TTACCACTAT AGATCTAGCC AAGGGTTTTC ACCAAATCCA AATGGATGAA AATTCTATTG 3120 CAAAAACAGC TTTTTCAACT AAGCATGGGC ATTATGAATA TACTCGTATG CCCTTTGGTT 3180 TAAAAAACGC TCCAGCTACT TTTCAGAGAT GCATGAATAA TCTTCTGGAA GATTTAATCT 3240 ACAAAGACTG TTTAGTCTAT TTAGACGATA TTATTGTTTA TTCCACTCCA TTGGAAGAAC 3300 ACATTTTATC CCTAAAGAAA GTCTTTGAAA AACTGAGAGA CGCTAATTTA AAGTTGCAAC 3360 TAGATAAATG TGAATTCATG AAGAAAGAAA CTGAATTCCT AGGACACATC GTCACAACAA 3420 ATGGCATCAA ACCAAATCCA AATAAAACTA AAGCAATTAC AAATTTTCCA TTACCCAAGA 3480 CACCTAAGCA AATAAAATCA TTTTTGGGAT TATGTGGATT CTATCGCAAG TTTATTCCTA 3540 ACTTTGCCAA AATAGTTAAA CCCATGACCC TCAAATTAAA GAAAGGTGCT ATAATAGACA 3600 CCAAATGTAA AGAATACATC GAATCATTTG AAAAATTAAA AGTTTTGATA ACTTCAGACC 3660 CGATATTAAT CTATCCTGAT TTTTCAAAAC CTTTTTCTTT GACAACTGAT GCTAGCAACG 3720 TAGCTATTGG TGCAGTGTTA TCACAAAATC ACAAGCCAGT TTGTTATGCC AGTAGAACGC 3780 TAAACGAACA TGAAATCAAC TATGCTACGA TTGAAAAAGA ATTGTTAGCT ATAGTTTGGG 3840 CTACAAAATA TTTCAGGTCA TACTTATTCG GCAGACCATT TGAAGTATTA AGTGATCACA 3900 AGCCACTGGT ATGGCTCAAC AACATTAAAG AACCAAACAT GAAATTGCAA AGATGGAAAA 3960 TAAAACTTAA TGAATTCGAT TATAAAATCA AATATCTTCC AGGCAAAGAA AACCATGTCG 4020 CGGATGCTCT TTCCCGCACG AAAATAGAAG TTATGGTTGG CGAGGTCGCA AATAGCGCAG 4080 ACGCAACTAT ACACAGTGCC ATTGAAGATA ATCTAAATTA CATACCCATA ACAGAAAGAC 4140 CAATAAATTA CTTCTCTAGA CAAATAGAGA TAGAAAAAGG CGATAACGAT ACAACAAGTG 4200 TACAACATTT GTTTCAAAAA TTAAAGATTA AGATAGTCTA TAAAGAAATG ACACCTGAAC 4260 TCGCCAAAAA CCTCATTAAG GAATATGTGT GCACCAAAAA GAGTGCAATT TATTTCCCTA 4320 ATGACGAAGA TTTTCTGATC TTCCAGAGAG CGTTTACCGA AATTATAAGC CCTAACAATT 4380 TCACAAAACT CTTGAGATGT ACCACAAAGT TAATTGATAT ACTAACGTAT GCAGAATTCA 4440 AAGATTTAAT CTTAAAGAAA CATAAGGAAC TTTTACATCC GGGTATAGAA AAAACAATCA 4500 ATTTATTTAA AGAAGAATAT TACTATCCTG ATAGTCAAAA GCTTATTCAA ACCATTATCA 4560 ATGAATGTCA AATTTGTTAT CTAGCAAAAA CGGAACATCA AACACAAATG ACATATGAGA 4620 CTACACCAGA AATATTTAAC ACAAGAGAAA AATACATGAT AGATTTTTAT CTCACAGGAA 4680 ACCAGATCTT CTTATCTTGC ATTGATATCT ATTCGAAATT TGCATCACTA GTTGAATTAA 4740 AAAGTAGAGA TTGGCTAGAA GCAAAAAGAG CCATTACTAA AATATTCAAT GACATGGGAA 4800 AACCGCAAGA AATTAAAGCA GACAAAGACT CAGCTTTTAT GTGTTTAGCC TTACAAAATT 4860 GGTTAAGATC TGAAGGTGTA CAAATTTCTA TAAGCACTAG CAAAAATGGT ATATCTGATA 4920 TAGAAAGATT CCACAAGACC GTAAACGAAA AGCTAAGAAT CATTGGTAGC CAACAAAATG 4980 TTGAAGATAG GTGCACAAAA TTCGAAAGAA TTCTATACAT ATACAATCAC AAAACTAAAC 5040 ATAATAGTAC TAAAAGATTT CCAGCAGACA TTTTCCTATA TGCAGGCAGT CCAGATTTTA 5100 ATGTACAACA AAACAAAATC GATAGGATAG AATACCTCAA TAAGAATAGA CACGATTTTG 5160 AAGTTGATAT AAAATATAGA CAAGCCCCAC TTGTAAAAAG TAAAATAACC AATCCATTTA 5220 AAAAGACAGG AAGAATTGGA CAAGTAGATG ATAAACATTT CGAAGAACAA AATCGTGGCA 5280 GGAAGATCGT TCACTATAAG TCAAAATTTA AGAAACAGAA AAAGTTTAAT AAGAGCAAAT 5340 ATGATAATTC CAGACCAACC AAAGAAGCAC AAAGTACACA ACATACTTCT AATAATGCTT 5400 AGTTGCATAC TATCACTTAT CATCACGGTC AAGTGCAACA ATATAGAAGT AAATCCAGTA 5460 AACGCGAAAA ATGGATACCT TATATTCCAA ACAGGAACAA TGGAAATTCC AACCAGCTAT 5520 GAATACCATT ATTTAAGCAT AAACATAACA AAGACAATGC TCATGTTCGA AGATATAGTA 5580 AGTGAAGCAA ACAACTATCC TAATGTACCA CAAATACAAT ATTTAGTCGA CAAATTAAAA 5640 CGAGAAATAA ATGGGTTAAG AATTATTAGT CGAAGTAAAA GAGGTCTTTT AAACGTAGTA 5700 GGAAAAGCAT ACAAATACTT ATTCGGCACA TTAGATGAGG ATGACAGAGA AGAGTTAGAA 5760 GAAAAAATAA ACAACATGTC AGAAGACTCT GTAAAAACCC ATGACCTAAA CACGATTCTA 5820 GATGTAATCA ATAGTGGTAT AGATATAATT AATAAGCTCA AAGTAGATAA AGAACAACAC 5880 CAACAAATTG CGGTACTAAT ATTTAACCTA GAGCAATTTA CAGAATATAT AGAAGACATA 5940 GAATTGGGTC TGCAATTAAC CAGACTAGGA ATTTTCAATC CAAGATTACT AAAGCATGAC 6000 TATTTAAAAC ATGTAAATTC AGAAAAAATG CTAAAGATAA AAACGTCAAC CTGGCTTAAA 6060 ACAGACACGA ACGAAATTTT GATTATTTCC CATATTCCTA GCGAAGTTAC TAAAGTTCCA 6120 ATATTCCAAA TTGTTCCGTA CCCAGATGAA CATAATTATA TTCTAACCGA GCAAATATTC 6180 GATAAATTCT ACATATTTGA TAACCAAGTA TTCCATAAAG ATACCAATAG GGATATATTC 6240 GACAAATGTA TTATTGGAAT CATCAAACAA GAGCAAACTC AATGCAAATA TATTAAAACA 6300 CATAAAAATT ACCAAATAAA TTATATAGAA CCAAATATAC TATTAACATG GAATATTCCT 6360 GAAACAGCTG TTAACCAAGA CTGTACACAC AATAAAATAT TAATTTCAGG AAACAACATC 6420 ATTAAAATTA AAAATTGTAC CATACAAATA GATGAATTCT TAATCTCTAA TAATCTAGCA 6480 GACTTTACAC AAACAATTTA TATCACCAAC AATGTAACAC GTCTAGAACC AATAAATCAC 6540 TTACAAACGA GAGAAATGAT AGAAACCCAT GTAAAACACT ATAACTTTTT TCAAATTATA 6600 TGCATTACAA CGTTCGTCAT AATGATAATT AGTTTGACTC TGTATGTAGC ATATAAGTTT 6660 AAAAATATAC CTAAGAAAAT TATTGTCAAT ATCGTAAGCA AAAAGAACAC ACGCACCTTG 6720 AAAATAATGT CAATGAAAAT ATTCAACAAG GAAATAATAT TACCTTATAC CCAAATTTAA 6780 CGACCTGAGG ACAGGCCAAA TTCAAAGGTT GGGGGAGTGA CATATCCATA AGTCCCTAAG 6840 ACTTAAGCAT ATGCCTACAT ACTAATACAC TTACAACACA TACACCCCAA TACAACATAC 6900 ACTACTCCGG ATGTACCCAA CAGATACCAG ATAAGAATAA GATTGTTATA TGATCCTCGA 6960 GAATGGAAAA AACCCCAATT CTAGATAAGT CACCCACTGG TAGACTAAAC ATCCGTTCCC 7020 CTAATTTAAA CAATTCCTTG CTTAAGCCTC ACCCCATCGT CACATTCCCA CGTTCAAAGC 7080 TCGGAGCCGC AATCCCGAAA AACAAAAGTA TCGATTTCAA TAAACAAATT ATAAGAATCT 7140 AAGAGCACTT GTATCCAAGA GCAAATGCAC TTGAATCCAA GAGAAACGCA AAGCTTTTTC 7200 TCTTTACGAT CAGAATCCTA AAGTCTAAAG TCCATATTAG AAAAGCTCGA TACCGAGGCT 7260 TGAACGTCAA CCAAATCAGA ATAATTATCA GAGTTCAGTT TGAGACCTAA TTGTAAAAGG 7320 TTCGGTGTTC TTCTCAAATA AAAAGATTGT AATCATTTAG TGAAATAAAA ATTATATTTT 7380 TTTCACTTAT AAATATTGCA AGTATTTAAT T 7411 // ID DMIS176 standard; DNA; INV; 7439 BP. XX AC X01472; J01060; J01061; XX DR FLYBASE; FBgn0000004; 17.6. XX FT source X01472:1..7439 FT SO_feature five_prime_LTR ; SO:0000425:1..512 FT SO_feature three_prime_LTR ; SO:0000426:6928..7439 FT SO_feature TATA_box ; SO:0000174:372..377 FT SO_feature TATA_box ; SO:0000174:7271..7277 FT SO_feature primer_binding_site ; SO:0005850:511..529 FT SO_feature polyA_signal_sequence ; SO:0000551:372..377 FT SO_feature polyA_signal_sequence ; SO:0000551:7299.7304 FT SO_feature RR_tract ; SO:0000435:6917..6927 FT SO_feature CDS ; SO:0000316:1074..2393 FT /db_xref="FLYBASE:FBgn0044339; 17.6\gag" FT /db_xref="SWISS-PROT:P04282" FT /protein_id="CAA25701.1" FT /translation="MAQEPAIVPPLSDSNMTQVAYQIGNVEKFNGDPGSLYTFVSRIDY FT ILALYATGDERQQQIIFGHIERSISGEVMRCIGAYDMYTWQQLRRQLVLNYKPQTPNHV FT LLEEFRKTPFRGNVRAFLEEAESRRQTLTSKLELEQDLEEKTFYLKLIKSSIESLIEKL FT PTHIYLRINNHNIPDLRSLINLLQEKGMYEQINHTSTHVQKQNFSDKPQKSFNQNTNQS FT NNIRKYPTPFLHYNSPIPYQAPQIYQTPPTNNPLYRHPIPYHPNPNNVFQPSQQNNVFQ FT PSQQNNAFQPNQRTNFTSRPIFNTNRNNAFDQNRFGQQPQYQNQQSTQNSSSYVPNRPI FT KRLRPANSGQTGMSVDETLYQEDAFYQQCVPYDYFYYPTYDHSDYYPENQYQIDENNQN FT LQRTQQLQQINTDETNNDNQEPNVEQAENFQPQALENPNI" FT SO_feature CDS ; SO:0000316:2345..5518 FT /db_xref="FLYBASE:FBgn0014453; 17.6\pol" FT /db_xref="SWISS-PROT:P04323" FT /protein_id="CAA25702.1" FT /translation="TGRKFSATSLGKPQYITIKYKENNLKCLIDTGSTVNMTSKNIFDL FT PIQNTSTFIHTSNGPLIVNKSIIIPSKILFPTTNEFLLHPFSENYDLLLGRKLLAEAKA FT TISYRDQEVTLYNNKYKLIEGIATHEQSHFQNVNMIPDTMLRQPNKISPILESDLYRLE FT HLNNEEKQRLCALLQKYHDIQYHEGDKLTFTNQTKHTINTKHNLPLYSKYSYPQAYEQE FT VESQIQDMLNQGIIRTSNSPYNSPIWVVPKKQDASGKQKFRIVIDYRKLNEITVGDRHP FT IPNMDEILGKLGRCNYFTTIDLAKGFHQIEMDPESVSKTAFSTKHGHYEYLRMPFGLKN FT APATFQRCMNDILRPLLNKHCLVYLDDIIVFSTSLDEHLQSLGLVFEKLAKANLKLQLD FT KCEFLKQETTFLGHVLTPDGIKPNPEKIEAIQKYPIPTKPKEIKAFLGLTGYYRKFIPN FT FADIAKPMTKCLKKNMKIDTTNPEYDSAFKKLKYLISEDPILKVPDFTKKFTLTTDASD FT VALGAVLSQDGHPLSYISRTLNEHEINYSTIEKELLAIVWATKTFRHYLLGRHFEISSD FT HQPLSWLYRMKDPNSKLTRWRVKLSEFDFDIKYIKGKENCVADALSRIKLEETYLSEQT FT QHSAEEDNSDLIFITERPLNTFNRQVIFSKGPPDIKVTKYFKKHITQIFYDIMTREKAE FT QYLIDHFCGKKSALYIESDADFEVIQAAHKLAINTKYTKILRSTILLKNITTYAEFKEL FT ILTAHEKLLHPGIQKTTKLFGETYYFPNSQLLIQNIINECSICNLAKTEHRNTDMPTKT FT TPKPEHCREKFMIDIYSSEGKHYVSCIDIYSKFATLEEIKTKDWIECKNALMRIFNQLG FT KPKLLKADRDGAFSSLALKRWLESEEVELQLNTTKTGVADIERLHKTINEKIRIIKTSD FT DEETKLSKMETVLNIYNHKTKHDTTGQTPAHIFLYAGQPILDTQQNKENKINKINNDRV FT EYEVDTRYRKGPLQKGKLENPFKPTKNVEQTDSDHYKITNRNRITHYYKTQFKKRKKNN FT QLSISQAPGT" FT SO_feature CDS ; SO:0000316:5488..6903 FT /db_xref="FLYBASE:FBgn0027624; 17.6\env" FT /db_xref="SWISS-PROT:P04283" FT /protein_id="CAA25703.1" FT /translation="SALNFTGTWHLITLLLMLITTVHGQQIEINNIDTNHGYLLFSDKP FT VQIPSSFEHHCLRINLTEIDTIADYFEQRLRTDYHAPQVKFLYNKMRRELAGIALRHRN FT KRGLINIVGSVFKYLFGTLDENDRVDIQRKLETNAHNSVNLHELNDAIQLINDGMQKIQ FT NYENNSNIINSLLYELMQFTEYIEDVEMGMQLSRLGLFNPKLLNYDKLENVNSQNILNI FT KTSTWINYNDNQLLIISHIPINFSLINTVKIIPYPDSNGYQLEYTDTQSYFERENKVYN FT NENKEINNECVTNIIKHLKPICNFESIHTDEIIKYIEPNTIVTWNLTQTSLKQNCQNSF FT NNIKIKGNKMIKVTQCKIEINSIILSENLFKPEIDLTPLYTPLNITKIKTVKHNDINEM FT ISQNNITLYIFMTTVIIILILLYLYLRYVSFNPFMMLYAKLKLRKNQNQNTAQQIEMED FT VPLPLLYPSIPAQV" XX CC Derived from X01472 (g8142) (Rel. 36, Last updated, Version 2). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7439 BP; 2985 A; 1512 C; 1048 G; 1894 T; 0 other; AGTGACATAT TCACATACAA AACCACATAA CATAGAGTAA ACATATTGAA AAGCCGCATA 60 CGTAAACAAT AAGTGACCAC CATGCTAATG TGGATCAAAT AACAAAAATA TCCACTCTGC 120 ATTTTGACAC CCCCATACTG TATGCCATCT GCGCAGTATG CATTCTAATA AACAAATTCT 180 TTGACAGCGG CACTTAGCCA TTCTTGTAAA CAAATCTTAA AGTCTGCCTG CTCTCTCTGA 240 GGCTTCTCCT CCACTTAAGA ATCCAAGAGC AATGCTCTCC CAAAAACACT AACATATTCT 300 TTAAGCAAGC ACAGAGGCTT CTCCTCATTT TCACTTTCAT TTGATTTTCA GTCTTAAGCT 360 GAACGTTAAT CAATAAACAA CACAATCGAT ACCGAAATTT TGATTCGTTT TATTTTGGCA 420 AAACTCAATT TTCAGCGTTG GTCTTAGTTC ATATTCGGAA CGGTCCATTT AATAGACTCA 480 AAACTATTTA TTGCAACCAT TTATTTGCAA TTGGCGCAGT CGATGTGATC AGTGTTAAAG 540 TTCCTTGATG CGGTAACCAG ATTTGCCAAT TCCTGTGTTC TTTTTGTTCT CTGACAAAAG 600 TACCACGATA ACGGGCACCC ACGTGACGGT TAATATCGCT TTAAGTTTTT AATTAAACCT 660 CGACAATAAA GTGAAACCGA AAAATCACAA TTTGCCTAAA CAAACCTGAA TTTATTATCA 720 GGAAGACGCT ATTGAATTTG TGAGAGGCTG TAAATCCAAT TGGTTACCTC AAAGACCCAC 780 GAAAAAGCTA TAGTGCAACC CTTGCGAAAA TCAAAACCTA TCTTAAAAAA AAAAAAAAAA 840 TATAAATAAT AAATTAATAA GCGAAAATTA AAACGTATTA AAAGTAAGAA TAATAAATAA 900 ATAAGTGAAA ATTCTATATG ATAAAAATTA AAAATAAGAA TAATAAATAA AAAGACAACA 960 TTTTAAATTA AACAATATTA AAAAAATATA AAAATATTAA AAACTATATT AAAAAAAAAA 1020 AAAAAACAAA AAAACAAAAA AAAAAAAATA AATAAATAAT CCAAAAATCA AAAATGGCTC 1080 AAGAACCAGC AATTGTGCCA CCACTATCAG ACAGCAACAT GACCCAGGTT GCCTACCAGA 1140 TTGGCAATGT GGAGAAATTC AACGGTGATC CAGGCTCACT ATACACCTTT GTGAGTCGAA 1200 TTGATTACAT ACTGGCTCTT TATGCTACCG GAGATGAACG CCAACAGCAG ATCATATTTG 1260 GGCATATTGA ACGCAGCATC AGCGGAGAAG TTATGCGCTG CATTGGAGCC TATGACATGT 1320 ACACCTGGCA GCAGCTTAGA AGACAATTGG TACTCAACTA TAAACCCCAG ACCCCTAACC 1380 ACGTTCTTTT AGAAGAGTTT CGAAAGACCC CATTTCGAGG CAATGTACGA GCATTCCTGG 1440 AAGAAGCAGA AAGCCGCAGA CAAACACTTA CTAGTAAGCT TGAATTAGAG CAAGATCTTG 1500 AAGAAAAGAC TTTTTATTTG AAATTAATAA AATCCAGTAT AGAATCACTA ATTGAAAAAT 1560 TACCTACACA CATTTATTTA AGAATAAATA ACCACAACAT ACCAGATTTG CGATCACTTA 1620 TAAACCTTTT ACAAGAGAAG GGCATGTACG AACAAATAAA TCATACAAGT ACACATGTCC 1680 AAAAACAAAA TTTCTCTGAT AAGCCACAAA AGTCCTTTAA TCAAAATACT AATCAGTCTA 1740 ACAATATCAG AAAATATCCA ACACCTTTCC TACATTATAA TTCACCAATA CCATATCAAG 1800 CTCCACAAAT TTATCAAACA CCACCAACTA ATAACCCACT TTATCGTCAT CCAATACCCT 1860 ACCACCCTAA TCCAAACAAT GTTTTTCAAC CAAGCCAACA AAACAATGTT TTCCAACCAA 1920 GCCAACAAAA CAATGCTTTT CAACCAAATC AACGAACAAA CTTTACATCT CGACCAATTT 1980 TTAACACCAA TCGAAACAAT GCATTCGATC AGAATAGGTT CGGACAACAA CCCCAATATC 2040 AAAATCAACA ATCAACACAA AATTCAAGTT CCTATGTACC CAATCGACCA ATAAAACGAT 2100 TAAGACCAGC TAATAGTGGA CAGACTGGGA TGAGTGTTGA CGAAACATTA TATCAAGAGG 2160 ACGCTTTTTA TCAGCAGTGT GTTCCATATG ACTATTTTTA TTATCCAACT TACGACCATT 2220 CAGACTATTA TCCAGAAAAT CAATATCAAA TTGACGAAAA CAACCAAAAT TTACAAAGAA 2280 CACAACAGTT ACAGCAGATT AATACAGACG AGACAAACAA TGACAACCAA GAACCCAATG 2340 TTGAACAGGC CGAAAATTTT CAGCCACAAG CCTTGGAAAA CCCCAATATA TAACAATTAA 2400 ATACAAAGAA AATAATTTGA AATGCCTTAT TGATACCGGA TCAACAGTTA ACATGACATC 2460 TAAAAATATA TTTGATTTAC CAATCCAGAA TACTAGTACT TTTATTCATA CCAGCAATGG 2520 ACCGCTCATT GTCAACAAAA GTATAATCAT ACCTTCAAAG ATTTTGTTCC CAACAACAAA 2580 TGAATTTTTA TTGCACCCTT TCTCTGAGAA TTACGATCTT TTATTAGGAA GAAAACTTTT 2640 AGCAGAAGCA AAAGCAACAA TAAGTTACCG CGATCAAGAG GTAACTCTTT ACAACAACAA 2700 ATACAAATTA ATAGAAGGAA TAGCAACACA TGAACAGAGT CATTTTCAAA ATGTAAATAT 2760 GATACCTGAC ACCATGCTCA GACAGCCAAA TAAAATTTCA CCCATTTTAG AATCAGACCT 2820 ATACAGATTG GAACATTTAA ATAACGAAGA AAAACAAAGA TTGTGCGCAC TCCTGCAGAA 2880 ATACCATGAC ATACAGTACC ATGAAGGTGA TAAGTTGACA TTTACTAATC AAACCAAACA 2940 TACTATCAAT ACAAAGCACA ATCTACCACT TTACTCTAAA TACAGTTACC CACAGGCTTA 3000 TGAACAGGAG GTCGAAAGCC AAATACAAGA TATGCTAAAT CAAGGTATTA TACGTACCAG 3060 TAATTCACCT TACAATAGCC CCATCTGGGT GGTTCCAAAG AAACAAGATG CATCAGGCAA 3120 ACAGAAATTT AGAATTGTAA TAGACTACCG AAAATTAAAT GAAATAACAG TAGGAGACAG 3180 ACACCCAATC CCAAACATGG ACGAAATCTT GGGAAAATTG GGCAGATGTA ATTACTTCAC 3240 AACTATAGAC TTGGCAAAGG GTTTCCACCA GATCGAAATG GATCCAGAAT CAGTTTCAAA 3300 GACAGCCTTT TCTACCAAGC ACGGTCATTA TGAATATTTG CGCATGCCAT TCGGATTAAA 3360 AAACGCGCCA GCCACCTTTC AACGGTGCAT GAATGATATT TTAAGACCAC TCTTAAACAA 3420 ACACTGTCTT GTGTATTTGG ACGACATAAT TGTATTCTCG ACATCCCTTG ATGAACACCT 3480 GCAATCGCTC GGACTAGTTT TCGAAAAATT AGCAAAAGCC AACCTTAAAT TACAACTTGA 3540 CAAATGTGAG TTTCTCAAGC AAGAAACCAC ATTTTTAGGA CATGTTCTAA CACCAGATGG 3600 AATAAAACCA AACCCTGAAA AAATTGAAGC CATTCAAAAA TATCCAATTC CCACTAAACC 3660 AAAAGAAATA AAAGCTTTTC TTGGACTGAC AGGATATTAT CGTAAATTTA TTCCAAACTT 3720 TGCAGACATA GCCAAACCCA TGACTAAGTG TTTAAAAAAG AACATGAAAA TTGACACTAC 3780 CAACCCAGAA TATGACTCTG CATTTAAAAA ATTAAAATAT CTAATATCAG AAGACCCAAT 3840 TCTTAAAGTA CCCGACTTTA CAAAGAAATT CACTTTAACC ACAGACGCAA GTGATGTCGC 3900 TTTGGGGGCA GTACTGTCAC AAGATGGACA CCCACTTAGC TACATTAGCC GAACACTTAA 3960 TGAACACGAA ATAAATTACA GCACAATTGA AAAAGAACTC TTAGCAATTG TATGGGCGAC 4020 AAAGACTTTT CGACACTACC TACTTGGAAG ACACTTTGAA ATATCCAGTG ACCATCAACC 4080 ATTGAGCTGG TTGTACCGTA TGAAAGACCC AAATTCAAAA CTGACCCGAT GGAGAGTAAA 4140 ATTATCCGAA TTCGATTTTG ATATAAAATA TATAAAAGGA AAAGAAAATT GCGTGGCGGA 4200 TGCTCTGTCC AGAATAAAAC TTGAGGAGAC ATATTTGAGC GAACAAACCC AACATAGTGC 4260 AGAAGAGGAC AATAGTGATT TAATTTTTAT TACAGAAAGA CCTCTAAATA CATTTAACAG 4320 ACAAGTTATA TTTTCAAAAG GACCACCAGA CATTAAAGTT ACGAAATATT TCAAAAAACA 4380 CATCACCCAA ATATTTTACG ACATTATGAC CAGGGAAAAA GCCGAACAAT ATTTGATAGA 4440 CCATTTTTGT GGTAAGAAAA GTGCGTTGTA TATTGAGAGT GACGCTGATT TCGAAGTCAT 4500 TCAAGCCGCA CATAAATTAG CCATAAACAC CAAATATACA AAAATCCTGC GTAGCACGAT 4560 TTTGTTAAAA AACATAACCA CTTATGCGGA ATTTAAGGAA TTGATCTTGA CTGCTCATGA 4620 AAAACTTCTA CACCCAGGCA TACAGAAAAC TACTAAACTT TTCGGAGAAA CTTACTATTT 4680 CCCTAATAGC CAGCTACTTA TTCAGAATAT AATAAATGAG TGCAGTATTT GCAATCTGGC 4740 AAAAACAGAG CACCGAAATA CAGACATGCC AACGAAAACC ACACCCAAAC CAGAACATTG 4800 CCGCGAAAAA TTCATGATAG ACATTTACTC ATCCGAAGGC AAACATTACG TTAGTTGCAT 4860 AGACATTTAT TCGAAATTTG CCACATTAGA AGAAATAAAA ACAAAAGACT GGATAGAATG 4920 CAAAAACGCG CTTATGCGCA TATTCAACCA GCTTGGCAAG CCAAAGTTAC TAAAGGCGGA 4980 CAGAGACGGC GCATTTTCCA GTTTAGCCCT CAAGAGATGG CTGGAGAGTG AGGAAGTCGA 5040 ATTGCAGCTT AACACAACAA AAACTGGTGT GGCGGACATA GAAAGACTAC ATAAAACAAT 5100 TAATGAAAAG ATTCGCATAA TCAAAACATC CGATGACGAA GAAACCAAAT TGAGCAAAAT 5160 GGAAACAGTA CTTAACATAT ACAATCATAA AACCAAACAC GACACCACTG GACAGACCCC 5220 TGCACACATA TTTCTCTACG CTGGACAACC AATATTAGAT ACCCAACAAA ACAAAGAAAA 5280 CAAAATAAAC AAAATAAATA ATGACAGAGT GGAGTACGAA GTCGACACAA GATACAGAAA 5340 AGGTCCACTA CAGAAAGGCA AATTAGAAAA TCCTTTTAAG CCAACAAAAA ATGTGGAGCA 5400 GACTGACTCT GATCATTATA AAATTACTAA TAGAAATAGA ATTACTCACT ACTACAAAAC 5460 ACAATTCAAA AAACGAAAGA AAAATAATCA GCTCTCAATT TCACAGGCAC CTGGCACTTG 5520 ATAACATTGC TGCTGATGCT GATCACAACA GTTCATGGAC AACAAATTGA AATTAATAAT 5580 ATTGACACAA ACCACGGATA TCTCCTTTTT TCTGATAAAC CAGTCCAGAT ACCATCATCC 5640 TTTGAACATC ATTGCTTGAG AATCAATTTA ACTGAAATAG ACACCATAGC TGATTATTTT 5700 GAGCAAAGAC TACGTACCGA CTACCATGCA CCCCAGGTCA AATTTTTATA CAACAAAATG 5760 AGAAGAGAAC TAGCTGGAAT AGCCTTGCGA CATAGAAATA AACGGGGACT TATTAACATT 5820 GTAGGTTCAG TTTTTAAATA CCTATTTGGC ACACTTGACG AAAATGATCG AGTGGATATA 5880 CAGAGGAAAC TTGAAACAAA CGCCCATAAC TCGGTAAATT TACATGAACT CAATGACGCT 5940 ATTCAATTAA TAAATGACGG AATGCAAAAG ATACAGAATT ATGAAAACAA CAGCAACATC 6000 ATTAACAGTC TTTTATATGA ACTCATGCAG TTTACAGAAT ACATAGAAGA TGTGGAAATG 6060 GGAATGCAGC TTTCCAGACT CGGTCTATTT AATCCCAAAC TACTAAACTA CGATAAACTT 6120 GAGAATGTAA ACAGCCAAAA TATTTTAAAC ATTAAAACAT CCACTTGGAT TAATTACAAT 6180 GATAACCAAT TATTAATCAT ATCTCACATA CCTATTAACT TTTCATTAAT AAATACAGTA 6240 AAAATAATCC CTTACCCAGA CTCGAACGGC TATCAGCTAG AATACACAGA CACACAATCA 6300 TATTTTGAAA GAGAAAATAA AGTTTACAAT AACGAAAATA AAGAAATAAA CAATGAGTGT 6360 GTCACCAACA TTATTAAACA TTTAAAACCA ATTTGTAATT TTGAGTCAAT CCACACAGAT 6420 GAAATAATAA AATACATAGA ACCAAACACA ATTGTAACCT GGAATTTAAC CCAAACAAGT 6480 CTCAAACAAA ATTGTCAAAA TTCATTTAAT AATATAAAAA TAAAAGGAAA CAAAATGATA 6540 AAAGTAACCC AATGTAAAAT AGAAATCAAT AGCATAATTC TAAGTGAAAA TCTCTTTAAA 6600 CCAGAAATAG ATTTGACACC ATTATACACA CCACTTAACA TAACAAAAAT AAAAACTGTT 6660 AAACACAACG ACATTAATGA AATGATTTCA CAAAACAATA TTACACTTTA CATATTTATG 6720 ACTACTGTCA TCATTATACT TATTTTATTG TACTTATATT TAAGATACGT ATCATTTAAC 6780 CCATTCATGA TGCTGTATGC AAAACTAAAA TTAAGAAAAA ATCAAAATCA AAACACAGCA 6840 CAACAAATAG AAATGGAAGA CGTTCCATTA CCCCTACTAT ATCCATCAAT CCCAGCCCAA 6900 GTATAGGCTT CTCTTTAAGG GAAGGGAAGT GACATATTCA CATACAAAAC CACATAACGT 6960 AGAGTAAACA TATTGAAAAG CCGCATACGT CAACAATAAG TGACCACCAT GCTAATGTGG 7020 ATCAAATAAC AAAAATATCC ACTCTGCATT TTGACACCCC CATACTGTAT GCCATCTGCG 7080 CAGTATGCAT TCTAATAAAC AAATTCTTTG ACAGCGGCAC TTAGCCATTC TTGTAAACAA 7140 ATCTTAAAGT CTGCCTGCTC TCTCTGAGGC TTCTCCTCCA CTTAAGAATC CAAGAGCAAT 7200 GCTCTCCCAA AAACACTAAC ATATTCTTTA AGCAAGCACA GAGGCTTCTC CTCATTTTCA 7260 CTTTCATTTG ATTTTCAGTC TTAAGCTGAA CGTTAATCAA TAAACAACAC AATCGATACC 7320 GAAATTTTGA TTCGTTTTAT TTTGGCAAAA CTCAATTTTC AGCGTTGGTC TTAGTTCATA 7380 TTCGGAACGG TCCATTTAAT AGACTCAAAA CTATTTATTG CAACCATTTA TTTGCAATT 7439 // ID DMTN1731 standard; DNA; INV; 4648 BP. XX AC X07656; XX DR FLYBASE; FBgn0000007; 1731. XX FT source X07656:1..4648 FT SO_feature five_prime_LTR ; SO:0000425:1..336 FT SO_feature three_prime_LTR ; SO:0000426:4313..4648 FT SO_feature TATA_box ; SO:0000174:110..116 FT SO_feature primer_binding_site ; SO:0005850:342..352 FT SO_feature CDS ; SO:0000316:431..1252 FT /db_xref="FLYBASE:FBgn0020768; 1731\gag" FT /db_xref="REMTREMBL:CAA30502" FT /protein_id="CAA30502.1" FT /translation="MSNLYQIDKLEDGSYETWSIQMRSVLVHACLWKVVSGESVKPEVD FT TGGAWQSQDEKALATIILSVKSSQLGYVKGCLTAAEAWKVLQDVHQPKGPLRTVMLYKK FT LLSKRLLEGQSISSHIKEFKEIFDALDAVEIGITEKLRSVVLLSSLPESFENFVVAIET FT RDDVPLFDALCIKLIEEDTRRGGAEQQREKQTESAKAFTAVHKPQAPAREARPSAKKRK FT DVVCYNCGERRHFKANCRREKVNKESATQEQCSLLNALDSGGFWQNTVVSR" FT SO_feature CDS ; SO:0000316:1203..4151 FT /db_xref="FLYBASE:FBgn0012032; 1731\RTase" FT /db_xref="REMTREMBL:CAA30503" FT /protein_id="CAA30503.1" FT /translation="MRWIVVVFGKTQWCLDSGATSHMCCDRSVFTEFEEHTEKISLAGN FT GFLLAKGIGTVKLKTDLCTLVLNNVLFVPDLNGNFMSVSRAAQYKCFVNFGPHYADVIQ FT EGERILRVMRAGNLYMFQGKHNSCFAAVDADGSLWHKRNGHLNTSSLQEMVRKKMVYGV FT EKVVFKPDAVCKTCMLAKIHVQPFPKTTRSRAEELLDMIHSDLCGPFSTPSLAGSKYFL FT TFIDDKSRRIFVYFLRKKDEVFTKFVEFKKLVERQTGRKIKCIRSDNGGEFVNNVFDDY FT LKAHGIARQLTIPHTPQQNGVAERANRTLVEMARCMLLQSELGEALWAEAINTAVYLRN FT RSTSRALQSKTPMEEWTGKIPAVSHLRVFGAIAVALDKGVHKGKFESKGKEYRMIGYSI FT AAKGYRLFDKEKRCVIEKQDVLFDESGSLVNHGNTIEFQFPATDDPEPQSDSNAREGDD FT TEPVGSSDDYESAAEAEEAEVHVGPGRPKIVRTGRPGRPKKQYNVLGVLMASDVEIPKS FT YEEAINSQYSAKWEEAMGLEYKALLANETWKLADLPRNRRCVACKWVYSLKRDVSGRIE FT RFKARLVAKGCSQKFGVDYFETFSPVCRLESVRLILALAAEMQLYLHHMDVCTAYLNSE FT LKDTVYMKQPQGFTDAANPDQVLLLRKAIYGLKQSGREWNSKLDGVLKDLGFKACNHEP FT CLYQQSGQGNLMLILVYVDDLILACQSREDMEDLKAKISESFECTDKGPLHLFLGMEVQ FT RDGDLGEITLGHSQYIKELLRDYGSENCRPATTPLDAGHQVLCAGEQCQKVDAGQYQST FT IGELMWLGLTTRPDMLHSVAKLAQRNQDPHSEHMVAVKHILRYLASTVDVKLHYQKCGQ FT AFTGFVDADWGGDRLDRKSYTGYVFFLSGGPVSWRSEKQQSVALSSTEAEYMALTTACK FT EAIALRRLIVEIVCGDLKTPTVMHGDNLKCAAQLAKNPVHHSRTKHIDIRYH" XX CC Derived from X07656 (g8700) (Rel. 36, Last updated, Version 6). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4648 BP; 1316 A; 880 C; 1268 G; 1184 T; 0 other; TGTTGAATAT AGGCAATGCC CACATGTGTG TTGAATATAG GCAATTTCCA CATGTGCATA 60 TGTAATTTTG TATGAGAACA TACATACATA CACATGAACT GTATGTATGT ATATATATTA 120 GCAAATAAGC AGCCGCATGA AGGTGGCATT TTTATGTGTA TCAGTTTCAG TTTCAAATAA 180 AACTTCTTCG TGTTCGGACA CGCGGCTCAA GACTTTTTAT TTCGCGTTTA CTCTTTCAGC 240 CTTTGCTCTC AATTCGCTGA GTTTGGGTGA AGATTAGGAT CTTCCCATTA TGATTGTCAG 300 TGTTCCACAC TTGGAGCACC TTTTCAATAA ACAACAGGTT AATGGGCCCA GCGCCCTAGG 360 AGCTGCCTAA AGGAGAAACG TGTAGTGAAA CTCAGGAGTT AGATTTTGGA GTCTACTCAA 420 GATTGCCGGA ATGAGTAACC TGTATCAGAT CGATAAGCTG GAGGATGGAT CCTATGAAAC 480 GTGGAGCATC CAGATGCGTT CAGTGTTGGT GCACGCATGT TTGTGGAAGG TGGTTTCAGG 540 AGAGTCCGTG AAACCTGAGG TTGATACTGG AGGTGCTTGG CAATCCCAAG ATGAAAAAGC 600 ATTGGCCACG ATCATCTTGA GTGTGAAGTC TTCGCAACTT GGTTATGTAA AAGGGTGTCT 660 CACTGCGGCT GAGGCATGGA AAGTTTTACA GGATGTCCAC CAGCCGAAAG GGCCGTTACG 720 AACGGTCATG CTGTATAAGA AGTTGCTGAG CAAACGTCTG TTGGAAGGGC AGAGTATATC 780 GTCACATATT AAAGAATTTA AGGAAATCTT TGATGCCCTT GATGCGGTGG AAATTGGTAT 840 CACCGAGAAA TTGCGCAGTG TTGTTTTGCT GTCGAGCCTT CCAGAGAGTT TCGAGAATTT 900 CGTTGTCGCC ATTGAGACGC GCGACGACGT GCCGCTTTTC GATGCTCTAT GTATAAAGCT 960 GATCGAGGAA GACACGAGAA GGGGAGGAGC GGAGCAGCAG AGAGAAAAAC AAACGGAGAG 1020 CGCAAAGGCA TTTACTGCAG TACATAAGCC ACAGGCGCCG GCGAGAGAAG CTCGGCCGAG 1080 CGCAAAGAAG AGGAAAGACG TAGTTTGTTA TAACTGTGGA GAGCGTAGGC ATTTTAAAGC 1140 GAACTGTCGT CGCGAGAAAG TAAACAAAGA GAGCGCGACA CAAGAACAAT GCAGTTTGTT 1200 AAATGCGCTG GATAGTGGTG GTTTTTGGCA AAACACAGTG GTGTCTCGAT AGCGGGGCTA 1260 CCAGTCACAT GTGCTGTGAC AGAAGTGTTT TTACTGAGTT TGAAGAGCAC ACTGAAAAAA 1320 TTAGTCTTGC TGGAAATGGA TTCCTACTAG CAAAGGGCAT AGGAACAGTG AAGCTGAAGA 1380 CTGATTTATG TACTCTGGTA TTGAATAACG TACTCTTCGT CCCAGATTTG AACGGCAACT 1440 TTATGTCAGT CAGCCGTGCA GCTCAGTATA AATGTTTTGT CAATTTTGGA CCACATTACG 1500 CTGACGTCAT TCAGGAAGGC GAGCGAATAC TGCGTGTAAT GAGAGCTGGT AATTTATATA 1560 TGTTTCAAGG GAAACATAAC AGTTGTTTTG CGGCCGTTGA TGCTGATGGT TCACTATGGC 1620 ATAAAAGGAA TGGCCATTTG AATACAAGCA GCCTACAGGA GATGGTGAGG AAGAAGATGG 1680 TGTACGGTGT TGAAAAGGTC GTTTTCAAAC CAGACGCAGT ATGCAAGACG TGCATGCTGG 1740 CAAAAATCCA TGTGCAACCA TTTCCGAAGA CAACGAGGAG CAGAGCTGAG GAGCTGTTGG 1800 ATATGATCCA TTCAGACCTG TGCGGGCCAT TTAGCACACC GTCACTTGCT GGATCAAAGT 1860 ACTTTCTCAC TTTCATAGAC GACAAGTCCA GGCGGATTTT TGTATATTTC TTGCGGAAGA 1920 AGGACGAAGT CTTCACTAAG TTTGTCGAGT TTAAGAAACT GGTCGAGCGA CAAACAGGTA 1980 GAAAGATAAA ATGTATCCGG AGCGATAATG GTGGTGAGTT CGTCAATAAT GTTTTTGATG 2040 ACTATTTAAA GGCACATGGG ATCGCTAGAC AGCTGACTAT TCCACACACT CCCCAACAAA 2100 ATGGAGTTGC AGAACGAGCC AACCGCACGC TAGTAGAAAT GGCTAGGTGC ATGTTGCTGC 2160 AATCGGAGTT GGGTGAGGCT CTATGGGCTG AGGCGATAAA CACTGCGGTG TATCTGAGGA 2220 ACCGATCAAC GAGCAGAGCA TTACAAAGCA AAACCCCTAT GGAAGAGTGG ACCGGAAAAA 2280 TACCAGCAGT GAGCCACTTG AGGGTTTTTG GTGCCATAGC AGTGGCATTG GACAAAGGAG 2340 TCCATAAAGG CAAATTCGAA TCCAAAGGAA AGGAATATCG TATGATTGGA TATTCAATAG 2400 CTGCTAAGGG GTACCGTCTG TTTGACAAAG AGAAGCGGTG TGTGATCGAG AAGCAAGATG 2460 TCCTTTTTGA TGAGTCTGGT AGTTTGGTAA ATCATGGAAA TACCATTGAG TTCCAGTTTC 2520 CCGCAACTGA TGACCCGGAG CCGCAGAGTG ATTCGAATGC ACGGGAAGGT GACGATACAG 2580 AACCCGTGGG CAGCAGCGAC GACTATGAGA GTGCAGCTGA GGCAGAAGAA GCTGAAGTAC 2640 ATGTGGGGCC TGGACGGCCA AAGATTGTTC GGACGGGCAG ACCAGGGCGC CCGAAGAAGC 2700 AATACAATGT ACTTGGCGTG TTGATGGCTA GCGACGTCGA AATTCCCAAG TCCTATGAGG 2760 AGGCCATCAA TTCGCAGTAT TCTGCAAAGT GGGAAGAGGC AATGGGCCTG GAGTACAAGG 2820 CGCTACTTGC AAATGAGACA TGGAAGCTGG CTGACTTACC AAGAAATCGC CGGTGTGTGG 2880 CTTGCAAGTG GGTGTATTCC CTGAAACGAG ACGTCTCTGG TAGAATTGAG CGCTTCAAGG 2940 CACGACTAGT AGCAAAGGGG TGTTCGCAGA AGTTCGGAGT GGACTACTTC GAGACTTTTT 3000 CACCCGTGTG CAGGCTCGAG AGTGTGAGGC TCATTTTGGC ATTGGCAGCA GAGATGCAAT 3060 TGTACTTGCA TCACATGGAC GTATGCACGG CGTACTTAAA TAGCGAGCTA AAGGATACTG 3120 TGTACATGAA GCAGCCCCAA GGGTTCACAG ATGCTGCTAA TCCCGACCAG GTGTTATTGC 3180 TGAGGAAGGC AATATACGGC TTGAAGCAGT CAGGCAGAGA GTGGAACTCC AAGCTCGACG 3240 GTGTTCTAAA AGACTTGGGA TTTAAGGCCT GTAATCATGA ACCATGTCTT TATCAGCAAA 3300 GTGGTCAAGG TAATCTGATG CTCATCTTAG TATATGTTGA TGATTTAATT CTAGCGTGCC 3360 AGTCAAGAGA AGATATGGAG GATCTGAAAG CCAAGATTTC AGAGTCTTTC GAGTGCACGG 3420 ACAAGGGTCC ACTGCATTTG TTCTTAGGCA TGGAGGTGCA ACGAGATGGC GACCTTGGAG 3480 AAATCACTTT GGGCCATTCG CAATATATCA AGGAACTATT GCGGGATTAT GGCAGCGAGA 3540 ACTGTAGACC AGCGACGACA CCTTTGGATG CAGGGCATCA AGTTTTGTGC GCGGGTGAGC 3600 AGTGCCAGAA GGTCGACGCA GGGCAGTATC AGTCTACAAT TGGTGAGCTA ATGTGGCTTG 3660 GGCTTACTAC CAGACCAGAC ATGCTACATT CGGTGGCGAA GTTGGCTCAG AGGAATCAGG 3720 ACCCGCATTC TGAGCACATG GTGGCTGTGA AGCACATCCT CCGGTACTTG GCGTCAACTG 3780 TGGACGTCAA GCTGCATTAT CAAAAGTGCG GTCAGGCATT TACCGGCTTT GTGGATGCAG 3840 ATTGGGGAGG CGACCGTTTG GACCGAAAGT CATACACAGG GTATGTGTTT TTCCTGTCTG 3900 GCGGACCAGT ATCATGGAGG TCCGAGAAGC AGCAGAGCGT GGCGTTGAGC AGTACTGAAG 3960 CCGAGTATAT GGCTCTGACC ACGGCTTGCA AGGAAGCTAT AGCTTTACGA AGGCTAATAG 4020 TGGAGATCGT ATGCGGTGAT CTGAAGACCC CGACGGTTAT GCATGGCGAC AACCTGAAGT 4080 GCGCAGCACA GTTAGCGAAG AACCCGGTTC ATCACTCTAG GACGAAGCAC ATCGACATTC 4140 GATATCATTA GAGAAGTCAT GAAAGAGGGT CACGTTGTGT TAGAGTACAC TTCTACGAAT 4200 GAGATGATAG CAGACATTAT GACAAAGAAT CTTTCAAAGG GAAAGCATAA TGGGTTTATG 4260 AAAATGTTAA ATTTGTTTTA ATTTTTGTAA ACATGTTGGC ATTGAGGAAG GCTGTTGAAT 4320 ATAGGCAATG CCCACATGTG TGTTGAATAT AGGCAATTTC CACATGTGCA TATGTAATTT 4380 TGTATGAGAA CATACATACA TACACATGAA CTATATGTAT GTATATATAT TAGTAAATAA 4440 GCAGCCGCAT GAAGCTGGCA TTTTTATGTG TATCAGTTTC AGTTTCAAAT AAAACTTCTT 4500 CGTGTTCGGA CGCTCGGCTC AAGACTTTTT ATTTCGCGTT TACTCATTCG GCCTTTGCTC 4560 TCAATGCGCT GAGTTTGGGT GAAGATTAGG ATCTTCCCAT TATGGTTGTC AGTGTTCCAC 4620 ACTGGGAGCA CCTTTTCAAC AAACCACA 4648 // ID DMIS297 standard; DNA; INV; 6995 BP. XX AC X03431; XX DR FLYBASE; FBgn0000005; 297. XX FT source X03431:1..6995 FT SO_feature five_prime_LTR ; SO:0000425:1..414 FT SO_feature three_prime_LTR ; SO:0000426:6582..6995 FT SO_feature TATA_box ; SO:0000174:276..282 FT SO_feature TATA_box ; SO:0000174:6857..6863 FT SO_feature polyA_signal_sequence ; SO:0000551:304..309 FT SO_feature polyA_signal_sequence ; SO:0000551:6885..6890 FT SO_feature primer_binding_site ; SO:0005850:414..431 FT SO_feature RR_tract ; SO:0000435:6571..6581 FT SO_feature CDS ; SO:0000316:803..2047 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044338; 297\gag" FT /db_xref="SWISS-PROT:P20828" FT /protein_id="CAA27159.1" FT /translation="MSQPIIALSDINLAEARRQLKDIMPFKGDPETLHTFISRVDYVIS FT LYQTNDVRQQRILLGAIERNLDGQITRSLGLPNVEDWPTLKARLIAEFKIQTPNYKLLE FT NFRETPYRGSLRAFCEEAERRRQLLISKLHLEGNQSDFLIYIQGIKESIKILIRKLPIQ FT LFTILAHHDITDLRSLITIAQNEGIYEEHINFEFYEKPEYRNKNSNSNQNSKTQKFNTN FT VQTQNRPSYSQYSQPFQPNFNQYIQPFRPSYTQQITNNPPMWHAPNYFRPNQYINPQPI FT IQKNHFQQYPNKAQFPQTTHFRGNTYPRLQQPSTYKNTNFPITKRLRPSDSEQTKMSID FT EIRFQDAHEFEQVQPNYYEQQYFNQNQYNPYQNHSFINEGQQQVQFVQINNKQNQNNSE FT LNENFRLTVPENTNT" FT SO_feature CDS ; SO:0000316:<1999..5178 FT /db_xref="FLYBASE:FBgn0027622; 297\pol" FT /db_xref="SWISS-PROT:P20825" FT /protein_id="CAB57796.1" FT /translation="TKRKFSVNSSGKYEYIKIVYKGRSYKCLLDTGSTINMINENIFCL FT PIQNSRCEVLTSNGPITLNDLIMLPRNSIFKKTEPFYVHRFSNNYDMLIGRKLLKNAQS FT VINYKNDTVTLFDQTYKLITSESERNQNLYIQRTPESIASSDQESIKKLDFSQFRLDHL FT NQEETFKLKGLLNKFRNLEYKEGEKLTFTNTIKHVLNTTHNSPIYSKQYPLAQTHEIEV FT ENQVQEMLNQGLIRESNSPYNSPTWVVPKKPDASGANKYRVVIDYRKLNEITIPDRYPI FT PNMDEILGKLGKCQYFTTIDLAKGFHQIEMDEESISKTAFSTKSGHYEYLRMPFGLRNA FT PATFQRCMNNILRPLLNKHCLVYLDDIIIFSTSLTEHLNSIQLVFTKLADANLKLQLDK FT CEFLKKEANFLGHIVTPDGIKPNPIKVKAIVSYPIPTKDKEIRAFLGLTGYYRKFIPNY FT ADIAKPMTSCLKKRTKIDTQKLEYIEAFEKLKALIIRDPILQLPDFEKKFVLTTDASNL FT ALGAVLSQNGHPISFISRTLNDHELNYSAIEKELLAIVWATKTFRHYLLGRQFLIASDH FT QPLRWLHNLKEPGAKLERWRVRLSEYQFKIDYIKGKENSVADALSRIKIEENHHSEATQ FT HSAEEDNSNLIHLTEKPINYFKKQIIFIKSDKNKVEHSKIFGNSITTIQYDVMTLEKAK FT QILLDHFIHRNITIYIESDVDFEIVQRAHIEIVNTTYTKVIRSLFLLKNVGSYAEFKEI FT ILQSHEKLLHPGIQKMTKLFKENHFFPNSQLLIQNIINECNICNLAKTEHRNTKMPLKI FT TPNPEHCREKFVVDIYSSEGKHYISCIDIYSKFATLEQIKTKDWIECRNALMRIFNQLG FT KPKLLKADRDGAFSSLALKRWLEEEEVELQLNTAKNGVADVERLHKTINEKIRIINSSD FT DEEVKLSKIETILYTYNQKIKHDTTGQRPAQIFLYAGHPILDTQKIKEKKIEKINEDRR FT EFNIDTNYRKGPLQKGKLENPFKPTKNVEQTDPDHYKITNRNRVTHYYKTQFKKQKKNN FT KLSISQAPGTR" FT SO_feature CDS ; SO:0000316:5145..6560 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0027623; 297\env" FT /db_xref="SWISS-PROT:P20829" FT /protein_id="CAB57797.1" FT /translation="TLNFTGTWYPITLLFILITAVHGQQIQINNIDTNHGYLLFSDKPV FT QIPSSFEHHSLKINLTEIDIVVDYFEQRLRTDYHAPQINFLYNKIKRELARITLKHRNK FT RGFINIVGSGFKYLFGTLDENDRVEIQKKLEINVHNSVKLHELNDAIRLINDGMQKIQN FT YENNHTIIDSLLFELMQFTEYIEDLEMAMQLSRLGLFNPKLLNYDKLENVNSQNILNIK FT TSTWINYNDNQVLIISHIPIYLSLISTIKIIPYPDSNGYQLDYTDTQSYFEKENKVYNT FT ENKEVKNECVTNIIKHLNPICNFKPVHTNEIIKYIEPNTIVTWNLTQTILNQNCQNSIN FT KIKIEGNKMIRVTQCKIEINNINFSETLLEPEIDLTPLYTPLNITKIKIVKHNDIIEMI FT SENNITLYIQMIIVIIALILLYSYLRYVSFKPFMMLYAKLKIRKNQNQNTPQQTEIEEI FT PFPTLYPSIPAQV" XX CC Derived from X03431 (g8146) (Rel. 36, Last updated, Version 2). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 6995 BP; 2811 A; 1356 C; 972 G; 1856 T; 0 other; AGTGACGTAT TTGGGTGGTC CAAACCAGCC ACTTCCATTA TTTCAAAGAA ATCAGTAATG 60 CACTCTAGTA ATTTTCCATA ACTGTATCCC AGCTGCGCAG ACTCGTTTAT CTTTTGCAGC 120 GCAGCGTTCT TTGTAAACAT CCTAAAGACC TGCCTAAGCA GATTTGACTG CCCTCTTTCA 180 ACGCTACCTA ATCTTAAGAA CCCAAGAGCG AGGCTCTCCC GAAATACAAA TATTGTTCAA 240 ATACTGAGGC TTCTCCTCAA TCCAATTTGC ATTTGATTTT TAGTCTTAAG CTGAGATCCA 300 AAGAATAAAG TCGTGAAACT ATTTCTCCTA AAAACTATTT TTTATTTCTT GGCGTTGTCC 360 TTAGTCAACT GACGGGACAT TAGTTCGACT CATAAATAAA ACAACAATTT TACTGGCGCA 420 GTCGGTAGGA TACAAAAGTA TCCGAAAAAA AAGAACCTTC GAATGGAAAA TAAGTTAAAT 480 TTTATAGTCC TGTGCTCGAA ACATCTCCCA AAATAAATTC GTGAAAACTC TTCAACTTCA 540 ATTATAATTC CAATTCGGTT ATCCAATAAT AAGTGGAAGT GAAATACGAA ACAAAAATAT 600 TAAGTCCAAA GGCAACTAAG TTTTAAAACC AACATATAAA AATAAAAAAT TAAAACAATA 660 TAGAATTTTA ATAATACAAC ACAAAAATTT ACAAAACAAA AAAACAAACA AGTGAAACTA 720 GAAAGCTTAA AAATAATAAT AACATTGAAT CCGAAACAAA ACAAAAAAAT AAAACACAAA 780 AGTTAAAAAT TTTACAATAA AAATGTCACA ACCAATTATT GCGCTGAGCG ACATAAACCT 840 TGCCGAAGCC CGTCGGCAGC TTAAAGACAT TATGCCATTC AAGGGTGATC CAGAAACCCT 900 TCACACCTTT ATCAGCAGAG TGGATTACGT AATTTCGCTC TACCAAACAA ATGATGTCCG 960 ACAACAGAGG ATTCTACTGG GAGCCATCGA AAGGAACTTG GACGGACAAA TTACACGATC 1020 TTTGGGACTT CCGAACGTCG AAGATTGGCC TACCCTTAAA GCAAGACTCA TCGCGGAATT 1080 TAAAATTCAA ACACCAAACT ACAAACTTCT GGAGAACTTC AGGGAGACAC CATACAGAGG 1140 AAGCCTAAGA GCATTCTGCG AAGAAGCGGA GAGACGACGT CAATTACTAA TTTCGAAACT 1200 ACACCTGGAA GGTAACCAAT CGGATTTTCT TATTTATATT CAGGGTATTA AAGAATCTAT 1260 TAAGATACTG ATAAGGAAAC TACCAATACA ATTATTCACT ATTTTAGCCC ATCACGATAT 1320 TACAGACTTA AGATCCTTAA TTACCATTGC ACAAAATGAG GGAATTTATG AAGAACACAT 1380 TAATTTTGAA TTTTATGAAA AACCAGAATA TCGTAATAAA AATTCAAATT CTAACCAGAA 1440 TTCGAAAACA CAAAAATTCA ATACAAATGT TCAAACTCAA AATCGACCAA GTTACTCACA 1500 ATATTCCCAA CCCTTCCAAC CTAATTTTAA TCAATACATT CAACCATTTA GACCTAGCTA 1560 TACACAGCAG ATAACTAACA ACCCACCCAT GTGGCACGCA CCTAATTATT TCAGACCCAA 1620 CCAATACATA AACCCACAAC CCATTATTCA AAAAAATCAT TTCCAACAAT ATCCCAACAA 1680 AGCCCAATTT CCCCAAACAA CGCATTTTAG AGGAAATACA TACCCTCGAC TACAACAACC 1740 CTCTACATAT AAAAATACTA ACTTCCCGAT TACTAAACGA CTAAGACCAT CGGACAGTGA 1800 ACAAACTAAA ATGTCTATTG ACGAAATTAG ATTCCAAGAC GCGCATGAAT TCGAACAAGT 1860 CCAACCTAAT TATTACGAGC AACAGTATTT TAACCAAAAT CAATACAATC CGTATCAAAA 1920 TCATAGCTTC ATTAATGAAG GGCAACAACA AGTTCAATTT GTACAAATTA ATAACAAACA 1980 AAACCAAAAT AATTCTGAAC TAAACGAAAA TTTTCGGTTA ACAGTTCCGG AAAATACGAA 2040 TACATAAAAA TAGTATACAA AGGGCGTTCA TACAAATGCC TTCTAGACAC AGGATCAACA 2100 ATTAATATGA TCAATGAAAA TATATTTTGT CTTCCCATTC AAAATAGTAG ATGTGAAGTT 2160 TTAACATCAA ATGGCCCTAT TACCTTGAAC GACTTGATTA TGTTACCCAG AAATAGTATT 2220 TTCAAAAAAA CCGAACCATT TTATGTGCAC AGATTTTCTA ATAATTACGA TATGCTAATT 2280 GGCAGAAAAT TGTTGAAAAA TGCTCAATCA GTTATTAATT ACAAAAATGA TACAGTTACC 2340 CTTTTTGATC AAACATACAA ATTAATTACT TCAGAATCCG AAAGAAACCA AAATTTGTAT 2400 ATCCAAAGGA CACCAGAATC AATTGCAAGC TCAGATCAGG AATCAATAAA AAAATTAGAT 2460 TTTTCACAGT TTCGATTAGA TCACCTAAAT CAGGAGGAAA CTTTTAAGTT AAAAGGCTTG 2520 TTAAATAAAT TTAGAAATCT TGAATATAAG GAGGGAGAGA AATTAACATT TACAAATACA 2580 ATTAAACACG TACTAAATAC AACACATAAC TCCCCAATTT ATTCGAAACA ATACCCACTT 2640 GCGCAAACAC ACGAAATCGA AGTAGAAAAC CAAGTACAGG AAATGCTGAA TCAGGGATTA 2700 ATTAGGGAAA GTAATTCTCC ATACAATAGT CCTACTTGGG TCGTACCAAA GAAACCGGAT 2760 GCTTCTGGTG CAAATAAGTA CAGGGTAGTA ATTGATTATA GAAAGCTAAA TGAAATAACC 2820 ATACCTGACA GATATCCAAT TCCAAATATG GACGAAATTC TTGGCAAACT GGGTAAATGC 2880 CAATATTTTA CAACGATCGA TCTGGCAAAG GGATTTCATC AAATAGAAAT GGACGAAGAA 2940 TCAATTTCTA AAACTGCATT CTCCACAAAA AGCGGTCATT ACGAATACCT TCGAATGCCA 3000 TTTGGCCTTA GGAATGCACC CGCTACTTTT CAAAGGTGCA TGAATAATAT CCTTCGACCG 3060 TTGCTTAACA AACACTGTTT GGTGTATCTG GATGATATTA TAATTTTTTC AACATCCCTT 3120 ACAGAACATT TAAATTCAAT ACAATTAGTT TTTACAAAGC TTGCAGATGC AAATTTAAAA 3180 TTGCAACTAG ACAAATGTGA GTTCTTAAAA AAGGAAGCTA ACTTTCTTGG TCACATAGTT 3240 ACCCCTGATG GTATTAAACC AAATCCTATT AAAGTTAAAG CCATAGTTTC ATACCCAATT 3300 CCGACAAAAG ATAAAGAGAT AAGAGCTTTC CTTGGATTAA CAGGTTATTA TCGCAAATTT 3360 ATTCCAAATT ACGCAGACAT AGCAAAACCC ATGACCAGCT GCTTAAAAAA AAGGACAAAG 3420 ATAGATACAC AAAAACTTGA GTACATAGAG GCATTCGAAA AACTTAAGGC TTTGATAATT 3480 CGTGACCCAA TTTTACAATT ACCTGATTTT GAAAAGAAAT TTGTTTTAAC CACAGATGCA 3540 AGTAACTTGG CCCTCGGGGC TGTCCTTTCT CAAAACGGTC ATCCTATATC TTTTATTAGT 3600 AGAACACTTA ACGATCACGA ATTAAATTAC AGTGCTATCG AAAAAGAATT ACTTGCCATA 3660 GTTTGGGCCA CAAAAACTTT TCGACATTAT TTACTAGGAC GACAATTTCT CATTGCCAGT 3720 GACCATCAAC CTCTTAGATG GCTTCATAAC TTAAAGGAAC CAGGTGCTAA GTTAGAAAGA 3780 TGGAGAGTTA GATTAAGCGA ATACCAATTT AAAATAGATT ATATTAAAGG GAAAGAAAAT 3840 TCAGTTGCCG ATGCATTATC AAGAATTAAA ATTGAAGAAA ATCATCATAG TGAAGCTACT 3900 CAACATAGTG CAGAAGAGGA CAATAGCAAC CTTATTCATT TAACAGAAAA ACCAATAAAT 3960 TATTTCAAAA AACAAATAAT CTTTATTAAA TCCGATAAAA ATAAAGTAGA GCATTCAAAA 4020 ATATTCGGTA ACTCCATTAC CACAATTCAA TATGACGTAA TGACACTTGA AAAGGCCAAA 4080 CAAATTTTAC TCGATCACTT TATCCATAGA AACATTACCA TTTATATTGA GAGCGATGTA 4140 GATTTTGAAA TCGTTCAAAG AGCACACATA GAAATTGTTA ATACCACCTA CACAAAAGTA 4200 ATTCGCAGTC TTTTCCTATT AAAGAACGTT GGTTCATACG CCGAATTCAA AGAAATCATA 4260 CTTCAATCAC ATGAAAAACT TTTACACCCT GGTATACAGA AAATGACAAA ATTATTTAAA 4320 GAAAATCACT TCTTTCCAAA TAGCCAACTA TTAATTCAGA ATATAATAAA CGAATGCAAC 4380 ATATGCAATT TGGCCAAAAC AGAACATAGA AACACCAAAA TGCCTTTAAA AATCACACCC 4440 AACCCGGAAC ATTGCCGAGA AAAATTTGTA GTAGATATTT ATTCATCTGA GGGAAAACAT 4500 TACATCAGTT GCATTGATAT TTATTCTAAA TTCGCTACAC TTGAGCAAAT TAAAACTAAG 4560 GATTGGATAG AATGCAGAAA CGCATTAATG CGCATTTTTA ATCAACTAGG AAAACCCAAA 4620 TTATTAAAGG CAGACAGAGA CGGAGCTTTC TCCAGTTTAG CTTTAAAGCG ATGGCTTGAA 4680 GAAGAAGAAG TCGAATTACA GCTCAATACA GCAAAAAACG GAGTAGCAGA CGTCGAAAGA 4740 TTACACAAAA CAATAAATGA AAAAATTCGT ATAATCAATT CATCTGATGA TGAAGAAGTA 4800 AAATTAAGCA AGATAGAAAC AATCCTCTAC ACATACAACC AAAAAATTAA ACATGACACT 4860 ACTGGACAGA GACCTGCTCA AATTTTCTTA TACGCTGGGC ATCCCATATT AGACACTCAA 4920 AAAATTAAAG AGAAGAAAAT AGAGAAAATA AATGAAGACA GACGGGAATT TAATATTGAC 4980 ACTAATTACA GAAAAGGTCC ACTACAGAAA GGCAAATTAG AAAACCCATT TAAACCAACC 5040 AAAAATGTAG AACAGACAGA CCCTGACCAT TACAAAATCA CTAATAGAAA TAGAGTTACG 5100 CACTACTACA AAACACAATT CAAAAAACAA AAGAAAAATA ATAAACTCTC AATTTCACAG 5160 GCACCTGGTA CCCGATAACA CTATTGTTTA TACTGATCAC AGCTGTTCAT GGACAACAAA 5220 TTCAAATTAA TAATATTGAC ACCAACCACG GATATCTCCT TTTTTCTGAT AAGCCAGTAC 5280 AGATACCATC CTCCTTTGAA CATCACTCCT TAAAAATCAA TTTAACTGAA ATAGACATCG 5340 TGGTTGACTA TTTTGAGCAA AGACTACGAA CCGATTACCA TGCACCCCAG ATCAATTTTT 5400 TATACAATAA AATAAAAAGA GAACTAGCCA GAATAACCCT GAAACATAGA AACAAACGGG 5460 GTTTTATTAA CATTGTGGGT TCAGGTTTTA AATACCTATT TGGAACACTA GATGAAAATG 5520 ATCGAGTCGA AATACAGAAA AAACTTGAAA TCAACGTCCA TAACTCAGTA AAATTACATG 5580 AACTCAACGA CGCCATACGA TTGATAAATG ACGGAATGCA AAAAATACAG AATTATGAAA 5640 ATAACCACAC CATCATTGAC AGTCTTTTGT TCGAACTAAT GCAGTTTACG GAATACATAG 5700 AAGATTTGGA AATGGCTATG CAGCTTTCCA GACTTGGACT GTTTAACCCC AAATTACTAA 5760 ACTACGACAA ACTTGAAAAT GTGAACAGCC AAAACATTTT GAACATTAAA ACATCCACTT 5820 GGATTAACTA CAATGATAAC CAAGTATTAA TCATATCCCA CATACCCATT TACCTTTCAC 5880 TAATAAGCAC AATTAAAATA ATTCCTTACC CAGACTCCAA CGGCTATCAG CTAGATTACA 5940 CAGACACACA ATCATATTTT GAAAAAGAAA ATAAAGTTTA TAATACCGAA AATAAAGAAG 6000 TAAAAAATGA ATGTGTCACC AATATTATTA AACACTTAAA TCCAATTTGT AATTTTAAGC 6060 CAGTACACAC GAACGAAATA ATAAAATACA TAGAACCAAA CACAATTGTA ACTTGGAACT 6120 TAACCCAAAC AATTCTTAAC CAAAATTGCC AAAATTCAAT TAATAAAATA AAAATAGAAG 6180 GAAACAAAAT GATAAGAGTA ACGCAATGCA AAATAGAAAT CAATAATATA AATTTTAGTG 6240 AAACTCTGTT AGAACCAGAA ATAGATTTGA CACCACTATA CACACCACTT AATATAACAA 6300 AAATAAAAAT TGTAAAACAC AACGACATTA TTGAGATGAT TTCAGAGAAC AATATTACAC 6360 TTTACATACA AATGATCATT GTAATAATCG CACTAATTTT GTTGTACTCA TATTTAAGAT 6420 ATGTATCATT TAAACCATTT ATGATGTTGT ATGCAAAACT TAAAATAAGA AAAAATCAAA 6480 ATCAAAACAC ACCACAACAA ACAGAAATAG AAGAAATTCC ATTTCCCACA CTATATCCAT 6540 CAATCCCAGC CCAAGTATAG GCTTCTCTTT AAGGGAAGGG GAGTGACGTA TTTGGGTGGT 6600 CCAAACCAGC CACTTCCATT ATTTCAAAGA AATCAGTAAT GCACTCTAGT AATTTTCCAT 6660 AACTGTATCC CAGCTGCGCA GACTCGTTTA TCTTTTGCAG CGCAGCGTTC TTTGTAAACA 6720 TCCTAAAGAC CTGCCTAAGC AGATTTGACT GCCCTCTTTC AACGCTACCT AATCTTAAGA 6780 ACCCAAGAGC GAGGCTCTCC CGAAATACAA ATATTGTTCA AATACTGAGG CTTCTCCTCA 6840 ATCCAATTTG CATTTGATTT TTAGTCTTAA GCTGAGATCC AAAGAATAAA GTCGTGAAAC 6900 TATTTCTCCT AAAAACTATT TTTTATTTCT TGGCGTTGTC CTTAGTCAAC TGACGGGACA 6960 TTAGTTCGAC TCATAAATAA AACAACAATT TTACT 6995 // ID DM23420 standard; DNA; INV; 6126 BP. XX AC U23420; XX DR FLYBASE; FBgn0005384; 3S18. XX SY synonym: BEL XX FT source U23420:1..6126 FT SO_feature five_prime_LTR ; SO:0000425:1..361 FT SO_feature three_prime_LTR ; SO:0000426:5766..6126 FT SO_feature CDS ; SO:0000316:919..5742 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044337; 3S18\ORF" FT /db_xref="REMTREMBL:AAB03640" FT /protein_id="AAB03640.1" FT /translation="MFIGSIASNSSLTDCQRFHYLKSYLAGDALALVKHIPVTNDNYRE FT AWERLEQRYNKQSLIIRSFLNSFMSLPSAINSNIGTVRKIADGADEVIRGLRALNCEER FT DPWLIFILLSKLDSDTRQAWAQCAESEEKGVTINRFLKFLTSRCDTLEAFELTRSTQAR FT RAATTHHADTHPRREEPKCTSCQQNHQLFKCPQFIALDIASRRDFLKSRKLCFNCLSPA FT HMVGNCTSRHTCRICRRKHHTLVHGSSQPIQNGNNIDTASVDSRDRPAVSHAGSTIGHN FT QPLAREGHRLGSETPAENNFTHHTLENIPAAGSQTLLPTILADVIDAWGNTTTCRLLLD FT TGSTITLASESFVQRIGVRRTHARISILGLAANSAGVTRGRAHIKLRSRHSGQTVELVS FT FILTSLTSSLPAQVIDTSSSTWRQICELPLADPTFCTPGAIDVIVGSDQLWSLYTGDRK FT HFGNDFPIALNTVFGWILAGSYSAFDDHPTSAVTHHADLDTMVRSFMEMDSIQPNQALL FT DASDPTERHFAATHKRSTDGVYVVEYPFKEKAPPIDSTLPQAINRFFSLERKFRRYPEL FT KQQYEAFLDDYLQRGHMEKLTSAQVEESPDTCFYLPHHAVIKLDSLTTKCRVVFDGSGK FT DSSGVSLNDRLHIGPPIQRDLFGVCLRFRQHQYVLCADVEKMFRGIKVFKPHTNFQRIV FT WRTTENEPLLHFRLLTVTYGLAPSPFLAVRVLKQLADDHGHEYPAAAHALLHDAYVDDI FT PTGANTFEELMILKDELIALLDKGKFKLRKWSSNSWRLLKSLPEEDRCFEPIQLLNKSA FT ADSPVKVLGIQWNPGKDVLYLNLKGCDATISPTKRELLSQLSRIYDPLGLVAPVTVLLK FT LIFQESWTSVLQWDDPIPESLRTRWRALVEDLPALTQCQVPRYIASPFRDVQLHGFADA FT SSHAYGAVVYARVAVGCSFQVTLVAAKTRVAPIKPVSIPRLELNAALLLSRLLSIVKTS FT LTIPLFSTSCWTDSEIVLHWLSAPPRRWNTYVCNRTSEILSDFPRSCWNHVRTEDNPAD FT CASRGLHPSKLLEHRLWWKGPSWLATPTSEWPPSTSKFSVSSSFDVNTEERAIKPTTLH FT NFPDESIHELLIHKFSTWTRLIRVSSYCHRFIHTLRSHHRNSAPFLTSEELLDAQRRLI FT RHVQQKSFAREYEQLENRRQLNAKSHLIRFSPFLDDYGVMRVGGRIEQSTLNYNAKHPI FT LIPKDTPLAGLLVRHFHVSYLHTGVDATFTNLRQQYWILGARNLVRKAVFQCKSCFLQR FT KGTSNQIMGELPIPRVQASRCFQHTGLDYAGPIAIKESKGRTPRIGKAWFSIFVCLTTK FT ALHIEVVSELTTQAFIAAFQRFIARRAKPTDLYSDNGTTFHGGKKTLDDMRRLAIQQAK FT DEELAGFFANEGISWHFIPPSAPHFGGMWEAGVRSIKLHMKRILGSKALTFEELSTVLT FT QIEAILNSRPLCPTGDNSLDPLTPAHFLTGSPYTALPEPCRLDMQVNRLERWNQLQAMV FT QGFWKRWHMEYLTSLHERTKWHLETENLKIDTLVVLKEPNLPPSKWILGRITAVHAGID FT NKVRVVTVKTAHGLYKRPIAKIAVLPLC" XX CC Derived from U23420 (g733531) (Rel. 48, Last updated, Version 3). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 6126 BP; 1623 A; 1556 C; 1346 G; 1601 T; 0 other; TGTTTGGGAA CGAGACACCC TGTATACGCG AACAAGTCAC CCTTTATCTT TATTTACATT 60 CTTATTTGTC TGCAGCTTCA TCGGAGCTTA TCAGCGGAAT CAATGTAAGC ATCGCACCGC 120 TGTAATTGTC CGCGAGCTTG CCCAGTACTT TTCCAAACTT CTAACTCCCT TCTAACTGTA 180 ACTTGTTTAC GTCTTATGCT AGACTAATCG TATGGCGTGA TTACAGCCAA AGCTGAAGTC 240 AGTCACAATT TTGATCTGCG AGAAAACGTA CGCATCGGTG TCGAAATAAT TAATATTAAG 300 TGTCTGAACT TAACCAATAA ATGAAAATTA ACAGTAACAC TGGCGGTTTT ATTTATAAAC 360 ATAAAAATTG GTCCTTCGAG CCGGATAACC GGAAGTGCGT TTCGTTCGGG CATTTGATTT 420 TGATTATTGG CCTTTTGGCA AACGATAATC TATAGATTCC TACATCGTGT AGAATCGTTC 480 CCTTCTTTCG ACCACCATGC GGAGTGTGAT TCAACAACGG GGCTTCTGCA AAAGCCAAAT 540 TACTCGTGCG CATAATAATG CCTTAAAATT TGTTGATGAC ATTCACTCAG TGCAAACAAT 600 AGTTGTCCGC CTGGCGCAAC TACAGGAAAA TTATTTGCGG TTCGTACGGC TCTCGGAAGA 660 GCTGTATGCA TTTCAATCGG AAGCCGATTG GGAGAACCCT GACGAGGATT TTGACGCATA 720 TGAGGACAAA CATTATGCTA CACACGCTAT TCTCAGCAAT ACTTTGGAGG AGTTGAGACG 780 GGATGTCACC TCAAACAGTA TTGATGCCAC AGTTCAAGCG CAGGCACACC CCAGAGAAGT 840 CATGTCGATT TTCAGTTCGA GAGAATTAAA CTTCCGACTT TTTCTGGAAA TTATGAGGAC 900 TGGAAACATT TTTCGGACAT GTTTATTGGA TCGATTGCTT CCAATTCGAG CCTGACGGAT 960 TGCCAACGAT TTCATTATTT AAAATCGTAC CTTGCCGGAG ACGCGCTTGC ATTAGTTAAA 1020 CATATTCCAG TTACTAATGA CAACTATCGG GAAGCATGGG AGCGGCTGGA ACAGCGATAT 1080 AACAAACAAT CGCTAATTAT TCGATCGTTC TTAAACAGTT TCATGAGCCT TCCGAGTGCT 1140 ATAAATTCAA ATATCGGCAC AGTGCGGAAA ATTGCCGATG GTGCAGACGA AGTTATTCGT 1200 GGTCTACGAG CTCTTAATTG CGAAGAGAGG GATCCCTGGC TAATTTTCAT TCTACTTTCA 1260 AAATTAGATA GCGATACCCG CCAAGCCTGG GCTCAGTGCG CAGAATCCGA GGAAAAAGGT 1320 GTGACCATCA ACCGATTCTT GAAATTTCTC ACATCACGCT GCGATACGTT GGAGGCTTTT 1380 GAATTAACTC GATCAACCCA AGCTCGACGC GCAGCTACCA CGCACCACGC AGACACGCAT 1440 CCAAGACGGG AAGAGCCGAA GTGCACATCG TGCCAGCAGA ATCACCAACT GTTTAAGTGT 1500 CCTCAATTCA TCGCACTCGA CATTGCATCT CGCCGAGACT TCCTCAAATC AAGAAAGCTC 1560 TGTTTCAATT GCCTCAGCCC GGCTCATATG GTGGGCAACT GTACATCGAG GCATACTTGT 1620 CGGATCTGCC GCCGCAAGCA TCATACTTTG GTTCATGGCT CGTCGCAGCC AATTCAAAAT 1680 GGCAACAACA TTGACACAGC AAGTGTTGAC AGCCGCGATC GACCAGCAGT CTCACATGCG 1740 GGATCTACAA TTGGCCACAA TCAACCGCTA GCTCGAGAAG GTCATCGCTT GGGAAGCGAG 1800 ACTCCCGCGG AAAACAACTT TACGCATCAT ACTCTGGAGA ATATTCCGGC GGCTGGTTCT 1860 CAGACTCTGT TGCCAACCAT CCTTGCTGAC GTCATCGACG CCTGGGGAAA CACTACAACC 1920 TGCAGGCTGC TCCTGGACAC TGGATCTACA ATAACCTTGG CATCGGAATC ATTTGTTCAG 1980 CGAATAGGCG TGCGTCGAAC GCACGCACGG ATTTCTATTC TCGGTCTCGC CGCCAACAGC 2040 GCGGGCGTTA CCCGAGGACG CGCACATATC AAGCTGCGCT CTCGTCATTC GGGCCAAACT 2100 GTCGAATTGG TCTCGTTCAT TCTCACCTCG CTGACGTCAT CACTTCCTGC CCAAGTTATT 2160 GACACCTCAT CCTCTACGTG GAGGCAAATC TGCGAGCTTC CTTTGGCAGA CCCAACGTTC 2220 TGCACACCTG GAGCAATCGA TGTCATTGTT GGATCGGATC AACTTTGGTC TCTATACACA 2280 GGAGATCGGA AACACTTTGG TAACGACTTT CCTATCGCTC TCAATACTGT ATTTGGTTGG 2340 ATTCTTGCAG GCTCTTACTC TGCATTCGAT GATCACCCTA CTTCTGCGGT TACTCATCAC 2400 GCGGACCTAG ACACGATGGT TCGTTCATTC ATGGAGATGG ACAGCATTCA GCCTAACCAG 2460 GCTCTCCTGG ACGCCAGCGA TCCCACAGAG CGTCATTTTG CTGCCACACA CAAGCGCTCG 2520 ACGGACGGGG TGTACGTCGT CGAGTATCCC TTCAAGGAAA AGGCACCGCC TATTGATTCG 2580 ACCTTGCCAC AGGCCATCAA TCGCTTCTTC TCGCTGGAAC GCAAATTTCG TCGGTATCCA 2640 GAATTGAAGC AGCAGTACGA AGCTTTCCTG GACGACTACT TGCAACGTGG ACATATGGAA 2700 AAACTGACCT CGGCTCAGGT TGAAGAGTCC CCAGACACCT GCTTCTATTT GCCGCACCAC 2760 GCTGTCATCA AACTGGACAG TCTGACTACC AAATGTCGTG TAGTTTTTGA TGGATCAGGA 2820 AAAGACAGCT CTGGAGTATC GCTCAATGAC AGACTACATA TTGGTCCACC GATTCAACGC 2880 GATCTTTTTG GCGTTTGTCT ACGCTTCCGG CAGCACCAAT ATGTTTTATG TGCAGATGTC 2940 GAAAAGATGT TTCGAGGCAT TAAAGTCTTT AAGCCACACA CCAATTTTCA GCGCATTGTT 3000 TGGCGCACGA CTGAGAATGA ACCTCTGCTT CATTTTCGCC TGCTGACGGT TACCTACGGA 3060 TTGGCACCGT CACCATTTCT GGCTGTTCGA GTTCTAAAGC AACTTGCCGA CGATCATGGC 3120 CATGAATACC CTGCAGCAGC TCACGCTCTT CTGCACGATG CCTATGTGGA CGATATCCCG 3180 ACAGGCGCCA ACACATTCGA GGAGCTTATG ATTCTCAAGG ACGAGCTTAT AGCCCTCTTG 3240 GATAAGGGAA AATTCAAGCT ACGCAAATGG AGTTCTAATA GTTGGCGTCT TCTGAAATCA 3300 TTACCAGAGG AAGATAGATG TTTTGAACCT ATCCAGCTCC TCAACAAATC AGCTGCGGAT 3360 TCACCTGTCA AAGTTCTTGG TATCCAATGG AACCCTGGGA AGGACGTCCT GTATCTCAAC 3420 CTAAAGGGAT GCGATGCGAC CATTTCTCCG ACGAAAAGAG AACTCTTGTC TCAGCTATCA 3480 AGAATTTATG ATCCGCTTGG ACTGGTAGCG CCGGTCACAG TTCTACTCAA GCTAATCTTC 3540 CAAGAAAGCT GGACAAGTGT CCTGCAGTGG GACGACCCCA TACCTGAAAG TCTACGTACG 3600 CGCTGGAGAG CCTTAGTAGA GGATTTGCCA GCACTTACGC AATGCCAAGT ACCACGGTAT 3660 ATTGCGTCAC CATTTCGAGA TGTTCAACTA CACGGATTCG CCGACGCATC CTCGCACGCC 3720 TACGGTGCGG TAGTTTACGC TCGAGTTGCA GTTGGATGCA GCTTTCAAGT AACTCTGGTT 3780 GCCGCCAAAA CACGGGTGGC CCCGATCAAG CCCGTATCAA TTCCACGTTT GGAGCTAAAC 3840 GCTGCGTTAC TTCTATCTCG ATTGCTTTCT ATTGTCAAAA CATCACTAAC AATTCCTCTT 3900 TTCAGCACGA GCTGCTGGAC AGATTCAGAA ATTGTGCTAC ACTGGCTTTC AGCTCCCCCT 3960 CGACGGTGGA ACACCTACGT CTGCAACCGA ACTTCTGAGA TATTGAGCGA CTTTCCCCGT 4020 AGCTGCTGGA ACCATGTTCG CACGGAAGAC AATCCTGCAG ATTGTGCTTC CCGAGGACTT 4080 CATCCGTCAA AGCTTCTGGA GCATCGACTG TGGTGGAAAG GTCCGTCTTG GCTGGCCACA 4140 CCCACCTCTG AGTGGCCACC TTCTACAAGC AAGTTCAGCG TATCTTCAAG TTTCGATGTC 4200 AACACCGAAG AACGAGCCAT AAAGCCCACG ACTCTACATA ACTTTCCTGA TGAAAGTATA 4260 CACGAGTTAC TCATCCACAA ATTCTCAACC TGGACGCGTC TTATAAGGGT ATCTAGCTAC 4320 TGTCATCGCT TTATTCACAC TCTTCGATCC CATCATAGGA ATTCGGCACC ATTCCTTACG 4380 TCTGAAGAGT TGCTGGACGC ACAGCGCCGA CTTATTCGAC ATGTGCAACA AAAATCCTTT 4440 GCCAGAGAAT ATGAGCAGCT AGAGAATCGA CGCCAGCTTA ACGCTAAATC GCATCTTATC 4500 CGGTTTTCTC CGTTTCTGGA TGATTATGGA GTAATGCGAG TCGGTGGGAG AATCGAGCAA 4560 TCTACACTCA ACTATAACGC CAAGCACCCG ATTCTGATAC CTAAAGATAC ACCACTAGCT 4620 GGACTCCTGG TTCGACATTT TCATGTCTCC TATCTGCACA CTGGAGTTGA TGCAACGTTC 4680 ACCAATCTTC GTCAGCAGTA CTGGATTCTG GGAGCCCGCA ATCTCGTCAG AAAGGCAGTC 4740 TTCCAATGCA AATCCTGTTT TCTTCAACGA AAGGGCACAA GCAACCAGAT CATGGGAGAG 4800 CTACCAATTC CTCGAGTTCA AGCTAGCCGC TGCTTTCAAC ACACAGGGCT GGACTACGCT 4860 GGACCGATCG CAATCAAGGA ATCAAAGGGA AGAACTCCAC GCATCGGAAA GGCATGGTTT 4920 TCTATTTTCG TGTGTCTCAC TACAAAGGCA CTTCACATCG AGGTTGTTAG TGAGCTAACT 4980 ACACAGGCTT TCATCGCAGC CTTTCAACGA TTCATTGCCC GCCGAGCGAA GCCTACTGAC 5040 CTGTATTCGG ATAATGGAAC AACATTTCAT GGAGGCAAGA AAACTTTGGA TGACATGAGA 5100 CGTCTGGCCA TTCAACAAGC CAAAGATGAG GAACTAGCAG GATTCTTTGC CAATGAAGGG 5160 ATTTCTTGGC ACTTTATACC CCCGTCTGCT CCACATTTTG GAGGGATGTG GGAAGCTGGA 5220 GTTCGCTCAA TTAAACTCCA TATGAAACGA ATACTTGGAT CAAAGGCTTT AACGTTTGAG 5280 GAGCTCTCTA CTGTCCTGAC CCAAATTGAA GCTATCCTGA ATTCACGCCC GCTGTGCCCA 5340 ACTGGGGATA ATTCTTTGGA TCCACTGACG CCTGCTCATT TTTTGACTGG ATCTCCGTAT 5400 ACTGCATTGC CTGAACCCTG TCGTCTGGAT ATGCAAGTCA ATCGATTGGA GAGGTGGAAT 5460 CAGCTGCAAG CCATGGTTCA AGGCTTTTGG AAAAGGTGGC ATATGGAATA CCTGACATCT 5520 CTTCATGAGC GGACAAAGTG GCATCTGGAA ACCGAGAATC TGAAGATCGA CACACTGGTA 5580 GTACTCAAGG AGCCCAATCT ACCGCCCTCT AAATGGATTC TTGGCCGCAT CACAGCAGTG 5640 CACGCAGGAA TCGACAACAA GGTCCGAGTC GTTACAGTGA AGACTGCTCA CGGATTATAC 5700 AAACGCCCAA TTGCCAAAAT CGCTGTACTG CCTCTCTGCT GAACAACCGT TCAGGGGGGC 5760 CGGTATGTTT GGGAACGAGA CACCCTGTAT ACGCGAACAA GTCACCCTTT ATCTTTATTT 5820 ACATTCTTAT TTGTCTGCAG CTTCATCGGA GCTTATCAGC GGAATCAATG TAAGCATCGC 5880 ACCGCTGTAA TTGTCCGCGA GCTTGCCCAG TACTTTTCCA AACTTCTAAC TCCCTTCTAA 5940 CTGTAACTTG TTTACGTCTT ATGCTAGACT AATCGTATGG CGTGATTACA GCCAAAGCTG 6000 AAGTCAGTCA CAATTTTGAT CTGCGAGAAA ACGTACGCAT CGGTGTCGAA ATAATTAATA 6060 TTAAGTGTCT GAACTTAACC AATAAATGAA AATTAACAGT AACACTGGCG GTTTTATTTA 6120 TAAACA 6126 // ID 412 standard; DNA; INV; 7567 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0000006; 412. XX SY synonym: mdg2 XX FT source nnnnnnnn:1..7567 FT SO_feature five_prime_LTR ; SO:0000425:1..514 FT SO_feature three_prime_LTR ; SO:0000426:7054..7567 FT SO_feature CDS ; SO:0000316:679..1044 FT SO_feature CDS ; SO:0000316:1408..1722 FT SO_feature CDS ; SO:0000316:1888..3243 FT SO_feature CDS ; SO:0000316:3864..6866 XX CC Berkeley Drosophila Genome Project. XX SQ Sequence 7567 BP; 2982 A; 1367 C; 1323 G; 1895 T; 0 other; TGTAGTATGT GCCTATGCAA TATTAAGAAC AATTAAATAA AATAGCATAT TAACTTATGG 60 CAGCACTTTG TTGCTATGTT TATGTTTATG TTTATGCACG CAGTTAGGCC AGGGCGGATG 120 TAACATGATC ACCCACTCGA AGGCCACAAA GTATAAGTGC ATTGCCCAAT CGAAGGCAAA 180 AAGTATAAGT GCATGGTCAG CATTCACACG CCGACCAAAT ACATATTACA TACGTACATA 240 CATATCTCGC TCTCCCGATA AGCCTAGATA TATAAGATAT ACATAAGAAC GCCGCTCCGC 300 TGCTGGCGTA CCCGGCAGCG CAGCTACGCG GATTAGCCTA AGTCCAAATA TATAAAAAAC 360 TGTAAAATCG GAGAGACTCT GTAGACGTTG AGCTGACAGA ACCATTTCTG CCTACTCTAA 420 AATCAAAAGA AGAAATTGAA TAAATATATG TCAGCCCGAC GGCTGCCTTA AACTTAAAAC 480 GGACTTGTGT TCTTAATTGG AGTTCATCAT TACATGGCGA CCGTGACAGT CGTCCAACGC 540 TGGACGAATT GACCAAAGCT GGTGAAAACA AAGGAACAAA GGAACACTGG ACTGGAAGAA 600 GACTGGACTA ATTAAATGGA ACTGCAAAAA CCAAGGAAAA ATCTGAGTGA GTAGAGTTCT 660 ATTGAGTATG GGCAAACACC GTGGCGGTTT GAAAACTAAG CTGAATAAAC GTATAGCCCA 720 CGTAAGGTGG CTAATATACG GTCAGCAAAC GCCACCGGTT TGGTCGAAAG CTCTAAAGCT 780 ACATGCAGAG CTAGACCACT TGTTGCAATA TCAGCAAGAA TTAAAGACCC ATAAGCTCGA 840 GAAAACTCAC TCAGATAATA TTAAAAATAT ACCCACAATT AATGAAGTTC CAAAATACCA 900 GGCATGTCCA GCACCAGCAC CAGCATTAAC AAAACCAAAG AAGTCCTGCC CCCCTGGCTG 960 CGAAGGAATC TGGAGTCCCC ACTGCCTGGG GACTTGTGAG CGACCATCGA CGTCTTCAGC 1020 GGCGAAGAAA TAGACAGCAG CGAGGGAGTG TCAGCGTGCC ACCCCCGGCG ACGCCCAGCT 1080 GACACCCAAC AAATAGACAG CAGCGAGGGA GTGTCAGCGT GCCACCCCCG GCGACGCCCA 1140 GCTGACACCT GATGAGCATC ATCAACAGCA GAATATAATA ATAAATATAT ATAAATATAA 1200 AGTAAATATA AAATATATAT AGATAAGAAA AATTGTAAGA AATATTGTAA AACGGAGCAT 1260 ATACTATTAT GCCCTGTTAA CCCAATATGG CCCGTGAAGC CATAGCTAGA ATCAGGCAGG 1320 CAACAATGTA AAATACAATT TTTTTTTACT CTTGCGAACA TTGAAAGATT TTATAAATAG 1380 ATAATTCCAA ACATAAATGT CTATAGAGAC AAATGAAATA AGTAAAACTG AAAATAAAAG 1440 TATATACAAA GGAAATTTTC TATTCTATTC TCCAAAATAT AAAATTAGTA TACCCAAAAT 1500 GGGTCTAATA GACACTAAAA CTGTGGACTC TACAGCCAAT GTAATAAATA AAGTAGAAGT 1560 CCAAAATGCA GACTTGTTCT GGATAACCAT AATACTAATT GTAATTGCAT TAATTATGGT 1620 ATCCAATGCA TTAATAAAAA TATACAAACT GCATAACAAG TGTCTTAAGA AACGATACCG 1680 TAGCACTGCT AACGGTATAG ATAATATTTA AGGAAGATCT TTAATAAAGT CAATTATGAA 1740 TGAAAATATG AGAAAAATTA TATGAAAAAA AAAAATAATA AATAAAAAAA AAATATAAAA 1800 CGTAATATTG AATTTATCTA CATTAAAAAA AAATATATAC AAATGAATAA ATTTGAAGTT 1860 ATGAGTATAC CACAGCATGG ACTGGGAAAA GCTTGTTGAT CAGATAAAAG ATCAAAATGA 1920 AAATTTCAGA AAATCCTATA AGTGCTTAAC GCAAAACAGA TCAACACAAG CTGTAACAAT 1980 CAATAGGAAT GCCCAAGTCT TGGTAAATAG TTATAATGAA ATCAGAGAGT TGATCCAACA 2040 AAATAGAAAG AATTTGGAAC GCAAACAGTG TGCTAAGGCT TTGAACCTAC TGGTGACATT 2100 AAGAGAAAAA TTAATATTTA TAAAAAATAA ATTCAGTCTC CAGATAGAAA TTCCAACCAT 2160 AGTAAACACC CCACTAAGAA TAAATTTGAA TGAAGACAGC ACTAACTCTG ACGAGGAAGA 2220 TAGGACTATA GTCAAGGAAG ACATTAAAGA GGAAGATCTT CACGATCTAA CTATACCAGC 2280 AAAATTAATG CTGAAGAACG ACGATAAAAC AAATAACGCA GCCGACTCCG AAAATAACTT 2340 AACCATGGCA GAAGAAGCAG CTGCCATTAG GTCTTACATT AGGGAAGTCG CCTGCACAGT 2400 GCCAGAATTT GATGGGCAAA AGATCCATTT ACAAAGATTC ATTAAGGCAA TCAAATTGGT 2460 AGACCTAGCT AAGGGACCAT TTGAAGACAT TGCAGTTGAG GTCATTAAGT CAAAAATAGT 2520 TGGCACAATT TTGAACTCAG TTGACAATGA AACGACAATT CCAGCAATTA TAAACAAATT 2580 GCAGAAAGTA GTTGTCGGTG AGACATCCAG TAATGTCAAA GCAAAGCTAG CAACAGTTCA 2640 GCAGAGAGGT AAAACTGCAA CGCAATTTAC CGCTGAAGTT GATAGCCTGA GAAAACTTTT 2700 AGAAGCTTCC TATATCGATG AGGGTATACC TCTAGAACAT GCCACTGGTC TAAGCACCAA 2760 AGAGGCAATT GAAACCATGA TACATCGTGC TGAGCACGAA AGTATCAAAA CAGTACTGGA 2820 AGCAGGGACT TGCACCACTA TGGATGCAGC GATAAGCGCA TACATAAGAA CGAGTACAAG 2880 AGTTACCGGT GACATCAATA AAGTGATGTA CTTTAGAGGT AACAGACCCA ATAGAGGATA 2940 CGGAAATGCC AATAGAGGTA GTAACCGCGG TAGAGGCTTT AATAACAATA GTATTAGAGG 3000 CAACTACCAT AACGGTTACC AAAATAACGG TTACCAAAAT AACGGTTACC AGAATAACGG 3060 TTATCAAAAC CGCTATAATG GAAATAATAA CCGTTATAAT GGCTATAACA GAGGCCGTTA 3120 TAATGGAAAC AGAGGCCGTA ACAACAGTCA GAACAACTAC AACAGAAACA ATGCCAATGT 3180 ACGAGTAATC CAAGAACAGG GAAACTCGCA ACAGCCTTTA GGTACTCAGT AGAAGAAGAT 3240 CGTAGAGTAT ACACCATCAA TTATAATCTC AACATATTTT CTACATTCAT TCATGCCAAA 3300 ACAGGCGTAA AACTAGTTTT TCTACTTGAT ACAGGTGCAG ATATCTCTAT TCTCAAAGAG 3360 AACTCTGACA AATTTTCTAA TATTCAAATA ACCAATAAAA TAAACATTCA AGGCATAGGC 3420 CAACAGAAAA TTCAGTCTCG AGGACAGACT TTTATTGAGA TACAGACAGG TAAATACGTT 3480 ATCCCACACG ATTTTCATTT AGTAGATAAA AACTTTCCAA TACCGTGTGA TGGAATAATC 3540 GGAATAGATT TCATAAAAAA ATATAATTGC CAAATCGATT TAAACCAAGA AGAAGATTGG 3600 TTTATAATTA GACCAAACAA TTTGAAATTT CCAATATATA TTCCCATAGC ATACAGCTCT 3660 GGTATTAACA CAACGTTATT ACCAGCAAGA TCCCAAGTTG TCCGAAGATT AATAGTATCA 3720 TCAAAAGATG ATAACATTTT AATTCCAAAC CAGGAAATTC AAACTGGTAT TTATGTTGCA 3780 AATACAATCG CAACATCAAG TAATACATTT GTCCGAATTT TAAATACAAC CGATTCCGAC 3840 CAATTAGTCA ATATGGACAC TCTAAAATAT GAGCCACTTT CGAACTACAA TGTAGTTCAG 3900 GCAAATAGTG AACACAGAAA TAAAACTGTC TTATCTCAAT TAAAGAAAAA TTTCCCCGAA 3960 TTGTTTAAAT CACAATTAGA AAATATATGC AGCGAATATA TAGATATATT TGCATTAGAA 4020 TCAGAACCTA TAACAGTTAA TAATTTGTAT AAACAACAGT TGAGATTAAA AGATGATGAG 4080 CCAGTATACA CGAAAAATTA TAGAAGTCCT CATAGTCAAG TGGAAGAAAT ACAAGCCCAA 4140 GTTCAGAAAT TAATAAAAGA TAAAATAGTT GAACCATCAG TTTCACAGTA CAATAGCCCT 4200 TTGCTATTAG TACCCAAAAA GTCAAGCCCG AATTCTGATA AAAAGAAATG GAGATTAGTA 4260 ATAGACTATC GCCAAATTAA TAAGAAACTT TTAGCTGACA AATTTCCACT ACCGAGAATA 4320 GATGATATTT TGGACCAACT TGGTCGAGCA AAATATTTCT CCTGCCTTGA TTTAATGTCA 4380 GGTTTTCATC AAATCGAACT GGATGAAGGC TCGAGAGATA TAACATCTTT CTCAACCAGC 4440 AATGGCTCAT ATCGTTTCAC GCGATTGCCA TTTGGCTTAA AAATAGCGCC TAATTCATTC 4500 CAAAGAATGA TGACTATAGC ATTCTCCGGA ATAGAACCGT CTCAAGCATT CCTTTATATG 4560 GATGACTTAA TAGTCATAGG TTGTTCCGAA AAACATATGC TTAAAAACCT CACTGAAGTT 4620 TTTGGTAAAT GCAGGGAATA CAACCTAAAG TTACATCCTG AAAAATGTTC ATTTTTCATG 4680 CATGAAGTCA CATTTTTGGG ACACAAATGC ACAGACAAAG GAATTTTGCC GGATGACAAA 4740 AAATATGATG TCATTCAGAA CTACCCAGTT CCACATGATG CGGACAGCGC TAGACGTTTT 4800 GTAGCATTTT GCAATTACTA CAGACGTTTT ATCAAAAATT TCGCCGACTA TTCGCGGCAC 4860 ATAACAAGAT TATGTAAAAA GAATGTTCCA TTCGAGTGGA CAGATGAATG TCAAAAAGCA 4920 TTCATACATT TAAAATCTCA GCTAATTAAC CCAACACTCT TGCAGTACCC AGACTTCAGC 4980 AAAGAATTTT GCATAACAAC AGATGCAAGC AAGCAAGCGT GTGGCGCAGT TTTAACTCAA 5040 AACCATAATG GCCACCAACT CCCAGTTGCT TATGCATCCA GAGCTTTTAC GAAAGGTGAA 5100 AGCAATAAGA GTACAACAGA ACAAGAGTTA GCAGCAATTC ATTGGGCAAT AATACATTTC 5160 AGACCATACA TTTACGGAAA ACATTTCACT GTGAAAACAG ACCATAGACC ATTGACATAT 5220 TTATTCTCGA TGGTGAACCC CAGCTCTAAA TTAACTAGAA TAAGGCTTGA ACTAGAGGAA 5280 TATAATTTTA CAGTAGAGTA TCTAAAGGGC AAGGACAATC ATGTAGCAGA TGCGTTATCA 5340 AGAATAACCA TCAAAGAGCT AAAAGATATA ACTGGAAATA TATTAAAAGT CACTACAAGA 5400 TTTCAAAGTA GACAAAAATC CTGCGCAGGA AAAGAACAAT TGGATTTGCA AAAGCAAACC 5460 AAAGAAATAG CTTCAGAGCC CAACGTATAC GAAGTCATAA CAAATGACGA GGTACGAAAA 5520 GTAGTGACAT TGCAATTGAA TGACTCGATA TGTTTATTTA AACATGGAAA GAAAATTATT 5580 GCAAGATATG ATGTTGGTGA TCTTTATACT AATGGAATTC TTGATTTAGA TCAATTTCTC 5640 CAAAGGCTTG AATTGCAGGC CGGTATATAT GATATCAGCC AAATCAAAAT GGCACCGTGG 5700 AAAAAAATCT TTGAACACGT TTCAATAGAT AAATTTAAAA ATATGGGCAA TAAAATATTA 5760 AAGAATTTAA AAGTAGCGCT ACTTAACCCG GTGACCCAAA TAAATAATGA AAAAGAAAAA 5820 GAAGCTATAT TGTCTACATT ACATGATGAT CCAATACAAG GAGGGCATAC AGGCATTACA 5880 AAAACCTTGG CCAAGGTCAA AAGACATTAT TACTGGAAAA ATATGAGTAA ATACATAAAA 5940 GAGTACGTAA GAAAATGTCA AAAATGCCAA AAAGCAAAAA CAACAAAGCA CACAAAGACT 6000 CCAATGACGA TAACTGAAAC ACCAGAACAT GCTTTCGATA GAGTTGTTGT GGACACAATT 6060 GGTCCACTAC CCAAGTCAGA AAATGGTAAC GAGTACGCAG TCACTCTCAT ATGTGATTTA 6120 ACCAAGTACT TAGTTGCCAT ACCAATAGCA AATAAAAGCG CAAAAACAGT CGCAAAAGCT 6180 ATATTTGAAT CTTTTATTCT AAAGTACGGT CCAATGAAGA CGTTCATAAC GGACATGGGA 6240 ACAGAGTATA AGAATTCAAT AATTACTGAC CTGTGTAAAT ATTTGAAAAT AAAAAATATA 6300 ACATCAACAG CTCATCACCA CCAGACAGTT GGAGTAGTAG AAAGAAGTCA TAGAACCTTA 6360 AACGAGTATA TACGATCCTA CATATCGACG GACAAAACCG ATTGGGACGT ATGGCTTCAA 6420 TATTTCGTAT ACTGCTTCAA CACGACCCAA TCTATGGTAC ATAATTATTG TCCATATGAA 6480 TTAGTTTTCG GTAGAACAAG TAATTTACCA AAACATTTTA ATAAACTACA TAGCATAGAA 6540 CCAATATATA ACATAGATGA TTACGCTAAG GAGAGTAAAT ATAGGTTAGA GGTAGCATAT 6600 GCTCGAGCAA GAAAACTTCT CGAAGCACAC AAAGAAAAAA ATAAAGAAAA TTATGACTTA 6660 AAAATAAAAG ACATAGAATT AGAAGTAGGA GATAAAGTTT TACTAAGAAA TGAGGTAGGT 6720 CATAAATTAG ACTTTAAATA TACGGGGCCC TATAAGATAG AAAGCATAGG AGATAATAAC 6780 AATATTACGC TACTTACTAA TAAAAACAAA AAACAAATAG TTCATAAAGA TAGATTAAAG 6840 AAATTTCATT CATGATTGAA TTTAAACTTA TATTTTCCTT AATCATTTAC ACAAATTTTC 6900 CATACACTAC GTATATTTTT ATCTTTGCAT TATAAAATCA ACTATTGTTG TTCAAACAAA 6960 AACACAAACA AAATAAAAAT AAAAATAAAA TAATTTGCAT TTAATAATCA AAATAACTTC 7020 ACTAGGTTAC GTTATTTTTC AAAAGGAGGG AGATGTAGTA TGTGCCTATG CAATATTAAG 7080 AACAATTAAA TAAAATAGCA TATTAACTTA TGGCAGCACT TTGTTGCTAT GTTTATGTTT 7140 ATGTTTATGC ACGCAGTTAG GCCAGGGCGG ATGTAACATG ATCACCCACT CGAAGGCCAC 7200 AAAGTATAAG TGCATTGCCC AATCGAAGGC AAAAAGTATA AGTGCATGGT CAGCATTCAC 7260 ACGCCGACCA AATACATATT ACATACGTAC ATACATATCT CGCTCTCCCG ATAAGCCTAG 7320 ATATATAAGA TATACATAAG AACGCCGCTC CGCTGCTGGC GTACCCGGCA GCGCAGCTAC 7380 GCGGATTAGC CTAAGTCCAA ATATATAAAA AACTGTAAAA TCGGAGAGAC TCTGTAGACG 7440 TTGAGCTGAC AGAACCATTT CTGCCTACTC TAAAATCAAA AGAAGAAATT GAATAAATAT 7500 ATGTCAGCCC GACGGCTGCC TTAAACTTAA AACGGACTTG TGTTCTTAAT TGGAGTTCAT 7560 CATTACA 7567 // ID DMAURA standard; DNA; INV; 4263 BP. XX AC AB022762; XX DR FLYBASE; FBgn0010103; aurora-element. XX FT source AB022762:1..4263 FT SO_feature five_prime_LTR ; SO:0000425:<1..112 FT SO_feature three_prime_LTR ; SO:0000426:4046..>4263 FT SO_feature primer_binding_site ; SO:0005850:119..134 FT SO_feature primer_binding_site ; SO:0005850:4035..4044 XX CC Derived from AB022762 (d1268008) (Rel. 59, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 27-March-1999. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4263 BP; 1021 A; 1018 C; 1375 G; 849 T; 0 other; GAATTCTTCA AGAATAAAAC GTGTTCTACT ACCACGGATT AGTCTGCCCT TTCTTTCGGG 60 AACCAATGTG TGGGGTAGCC GTTTAAGGCA ACTCCCTGGA CGCACGACGA CAACCTTTTA 120 TTCGCAGTCC TAGGGCGACT GCAGGGGCAA CTTGCGCTGG AATGACGGTT TAGACGGCCA 180 GCTAGAGAGT TGCCGGAGCT GGAGTGACGG TTTAGACGGC CAGCGAGGAG GATTTGTGTG 240 AGCGCAGCCA GCGCTACGTA CCGGCAGAGG AGTCGCAGTC AGCGACATAG AGGGACGCAG 300 CCAGCGTCGA ACGCCGGTAC GAAAGGGTCG CAGCCAGCGA CAAGGAGACG CAAGAAGCGT 360 CATTTGTGGA GACCGCAGCC AAGCATCCGT GGCCGCAGCC AGCGGCACGA GGCGTCAGAG 420 ACGCCATTTC GGACGCGCAG AGGCGCCGCC ATTTTTGGAG CTGGGAAAGA TGCAGCATTC 480 CCCCAGGAAG AGTGCCCGGC TGAACGGAGG GGAAGTCACC CCTATAACAA CAGTGAGTCA 540 GCAGCCAGCC AGTAGTGGAG CAGGAACTCG GACGCGGGTG AACATCACGG CGGCGTCGAT 600 TCCTTGCCCG GCCACTACGG TGACTACAGT AGCTTCCCAA CCTAGAAGTA CTGCTGTCAC 660 AGCTGCGAGT TCAGTACCGG AGGTGAACCA GCCCCTCGTG TTGGAACTCA TGGAGAGGAT 720 CGCAGCGTTG GAGAGGGAGC TGGAGAAGAC TAGATCCCTA GAAAGTGTGA GCACCGCCAA 780 TTGCGCGCCA ATCGCAGTTG GCCCAAGCGC AGTTGGCGCC AACAGTGGAG CGTCGGGGCG 840 GCCGCCATTT TGGAGCGGCC AGCTAATACC CACATCTAAC GGAGAGGCCT TACATAACGG 900 GGACTGGGCC AGGCATGCTG CAACGATTGC GCCCTTTCCC ACTGTAGTCC ACTTCAGCGC 960 GTGGCTACAG GAGTACGCAA ACGTGGTGTG CACGGTTTTG GACGTCGAGG GAAAGGAGCC 1020 GAGGCGTCGA CTTCTACATG CAAGCGTCGA CCATAATGAA TGCGATCAAC AGGATGATCG 1080 GCATGGAGGT TGTCCCATCT GTGGAGGACA GCATGAAATA TTGAACTGCA GAAAATTTAT 1140 TGGAGCTTCG CCACAGGAAA GGTGGAGCAA TGTGAAGAGG CATCGGCTCT GCTTCAATTG 1200 CCTGCGAAGC GGGCACACGG CTAGATCCTG CTATACGCAA GGTGAGTGCC AGGTTAATGG 1260 ATGCCGAAGG GAGCATCACC GTCTGCTACA TGGTGCGGAC GGAGGAACGA AGGCCGCTGC 1320 AGCGAGGTGG CTTCAGACGC CACGAAGGGA ACCAGCAGCC AGCAGTTTCC AGACGCAGCC 1380 TAAAGGGGAG GCCTTCGCTA CGAGATGGTC ACAGGGACCA GGAGAGGAAC CGGCAGCCAG 1440 CCGTTCCAAG CAACAGTCTG GAGAGAGGAG CTCCACGTGA AGCGGGAGCG CCCATGCAGA 1500 GGAATTTGAG CTGCGTTGAC GCCGAAGGAG GCCGTCTACT GTTCCGTATA CTGCCGGTTA 1560 CGCTGTACGG AGCGGGGCGA AAGGTGGATA CATATGCGCT CCTAGATGAG GGATCCTCCG 1620 TCACGATGAT CGATGACGAA CTACGAAGGG ATCTTGGAGT GCAAGGAGAG CGTCGGCAGC 1680 TAAATATCCA ATGGTTTGGT GGTAAGGCAA CCAGAGAGCC TACCAACGTG GTGAGTCCGA 1740 AGATAAGTGG AGTTGGAAAG CCCACTCGCC ATGTATTGAG AAACGTTTAT GCCGTTTCGA 1800 GCTTGAGTTT GCCGATGCAG ACATTGAGCC GACGAGATGT CCAGGGCGTG CACAGGGATG 1860 CGCGTCTGCC CGATGAAGCC TTACAGCAAC GTGGTGCCGA AGCTGCTCAT CGGTCTGGAT 1920 CACGGACATC TGGGGTTGCC ACTTAGGACG AGGCGGTTCG CTCGAGAGGG ACCGTATGCG 1980 GCCGCAACCG AGCTGGGCTG GGTTGTGTTT GGGCCTGTAA GTGGGCAACC GACCACGCCG 2040 TCACCGAGGT CCTGCCTACT TGCCGTGTCA GTGGATGACG CGATGGAGAA GATGGTGGAG 2100 GACTATTTCG ACATGGAGAA CTTTGGAGTG AAGACCGCGC CGCCGGTCGC AGCCAGCGAC 2160 GATGTCCGGG CCCAAAGGAT ACTCGAAGAC ACCACGGTGA AAGTGGGGCG TCGCTACCAG 2220 ACGGGATTAC TCTGGAAGGA CGACCACGTT GTGCTGCCAC CGAGATATGA GGACGACGAC 2280 GTGCAAGTGA GCTTCGTGAG TGCGAGGACG AAGTGTGCCC CAATGAGAAC GATGACGATC 2340 CCACGGCTGG AGCTGCAAGC AGCAGTTCTT GGAACCAGGC TGATGAACAC TGTCAAGGAG 2400 GAGCACAGTG TGGTCATCAC GGACCTGGTG TTATGGACGG ACTCTAAGAC GGTGCTGAGA 2460 TGGATCGGCA GCACCCACCG CCGCTGACAA TGCGGCTGAT GATGCGACGC GGTCGCAGAA 2520 AAGGAGTCGA CCTTAGCCAG GAATCAAGGT GGCTAAGAGG ACCTGCATTT TTGATGCAGC 2580 CAGCAGCCAG CTGGCCGGGG TCTGAGGAAG GAACTGAGCG TGTTCCAGAT GTCCCTGATG 2640 AAGAAGAGAT GCCCAGTGAG TTTGCATTAG TTGCGGTAGA CGATTTTGTC ATTCCGTTTC 2700 AGAGATTCTC GAGCTTCAGT CGCCTGGTGA GGACCACAGC CTGGGTCCTA CGGTTTGCGC 2760 GCTGGTGCCG CAAACAGCGA AACGATCTCG AGGAATACGG CCTTACCGCA GCCAGAATGT 2820 AAGGCCGCCG GAACCGCACT GTGCATCCCG TACAGTGCGA GGAGGGCCGT ATTACTGTCA 2880 CACAGGCACA GTCTGACGGA GCTGATTGTG AGAGACTTCC ACGCCAGGAT GAAGCATCAA 2940 AATGTGGATG CTACGATCGC GGAGATCCGG ACAATGTTCT GGGTCACAAA GATGAGGCGT 3000 GTGATGCGGA GAGTCATCTC ATCGTGCAAC GAGTGCAAGT TGCAGCGAGC GCGGCCGATG 3060 CCGCCGATAA TGGGACCCCA TCCGGAAGAC AAACTGGATG CGGGTGGATG GCCATTCAAA 3120 TACACAGGAC TGGACTACTT TGGGCCACTG CTGGTGACTG TGTCCCGTCA CAAGGAGAAG 3180 CTTGGGTCGC CTTGTTTACG TGTTTGACGA CAAGGGCGAT TCACCTGGAG CTGGCGCATG 3240 ACCTGTCGAC GGATTCCTGC ATAATTGCGA TCAGGAACTT CGTCTGCCGT AGAGGGCCAG 3300 TATATAGACT GCGCAGCGAT AACGGCAAGA ACTTCGTGGG AGCTGACAGG GAAGCCAGGC 3360 GCTTTGGCGA CGTATTCGAG ATGGAGAAGC TTCAGAGTGA GTTGACAAGC AGAAGCATTG 3420 AATGGGTGTT TAATTGTCCA GCGAACCCGT CTGAGGGCGG AGTTTGGGAG CGCATGGTGC 3480 AGTGCGTCAA GAGAGTACTG CGTCATACCC AGAAGGAAGT TGCGCCGAGG GACCATGTAT 3540 TGGAGAGTTT CCTGATTGAG GCGGAGAATA TTGTAAACTC GCGTCCGCTC ACCCACTTGC 3600 CTGTGGATGT GGACCAGGAG GCGCCGTTGA CGCCAAACGA TCTTCTCAAG GGAGTAGCCA 3660 ATCTGCCGGA TACGCCTGGA TTGGATGCGG AGCTGCCCAA GGAAGGTACT ACGAGGAAGC 3720 AGTGGAGAAT TTCTCGCCTG CTACGAGACC GTTTCTGGAG GAAGTGGGTC ATGGAGTACC 3780 TGCCTACGCT TGTGCGCCGC GAGAAGTGGT GCCGACGAAC GGAGCCCATC CACCAGGGTG 3840 ATGTGGTCTT CGTCTGCGAT CCTGCCTTGG CCCGACGAGA GTGCCGCAAG GGTATCGTGG 3900 AGGAGATCTA CAGCGGAGCT GATGGAGTTG TCAGACGCGC TAAGGTGCGC GTGAACGAAA 3960 ACGGCCTATC TAGGACAATG ATGCGACCCG TCTCTAAACT TGCAGTTATG GATTTGAGTG 4020 AAGCGGTTCT TCACGGGGTC GGGGATGTCG CGGATCGAAT ATTGTTATCG ATAGGCTCTA 4080 GTTAGTATTT TTGAGAAGTC CGAATGTGGA AGGATTTGTA AGCCCATATG TGTCTGGGCA 4140 CGTTGTTTTT GGCCATTGTA AATTACCGGG AAAATTTAGC TTTTCATTGT CGTGTAAGAG 4200 TTGGAGGACA CACTGCGGTG AGCTAATAAG TTAAGTTAGT TGCAATTGTG AAACATTGAA 4260 TTC 4263 // ID DMBARI1 standard; DNA; INV; 1728 BP. XX AC X67681; S55767; XX DR FLYBASE; FBgn0005773; Bari1. XX FT source X67681:1..1728 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..28 FT SO_feature terminal_inverted_repeat ; SO:0000481:1701..1728 FT SO_feature CDS ; SO:0000316:379..1398 FT /db_xref="FLYBASE:FBgn0043784; Bari1\ORF" FT /db_xref="SPTREMBL:Q24258" FT /protein_id="CAA47913.1" FT /translation="MPKTKELTVEARAGIVARFKAGTPAAKIAEIYQISRRTVYYLIKK FT FDTVGTLKNKKRSGRKPVLDQRQCRQILGVVAKNPSASPVKIALESKNTIGKQVSSSTI FT RRRLKEADFKTYVVRKTIEITPTNKTKRLRFALEYVKKPLDFWFNILWTDESAFQYQGS FT YSKHFMHLKNNQKHLAAQPTNRFGGGTVMFWGCLSYYGFGDLVPIEGTLNQNGYLLILN FT NHAFTSGNRLFPTTEWILQQDNAPCHKGRIPTKFLNDLNLAVLPWPPQSPDLNIIENVW FT AFIKNQRTIDKNRKREGAIIEIAEIWSKLTLEFAQTLVRSIPKRLQAVIDAKGGVTKY" XX CC Derived from X67681 (g7640) (Rel. 36, Last updated, Version 6). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1728 BP; 596 A; 291 C; 332 G; 509 T; 0 other; ACAGTCATGG TCAAAATTAT TTTCACAAAG TGCATTTTTG TGCATGGGTC ACAAACAGTT 60 GCTTGTGCAG CAAGTGGGGG GAGGTGAAAT GCAAAAAAAC TTTTGCTTTT GCAAATTCAA 120 ACCTATGCAG AGTCAGATGA AAGAAGAATT GAAAAAATAA CTGTTCCTAT GCGCAAGGAA 180 GAGGCAAATG AAGAGATCTT TATCAGTTGT CAGAAGTATT TGCACACGGT TTCGTCGCAT 240 CACAATTATT TTCACAACGC AATTTCTTCT TCAGTGATTG GTTTAGAGTG ACAAGTGCCG 300 GTTTGTTTGC TTAAATACAT TTAAATTATT GAATAAAAAT TAGATTTAAT CATTTTCCTA 360 TTACAGTTAT TAAATAAAAT GCCCAAAACA AAAGAGTTAA CAGTTGAGGC CCGGGCTGGT 420 ATTGTTGCTA GGTTTAAAGC CGGTACACCT GCGGCCAAAA TAGCTGAAAT ATATCAAATT 480 TCGCGTAGAA CTGTCTACTA CTTAATAAAA AAGTTTGATA CAGTTGGCAC ATTAAAAAAT 540 AAAAAAAGAT CAGGCCGAAA ACCTGTGCTG GACCAAAGGC AATGCAGGCA AATACTTGGA 600 GTTGTGGCGA AGAATCCTAG TGCCAGTCCG GTAAAAATTG CCTTAGAATC AAAAAATACA 660 ATTGGCAAAC AAGTTAGTAG TTCTACAATT CGTCGCAGGC TAAAAGAAGC TGATTTTAAG 720 ACATACGTTG TTCGCAAAAC GATTGAGATC ACACCAACCA ACAAAACAAA ACGTCTTCGA 780 TTTGCGTTGG AATATGTTAA GAAGCCTCTT GACTTTTGGT TTAATATTTT ATGGACTGAT 840 GAGTCTGCAT TTCAGTACCA GGGGTCATAC AGCAAGCATT TTATGCATTT GAAAAATAAT 900 CAAAAGCATT TGGCAGCCCA GCCAACCAAT AGATTTGGTG GGGGCACAGT CATGTTTTGG 960 GGATGTCTTT CCTATTATGG ATTCGGAGAC TTGGTACCGA TAGAAGGAAC TTTAAATCAG 1020 AACGGATACC TTCTTATCTT AAACAACCAT GCTTTTACGT CTGGAAATAG ACTTTTTCCA 1080 ACTACTGAAT GGATTCTTCA GCAGGACAAT GCTCCATGCC ATAAGGGTAG GATACCAACA 1140 AAATTTTTAA ACGACCTTAA TCTGGCGGTT CTTCCGTGGC CCCCCCAAAG CCCAGACCTT 1200 AATATCATTG AAAACGTTTG GGCTTTTATT AAAAACCAAC GAACTATTGA TAAAAATAGA 1260 AAACGAGAGG GAGCCATCAT TGAAATAGCG GAGATTTGGT CCAAATTGAC ATTAGAATTT 1320 GCACAAACTT TGGTAAGGTC AATACCAAAA AGACTTCAAG CAGTTATTGA TGCCAAAGGT 1380 GGTGTTACAA AATATTAGTA TTGTATTTAT ATAAAATAAA GAAATTCTTA TGTTGAAATT 1440 AGATGTTAAG CTGAAATTTA CTAAATTAAG TTGAGTGAAA ATACTTTTGA AGCGCAATAA 1500 ACATGTGAAA ATACTATTGA CAACTTGCAT GCATATTTTC TTTTGCTTTA AGCTTTGTAC 1560 TATGAACCGT TATCTTTCGT ATTTCTTTTC GACTACCTTC TGCATAGATC AAGCTAAGCG 1620 ATAAGAACTA TTTCAGGCAA ATCGGACAAC AACAAGAAGA AATATAACAA AAAGAAGTTG 1680 AAGTTTGCAA ATATTGTGCG TTGTGAAAAT ACTTTTGACC ACCTCTGT 1728 // ID BS standard; DNA; INV; 5142 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0000224; BS. XX FT source nnnnnnnn:1..5142 FT SO_feature CDS ; SO:0000316:341..2248 FT SO_feature CDS ; SO:0000316:2245..2965 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. CC This replaces that from complement(X77571:651..5776) in versions CC previous to 4.8. XX SQ Sequence 5142 BP; 1652 A; 1222 C; 1075 G; 1193 T; 0 other; AAATCTGCAT TCATAGAGAT CGGTTGTGTC GCGCGTATGC AAAAGTGATC TATTTTGCTT 60 TATTGTTGCA ATTTCTTGGG TGCTTAAAAT AGCACTCACC AGTACATTCG GGCGCTGCTT 120 CGTGCGGTGT CGGCATCTGG CCAACAACAA AAAGCGTTAA TCGAAGTGCG GTGTAGCTAC 180 GATACCTGCC CTTCGGGCAA CTTATTCCCC TCACCCCGCG CAAAGCCGCT GAAGGGGGCA 240 ATAAAATCTA TGCTTATCAG CAAAACTGAT CCGTATTTGA TCTGTTTTGT GGTCAGTTAA 300 GCAAGCTATT TTGTAAATAT TAAGAATTAT TATTAAGACA ATGGATGAGA ACAATTCTGA 360 TGACACCCAG CTTTTAAATA AGCAGAGTAA CCATAGAACA ATGTTCTCAA TAGCTGGCAA 420 ATTACCTCAC GAGATTAGAA ACGAGTGCCG ATCAGCAATT CAACGCTTTA CAAGCAGCGT 480 AACTCAAAGC AGTAGCGTCA CCACAACAAC GGTGACATTT ACTAGTGCCA ATAACAGCAC 540 CATATATACA ATGGCAAATG CCGCAATAAG CAGCCCGTGC CTTGGAACAA GATCCACTCA 600 CCAGGAAAGT TCCACATTGA TAAACTCCGG AATCGTAGAA GATAATCTCA GCGATGCTGC 660 CAGAAGGTTA TTAAATGACC AAAATCAGAG AGCGGGTAAA AGGAAAAATG GAAAGCCCTT 720 GTCCCCCATC TCCAACCCGA AAAGAGGGAG TAGCAGCCAA GTTTTACACT CGCCCCCTAC 780 GACTAGCCTG AAGATAAGCT CTAATAATAG GTTTGCCATT CTGGACACGG ATATTTCTAC 840 TAACGAAGAA AGCGTGGAAG GCATGATGAT AGAGGGTGCT GATATTGACA GTGCCCATAT 900 GGATGATTCT CAACTCGATG GTTCCAATAC TGGTCGAAAC TTGCAGGAAA CACACAATAC 960 AGCCAATCAA CTTAATGATC ACAAAAAACC ACCACAAATT GTTGTAAATA TCAGAAACTT 1020 GAATGATCTG TTTGAGCTTA TAAAAGAAAA GACAAGCTTA GATAACGTTG TCGTTAAAGC 1080 TAATCAAGGG GAAACGGTCA GAATATTTCC AAAAGACAGC GACACTTACA GGAAAATAGT 1140 GAGCCATATG GATGACATTG GTATTCAGTT TCACACTTAC CAAATGCTGA CAGATAAGCC 1200 ACACAGAATT GTAGTAAGGG ACTTACATCA CAGTACATCA AACAAAGACA TAACCGCCGA 1260 TCTGAAATGT TTAGGCTACG AAGTGCTCCA CATTCACAAC CCTAGTTCTA GGACTAATAA 1320 GGACGAAAAA CTAAACATCT TTTTCATTAA TATAAAGCCC TGTGCAAAAA TTAATGAAAT 1380 TTACCATGTC AAGACCCTTT GCCGACAGAA AATACGGATT GAAAGGATGA GAAAGTCTTC 1440 TGAAATTGCG CAATGTCGTC GTTGTCAGGA GTACGGCCAT ACAGCTAAAT ACTGCCGCAG 1500 ACACCCAAAT TGTGCCAGAT GTGGCGAAAA TCACCAAACC ATGCAATGCA CCCGACCGAT 1560 AGACGCACTG CCCACATGTT ACCATTGCTC TGAAAATCAT ACGGCTAGCT TCAAAGGTTG 1620 CCTAAAGTAT CAGGAGCTTC TTCGCAGATC TATGGGGCCT GCAAGAAATG GAAACAGGTT 1680 AAATAAGAAC ACCCATCATC ACTCTCCTAG AGACCGGCAA GAGCTTCCTG CCTTGCAGCC 1740 CAATTACCGC AAGAACAACA CCCAATCAAC AGTACAGCAG TTATCGACAC AACCACAGCT 1800 TAATTTTGCC CAAAGCCAAC CATCTATAGG CACTGGTGGA AACAGAGCAG TATCCTATGC 1860 TACAGTAGTA AAAGGATACC CAAAAATAGC GCCCTCCAAG GACGGACCAG CCCAGCGTCA 1920 ACGCTTAAAC AACCCACAAA CGAAACAAAT ACTGCAGCAA CACCGATCGA ATACACAGCA 1980 GAATAACTCA TCTGATGTGC AAGTATTCTT ACAACAGCAA CAACAACAGT TTCTGGAATG 2040 GCAACAGCAG ATCCAACAAC AACAACACCA ACAGTTTCTT ATGTGGTTGC AACAGCAGCA 2100 GCAAGAACAA CTACAGTATA AAAGCCAAAC CAATCAACGA CTGGAAAAGC TTGAAAAAAT 2160 GGTTCTTGAA CTAGCGAATA TGTTAAAAGA ATGGGCTGGG AGTGAACTTA AGCCCCAGCT 2220 CTTTAACAAC GTCTCAGCCT CCCTATGAAT CCACTAAAGA TTCTTATTTG GAATGCTAAC 2280 GGCATTTCAA GAAAAGCCAA AGATGTTGAG CTGTTCGCGC ACAACAAAAA GATAGACATC 2340 CTTCTTGTGA CTGAACTAAG ACTCAAAAGA GGGGAAACTG TAAAGATATA TGGATATGCG 2400 TACTATCCAG CATATAGGCC ATCCCTTAAT AATAATAGTG TTGGCGGAGT AGCGGTGTTC 2460 GTGAGGACAA CTCTTCGCCA CTTTCCACAA AGGGTCATTG AGACACGCCA CATACAATTG 2520 TCATCAGTAA AAGTAGCCAC AGGACTCGGG GACCTGCAGT TTAGCGCTAT TTACTGCTCC 2580 CCAAGTACTA GAATCGAGGA AAGACATTTT ACTGACATAA TACGCGCCTG CGGCCAAAGG 2640 TACTTGGTAG GTGGCGACTG GAATGCCCGC CACTGGCTTT GGGGCGACAC TTGCAATTCA 2700 CCTCGCGGGC GGGAACTAGC AGAAGCCTTG TCCGTGACTG GAGCTAAGAT CCTCGCAACT 2760 GGCTCTCCGA CAAGGTATCC GTATGTGCCC AGCCATACGC CCTCATGCAT AGATTTCGCA 2820 GTGTATCATG GTATACCAGA CCACCTAGCA ACTATAACAC AAAGCTGGGA CTTGGATTCT 2880 GATCACTTGC CTCTTATCAT TAGCATTGAG ACAGACAGTA TTCATGTCAA TCCAAGTCCC 2940 AGGCTAGTCA CCAAACACAC TGACCTCCTT GCCTTTAGCC GACAATTGGA GAGCCTTATT 3000 TCGCTGAACA CCACGCTTAA TTCTGGTGAG GAAATTGAAA TGGCTGTTGA CAACCTAACT 3060 GAAAGCATAC ATAGGGCCGC GGCTGTCTCT ACTTCTCCCG TCCCTCGGAT AGGCACCACA 3120 TATGGGATAG TCTTGACAAG AGAGGCTAGA GAGCTTCTGA CACAGAAAAG AAGACTCCGA 3180 AGGCGAGCAA TCCGATCTCA AGACCCCTGG GACCGACTTT TATGGAACCG TGCTGCAAAG 3240 CAACTACGAA ACGTCCTCAG AGAACTTCGA AGCAACTTTT TTGAGCAGAA ACTAGCTAGT 3300 ATGGACTACA CAGTGGATGC TGGATACTCG CTATGGAAAT GCACCAAGTC CCTTAAAAGA 3360 CAGCCGTTTA GACAGGTTCC TATAAGGTGT CCGGGAGGCG AACTTGCTAA AAATGAAGAG 3420 GAGCAGGCTA ATTGTTTTGC AAATCATCTG GAGACAAGGT TCACCCACTT CCAATTCGCT 3480 ACAACGGAGC AGTATCAAGA GACGCTTGAT AGCCTAGAGA CACCTCTGCA AATGTCACTA 3540 CCCATTAAGC CCATCAGGGT TGAGGAAATT GTCGAAGCTA TCAAATCTCT TCCGTTAAAG 3600 AAGTCTCCTG GCATCGACAA CGTTTGCAAT GCCACACTAA AAGCACTACC TGTTCGAGCA 3660 ATTCTCTACT TGGCGCTGAT ATATAATGCC ATACTCAGGG TGCAGTTTTT CCCAAAGCAG 3720 TGGAAAATGG CAGCAATCCT AATGATACAT AAGCCTGGTA AACCTGAAGA GAGCCCTGAA 3780 TCGTACCGAC CCATAAGTCT TTTATCTTCG CTATCCAAGC TATGGGAACG ACTGATTGCC 3840 AACAGATTAA ATGACATTAT GACCGAGCGT CGTATCCTGC CGGATCATCA GTTTGGCTTT 3900 CGTCAGGGAC ACAGTACTGT GGAGCAGGTA CACAGACTGA CAAAACATAT CCTTCAGGCC 3960 TTTGATGATA AGGAATACTG CAATGCTGTG TTCATTGACA TGCAACAGGC ATTCGATAGG 4020 GTCTGGCATG ACGGCCTTAT CAGCAAAGTT AAAAAGTTAT TCCCAGCACC ATACTATGGA 4080 GTCCTAAAAT CATACTTGGA AGATCGGAGA TTCATGGTCA GGGTCAGAAA CTCCTACTCG 4140 ATTCCCCGCG TTATGAGAGC TGGAGTTCCG CAGGGCAGCG TACTGGGACC GTTGCTCTAC 4200 TCAGTATTTA CTGCAGATCT GCCCTGCCCA AACGCCTATC ATATGGCAGA TCCCAGGAAG 4260 GCCCTTCTTG CTACGTACGC TGACGATATT GCCCTGCTGT ACAGCTCTAA TTGTTGCAAC 4320 GAGGCAGCAA GGGGTCTCCA AGAGTACCTC ACCACTCTGG CTGCATGGTG CAAAAGATGG 4380 AATTTAAAGG TCAATCCGCA AAAGACCATC AATCCCTGCT TCACCTTGAA GACCTTAAGT 4440 CCCGTCACCG CACCCATAGA GCTGGAAGGT GTAATCCTAG ATCAACCTTC ACAGGCTAAG 4500 TACCTCGGGA TTACCCTTGA TAAACGGTTG ACTTTCGGCC CGCACCTGAA AGCTACGACT 4560 CGGAGATGTT ATCAAAGGAT GCAACAACTT CGATGGCTGT TAAACAGAAA AAGCACCATG 4620 ACACTGAGAG CCAAAAGAGC AGTCTACGTC CACTGCGTAG CCCCGATCTG GCTGTACGGA 4680 ATACAGATCT GGGGTATCGC AGCAAAATCC AACTACAACC GCATTCAGGT ATTGCAAAAT 4740 CGTGCCATGC GTGCAATTAC AGACTGCCCA TACTATGTAC GTGGCACTAC CCTTCACCGT 4800 GATCTGAATC TTCATACAGT GGAAGAGCAG ATCTCCAGGC ACACCAGCAG ATATAGTGAT 4860 AGACTAAGAC GACACCACAG TATACTTGCT AGACGCTTAC TCCCTGCTAG GCCTCTAAGG 4920 AGATTAAAAA GGAAGGGTTT CGCCAAAACA CTTGGACAAC CCTAAAGACC CCCTCGAAAT 4980 ATGAGACAAA GTTGTAAGTC CTCACATGAT TAGTGAGAGG TTTGGTTCTA TCTTTTATAT 5040 GTTAATTGCG CTGTTATGTT ACTGTTATTG CATTGTATTG ATTCATCGCT TCTAAATAAA 5100 TAATAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 5142 // ID DMU89994 standard; DNA; INV; 6411 BP. XX AC U89994; XX DR FLYBASE; FBgn0010302; Burdock. XX FT source U89994:1..6411 FT SO_feature five_prime_LTR ; SO:0000425:1..275 FT SO_feature three_prime_LTR ; SO:0000426:6136..6411 FT SO_feature CDS ; SO:0000316:564..2057 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0043782; Burdock\gag" FT /db_xref="SPTREMBL:O01350" FT /protein_id="AAB50147.1" FT /translation="MSDSDNLLDNLVSSLNKWSAHQASRQNSAEKNNKSSDNWWSKTKT FT TSEMEFEAQLKAIVESAVAGALAVQKQSFEKQLQEMNERIGKLTVNTPEVETYVDAEIR FT PGVVCSEPLDILKSLPDFDGKSETYVSWRKAAHVAFKVFKDYEGSSTFYQALGIMRNKI FT KGPANTVLASFNTPLHFKAMISRLDFTYSDKRPIYLIEQELSTLRQGDMTLTEFYDEVE FT KKLTLLTNKTIMTFDSALAMSLNEKYRTDALRVFVTGAKKSLSDILFAKGPKDLPTALA FT LAQEVESNHERYQFALIYSKNIGDRGQKIEQRHSDKDRNSIMPMQTKNPYFSKRQVHTY FT DNQERQDPVQLTNPDVSMRSRRTGNFGQTPFPTQGNIWPSQQQNSWPSQQQYSWPSQQQ FT NSFRTQNQFASQPQQQNTSQAQGHFGYAQASKRPTSGSARFTGPKQQRINYLPHEKGQC FT EEDTDGYQKEAEAEVDDYEDELVNYDHVHFLATNPCYRT" FT SO_feature CDS ; SO:0000316:<1994..5119 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0043781; Burdock\pol" FT /db_xref="SPTREMBL:O01351" FT /protein_id="AAB50148.1" FT /translation="GRTSELRSCSFFSHKSLLPYIEREIAGRTIKLLIDTGASKNYIQP FT LPELKNIMPVQNKFTVKSLHGCNTVKQKCFIKLFNTSVQFFILPSLSSFDAIIGLDLLK FT QGNATLDFKNKTLNINNEVESIQFLRCDSVNFANIENIVVPNQISNKFHTMLRNRLAVF FT AEPEEALPYNTNIVATIRTEDDQPIYSKLYPYPMGVSDFVNKETHALLKDGIIRPSSSP FT YNNPVWVVDKKGTDEEGNTKKRLVIDFRKLNLKTIDDKYPIPNVVWILSNLGKARFFTT FT LDLKSAFHQILLAEKDRAKTAFSVGNGKYEFCRLPFGLKNAPSIFQRAIDDVVRDRIGK FT SCYVYVDDVIIFSNGIEDHVNDVAWVLDRLSGANMRVSKEKSFFFKESVEYLGFMVSSG FT GITTSPSKVEAIQKYNQPTNLFSVRSFLGLASYYRCFIKDFASIARPLTDILKGENGKV FT SASQSKKIPISFDERQCSAFEKLKNVLVSENVMLLYPDYRKAFDLTTDASAFGLGAVLS FT QDGKPVTMISRTLQDRELNFATNERELLAIVWALKSLRNYLYGVKNLNIFTDHQPLTYA FT VSDRNPNAKIKRWKAFIDEHNAKIFYKPGKETYVADALSRQAIHVLEDEPQSDIATIHS FT EISLTFTIETIDKPVNCFRNQIVIDEGTADSTRTFVIFGSKTRHLIQFLDKETLIGRIR FT DVVKPDVVNAIHCELPVLAFIQNSLVNDFPATTFRHTMKMVSDIFNQTEQREIVSLEHN FT RAHRAAQENVKQILQYYFFPKMSQIAATFVSNCLVCQKAKYDRHPQKQILGRTPIPSHV FT GETLHIDIFSTGRNYFLTCIDKFSKFAIVQPIGSRTITDLEPAIMQLMNFFPHSKTIFC FT DNEPSINSESIKSLLKNRFNVDIANAPPLHSTSNGQVERFHSTLLEIARCLKLDSGMND FT TVNLILQATIEYNKTVHSVTNRRPIDIIHSTPPELANEIVEMVNEAQEKQLRRENVTRR FT DRTFEVGETVMVKQNNRLGNKLTPRYREELIEADLGTTVLIKGRVVHKDNLR" XX CC Derived from U89994 (g1905850) (Rel. 51, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 6411 BP; 2219 A; 1259 C; 1204 G; 1729 T; 0 other; AGTTAACACA ATCACAAAAC ACCCGAAATA TAGTCGTAAG CCTCAAGTGC TTTTCCCATC 60 TATAGATCGA GCTTTACCTA TAAGAAACTG TAACTTGTTA AGCTTTAGAG ATAAGAACTC 120 TTGCTATACT TAAGTCAGTC GATTTTGGAA GATTAGAAGC GTCGGTCATC GCCACGTACT 180 TACTATTCGT CTCATTAAGT GCAGACCGCG CAAGCCTATT GTAATTAATA AACTTACGCT 240 AATAAATATA TGGAAAATCT ACTAAAATGA TAATTGGCGC CCAAACGGAT ATAAAAACCT 300 ACGATAACTG AATAATTATA AATAAATAAC AAAAGGAGGA TCCGGAGACA AAACCAGCGG 360 CTTTGGCTAA TTAACTCTAA CCTAAGAAAT AAAAATTTGC TGATTACATA AAATATAATA 420 TTAATTACTA AGACCATCTA CCTTAAAATT GTTTGTTAAT CACTATTATT ATATTGTAAG 480 TATAACGCTT ATTGAACGAA TTAAAAATAT TATTATTATT ATTATATTAT AACCTATGCA 540 AAGAGTATTG ATAATAAAAA TACATGAGTG ACAGTGATAA CCTTTTAGAC AACCTAGTGT 600 CAAGCTTAAA TAAATGGTCA GCGCACCAGG CAAGTAGGCA AAACAGTGCA GAAAAAAATA 660 ATAAGTCATC AGATAATTGG TGGTCAAAAA CAAAGACAAC TAGCGAAATG GAATTTGAAG 720 CTCAGTTAAA AGCGATCGTA GAGAGTGCTG TTGCCGGTGC GCTCGCAGTC CAAAAACAAT 780 CATTTGAAAA GCAATTGCAG GAGATGAATG AGCGAATCGG GAAATTAACA GTGAACACCC 840 CAGAGGTGGA AACTTATGTA GATGCTGAAA TTAGACCAGG TGTTGTCTGT AGCGAGCCTC 900 TAGATATACT TAAATCTCTG CCAGATTTTG ATGGCAAAAG TGAAACATAT GTGTCGTGGA 960 GAAAAGCGGC TCATGTCGCT TTTAAAGTTT TCAAAGATTA CGAGGGAAGT TCAACATTTT 1020 ACCAAGCTCT TGGTATTATG CGAAATAAAA TAAAAGGTCC AGCGAATACA GTATTGGCTT 1080 CTTTTAATAC TCCGTTACAT TTCAAAGCAA TGATCAGCCG TCTTGATTTC ACATATTCTG 1140 ACAAAAGGCC GATCTATCTA ATCGAACAAG AGCTATCAAC TTTGCGACAG GGAGACATGA 1200 CTCTTACTGA ATTCTACGAT GAAGTCGAGA AAAAACTGAC CCTACTTACC AACAAGACAA 1260 TAATGACATT TGATAGTGCC TTGGCGATGT CACTGAATGA AAAGTACAGG ACGGACGCGT 1320 TACGTGTATT TGTAACCGGA GCTAAGAAAT CGTTGAGCGA CATTCTTTTT GCAAAAGGTC 1380 CAAAAGATTT ACCAACTGCT CTCGCTTTAG CGCAAGAGGT CGAGTCGAAC CATGAGCGTT 1440 ACCAATTCGC CCTTATTTAT TCTAAAAATA TTGGAGACAG GGGTCAGAAA ATCGAACAAA 1500 GGCACAGCGA TAAGGATAGA AACTCAATCA TGCCCATGCA AACTAAAAAC CCATATTTTA 1560 GCAAGCGTCA GGTGCATACT TATGATAACC AGGAAAGACA AGATCCAGTC CAGTTAACAA 1620 ATCCTGATGT ATCCATGCGA TCTAGAAGAA CTGGAAATTT TGGACAAACT CCATTTCCGA 1680 CTCAGGGAAA TATTTGGCCA TCCCAACAGC AAAATTCTTG GCCATCTCAA CAACAATATT 1740 CTTGGCCATC CCAACAACAA AATTCATTTC GAACACAAAA TCAATTCGCA TCGCAACCCC 1800 AACAGCAAAA CACAAGTCAG GCTCAGGGAC ATTTTGGGTA TGCGCAAGCA TCAAAAAGAC 1860 CAACGAGTGG CAGTGCAAGG TTTACAGGGC CAAAACAGCA GAGGATCAAC TACTTACCTC 1920 ATGAGAAAGG TCAATGTGAG GAAGATACAG ACGGTTATCA AAAGGAGGCA GAAGCGGAGG 1980 TTGATGATTA TGAGGACGAA CTAGTGAATT ACGATCATGT TCATTTTTTA GCCACAAATC 2040 CCTGCTACCG TACATAGAAA GAGAGATAGC AGGGAGAACC ATAAAACTTT TGATTGACAC 2100 CGGGGCTTCG AAAAATTACA TACAGCCCCT CCCTGAATTA AAAAACATAA TGCCGGTACA 2160 AAATAAATTC ACGGTAAAAT CGCTTCATGG TTGCAACACC GTCAAACAGA AATGCTTTAT 2220 TAAGCTATTT AACACATCTG TTCAATTCTT TATTCTTCCA AGTCTCTCTA GTTTTGACGC 2280 AATAATAGGA CTTGACCTTT TGAAACAGGG AAATGCAACG TTAGATTTTA AGAACAAAAC 2340 GTTGAATATC AACAATGAAG TGGAATCTAT TCAGTTTTTG AGATGTGACA GCGTAAATTT 2400 CGCCAACATA GAGAATATTG TGGTTCCAAA TCAGATATCT AATAAATTCC ATACAATGCT 2460 TCGAAACCGA TTGGCCGTCT TTGCGGAACC GGAAGAAGCA CTGCCGTATA ATACCAACAT 2520 TGTTGCCACA ATACGTACTG AGGACGACCA ACCCATTTAC TCAAAACTCT ATCCGTACCC 2580 CATGGGCGTA TCGGATTTTG TGAATAAGGA GACACATGCT TTGTTAAAGG ACGGAATTAT 2640 CAGGCCCTCG TCGTCACCTT ACAACAATCC GGTTTGGGTA GTCGATAAAA AAGGTACAGA 2700 TGAAGAGGGA AATACTAAGA AAAGGTTGGT TATAGATTTT AGAAAACTAA ATTTAAAAAC 2760 AATCGACGAC AAGTACCCTA TACCAAACGT AGTATGGATC TTGTCAAATT TGGGAAAAGC 2820 CAGATTCTTT ACAACCCTTG ACCTTAAATC GGCGTTTCAC CAAATTCTGC TCGCAGAAAA 2880 GGATAGAGCG AAAACTGCCT TTTCAGTAGG AAATGGAAAA TACGAGTTTT GCCGTTTGCC 2940 GTTTGGCTTG AAAAATGCCC CAAGTATTTT TCAACGTGCT ATTGATGATG TTGTTAGGGA 3000 CCGTATAGGA AAGTCATGTT ACGTTTACGT TGACGACGTA ATAATATTTT CAAACGGAAT 3060 TGAGGACCAC GTAAACGACG TTGCTTGGGT ACTAGACAGA CTGTCTGGGG CAAACATGAG 3120 GGTTTCTAAA GAGAAATCGT TTTTCTTCAA GGAAAGCGTC GAGTATCTCG GATTCATGGT 3180 GTCAAGTGGA GGTATCACAA CCAGTCCTAG CAAAGTAGAG GCTATTCAGA AATATAATCA 3240 ACCTACTAAT CTGTTTAGTG TTCGATCGTT TTTAGGGCTA GCAAGTTATT ACCGCTGCTT 3300 TATTAAGGAC TTCGCCTCTA TTGCTAGACC ACTCACTGAC ATTCTGAAGG GTGAAAACGG 3360 AAAGGTTTCC GCAAGCCAGT CTAAAAAGAT ACCAATTTCT TTCGATGAAA GACAATGTTC 3420 TGCTTTTGAG AAGCTTAAAA ATGTTCTTGT CTCCGAAAAT GTAATGTTAT TGTATCCCGA 3480 TTATAGAAAA GCCTTTGACT TAACAACAGA CGCTTCGGCT TTTGGCCTGG GGGCAGTCTT 3540 ATCACAGGAT GGCAAGCCTG TTACAATGAT TTCGAGAACT TTACAGGATA GAGAACTTAA 3600 TTTCGCAACA AATGAACGAG AACTTTTGGC CATCGTTTGG GCTTTAAAGT CTCTTAGGAA 3660 CTATCTATAT GGTGTCAAAA ACTTAAACAT TTTTACAGAT CACCAGCCGT TAACATACGC 3720 CGTGTCAGAT AGGAATCCAA ATGCAAAAAT CAAGAGATGG AAGGCGTTTA TAGACGAACA 3780 TAATGCTAAA ATTTTCTATA AACCTGGCAA GGAGACCTAT GTTGCCGATG CACTATCCAG 3840 GCAGGCTATT CATGTCCTAG AGGACGAACC CCAGTCAGAC ATTGCAACAA TACATAGCGA 3900 AATTTCATTG ACTTTTACAA TCGAAACTAT CGACAAGCCG GTTAACTGTT TTAGAAACCA 3960 AATTGTGATA GATGAGGGCA CCGCAGACTC AACTCGAACT TTTGTTATTT TCGGAAGCAA 4020 GACAAGGCAT CTAATACAGT TTCTAGACAA AGAGACCTTA ATCGGAAGAA TTCGTGATGT 4080 GGTTAAGCCG GATGTAGTGA ATGCGATACA CTGCGAATTA CCTGTACTAG CTTTCATTCA 4140 AAACAGTCTT GTAAATGACT TTCCAGCAAC AACCTTCCGA CACACTATGA AAATGGTCAG 4200 CGACATTTTT AATCAAACTG AGCAACGGGA AATAGTGTCT TTGGAGCACA ACAGAGCGCA 4260 TAGGGCAGCA CAGGAGAATG TAAAACAAAT TCTTCAATAC TACTTTTTCC CTAAAATGTC 4320 ACAAATAGCC GCTACCTTTG TTTCTAACTG CTTGGTTTGT CAAAAAGCCA AATACGACCG 4380 CCATCCGCAA AAGCAAATCC TCGGGAGAAC ACCTATTCCG TCACATGTAG GCGAGACATT 4440 GCATATTGAT ATATTTTCTA CGGGCAGGAA TTACTTTTTG ACATGTATTG ACAAATTTTC 4500 CAAATTCGCT ATTGTGCAAC CAATCGGCTC TCGAACGATA ACTGATTTAG AACCTGCAAT 4560 TATGCAACTA ATGAACTTTT TTCCCCATTC AAAGACAATA TTTTGTGACA ATGAACCGTC 4620 CATAAATTCC GAGTCAATCA AGTCACTTTT GAAAAATCGT TTTAATGTTG ACATAGCGAA 4680 CGCACCTCCA CTTCATAGTA CCTCAAACGG ACAGGTTGAA AGGTTTCACA GCACGCTTTT 4740 AGAAATAGCT CGATGCCTGA AACTTGACAG TGGAATGAAT GATACAGTCA ACCTTATTCT 4800 TCAGGCAACA ATAGAATACA ATAAGACGGT GCACTCAGTC ACCAATAGAA GACCGATCGA 4860 CATTATTCAT TCAACTCCTC CCGAATTGGC TAACGAGATA GTAGAAATGG TTAACGAAGC 4920 TCAGGAAAAA CAGCTAAGAA GAGAAAATGT AACAAGACGA GACAGAACCT TTGAGGTGGG 4980 AGAAACCGTC ATGGTAAAAC AAAACAATCG CTTGGGAAAT AAACTAACCC CACGGTATAG 5040 GGAAGAACTA ATCGAAGCAG ACCTCGGGAC AACGGTCCTC ATAAAAGGGA GGGTCGTTCA 5100 TAAAGATAAT CTACGCTAGG TTTAGTATTT CTTTTCCTTT TGTGACCATC GCCAAGTTAG 5160 CAAAATACAA ACGTGAAATC TGAACACTAG TAAAAGAGTT TGCAAACATT TTTCAATTAA 5220 ATATTTGTCA AATCCTTCTT ATTTAATCTT TAAACATTTT GTATTATTTC CGCTTCATCC 5280 TCTTTAGAAA ATTTTAAAGG TATGTGATGA AATGCTAGAC CCGAATGATT TGAAAACTTA 5340 AAGTCCACGC AACCACAAAT ATTTCCTGAA ACTACCATAG AAAATAAATG CATTACCAAA 5400 ACGGCATAAT AACAGTATAG CGCACTCACT CTAATTAGAT TTCAAATTCC CGATTAAAAA 5460 AAAAATAAAA CACTAATGTT ATCAATACCC TTTCCTGATT CTGTTCAACT AAAATAGGAA 5520 AATCAATACT TGCAATCAAT AAGCGTTTTA CTACATACTT TAATATCAAA ATATCTGAAT 5580 GAACTTTATT ATAAAATTAT AATTGTTATA CTTAATTATT GTCAAAACTT TAGTATTAAA 5640 ACTGTAACTA CCTCTTAAGT AGATGAGAAG AGTAGAAGAG GGAATTAAGA TCTATCAACG 5700 TAGTATCTGC TAAAGACGTA AAGATGCGGC AACTATTTCT GCGCCTGGGT ACTGAAACGA 5760 CGAACTGAAT AATATCTGCC ATCAGACGCC AACCAGAGTG CGTTCAACAC ATACGTTTTG 5820 ATGGTCAACT AGTTCAACCA ACATCAGCAT CATCGTCGTC AACAAGTCGA CGGTTACAAT 5880 AAAGATTTTT TCCAAGTTCG CTACGATCAT CTCCAGAACC TTGTTGCGAA CCCATGACAT 5940 GGAGAATCAG CAGCATTTAC GAACTTCTCG GATCATCCAG ACACGCAGAG CTGCCTTCCC 6000 TTCGATGGTT TAACGCAGTA CCAGGTTGGC AGTATGGGAA CTTAGTGCAC AACCAATGTT 6060 ACCCGTAAGA TCCGCTTTCA AATAGATTTG CCAATTGTAA AAAGTCTGTG GACAGCCTTC 6120 GTCTTAGAAG GGGAGGAGTT AACACAATCA CAAAACACCC GAAATATAGT CGTAAGCCTC 6180 AAGTGCTTTT CCCATCTATA GATCGAGCTT TACCTATAAG AAACTGTAAC TTGTTAAGCT 6240 TTAGAGATAA GAACTCTTGC TATACTTAAG TCAGTCGATT TTGGAAGATT AGAAGCGTCG 6300 GTCATCGCCA CGTACTTACT ATTCGTCTCA TTAAGTGCAG ACCGCGCAAG CCTATTGTAA 6360 TTAATAAACT TACGCTAATA AATATATGGA AAATCTACTA AAATGATAAT T 6411 // ID DMCOPIA standard; DNA; INV; 5143 BP. XX AC X02599; XX DR FLYBASE; FBgn0000349; copia. XX FT source X02599:21..5163 FT SO_feature five_prime_LTR ; SO:0000425:1..276 FT SO_feature three_prime_LTR ; SO:0000426:4867..5143 FT SO_feature polyA_signal_sequence ; SO:0000551:1990..1999 FT SO_feature polyA_signal_sequence ; SO:0000551:5063..5073 FT SO_feature primer_binding_site ; SO:0005850:277..291 FT /bound_moiety="tRNA:M-i-RB" FT SO_feature CDS ; SO:0000316:432..4661 FT /db_xref="FLYBASE:FBgn0013437; copia\GIP" FT /db_xref="SWISS-PROT:P04146" FT /protein_id="CAA26444.1" FT /translation="MDKAKRNIKPFDGEKYAIWKFRIRALLAEQDVLKVVDGLMPNEVD FT DSWKKAERCAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLL FT SLKLSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLS FT EENLTLAFVKNRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVTKPKKIF FT KGNSKYKVKCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTATSHGIAFMVKEVNNT FT SVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRND FT HEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKNSGMLNNVPV FT INFQAYSINAKHKNNFRLWHERFGHISDGKLLEIKRKNMFSDQSLLNNLELSCEICEPC FT LNGKQARLPFKQLKDKTHIKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTY FT LIKYKSDVFSMFQDFVAKSEAHFNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVP FT HTPQLNGVSERMIRTITEKARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKT FT PYEMWHNKKPYLKHLRVFGATVYVHIKNKQGKFDDKSFKSIFVGYEPNGFKLWDAVNEK FT FIVARDVVVDETNMVNSRAVKFETVFLKDSKESENKNFPNDSRKIIQTEFPNESKECDN FT IQFLKDSKESENKNFPNDSRKIIQTEFPNESKECDNIQFLKDSKESNKYFLNESKKRKR FT DDHLNESKGSGNPNESRESETAEHLKEIGIDNPTKNDGIEIINRRSERLKTKPQISYNE FT EDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAINTELNAHKINNTWTITKRPE FT NKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDYEETFAPVARISSFRFILS FT LVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDNVCKLNKAIYGLKQAARCW FT FEVFEQALKECEFVNSSVDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRY FT LMEKFRMTDLNEIKHFIGIRIEMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKI FT NYELLNSDEDCNTPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVL FT RYLKGTIDMKLIFKKNLAFENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLICWNTK FT RQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIANNPS FT CHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLGLLQ FT DDQSNAE" FT SO_feature CDS ; SO:0000316:join(432..1605,4555..4661) FT /db_xref="FLYBASE:; copia\GIP-RB" FT /db_xref="SWISS-PROT:P04146" FT /protein_id="CAA26445.1" FT /translation="MDKAKRNIKPFDGEKYAIWKFRIRALLAEQDVLKVVDGLMPNEVD FT DSWKKAERCAKSTIIEYLSDSFLNFATSDITARQILENLDAVYERKSLASQLALRKRLL FT SLKLSSEMSLLSHFHIFDELISELLAAGAKIEEMDKISHLLITLPSCYDGIITAIETLS FT EENLTLAFVKNRLLDQEIKIKNDHNDTSKKVMNAIVHNNNNTYKNNLFKNRVTKPKKIF FT KGNSKYKVKCHHCGREGHIKKDCFHYKRILNNKNKENEKQVQTATSHGIAFMVKEVNNT FT SVMDNCGFVLDSGASDHLINDESLYTDSVEVVPPLKIAVAKQGEFIYATKRGIVRLRND FT HEITLEDVLFCKEAAGNLMSVKRLQEAGMSIEFDKSGVTISKNGLMVVKNSENQLADIF FT TKPLPAARFVELRDKLGLLQDDQSNAE" XX CC Derived from X02599 (g7740) (Rel. 49, Last updated, Version 4). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5143 BP; 1874 A; 727 C; 971 G; 1571 T; 0 other; TGTTGGAATA TACTATTCAA CCTACAAAAA TAACGTTAAA CAACACTACT TTATATTTGA 60 TATGAATGGC CACACCTTTT ATGCCATAAA ACATATTGTA AGAGAATACC ACTCTTTTTA 120 TTCCTTCTTT CCTTCTTGTA CGTTTTTTGC TGTGAGTAGG TCGTGGTGCT GGTGTTGCAG 180 TTGAAATAAC TTAAAATATA AATCATAAAA CTCAAACATA AACTTGACTA TTTATTTATT 240 TATTAAGAAA GGAAATATAA ATTATAAATT ACAACAGGTT ATGGGCCCAG TCCATGCCTA 300 ATAAACAATT AAATTGTGAA TTAAAGATTG TGAAAATAAA TTGTGAAATA GCATTTTTTC 360 ACATTCTTGT GAAATAGCTT TTTTTTTCAC ATTCTTGTGA AATTATTTCC TTCTCAGAAT 420 TTGAGTGAAA AATGGACAAG GCTAAACGTA ATATTAAGCC GTTTGATGGC GAGAAGTACG 480 CGATTTGGAA ATTTAGAATT AGGGCTCTTT TAGCCGAGCA AGATGTGCTT AAAGTAGTTG 540 ATGGTTTAAT GCCTAACGAG GTAGATGACT CCTGGAAAAA GGCAGAGCGT TGTGCAAAAA 600 GTACAATAAT AGAGTACCTA AGCGACTCGT TTTTAAATTT CGCAACAAGC GACATTACGG 660 CGCGTCAGAT TCTTGAGAAT TTGGACGCCG TTTATGAACG AAAAAGTTTG GCGTCGCAAC 720 TGGCGCTGCG AAAACGTTTG CTTTCTCTGA AGCTATCGAG TGAGATGTCA CTATTAAGCC 780 ATTTTCATAT TTTTGACGAA CTTATAAGTG AATTGTTGGC AGCTGGTGCA AAAATAGAAG 840 AGATGGATAA AATTTCTCAT CTACTGATCA CATTGCCTTC GTGTTACGAT GGAATTATTA 900 CAGCGATAGA GACATTATCT GAAGAAAATT TGACATTGGC GTTTGTGAAA AATAGATTGC 960 TGGATCAAGA AATTAAAATT AAAAATGACC ACAACGATAC AAGCAAGAAA GTTATGAACG 1020 CGATCGTGCA CAACAATAAT AACACTTATA AAAATAATTT GTTTAAAAAT CGGGTAACTA 1080 AACCAAAGAA AATATTCAAG GGAAATTCAA AGTATAAAGT CAAGTGTCAC CACTGTGGCA 1140 GAGAAGGCCA CATTAAAAAA GATTGTTTCC ATTATAAAAG AATATTAAAT AATAAAAATA 1200 AAGAAAATGA AAAACAAGTT CAAACTGCAA CATCACACGG CATTGCGTTT ATGGTAAAAG 1260 AAGTGAATAA TACTTCAGTG ATGGACAACT GCGGGTTTGT CCTTGATTCT GGTGCTAGTG 1320 ACCATCTTAT AAATGATGAG TCGCTGTATA CCGACAGTGT GGAGGTTGTG CCTCCACTTA 1380 AGATTGCAGT GGCCAAGCAA GGCGAATTTA TTTATGCCAC TAAGCGTGGT ATTGTCCGAC 1440 TACGGAATGA CCATGAGATT ACACTGGAGG ATGTACTCTT TTGTAAGGAA GCTGCTGGTA 1500 ATTTGATGTC CGTAAAGCGT CTCCAAGAGG CAGGAATGTC GATCGAATTT GACAAAAGCG 1560 GTGTAACCAT TTCGAAAAAT GGGTTAATGG TTGTCAAAAA TTCAGGTATG TTAAACAATG 1620 TACCTGTGAT CAATTTTCAA GCATATTCTA TAAATGCTAA GCATAAAAAT AATTTTCGTT 1680 TATGGCATGA GAGGTTTGGC CATATAAGCG ATGGCAAATT ATTAGAAATA AAACGAAAGA 1740 ATATGTTTAG TGATCAAAGT CTTCTAAACA ACTTAGAGTT ATCATGTGAA ATTTGTGAAC 1800 CCTGTTTAAA TGGTAAACAG GCAAGACTTC CTTTTAAACA ATTGAAAGAT AAGACCCATA 1860 TTAAAAGACC ACTTTTTGTA GTACACTCAG ATGTCTGTGG GCCTATTACT CCAGTTACTT 1920 TAGATGATAA AAATTATTTT GTGATCTTTG TTGATCAGTT TACACATTAT TGTGTAACTT 1980 ATTTAATTAA ATATAAATCT GATGTGTTTA GCATGTTTCA AGATTTTGTA GCCAAGAGTG 2040 AAGCTCATTT TAATTTAAAG GTTGTGTACT TATACATTGA CAATGGTAGA GAATACTTGT 2100 CAAATGAGAT GAGACAATTT TGTGTTAAGA AAGGAATTTC TTATCACTTA ACAGTGCCAC 2160 ATACACCTCA GTTAAATGGT GTTTCTGAGA GAATGATAAG AACCATTACG GAAAAAGCTC 2220 GAACCATGGT TAGTGGTGCA AAGCTAGATA AAAGCTTTTG GGGCGAAGCA GTATTAACTG 2280 CTACTTATTT AATCAACAGA ATTCCTAGTA GAGCACTTGT TGATAGTTCA AAGACCCCAT 2340 ATGAGATGTG GCACAATAAG AAGCCATACT TAAAACATTT GAGAGTGTTT GGTGCAACTG 2400 TTTATGTGCA TATTAAAAAC AAACAAGGAA AGTTTGATGA TAAATCATTT AAAAGTATTT 2460 TTGTGGGCTA TGAACCCAAT GGTTTTAAGT TGTGGGATGC TGTAAATGAA AAATTTATTG 2520 TCGCAAGAGA TGTTGTTGTC GATGAAACCA ATATGGTTAA TTCTAGAGCT GTTAAATTTG 2580 AAACAGTGTT CCTGAAAGAT AGTAAGGAAA GTGAAAATAA AAATTTTCCG AATGACAGTA 2640 GGAAAATAAT ACAAACAGAA TTCCCGAATG AGAGTAAGGA ATGCGACAAC ATACAATTCC 2700 TGAAAGATAG TAAGGAAAGT GAAAATAAAA ATTTTCCGAA TGACAGTAGG AAAATAATAC 2760 AAACAGAATT CCCGAATGAG AGTAAGGAAT GCGACAACAT ACAATTCCTG AAAGATAGTA 2820 AGGAAAGTAA TAAATATTTT CTGAATGAGA GTAAGAAAAG AAAGCGAGAT GATCACCTGA 2880 ATGAAAGTAA GGGATCAGGC AACCCGAATG AGAGTAGGGA AAGTGAAACA GCAGAGCACT 2940 TAAAAGAAAT TGGAATTGAT AATCCAACTA AAAATGATGG CATAGAAATT ATTAATAGAA 3000 GAAGTGAGAG ATTAAAGACT AAGCCTCAGA TATCCTATAA TGAAGAGGAT AATAGTCTAA 3060 ATAAAGTTGT TCTAAATGCT CACACTATAT TTAACGATGT CCCAAATTCA TTTGATGAAA 3120 TTCAATATAG GGATGATAAA TCTTCTTGGG AAGAAGCCAT CAATACAGAG TTAAATGCTC 3180 ATAAAATTAA TAATACTTGG ACAATTACAA AAAGGCCTGA AAACAAAAAT ATTGTAGATA 3240 GCAGATGGGT ATTTTCTGTT AAATATAATG AACTTGGAAA TCCAATTAGA TACAAAGCTA 3300 GATTGGTTGC ACGAGGATTC ACTCAAAAAT ACCAAATAGA CTATGAAGAG ACATTTGCTC 3360 CTGTAGCTAG AATTTCAAGT TTCCGATTTA TATTGTCATT AGTAATACAG TATAACTTGA 3420 AAGTCCATCA AATGGATGTA AAAACAGCTT TCTTAAATGG CACGTTAAAA GAGGAAATTT 3480 ATATGAGACT TCCTCAAGGT ATATCGTGTA ATAGTGACAA TGTGTGTAAA TTGAATAAGG 3540 CAATTTACGG ACTCAAGCAA GCGGCTAGAT GCTGGTTTGA AGTATTTGAG CAAGCATTGA 3600 AAGAGTGTGA GTTTGTAAAC TCTTCAGTTG ATCGCTGTAT ATATATTTTA GACAAAGGTA 3660 ACATCAATGA AAACATATAT GTATTATTAT ATGTAGATGA TGTGGTTATA GCTACAGGAG 3720 ATATGACAAG AATGAATAAC TTCAAAAGGT ATTTAATGGA AAAGTTTAGG ATGACTGACC 3780 TAAATGAAAT AAAACATTTT ATTGGAATTA GGATAGAGAT GCAGGAAGAT AAAATCTATT 3840 TAAGCCAATC TGCATATGTT AAAAAAATTT TAAGTAAATT TAACATGGAA AATTGTAATG 3900 CAGTTAGTAC TCCTTTACCT AGTAAAATAA ATTATGAATT ACTTAATTCA GATGAAGACT 3960 GCAATACCCC ATGCCGTAGC CTCATAGGAT GTTTAATGTA CATAATGCTT TGTACACGCC 4020 CAGATTTAAC TACTGCAGTA AATATCTTGA GCAGATATAG TAGCAAAAAT AACTCCGAAT 4080 TATGGCAGAA CTTAAAAAGA GTTCTTAGAT ATTTGAAGGG CACTATCGAT ATGAAATTGA 4140 TTTTTAAAAA GAACTTGGCA TTTGAAAATA AAATTATTGG TTATGTGGAT TCTGATTGGG 4200 CTGGTAGTGA AATTGATAGA AAAAGTACAA CAGGGTATTT ATTCAAAATG TTTGATTTTA 4260 ATCTCATTTG TTGGAATACA AAGAGACAGA ACTCAGTAGC AGCCTCATCA ACTGAAGCTG 4320 AGTATATGGC CCTATTTGAA GCCGTGAGAG AAGCTCTATG GCTTAAATTT TTATTAACTA 4380 GTATTAACAT TAAACTAGAA AACCCCATTA AAATTTACGA AGACAATCAA GGCTGTATTA 4440 GCATAGCAAA CAACCCCTCA TGTCATAAAC GAGCTAAACA TATTGATATT AAATATCATT 4500 TTGCCAGAGA GCAAGTTCAG AATAATGTGA TTTGTCTTGA GTATATTCCT ACAGAGAATC 4560 AACTGGCTGA CATATTTACA AAACCGTTGC CTGCTGCGAG ATTTGTGGAG TTACGAGACA 4620 AATTGGGTTT GCTGCAAGAC GACCAATCGA ATGCTGAATG AAATTTTTAT ATATATTTTT 4680 CAAATTTAAA TTCCTGTAAA CATATTTTGT TACAATGATC TGATCGGGTT TTTCTGGGTT 4740 TTCCCCGTAT CCTCGCAGCA AATGCTGGAT CAGTTAACAC TTCCCAGAAT GCACACCACC 4800 CACATTTGAT AGTTACTAAT GAATATTATT GTTATGTTTT TAATTATAGA CGTTATTTTT 4860 GAGGGGGCGT GTTGGAATAT ACTATTCAAC CTACAAAAAT AACGTTAAAC AACACTACTT 4920 TATATTTGAT ATGAATGGCC ACACCTTTTA TGCCATAAAA CATATTGTAA GAGAATACCA 4980 CTCTTTTTAT TCCTTCTTTC CTTCTTGTAC GTTTTTTGCT GTGAGTAGGT CGTGGTGCTG 5040 GTGTTGCAGT TGAAATAACT TAAAATATAA ATCATAAAAC TCAAACATAA ACTTGACTAT 5100 TTATTTATTA TTAAGAAAGG AAAATAAATT ATAAATTACA ACA 5143 // ID DMW1DOC standard; DNA; INV; 4725 BP. XX AC X17551; XX DR FLYBASE; FBgn0000481; Doc. XX FT source X17551:1..4725 FT SO_feature CDS ; SO:0000316:213..1910 FT /db_xref="FLYBASE:FBgn0024789; Doc\gag" FT /db_xref="SPTREMBL:Q04134" FT /protein_id="CAA35586.1" FT /translation="MNQNDIRSQRQCEQDERRLSLQRNNAYFSFVSPQIGDRAPSPSTN FT SKLLPSANDRPRSCSPSLPASAHKSWSEETASPTPLLSQRQTTVPGNCNTAITSAVTSL FT ATATTSTSSAAQLIIAVPAVNNSAALTVCNNNNARKEESKQKQKSISTVQTGMDRYIQI FT KRKLSPQNNKAGNQPKINRTNNGNENSAVNNSNRYAILADSATEQPNEKTVGEPKKTRP FT PPIFIREQSTNALVNKLVALIGDSKFHIIPLKKGNIHEIKLQIQTEADHRIVTKYLNDA FT GKNYYTYQLKSCKGLQVVLKGIEATVTPAEIIEALKAKNFSAKTAINILNKDKVPQPLF FT KIELEPELQALKKNEVHPIYNLQYLLHRRITVEEPHKRINPVQCTNCQEYGHTKAYCTL FT KSVCVVCSEPHTTANCPKNKDDKSVKKCSNCGEKHTANYRGCVVYKELKSRLNKRIATA FT HTYNKVNFYSPQPIFQPPLTVPSTTPTISFASALKSGLEVPAPPTRTAHSEHTPTNIQQ FT TQQSGIEAMMLSLQQSMKDFMTFMQNTLQELMKNQNILIQLLVSSKSP" FT SO_feature CDS ; SO:0000316:1910..4576 FT /db_xref="FLYBASE:FBgn0024790; Doc\RTase" FT /db_xref="SPTREMBL:Q04135" FT /protein_id="CAA35587.1" FT /translation="MASLRISLWNANGVSRHTQELTQFIYEKNIDVMLLSETHLTNKNN FT FHIPGYLFYGTNHPDGKAHGGTGILIRNRIKHHHLNNFDKNYLQSTSIALQLNNGSTTL FT AAVYCPPRFPISEDQFMEFFNTLGDRFIAAGDYNAKHTHWGSRLVSPKGKQLYNALTKP FT ENKLDYVSPGKPTYWPADPRKIPDLIDFAITKHVPRNMVTAEALADLSSDHSPVFLNML FT TRPHIVDPPYRLTNFRTNWPRYQKYVCSHIELTTALSTKEDIDKSTETLENILVSAAKA FT STPPVTYAKPNYIKTNREIERLVLDKRRLRRDWQSNRSPITKHMLKIATRRLTNALKQE FT EKNSQRSYIEQLSPTSTKYPLWRAHRNLKTPIAPIMPLRSPSGTWFRSDEERASAFADH FT LQNVFRPNPSTNTFILPPLIAANLDPQEPFEFRPCELAKVIKEQLNPRKSPGYDLITPR FT MLIELPKCAILHICLLFNAIAKLGYFPQKWKKSTIVMIPKPGKDKTQPSSYRPISLLTC FT LSKLFEKMLLLRISPHLRINNTLPTHQFGFREKHGTIEQVNRITSEIRTAFEHREYCTA FT IFLDVAQAFDRVWLDGLLFKIIKLLPQNTHKLLKSYLYNRVFAIRCDTSTSRDCAIEAG FT VPQGSVLGPILYTLYTADFPIDYNLTTSTFADDTAILSRSKCPIKATALLSRHLTSVER FT WLADWRISINVQKCKQVTFTLNKQTCPPLVLNNICIPQADEVTYLGVHLDRRLTWRKHI FT EAKSKHLKLKARNLHWLINARSPLSLEFKALLYNSVLKPIWTYGSELWGNASRSNIDII FT QRAQSRILRIITGAPWYLRNENIHRDLKIKLVIEVIAEKKTKYNEKLTTHTNPLARKLI FT RVCSQSRLHRNDLPAQQ" XX CC Derived from X17551 (g8821) (Rel. 29, Last updated, Version 2). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4725 BP; 1647 A; 1192 C; 822 G; 1064 T; 0 other; GACATTCGGC ATTCCACAGT CTTCGGGTGG AGACGTGTTT CTTTCAAGCT ACGAATAGCA 60 AGTTCTAAAA ACTACAACAG TATAGTGAAA GTTAAACACA AAGTGTAAAG TGCAGTTTGC 120 ACAACTAACA ATTATTGACT ATAGTAATTA TTTACTAAAA TAAATAATTA TTCCATATTG 180 TTCTGGTAAT TGTTATATGT GGACTTAGAA CAATGAATCA AAACGACATA CGTTCTCAGC 240 GACAATGTGA ACAAGACGAG CGCCGGCTCT CTTTACAACG CAACAATGCA TACTTTTCTT 300 TCGTCTCACC GCAAATCGGT GATCGAGCAC CCTCACCTTC AACTAACTCG AAACTTTTGC 360 CCTCAGCGAA CGACAGACCG CGTTCTTGCT CTCCCTCTCT GCCTGCTTCG GCTCACAAGT 420 CGTGGAGCGA AGAGACCGCC TCTCCTACCC CGCTCCTCTC GCAGCGCCAA ACGACCGTCC 480 CGGGTAACTG TAACACTGCA ATAACGAGTG CAGTGACCTC ACTGGCAACT GCCACAACAT 540 CAACTTCGTC AGCGGCCCAA CTAATTATCG CTGTGCCAGC TGTAAATAAT TCAGCAGCAC 600 TGACCGTTTG CAACAACAAT AATGCACGTA AAGAAGAATC AAAACAAAAG CAGAAGTCGA 660 TTTCGACTGT GCAGACTGGC ATGGATCGCT ACATCCAAAT CAAGAGAAAG CTCAGCCCTC 720 AAAACAATAA GGCAGGTAAT CAACCCAAAA TCAATCGAAC CAACAACGGC AATGAAAACT 780 CTGCAGTAAA TAATTCAAAC CGATATGCTA TCTTGGCTGA TTCTGCGACC GAACAACCCA 840 ACGAAAAAAC GGTAGGGGAA CCAAAAAAGA CCAGGCCTCC ACCAATTTTC ATACGAGAAC 900 AAAGTACAAA TGCACTTGTA AATAAACTCG TTGCTTTGAT TGGTGACAGC AAGTTCCACA 960 TTATCCCACT TAAAAAAGGA AATATTCATG AAATAAAACT ACAGATCCAA ACAGAAGCAG 1020 ACCACCGTAT AGTGACTAAA TACCTAAATG ATGCTGGTAA AAACTACTAC ACATACCAAT 1080 TAAAAAGTTG CAAAGGGCTA CAGGTAGTAC TTAAGGGCAT TGAAGCAACA GTGACACCAG 1140 CTGAGATAAT TGAGGCTCTG AAGGCCAAAA ACTTTTCTGC AAAGACAGCT ATTAATATTT 1200 TAAACAAAGA CAAAGTTCCG CAGCCACTAT TCAAAATAGA ACTCGAACCA GAGCTCCAGG 1260 CACTAAAGAA AAACGAAGTG CACCCAATAT ACAATTTACA GTACTTGCTA CATCGGAGGA 1320 TCACCGTGGA GGAGCCGCAC AAACGTATCA ATCCAGTTCA ATGTACTAAT TGCCAAGAAT 1380 ACGGCCACAC CAAGGCATAC TGCACCCTTA AGTCCGTATG TGTTGTCTGT AGCGAACCTC 1440 ATACTACCGC AAACTGCCCC AAAAACAAGG ACGATAAGTC TGTGAAGAAA TGCAGTAACT 1500 GCGGGGAAAA ACATACTGCA AACTACAGAG GCTGTGTGGT GTACAAAGAA TTGAAGAGCC 1560 GCCTAAACAA ACGTATTGCC ACAGCACATA CATACAACAA AGTCAATTTC TACTCTCCGC 1620 AACCGATTTT TCAACCACCC CTAACTGTCC CAAGCACTAC TCCAACAATT TCTTTCGCTA 1680 GCGCCCTAAA ATCCGGACTA GAAGTGCCCG CCCCACCGAC AAGAACTGCT CATTCCGAAC 1740 ATACACCGAC AAACATCCAA CAAACACAAC AAAGTGGCAT CGAAGCTATG ATGCTATCCC 1800 TACAGCAAAG CATGAAAGAC TTCATGACGT TCATGCAAAA TACTTTGCAA GAGCTCATGA 1860 AAAACCAAAA TATCCTGATT CAACTTCTTG TATCTTCAAA ATCCCCATAA TGGCTTCCCT 1920 ACGGATATCT CTGTGGAACG CAAATGGCGT TTCACGGCAT ACACAAGAGC TCACACAGTT 1980 CATTTACGAA AAAAACATCG ACGTAATGCT ACTATCAGAA ACGCACCTCA CAAATAAAAA 2040 CAATTTTCAT ATACCAGGAT ACTTGTTCTA TGGTACAAAT CATCCAGATG GTAAAGCTCA 2100 TGGAGGCACT GGAATACTCA TCAGAAATCG CATAAAACAC CACCACTTAA ACAATTTTGA 2160 CAAAAACTAC TTACAATCTA CGTCCATAGC CTTACAACTC AACAATGGTT CAACGACTCT 2220 AGCCGCAGTC TACTGCCCAC CGCGCTTTCC AATCTCTGAG GATCAATTCA TGGAATTCTT 2280 TAACACACTA GGTGACAGGT TCATCGCAGC GGGTGACTAT AACGCCAAGC ACACCCATTG 2340 GGGATCTCGA CTTGTGTCGC CAAAGGGTAA GCAATTGTAC AATGCGCTTA CGAAGCCAGA 2400 AAACAAGCTA GACTATGTAT CCCCGGGTAA GCCTACATAC TGGCCAGCAG ACCCAAGAAA 2460 AATCCCAGAC CTGATCGATT TTGCAATTAC TAAACATGTC CCCCGCAACA TGGTCACCGC 2520 CGAAGCACTA GCAGATTTAT CATCAGATCA CTCACCTGTT TTTCTAAATA TGCTAACTCG 2580 CCCCCACATC GTCGACCCAC CGTATAGACT CACAAATTTT AGAACAAACT GGCCAAGGTA 2640 TCAAAAGTAT GTCTGTTCAC ACATAGAACT AACGACGGCA TTATCTACAA AGGAGGATAT 2700 AGACAAGTCA ACGGAAACTC TTGAAAACAT TTTAGTTTCG GCTGCAAAGG CTTCAACCCC 2760 GCCAGTGACG TATGCAAAAC CAAACTACAT CAAAACTAAT CGCGAAATCG AGCGGCTGGT 2820 ATTAGATAAA CGACGCCTAC GAAGGGATTG GCAGTCTAAT AGATCACCAA TTACTAAGCA 2880 CATGCTTAAG ATAGCCACAC GCAGGCTTAC CAATGCTCTC AAACAAGAGG AAAAAAACAG 2940 CCAACGTTCA TATATCGAGC AACTCTCTCC CACCAGCACT AAGTACCCTC TTTGGAGAGC 3000 TCACAGAAAC CTAAAGACTC CAATAGCGCC AATTATGCCA CTCCGAAGTC CCTCTGGCAC 3060 CTGGTTTCGA AGTGATGAAG AAAGAGCCAG TGCTTTCGCT GACCATTTAC AAAATGTATT 3120 CCGACCAAAT CCCTCTACCA ACACATTTAT TCTCCCTCCT TTAATAGCAG CCAATCTAGA 3180 TCCTCAAGAA CCCTTTGAAT TCCGACCATG TGAACTAGCA AAGGTTATCA AAGAGCAACT 3240 GAACCCAAGA AAATCGCCTG GCTACGACCT AATAACTCCA AGAATGCTCA TTGAACTCCC 3300 AAAGTGTGCT ATTCTTCACA TCTGCCTGTT GTTCAACGCA ATCGCCAAGC TTGGATACTT 3360 CCCTCAAAAA TGGAAAAAGT CGACCATAGT AATGATTCCA AAGCCAGGAA AAGATAAAAC 3420 GCAGCCATCA TCATATAGAC CGATAAGCTT ACTAACATGT CTTTCAAAGC TGTTTGAAAA 3480 AATGCTACTC CTTCGGATTA GCCCTCATCT TAGAATAAAC AACACACTTC CAACACATCA 3540 ATTTGGCTTT AGAGAAAAAC ATGGAACCAT CGAACAGGTC AACCGAATCA CGTCAGAAAT 3600 TCGTACTGCT TTTGAACATC GAGAATACTG CACAGCCATT TTTCTAGACG TCGCGCAGGC 3660 ATTTGACAGA GTGTGGCTCG ATGGACTTTT GTTTAAAATA ATCAAGCTGT TGCCCCAAAA 3720 CACACATAAG CTACTGAAGT CATACCTATA TAACAGAGTG TTTGCAATAA GATGCGATAC 3780 AAGCACTTCA CGCGATTGCG CAATCGAAGC TGGAGTGCCG CAAGGCAGTG TACTGGGTCC 3840 AATCTTATAC ACCCTGTATA CGGCGGATTT CCCCATAGAC TACAATCTAA CAACCTCCAC 3900 GTTCGCTGAT GATACCGCGA TACTCAGTCG CTCGAAATGC CCAATAAAAG CCACGGCACT 3960 CCTATCCCGA CACTTAACAT CTGTAGAACG ATGGCTTGCC GACTGGAGAA TTTCAATAAA 4020 TGTTCAAAAA TGCAAGCAGG TTACCTTTAC CTTAAACAAA CAAACATGCC CACCACTGGT 4080 CTTGAATAAC ATATGCATTC CACAAGCCGA CGAGGTAACA TATCTGGGAG TTCATCTGGA 4140 CAGGCGGCTC ACTTGGCGCA AACATATAGA AGCCAAATCG AAACATCTTA AACTTAAAGC 4200 AAGGAACCTC CACTGGCTCA TAAATGCTCG CTCTCCACTT AGTCTGGAGT TCAAAGCTCT 4260 TCTATACAAC TCCGTCTTAA AACCTATCTG GACTTATGGC TCCGAGCTGT GGGGCAACGC 4320 ATCCAGAAGT AACATAGACA TTATTCAGCG AGCACAGTCA AGAATTCTGA GAATTATCAC 4380 TGGAGCGCCG TGGTACCTTC GAAACGAAAA CATACACAGA GACCTAAAAA TCAAATTAGT 4440 AATCGAAGTA ATAGCTGAGA AAAAAACGAA GTATAACGAA AAGCTGACCA CCCATACAAA 4500 TCCCCTCGCA AGAAAACTAA TCCGAGTATG CAGTCAAAGC CGGCTGCACC GCAACGACCT 4560 CCCAGCCCAG CAATAAACTT ATTAGGGCAT TAATGAAAAA AAAAAACTAT CACTAAGTGA 4620 AAGTTAATTA AGTTAGATTA AGATTTGAAC ACTTATTGTT AGTCTCTTAA CACAAAGGGA 4680 AGATTCAATA AATAATAAAA ATTAAAAAAA AAAAAAAAAA AAAAA 4725 // ID F standard; DNA; INV; 4708 BP. XX AC AC005198; XX DR FLYBASE; FBgn0000652; F-element. XX SY synonym: Jiminy XX XX FT source AC005198:38639..43358 FT SO_feature CDS ; SO:0000316:192..1880 FT /db_xref="FLYBASE:; CDS1" FT SO_feature CDS ; SO:0000316:1880..4561 FT /db_xref="FLYBASE:; CDS2" FT /translation="MATLRIATWNANGVSQRKLELAQFLHEKHIDVMLLSETHLTSKY FT NFQIRDYHFYGTNHPDGKAHGGTAILIRNRMKHHFYKEFAENHLQATSINIQLDDNTL FT LTLAAVYCPPRFTVLEAQFLDFFQALGPHFIAAGDYNAKHTHWGSRLVNPKGKQLYKT FT IIKATNKLDHVSPGSPTYWPSDLNKLPDLIDFAVTKNISRSLVKAECLPDLSSDHSPV FT LIHLRRYAENVKPPTRLTSSKTNWLRYKKYISSHIELSPKLNTESDIESCTCALQSIL FT TAAALTATPKITNNTINSKKTNVQIEQLVHVKRRLRREWQSSRSPTAKQKLKVATRKL FT ANALKQEEDDDQRRYIEQLTPTGTKQKSLWRAHSTLRPPTETVLPIKNSSGGWARSDE FT DRANTFAAHLQNVFTPNQATSTFALPSYPVNRHQQHTPIVFRPKEITKIIKDNLSPKK FT SPGYDLITPEMIIQLPHSAVRYITKLFNAITKLGYFPQRWKMMKIIMIPKPGKNHTVA FT SSYRPISLLSCISKLFEKCLLIRLNQHQTYHNIIPAHQFGFRESHGTIEQVNRITTEI FT RTAFEYREYCTAVFLDVSQAFDKVWLDGLMFKIKISLPESTHKLLKSYLYDRKFAVRC FT NTATSTVHTIEAGVPQGSVLGPTLYLIYTADIPTNSRLTVSTFADDTAILSRSRSPIQ FT ATAQLALYLIDIKKWLSDWRIKVNEQKCKHVTFTLNRQDCPPLLLNSIPLPKADEVTY FT LGVHLDRRLTWRRHIEAKKTQLKLKANNLHWLINSGSPLSLDHKVLLYNSILKPIWTY FT GSQLWGNASNSNIDIIQRAQSKILRTITGAPWYVRSENIQRDLNIPSVTNAITELKEK FT YL" XX CC K. O'Hare, Personal communication to FlyBase, 1 May 2000. CC CDS2 translation from M17214; AC005198 has a 1-bp deletion. XX SQ Sequence 4708 BP; 1618 A; 1242 C; 857 G; 991 T; 0 other; AATCAATTAA TCAATTCGAT CGCCGACGTG TGAAGACGTT TTTATCGTGC TCCGCACAAA 60 ATCGGTTGTT TTGAGTGAAG TGAACGCCAA ATAAAATAAA CTAAATAAAA AATCTGAAAG 120 CGAAAGAGAC GCTCTATGCG ATGCAAGATC GCTTAAATAC ATAGTGAATT GTTATCTTAA 180 ATAATAAAAC TATGAGTCAG AATGACACTC GCGCCCAGCG TCAGCGCGAG CATGACGAAC 240 GCCGACTCTC AATTCAGCGC AACAACGCGT ACTTCTCCTA CGTCTCACCG ACAATCCCAA 300 ACGCAGACAT CGAGCGGTCA ATAACCCATA GCCCAGGAAA CCTTCTTCTA CCAACAAATC 360 AAGAAAGAGC GCGCTCCTGC TCTCCCGCTC TATTGGCTCC GACAGAAGCC CCGCTACCTC 420 CAACAACAAC AGCTGGAGAG GGACCGGCAG CCCGCTCTGC CTCGTCATCG GCTGCACCCG 480 CTCACGGTCT GACTAAGTCA GCGAAAGCAA AACCGCTAGC AATAAACGGT ACTGCTGCAC 540 TGCCAGCAAA ACAAAACGAA AACGTAAACA AAAAAGCTGG GTCGACCTGG CAGACTGGAA 600 TGGACCGCTA CATTACAATA AAGCGAAAGC TCAGCCCGGA AAATTCAGAT TTGGGAAACA 660 AGCCGAAAAA TACACGCGAT AACTCTACCT TGATCAAAAA TGTAGCCCCT GCAAATACCA 720 ACAGATTTGC CTTGCTGGTA GATACCGCTG AGGACGTGCC GCTGGGATCC GTTGATATCG 780 AACCGAAGAA AACAAAGCCT CCGCCAATAT ACATCCGCGA GAAGAGCACA AGCCGTCTTG 840 TAAATACTTT GATTGGCCTT ATTGGGAAAG ATAGCTTTCA TATAATTCCC CTCGTAAGAG 900 GTACTATCAA CGAAATCAAA CTTCAGACGA AAACGGAGGA CGACTACAGA AAAGTCACAA 960 ACTATTTTAC CGCACAAAAA ATAGGCTTCT ACACCTACCA GCTTAAAAGC AGCAAGGGCC 1020 TGCAAGTAGT CCTGAAGGGC ATTGAGTCTG ATGTTACGCC CGAAGAGATA ACTGAGGCGC 1080 TAAAGGAAAA GGGATTTTAC GCCAAAAACG TGTTCAATAT CAAAAACAGA AACAGGCAGC 1140 CCCAACCACT CTTCAAGATT GAGCTTGAAC CAGAAAACAA GCCTCCTAGA AAAAACGAGG 1200 TTCACCCAAT TTACAAACTC CAGCTCCTTT TGCACCGTAG GATCACGGTA GAAGAGCCGC 1260 ACAAACGCAA CGCTCCTGTA CAATGTACAA ACTGCCAAGA GTATGGCCAC ACGAGGTCAT 1320 ATTGTACACT TCGCCCGGTG TGCGTAGTCT GTGGAGATCT CCACGACTCC AAACAGTGTC 1380 AAATTAACAA AGAAAATGCA TGCGAGAAAA AATGTAATAA CTGCGGGGGC AATCACACAG 1440 CAAACTACAG AGGCTGTCCA ATCTACAAAG AGCTGAAAAT CCGTCTTCAC AAAAGAATGA 1500 ACACGGCGCG GGCACACCAA GGATCAGCTA CCCTGATACC ATCAGAGACA AATCCTGAAG 1560 TAATTTTCTC GAAAGCAGCT AGTTTCGCTC CCTGGCCTAC ATTCAACACT AACAAGACAA 1620 CATTTGCTAA CGTTTTAAAA TCAGGTATGA CGCCTCCAAC CCAAAACTCC CGAACTCCAC 1680 ATGAAGTGCA CACAAAATTA GACACACAAC AAAACTATCA CCCAGCTGCG CAGCAGGAAA 1740 CAAAAACTGA AGCTATGATG CAAGCCTTAC AACAGAGCAT GATGGAATTT ATGACATTTA 1800 TGAAGACCAC CATTCAAGAC ATGATGCGTA ATCAAAACCT TTTGATACAA ATGCTTGTAG 1860 CCCAACAATC AAATAAATAA TGGCTACCTT ACGCATAGCT ACGTGGAACG CCAATGGCGT 1920 CTCACAGCGC AAACTTGAGC TAGCTCAATT CCTACATGAG AAGCATATCG ACGTAATGCT 1980 TCTTTCGGAA ACTCATCTCA CAAGCAAATA CAATTTTCAA ATAAGAGACT ACCATTTCTA 2040 CGGTACAAAT CATCCCGACG GAAAAGCACA CGGTGGCACC GCCATACTCA TAAGGAACCG 2100 TATGAAGCAC CACTTTTACA AAGAATTTGC GGAAAATCAT CTTCAGGCCA CATCTATCAA 2160 CATTCAGCTG GATGACAACA CTCTCCTTAC ACTAGCGGCC GTATACTGCC CCCCCCGTTT 2220 CACAGTATTA GAAGCTCAAT TCCTGGATTT CTTCCAAGCA CTAGGGCCAC ACTTCATTGC 2280 AGCAGGCGAC TACAACGCTA AACATACTCA CTGGGGATCG CGACTTGTGA ACCCAAAAGG 2340 AAAACAGCTT TATAAGACGA TAATAAAAGC CACTAATAAA CTTGACCATG TTTCCCCCGG 2400 GAGTCCTACA TACTGGCCAT CAGACCTCAA TAAGCTGCCA GACCTGATCG ACTTCGCAGT 2460 TACGAAAAAT ATTTCCCGCA GTTTGGTTAA AGCTGAATGT CTGCCGGATC TCTCATCTGA 2520 TCACTCGCCT GTACTAATTC ACCTCCGCCG ATACGCAGAA AACGTGAAAC CACCAACCAG 2580 ATTGACCTCT AGCAAAACAA ACTGGCTCAG GTATAAAAAA TATATAAGTT CACATATTGA 2640 GCTAAGCCCA AAACTCAATA CTGAATCTGA TATAGAGAGC TGCACGTGTG CATTGCAATC 2700 CATCCTTACT GCAGCAGCTC TTACTGCAAC ACCCAAAATA ACAAATAATA CAATTAATTC 2760 AAAAAAGACC AACGTACAAA TCGAGCAACT CGTCCACGTA AAACGTCGCT TACGCAGAGA 2820 ATGGCAATCT TCCAGATCCC CAACTGCAAA ACAAAAGCTA AAAGTAGCCA CACGGAAACT 2880 GGCCAACGCT CTGAAACAAG AAGAGGACGA CGATCAGCGC CGATACATAG AGCAACTCAC 2940 ACCAACAGGC ACAAAACAAA AGTCACTGTG GCGAGCCCAC TCAACTCTTC GCCCACCGAC 3000 TGAAACCGTT TTGCCGATAA GGAATTCATC AGGTGGCTGG GCCCGTAGTG ATGAAGACAG 3060 AGCCAACACA TTTGCCGCTC ACCTACAAAA TGTGTTCACG CCAAACCAGG CTACTAGCAC 3120 ATTCGCGCTA CCGTCCTATC CCGTAAACCG CCATCAGCAA CACACCCCAA TTGTGTTTCG 3180 TCCTAAAGAA ATAACTAAAA TAATCAAAGA CAATCTCAGC CCGAAAAAAT CCCCCGGCTA 3240 CGACCTTATA ACACCGGAAA TGATCATCCA GCTGCCACAT TCTGCAGTTC GCTACATAAC 3300 CAAGCTCTTT AATGCCATCA CCAAACTTGG TTACTTTCCA CAACGATGGA AGATGATGAA 3360 GATCATAATG ATTCCAAAGC CTGGTAAGAA CCACACAGTC GCTTCATCTT ACAGACCAAT 3420 AAGTCTACTC TCATGCATTT CGAAACTATT CGAAAAATGC CTGCTGATCC GACTTAATCA 3480 ACATCTGATA TACCACAATA TAATCCCAGC CCACCAATTT GGATTTCGCG AAAGCCACGG 3540 AACCATTGAA CAGGTGAATC GTATTACAAC GGAAATAAGA ACTGCATTTG AATATCGCGA 3600 ATACTGTACA GCAGTATTTT TAGACGTATC CCAAGCATTC GACAAAGTCT GGCTCGACGG 3660 CCTAATGTTT AAAATTAAAA CATCCCTACC CGAAAGCACA CACAAACTTC TAAAGTCTTA 3720 CCTCTATGAC AGAAAGTTTG CAGTGCGGTG CAACACTGCC ACTTCCACTG TTCATACAAT 3780 TGAGGCTGGA GTCCCCCAAG GCAGCGTTCT TGGGCCAACC TTATACCTCA TCTATACAGC 3840 CGACATCCCT ACAAATAGTC GCTTAACGGT ATCCACATTT GCCGACGATA CAGCTATCCT 3900 TAGCCGTTCA AGGTCCCCTA TCCAAGCTAC AGCACAGTTG GCACTGTACC TCATCGACAT 3960 TGAGAAGTGG CTCTCTGACT GGCGAATAAA AGTAAACGAG CAAAAATGCA AGCACGTGAC 4020 GTTTACGCTA AACAGACAAG ACTGTCCTCC GCTCTTGTTG AACAGCATAC CACTCCCGAA 4080 AGCAGACGAG GTAACGTACC TAGGAGTACA CCTAGACAGA AGACTCACAT GGCGCAGGCA 4140 CATTGAAGCC AAAAAAACCC AACTTAAACT CAAAGCCAAC AACTTACACT GGCTCATCAA 4200 CTCTGGTTCT CCGCTCAGCC TAGATCACAA GGTCTTGCTC TACAATTCTA TATTGAAACC 4260 AATCTGGACC TATGGCTCAC AGTTATGGGG CAATGCCAGC AACAGCAATA TTGACATCAT 4320 TCAGCGAGCA CAATCAAAGA TTCTGAGAAC CATCACTGGG GCACCGTGGT ACGTTCGGAG 4380 TGAAAACATC CAAAGAGACT TAAATATCCC ATCAGTTACC AACGCAATCA CGGAACTTAA 4440 GGAAAAATAC CATAGCAAGC TTCACACGCA CCCCAACCAC CTAGCGCGAG GTCTAATCCA 4500 GCTCAGCAGC CGTTCCCGTC TCCGGCGAAA GGACCTACCA ACCCAGCGAA TAAATTATTA 4560 GGGCCGTTTA AACATAGAAC AGTTGGAAAA ATAATACAAC TGTTCAAAAA ATACTTGTTA 4620 TAGTTAAGAT TTTTAAACTT ATTGTTAGTT CTTATACAAG AAGATTCAAT AAATAAAAGC 4680 AAAGTAAAAT AAAAAAAAAA AAAAAAAA 4708 // ID FB standard; DNA; ; 4347 BP. XX AC X51937; AC X15469; XX DR FLYBASE; FBgn0002949; NOF. XX FT source join(X15469:94..1010,X51937:1..3430) FT SO_feature CDS ; SO:0000316:797..3874 FT /db_xref="FLYBASE:FBgn0044029; NOF\ORF" FT /db_xref="SWISS-PROT:P16320" FT /protein_id="CAA36201.1" FT /translation="IQQLDTSANLTLNSTFPDDDPEFQITEASKNGPLPILYFNLELDL FT ELWRSIAPKKDQKTEKLQPNWTDTMAKLIYKKVPLPCAFNFRKAKLSDKVDNIWLRIEG FT YCNDCSSILKGHCLVKPDEQCGIMISVSVPDTRGIPHNKKRRCTGSRRLEIGNELILKK FT AALWRKEATDNMNDDDPEPSYIPNLPTLRKLREEATNRHLGITKDRDPVSSLYLKKYEG FT ELAGCILDIGLDEFFCIYCTGTQVKTYASRIKTIRKISIDATGSVVLPIQKPNGDSSYV FT FLYQIVMEGDDSIFPVFQMLSAKHDTASIQFWLSRFISKSGHFPLEVVSDFSLALLNGI FT SLSFNECRIATYIKKCFHSLLMEERTDLPPCYIRLDIAHLIKMICRKNVFKSKLPNLKD FT FYTRCIGLATTCETKDSFAELIKSVLIVALSQSSGEDEKGDILSSYRNEKYLLARIATF FT TAPDHKETIEDNCIPEDQEEIDEDVTDFISNIKIAAEEEALNCNSVNCRPNPYFLPELM FT PPLIKLCKYFVLWTNVMKEKFCSKYDVGSSALVEAYFKDLKNTDMSIFHRPVRADKFVV FT QHIRCIEAVCKLERAAMKRKTVKTPSFIKENAPKKMCSKETKGFLEEILEESEVEYLLQ FT EENWKVKNKTIKPTEGNDAEDNDTDDENKEMDLSEQPKEKPRGKYLKKCPNVELLYNRP FT HRRKQDEILHNGGSMGPVWIGKQLLQFKNTCPFDSLVEILSTAYIDNFYYKSLLDDFYT FT DNLTIELVKKYAVEGVSSSLYCDRGLVLKSFFDEKHQIIKCDANIGSFIEKALNGVPSA FT SSHRTHIKNNHDCRNQKYIHHRLEVIDVEKVGHLDVQEVVIPFIDEFFARTDGECKICG FT GQQILERQPGPHVILDIEFAMDAFHQIHHNGLPGTTTLLQVPEEILIQEKKYILSGAIE FT YVPAMGGEIGHYIAYCRRVIGSWEVHNDMCRQWKKFSALNTKMTLHILIYTRKN" XX CC Derived from X15469 (g7962) (Rel. 36, Last updated, Version 3). CC Derived from X51937 (g8297) (Rel. 44, Last updated, Version 6). CC Takis Benos and Michael Ashburner, 25-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4347 BP; 1461 A; 775 C; 885 G; 1226 T; 0 other; TATATTCTAT TGCCCACCAT ATAAACACGT GCCACTTTCC TAGTTTTAGG ATCTGCCTAC 60 ATAACACGTG CAGACGCACA GGTGTTTCTG GGTTTATATA GACCAAAAAT TGGTTCCGAT 120 TGCCAATCTT GTAATTTACA GTTTACCAGG TAATTACATA ATTTTCAAAC CTCACTTTAT 180 GATAGGGTCC AATTTTTTAC CTGTGACAAA GTGTTAAATT TTTTAAGAAT GGGTTTTTCA 240 TGGCAGGTCA GAATCCTCTA TAAAATCTAA AACACTTGTC GGTATTTGAA AATCGCTCTC 300 CTCCTTGATT CTCATATTAG GTGTAAAAGA TAAATCCGGA ACTCATAATT AAAATATTTT 360 TTATGTGAAA AAGTTGTGCG CGATTTTAAC TACGCTTACC CAGTGCTGGA AAAGTTAAAG 420 TTGTTTTGTT TTTCAAAGAA AGTGAAAGTT GCTAAGCACG AACTTAAGAA ATCTGAGTGA 480 TTGTGTTAAA TTTATTTGAA TCCTTGTGAA TTTTGTTGAC AGTCTTTTTA AAGACTTGCA 540 AAATTTTCAT ATTATTCGGT TCTTGCTTTT ATTTTTATAC AACGCGTTTT TCCTTTAGGC 600 ATACCTTTAT ACATTTACAG TGTAAACAAC AGTGTAAAAC GTGTAAATCA GTGCAAAATA 660 GTTTTTTTTA TTTACTCCAT AAAAAATAAG TGTTACTGTC AGGATGCCGG CCAAACCGCA 720 AGTCGATGGT CACACCTTAG TGGATGCATT TTGCTGCGCG AATATTTTTA CGGAGACTGG 780 AGCTCTTAAG CCAAGAAGCG ATAAAGTTTG GATGGATATA AGCAACCAAT TGAAAGGAGC 840 GATCAGCGCG AAGACGCTTA ATTTCTACGC CAGAATCAAT AGGAATAACA TGATAACTGT 900 GGTTAAAGAA CGATGTGGAA TTCAACAGCT GGATACTAGT GCCAATTTAA CTTTAAATAG 960 CACATTTCCT GATGATGACC CGGAGTTCCA GATCACCGAA GCTTCAAAAA ATGGACCATT 1020 GCCTATTTTG TACTTTAACC TGGAGTTGGA CCTGGAATTG TGGAGATCAA TTGCCCCCAA 1080 AAAGGATCAA AAAACTGAAA AACTGCAACC TAACTGGACG GATACTATGG CAAAGTTGAT 1140 ATACAAAAAA GTTCCTCTTC CGTGTGCATT TAATTTTAGA AAAGCTAAAC TTTCCGACAA 1200 AGTGGATAAT ATTTGGCTAC GAATTGAAGG CTATTGCAAT GACTGCAGCT CAATTTTAAA 1260 GGGACATTGC CTTGTGAAAC CCGATGAACA ATGCGGCATA ATGATATCTG TTTCAGTACC 1320 GGACACACGA GGTATACCTC ATAATAAAAA ACGACGGTGC ACTGGATCGA GACGACTTGA 1380 AATTGGGAAC GAGTTGATTT TAAAAAAAGC TGCATTGTGG AGGAAGGAAG CCACCGACAA 1440 CATGAATGAT GACGACCCAG AACCGAGTTA CATACCAAAT TTACCAACCC TTCGGAAACT 1500 TCGTGAAGAG GCAACTAACA GACACCTAGG AATTACCAAG GATCGGGATC CAGTTTCATC 1560 ATTATACCTT AAAAAGTATG AGGGTGAATT GGCTGGATGC ATTCTTGACA TTGGATTGGA 1620 TGAATTTTTC TGCATATACT GCACAGGAAC CCAAGTAAAA ACATATGCAT CAAGGATAAA 1680 AACTATTAGA AAGATTTCTA TTGACGCAAC TGGAAGCGTG GTGTTACCCA TCCAAAAACC 1740 AAACGGTGAC TCTAGTTATG TTTTTCTGTA CCAAATTGTA ATGGAGGGTG ACGACAGTAT 1800 ATTTCCAGTT TTTCAGATGC TGTCGGCTAA ACATGACACA GCCAGCATAC AGTTTTGGTT 1860 AAGCAGATTT ATATCAAAGT CGGGGCATTT TCCACTGGAG GTTGTATCTG ATTTTTCCTT 1920 GGCATTGCTA AATGGAATAA GCTTAAGCTT TAATGAGTGT AGGATTGCGA CGTATATAAA 1980 AAAATGTTTC CACAGCCTTT TGATGGAGGA ACGGACGGAT CTGCCACCCT GCTATATTCG 2040 ACTTGACATC GCCCACCTAA TTAAAATGAT ATGCCGGAAG AACGTCTTCA AAAGTAAATT 2100 ACCGAACCTC AAGGATTTTT ATACTAGATG TATTGGTCTT GCAACAACGT GTGAGACAAA 2160 GGACAGTTTT GCGGAATTAA TTAAATCAGT ACTGATTGTC GCACTGAGCC AATCCTCAGG 2220 GGAAGATGAA AAAGGAGACA TTCTTTCAAG TTACAGGAAT GAAAAGTATC TGCTCGCCAG 2280 AATAGCTACA TTTACTGCCC CGGATCACAA GGAGACCATT GAGGACAACT GCATACCAGA 2340 GGACCAGGAG GAAATTGACG AGGATGTTAC GGACTTTATC TCTAATATTA AAATCGCTGC 2400 CGAAGAAGAA GCGTTAAATT GCAATTCGGT CAACTGTCGG CCAAATCCGT ATTTCCTACC 2460 TGAGCTAATG CCACCATTAA TTAAGTTGTG CAAATATTTT GTTTTATGGA CAAACGTGAT 2520 GAAGGAAAAG TTCTGTTCCA AATATGATGT CGGCTCTTCG GCTCTTGTGG AAGCCTATTT 2580 CAAGGATTTA AAAAACACGG ACATGAGCAT ATTCCACCGA CCAGTGAGAG CGGATAAATT 2640 CGTGGTGCAA CATATCCGAT GCATCGAAGC TGTTTGCAAG CTGGAACGAG CCGCGATGAA 2700 ACGCAAGACC GTTAAAACTC CCAGCTTTAT AAAAGAAAAC GCTCCTAAGA AAATGTGCAG 2760 TAAGGAAACC AAGGGATTTC TGGAGGAAAT ACTTGAAGAA AGCGAAGTGG AATACCTTTT 2820 ACAAGAAGAA AACTGGAAGG TGAAGAATAA AACAATAAAG CCCACGGAAG GAAATGATGC 2880 TGAAGACAAC GACACTGATG ATGAAAACAA GGAAATGGAT TTAAGTGAAC AGCCCAAAGA 2940 AAAACCAAGG GGAAAATATC TCAAAAAATG CCCCAATGTG GAGTTATTAT ACAATCGACC 3000 ACATCGAAGG AAACAGGACG AAATTTTGCA TAATGGTGGA TCAATGGGAC CCGTCTGGAT 3060 TGGCAAACAA TTATTGCAAT TCAAAAATAC TTGTCCGTTT GACTCTCTAG TGGAAATATT 3120 GTCGACCGCA TACATAGACA ATTTTTATTA CAAAAGCCTA TTGGATGATT TCTACACTGA 3180 CAACTTGACG ATAGAATTGG TGAAAAAGTA TGCCGTCGAG GGAGTTTCGT CCAGTCTCTA 3240 CTGCGACAGA GGTCTGGTCC TAAAAAGTTT TTTTGATGAA AAACACCAGA TTATAAAATG 3300 CGACGCAAAT ATTGGGTCTT TTATTGAAAA AGCGCTGAAT GGAGTACCCA GTGCGTCAAG 3360 TCATCGGACC CATATAAAAA ACAACCATGA TTGCAGGAAC CAAAAATATA TCCACCATCG 3420 GCTGGAGGTT ATAGATGTCG AAAAAGTTGG CCACCTCGAC GTCCAGGAGG TAGTGATCCC 3480 CTTTATTGAT GAGTTTTTTG CAAGAACTGA TGGAGAATGT AAAATATGCG GTGGACAACA 3540 GATCCTTGAA AGGCAGCCAG GACCGCATGT CATACTTGAT ATAGAATTTG CAATGGATGC 3600 TTTTCATCAA ATTCATCATA ACGGTTTACC AGGAACGACC ACTTTACTTC AAGTGCCGGA 3660 GGAAATTTTA ATACAGGAAA AGAAATATAT TTTAAGTGGT GCCATCGAAT ATGTTCCTGC 3720 GATGGGAGGG GAAATTGGAC ATTACATTGC ATATTGCCGC AGAGTCATTG GATCTTGGGA 3780 AGTGCACAAC GATATGTGCA GGCAATGGAA AAAGTTCTCA GCTCTAAATA CCAAAATGAC 3840 ACTCCACATT TTGATATACA CCCGGAAAAA TTAATGTTTA TTTTTAAGCC TTGTTTAAAA 3900 GTGTAAAAAA TATTTGTTGT TAAAAATTAC AATCTTAAGT CCTTTGCAAA CGTTGTTTAA 3960 AAATAAAATT AAATTAATTA TTTTACAAAA CTTAACCCTT TTTCACTTTT ATACCTAATA 4020 TAAAGAGGTC CGTAAAGTAT CAAGGAGGAG AGCGATTTTC AAATACCGAC AAGTGTTTTA 4080 GATTTTATAG AGGATTCTGA CCTGCCATGA AAAACCCATT CTTAAAAAAT TTAACACTTT 4140 GTCACAGGTA AAAAATTGGA CCCTATCATA AAGTGAGGTT TGAAAATTAT GTAATTACCT 4200 GGTAAACTGT AAATTACAAG ATTGGCAATC GGAACCAATT TTTGGTCTAT ATAAACCCAG 4260 AAACACCTGT GCGTCTGCAC GTGTTATGTA GGCAGATCCT AAAACTAGGA AAGTGGCACG 4320 TGTTTATATG GTGGGCAATA GAATTTA 4347 // ID DMTNFB standard; DNA; INV; 1106 BP. XX AC V00246; J01084; XX DR FLYBASE; FBgn0000638; FB. XX FT source V00246:1..1106 XX CC Derived from V00246 (g8708) (Rel. 36, Last updated, Version 3). CC Josh Kaminker 2 Aug 2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1106 BP; 340 A; 228 C; 119 G; 419 T; 0 other; AGCTCAAAGA AGCTGGGGTC GGAAAAATCG AATTTTTGAA ATTTGAAAGC TGGAATCGTT 60 TGCCCATTTT TTGCCCATGT TTGCCCACCA ATTAGTTTTT TTTGCCCACG TCCAGTTTTT 120 GAGATATGGA TTTTCGAAAA AGTTCGAAAA TGTTCGAAAA TCAAAAATTT CGCTTTTTTC 180 AAATTTTTTT TTTTTTAAAT CGCAATAACA TCGTTTGCCC ACGTTTGCCC ACCCTTTAGA 240 ATTTTGAAAA AATTTATACT TTAGAAAATA TAAGGCTTTT AAGTTTACCT CGGTCTAATC 300 AGAGAGTAAA TCGTTTGCCC ATCTCTTAAA ACCAAATATT ATCAACAAAA AACGTTTGCC 360 CAACCATTAT TATTAGTTTT TATCGTTTGC CCACCCTTTA AAAAACCTTT AACAAAATTT 420 TTTTTTCGAT TGCCCACACT TGAAATACAA CCAATTTCGT TAGCCCACCT CTTCAAAATA 480 AATATTTCCA ATAAAAAACG TTTTCCCACC ATTTAAAAAT AAATAATTTC GATTGCCCAT 540 CCTTCAAAAT TCATTTTAAC GTTTGCCCAC CCTTTAAAAT TTGTTTTTTT CGTTTGCCCA 600 CTCTTAAAAC TAAATAATTT CGATTGCCCA CCTTTTAAAA CTAAATAATT TCGTTTGCCC 660 ATCCTTTAAA ATTCATTTTT AACGTTTGCC CACCCTTTAA AAATAAATTA TTTCGTTTGC 720 CCACCCTTTA AAATTTGTTT TTTTCGTTTG CCCACTCTTA AAACTAAATA ATTTCGATTG 780 CCCACCTTTT AAAACTAAAT AATTTCGTTT GCCCATCCTT TAAAATTCAT TTTAACGTTT 840 GCCCACCCTT TAAAAATAAA TTATTTCGTT TGCCCACCCT TTAAAAGTTT TTTTTTTTCG 900 TTTGCCCACT CTTAAAACTA AATAATTTCG ATTGCCCACC TTTTAAAACT AAATAATTTC 960 GTTTGCCCAT CCTTTAAAAT TCATTTTTAA CGTTTGCCCA CCCTTTAAAA TTTGTTTTGT 1020 AAGATGTGGC GCCAATTCAG ATATTTTAGG ATCGGCGGAT AGAAGCACTT ACTTATATGA 1080 TGATGATGAA CATACATAGA CATAAT 1106 // ID DMREPG standard; DNA; INV; 4346 BP. XX AC X06950; XX DR FLYBASE; FBgn0001100; G-element. XX FT source X06950:1..4346 FT SO_feature CDS ; SO:0000316:220..951 FT /db_xref="FLYBASE:; G-element\ORF0" FT SO_feature CDS ; SO:0000316:819..1539 FT /db_xref="FLYBASE:; G-element\ORF1" FT SO_feature CDS ; SO:0000316:join(1530..1858,1866..1973,1982..2719,2727..3008, FT 3012..3800,3806..3991,4006..4095) FT /db_xref="FLYBASE:; G-element\ORF2" XX CC Derived from X06950 (g8427) (Rel. 16, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4346 BP; 1215 A; 1356 C; 959 G; 816 T; 0 other; ACAGTCGCGA TCGAACACTC AACGAGTGCA GACGTGCCTA CGGACCGACG GCAAGTTATT 60 TTCGTGCTCA AAGTCCCGCT ACTCTAAAAC CGCTACGTAG TGTCGCGAGA TTTCTTCGCG 120 CACCGTGATT GGTTCAGCCG GCGAACCTTA CGGTATCGCT ACCACTACCA ACGCACTCGT 180 GCGTGCGTGT TATCGGTATC AACAGTTACA TTCGGCTAAA GTTACTGCGA ACAACTCAGC 240 AGCAGCCACG TGCTGAGGCT GGTACACCAA CAAACGGTTC CTACCGTGCC CTCCTCCCCT 300 CCTTCCCTAC TCCGGGACAA CATGGACTGG CAAGCCCCCC CGCGACCCAC CAAGCTGACC 360 AAAGTGCCTA GAAAGAAGGC GCTCAAAGAG GCGCCAGGAG AAGGTGAAAG CAGCTGCTCA 420 AGCGATAGCA GCTCCTCGGA GTCAGAGCCT GGGGAAGTCA AGCGCAAAGC AGCGAGCAGA 480 GACGCTAAGG AAGCCGCCGA CAACGTGCCC AACACCAGCG CAGCTCTGCG CAAGAAGCTG 540 GAAAATAACT CCTTCGCCCT TCTGTCCAGC ACTGAGGACG AAGACGATGA CGACGACAAC 600 ACCGACAACG AGCAGCAAAC CCCTGTTGGG GAATCTGCTC CAAAAACCAT GAAAAAACCC 660 AACCCGACCC CGAAGACCAT CAAGCCACCC CCGATCTACA TCCCAGACGT GACCAACATC 720 TCAGCCCTTG TCAGGATGAT TACGACTCTC GTCGGTGCCC ACAAGGAATT CTCGTACAAA 780 ACTGAGAGAA ACAACAATGT ACGAGTAATG ATGCCTGACA AGGAATCCTA CTCAGCCTTT 840 CGTCAGCAGC TTGTGACCCA GAACAAAAGG CACCGCACAT TTCAACTGTC AGGGACCTGC 900 ACAACCCAAT TGGCAAAAAA TCAAAGGAAC CCCTGGGGAT CTTCTTTGTA AACCTGGAAC 960 CTGCGAGCAA CAATACAGAC ATCTACAAAC TCAAGAGAAT CTGCAGGTCG GTCGTCACCG 1020 TTGAGCCGCC TCTGAAATTC AACGATGTTC CGCAGTGCTT CAGATGTCAA GGGTTCGGAC 1080 ACACCCAGCG CTACTGCTTT TTAGAGTTTC GCTGCGTCAA GTGTGGTGGC CTCCACGACT 1140 CCAGGGCGTG TGAAAAAAAG GAAGACGAGA AAGCATGCTG CCTACACTGT CAAGCCGACC 1200 ATCCAGCGTC GTTCAAAGGG TGCCCCGCGT ATAAGAAGGC AAAGGCTCAA CAAGCTCCTA 1260 AACCCAAAGC AAGGAGCATG GAAAGCAACA ACAAGCCCTC CTTTGAGCTC CCAAATATTA 1320 CAAACGGTAT GAGCTATAGA GACGCGCTAA GTGGCACACG CAAGTCCCAA GCAAGCACTC 1380 CCCCACCGAC ACCCCCAACC CCACCTGAAG CCCCACAACC TAACCACATG GAGGCTATGT 1440 TCACTCGATT TGAGAGCCTG GTCGAAAGAA TGATGGAGAA GATGTTTGCT CAGGTGACGC 1500 AGCTTGTTGC TTCCATCCTC AACAGCAAGT CATGCAAATA AGTCTCAACA TAGTCTTCTG 1560 GAACGCGAAC GGCTTGCAGA GAAGCAAAGC CGAAGTTGAG CACACCATCA AAACCGACAA 1620 CATCGATATT TTATTGGTCT CAGAATCCCA TTTTTGCCCC AGATCCCACT TCATCATCTC 1680 CGGTTACGAC CTCATCACAG CCAACCACCC ATCAGGTAGA GCTCGAGGAG GAGCGGCCAT 1740 GCTCATCAAA AGCGGCATAC AGTTCACTGA ACTGCCTGCG ATACAGGAGG ATTGGGCACA 1800 GTGTGCAGTG GCCAGAGTCA ATAGCCTACA GGGAGATATT ACGGTTGGAG CGGTTTACTT 1860 CCACCCCCAG GCACGCGATT ACAGAGACTC ACCTGCATGA GTTCTTCGAG TCCCTCGGAA 1920 CTCGCTTCAT TGCAGCCGGA GACTTCAATG CAAAGCACTC CTGGTGGGGG TCCGCACAAA 1980 CAACCCCAAA GGCAAAACGC TCCACAAGTA CCTGATGCGC AAAAACTTGG ACTGCCACTC 2040 TACTGGAGAG CCCACACACT GGCCCTCGGA CCCTTCTAAG CAGCCGGATC TGCTGGACAT 2100 CGCGATCTGC AAAGGCATAG GTCGTGCCAA ACTCGTCTGC ACTACATACG ACAGGCTCGT 2160 ATCGGACCAC AGCGCCGTCA ACCTGCTCCT CAACATCCCT GTCCTCAGGA AGACGCCGCT 2220 CCGTAGACTC ACGGGGAATC GCACCAATGC CCCCAAGTTC ACGTTCTGGA TGCTCTCCTC 2280 CCTAAACCCA GACCCAGACC TCTCCACTCC AGGCAATATA GGCGCGGCCA TCGAAAAACT 2340 GAACAAGGAG ATGCACAACG CCGCTGAGTT TGCGAACCCT CCTCCTCCTA CAACCCCGAG 2400 AACTCCCGCA AGAGACCTGC ATTTGTGGTC CCCAGAAATC GCCGCCCTCG TGGCCGAGAA 2460 GAGACGCCTC AGACGAGTAT GGTTCCTCTC GCGTAACCCC AGGGACAAGA CAGCGCTCAA 2520 TCGCGCCTCC AAGGAACTCA AGGACAAACT AACCACCCTA CGCCAAGACT CGTTTCAACG 2580 ATTCCTTGAA GATCTGGAAC CTGGAGACCC GCAGCACAAC CTGTGGATCG TCACGCGGCA 2640 CATCAAAAGA CCCGCCAAGA AAATGGTACC AGTGCGTACA GCAGACTGCT CCTGGTGTCG 2700 GTCTGAGGCA GAAAGAGCCG AAGCTTGCTG ACCACCTTCG CTCTGCCTTC ACTCCGTTTG 2760 ACCGATGCAC AGCTGCAGAG CAAGCTGACA CCATCAGAGC TGTTGAAAGC CCATGTGCTC 2820 CAGGACCTGC AATTCAGCCC GTCGCACCAG AGGAGATCGC GCAGGAAATT GCCTCGCTCA 2880 GAAACGGCAA GTCTCCCGGC CCTGATCGCA TCGACGCTAC TGCGTTAAAA ATGTTGCCCA 2940 CATTCTGCTC ACAGCTGCTT GCCAACATTT TTAACAGCTG CTTCCGGCTA GGGTATTTCC 3000 CAAAACAATA GAAACGCGCC GAAGTGATTA CCATCCCCAA GCCCGGCAAA CCTGAAGCCA 3060 ATCTTGCCTC CTATCGTCCG ATAAGTCTGC TGGCAATCCT CTCCAAAATA CTCGAAAGAG 3120 TATTTCTGCG CAGAGTGCTG CCAGTACTGG ATGAGGCTGG TTTGATCCCC GATCACCAGT 3180 TTGGCTTCAG GCGCTCCCAC GGAACACCAG AGCAATGCCA CCGGCTTGTA GAGCAAATTT 3240 TGGAGGCCTT CGAAAGGAAG CAATACTGCT GCGCCGTCAT GCTGGATGTG AAGCAGGCCT 3300 TCGACAAAGT CTGGCACCCT GGACTCCACT ATAAAATCAA GACTCACCTT CCCGGATCCC 3360 ACTTCGCCTT CCTCAAATCA TTCACTGAGG GTAGAGAGTT CCAAGTTTGC TGCGGAACAG 3420 CGACCAGCAC GCCTAGGCCG ATAAGAGCCG GAGTACCCCA AGGCAGCGTC CTTGGACCAA 3480 TACTGTACAC ACTCTACACA GCAGACCTTC CTATCACACC CTCCCGGAGC CTAACAGTGG 3540 CCACATATGC CGATGACACC GCCTTCCTAG CCTCCGCCTC AGACCCCCAA GAAGCATCAA 3600 CCATCATTCT AAGCCAGCTG GATGCCCTCG ACCCATGGTT GAAACGATGG ACCATTGCCG 3660 TGAACGCAGA CAAATCCTCC CAAACCACTT TCTCCCTGCG CAGAGGAGAC TGCCCCCCAG 3720 TCACGCTCAA CGGGGAAACT ATTCCAACCT CAAGTTCCCC GAAATACCTT GGATTGACTC 3780 TAGACCGGAG GCTCACTTGG CACACCAGGC TGACCTGCGC CTCAAGCAAC TCCACTGGCT 3840 CATCGGGAAA AGGTCCAAAC TTAGGGAAAA CCTTAAACTC CTCCTGTACA AGGCCATCCT 3900 GAAGCCAATT TGGACTTATG GGATTCAGCT GTGGGGCACT GCCAGCATCT CAAACCGCAA 3960 CCGCATACAG CGCTTCCAGA ACAAGTGCCT GAGTCAATCG CTGACGCTCA CCCATACCAT 4020 GAAAACTCCG TTATCCACAA GGAGCTTGGA ATGCCATGGG TAGCAGAGGA GATCTCCCGC 4080 TTCAGCGAGA GATACGCTAA ACGACTGGAC AACCACCCTA ACCATCTGGC TATTAACCTC 4140 CTGGACAACA GTGAAACCAT CAGACGCCTC CAGAGGAAAC ACCCGCTTGA TCTCCACCAC 4200 CTATAACCCA CAACAATGAA CCCCCGACCA ATCTACAACT TTGTAATCCC TTAAGTTAAT 4260 GCCCCCCCCA CCCAAACATT TAATTATTGT CCACATGGAC AGATTTTAAA TTAATACATA 4320 GATCGCTAAA AAAAAAAAAA AAAAAA 4346 // ID DMGYPF1A standard; DNA; INV; 7469 BP. XX AC M12927; XX DR FLYBASE; FBgn0001167; gypsy. XX SY synonym: mdg4 XX FT source M12927:1..7469 FT SO_feature five_prime_LTR ; SO:0000425:1..482 FT SO_feature three_prime_LTR ; SO:0000426:6841..7411 FT SO_feature polyA_site ; SO:0000553:7277..7280 FT SO_feature primer_binding_site ; SO:0005850:482..492 FT /bound_moiety="tRNA:lys2" FT SO_feature CDS ; SO:0000316:1080..2435 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0014965; gypsy\gag" FT /db_xref="SWISS-PROT:P10405" FT /protein_id="AAA70218.1" FT /translation="MSWAHNYRKVKVEYESEDSWEEEQVGQALGRPLDSATVDITMDPN FT QIQALIDNAVRQALSQQQSQFQTQLNSLAARVQSLQVEAPQIKIYEKVSVNPDVRCDIP FT LDIIKSVPEFSGTQDEYVAWRQSAIYAYELFKPYNGSSAHYQAVAILRNKIRGAAGALL FT VSHNTVLNFDAILARLDCTYSDKTSLRLLRQGLEMVRQGDLPLMQYYDEVEKKLTLVTN FT KIVMTHEQEGADLLNAEVRADALHAFISGLKKALRAVVFPAQPKDLPSALALAREAEAS FT IERSMFANSYAKAVEERAHSGANGKSRFQGKPNKEEQGQDRNPHFTKRPKNNGQTNKDT FT QAQAPQPMEVDSSSRFRQRTEHYQNHPNESNAFKRRNSSERSTGPRRQRLNNVVQEAPK FT QKDPKEEYEKTAKAAVEEIDSENEYAPSDDSLNFLGGAPGCRSLNDGWLGEP" FT SO_feature CDS ; SO:0000316:2438..5470 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0014966; gypsy\pol" FT /db_xref="SWISS-PROT:P10401" FT /protein_id="AAA70219.1" FT /translation="MLIDTDAAKNYIRPVKELKNVMPVASPFSVSSIHGSTEIKHKCLM FT KVFKHISPFFLLDSLNAFDAIIGLDLLTQAGVKLNLAEDSLEYQGIAEKLHYFSCPSVN FT FTDVNDIVVPDSVKKEFKDTIIRRKKAFSTTNEALPFNTAVTATIRTVDNEPVYSRAYP FT TLMGVSDFVNNEVKQLLKDGIIRPSRSPYNSPTWVVDKKGTDAFGNPNKRLVIDFRKLN FT EKTIPDRYPMPSIPMILANLGKAKFFTTLDLKSGYHQIYLAEHDREKTSFSVNGGKYEF FT CRLPFGLRNASSIFQRALDDVLREQIGKICYVYVDDVIIFSENESDHVRHIDTVLKCLI FT DANMRVSQEKTRFFKESVEYLGFIVSKDGTKSDPEKVKAIQEYPEPDCVYKVRSFLGLA FT SYYRVFIKDFAAIARPITDILKGENGSVSKHMSKKIPVEFNETQRNAFQRLRNILASED FT VILKYPDFKKPFDLTTDASASGIGAVLSQEGRPITMISRTLKQPEQNYATNERELLAIV FT WALGKLQNFLYGSREINIFTDHQPLTFAVADRNTNAKIKRWKSYIDQHNAKVFYKPGKE FT NFVADALSRQNLNALQNEPQSDAATIHSELSLTYTVETTDKPLNCFRNQIILEAARFPL FT KRNLVLFRSKSRHLISFTDKSWLLKTLKEVVNPDVVNAIHCDLPTLASFQHDLIAHFPA FT TQFRHCKNVVLDITDKNEQIEIVTAEHNRAHRAAQENIKQVLRDYYFPKMGSLAKEVVA FT NCRVCTQAKYDRHPKKQELGETPIPSYTGEMVHIDIFSTDRKLFLTCIDKFSKYAIVQP FT VVSRTIVDITAPLLQIINLFPNIKTVYCDNEPAFNSETVTSMLKNSFGIDIVNAPPLHS FT SSNGQVERFHSTLAEIARCLKLDKKTNDTVELILRATIEYNKTVHSVTRERPIEVVHPG FT AHERCLEIKARLVKAQQDSIGRNNPSRQNRVFEVGERVFVKNNKRLGNKLTPLCTEQKV FT QADLGTSVLIKGRVVHKDNLK" FT SO_feature CDS ; SO:0000316:join(567..568,5551..7000) FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0014964; gypsy\env" FT /translation=" FT MFTLMMFIPLVVANARITDFSHANYIPVLDGDVLVFEQRDLLKHSSNLSE FT YASMIDETQKLSESFPHSHMRKLLEVDTDHLRTLLSVLKVHHRIARSLDF FT LGTALKVVAGTPDATDLFKIKITEAQLVESNSRQIAINSETQKQINKLTD FT TINKVINARKGDLVDTPHLYEALLARNRMLSTEIQNLILTITLVKSNIIN FT PTILDHADLKPLVEQDTPIVSLIEASKIRVLQSENSIHILIAYPRVKFSC FT KKVAVYPVSHQHTILRLDEDTLAECEHDTFAVTGCTDTTHFTFCERSRRE FT TCVRSLHAGNAAQCHTQPSHLREINPVDDGVVIINEAAAHVSTDGSPETL FT IEGTYLVTFERTATINGSEFVNLRKTLSKQPGIVRSPLLNIVGHDPVLSI FT PLLHRMSNENLHSIQNLMDDVESEGSPRLWFVAGVVLNFGLIGSLALYLA FT LRRRRASREIQRTIDTFNMTEDGHKLEGGVVNN" XX CC Derived from M12927 (g157583) (Rel. 44, Last updated, Version 6). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. CC [See also:AF033821; alignments in /alignments/gypsy] XX SQ Sequence 7469 BP; 2301 A; 1808 C; 1602 G; 1758 T; 0 other; AGTTAACAAC TAACAATGTA TTGCTTCGTA GCAACTAAGT AGCTTTGTAT GAACAATGCT 60 GACGCGCCAG AATTGGGTTC AACGCTCCAC GCGAAGAATG CCTGGCAGCG GAAAGCTGAC 120 ACTTCCTACC GGGAGTGTTG CTTCACGCTG CAAGAAATGC TGAGTCGGCT TGCCGACTTG 180 TGGCGGCGCG ATGCATTGCT CGAGGGTAAA CTTAGTTTTC AATATTGTCT TCTACTCAGT 240 TCAAATCTTG TGTCGAAATA AACCACAGCT TGCTCCGGCT CATTGCCGTT AAACATCATT 300 GTTCTTATTT ACAATCAAAT CGCTATCGCC ACAAGGCTAG TGATAATAAC TAAGGGGGCG 360 AAGTCAAGCC CTCCAACCTA ATCTCCATAA ACAGTGTCTA AGACGAACCT CAGCGAAAGA 420 AGGAAGATCT CTAGACCTAC TGGAAATAAC ATAACTCTGG ACCTATTGGA ACTTATATAA 480 TTGGCGCCCA ACCAACAATC TGAACCCACC AATCTAATTT AACACACTTT GTCAGGCGAC 540 AAACAGGGTA GTTAAGTTAG AAAAGCATGT AAGTTTTACA AGACACTTCT TTGACGCAAT 600 CAAGAAATTT ACGAGTGAAA AAAAAAAAAA AAAAAAGTTG TGTATCTGGC CACGTAATAA 660 GTGTGCGTTG AATTTATTCG CAAAAACATT GCATATTTTC GGCAAAGTAA AATTTTGTTG 720 CATACCTTAT CAAAAAATAA GTGCTGCATA CTTTTTAGAG AAACCAAATA ATTTTTTATT 780 GCATACCCGT TTTTAATAAA ATACATTGCA TACCCTCTTT TAATAAAAAA TATTGCATAC 840 TTTGACGAAA CAAATTTTCG TTGCATACCC AATAAAAGAT TATTATATTG CATACCCGTT 900 TTTAATAAAA TACATTGCAT ACCCTCTTTT AATAAAAAAT ATTGCATACG TTGACGAAAC 960 AAATTTTCGT TGCATACCCA ATAAAAGATT ATTATATTGC ATACCTTTTC TTGCCATACC 1020 ATTTAGCCGA TCAATTGTGC TCGGCAACAG TATATTTGTG GTGTGCCAAC CAACAACCAA 1080 TGAGTTGGGC ACATAACTAC AGAAAGGTTA AGGTCGAATA CGAAAGCGAG GATAGCTGGG 1140 AGGAGGAGCA AGTAGGCCAA GCATTAGGTC GGCCGTTAGA TAGTGCCACG GTAGATATTA 1200 CCATGGACCC CAATCAGATT CAAGCTCTTA TCGACAATGC TGTCAGACAG GCATTGTCGC 1260 AACAGCAATC CCAATTTCAG ACACAACTCA ATTCCCTAGC TGCGCGGGTA CAGAGTTTGC 1320 AGGTGGAAGC ACCGCAAATC AAGATTTACG AAAAAGTCTC TGTTAACCCC GATGTTAGGT 1380 GCGACATTCC CCTTGACATA ATAAAGTCTG TACCAGAGTT CTCCGGTACC CAAGACGAGT 1440 ATGTGGCCTG GAGACAATCG GCCATATACG CCTACGAGCT CTTCAAACCA TACAATGGCA 1500 GCAGTGCCCA TTATCAGGCT GTTGCCATAT TAAGGAATAA AATCCGTGGC GCAGCCGGGG 1560 CTTTACTGGT CTCCCACAAT ACGGTATTGA ACTTCGATGC TATTTTGGCC AGACTAGACT 1620 GCACGTACTC GGACAAAACA TCCTTACGCC TGTTGAGGCA AGGATTGGAA ATGGTTAGGC 1680 AAGGAGACCT ACCACTAATG CAATACTACG ATGAAGTTGA AAAGAAGCTA ACGCTTGTCA 1740 CTAACAAAAT CGTAATGACG CATGAACAAG AGGGTGCTGA CCTGCTTAAC GCTGAGGTCA 1800 GAGCCGACGC CCTGCATGCT TTTATTTCGG GGCTCAAAAA GGCCCTCAGA GCTGTGGTCT 1860 TCCCGGCCCA ACCAAAAGAC CTGCCATCTG CACTGGCTTT AGCTAGAGAA GCAGAGGCAA 1920 GCATAGAGAG AAGCATGTTC GCTAACTCCT ACGCCAAGGC CGTAGAGGAG CGAGCGCATT 1980 CGGGGGCAAA CGGCAAGAGC CGTTTCCAGG GGAAGCCAAA TAAAGAAGAA CAGGGACAGG 2040 ACAGGAATCC CCACTTCACC AAACGCCCCA AAAATAACGG ACAAACCAAC AAGGACACTC 2100 AGGCGCAAGC ACCCCAGCCA ATGGAGGTCG ATTCATCCTC CAGGTTTAGG CAGCGTACTG 2160 AACATTATCA GAATCATCCT AACGAGTCGA ACGCGTTTAA GAGGAGAAAT TCCTCAGAAC 2220 GCTCAACAGG ACCGAGACGA CAACGTCTGA ATAACGTTGT CCAAGAGGCC CCTAAACAAA 2280 AGGACCCCAA AGAAGAGTAT GAAAAAACAG CAAAGGCTGC AGTCGAGGAA ATCGACAGCG 2340 AAAATGAGTA CGCTCCCAGT GACGACTCGT TGAATTTTTT AGGGGGCGCT CCCGGTTGCC 2400 GTTCATTGAA CGACGGCTGG CTGGGAGAAC CTTAAAGATG CTAATCGATA CCGACGCGGC 2460 AAAAAACTAC ATTAGGCCCG TAAAGGAGCT GAAAAATGTA ATGCCGGTCG CCAGCCCTTT 2520 CTCGGTGAGC TCAATACACG GCTCCACCGA AATCAAACAC AAATGCTTGA TGAAAGTCTT 2580 CAAGCACATC TCCCCATTTT TTCTTTTGGA TTCTCTCAAT GCGTTCGACG CTATCATAGG 2640 CTTGGACCTG TTAACACAGG CCGGGGTAAA ACTCAACCTT GCAGAGGACT CCTTAGAATA 2700 CCAGGGCATC GCTGAAAAGC TTCATTATTT CAGCTGCCCC AGTGTAAATT TCACTGATGT 2760 AAACGATATT GTTGTACCTG ACTCCGTTAA AAAGGAGTTC AAGGACACAA TAATAAGGAG 2820 GAAGAAAGCT TTCTCCACAA CAAATGAAGC TCTTCCTTTT AACACCGCTG TCACTGCCAC 2880 AATTCGGACA GTTGACAATG AACCGGTGTA CTCAAGAGCG TACCCAACTC TTATGGGTGT 2940 CTCCGACTTT GTGAACAACG AGGTCAAACA ACTGCTGAAA GACGGCATTA TCAGGCCCTC 3000 AAGGTCTCCC TATAACAGCC CGACCTGGGT TGTTGACAAA AAGGGGACCG ACGCCTTCGG 3060 GAACCCAAAC AAGAGGTTGG TCATTGACTT CAGGAAGCTA AATGAGAAAA CTATTCCTGA 3120 CCGGTACCCG ATGCCTAGCA TTCCCATGAT TCTAGCGAAT CTGGGCAAGG CAAAGTTCTT 3180 CACTACCCTT GATCTTAAGT CAGGGTATCA TCAAATTTAC CTCGCGGAAC ACGACCGCGA 3240 GAAGACATCG TTCTCGGTGA ATGGTGGTAA ATACGAGTTT TGCCGTCTAC CGTTCGGCTT 3300 GAGAAATGCA AGCAGCATTT TTCAAAGAGC CCTAGACGAT GTGCTTAGAG AGCAAATCGG 3360 GAAGATATGT TACGTCTATG TAGATGACGT CATAATTTTC TCTGAAAACG AGTCCGACCA 3420 TGTCCGCCAC ATCGATACAG TACTAAAATG CCTGATCGAT GCCAACATGA GAGTAAGCCA 3480 GGAGAAAACT AGATTCTTTA AAGAGAGTGT AGAATACCTC GGCTTTATTG TCAGTAAGGA 3540 CGGAACTAAA TCCGATCCAG AGAAGGTGAA GGCCATTCAG GAGTACCCTG AACCAGACTG 3600 CGTTTACAAG GTTAGGTCCT TCCTTGGTTT AGCCAGCTAC TACAGAGTCT TCATCAAAGA 3660 CTTTGCTGCC ATAGCCCGCC CGATCACCGA TATCCTAAAA GGGGAAAATG GTTCGGTGAG 3720 CAAACACATG TCTAAAAAAA TTCCTGTTGA GTTTAATGAA ACTCAACGCA ACGCGTTCCA 3780 AAGACTGCGA AACATACTAG CATCCGAGGA TGTCATACTC AAATACCCCG ACTTTAAAAA 3840 GCCTTTTGAC CTTACTACAG ATGCTTCGGC AAGTGGTATC GGTGCAGTCC TATCCCAGGA 3900 GGGCAGGCCA ATCACCATGA TATCGCGTAC CCTTAAACAG CCCGAGCAGA ACTACGCCAC 3960 AAACGAAAGG GAATTGCTGG CGATTGTATG GGCCCTAGGT AAGTTGCAGA ACTTCCTGTA 4020 TGGCTCTAGG GAGATTAATA TATTTACCGA CCATCAACCC CTCACTTTCG CTGTTGCCGA 4080 CAGGAACACG AATGCCAAGA TAAAGAGGTG GAAATCTTAC ATAGACCAGC ATAATGCCAA 4140 GGTTTTCTAC AAACCTGGCA AAGAAAATTT CGTGGCAGAC GCCCTCTCTA GGCAGAATCT 4200 GAATGCCTTA CAAAACGAAC CCCAATCAGA CGCTGCGACC ATTCACAGTG AGCTCTCCCT 4260 GACCTACACG GTCGAGACAA CAGACAAACC GTTAAATTGC TTCAGGAACC AGATCATTCT 4320 GGAGGCAGCA CGTTTTCCGC TCAAACGAAA CCTGGTGCTC TTTCGAAGCA AATCTCGCCA 4380 CTTAATCAGC TTTACTGATA AAAGTTGGCT ATTAAAAACA CTTAAGGAGG TGGTAAACCC 4440 TGACGTCGTG AACGCTATTC ACTGCGACCT GCCCACTCTG GCAAGCTTCC AACACGACCT 4500 CATTGCCCAC TTTCCAGCCA CCCAATTTCG TCACTGTAAG AATGTCGTGT TAGACATAAC 4560 CGACAAAAAC GAACAGATCG AAATCGTCAC TGCCGAGCAC AACCGCGCTC ACAGAGCCGC 4620 ACAAGAAAAC ATTAAACAAG TCCTTCGGGA TTATTACTTT CCCAAAATGG GCAGTTTAGC 4680 TAAAGAAGTA GTAGCTAATT GTAGGGTCTG CACCCAAGCA AAGTATGACA GGCACCCGAA 4740 AAAGCAAGAG CTCGGGGAAA CGCCCATACC CAGCTATACA GGTGAGATGG TGCATATTGA 4800 CATATTCTCA ACCGACAGGA AGCTATTCCT GACGTGTATT GACAAATTTT CTAAATATGC 4860 AATAGTGCAA CCAGTGGTGT CTAGAACAAT AGTGGACATC ACAGCACCCC TGTTGCAGAT 4920 CATTAACCTG TTCCCCAATA TCAAAACGGT CTATTGTGAC AATGAGCCCG CATTTAACTC 4980 AGAAACTGTC ACCTCAATGC TCAAGAACAG CTTCGGCATT GACATAGTAA ATGCGCCCCC 5040 ACTCCACAGC TCATCCAATG GCCAAGTTGA ACGGTTCCAC AGCACATTGG CAGAAATCGC 5100 CAGGTGCCTG AAGTTGGACA AAAAAACGAA TGACACAGTA GAACTAATCT TGAGGGCGAC 5160 GATAGAATAT AACAAAACCG TGCACTCAGT TACTCGTGAG AGACCAATTG AGGTGGTTCA 5220 CCCAGGGGCC CACGAGCGCT GCCTAGAAAT CAAGGCAAGA TTAGTAAAGG CTCAGCAAGA 5280 CAGCATCGGA AGAAACAACC CTTCCCGACA AAACCGCGTG TTTGAGGTGG GAGAACGCGT 5340 GTTTGTAAAA AACAACAAGA GGTTAGGAAA TAAGCTAACT CCACTATGCA CCGAGCAAAA 5400 AGTGCAGGCA GACTTGGGAA CGTCTGTTCT TATTAAGGGG AGGGTGGTCC ACAAGGACAA 5460 CCTCAAGTAG ACATTCCCTC TACAGTTAGG TAGTAAGTTA TGTCAAGGAA AATCCGAGCA 5520 CTGTAGTATC ACCTTGTCTT TAATTTCCAG GTTCACCCTC ATGATGTTCA TACCCTTGGT 5580 AGTAGCGAAT GCTCGGATCA CCGACTTTTC GCATGCCAAC TACATTCCTG TGTTAGATGG 5640 GGATGTGCTG GTGTTTGAAC AGCGTGACCT CTTGAAACAT TCGAGTAACC TTTCCGAGTA 5700 CGCTAGTATG ATAGATGAAA CACAGAAACT GTCCGAGTCC TTTCCCCACT CACATATGCG 5760 TAAGTTGCTA GAGGTCGATA CTGACCATCT TAGAACCTTG TTGTCCGTTC TCAAAGTCCA 5820 CCATAGGATA GCTAGGAGTC TAGATTTCTT AGGTACAGCC TTAAAGGTTG TGGCGGGTAC 5880 TCCCGATGCC ACGGACCTCT TTAAAATTAA GATCACAGAG GCCCAACTAG TAGAATCTAA 5940 TTCCAGGCAG ATAGCTATAA ACTCCGAAAC CCAGAAACAG ATAAATAAGT TAACTGACAC 6000 CATCAATAAG GTGATCAATG CCCGTAAAGG CGACTTGGTT GACACTCCAC ACTTATATGA 6060 AGCACTACTA GCAAGAAATA GGATGCTGTC TACAGAAATT CAAAATTTAA TTCTCACTAT 6120 TACTTTGGTC AAATCAAACA TTATAAATCC CACAATTCTT GATCATGCCG ACTTGAAGCC 6180 TCTTGTAGAA CAGGATACCC CAATTGTCAG CTTAATAGAA GCATCTAAGA TCAGGGTCCT 6240 CCAGTCCGAG AATAGCATTC ATATTTTAAT TGCCTATCCT AGAGTCAAGT TCAGTTGCAA 6300 GAAAGTCGCC GTCTACCCTG TATCTCACCA ACACACCATC TTGCGCCTCG ACGAAGACAC 6360 TTTGGCCGAA TGCGAACATG ACACCTTTGC GGTCACCGGA TGCACAGACA CCACACACTT 6420 CACGTTCTGC GAGCGGTCTC GGCGCGAAAC TTGCGTGCGC TCACTCCATG CTGGAAACGC 6480 TGCTCAATGC CACACTCAAC CCAGCCACTT GCGAGAAATA AACCCCGTAG ATGATGGCGT 6540 TGTGATTATC AACGAAGCCG CAGCTCACGT TAGCACTGAT GGCAGCCCCG AAACACTGAT 6600 AGAGGGAACC TACCTGGTAA CCTTCGAGCG AACGGCAACC ATCAACGGCT CTGAATTCGT 6660 AAATCTAAGG AAAACACTAA GCAAGCAGCC AGGCATCGTG CGTTCACCAC TACTTAACAT 6720 CGTCGGCCAC GACCCTGTGC TCAGTATACC TCTGCTACAC CGGATGAGTA ACGAAAACCT 6780 ACATTCCATC CAAAACCTTA TGGATGACGT GGAATCTGAA GGCTCGCCCA GACTCTGGTT 6840 CGTGGCTGGT GTGGTCCTAA ACTTCGGCTT GATTGGCTCT CTCGCCCTTT ATCTGGCATT 6900 AAGGAGAAGA CGAGCCTCTA GGGAGATACA GCGCACCATC GATACTTTCA ACATGACCGA 6960 GGACGGTCAT AAACTTGAGG GGGGAGTAGT TAACAACTAA CAATGTATTG CTTCGTAGCA 7020 ACTAAGTAGC TTTGTATGAA CAATGCTGAC GCGCCAGAAT TGGGTTCAAC GCTCCACGCG 7080 AAGAATGCCT GGCAGCGGAA AGCTGACACT TCCTACCGGG AGTGTTGCTT CACGCTGCAA 7140 GAAATGCTGA GTCGGCTTGC CGACTTGTGG CGGCGCGATG CATTGCTCGA GGGTAAACTT 7200 AGTTTTCAAT ATTGTCTTCT ACTCAGTTCA AATCTTGTGT CGAAATAAAC CACAGCTTGC 7260 TCCGGCTCAT TGCCGTTAAA CATCATTGTT CTTATTTACA ATCAAATCGC TATCGCCACA 7320 AGGCTAGTGA TAATAACTAA GGGGGCGAAG TCAAGCCCTC CAACCTAATC TCCATAAACA 7380 GTGTCTAAGA CGAACCTCAG CGAAAGAAGG AAGATCTCTA GACCTACTGG AAATAACATA 7440 ACTCTGGACC TATTGGAACT TATATAATT 7469 // ID DMHFL1 standard; DNA; INV; 2959 BP. XX AC M69216; XX DR FLYBASE; FBgn0001210; hobo. XX SY synonym: H-element XX XX FT source M69216:1..2959 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..12 FT SO_feature terminal_inverted_repeat ; SO:0000481:2948..2959 FT SO_feature TATA_box ; SO:0000174:107..112 FT SO_feature polyA_signal_sequence ; SO:0000551:2382..2394 FT SO_feature CDS ; SO:0000316:316..2292 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0014191; hobo\T" FT /translation=" FT MAPYIMIVEFLCLWSSVSAVNCPFFVFYDAITSLLGFSIIWKPKEKVTIM FT AEAADFVKNKINNGTYSVANKHKGKSVIWSILCDILKEDETVLDGWLFCR FT QCQKVLKFLHKNTSNLSRHKCCLTLRRPTELKIVSENDKKVAIEKCTQWV FT VQDCRPFSAVTGAGFKNLVKFFLQIGAIYGEQVDVDDLLPDPTTLSRKAK FT SDAEEKRSLISSEIKKAVDSGRASATVDMWTDQYVQRNFLGITFHYEKEF FT KLCDMILGLKSMNFQKSTAENILMKIKGLFSEFNVENIDNVKFVTDRGAN FT IKKALEGNTRLNCSSHLLSNVLEKSFNEANELKKIVKSCKKIVKYCKKSN FT LQHTLETTLKSACPTRWNSNYKMMKSILDNWRSVDKILGEADIHVDFNKS FT SLKVVVDILGDFERIFKKLQTSSSPSICFVLPSISKILELCEPNILDLSA FT AALLKERILENIRKIWMANLSIWHKAAFFLYPPAAHLQEEDILEIKVFCI FT SQIQVPISYTLSLESTETPRTPETPETPESLESPNLFPKKNKTISSENEF FT FFPKLVTESNSNFNESPLDEIERYIRQRVPLSQNFEVIEWWKNNANLYPQ FT LSKLALKLLSIPASSAAAERVFSLAGNIITEKRNRLCPKSVDSLLFLHSY FT YKNLNNSQ" XX CC Derived from M69216 (g157606) (Rel. 41, Last updated, Version 3). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. CC CDS annotation from Lynn Crosby's annotation 'H-element.v004'. XX SQ Sequence 2959 BP; 994 A; 541 C; 571 G; 853 T; 0 other; CAGAGAACTG CAAGGGTGGC ACTTTTTTAC CACTCGACTC ACACCCTACA ATTTTGTGTG 60 CGGGTGCTAC TCGCCACGCA CATCGCGGGT ACTTACAAAC ACACAGTATA AATCTGAACA 120 TGCAGACAAG ACACCCCGTT GTGTGCGCAC CCGAATCAAT ACGGTGTTTT GCGTCGCGGG 180 TGCCGCTCAC ACAGTGCCTA AAAAGGGATG AGTGAGAAAA ACACTTGTGG GTATACCGTT 240 AAACACATGG GTGTTTCCAA AAATACTCGG GTGTTTCCAA AAATACTCGA GTGGTCTCGT 300 AGGTAGTCGA GTCAAATGGC GCCATACATA ATGATTGTTG AGTTCTTGTG TCTTTGGTCC 360 AGTGTCTCGG CTGTTAATTG CCCCTTTTTT GTTTTTTACG ATGCAATTAC TAGCTTGTTA 420 GGATTCAGTA TTATTTGGAA GCCAAAGGAA AAGGTCACAA TAATGGCAGA AGCGGCTGAT 480 TTCGTTAAAA ATAAAATTAA CAATGGAACA TACTCAGTTG CCAATAAACA TAAAGGAAAA 540 AGTGTTATTT GGAGCATTTT ATGTGACATT TTAAAGGAAG ATGAAACTGT TCTGGACGGA 600 TGGCTGTTCT GCAGGCAATG CCAGAAAGTG CTCAAATTTT TACACAAAAA CACCTCCAAT 660 TTATCCCGCC ATAAATGTTG TCTAACATTA AGACGACCAA CGGAATTAAA AATTGTTTCG 720 GAAAACGACA AGAAAGTAGC TATTGAAAAA TGCACCCAAT GGGTTGTCCA AGATTGTCGG 780 CCGTTTTCTG CAGTAACCGG AGCCGGATTT AAAAATTTGG TGAAGTTTTT CCTACAAATC 840 GGCGCTATCT ATGGGGAACA GGTAGACGTC GATGACTTAC TACCTGATCC AACAACATTA 900 AGTCGGAAGG CCAAATCGGA TGCAGAAGAG AAGAGGAGTC TAATCTCGTC CGAGATAAAA 960 AAAGCTGTGG ATAGCGGAAG AGCAAGTGCG ACCGTCGACA TGTGGACTGA CCAGTATGTC 1020 CAAAGAAACT TTTTGGGCAT CACTTTCCAT TACGAAAAAG AATTTAAACT TTGTGACATG 1080 ATTTTGGGAC TAAAATCGAT GAATTTTCAA AAATCGACTG CCGAAAACAT TTTAATGAAA 1140 ATTAAAGGTT TATTTTCGGA ATTCAATGTT GAGAACATTG ATAATGTTAA GTTTGTGACT 1200 GACAGGGGAG CAAATATAAA AAAGGCTTTA GAGGGCAATA CCCGTTTAAA TTGTAGCAGT 1260 CACCTGTTGT CAAATGTTTT AGAAAAATCG TTTAACGAGG CCAATGAACT CAAAAAAATT 1320 GTGAAATCAT GCAAAAAAAT CGTGAAGTAC TGCAAAAAAT CAAATTTGCA GCATACTCTA 1380 GAAACCACTT TGAAAAGCGC CTGTCCGACT AGATGGAACT CCAACTACAA AATGATGAAG 1440 TCCATTCTGG ATAACTGGCG TAGTGTGGAT AAAATATTAG GTGAAGCTGA TATCCATGTA 1500 GATTTTAATA AATCATCTTT AAAAGTTGTG GTAGATATTC TAGGAGACTT TGAACGAATA 1560 TTTAAGAAGT TGCAAACATC TAGCTCACCA TCTATATGCT TCGTATTGCC ATCCATCTCT 1620 AAAATTTTAG AATTATGCGA GCCGAATATT TTAGACCTTT CTGCAGCAGC ATTGCTTAAG 1680 GAAAGAATTT TGGAAAATAT TCGTAAGATT TGGATGGCAA ATCTAAGCAT ATGGCATAAG 1740 GCGGCATTTT TTTTATATCC ACCCGCAGCA CATCTTCAGG AAGAAGATAT TCTTGAAATA 1800 AAGGTGTTTT GCATTTCACA AATTCAAGTC CCAATTTCAT ACACATTAAG CTTAGAATCT 1860 ACAGAAACTC CAAGAACTCC AGAAACTCCA GAAACTCCAG AAAGTCTAGA AAGTCCAAAC 1920 TTATTTCCAA AAAAAAACAA AACAATATCT TCTGAAAACG AATTCTTCTT CCCAAAGTTA 1980 GTAACTGAGT CTAATTCCAA CTTCAATGAA TCTCCATTAG ATGAAATTGA ACGATATATT 2040 AGACAAAGAG TTCCATTGTC TCAAAATTTT GAAGTAATTG AGTGGTGGAA AAATAACGCA 2100 AACTTATACC CTCAGTTGTC AAAGTTAGCA TTAAAACTTT TATCAATACC AGCCAGTAGC 2160 GCAGCAGCTG AAAGAGTGTT TTCCCTAGCA GGTAATATAA TAACAGAAAA GCGAAATAGA 2220 TTATGCCCAA AATCTGTAGA TAGCCTCCTT TTTTTGCATT CCTATTACAA AAACCTAAAC 2280 AACTCGCAAT AGATATTCCT TCTTATAAGT ATATTTTATA TTATTAATTC TTATTATTTG 2340 CTTAATTTTT GTATAAGTGT TAAGTATTAA GTATAAGTAT TAATTAATAA TATATAAGAT 2400 TGTTATTTGT TAAGACATTA GATGCAAAAT CCTAAAAATG TGAAAGTAAT GAAGTTCCTT 2460 ATATTTAATA GATACTTTTT AAGCCCACTA TGTTTTTATT ATTTAGATTG AGACATTAAA 2520 AAACGTAAAA ATCAACAAAT GCCGTCTTTA ATTGCAATTA CTTTATGTGT TTGAAATGGG 2580 AGGCACCCAT TGAGTCCATC AAAGAGCAAA GACATGAGCA CAAAAATTTT CTTGGGTATT 2640 CCCTTTTACC CTTCATTTCT TATACCCGTC ACGCTTCCAC CCATACAAAT TTTAGGCGTA 2700 CAAAAAATGA CCAGAGAACT GCAGCCCGCA TACAAAAAAT GACCTGCGGC CGATCGTTGA 2760 CTGTGCGTCC ACTCACCCAT ACGGCTCTTG CGCAGCAGGC CTCGGGTGGT TTTTTTACTC 2820 GTAACAAAAA CACAACGTCG GTAAAACACT CGAGTATTTT GTGTTGCCGC AAGTAGGGTG 2880 TCAAAAAAAA CGGGGTGCCT AGAGTACCGA GTGTTTATCG GGTGGACGTA GAGTGCGAGT 2940 GGCGGGCTGC AGTTCTCTG 2959 // ID DMTHB1 standard; DNA; INV; 1653 BP. XX AC X01748; XX DR FLYBASE; FBgn0001181; HB. XX FT source X01748:189..1841 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..31 FT SO_feature terminal_inverted_repeat ; SO:0000481:1622..1653 FT SO_feature CDS ; SO:0000316:387..534 FT /db_xref="FLYBASE:FBgn0044055; HB\T" FT /db_xref="SPTREMBL:Q27293" FT /protein_id="CAA25884.1" FT /translation="MLILKLRKEGKTYKDIQKTLKCSAKMVSNAIKYKWKPENRGTKHK FT TTDIEDRRIVSYSKVYRFASFRDIKSELNLGISDVTIRRRLLNQNFSARSPRKVPLPSP FT RHIKARLSLAKTYLNWPVSKWRNILWTDGSKIMLFGGTGSLQYI" XX CC Derived from X01748 (g8693) (Rel. 49, Last updated, Version 3). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1653 BP; 568 A; 288 C; 302 G; 495 T; 0 other; ATGTACAGCT GTGTTCAGAA AAATAGCAGT GCGAAGGAAA CTAAGTAATA CAAAGGTATT 60 TTTCCATGTC CCTTTTCGGA ATCGACTTTT TATTCCTCTT ATTTTTGTTA AATGGAATGT 120 GTAGATAGGG AAAAAAAGAA AATCCGGTCA GTTTTTCTTG TTATCCTTTT TTTATTTACA 180 TTCTTGAGCA AAATCACAAT TTTTAGGCTG TTCATAAGAA TAGCAGTGTC TGGTTCTGAC 240 CAACGTAAAG TCCCGAAATG ATCAATATTT TCTAAAAAGT GAGTTTGGTT AAGTTAATTC 300 GTATATTTAA AAGGACAATA AATTAAAAAA ATTAAAAAAA TTTTATTTTA GTGGGTAGAG 360 GACAGCACTA CTCCCAGGGG AAAAGAATGT TAATTCTTAA GCTTAGAAAG GAAGGAAAAA 420 CATATAAGGA CATTCAAAAA ACCCTTAAAT GTTCTGCCAA AATGGTATCC AATGCCATTA 480 AATATAAATG GAAGCCCGAA AACCGTGGTA CCAAACATAA AACCACAGAT ATAGAGGATC 540 GACGCATTGT TTCTTACAGC AAAGTCTATC GTTTTGCATC CTTTAGGGAC ATAAAGTCTG 600 AGCTGAACTT GGGAATCAGC GACGTTACTA TTCGTAGACG ACTACTGAAT CAAAATTTCA 660 GTGCGAGGAG TCCACGAAAG GTTCCCCTAC CTAGCCCAAG GCATATTAAG GCAAGGTTAA 720 GCTTAGCTAA AACCTACCTA AACTGGCCAG TCTCCAAATG GCGTAATATC CTTTGGACTG 780 ATGGGTCAAA AATCATGCTA TTTGGTGGAA CTGGTTCACT ACAGTATATC TGACGACCTC 840 CAAACACGGA GTATCACCCA AAACACCCAG TGAAGACTTT CAATCACGGT GGACCTAAAA 900 TCATGGTATG GGCTTGTTTT TTTTATAATG GTATGAGTCA TGCTATGGAT TATGATTTAT 960 GGTATTATAG ACCAAAACGC ATATGTAAAT ATACTTAGTG ATGTCTTATT GTCATATTCT 1020 GAATAAAATA TACCCTTAAA ATGGACATTC CAACAGGATA ATGATCAGAA ACGCAGATGT 1080 AAATCGGCTA AGAATAGGTT CACCCAAAAT AGAATAGATG CAATGCCGTG GCAAGCACCA 1140 CCTTCCCATT TAAACCCGAT TGAAAACCTG TATGGGGACA TTAAACAGTT TGTGTCGAAG 1200 AAGTCCCCGA CGTCTAAGAC TCAGATTTGG CAAGTTGTGC AGGATACATG GGCAAAAATT 1260 CCTCCCAAAC CTTGCTAGGA CTTGGTGGAC TTCATGCCGC GTGGGTGTAA GGCTGTGCTG 1320 GCTAACAAAG GCTATCCAGC CAAGTATTAG GCCCGAATTA ACATATTAAA AAGAAAAACT 1380 AAGTTCGTTC TAGGTCAAGT TAAATTTTGT TACTATTTTT TCATAGCACT GCTATTTTAT 1440 TGAACACCAG AATTTCTGCC TATTTATTGT TTTAATCTAT ATTTTCGAAA CTATTGAAGA 1500 AATAAAAGTG AAACATTTGT TAAATTGTTT GAAATGAAAT ACCTAATGAT ATTATTAAAA 1560 AAAAATTCCC ATTAAAACTG TAAATCATAG GAATTTTTTA TCTTAAACTC TGAAGTCCAA 1620 AGCACTGCTA TTATTCTGAA CACAGCTGTA CAT 1653 // ID DM06920 standard; DNA; INV; 6083 BP. XX AC U06920; XX DR FLYBASE; FBgn0004141; HeT-A. XX FT source U06920:1015..7097 FT SO_feature five_prime_UTR ; SO:0000204:1..731 FT SO_feature three_prime_UTR ; SO:0000205:3497..6083 FT SO_feature polyA_sequence ; SO:0000610:6077..6083 FT SO_feature CDS ; SO:0000316:732..3497 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0016662; HeT-A\gag" FT /db_xref="REMTREMBL:AAC17188" FT /protein_id="AAC17188.3" FT /translation="MSMSDNLFSDDEVLSISSSPEQRSSPFYLNISPMSHGSDNSQINT FT VIINSKKLPSNQADISLKNSSGAAIKIVNSLSHKKKENTNVNNAQKDPLSLTNTTASTC FT GAKSSISEGKLSSPPSTSHTYEGKLLTKLTHTHTDFRGAKTSDAMGSFPSLSHSDNSIE FT KNLSSSTKIGPNASSPPSHAHTHTSKSTDISLESRSKHPALANTDARSIKANANDNGEI FT FSSLIQIDERKQEERPCTTINAFWSIFKPKPDVTKLSLKRKPTNPTKNTGKKCISPHKK FT SAYLCPSAQDDLNLNLNPKSSAKPTVVNLPAARILSRPAAKRDLFKSSSSRSPDEQPMS FT FSEVVAGTGSIFAAPCVPAPLTKTPGKRTNDDLDCSNFKTPNKKLCATSNFVTPSIFPP FT LITPVFKSKAAQSVYEESKARNGPPPPALACSINASARSAAAPPGIAPLPPHNTDAELP FT PWKIVPQSRRAPPILVNDVKEIVPLLEKLNYTAGVSSYTTRAIEGNGVRIQAKDMTAYN FT KIKEVLVANGLPLFTNQPKSERGFRVIIRHLHHSTPCSWIVEELLKLGFQARFVRNMTN FT PATGGPMRMFEVEIVMAKDGSHDKILSLKQIGGQRVDIERKNRTREPVQCYRCQGFRHA FT KNSCMRPPRCMKCAGEHLSSCCTKPRTTPATCVNCSGQHISAYKGCPAYKAEKQKLAAN FT NVDINKIRTIKDATNNFYKRQGPPLRNNTPRLPHSSAILSKSIAEARQEAARKSMLNPF FT RQNINDRRPRFSSHDTAIQKRLNKWRRNTNKIPKKGRIALKDNAKPRPAHRTSNPAQRH FT LEDYQDMLRRERSEENDQESEKGTPNTKQVGNDSPPTTSRAARASFKPRIIDDTTPSPK FT ICNPNSQKGLLDDPTTSLANRVDNLEKKIDILMALIIQGRNNNLDMDTSN" XX CC Derived from U06920.2 (Rel. 67, Last updated, Version 14). CC Michael Ashburner, 26-Jan-2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 6083 BP; 2199 A; 1545 C; 1033 G; 1306 T; 0 other; TAAATAAATA AAATAAATTA AACAATTAAC TAAATAATTA AATAACTAAA ATTAATAATA 60 TAATCCGTTC GCTTGCCAAA GACTCTCACG CGCATAACTA ATTAAAATCG ATTTTCAAGT 120 TGACAAATAA ATGGTTTAAA ATTGTCCTCA GGCTGCAAAG AAAAGCCGCG GCAACAATAA 180 ACATTTAGTG ACACGCGAAA AGCGAACATT TGATTAGTGT AATACTTGTG CAAACCGACA 240 AGCTGCCGCC ATAACAAAAC GGAGACGAAG AATCATAAAG AACAAAAGCT AAATCCACCA 300 GCATAGCAAA AATAAATTAA CAAATAAAAT AAAAGCAAAT TTAAATAACA TAATAAATTA 360 AACTTATTTA ATAAACCAAT TAATTTTAAT TAATTCAATT AAACGCTAAA TCTACATAAT 420 ACTCCACGCG CAAATTAATT GAAATCGTCT TTCTAGTTAA TAAATTAAAA GTTTAAAAAT 480 TGTCTCCGGC CGCAAAATTT GAACCGCGAC GATAAAAACA TTTAATTGAC AAACAAAAAG 540 CGAACAATTA TTCAGTGAAC TATTTGTGCA AAATTGACAA GCAGACGCCA TAATTAAAAG 600 GAGAAGAAGC CAAAAGACGA AGAGAAGAAA GCAACCAGAA GAACTCAAAG AAGAAAAGGA 660 GGAAAGCCCA ATTAAAGAAA GCCAGGGTAT TTATACCTTA CACTTATCGT TTAATATAAC 720 AAAAACCCAA CATGTCCATG TCCGACAACC TTTTTTCTGA CGATGAGGTA CTTTCAATTT 780 CCTCAAGCCC AGAACAGCGA TCTTCTCCGT TCTACCTCAA TATATCGCCC ATGTCCCACG 840 GATCAGACAA TTCTCAGATT AATACAGTCA TCATTAATTC GAAGAAATTG CCCTCAAATC 900 AAGCAGACAT AAGTTTAAAA AACTCTTCTG GGGCTGCTAT AAAAATTGTT AATTCCCTTT 960 CACACAAGAA GAAAGAGAAC ACAAACGTTA ATAATGCCCA AAAAGACCCC CTCTCACTCA 1020 CCAATACTAC TGCAAGCACT TGTGGCGCCA AAAGCAGCAT CTCAGAGGGG AAATTGTCTT 1080 CTCCTCCGTC CACCTCACAC ACATATGAGG GGAAATTACT CACAAAACTT ACTCACACAC 1140 ACACAGACTT TAGAGGCGCC AAAACGAGCG ATGCAATGGG AAGTTTCCCC TCTCTCTCGC 1200 ACAGCGACAA TAGCATAGAG AAAAATCTGA GTTCTTCCAC CAAAATTGGA CCAAACGCTT 1260 CTTCCCCTCC TTCTCATGCA CACACTCACA CTAGCAAATC CACTGATATA AGCTTAGAAA 1320 GCCGCTCAAA ACATCCCGCG CTTGCCAATA CGGACGCACG CTCTATAAAA GCCAATGCTA 1380 ATGACAATGG GGAAATTTTC TCCTCACTTA TACAAATTGA CGAACGCAAG CAAGAGGAAA 1440 GGCCTTGCAC AACTATCAAC GCTTTTTGGT CTATTTTTAA ACCCAAGCCG GACGTTACTA 1500 AACTAAGTCT AAAGAGGAAA CCCACCAATC CCACTAAAAA CACTGGGAAA AAATGCATCT 1560 CCCCTCATAA AAAGAGCGCT TATTTATGCC CTTCCGCTCA GGATGATTTA AATTTAAATT 1620 TAAACCCCAA ATCTAGCGCC AAGCCCACTG TGGTGAATTT ACCAGCTGCC CGCATCCTAA 1680 GCCGGCCTGC AGCCAAGCGG GATTTATTTA AATCATCATC CTCCCGAAGC CCAGACGAGC 1740 AGCCTATGAG TTTTTCGGAA GTGGTCGCTG GCACGGGTTC AATTTTTGCG GCACCCTGTG 1800 TCCCGGCACC TTTAACGAAA ACTCCAGGCA AGCGGACAAA CGACGATCTG GACTGCTCCA 1860 ACTTTAAGAC GCCCAATAAA AAATTATGCG CGACTTCCAA CTTTGTAACT CCCAGCATTT 1920 TTCCGCCGCT CATCACTCCC GTTTTCAAGA GCAAGGCAGC TCAATCTGTT TACGAGGAAT 1980 CCAAAGCCAG AAATGGACCC CCCCCGCCGG CCCTCGCCTG CAGCATCAAT GCCTCTGCTC 2040 GCAGCGCAGC GGCGCCACCC GGGATCGCCC CCCTACCCCC TCATAATACA GATGCAGAGC 2100 TGCCTCCATG GAAAATCGTG CCCCAGAGCC GTAGAGCACC TCCTATACTC GTCAATGATG 2160 TAAAGGAAAT TGTACCTCTA CTGGAAAAGC TGAACTACAC AGCAGGAGTC TCCAGCTATA 2220 CTACTAGGGC TATAGAAGGA AACGGGGTCA GGATACAGGC AAAGGACATG ACCGCCTATA 2280 ACAAAATTAA AGAAGTCCTG GTGGCCAACG GACTTCCTTT ATTCACCAAC CAGCCCAAGT 2340 CCGAGAGAGG CTTCCGAGTC ATCATCAGAC ATCTCCACCA CTCCACACCA TGCTCGTGGA 2400 TAGTCGAGGA ACTGCTGAAG CTCGGATTCC AAGCGCGATT CGTCAGAAAT ATGACGAATC 2460 CGGCTACAGG TGGCCCCATG CGAATGTTTG AAGTGGAGAT CGTCATGGCC AAAGACGGCA 2520 GTCATGACAA AATACTCTCA CTCAAACAAA TCGGTGGGCA AAGGGTGGAC ATTGAAAGGA 2580 AAAACAGGAC ACGGGAGCCA GTCCAGTGCT ACAGATGCCA AGGCTTCAGG CATGCCAAAA 2640 ACTCTTGCAT GAGGCCGCCA AGATGCATGA AATGCGCTGG CGAACACCTG TCTTCCTGTT 2700 GCACCAAACC AAGAACCACC CCCGCCACCT GCGTAAATTG CTCTGGGCAG CATATTAGCG 2760 CGTACAAAGG ATGCCCTGCA TATAAGGCGG AAAAACAAAA GCTGGCGGCA AACAACGTTG 2820 ACATAAACAA AATAAGAACA ATCAAAGACG CAACAAATAA CTTTTATAAA CGTCAAGGCC 2880 CCCCTCTACG CAACAACACC CCTCGGCTAC CGCACAGCTC AGCAATCCTG AGCAAATCAA 2940 TTGCCGAAGC TCGCCAGGAG GCAGCCAGAA AGTCGATGTT AAATCCATTC CGACAAAATA 3000 TAAACGACAG AAGACCACGA TTCTCCTCCC ACGACACGGC CATTCAGAAG CGTCTGAATA 3060 AATGGCGCCG AAACACCAAC AAAATACCCA AAAAGGGTAG GATAGCCTTA AAGGATAATG 3120 CAAAGCCACG ACCGGCACAT AGGACAAGTA ACCCAGCGCA AAGACATCTG GAGGACTACC 3180 AGGACATGCT CCGAAGGGAA AGGAGTGAAG AAAACGACCA GGAATCTGAG AAGGGCACCC 3240 CCAATACCAA GCAGGTCGGC AATGACAGCC CTCCGACCAC GAGCAGAGCA GCCAGAGCCA 3300 GCTTTAAGCC AAGAATCATT GACGATACCA CGCCATCGCC AAAAATCTGC AATCCCAACT 3360 CACAAAAAGG CCTCTTGGAC GACCCCACAA CAAGCTTAGC TAATAGAGTC GACAATTTAG 3420 AAAAGAAAAT TGACATTTTA ATGGCCTTAA TCATACAAGG AAGAAATAAC AATCTTGACA 3480 TGGATACATC CAATTAATCT TACAACTACT TATATATTCT TTAATAAATA TATCCAATAG 3540 AAAAGCGCAC GTCGGTCTGC TTTTAAAATC CTTCACCGTC ATCACCTTCC TCGACGGAGC 3600 CTAATTTATT GGAAAAATAA ATCAATTATA TGTTGGCACA AAAATGTAAA CACACACTCA 3660 CCTAAACGCA CCCGGACGAA CAAGCCTATG ACAACGCACT CCAGCTGATC TGTAAGAAAC 3720 AAAAAATATG AATAGATAGA TCGATATGAA AAGGATATGT GCGGCAGAAA CATGATGAGC 3780 AAAAGGCGAC TCGCTGCAGC AACTTATGCA CAACGTCACT TACCTGAAAT TTCTTGCCGT 3840 ACGATCTCCT GTAGTATCCC TTATCACAGC TGCAATCTAC TTGCAATGCT GCACTGCAAT 3900 AAACGTACTA CAAAAGCTGC ATACGTTTTG ATCAGGACAC CTCGTGCGGA CGTGCTAAAA 3960 AAAATTTCCT TTCTGCTGCT CTTATTGACG CTAAAACCTT AAAACCTACA AACAAAACAA 4020 TTAAATAATA ACAAATCAAA TAAGACAACC AAATAATACA CTTACCTCAT TGACTGCAGC 4080 TAAATCGCTG ACCCACATTC AGTGCAGCCG ACAGCAGGAG ACGGGCCCGC AAAAGCAAAA 4140 CAAAATCGCC AATTTTGCGA TTATAAACAC GAAAAATTGA CAATTTTGCG ATGCCGTCTC 4200 CGCCTCCTGA TGCCACTGCA TTGACAAGCA TCACTAGCGA GGAGCTGACA CCACACCAAA 4260 AAGCTGTAAA ATCCGTCCAC AAATTGTATA TTTTGCCTCA GTGTCGTATC TGCAATGTTT 4320 TTCCGATAAC CTGTAAGGAA AGAAAAATTA ATAAGAAAAT TATACAAAAT TAATTAAGGA 4380 CGACAGAAAA TAGCAAACCA GACAGGCAAA TTAACAGATA CAAATATGAG ACTCCATCCT 4440 GCTGCCGACA CACAAGTAAA TCCTTCAACT CGACAACAGG AGACGGGCCT TGCAAAAGCA 4500 AAACAAAATC GCCAACTTTT GCGATTATAA ATACAAAAAA TTGACAATTT TGCAACGCCG 4560 TCTCCACCTC CTGTTGCCAC TGCATTAATA AGGATCACCA GCGCGGCGTG ACGCCACACT 4620 AAAAGGCTGC AAAATCCGTC CACAAAATGT ATACTTTTCC TCAGTACAAT ACTTTCTAAT 4680 GAACTTCCGC CAACCTGCAA TGAAAAGAAA AGAAATAGGT ATATAAAACA AAACAAACAA 4740 AAGGACAACC TAAAATTAGC AAACCAGACA GGCATACTAG TAGATGCTAA TATGCAGCTC 4800 CATCCTACTG ACGACAACCA CGCAACTCCT TTCTCCAAGA CCGCAAATAC TGAAACAAGG 4860 AAGCACAAGC TAATACTGGG AATTATTTAT TTAAACAAAA ATACTTATCT AATTGCCAAT 4920 TCGACGACTC CAAATCCGCG GCTAACCGGC GGCGATGGCC CATAAATAAA GGGCCTCCTA 4980 ATTAATTACA AAATGTACCT GAAAAACATA AAATTAACGC AACTATAATT AACGCAATTA 5040 ATAAATCAAA TAAATACAAG TATAATACTT ACCTCCAAGC AAACGTACCT GAAAAACAAA 5100 ACCAAAAAAA AAATTAATGC AATAAATAAA TCAAATAAAT ACAAACATAA TACTTACCTC 5160 CAATTTACCT CCCAGCCAAT CTACCTGAAA AACATAATCT AATACAATCT CAAAAACAAA 5220 TAACAAATGT AATACTTACC AAATTTTAAT TTTGTATTCA TTTCCATGAC CCCAACGCTG 5280 CAACTGTCCT CGGCAACAAT TCCTGTTCCG GCGGCTCCAT GCTGCCAATC CTGACGCACT 5340 GGCCACAAGA CGCGGCGCTG CTGGCAATCT CTCGATGAAC AACCGATCTA CAATTTCCAT 5400 GACGACTCCT CTGTCACGAT GAGACAGAAG ACACCACCAA CGCCAGCAGC TCCAAAACAA 5460 TACAACAACG GCCGCGCGGA ACCCATCTTC AGAATTCCCT CTTCCTGACG ACCGGCGAAC 5520 GAGTTCTGGA ATAAACAATG TATTAATTGC AAACATCTAC CGATGAGGGT AGAAGAGATA 5580 CTCACCAAAC GACTGCGGCG CGGGAACAAA CTAACTGCAA CGCCGGCCGG ACCTATTTGT 5640 TGCAAGTGGC GCGCATCCAG CGCCTGCAAC ATGCCCCAGC CCAAGTACAC AACTACTTAC 5700 CTGCAACGTC GCCAGAGGCT CCCAGCGAAT CGGTGCTTCC GTCCTTCTGG CGGGGGTACC 5760 TGAAAAGAAA CAAATTAAAC AATATTAATC CTAAATTTCA ATGTTTTTTG TAAAATAATT 5820 TAAATTGTTA AATGTAAACA AGCCTTGCAA TATGTTAATG TTACCAGTCC ATGCTACTGT 5880 CTAAAAGCCA AGAATACAAA AAATACTAAT TATAAACTAA CTCACCACGC CCAACCCCCA 5940 AACTCACCCC ATGCAATGTT AAACCTATAA ATTCAAATAA TTGTACCTAT ATATTGCACA 6000 TACTGTAATC AAAGGCAAAA TAAATCGTGG ATGCGGAACA GAATTTACTC TGTCTCCGTA 6060 CCTCCACCAG CAAAGTTAAA AAA 6083 // ID DMIFACA standard; DNA; INV; 5371 BP. XX AC M14954; XX DR FLYBASE; FBgn0001249; I-element. XX FT source M14954:100..5470 FT SO_feature CDS ; SO:0000316:187..1467 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0020417; I-element\gag" FT /db_xref="SPTREMBL:Q24362" FT /protein_id="AAA70221.1" FT /translation="MTDPPNIYKITSKTYQSQLGEPKFIIIKRNDNNSFERTSPFIIKK FT SVDFACGGEVEGCKRTRDGNLLIKTKNELQARKLLKLTKIADEDVTASEHKTLNFSKGV FT IYCNDLRHIDEDTILQELKPQKVSEVKKIMKRQNPNSNSDTNNITLVETGLIIITFESH FT KLPEIVRIGYETVRVRDYIPLPLRCKKCLRFGHPTPICKSVETCINCSETKHTNDGEKC FT TNEKNCLNCRNNPELDHQHSPIDRKCPTFIKNQELTAIKTTQKVDHKTAQHIYFERHGF FT QTKNTYAKTLTNGTTQRTTNTPSPNIHTNTTQSQQQNPHHTPKSAAQNTSAKTPTTEPA FT KTTLLSNQPHQHHHHHSYDKLEDMDTDYTPTRKPSTTYSSQLTEDLKIKIFPKDKSNNL FT SINLKASKLKAKAHKNKHTNNSDSESI" FT SO_feature CDS ; SO:0000316:1938..5195 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0020418; I-element\RTase" FT /db_xref="SPTREMBL:Q24363" FT /protein_id="AAA70222.1" FT /translation="MAPSWGSPTTNKRGKITHRFIDNMHLILLNDKSPTHFSTHNTYTH FT IDLTLCSPILAPHAKWKILNDLHGSDHFPIITTLFPTTNPQKFYRPFFKLKEANWEQFN FT ALTHQTNKKYPTSHNVNKEAALINRIILYSANLSIPQTSPNTHPYRVPWWNKHLDQLRK FT EKQLAWKKLNRTITVDNILDYRRKNAIFRYELKKRKKEASSSFTSTIHPTTPSSKIWAN FT IRRFCGLNPAKQIHAITNPVNNETTLASNEIANIFAQHFSDLSGDWNFSEEFRNNKYRN FT NIHLYTPSPIAQTIEENITYLELSSALQTLKGCAPGLNRISYQMIKNSSHTTKNRITKL FT FNEIFNSHIPQAYKTSLIIPILKPNTDKTKTSSYRPISLNCCIAKILDKIIAKRLWWLV FT TYNNLINDKQFGFKKGKSTSDCLLYVDYLITKSKMHTSLVTLDFSRAFDRVGVHSIIQQ FT LQEWKTGPKIIKYIKNFMSNRKITVRVGPHTSSPLPLFNGIPQGSPISVILFLIAFNKL FT SNIISLHKEIKFNAYADDFFLIINFNKNTNTNFNLDNLFDDIENWCSYSGASLSLSKCQ FT HLHICRKRHCTCKISCNNFQIPSVTSLKILGITLNNKYKWNTHINLLLPKLHNKLNIIK FT CLSSLKFNCNTHTLLNVAKATIIAKLEYGLFLYGHAPKSILNKIKTPFNSAIRLALGAY FT RSTPINNLLYESNTPPLEMKRDLQIAKLSQNLILSKNTPIHKFLKPKKANKKKTSTIDR FT TIKLSLELNLPYKPIKLHKNKPPWTLPNLIDTSLRIHKKEQTSPDQYRKLYEHTKNNLK FT THNFIFTDGSKINYTISFAITTETDVLKYGILPPYSSVLTSETIAILEAIELTKNRRGK FT FIICSDSLSAVDSIQNTNNNSFYPSRIRSLITQHAPKIKIMWIPGHSGIKGNELADQAA FT KSASSMPLILTPNINTTDIKKHLKADLATKQKEHIINCSPWYQSINTNTSHPCDYLKQS FT HPNWTRLDQIKIIRLRLGHTNITHQHYLNPNSIPTCPFCQGDISLNHIFNSCPSLLQTK FT QDIFNNTNPLDLLSKPNPDNIQKLILFLKKTKLYHKI" XX CC Derived from M14954 (g157749) (Rel. 44, Last updated, Version 2). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5371 BP; 2176 A; 1446 C; 606 G; 1143 T; 0 other; CATTACCACT TCAACCTCCG AAGAGATAAG TCGTGCCTCT CAGTCTAAAG CCTCGCTTCG 60 CGTAAGCCCA AAACTCTTAT CAGCAAAATC TTGATAAACA AATATCAACC ACAAAGAGAA 120 AATAAAAAAC TTAACAACAA AAACAACAAT ACCGCTAATC CGGGCTCAAG CCCTTAACCA 180 ACAATCATGA CAGACCCACC AAACATTTAC AAAATCACTT CAAAAACATA CCAATCCCAA 240 TTAGGCGAAC CTAAATTTAT AATTATTAAA AGAAATGACA ACAACTCTTT CGAAAGAACT 300 TCACCATTCA TCATAAAAAA ATCGGTGGAC TTTGCCTGTG GAGGAGAAGT TGAGGGATGC 360 AAACGTACAA GAGACGGCAA CCTGCTAATA AAAACCAAAA ATGAATTACA AGCCAGAAAA 420 CTCCTAAAAC TAACAAAAAT TGCAGATGAG GATGTAACAG CAAGTGAACA TAAAACATTA 480 AACTTCTCTA AGGGAGTTAT TTACTGTAAC GACCTTAGAC ACATCGACGA AGACACAATT 540 CTACAAGAAC TAAAACCACA AAAAGTATCT GAAGTTAAAA AAATAATGAA ACGGCAAAAC 600 CCCAACTCTA ACTCCGACAC CAACAACATC ACATTAGTTG AAACTGGACT CATAATTATA 660 ACCTTTGAAT CGCATAAGCT CCCCGAGATA GTACGAATCG GGTACGAAAC AGTCCGAGTA 720 CGAGACTATA TCCCACTCCC ACTTCGATGC AAAAAATGCC TCCGCTTCGG TCATCCAACA 780 CCCATATGCA AAAGTGTAGA AACTTGCATC AATTGCTCTG AAACAAAACA CACAAACGAC 840 GGAGAAAAAT GCACAAACGA AAAAAACTGC TTAAATTGCC GAAATAACCC AGAACTTGAC 900 CATCAACACA GCCCAATTGA CCGCAAATGC CCTACGTTCA TAAAAAACCA GGAATTAACA 960 GCAATTAAAA CCACACAAAA AGTTGACCAT AAAACGGCCC AACACATATA TTTCGAACGT 1020 CACGGCTTCC AAACGAAAAA CACCTACGCC AAAACACTTA CAAACGGCAC AACCCAGAGG 1080 ACAACAAACA CTCCATCACC TAATATTCAC ACAAACACAA CCCAATCACA ACAACAAAAT 1140 CCGCACCACA CACCCAAATC AGCAGCACAA AACACTTCAG CTAAGACACC AACAACTGAA 1200 CCAGCCAAAA CAACCTTACT ATCCAACCAA CCACACCAAC ACCACCACCA CCACAGCTAC 1260 GACAAACTAG AAGACATGGA TACCGACTAC ACACCTACCA GAAAACCATC TACGACATAC 1320 TCATCACAAC TCACAGAAGA CCTAAAAATA AAAATCTTCC CTAAAGATAA GTCCAATAAC 1380 CTATCCATAA ACCTTAAAGC ATCAAAACTA AAGGCCAAAG CCCACAAAAA CAAGCACACT 1440 AACAACAGCG ACAGCGAATC CATATAGAAC TCTACACAAA ACCCTAACCG TTAACACTAC 1500 CTTTAAGTAA GTTATAAGCT TTAATTTTCT CACAAATGTC CCTAACTATA ATCCAATGGA 1560 ATCTAAAAGG ATATCTAAAC AACTACAGCC ATCTCCTTAT TCTAATCAAA AAATACTCCC 1620 CCCACATAAT TTCCCTCCAA GAAACCCATA TACAATACAC TAATAACATT CCAACCCCAA 1680 TAAACTACAA ACTATTAACA AATATTGCCA CCAACAGATT TGGGGGGCGT ACGACTACTA 1740 GTGCATAAGT CAATACAACA CACTGTCCTC AACATAACAA TCGATATAGA AGCAATAGCC 1800 ATAAATATAG AATCTAAACT TAAATTAAAC ATATTTTCCA CATACATTTC TCCGACCAAA 1860 AACATAACTA ACCAGACACT CCATAACACA TTTAACATAC AACAAACACC CTCTCTAATT 1920 ACGGGAGATT TTAATGGATG GCACCATCCT GGGGCTCCCC AACAACAAAT AAACGAGGAA 1980 AAATAACTCA TAGATTCATT GACAACATGC ACCTTATCCT GTTAAACGAC AAATCTCCCA 2040 CACACTTTTC AACACACAAT ACATACACAC ACATAGACCT CACACTCTGC TCTCCAATCC 2100 TAGCCCCCCA CGCCAAGTGG AAAATACTAA ACGATCTTCA CGGTAGCGAC CATTTCCCTA 2160 TTATCACAAC ACTATTCCCA ACAACCAATC CACAAAAATT CTACAGACCC TTTTTTAAAC 2220 TCAAAGAAGC CAACTGGGAA CAGTTCAACG CTCTTACCCA CCAAACCAAC AAGAAATACC 2280 CCACCTCCCA CAACGTAAAC AAAGAAGCCG CTCTAATCAA TAGAATCATC CTTTATAGCG 2340 CAAACCTCTC CATCCCACAA ACCTCACCTA ACACACATCC ATACAGGGTT CCATGGTGGA 2400 ATAAACACCT CGACCAATTA CGTAAAGAAA AACAACTTGC CTGGAAAAAA TTAAACCGCA 2460 CAATTACTGT TGACAACATT CTAGACTATA GACGCAAAAA CGCAATATTT AGATACGAAC 2520 TAAAAAAGAG GAAAAAAGAA GCTTCCAGCT CTTTCACCTC AACCATCCAT CCCACTACTC 2580 CCTCATCCAA AATATGGGCC AATATAAGAC GCTTCTGCGG ACTTAACCCA GCAAAACAAA 2640 TTCATGCCAT CACAAACCCA GTAAATAACG AGACTACATT GGCTAGCAAC GAAATTGCTA 2700 ACATATTCGC ACAACATTTC TCTGACCTCT CCGGCGACTG GAACTTCTCA GAGGAGTTCC 2760 GGAACAATAA ATATAGAAAT AACATACATC TCTACACCCC CTCTCCAATA GCCCAAACCA 2820 TAGAAGAGAA CATAACGTAT CTAGAACTTA GCTCAGCACT ACAAACATTA AAAGGATGTG 2880 CTCCAGGACT AAATAGAATC TCGTATCAAA TGATCAAAAA TAGCTCCCAC ACAACAAAAA 2940 ACCGAATAAC GAAACTATTT AATGAAATAT TCAATAGCCA CATACCTCAA GCCTACAAAA 3000 CAAGCCTAAT CATCCCAATC CTTAAGCCAA ACACCGACAA AACGAAAACT TCCTCATACC 3060 GACCCATCTC CCTCAACTGC TGTATAGCAA AGATACTTGA TAAAATAATT GCGAAAAGAC 3120 TCTGGTGGCT AGTGACATAT AACAACCTAA TTAACGACAA ACAATTCGGG TTCAAAAAAG 3180 GCAAATCGAC TTCGGACTGT CTACTCTATG TAGACTATCT CATAACGAAG TCAAAAATGC 3240 ACACCTCCCT CGTCACTCTT GATTTTTCAA GAGCCTTCGA TCGAGTAGGT GTGCACTCCA 3300 TAATCCAGCA ATTGCAGGAA TGGAAAACGG GTCCCAAAAT AATAAAATAC ATTAAAAACT 3360 TCATGAGCAA CAGAAAAATA ACTGTCCGCG TCGGTCCGCA TACATCAAGC CCGTTACCCC 3420 TATTCAACGG AATCCCCCAA GGTTCACCCA TATCCGTAAT ACTTTTCCTC ATAGCATTCA 3480 ACAAATTATC CAACATCATA TCCCTACATA AAGAAATTAA ATTCAACGCA TATGCCGACG 3540 ACTTCTTCCT TATAATAAAT TTCAACAAAA ACACAAATAC AAATTTCAAC TTAGACAATC 3600 TATTCGACGA TATAGAAAAT TGGTGCTCCT ACTCAGGGGC ATCGCTTTCC CTATCCAAAT 3660 GTCAACACCT CCACATATGC AGAAAACGTC ACTGCACATG CAAGATAAGC TGCAACAACT 3720 TCCAAATTCC TAGCGTTACG TCCTTAAAAA TTCTAGGAAT AACCTTAAAC AACAAATACA 3780 AATGGAACAC ACACATAAAC CTACTTCTAC CCAAACTACA CAACAAGCTA AATATAATAA 3840 AATGCCTATC TAGTCTTAAA TTTAACTGCA ACACGCATAC ACTACTTAAT GTCGCAAAAG 3900 CAACAATTAT AGCCAAACTA GAGTATGGTT TGTTTCTGTA CGGCCATGCT CCCAAAAGCA 3960 TTTTAAACAA AATAAAAACA CCGTTTAACT CCGCTATCCG TCTAGCTCTC GGCGCATATC 4020 GCTCTACCCC AATAAATAAC TTACTTTACG AATCGAATAC TCCCCCCTTA GAAATGAAAC 4080 GAGACCTTCA AATAGCCAAA CTATCCCAAA ACCTAATCCT CTCCAAAAAC ACACCAATAC 4140 ATAAGTTCTT AAAGCCTAAA AAAGCTAATA AGAAAAAAAC ATCAACAATA GACCGAACAA 4200 TCAAACTTAG CCTAGAACTT AATCTACCCT ACAAACCAAT AAAACTCCAT AAAAACAAAC 4260 CACCATGGAC CCTCCCCAAT CTAATAGACA CGTCACTTAG AATCCATAAG AAAGAACAAA 4320 CATCTCCAGA CCAATACAGA AAATTATACG AACACACAAA GAATAACCTC AAAACACACA 4380 ATTTCATATT CACTGACGGT TCAAAAATTA ATTACACAAT ATCATTCGCC ATTACAACGG 4440 AGACAGACGT CTTGAAATAC GGCATACTGC CCCCATATTC ATCCGTCCTC ACCTCCGAAA 4500 CAATCGCCAT CCTAGAAGCA ATAGAACTTA CTAAAAACCG AAGAGGCAAA TTTATTATCT 4560 GCTCCGACTC CCTATCAGCA GTAGATTCAA TTCAAAACAC AAATAATAAC AGCTTTTACC 4620 CAAGCAGAAT ACGATCGCTA ATAACGCAAC ACGCACCTAA AATTAAAATA ATGTGGATTC 4680 CTGGCCATTC AGGAATAAAA GGAAATGAAT TAGCCGATCA AGCTGCAAAA TCAGCAAGCA 4740 GTATGCCACT TATCCTCACC CCAAACATAA ATACCACAGA TATAAAAAAA CACCTTAAAG 4800 CCGACCTTGC GACAAAACAG AAAGAACACA TAATAAACTG CAGTCCATGG TACCAATCTA 4860 TTAACACGAA CACCTCACAC CCATGCGATT ACCTTAAACA ATCCCACCCA AATTGGACCA 4920 GACTCGACCA AATAAAAATA ATACGACTTC GACTAGGACA CACAAACATA ACCCACCAAC 4980 ACTACCTAAA TCCCAATTCA ATACCAACTT GCCCGTTTTG CCAAGGTGAT ATTTCTTTAA 5040 ACCACATATT TAACTCATGC CCATCCCTCC TACAAACCAA GCAAGATATA TTTAACAACA 5100 CCAACCCTCT AGACCTTCTT AGCAAACCCA ATCCAGATAA CATACAAAAA CTCATACTTT 5160 TCCTCAAAAA AACTAAATTA TACCACAAAA TCTAAAAACA AAACAGGCAT TTGTACATAA 5220 CAAGCCAGCA ATTAGTTACC AAATTAGATA TTAACTAAAT TAAGATATAA TAACATTGTA 5280 AATAAATATA GCTGTAAGCC CCGTAGCTAA TGCTATACTA TCTAAGTTAG TCTAGTTTTG 5340 TAAACTATTC TATCTATCAT AATAATAATA A 5371 // ID DMLINEJA standard; DNA; INV; 5020 BP. XX AC M22874; XX DR FLYBASE; FBgn0001283; jockey. XX SY synonym: wallaby SY synonym: sancho XX FT source M22874:115..5134 FT SO_feature CDS ; SO:0000316:300..2051 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0020297; jockey\gag" FT /db_xref="SWISS-PROT:P21330" FT /protein_id="AAA28674.1" FT /translation="ISIALYGISIKTIDIMENSFAQSRPSNGCDKFEKMRKVAGVEPGE FT LRSQLRASCAVVSPNLEGMPTQSAVSSLMVTISSNTNASVTCTISNVQANMICTPTYTD FT CTTVTTSICPTTPYDNGLPTPLSSLPNKPSKANCPFQAHDRTVNRKRKGVSQPPLPILT FT PSPSRKTKRQATMPLNEEASTSTAAALNNNRFALLSAEAENMEQDVSDADSDIEDSAAR FT DGGGQSAKYSKPPAICVPSVSDPVTLERALNLSTGSSNYYIRISRFGVSRIYTANPDAF FT RTAVKELNKLNCQFWHHQLKEEKPYRVVLKGIHANVPSSQIEQAFSDHGYEVLNIYCPR FT KSDWKNIQVNEDDNEATKNFKTRQNLFYINLKQGPNVKESLKITRLGRYRVTVERATRR FT KELLQCQRCQIFGHSKNYCAQDPICGKCSGPHMTGFALCISDVCLCINCGGDHVSTDKS FT CPVRAEKAKKLKPRSRLPMTNNIATLKPPQRSSSGYIPAEALRTNISYADIARRNTTQS FT RARATVQAEVIPTSDNSLNNKFMTLDNSIRAINTRMDELFKLIHETVEANKAFRELVQV FT LITRIPK" FT SO_feature CDS ; SO:0000316:2048..4798 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0015952; jockey\pol" FT /db_xref="SWISS-PROT:P21328" FT /protein_id="AAA28675.1" FT /translation="MTQPTLKIGLWNARGLTRGSEELRIFLSDHDIDVMLTTETHMRVG FT QRIYLPGYLMYHAHHPSGNSRGGSAVIIKSRLCHSPLTPISTNDRQIARVHLQTSVGTV FT TVAAVYLPPAERWIVDDFKSMFAALGNKFIAGGDYNAKHAWWGNPRSCPRGKMLQEVIA FT HGQYQVLATGEPTFYSYNPLLTPSALDFFITCGYGMGRLDVQTLQELSSDHLPILAVLH FT ATPLKKPQRVRLLAHNADINIFKTHLEQLSEVNMQILEAVDIDNATSLFMSKLSEAAQL FT AAPRNRHEVEAFRPLQLPSSILALLRLKRRVRKEYARTGDPRMQQIHSRLANCLHKALA FT RRKQAQIDTFLDNLGADASTNYSLWRITKRFKAQPTPKSAIKNPSGGWCRTSLEKTEVF FT ANNLEQRFTPYNYAPESLCRQVEEYLESPFQMSLPLSAVTLEEVKNLIAKLPLKKAPGE FT DLLDNRTIRLLPDQALQFLALIFNSVLDVGYFPKAWKSASIIMIHKTGKTPTDVDSYRP FT TSLLPSLGKIMERLILNRLLTCKDVTKAIPKFQFGFRLQHGTPEQLHRVVNFALEAMEN FT KEYAVGAFLDIQQAFDRVWHPGLLYKAKRLFPPQLYLVVKSFLEERTFHVSVDGYKSSI FT KPIAAGVPQGSVLGPTLYSVFASDMPTHTPVTEVDEEDVLIATYADDTAVLTKSKSILA FT ATSGLQEYLDAFQQWAENWNVRINAEKCANVTFANRTGSCPGVSLNGRLIRHHQAYKYL FT GITLDRKLTFSRHITNIQQAFRTKVARMSWLIAPRNKLSLGCKVNIYKSILAPCLFYGL FT QVYGIAAKSHLNKIRILQAKTLRRISGAPWYMRTRDIERDLKVPKLGDKLQNIAQKYME FT RLNVHPNSLARKLGTAAVVNADPRTRVKRRLKRHHPHDLPNLVLT" XX CC Derived from M22874 (g157823) (Rel. 47, Last updated, Version 5). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5020 BP; 1527 A; 1198 C; 1055 G; 1240 T; 0 other; AAAAATCATT CACATGGGAG ATGAGCAATC GAGTGGACGT GTTCACAGAA GTCGCGAGAT 60 AAAACAAAAA CGTAATTGTG ATCCATCACA AACATCTGCG CAGATCGTGT GCTTATCTCA 120 CAAACAAAAT CTATTTTTAG TCACTGCATA ACGGTGACGG CTTCGGTTCG CGAAACTTAT 180 CAGCAACTAG CAATTTCTAA GCTGTGTTGT TTTTGCCCCT CGCCCTGCGC GCTGCGCAAG 240 CGGGAGGTTG TTACAATTTA CCTTACAAGT AAACCGGTAA ATCTTATCGT GTTTAGTAAA 300 TATCAATTGC ATTATACGGC ATAAGTATAA AGACAATTGA TATAATGGAG AATTCATTTG 360 CTCAATCGCG ACCTAGCAAT GGGTGCGATA AATTTGAGAA AATGAGGAAA GTAGCAGGTG 420 TTGAGCCAGG AGAATTACGC TCCCAACTCC GCGCCAGCTG TGCAGTTGTT TCCCCTAACC 480 TGGAAGGTAT GCCAACTCAA TCTGCGGTCT CCAGCTTAAT GGTGACAATC AGCAGCAACA 540 CCAATGCAAG TGTTACCTGC ACTATTTCTA ACGTACAGGC CAACATGATC TGTACTCCTA 600 CATACACTGA TTGCACAACC GTGACCACTA GCATTTGCCC AACTACGCCT TATGACAATG 660 GACTGCCGAC ACCTCTGTCA TCACTGCCCA ATAAGCCATC TAAAGCGAAT TGCCCCTTTC 720 AAGCACATGA TCGTACTGTC AACAGGAAAC GAAAAGGCGT GTCTCAGCCC CCATTACCTA 780 TCCTCACCCC TTCTCCAAGC CGTAAAACTA AAAGGCAGGC CACTATGCCA CTCAATGAGG 840 AGGCCTCTAC CTCCACTGCA GCAGCATTAA ATAACAATCG CTTCGCGCTT TTGTCCGCTG 900 AAGCGGAGAA TATGGAGCAA GACGTGTCGG ATGCTGATTC TGACATTGAA GACTCTGCTG 960 CCCGAGATGG TGGTGGACAA TCCGCTAAAT ATAGCAAACC CCCAGCCATA TGCGTACCAA 1020 GTGTAAGCGA TCCGGTCACC TTGGAACGGG CTCTCAATCT GAGCACCGGC TCCTCAAACT 1080 ACTACATCCG CATTTCTAGA TTTGGTGTAT CCAGAATCTA TACAGCCAAC CCTGATGCTT 1140 TCCGCACCGC TGTAAAAGAA CTAAATAAGT TAAATTGTCA ATTCTGGCAT CACCAACTTA 1200 AAGAAGAAAA ACCCTACAGA GTAGTGCTTA AAGGAATCCA TGCTAATGTT CCTAGTTCGC 1260 AGATAGAACA AGCATTTAGT GATCACGGCT ATGAGGTCCT TAATATCTAT TGCCCCAGAA 1320 AGTCTGACTG GAAGAACATT CAGGTAAACG AAGATGATAA TGAAGCTACA AAAAACTTCA 1380 AAACTAGACA AAATTTGTTT TATATTAATC TTAAACAAGG CCCGAATGTT AAAGAGTCTC 1440 TTAAGATAAC TCGACTTGGC AGATACAGAG TCACTGTTGA GCGCGCTACA CGTAGAAAAG 1500 AACTGCTACA ATGTCAAAGA TGCCAAATTT TTGGACACTC TAAGAACTAT TGCGCCCAGG 1560 ATCCTATTTG TGGTAAATGT AGTGGTCCCC ATATGACCGG GTTCGCTTTG TGCATAAGTG 1620 ACGTATGTCT GTGTATAAAT TGTGGTGGTG ATCATGTCTC GACAGACAAA AGCTGCCCTG 1680 TCAGAGCAGA GAAAGCCAAG AAGCTAAAAC CAAGGTCCAG GCTACCGATG ACTAATAATA 1740 TTGCCACACT CAAACCTCCA CAACGTTCTT CAAGCGGTTA CATACCAGCT GAGGCATTAA 1800 GAACCAACAT CTCTTATGCT GATATTGCTC GACGCAACAC GACTCAATCT AGGGCTCGTG 1860 CTACTGTGCA GGCTGAAGTT ATACCAACGT CGGACAATAG CCTTAACAAT AAATTTATGA 1920 CGTTAGACAA CTCCATTCGG GCCATCAATA CGAGAATGGA CGAACTATTT AAGCTTATAC 1980 ACGAAACTGT AGAGGCTAAT AAAGCTTTCA GAGAACTGGT TCAGGTTCTA ATTACACGTA 2040 TTCCTAAATG ACTCAACCAA CCTTAAAAAT CGGATTGTGG AACGCTCGCG GATTAACAAG 2100 GGGCTCTGAG GAGCTTCGGA TATTCCTCAG CGATCACGAT ATAGACGTAA TGCTTACCAC 2160 GGAAACACAC ATGCGAGTTG GTCAGCGCAT CTATCTCCCA GGGTATCTTA TGTATCACGC 2220 CCACCACCCC AGTGGTAACA GTAGAGGTGG CTCTGCAGTC ATCATAAAAT CTAGACTTTG 2280 TCACAGCCCT CTGACACCTA TCTCTACTAA TGACAGGCAG ATAGCGAGAG TGCACCTGCA 2340 AACATCGGTT GGGACCGTCA CTGTAGCTGC TGTTTATCTA CCTCCAGCAG AAAGATGGAT 2400 AGTAGATGAC TTCAAATCCA TGTTTGCTGC GTTAGGCAAC AAATTTATTG CTGGTGGTGA 2460 TTACAATGCC AAACATGCAT GGTGGGGGAA CCCAAGATCC TGTCCTAGAG GTAAAATGTT 2520 GCAAGAAGTC ATTGCACATG GGCAATACCA AGTTCTGGCT ACGGGCGAAC CCACTTTCTA 2580 CTCTTACAAC CCTTTGTTAA CACCATCAGC CCTTGATTTT TTTATAACCT GTGGGTACGG 2640 CATGGGCAGG CTAGATGTAC AAACTCTCCA GGAACTCTCG TCGGACCATC TTCCTATTCT 2700 GGCTGTATTG CACGCTACGC CGTTAAAGAA ACCACAACGC GTACGACTAC TTGCCCATAA 2760 TGCTGACATA AACATATTCA AAACCCATCT TGAACAGCTG AGTGAGGTAA ATATGCAAAT 2820 TCTGGAGGCG GTGGACATTG ATAATGCCAC AAGCCTTTTC ATGAGCAAAC TAAGTGAGGC 2880 TGCTCAGCTT GCTGCACCGA GAAATCGGCA TGAAGTAGAG GCCTTCAGAC CACTTCAACT 2940 TCCTTCCAGT ATATTGGCAC TGCTCAGGCT AAAACGAAGA GTTCGAAAAG AATATGCTAG 3000 AACAGGTGAT CCCCGCATGC AACAGATCCA CAGTAGACTG GCCAACTGCC TGCATAAGGC 3060 CCTTGCTCGA AGAAAGCAGG CCCAAATAGA TACCTTCTTG GATAACTTGG GTGCTGACGC 3120 GAGCACAAAT TACTCACTGT GGCGTATCAC GAAACGGTTC AAAGCTCAGC CCACCCCAAA 3180 ATCAGCAATC AAAAATCCGT CTGGTGGCTG GTGTCGCACT AGCTTGGAAA AAACTGAAGT 3240 GTTCGCTAAC AACCTTGAGC AACGTTTTAC ACCCTATAAC TATGCACCGG AAAGTCTCTG 3300 TCGTCAGGTT GAAGAATACT TGGAATCGCC CTTTCAAATG AGCCTGCCTC TGAGTGCTGT 3360 CACACTGGAA GAAGTGAAGA ATTTAATAGC CAAGCTGCCA CTTAAGAAAG CTCCTGGAGA 3420 AGATCTTCTT GATAATAGAA CCATTAGACT TCTCCCAGAT CAAGCATTGC AGTTCCTTGC 3480 CTTAATATTC AACAGCGTTC TTGATGTTGG CTACTTTCCG AAAGCTTGGA AATCGGCGAG 3540 CATAATTATG ATCCATAAGA CTGGAAAAAC ACCGACAGAC GTTGACTCGT ACAGGCCCAC 3600 CAGCTTACTC CCATCTCTGG GTAAAATTAT GGAGAGGCTG ATCCTAAACA GGCTGCTCAC 3660 ATGCAAGGAT GTTACCAAAG CGATTCCCAA ATTTCAGTTT GGCTTCCGGT TGCAGCACGG 3720 TACTCCTGAG CAACTACATA GAGTAGTGAA CTTTGCTCTG GAAGCTATGG AAAACAAGGA 3780 GTATGCAGTA GGTGCCTTTC TTGATATTCA ACAGGCATTT GACAGAGTCT GGCACCCTGG 3840 GCTCCTGTAC AAAGCGAAGA GGCTGTTCCC GCCGCAGCTA TATTTGGTTG TTAAAAGTTT 3900 CCTGGAAGAA CGCACATTCC ACGTCTCTGT TGATGGGTAC AAATCATCAA TCAAGCCAAT 3960 TGCAGCTGGA GTTCCTCAAG GAAGCGTTCT TGGCCCAACC CTATACTCAG TTTTTGCTTC 4020 GGACATGCCT ACTCACACAC CAGTCACAGA GGTAGACGAA GAAGATGTGC TCATAGCCAC 4080 CTACGCTGAC GATACTGCTG TGCTCACGAA AAGTAAAAGT ATCCTGGCTG CCACTTCTGG 4140 TCTACAGGAA TACCTGGATG CATTCCAGCA ATGGGCTGAG AACTGGAATG TGCGCATCAA 4200 CGCTGAGAAG TGTGCCAATG TGACGTTCGC CAACCGAACA GGTAGCTGTC CGGGTGTCAG 4260 TCTGAATGGA AGACTGATCA GACACCATCA GGCTTATAAA TACCTTGGTA TTACCCTCGA 4320 TAGGAAGCTC ACCTTCAGCA GGCACATCAC AAATATTCAG CAAGCGTTCA GGACCAAGGT 4380 TGCTCGGATG TCTTGGCTCA TTGCACCACG CAACAAACTG TCGCTTGGCT GCAAGGTCAA 4440 TATTTACAAG TCCATATTGG CCCCCTGCCT GTTCTACGGC CTGCAGGTAT ACGGCATTGC 4500 TGCGAAGAGT CACCTTAATA AGATCCGGAT TTTACAGGCG AAGACCTTAA GAAGAATTTC 4560 GGGGGCTCCT TGGTATATGA GAACAAGAGA CATCGAACGC GACCTCAAGG TGCCCAAATT 4620 AGGAGACAAG CTCCAGAACA TCGCCCAAAA ATATATGGAA AGGCTTAATG TACACCCCAA 4680 CAGCCTAGCA AGGAAGCTAG GAACTGCAGC TGTGGTCAAT GCTGACCCTC GGACTAGAGT 4740 CAAAAGAAGA CTCAAGCGAC ACCACCCTCA TGACCTCCCT AACCTGGTTT TGACCTAGAA 4800 AGTCTTAGTT TTAAAATTCA TTAGAATAAT CAAATAAATA ATAATTACTA TGTTATATCA 4860 ACTATTATAA TTCTCCCTAT CATTTTTAGA TTAAAAATCT GTTAGTCTTA AGTAACCAAG 4920 ACACATTGTA AAATAAAATA ATTTAAGCAG ATCAAATTAA GTTGCCGCAT GGGTAACAGT 4980 GCGTTGATCA AATAATAAAA ACATCATAAA AAAAAAAAAA 5020 // ID DMTRDNA standard; DNA; INV; 1435 BP. XX AC X80025; XX DR FLYBASE; FBgn0014967; hopper. XX SY synonym: M4 XX FT source X80025:1..1435 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..33 FT SO_feature terminal_inverted_repeat ; SO:0000481:1403..1435 XX CC Derived from X80025 (g510507) (Rel. 44, Last updated, Version 11). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1435 BP; 456 A; 265 C; 212 G; 502 T; 0 other; CACTATGGGG CATTTGGCCT GTTTTTTTTA CAAAAATTAA TACCTCCTAA ACTATTGGAG 60 ATATTTGGAT GAATTTTTTT TTATGCGTTA CACATGCCTC CAGGAATATT TTGGAAAAGT 120 GGGCGTGCCC CAACTCCGCC CCATTTTTTT TTTTTTTTTT TTTTTTTTTT AATAATATAT 180 TTTTAAAGTT TATTTTTAAT TTCAATAATG TATAATTCAT AACCGTCTTC CTCTTCACAA 240 TCAGTAGAGT CTGAAGAATT TTTATCAGGT TCAAATTCGC AAGCTAACAT TTCAATGACT 300 TCTGGTGGAA GAGATAGTCG CTTATGTTTT CGCCTCTTTA AATTTATTGA TGATATTATG 360 GGATCCGAAG TATCCATTGC TCTGTAAAAG ACATCTGCGA AGCTACTAAT AGTTTTTGCC 420 GTGGCTGGCT TCAACAAAAG AATTTTAAGT ATGGCTGCAA GATCCCGCAG GCAGCACTTC 480 CGTGCAGCTT GAACCAAAAG ACGTTCGTTG TGTTTCTGCG CCCTTACGAG TTCATCTGCT 540 TGCTGTCTTG GGCCACTCAA ATTTTTTAAA TAATATGACG TTTTCGGGAG TCCAACTAAT 600 TTCCTTTCCT ATTTATTTTT CTCCTTTACC TTCAGGACTA GGTGTTCTTC TAACCAATTT 660 GAAAAAAATT TTAAAAATTC ATATATTTTT CGATTGCATT TTCTCCAATT TCGTAAAAGA 720 TTGACTGAAA TCATTCGTTA TTATTATTAT TAGTTAATCG TTTATTAAAG TCTAGCTTGC 780 TATCAGAAAA ATGCCCACTG ATAAAAGTGC AAATAGAATT TTCCTTTTGA CGAACACCCT 840 TTTGCGTGCG CCACACTTCC AGCAGGGCAG CATTGGAAAT CGAGATATTG CTCCCTAAAA 900 AATGAAATTT CTCAAAAAAC CGCAAAAAAC GCACATAGAG ACTACCTGAT ATGAGTTAGG 960 AATTGAACAC ACTACAACAT GGATATAAAC ACTTACTGAA CAAATTTGAA CAAATTGTTG 1020 TAGCTCTATT CAAAGTTGAA AATTTTTTCA AACAACTACA TCTTGACACC ACTTGTTAAA 1080 TGTACAAATT GTTAGAAATA GGCGCACACA ATAAACAATA TATTAATAAC AACACATAAT 1140 AAGAACCTAA AGATTGATTA TCCATTTCAA ATTATACTCT CCTTCTTCTT CTTTTTAAAT 1200 TTTAACACTT TGAAAGTTAA GCTAAATTTT GTGCGCAAAG CAGCCACGTG GTATATGCTC 1260 GCAACAGCCG ACTTTAACAG CTGTTATTAT AACAGTGCAT TGTTAAATTA ACTTATGCGG 1320 GCTATATCAT AACAGTTTAA CGTATTTCCA ATGTATTAAT ACTAAAATAC TTCAAATTTG 1380 CATACTTGTG AAAAACACAT TATTGTAAAA AAAACAGGCC AAATGCCCCA TAGTG 1435 // ID DMRTMGD1 standard; DNA; INV; 7480 BP. XX AC X59545; XX DR FLYBASE; FBgn0002697; mdg1. XX FT source X59545:1..7480 FT SO_feature five_prime_LTR ; SO:0000425:1..441 FT SO_feature three_prime_LTR ; SO:0000426:7039..7480 FT SO_feature CDS ; SO:0000316:548..923 FT SO_feature CDS ; SO:0000316:1327..1557 FT SO_feature CDS ; SO:0000316:1749..3062 FT SO_feature CDS ; SO:0000316:2987..6673 XX CC Derived from X59545 (g8507) (Rel. 49, Last updated, Version 4). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7480 BP; 3067 A; 1298 C; 1235 G; 1880 T; 0 other; TGTAGTATAT ACGAATATAA TAACAATAAT AATAACAATA ATAATAATAA TATTAATAAT 60 AATTATAATA TGAATCATAA TAATAAGTCA ACTAATAAGT AAACTTAGGA CCACCCTAAT 120 TCCTTAGGGT CACCCTAGTA GATCTTTAGA TACACCCTAA TACTAAATAT GCGAATTCAG 180 GATGTACGCC TTTAGGGGTC GGACTCGACT CCCATTGGTT ATCGAGTATG AACTTCATAC 240 ATACATATTG CAGAATTTGC TAGTGTCAGC ACTTGGCTGT CACAAGAGAT CTCCCTGTAG 300 ACCACACTAA GATCAGTTAT AAATCAGGAA TAGATCTGGA ATGTACACTC GCTTAATAAA 360 AACCAAATAA AGATAAAATG ACCAACTGCG TTTTGAGACT TTATTAACTA CATCAGAAGT 420 ATTAGAATTC AAATTAACTA CATGGCGACC GTGACAAAGG ATCGTTATAA GTTGTAGCAG 480 AAGCTAAAGG AAACCGCTTG TGATAATTTT CAACTTCGAT GCTCATCCAC CAAGACGGCG 540 GCAATTATGA AGAAAAAAGC GATCTGAGTG AGTAGAGTGT CAGTGTGATG GGAAAAAACA 600 GGGGCGGAGT TCGACATAAT ATAAAAAAGA GAATAGCGCA CATAAAGTGG CTATTATATA 660 CGAACACTCC ACCACCCCAA TGGTCGAAAG CTCAAAAACT ACAAGCTGAG CTAGACCACT 720 GTGTCGAATA TCTCAAGAAA AAAATCCCCA CCACACGCGC TCACTCAGAA AATCAAATAA 780 AATCGTTAAC AATTAACAAA ACTCCAACTC CCAATCCGAA AAGCCTGCCT GTTTTCAAGA 840 AAAGATGCCC GAACGACTGC GAGGGACCAC TGTTCACACC GCATTGTGAA CATACGTGCA 900 GACATTGCAG CTCCACCACA TAACCCCTAA ATGAGGAAAT CATCATCAAC GTGGTGAGCA 960 GCCCGCTCAT TACGTCATCG AGGGAGTGTC AGCGTGCCAA CCCGGCGACG ACCAGATGAC 1020 GCAGGAGGGT CAGAGTGAAG CAAATAGGAG CTGAAAAATA AAATATTTTT TTTGTTGCCC 1080 TGCGTGGCAC ACCCTCGATG CACTGCGCTG CATATTAATA TTACACAAAA TATTGTAACA 1140 TTGAGCGGAA CTTTTTCTGC CCGATGAGAA GAATGGCCCG TAAAGCCATA CACCAACTAG 1200 GTAGGAAAAT GTAACTATAT TGAACAAAAA AAAAAAAAAT CAAAACAACA TATTTTTAAA 1260 GTAAAATAAA CCAAAACCCA AAAATAAAAA AAAAAAAAAA AAAAAAAAAA ATAAAAATAT 1320 ACAAAAATGG GTTGGTTTGG ATCTGACGAT AGTCAGACAA AAGATAATAC GGCCAATGTG 1380 GTCAATAACT TAAAAATAGT CGACCATACA GATGACATTC AGTCACTGTG GTTACTCCTT 1440 TTGATCATGA CGATCGTAAC AGTCGCTCAG TTTATATTAA CGCTATATGT TAAGCATAAC 1500 AAGATAATAA AGAGGCGTTA TATAGGCAAA GCAGAGAATA GTTTGGATAA AATTTGAAAA 1560 AAAAAAAAAA AAAAAAAAAA AAAAAAACAT CCGAGATGTA TTTTGAATTA AGATGATCTA 1620 AAATTTTTAT TTTTAATATC AGAAAACTAG AATGAAAAAA AAAAAAAAAG ATTATTGAAG 1680 AAACCTATTT GAGAGAGGCC AAAATTTATA AAGTTCGATT GCATAGATAA ATCCATAGTA 1740 TTCTTTAAAT GGACTGGCAA GAAATACAAA ACGAGCTTAA AGAAATTAAA ACAACTTTTG 1800 ACAAGTCTTA TAAATGCATG ACACCAAATA GAGAAGTGCA ACAAGACACT CTCAACAAGC 1860 ATGCGCAGAT ATTGGTAAGA TGCTTTAATG GAGCACGCCA ATTAATTTAC AGAGAAAGGA 1920 AAAGATTAAC AAAAAATCAT TTATCACAAG CAGTAAAATT TCTAAACAGG TTCCGTGAGA 1980 ACTTGTTAAA CGTCAAGTAC AGACACAACT TAAATATTAC AATCCCAACG ATTTTAAGCA 2040 CACCTATAGT GGCTGAGATC GGTGAGGATA TCGAAAGTGT AGGAGAATCA GAAATAGAAA 2100 TAAAAGAAGA GGATCTCCAC GATCTTGCAA TTCCAGCGGT AATAACATTA CCCGAATTAC 2160 TTGAAGAAGA ACTTTCAGAT TCAAATACAG GAATAAGAAT ACAGGAAACG GACAAAATGA 2220 CAGACTCTGC CGCAACAGCA AGGGAATATG TGCGACAAAT TTCGTCCACA ATACCTGAGT 2280 TTGACGGCAA AAAGTTAAAC TTGAATAGAT TCCTCACGGC TCTCCGGCTG ATAGATCTGA 2340 CAAAAGGAGA TCAGGAGATG CTAGCGGTTG AGGTAATCAA GACAAAGATA CTTGGTCCAT 2400 TATCACACAA AGTTGAAAAT GAAAAGACCA TTATCGGTAT AATAAATCTA TTAAAAGCAT 2460 CAGTTAAAGG CGAATCGCCC GATGTCATCA AAGCAAAAAT GCTTAGTACA CAACAGCGCG 2520 GCAAAACTGC AGCGCAATAT ACCACGGAGA TAGAAAACCT ACGTGGGTTG CTCGAAGCAG 2580 CCTATATAGA TGATGGTTTA GATTCCAACA ATGCAGACAA ATTCGCTACA AAGGAAGCCA 2640 TATCTGCAAT GACCAAGAAC TGTGGGCACG ATAAGCTCAA AACCATATTG GAAGCTGGAA 2700 ATTTCAACAC GATGAATAGC GTGATTGAAA AATACATACA CTGCAGTACA GAAATGACCG 2760 GCAATTCAAA TAGTGTATTA TTCTATAATA ATAGAGGACA CTATCGAGGT AATAATTACC 2820 GAGGAAATTA CCAAAACAGA GGTAATGGCC GAGGAAATTA TAACTCCTAC AATAACAACT 2880 ATAGAGGCAG AGGTGGTTAC CATGGTGGAA ACAGAGGACG AGGTGGTAAC CAAAATTATA 2940 ATAGAGGTGG AGGTTACTCA AGAGGTAACC AAAACCATAA CTATAAAACA AGTCATGCCC 3000 ACAATGTCCG AAACATACAA TCGGAAAACG AACATACCCC CTTGAGCGAC AATCTACAAT 3060 AAAATTATAC AAAATTAATC TCAATTTAAG CATTTTTATA CGATTGAAGA ATATGAGTAC 3120 CAATTCATGG GTAACTCTTT TAATAGATAC AGGTGCAGAA ATTTCCCTGC TTAAATGCAG 3180 AAACAATAAT CTTAACGATT TAAATCCAAA AAATACAACA AATATATCAG GAATAGGGCA 3240 AGGGACAATT CAGTCTCTAG GTACACTACA TTTAGAAATG TGTATTGCTA ATGCAGCAAT 3300 ACCATATGAA TTCCATATCG TACCTAACAA TTTTCCTATA CCAGGGGATG GTATAATTGG 3360 CTTGGATTTC ATTAAGAAAT ACAATTGTAT TTTGGAATTC CACGACCAAG AAGATTGGTT 3420 CACTTTGAGG CCCAAAAATT TCAGGAACAT AAACATTCCT ATTATACATA CACTAGATAA 3480 TGAAATAATT TTGCCAGCTA GATCAGAAGT GATTCGAAAG ATTCAACTAA CATCTACTGA 3540 CACACATGTT CTCATTCCCA ACCAAGAATT ACAACCTAGC ATAATAATCG CAAGTGCACT 3600 CGTAAACACT CAGAACGTTT TGATTCGAAT TATTAATACA ACTGAAAAAG ACGCTATAGT 3660 TAGTAGCGCA AATATAAAAA GCGAATCATT GGATGATTAT GATGTATACA ACGCAAATAT 3720 AGAAAATAGT GCACAAAGAA CTTCAGAAGT ATTAAAACTT CTTAAATTTC CATCGTTATT 3780 CAAAAGCGAT TTAACAAAAT TATGCACCGA ATATAGCGAT ATTTTTGGTC TTGAAACAGA 3840 AACCATATCA GCTAATAATT TTTACAAGCA AAAATTGAGA TTAAATGACA AAACTCCAGT 3900 CTATATCAAA AACTATAGAA TGCCAGAAAG TCAAAAACCA GAAATTCAAA GGCAAGTTGA 3960 CAAATTAATA AAAGATGGCA TTGTCGAACA ATCTATTTCA GAATATAATA GCCCTCTTCT 4020 CTTGGTACCC AAGAAATCAC TGCCTAACTC GGAGGAAAAG AGATGGCGAT TAGTAGTCGA 4080 TTATCGCCAA ATAAACAAGA AACTGCTAGC AGATAAATTC CCACTTCCAA GAATAGAAGA 4140 CATTCTTGAT CAATTAGGCC GAGCAAAATA TTTCTCGTGC CTAGACCTGA TGTCAGGATT 4200 TCATCAAATA GAATTAGACG AAAGGTCAAG AAATATAACA TCTTTCTCAA CTTCAACGGG 4260 AGCATACCGC TACACGCGAT TACCATTTGG TTTAAAAATA GCCCCAAATT CTTTTCAAAG 4320 AATGATGACC CTTGCATTTT CAGGTTTAAC GCCTTCGCAA GCATTTCTGT ATATGGATGA 4380 TTTAGTAGTC ATAGGCTGTT CTGAAAAGCA CATGCTTAAA AATCTAACCG ACGTTTTCAA 4440 ATTATGTAGG CAACATAATT TAAAATTACA TCCAGAAAAA TGCACTTTCT TTATGAAAGA 4500 GGTTACTTAT TTAGGTCACA AGTGTACTGA CAAAGGTATA TTGCCAGATG ACTCTAAATA 4560 TGAGGTAATA AAGAACTACC CCAAACCAGT AAACGCAGAC GAAGCTAGAC GCTTCGTGGC 4620 ATTTTGCAAT TATTACAGAA GATTTATTAA GAACTTTTCT GAGAAATCAC GCCACTTAAC 4680 GAGGCTTTGT AAAAAGAATG TTCCATTTGA ATGGACAAGC GAATGCAATG ATGTATTCGA 4740 ATATCTCAAA AGGAAATTAA TGAAACCAAC ACTCCTTCAG TACCCAGATT TCAGCAAACA 4800 ATTTTGCATA ACCACAGATG CTAGTAAACA AGCATGTGGA GCGGTACTAT CTCAAGACCA 4860 TAACGGTCAA CAGCTACCAG TGGCATACGC TTCAAGAAGC TTTACAAAAG GCGAAAGTAA 4920 TAAGTCCACT ACAGAGCAGG AGCTAGCAGC TATTCACTGG GCAATAAATC ACTTCAGACC 4980 ATACGTATAT GGTAGACATT TCTTAGTACA AAGTGACCAT AGGCCACTAT CATATCTTTT 5040 TTCAATGAGA AACCCCAGTT CAAAATTAAC CAGAATGAGA CTAGACTTGG AGGAGTTCGA 5100 ATTCACAGTA GAATATCTCA AGGGGAAAGA TAATCATGTC GCAGACGCAT TGTTCCGAAT 5160 AACAATCGGA GAACTTAAAG CAATAAATAG ACAGATACTA AAGGTAACAA CAAGATCAAC 5220 AACAAGACAG AAAAATACCT GCGCAGGTGA AAAATTGCAT GAACCAAATG AGAAAGAAAA 5280 TATAAAAATG CCCAATATCT ATCAGGTAAT CAATAACATT GATGCCAAAA AATATGTTAT 5340 ACTCAAAATA GACAAGCATA AGTGTTTGTT GAAAAGAGGA AAACAAATTA TAACACGTTT 5400 TGATATGACT AATTTTTATT CTAATGAAAT AATCGATTTA GATCAATTCT TTCAAAGGCT 5460 TAATGAAGAA GCAAGAATAA ATAGCATCAT TCAAACACAA TTGTCACCAA GTGAACAAAT 5520 CTTCGAATTT GTCACTATAA AGAACTTTAA AGAAAAGGGC AATAAAATAC TAAAAAATTT 5580 AAAAATAGCG CTATTAAACA AGGTGACTAA GATAGATAAA AATGATAAGG TTCAAATAAA 5640 AGCAATACTG TCTAAATATC ATGATGATCC ATCAGAAGGA GGCCATTCAG GAATTTCTAG 5700 AACCCTGAGG AAAATGAAAA ACTGTTGTTG TTGGCCACGA ATGACGAAGG CGATAAGTGA 5760 ATATGTTGAA ACATGTTTGA AATGTCAACA AGCCAAGACT ACGAAACATA CTAAAACACC 5820 GTTGACAATA ACAGAAACGC CAGCAACAGC ATTTGATAAA GTTTTGATAG ATACCATTGG 5880 TCCACTGCCA AGATCAGAAA ACGGAAATGA GTATGCTGTT ACTATCATTT GCGATTTAAC 5940 AAAATATTTG GTAACGGTAC CAATTCCAAA TAAAAGTGCA AAATCAGTTG CTAAGGCTAT 6000 ATTCGAAAAT TTTATTCTAA AGTACGGTCC AATGAAAACA ATCACAACGG ACATGGGAAC 6060 GGAATATAAA AACCAAATTA TAGACGACCT ATGCAAATAT ATGAAGATAA AAAACATTAC 6120 TTCAACAGCA CACCATCACC AGACATTAGG AACAGTAGAA CGAAGTCACA GAACTTTCAA 6180 CGAGTATGTT CGCTCATATA TATCTGTTGA CAAAACCGAT TGGGATATAT GGATACAATA 6240 TTTTACTTAT TGTTTCAACA CAACACCATC GGTAGTTCAT GAATATTGTC CATATGAATT 6300 AGTATTTGGA AGATTACCAA GACAGTTCAT AGATTTTAAC AGGATAGACA GAATAGATCC 6360 TATTTACAAC ATGGATGATT ATTCAAAAGA AGTTAAGCTA CGATTAGAAA TAGCATATAG 6420 AAGAGCTAAA AATATGTTAG ACAAGGCAAA AGCCGATAGA AAGATAAAAT ATGATAGAAA 6480 TATTAGTAAC TTTGAATTAA AGATAGGAGA TAAGATATTA CTTAAAAACG AAACGGGTCA 6540 TAAACTTGAC AATAATTATT TAGGACCATA TTTAGTTTCA GAAATAGGAG ATAATGACAA 6600 CATTACAATT ATAGGAAATA AAAATAAAAA ACAGATAGTC CATAAAGATA GGTTAAAAAT 6660 TTTTAATTCA TAATACATTT TGTTTGGTTG GCCAACCACA AATAAAAAAC CACAAATAAA 6720 AAACCACAAA TAAAAAACCA CAAATAAAAA ACCACAAATA AAAAACCACA AATAAAAAAC 6780 CACAAATAAA AAACCACAAA TAAAAAACCA CAAATAAAAA ACCACAAATA AAAAAACCAC 6840 AAATAAAATA AAAACCAATA AAAACATTAT AATACAAAAC TTTTACTTTG CAAAATATAA 6900 TGAAAATATA TATATTTTTT TTAATATCTC TTTAATCATT CATTTCAAAT ATTAATGTAC 6960 ATTTAAAAAA AAAAAAAAAA ATATTATATA CTTGAAAATA ACTTCATGTT ATTACGTTAT 7020 TTTTCAAAAG GAGGGAGATG TAGTATATAC GAATATAATA ACAATAATAA TAACAATAAT 7080 AATAATAATA TTAATAATAA TTATAATATG AATCATAATA ATAAGTCAAC TAATAAGTAA 7140 ACTTAGGACC ACCCTAATTC CTTAGGGTCA CCCTAGTAGA TCTTTAGATA CACCCTAATA 7200 CTAAATATGC GAATTCAGGA TGTACGCCTT TAGGGGTCGG ACTCGACTCC CATTGGTTAT 7260 CGAGTATGAA CTTCATACAT ACATATTGCA GAATTTGCTA GTGTCAGCAC TTGGCTGTCA 7320 CAAGAGATCT CCCTGTAGAC CACACTAAGA TCAGTTATAA ATCAGGAATA GATCTGGAAT 7380 GTACACTCGC TTAATAAAAA CCAAATAAAG ATAAAATGAC CAACTGCGTT TTGAGACTTT 7440 ATTAACTACA TCAGAAGTAT TAGAATTCAA ATTAACTACA 7480 // ID DMMDG3 standard; DNA; INV; 5519 BP. XX AC X95908; XX DR FLYBASE; FBgn0002698; mdg3. XX FT source X95908:1..5519 FT SO_feature five_prime_LTR ; SO:0000425:1..267 FT SO_feature three_prime_LTR ; SO:0000426:5253..5519 FT SO_feature transcription_start_site ; SO:0000315:178 FT SO_feature polyA_signal_sequence ; SO:0000551:5253..5519 FT SO_feature CDS ; SO:0000316:296..4780 FT /db_xref="FLYBASE:FBgn0043882; mdg3\ORF" FT /db_xref="SPTREMBL:Q94885" FT /protein_id="CAA65152.1" FT /translation="MDDKIILNDFSLTTLKDWLRILGQNTEGTKTELIARLQDIPTAVR FT GDCPPEHPQKNAPPGNDIFSSLDFQNCEINTDHVSVNAMNRKESTETGSERETNMFELQ FT QLRAELAEAKAMLNGTRSSLQFQEQQQPEQSKATVSSVIQTAQFTQAGATKENTTFHSP FT QRSNERAESQRFPVDALALAKETITDYDGKTCARAWITVVKNIARTFNIDDNHLRILLI FT TKLKGNAQVWLHAHPARLIEPIDNLLDQLSLTFGEQSSKAEIRRKFESRKWKTEENFCS FT YYDEKMALSNGINIDDDELLDQMIEGIPLQNFRTQARIQCFSTPSEMLRAFSNIRLPAR FT REPPVQPTDYKDAIRCANCNSRGHKADICKKPKREPGSCYACGQLGHLVAQCPTRKSVS FT SNNYVRWFKINFFENAYKPIISECLIDSGSPISIIKKSLINETMKLALVNTCYFGLNNC FT ILKTHGQTTCYVLKGSIKIYFRLIIVCDQSMRYNVILGRDFLTACNLNLDPYTLGMIAL FT RKPMEINKISMFTENDSPEKSLENEIVSPKSLENEIVSSQSLENEIVSPKSLENEIVSP FT KSFKNATISPKSLENKIVNQQHKETGPISLRDEIVNQQKNVSKSKLSEDEIVNTSKEIV FT SFKLPKDKNVYEQLNHNFDKEVLRICHVTESELEYKIGENVSNRLQLEFDRLFRNFYIN FT AKRPNEPTVRSEIQLCLKNPKPFSCSPRRLSYTEKDRLQKLLDEYLENGFIRPSDSEYA FT SPIVLVKKKTGDLRMCVDFRKLNKMTMKDNYPLPLIDDLLDRMNEKTVFTKLDLKNGFF FT HVHVKKESIKYTSFVTPLGQYEWLRMPFGLKNAPSVFQRFVNKIFADMIRENKVVVYMD FT DILLATENINEHLETLKEIFKRLVENKLELRIDKCEFMQSSIKYLGFIINKDGIMPNDK FT GIEAIKNFPIPNNVHTVQSFLGLCSYFRRFIKDFSRLAKPLHDILKKDKPFKFGSEEMI FT CFNMLKDKLIQSPVLAIYNHKHETELHCDASSSGFGAVLMQKKEDQKWHPVSFFSKRTT FT DIESKYHSFELETLAIVYSLRRFRVYLHWRTFKIVTDCNSLILTLSKKELNPRIARWAL FT EFQGYDFEIVHRAGSRMQHVDALSRCTNIMVIQTNSFEDNLVICQGKDTKLKEIRQLLE FT NTENKLYEMRNGIVYKKTNENRLLFYVPIEMEEQVLYKYHNELGHVGRDKMIEAIMKNY FT WFPNLKQKCSTHISNCLKCISFSPKTGKTEGFLHNIPKGNKPFEIIHIDHYGPVDLARP FT KKHILVIVDAFTKFVRLYATKTTNTKEVIQSLNDYFRAYSRPKCIISDRGACFTSGDFD FT SFLKECNVKHIKIATGSPQANGQVERINRSLGPMISKLIEPDQGLHWDLVLEKVEYTLN FT NTLHRSIKQYPSIMLFGLQQKGQIMDELKEKIEEIGETIEERDLESIRNKGEASQKIAQ FT AYNKEYVDKKRKRSGVFTKGTTSWLKILTQQQA" XX CC Derived from X95908 (e990667) (Rel. 49, Last updated, Version 3). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5519 BP; 2078 A; 1005 C; 1029 G; 1407 T; 0 other; TGTAGTAGGC TGCACCTTCT ACCCTCTTCC TTTACTCTTA GTCATACATA CCTAATTATA 60 CATAGCCAAT CTAGTCATAA GCTTATACAC TCATACACCC ATCCTTAACA TACAAATATT 120 ATCGAGAAAC TTATCGACTA ATCGACTCGC CACTCTGCAG AGAGCGCGGC AGTCAGTCGC 180 TGTTGAACCA AGCTAAAGGA CAGATCAAAA ATAAAAGAGA CACGTGAAAT TGTATTAGAA 240 TATTAACTTC TGTAAACGGC GGCTAAAATC TCAGAAGTGG GATTAATAAT CCAAAATGGA 300 CGATAAAATC ATCCTGAACG ACTTTTCGCT GACAACCCTA AAAGATTGGC TACGTATTCT 360 GGGCCAAAAT ACGGAGGGCA CAAAAACCGA ATTAATCGCG AGGCTGCAAG ACATCCCAAC 420 GGCAGTTCGG GGCGATTGTC CACCGGAGCA CCCCCAGAAA AACGCTCCAC CAGGAAACGA 480 CATTTTTTCT TCACTGGATT TTCAGAATTG TGAAATTAAC ACCGATCACG TAAGTGTGAA 540 TGCGATGAAC AGAAAAGAAT CAACCGAAAC TGGCAGTGAG AGGGAGACAA ACATGTTCGA 600 GCTACAGCAA CTACGCGCAG AGCTAGCAGA AGCGAAGGCA ATGCTTAACG GAACACGATC 660 GAGCTTGCAG TTCCAAGAAC AACAACAACC AGAGCAAAGC AAGGCTACAG TTAGTTCCGT 720 TATCCAGACG GCGCAGTTTA CGCAGGCTGG CGCCACAAAA GAGAACACAA CATTTCACTC 780 GCCGCAGCGA TCCAACGAGA GAGCGGAGAG CCAGCGTTTT CCAGTTGATG CTCTCGCTCT 840 CGCCAAAGAG ACGATAACCG ATTACGATGG GAAAACTTGC GCGCGTGCCT GGATAACAGT 900 GGTCAAAAAT ATCGCACGCA CTTTCAACAT CGATGACAAC CATTTACGCA TCTTACTCAT 960 CACTAAACTT AAAGGAAACG CGCAAGTCTG GTTACATGCG CACCCTGCTC GATTGATCGA 1020 ACCAATTGAC AATTTGCTTG ATCAATTGTC ATTGACTTTT GGCGAGCAAT CATCCAAGGC 1080 TGAGATCCGG CGAAAATTCG AGAGTCGCAA GTGGAAAACC GAGGAGAATT TCTGCAGTTA 1140 TTACGACGAG AAGATGGCTC TCTCAAACGG GATAAACATC GACGACGACG AACTACTGGA 1200 CCAGATGATA GAGGGCATAC CGCTACAAAA TTTCCGTACC CAAGCACGGA TTCAATGCTT 1260 CTCTACTCCA TCGGAGATGC TACGCGCATT TTCGAACATC CGTTTGCCAG CTCGGAGGGA 1320 GCCACCTGTA CAGCCAACCG ACTACAAAGA TGCCATACGA TGCGCAAACT GTAATTCAAG 1380 AGGACACAAA GCTGACATCT GCAAGAAGCC CAAACGTGAA CCAGGTTCGT GCTACGCCTG 1440 TGGACAACTT GGACACCTGG TGGCACAATG TCCCACAAGG AAGAGCGTTT CATCTAATAA 1500 TTATGTAAGA TGGTTTAAAA TTAATTTTTT TGAAAATGCT TATAAGCCCA TAATTTCAGA 1560 ATGCCTCATA GACTCTGGCA GTCCTATATC TATCATTAAA AAGTCACTTA TTAACGAGAC 1620 AATGAAGTTA GCCCTAGTTA ATACTTGCTA TTTTGGTTTA AACAACTGTA TTCTCAAAAC 1680 ACATGGACAA ACCACATGTT ATGTTTTGAA AGGATCAATA AAAATATATT TTCGTTTAAT 1740 CATTGTTTGC GACCAGTCTA TGAGGTATAA TGTTATTCTC GGCAGAGATT TTTTAACTGC 1800 ATGCAATTTA AATTTAGACC CGTACACCTT GGGAATGATT GCGTTGAGAA AACCCATGGA 1860 AATAAACAAA ATATCAATGT TTACTGAAAA TGATAGTCCT GAGAAATCTT TAGAAAATGA 1920 AATTGTTAGT CCAAAATCGT TAGAGAATGA AATTGTTAGT TCACAATCGT TAGAAAATGA 1980 AATTGTTAGC CCCAAATCGT TAGAGAATGA AATTGTTAGT CCAAAATCGT TTAAAAATGC 2040 AACTATTAGT CCGAAATCGT TAGAAAATAA AATCGTTAAT CAACAGCATA AAGAAACTGG 2100 TCCAATATCG TTAAGAGATG AAATAGTTAA TCAACAAAAG AATGTCAGTA AATCAAAATT 2160 ATCAGAAGAT GAAATTGTTA ACACTTCAAA AGAAATCGTT AGTTTTAAAT TGCCAAAAGA 2220 TAAAAACGTT TACGAACAAT TAAATCACAA CTTTGATAAG GAAGTACTAA GAATATGTCA 2280 TGTAACTGAA AGTGAGTTAG AATACAAAAT AGGAGAAAAT GTTAGCAATA GGTTACAACT 2340 AGAATTCGAT AGGTTGTTTA GAAATTTTTA TATAAATGCA AAAAGGCCAA ATGAACCGAC 2400 AGTTAGAAGT GAAATACAAT TGTGTTTGAA AAACCCGAAA CCGTTTAGCT GTTCTCCTAG 2460 GAGGCTTTCA TACACAGAAA AAGACAGGTT ACAAAAACTA TTAGACGAAT ATTTGGAAAA 2520 CGGATTTATA CGACCAAGCG ACTCGGAATA TGCATCGCCT ATTGTTTTAG TGAAAAAGAA 2580 AACTGGAGAC TTACGTATGT GCGTCGACTT TAGAAAACTT AATAAAATGA CAATGAAAGA 2640 CAACTATCCT CTACCTCTTA TAGATGACTT GTTAGATAGA ATGAATGAGA AAACTGTTTT 2700 CACCAAACTC GATCTTAAAA ACGGTTTTTT CCACGTGCAT GTTAAAAAAG AATCAATAAA 2760 ATACACCTCT TTCGTTACAC CATTAGGCCA ATACGAGTGG CTGCGAATGC CATTTGGCCT 2820 CAAAAACGCC CCGTCTGTGT TCCAAAGATT TGTTAACAAA ATTTTTGCGG ATATGATTAG 2880 AGAAAACAAA GTAGTAGTAT ATATGGACGA CATTCTATTG GCAACCGAAA ATATAAACGA 2940 ACACTTAGAA ACGTTGAAAG AAATTTTTAA AAGATTAGTT GAAAATAAAC TTGAATTAAG 3000 AATAGACAAA TGTGAGTTTA TGCAATCAAG TATAAAATAT CTTGGGTTCA TAATAAATAA 3060 AGACGGCATA ATGCCCAATG ACAAAGGAAT CGAGGCAATA AAAAATTTCC CAATACCTAA 3120 TAATGTTCAT ACAGTACAAA GTTTTTTGGG ATTATGCTCA TATTTTCGAC GGTTTATAAA 3180 AGATTTTTCT AGACTAGCTA AACCATTGCA TGACATTCTA AAAAAAGATA AACCGTTCAA 3240 ATTTGGTAGT GAAGAAATGA TTTGTTTTAA TATGTTAAAA GATAAATTAA TACAGTCACC 3300 GGTCTTAGCT ATATACAACC ATAAACACGA AACAGAATTG CATTGTGATG CAAGTTCTTC 3360 TGGATTCGGT GCTGTACTTA TGCAAAAGAA GGAGGACCAG AAATGGCACC CAGTTTCATT 3420 CTTTTCAAAA CGGACAACAG ATATTGAATC AAAATACCAC AGTTTCGAGT TAGAAACTTT 3480 AGCCATTGTT TATTCGTTAC GTAGATTTAG AGTTTATCTT CATTGGAGGA CATTTAAAAT 3540 AGTCACCGAC TGCAACTCAT TAATTTTGAC CCTAAGCAAA AAAGAGCTAA ACCCTAGGAT 3600 AGCCAGGTGG GCTTTAGAAT TCCAAGGTTA TGATTTTGAA ATTGTGCATA GGGCAGGTAG 3660 CCGCATGCAA CATGTTGACG CACTGAGTAG GTGTACAAAT ATTATGGTAA TACAAACAAA 3720 CAGTTTCGAA GATAATCTAG TTATATGTCA AGGGAAAGAT ACAAAATTAA AAGAAATCAG 3780 GCAATTGTTA GAAAACACAG AAAATAAATT GTATGAGATG AGAAATGGTA TAGTTTACAA 3840 AAAGACAAAT GAAAATAGAT TGCTGTTCTA CGTTCCGATA GAAATGGAAG AACAAGTGTT 3900 ATACAAATAT CACAACGAAC TTGGACACGT AGGAAGAGAC AAAATGATAG AAGCTATAAT 3960 GAAAAACTAT TGGTTTCCAA ATTTAAAACA GAAGTGTAGC ACACATATCA GCAACTGTTT 4020 AAAATGTATT TCATTCAGTC CCAAAACAGG AAAAACAGAA GGATTTCTAC ACAACATACC 4080 TAAGGGAAAC AAACCTTTTG AAATAATCCA TATTGACCAT TATGGTCCAG TAGACTTGGC 4140 TAGACCGAAG AAACATATTC TAGTGATAGT AGATGCATTC ACAAAGTTTG TCAGACTATA 4200 CGCAACAAAA ACTACGAACA CAAAAGAAGT CATACAATCG TTAAATGACT ACTTCAGAGC 4260 ATACAGTAGG CCTAAGTGTA TCATATCAGA TAGAGGAGCA TGTTTCACGT CTGGCGATTT 4320 TGACTCATTT TTGAAAGAAT GCAATGTTAA ACACATAAAA ATTGCAACAG GATCGCCACA 4380 AGCCAACGGT CAAGTTGAAC GTATAAACCG AAGTCTTGGT CCAATGATTA GCAAGTTAAT 4440 TGAACCTGAT CAAGGTCTAC ACTGGGACTT AGTCTTAGAA AAGGTCGAAT ATACCCTGAA 4500 CAATACACTA CACCGCAGCA TTAAACAGTA TCCTAGCATA ATGTTATTTG GGTTACAACA 4560 AAAAGGACAA ATTATGGATG AGTTAAAAGA AAAAATTGAG GAAATTGGAG AAACGATTGA 4620 AGAAAGAGAT TTAGAAAGTA TTAGAAATAA AGGCGAGGCA AGTCAGAAAA TAGCACAAGC 4680 ATACAATAAA GAATATGTTG ACAAAAAACG AAAACGATCA GGAGTGTTCA CAAAAGGCAC 4740 TACGTCATGG TTAAAAATTT TGACTCAACA ACAGGCATAG CTAAGAAGTT AATTCCAAAG 4800 CATAAAGGAC CCTATGTCAT AAGCAAAGTT CTCAAAAATG ATCGCTTCCT TCTGGAAGAT 4860 GTTGATGGAT TTCAAATTTC TCGCAATCCT TACCGGGGTG TATGGAGCAT ACAGAATATA 4920 AAACACTGGC AAAGAAAAAT TAAGAGTCTA CAAAATAGAA AGTATAATTT GAGAAACTCT 4980 GTACAAAATA GAAAGTATAA TTTGAGAAAC TCTGTACAAA ATAGAAAGTA TAATTTAAGA 5040 AACTCTGTAC GAAATCGAAA GTATAATTTA AGAAGCAATT GTAAAACAAA GAAAACAAAC 5100 AAGAAGAAAA GAAAACCAAA AAAATGTTTA AGACCGTTCA AAAGTATCTC CACTAAGAAG 5160 AATAAAATAA GAAACAGGAC CCTTAGCTTT AAGAAACGTT AATTGTTATA AAATCCTACG 5220 ATCGGGAGAT CTAGTTGTCA GGACGGCCGA GTTGTAGTAG GCTGCTCCTT CTACCCTCTT 5280 CCTTTACTCT TAGTCATACA TACCTAATTA TACATAGCCA ATCTAGTCAT AAGCTTATAC 5340 ACTCATACAC CCATCCTTAA CATACAAATA TTATCGAGAA ACTTATCGAC TAATCGACTC 5400 GCCACTCTGC AGAGAGCGCG GCAGTCAGTC GCTGTTGAAC CAAGCTAAAG GACAGATCAA 5460 AAATAAAAGA GACACGTGAA ATTGTATTAG AATATTAACT TCTGTAAACG GCGGCTAAA 5519 // ID DMDM11 standard; DNA; INV; 5461 BP. XX AC X14037; AC X15066; XX DR FLYBASE; FBgn0002745; micropia. XX FT source join(X14037:5..3664,X15066:3534..3593,X14037:3721..5461) FT SO_feature five_prime_LTR ; SO:0000425:1..476 FT SO_feature three_prime_LTR ; SO:0000426:4957..5461 FT SO_feature primer_binding_site ; SO:0005850:477..489 FT /bound_moiety="tRNA-leu" FT SO_feature primer_binding_site ; SO:0005850:4939..4956 FT SO_feature CDS ; SO:0000316:540..4415 FT /db_xref="FLYBASE:FBgn0043876; micropia\polyprotein" FT /translation=" FT MQNRNLAELVKIMQKTPAREQQPSYDVKLPKFNPDAACVEAAKWCSTTDI FT ILTEHPLKGSKLITALSNCMEGTASQWLTQISYQGMTWQEFQELFLQRFE FT TEETPAATFLNLLNSRPTAAECYAVYASRLVTQLTTKWRNMEIEEIAVTT FT VLAHMANIDSRLQRVLFTSNVRTRSKLQAELKAFTFDKKRHARDDNLGPD FT QKNRKASPVVCHFCSKPGRRIAECRSKMRQDRRAKPQREKSNVTCYRCGQ FT PGHFSNQCPKNGTAAKQDVTQQKTVNQCCVTEPKGSLHQRGEIYPICFDS FT GAECSLIKDDISSKLSGKRINNTVMIKGIGGGSVCSTLQILSEVTINENI FT MEILFHVVPNEEMRNDILIGREILKQGFYVILTSDNFKVVKSKTVNNCSV FT TERSFTLSDIDTELVDNEKAQLIELLEKHSTSFTNGIPHTRVNTGEMKIR FT LIDPTKTVQRRPYRLSPEEREVVRMQVSELIRCNIVRPSCSPFASPMLLV FT KKKNGTDRLCVDFRELNSNTISDKYPLPLISDQIARLRGANYFTCLDMAS FT GFHQIPIHPESVEYTAFVPDGLKNAPSVFQRTVINALGDLANSFVIVYMD FT DIMVVSPTKELALERLKTVLNVLTKAGFTFNLAKCSFLKTTVQYLGYEVR FT AGEIRPNVRKIASLSSLPPPQTVSGVRQFIGLASYFRKFVSGFSQLMKPL FT YSLSSGSGKITWSAELEEIRLKVVTILTNEPALVIFDPQYPIELHTDASA FT CGYGAILLHRIESKPHVIEYFSKTTTSVESRYHSYELETLAVVKAVKHFR FT HYLIGREFVVYTDCNSLKASRTKIDLTPRVHRWWAYLQSFNFEIQYREGK FT RMAHVDFLSRNPLSPEHILSINKIPEKRVNLSEISSTWLLAEQRLDLEII FT EIVNKLESDELAENLAKTYDLRKGVLYRKVQRRGRTSYLPVVPRAFKWSV FT INQVHESIMHLGWQKTLDKVYQYYWFAKMNKYVRKFVSNCITCRSVKSSS FT GKVQAELHSIPKTSIPWHTIHIDITGKLSGKSDLKEYVIVQIDAYTKFVY FT LYHTLKIDAESCVNAMKSSISLFGVPDRIIADQGRCFTSSKFSEFCVSQK FT VELHLIATGMSRANGQVERVMETLKNLLSVVESSQRSWQDALGEVQLALN FT CTISRATDASPLEMLIGKQARPLGLVPPCETECEIDLATVRAHATENMNS FT LASYDKSRFDSSRAAVDKHHVGDYVLLRNEERHQTKLDPKFRGPFLVTEV FT LEGDRYTLKSLTSNRSFKYCHESIKMPDAEIPNELNENVEQ" XX CC Sequence assembled by Lynn Crosby (FlyBase), 'micropia.v006'. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5461 BP; 1679 A; 1051 C; 1264 G; 1467 T; 0 other; TGTCGTGGCG AAAATAATGA GTATGCGTGT AGTCGCTGTT TACTTCTTCT CCATGTTCCC 60 TTTGCTATTA TGCGTGTTCC TATTTATGAA CACGTGGCGA AAATAATAAA TGCGTGTAGT 120 CGCCGTTTAC TTCTTCTCCA TGTTCCCTTT GCTATTATGC GTGTTCCTAT TTATTGTCAA 180 TGTGTGAGGA TGAATAGATG AATTATCTAT GAACGGGATT TTGCAAAAAC GACTTGCGCT 240 GCTTGGTTAG AAAGGGAAAA CTATATAATG AAAAGGGAAT GCCAAAAATT GAGAAGAGAC 300 AAAGCAGGCT GCACGAAGCT GGAGTGAGGG CATTAATCGT GGAGAAGCCA AAGCAGACGC 360 AAGTGGACTC GTTGACTGCG CACAGCTGCA TAAAATTATA TAGTAAAAAG AGATTTGAGC 420 GACGCTGATA TGGACGGACG GACGGACGCG AGGCCCCTGA TATTCTTAAC CCGACATCAG 480 AAGTGGGATC TGTGCCACAC CCTGCATTTT CTGAGGATCA GTGGCGTGCA GTAGTGGAAA 540 TGCAAAATCG GAATTTGGCT GAACTTGTAA AAATCATGCA AAAGACGCCG GCACGTGAGC 600 AGCAACCTAG TTATGATGTT AAGCTACCCA AATTTAACCC TGATGCTGCA TGCGTAGAGG 660 CAGCAAAGTG GTGTTCAACA ACCGATATAA TTCTAACTGA GCACCCCCTT AAAGGAAGTA 720 AATTGATCAC GGCACTAAGT AACTGCATGG AGGGAACTGC ATCTCAGTGG CTAACACAAA 780 TCTCGTACCA GGGTATGACT TGGCAAGAGT TCCAGGAATT ATTTCTGCAG CGCTTTGAAA 840 CCGAAGAGAC GCCGGCCGCT ACGTTTTTAA ATTTACTCAA CAGCCGCCCG ACTGCCGCCG 900 AATGTTACGC GGTGTATGCG AGTCGGCTGG TGACGCAGCT GACTACAAAG TGGCGGAATA 960 TGGAAATAGA AGAAATTGCC GTTACAACTG TTCTTGCGCA TATGGCAAAC ATTGACAGTC 1020 GTTTGCAGCG CGTCCTCTTC ACATCCAATG TGCGTACCAG AAGTAAGCTA CAGGCGGAGT 1080 TAAAAGCGTT TACGTTCGAC AAGAAGCGAC ATGCTCGAGA TGACAACCTT GGACCTGACC 1140 AGAAGAACCG TAAGGCATCG CCAGTTGTAT GCCACTTCTG TTCAAAGCCG GGACGTCGAA 1200 TTGCTGAATG CCGAAGTAAA ATGCGACAAG ATAGACGGGC GAAACCGCAG CGTGAAAAAT 1260 CAAATGTTAC GTGCTATCGG TGCGGCCAAC CGGGACATTT CTCCAACCAG TGCCCGAAAA 1320 ACGGAACTGC AGCCAAACAA GATGTGACTC AACAGAAGAC TGTTAACCAA TGTTGTGTGA 1380 CTGAGCCAAA GGGAAGCTTG CATCAACGAG GTGAGATCTA TCCAATTTGT TTCGATTCCG 1440 GTGCAGAGTG CTCCCTTATT AAAGACGACA TTAGCAGTAA GTTATCTGGT AAACGTATAA 1500 ACAATACTGT AATGATAAAA GGCATTGGTG GTGGCAGTGT GTGCAGTACA TTGCAAATCT 1560 TGAGTGAAGT CACTATAAAC GAAAATATTA TGGAAATATT ATTTCATGTA GTCCCGAACG 1620 AGGAAATGAG GAATGATATT CTGATAGGGC GAGAAATACT TAAACAAGGC TTTTATGTAA 1680 TTTTGACATC CGATAATTTT AAAGTTGTAA AATCAAAAAC TGTTAATAAT TGTTCCGTTA 1740 CTGAGCGATC GTTTACTTTG TCCGATATTG ACACCGAATT AGTCGACAAT GAGAAAGCTC 1800 AATTAATTGA GTTACTTGAA AAGCACTCGA CTTCATTTAC CAACGGGATA CCTCATACTC 1860 GAGTAAATAC AGGCGAAATG AAAATCCGTT TGATTGATCC AACTAAAACT GTTCAGCGCC 1920 GACCTTACAG ACTTAGCCCC GAAGAGAGAG AAGTAGTGCG AATGCAGGTG AGCGAATTGA 1980 TAAGATGTAA TATTGTTCGC CCAAGTTGCT CTCCCTTTGC TAGCCCCATG TTGCTCGTCA 2040 AAAAGAAGAA CGGAACCGAC CGTCTATGTG TTGATTTTAG AGAGCTAAAC TCGAACACGA 2100 TTTCGGATAA ATACCCCTTG CCGCTTATCA GCGATCAAAT TGCTAGACTT CGCGGAGCAA 2160 ATTATTTCAC ATGCCTGGAT ATGGCAAGTG GTTTCCACCA AATCCCGATT CACCCTGAAT 2220 CCGTGGAATA TACTGCATTT GTGCCCGACG GCCTCAAAAA TGCGCCATCT GTTTTCCAGC 2280 GCACAGTCAT AAATGCACTT GGTGACCTTG CTAACTCTTT TGTAATCGTT TACATGGACG 2340 ACATAATGGT AGTATCGCCA ACCAAGGAAT TGGCTTTGGA AAGGTTAAAA ACTGTTTTGA 2400 ATGTTCTTAC AAAGGCTGGT TTTACCTTTA ACCTTGCTAA ATGCAGTTTT CTCAAAACAA 2460 CGGTTCAGTA TTTAGGCTAT GAAGTGCGAG CGGGAGAAAT TCGTCCGAAT GTGCGAAAGA 2520 TAGCTTCTTT AAGCTCCTTG CCTCCTCCTC AAACTGTCTC CGGCGTTAGA CAATTCATTG 2580 GCTTGGCCTC TTACTTTCGC AAATTCGTGT CTGGATTCTC CCAACTTATG AAACCATTGT 2640 ATTCACTTTC GTCTGGTAGC GGCAAGATTA CATGGAGCGC TGAGCTGGAA GAGATCAGAC 2700 TTAAAGTTGT GACGATCCTC ACAAATGAGC CTGCTCTGGT AATCTTCGAC CCGCAATATC 2760 CTATTGAGTT GCACACTGAT GCAAGTGCCT GTGGATATGG AGCGATACTT TTGCACCGTA 2820 TAGAAAGTAA GCCCCATGTA ATCGAATACT TCAGCAAAAC AACTACCTCT GTTGAATCTA 2880 GATATCACTC CTACGAGCTG GAAACCTTGG CAGTGGTAAA AGCCGTTAAA CATTTTCGCC 2940 ATTACCTAAT TGGCCGTGAG TTCGTTGTCT ATACAGACTG CAATTCATTA AAAGCTTCTC 3000 GCACAAAAAT AGATTTAACC CCCAGAGTTC ACCGCTGGTG GGCCTACTTA CAATCGTTTA 3060 ATTTCGAAAT TCAGTATAGA GAGGGTAAGC GTATGGCTCA TGTGGATTTC CTATCAAGAA 3120 ATCCTTTATC ACCCGAACAC ATTTTGTCAA TAAACAAGAT TCCCGAAAAA CGAGTAAATC 3180 TGTCTGAAAT TTCAAGTACT TGGCTTCTTG CTGAGCAACG GTTAGACCTT GAGATAATAG 3240 AAATTGTTAA CAAATTGGAG TCAGATGAAT TAGCCGAAAA CTTGGCCAAA ACGTATGATT 3300 TGCGAAAAGG TGTATTATAT CGCAAGGTCC AAAGACGAGG TAGAACAAGT TATTTACCAG 3360 TTGTACCCAG AGCTTTCAAA TGGTCAGTAA TTAACCAGGT ACACGAGTCG ATAATGCATT 3420 TAGGGTGGCA AAAGACACTT GATAAAGTGT ACCAGTATTA TTGGTTCGCT AAAATGAACA 3480 AGTATGTTCG AAAATTTGTT TCAAACTGCA TAACTTGTAG ATCAGTGAAA TCATCTTCCG 3540 GGAAGGTTCA GGCGGAACTT CATTCCATTC CGAAGACAAG TATACCGTGG CACACCATCC 3600 ACATAGATAT AACGGGGAAA TTAAGTGGCA AGAGCGATTT GAAGGAGTAT GTCATTGTTC 3660 AGATCGATGC CTATACAAAG TTTGTTTATC TGTATCACAC CTTAAAGATA GATGCCGAAA 3720 GCTGTGTTAA TGCTATGAAA TCTTCCATAT CCTTATTTGG AGTACCAGAT CGCATTATCG 3780 CCGACCAGGG CAGATGTTTT ACTAGCTCTA AGTTTTCAGA GTTTTGCGTA TCGCAGAAAG 3840 TTGAACTTCA CTTGATTGCT ACGGGAATGA GCCGTGCAAA TGGGCAAGTG GAACGGGTGA 3900 TGGAAACACT GAAAAATTTG TTGTCAGTGG TAGAATCAAG TCAACGATCG TGGCAGGACG 3960 CACTTGGCGA AGTCCAACTT GCACTGAATT GTACAATTTC TCGTGCCACT GATGCAAGTC 4020 CGTTAGAAAT GTTAATTGGT AAACAGGCTC GACCCCTTGG ATTAGTTCCC CCATGTGAGA 4080 CCGAATGTGA AATAGATTTG GCAACTGTTA GAGCTCATGC GACAGAAAAT ATGAATTCCT 4140 TAGCGTCTTA CGACAAATCC CGATTTGATA GCAGTAGAGC AGCCGTTGAC AAACACCACG 4200 TAGGTGACTA TGTGCTATTG AGGAATGAAG AAAGACACCA AACTAAGTTA GATCCGAAAT 4260 TCAGAGGACC GTTTTTGGTA ACTGAAGTAT TAGAGGGTGA CAGGTATACA CTAAAGTCGT 4320 TGACGAGTAA CCGATCGTTC AAGTATTGCC ATGAATCAAT CAAAATGCCG GATGCAGAAA 4380 TCCCGAATGA GTTAAACGAG AATGTAGAGC AATAGCTGAA ATATAGAAAC AGTTGAATGA 4440 AAAGAAAAGC CCGCCAATGA GTTCTTTTGT GAACGAGAGA TATCCGTCTA GGTGAGACGA 4500 TGAATTGTGA GTTATCCGTC TAGGTGAGAC GATGAATTGT GAGTTATCCG TCTAGGTGAG 4560 ACGATGAATT GTCAGTTATC CGTCAGGTGA GACGATGAAT TGTGAGTTAT CCGTCCAGGA 4620 GAGACGATGA GTTTGGATTG AATTAATAAT CAAGTGTGTG TGAACTGGCG GAAGATCGAT 4680 ATATAGAAAT CGATAAATGA TAATGTTAAG ATAAGTTGTG AGCTGATGTA TTACTGATCA 4740 ATGGAACTGA ATATGAAAAT AGAATAAGTT ATCCCAGCAA CAGTGAAATA AGAGCTGTTT 4800 TGTTTCTTCA CAGAATTAAG ATTTAAGAAA TACACCTGAT AAAGTCAAAC TAATGAAATT 4860 AAATGTTATT GAATAGTGAT GAAAGTAGGT GATCTTGATA TCTTGGTATC TCGGTATCAA 4920 AAGCTTACAC GAGGACGTGA AATGTCAGAA TGGCCGTGTC GTGGCGAAAA TAATGAGTAT 4980 GCGTGTAGTC GCTGTTTACT TCTTCTCCAT GTTCCCTTTG CTATTATGCG TGTTCCTATT 5040 TATGAACACG TGGCGAAAAT AATGAATGCG CGTAGTCCGG TTTACTTCTT CTCCATGTTC 5100 CCTTTGCTAT TATGCGTGTT CCTATTTATT GTCAATGTGT GAGGATGAAT AGATGAATTA 5160 TCTATGAACG GGATTTTGCA AAAACGAGAG CGATAGAGCT GTTGCTGAAC GTGGCCACTT 5220 GCGCTGCTTG GTTAGAAAGG GAAAACTATA TAATGAAAAG GGAATGCCAA AAATTGAGAA 5280 GAGACAAAGC AGGCTGCACG AAACTGGAGT GAGGGCATTA ATCGTGGAGA AGCCAAAGCA 5340 GACGCAAGTG GACTCGTTGA CTGCGCACAG CTGCATAAAA TTATATAGTA AAAAGAGATT 5400 TGAGCGACGC TGATATGGAC GGACGGACGG ACGCGAGGCC CCTGATATTC TTAACCCGAC 5460 A 5461 // ID PPI251 standard; DNA; SYN; 2907 BP. XX AC X06779; V01520; X69493; XX DR FLYBASE; FBgn0003055; P-element. XX FT source X06779:996..3902 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..31 FT SO_feature terminal_inverted_repeat ; SO:0000481:2877..2907 FT SO_feature CDS ; SO:0000316:join(153..442,501..1168,1222..1947,2138..2709) FT /db_xref="FLYBASE:FBgn0013311; P\T" FT /translation=" FT MKYCKFCCKAVTGVKLIHVPKCAIKRKLWEQSLGCSLGENSQICDTHFND FT SQWKAAPAKGQTFKRRRLNADAVPSKVIEPEPEKIKEGYTSGSTQTESCS FT LFNENKSLREKIRTLEYEMRRLEQQLRESQQLEESLRKIFTDTQIRILKN FT GGQRATFNSDDISTAICLHTAGPRAYNHLYKKGFPLPSRTTLYRWLSDVD FT IKRGCLDVVIDLMDSDGVDDADKLCVLAFDEMKVAAAFEYDSSADIVYEP FT SDYVQLAIVRGLKKSWKQPVFFDFNTRMDPDTLNNILRKLHRKGYLVVAI FT VSDLGTGNQKLWTELGISESKTWFSHPADDHLKIFVFSDTPHLIKLVRNH FT YVDSGLTINGKKLTKKTIQEALHLCNKSDLSILFKINENHINVRSLAKQK FT VKLATQLFSNTTASSIRRCYSLGYDIENATETADFFKLMNDWFDIFNSKL FT STSNCIECSQPYGKQLDIQNDILNRMSEIMRTGILDKPKRLPFQKGIIVN FT NASLDGLYKYLQENFSMQYILTSRLNQDIVEHFFGSMRSRGGQFDHPTPL FT QFKYRLRKYIIARNTEMLRNSGNIEEDNSESWLNLDFSSKENENKSKDDE FT PVDDEPVDEMLSNIDFTEMDELTEDAMEYIAGYVIKKLRISDKVKENLTF FT TYVDEVSHGGLIKPSEKFQEKLKELECIFLHYTNNNNFEITNNVKEKLIL FT AARNVDVDKQVKSFYFKIRIYFRIKYFNKKIEIKNQKQKLIGNSKLLKIK FT L" FT SO_feature CDS ; SO:0000316:join(153..442,501..1168,1222..1994) FT /db_xref="FLYBASE:;" FT /translation=" FT MKYCKFCCKAVTGVKLIHVPKCAIKRKLWEQSLGCSLGENSQICDTHFND FT SQWKAAPAKGQTFKRRRLNADAVPSKVIEPEPEKIKEGYTSGSTQTESCS FT LFNENKSLREKIRTLEYEMRRLEQQLRESQQLEESLRKIFTDTQIRILKN FT GGQRATFNSDDISTAICLHTAGPRAYNHLYKKGFPLPSRTTLYRWLSDVD FT IKRGCLDVVIDLMDSDGVDDADKLCVLAFDEMKVAAAFEYDSSADIVYEP FT SDYVQLAIVRGLKKSWKQPVFFDFNTRMDPDTLNNILRKLHRKGYLVVAI FT VSDLGTGNQKLWTELGISESKTWFSHPADDHLKIFVFSDTPHLIKLVRNH FT YVDSGLTINGKKLTKKTIQEALHLCNKSDLSILFKINENHINVRSLAKQK FT VKLATQLFSNTTASSIRRCYSLGYDIENATETADFFKLMNDWFDIFNSKL FT STSNCIECSQPYGKQLDIQNDILNRMSEIMRTGILDKPKRLPFQKGIIVN FT NASLDGLYKYLQENFSMQYILTSRLNQDIVEHFFGSMRSRGGQFDHPTPL FT QFKYRLRKYIIGMTNLKECVNKNVIP" FT SO_feature intron ; SO:0000188:443..500 FT SO_feature intron ; SO:0000188:1169..1221 FT SO_feature intron ; SO:0000188:1948..2137 XX CC Derived from X06779 (g58305) (Rel. 49, Last updated, Version 8). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. CC CDS annotation from Lynn Crosby's annotation 'P-element.v010'. XX SQ Sequence 2907 BP; 989 A; 491 C; 582 G; 845 T; 0 other; CATGATGAAA TAACATAAGG TGGTCCCGTC GAAAGCCGAA GCTTACCGAA GTATACACTT 60 AAATTCAGTG CACGTTTGCT TGTTGAGAGG AAAGGTTGTG TGCGGACGAA TTTTTTTTTG 120 AAAACATTAA CCCTTACGTG GAATAAAAAA AAATGAAATA TTGCAAATTT TGCTGCAAAG 180 CTGTGACTGG AGTAAAATTA ATTCACGTGC CGAAGTGTGC TATTAAGAGA AAATTGTGGG 240 AGCAGAGCCT TGGGTGCAGC CTTGGTGAAA ACTCCCAAAT TTGTGATACC CACTTTAATG 300 ATTCGCAGTG GAAGGCTGCA CCTGCAAAAG GTCAGACATT TAAAAGGAGG CGACTCAACG 360 CAGATGCCGT ACCTAGTAAA GTGATAGAGC CTGAACCAGA AAAGATAAAA GAAGGCTATA 420 CCAGTGGGAG TACACAAACA GAGTAAGTTT GAATAGTAAA AAAAATCATT TATGTAAACA 480 ATAACGTGAC TGTGCGTTAG GTCCTGTTCA TTGTTTAATG AAAATAAGAG CTTGAGGGAA 540 AAAATTCGTA CTTTGGAGTA CGAAATGCGT CGTTTAGAGC AGCAGCTGAG GGAGTCTCAA 600 CAGTTGGAGG AGTCTCTACG CAAAATCTTC ACGGACACGC AGATACGGAT ACTGAAGAAT 660 GGTGGACAAA GAGCTACGTT CAATTCCGAC GACATTTCTA CAGCTATTTG TCTCCACACC 720 GCAGGCCCTC GAGCGTATAA CCATCTGTAC AAAAAAGGAT TTCCTTTGCC CAGTCGTACG 780 ACTTTGTACA GATGGTTATC AGATGTGGAC ATAAAAAGAG GATGTTTGGA TGTGGTCATA 840 GACCTAATGG ACAGTGATGG AGTTGATGAC GCCGACAAGC TTTGCGTACT CGCTTTCGAC 900 GAGATGAAGG TCGCTGCTGC CTTCGAGTAT GACAGCTCTG CTGATATTGT TTACGAGCCA 960 AGCGACTATG TCCAACTGGC TATTGTTCGT GGTCTAAAAA AATCGTGGAA GCAGCCAGTT 1020 TTTTTCGATT TTAATACCCG AATGGACCCG GATACTCTTA ACAATATATT AAGGAAACTG 1080 CATAGGAAAG GATATTTAGT AGTTGCTATT GTATCCGATT TAGGTACCGG AAACCAAAAG 1140 CTATGGACAG AGCTCGGTAT ATCAGAATGT AAGTTTCGTA TATTACAAAA ATCAGATAAT 1200 CCTTGAAATT CCATTTTTTA GCAAAAACCT GGTTTAGCCA TCCTGCAGAT GACCATTTAA 1260 AGATTTTCGT TTTTTCGGAT ACGCCACATT TAATTAAGTT AGTCCGTAAC CACTATGTGG 1320 ATTCCGGATT AACAATAAAT GGGAAAAAAT TAACAAAAAA AACAATTCAG GAGGCACTTC 1380 ATCTTTGCAA CAAGTCCGAT CTGTCTATCC TCTTTAAAAT TAATGAAAAT CACATTAATG 1440 TTCGATCGCT CGCAAAACAG AAGGTTAAAT TGGCTACCCA GCTGTTTTCG AATACCACCG 1500 CTAGCTCGAT CAGACGCTGC TATTCATTGG GGTATGACAT TGAAAATGCC ACCGAAACTG 1560 CGGACTTCTT CAAATTGATG AATGATTGGT TCGACATTTT TAATTCTAAA TTGTCCACAT 1620 CCAATTGCAT TGAGTGCTCG CAACCTTATG GCAAGCAGTT GGATATACAG AATGATATTT 1680 TGAATCGAAT GTCGGAAATT ATGCGAACAG GAATTCTGGA TAAACCCAAA AGGCTCCCAT 1740 TTCAAAAAGG TATCATTGTG AATAATGCTT CGCTTGATGG CTTGTATAAA TATTTGCAAG 1800 AAAACTTCAG TATGCAATAC ATATTAACAA GCCGTCTCAA CCAAGACATT GTGGAGCATT 1860 TTTTTGGCAG CATGCGATCG AGAGGTGGAC AATTCGACCA TCCCACTCCA CTGCAGTTTA 1920 AGTATAGGTT AAGAAAATAT ATAATAGGTA TGACAAATTT AAAAGAATGC GTAAACAAAA 1980 ATGTAATTCC ATGATTTATA ATTGTTTAAT GTTTAGCTAT ATGTTTCAGG AAAGTTTCAG 2040 TTGAGAATGT AGGTAGTTAT GTGCTGTCTA TTGTGTTTTG TCTTTTATCT GTTTCTTTTC 2100 ATTTTATTAT TTAATCATTA TCCTTTTGCT TATCCAGCCA GGAATACAGA AATGTTAAGA 2160 AATTCGGGAA ATATCGAAGA GGACAACTCT GAAAGCTGGC TTAATTTAGA TTTCAGTTCT 2220 AAAGAAAACG AAAATAAAAG TAAAGATGAT GAGCCTGTCG ATGATGAGCC TGTCGATGAG 2280 ATGTTAAGCA ATATAGATTT CACCGAAATG GATGAGTTGA CGGAGGATGC GATGGAATAT 2340 ATCGCGGGCT ATGTCATTAA AAAATTGAGA ATCAGTGACA AAGTAAAAGA AAATTTGACA 2400 TTTACATACG TCGACGAGGT GTCTCACGGC GGACTTATTA AGCCGTCCGA AAAATTTCAA 2460 GAGAAGTTAA AAGAGCTAGA ATGTATTTTT TTGCATTATA CAAATAATAA TAATTTTGAA 2520 ATTACAAATA ATGTAAAGGA AAAATTAATA TTAGCAGCGC GAAACGTCGA TGTTGATAAA 2580 CAAGTAAAAT CTTTTTATTT TAAAATTAGA ATATATTTTA GAATTAAGTA CTTCAACAAA 2640 AAAATTGAAA TTAAAAATCA AAAACAAAAG TTAATTGGAA ACTCCAAATT ATTAAAAATA 2700 AAACTTTAAA AATAATTTCG TCTAATTAAT ATTATGAGTT AATTCAAACC CCACGGACAT 2760 GCTAAGGGTT AATCAACAAT CATATCGCTG TCTCACTCAG ACTCAATACG ACACTCAGAA 2820 TACTATTCCT TTCACTCGCA CTTATTGCAA GCATACGTTA AGTGGATGTC TCTTGCCGAC 2880 GGGACCACCT TATGTTATTT CATCATG 2907 // ID DMPOGOR11 standard; DNA; INV; 2121 BP. XX AC X59837; S90749; XX DR FLYBASE; FBgn0003122; pogo. XX FT source X59837:1..2121 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..21 FT SO_feature terminal_inverted_repeat ; SO:0000481:2101..2121 FT SO_feature intron ; SO:0000188:1438..1541 XX CC Derived from X59837 (g8354) (Rel. 45, Last updated, Version 10). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. CC K. O'Hare, Personal communication to FlyBase, 1 May 2000. CC This is probably complete element. XX SQ Sequence 2121 BP; 724 A; 353 C; 399 G; 645 T; 0 other; CAGTATAATT CGCTTAGCTG CATCGATAGT TAGCTGCATC GGCAAGATAT CTGCATTATT 60 TTTCCATTTT TTTGTGTGAA TAGAAAATTT GTACGAAAAT TCATACGTTT GCTGCATCGC 120 AGATAACAGC CTTTTTAACT TAAGTGCATC ATATCAGCTG TTTTTTTTGC CAATTTCAAT 180 GAATATCATC AAAGTTAGCT GCGCCATCTA TGAATCATTT TTGCATATCT AAAAGATGCA 240 AGAATGCCAA CTCGTTTCAG TATCTGCGCA TGTCCGTTTT TGTTTTTGCT TTGATCGTGA 300 TTTTTGTGTT TTTGTTTCTT ATGGCACAAA GTTATTAAAA TGGGTAAAAC AAAGCGTGTC 360 GTTGGACTAA CACTAAAGGA AAAGCTTCAA ATAATCGAGT TAGTGACCAA CAAAGTGGAC 420 AAAAAGGAAA TTTGTGCCAA GTTCAAATGC GACAGATCCA CAGTCAACCG CATTTTACAA 480 AAAACAAATG AAATTCATGA AGCTGTGGCC GCGTCAGGTT TAAAAAGAAA GCGTCAAAGA 540 AAAGGAGCGC ACGACTTAGT AGAAGAAGCC TTATACATTT GGTTCGGACA GCAGGAATCA 600 AAGAACGTAA TTCTTGACCG GCACGTCATA TTAGCAAAAG CGAAAGAATT TTGCCAAAAA 660 TTTAACGACG CCTTTGAACC TGACGCCAGC TGGCTTTGGC GCTGGCGCAA GCGCCACAAT 720 ATAAAGTATG GCAAAATACA CGGCGAAACT GCTACAAATG ATTCCGTATC AGCAAATGAG 780 TACAAAAATG ATATTTTGCC AGGATTGCTT AAAGGTTATA ACCCAGAAGA CATTTTTAAC 840 GCTGACGAAA CTGCACTCTT TTATAAAGCA ATGCCGAATG CGACATTTTT TACTTGTGGA 900 AAGCAATTAA ATGGCCAGAA ATCTCAGAGA GTGAGACTTA CTTTGCTGTT TATATGCAAT 960 GCAACTGGGA CATACAAAAA AACTTTTGTA ATCGGCAGAT CTAAATCGCC ACGATGCTTC 1020 AAGAATGCTA ATGTGCCCAT TCCGTACTAT GCAAATAAGA AGGCCTGGAT GACTAAGGAT 1080 CTCTGGCGAA AAATAATGAC AGGATTTGAC GAAGAAATGA AAAAGCAAAA TCGAAAGATT 1140 TTACTCTTCA TCGACAATGC AACTAGTCAC ACGACTGTCA AGGACTTCGA AAACATAAAA 1200 TTGTGCTTCA TGCCACCAAA CGCAACGGCT CTACTTCAAC CTCTGGACCA AGGTATTATC 1260 CACTCATTCA AATTAGAGTA TAGGCGTATT TTGGTCAAAC AGCAGCTCAT TGCTGTTAAT 1320 TGTGGTAAAT CTACTGTGGA ATTTTTAAAA TCATTATCGT TATTGGATGC TCTATATTTT 1380 GTCAACCAAG GATGGAAGAA TGTTAAAATG TTAACTATTC AGAATTGTTT TAAAAAGGTA 1440 AGATGGGATT ATTATTGATA TGTATCTCAA ATAACGAATT TATTATTTTC AGGCTGGATT 1500 TAAGTTCAGT TTTGAAAATG AAGACACCAT TGCTGAAAAA GACAAACAAT GCGTAGAAGT 1560 TGACATTGTA TCGAATATTA ATTGGAATGA ATATGCCAAT GTTGATGCAG ATGAGGCTTG 1620 CCATGGTCAA TTAGATGATG ATGAAATCGT GCGCTCTTTA GTTCAAGATG CAAAAACCAG 1680 CGATAACGAA GAAAGCCATA GTGATGAAGA TGTGGACGAT ACTGAGCGTC CTACTTTTAA 1740 GGATGGGTTT GCAGCAATTA AGGCTTTAAA GTCCATTTTT ATGCGAAACA ATAATGATGA 1800 GTTTTTGCAA AACTTGAATT CTATGGAAGA CAAGCTGTTT AATTTACATA TAAACTCAGC 1860 TGTATTGCAA AAAAAAATTA CTGACTATTT TTAAGTTAGT TTTAAAAAGT GTTTTAATCA 1920 ATTCACCATC ACTTAAATTT ATATGTCGAT CTTACTTATC ATTAAGAATG AAATTATCAG 1980 TTCCTTTTAT GTTTAACATT GTTATAAAGA AATAAATTCT TTATTTTTCC TTAAAAAAAA 2040 AAATTAAGTT AGCTGCATTT TTAAGTTACC TGCATCGAGG CATTGTGCAA AGTACTCGAG 2100 GCAGCTAAGC GAATTATACT G 2121 // ID DMRER1DM standard; DNA; INV; 5356 BP. XX AC X51968; XX DR FLYBASE; FBgn0003908; R1A1-element. XX FT source X51968:1..5356 FT SO_feature CDS ; SO:0000316:319..1731 FT /db_xref="FLYBASE:FBgn0044825; R1A1-element\ORF1" FT /db_xref="SWISS-PROT:P16424" FT /protein_id="CAA36226.1" FT /translation="PVSASIRLLDSSKGGATIGATPMESDSSVSALSGSSASKVSRRGR FT RRSHLASKSSAPTQAKLVALASNGVPEPVGVLEEAFSSLEDARAATSNAANDAAPPAAA FT PAVDHTVAPDVSTAAKIAATTATAATAAARAGQAAMMAELSATQRMVRNSFRSLGGVDT FT EELSCAISRYDELVMALMLRCGELETRLAMPPPPPPPSKANTTAANAPQMPQVAPIAAP FT RTTKVRETWSAVVKCDDPALSGKAIAEKVRTMVAPSLGVRVHEVRELPSRWWCDHSYSS FT VGELQKVMASKRFAELGLNVARNAAEKPKVIVYDVDTAIGPEEFMQELHENNFDSEMTL FT AQFKKSVHLVTKAWSATDGATVNVTLEVDDRAMAKLDVGRVYIKWFSFRCRSQVRTYAC FT HRCVGFDHKVSECRQKESVCRQCGQQGHTAAKCQNPVDCRNCRHRGQPSGHYMLSNACP FT IYGALLARVQARH" FT SO_feature CDS ; SO:0000316:1728..4790 FT /db_xref="FLYBASE:FBgn0044824; R1A1-element\ORF2" FT /db_xref="SWISS-PROT:P16425" FT /protein_id="CAA36227.1" FT /translation="TLMFSFIQANCGRGRAATIELGVRLRRSESMFALVQEPYLGGDEM FT DVLPEGMRVFTDRRGKAAILVDHQEAICMPVETLTTDYGVCLVVKGSFGSIFLCAAYCQ FT FDAPLEPYLRYMDAVLLQASRTPAILGLDANAVSPMWLSKLSRHAEGQANYRRGELLSE FT WMLEARVAALNQSTEVYTFDNHRATSDIDVTIVNEAASMWATYEWRVDEWELSDHNIIT FT VVAEPTTARSVESIAPVPSWNFSNARWRLFKEEMVSRIAELPENFSESPLDQQVSTLRS FT IVHSVCDTALGRKLTRSPSRRARWWTADLCAARREVRRLRRLLQDGRRRDDDAAVELVV FT VELRRASAYYKKLIGRAKMDDWKRFVGDHADDPWGRVYKICRGRRKCTEIGCLRVNGEL FT ITDWGDCARVLLRNFFPVAESEAPTAIAEEVPPALEVFEVDTCVARLKSRRSPGLDGIN FT GTICKAVWRAIPEHLASLFSRCIRLGYFPAEWKCPRVVSLLKGPDKDKCEPSSYRGICL FT LPVFGKVLEAIMVNRVREVLPEGCRWQFGFRQGRCVEDAWRHVKSSVGASAAQYVLGTF FT VDFKGAFDNVEWSAALSRLADLGCREMGLWQSFFSGRRAVIRSSSGTVEVPVTRGCPQG FT SISGPFIWDILMDVLLQRLQPYCQLSAYADDLLLLVEGNSRAVLEEKGAQLMSIVETWG FT AEVGDCLSTSKTVIMLLKGALRRAPTVRFAGRNLPYVRSCRYLGITVSEGMKFLTHIAS FT LRQRMTGVVGALARVLRADWGFSPRARRTIYDGLMAPCVLFGAPVWYDTAEQVAAQRRL FT ASCQRLILLGCLSVCRTVSTVALQVLGGAPPLDLAAKLLAIKYKLKRGFPLEENDWLYG FT EDIACLSWEQRKTRLEECLIQSWQNRWDDDSEPGRVTHRFIPYVTLAYRDPSFGFSMRT FT SFLLTGHGSFNAFLHGRALSDTTACACGDPYEDWMHILCACPLYADLRDLDGLGVQRLG FT ENWIFEGILDDQEKTQRLAMFAEEVFLRRRAL" XX CC Derived from X51968 (g8429) (Rel. 23, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5356 BP; 1213 A; 1340 C; 1634 G; 1169 T; 0 other; CGGACGTGTT TTCGTTGCGC TCGTGGACAT AGTGCGAAGA ACTTTGTTTT CCGTATTTGG 60 AAGTATACGG AATAAATAAT TTAGTGTTCC GTGGAAGTGG TGCGCAAATT TTCGCGAATT 120 AAAAACAAGC GGTTTGGAAG TAATTGACAA TAAATTATTG GAAATTTTCC ACTCCGCACG 180 TGTTGAGCGG CGGAGCTTGC GGGTGAGCTT TTCGAACAGC TGAGAGAAGC TTATTGGTGG 240 TAGTCACCGC TAAGGATTGT GTCTTGGGAC AGCTTAGTGC ACTCTACCAA TAGGTGGAGC 300 TATCACCATA GCAACTAGCC CGTGTCAGCG AGCATACGAT TGCTGGACTC GTCAAAAGGA 360 GGAGCCACCA TCGGAGCAAC GCCGATGGAG AGCGACAGCA GTGTGAGTGC CTTGAGCGGA 420 AGCAGTGCCT CAAAGGTGTC AAGACGAGGC AGGCGTAGGA GCCATCTGGC CTCCAAGAGC 480 TCGGCGCCAA CGCAGGCGAA ACTGGTTGCC CTGGCCTCGA ATGGAGTGCC GGAACCCGTT 540 GGTGTGCTGG AGGAGGCGTT TTCGTCGCTG GAGGATGCCC GGGCGGCTAC GTCAAACGCT 600 GCCAACGATG CTGCCCCCCC CGCTGCTGCC CCCGCTGTTG ATCACACTGT TGCCCCTGAT 660 GTTTCCACTG CTGCTAAAAT CGCTGCCACC ACTGCCACCG CTGCCACCGC TGCCGCCCGT 720 GCTGGGCAAG CAGCCATGAT GGCAGAGCTG TCGGCCACCC AGCGCATGGT GCGAAACAGT 780 TTCCGCAGCC TAGGAGGCGT AGACACGGAA GAGCTCTCGT GTGCCATCAG CCGCTATGAT 840 GAGCTGGTGA TGGCATTAAT GCTCCGGTGT GGAGAACTGG AGACGCGGCT CGCTATGCCA 900 CCACCGCCGC CGCCGCCGTC CAAGGCGAAC ACTACTGCCG CCAATGCTCC CCAGATGCCT 960 CAGGTTGCAC CCATCGCTGC CCCGCGGACA ACCAAGGTTC GTGAGACGTG GTCAGCGGTG 1020 GTGAAGTGCG ACGACCCTGC GCTATCGGGG AAAGCCATAG CCGAAAAGGT GCGGACGATG 1080 GTTGCACCCT CCCTCGGAGT CAGAGTACAC GAGGTACGTG AGCTGCCGTC GAGGTGGTGG 1140 TGCGATCATT CGTACTCTTC GGTTGGAGAG CTGCAGAAGG TGATGGCATC GAAAAGATTC 1200 GCAGAACTTG GACTGAATGT GGCACGGAAC GCGGCCGAGA AGCCGAAGGT CATAGTCTAT 1260 GACGTCGACA CAGCCATCGG CCCAGAAGAG TTCATGCAGG AGCTTCACGA GAACAACTTC 1320 GACAGTGAAA TGACTCTGGC CCAGTTCAAA AAGTCGGTGC ACCTGGTGAC CAAGGCGTGG 1380 TCGGCTACTG ACGGTGCCAC CGTAAACGTG ACGCTAGAGG TAGACGACCG GGCGATGGCG 1440 AAACTTGATG TAGGACGTGT CTACATTAAG TGGTTTTCGT TCCGATGCCG ATCGCAAGTC 1500 CGCACCTATG CCTGCCACAG ATGTGTGGGT TTCGACCACA AGGTTAGTGA ATGCAGGCAG 1560 AAGGAGAGTG TTTGCCGCCA GTGCGGGCAA CAAGGCCACA CCGCGGCAAA GTGCCAAAAC 1620 CCGGTGGACT GCCGGAACTG CCGTCACAGA GGGCAACCTT CGGGGCATTA TATGCTCTCG 1680 AATGCTTGCC CGATATACGG AGCGTTGTTA GCGAGGGTGC AAGCTAGACA CTAATGTTTA 1740 GCTTCATCCA AGCGAACTGT GGCCGAGGCA GAGCTGCGAC CATCGAGCTC GGAGTCCGAC 1800 TCAGGAGATC GGAGTCAATG TTTGCTCTGG TGCAGGAGCC GTATCTTGGC GGGGATGAAA 1860 TGGATGTGCT GCCTGAAGGA ATGAGGGTTT TCACCGACCG GCGAGGGAAG GCAGCCATCC 1920 TAGTGGATCA TCAGGAAGCC ATCTGCATGC CAGTGGAAAC TCTCACCACA GATTATGGCG 1980 TATGTCTGGT CGTTAAAGGG AGTTTTGGCT CAATCTTCCT TTGCGCCGCA TACTGCCAGT 2040 TCGATGCACC TCTGGAACCG TACCTCCGGT ACATGGATGC GGTCCTGCTG CAGGCCAGCA 2100 GAACCCCCGC AATCCTGGGC CTCGACGCGA ATGCAGTGTC CCCCATGTGG CTTAGCAAAC 2160 TCTCTCGTCA TGCCGAGGGG CAAGCTAACT ACAGACGGGG TGAGCTGCTG TCTGAGTGGA 2220 TGCTGGAGGC AAGAGTCGCC GCCCTAAACC AGTCAACAGA GGTGTACACG TTCGATAATC 2280 ACAGAGCGAC TAGTGATATC GACGTGACAA TCGTCAATGA AGCAGCATCT ATGTGGGCCA 2340 CATATGAGTG GAGAGTGGAC GAGTGGGAAT TGAGTGACCA CAACATCATT ACTGTTGTGG 2400 CCGAACCAAC TACCGCGCGC TCAGTTGAGA GCATAGCTCC TGTGCCGTCC TGGAACTTCT 2460 CCAATGCACG TTGGCGATTG TTCAAGGAGG AAATGGTGAG TAGAATAGCC GAACTTCCGG 2520 AAAACTTTTC AGAGTCGCCG TTGGACCAGC AAGTTTCGAC CCTGCGCAGT ATAGTACATA 2580 GTGTATGTGA TACTGCGCTA GGAAGGAAGT TGACTCGATC GCCCAGCAGG AGAGCACGTT 2640 GGTGGACTGC CGACCTCTGC GCTGCAAGGC GCGAAGTCCG AAGACTTCGT CGCCTGCTCC 2700 AAGATGGAAG GCGTCGAGAT GACGATGCCG CTGTAGAGCT TGTAGTGGTC GAGCTGAGGC 2760 GTGCCTCAGC CTACTACAAG AAGCTCATTG GAAGGGCGAA GATGGATGAC TGGAAACGCT 2820 TCGTGGGAGA TCATGCCGAC GACCCATGGG GGCGCGTCTA CAAGATTTGC CGAGGTCGCA 2880 GGAAGTGCAC GGAGATTGGG TGCCTCCGCG TGAATGGCGA GCTGATCACT GATTGGGGTG 2940 ACTGCGCACG AGTGCTCCTC CGCAATTTTT TCCCAGTTGC GGAGTCCGAA GCACCGACTG 3000 CCATCGCGGA GGAAGTCCCA CCGGCCCTCG AAGTATTCGA GGTTGATACA TGTGTTGCCC 3060 GGCTGAAGAG CAGGCGCTCT CCCGGGTTGG ACGGCATCAA TGGCACTATC TGCAAGGCAG 3120 TCTGGCGCGC CATACCCGAG CACCTAGCAT CATTGTTTTC CCGATGCATC CGATTGGGAT 3180 ACTTTCCAGC CGAGTGGAAG TGCCCACGAG TTGTCTCGTT GCTCAAAGGG CCAGATAAGG 3240 ACAAGTGTGA GCCCTCCTCA TACAGAGGAA TATGCTTGCT ACCAGTCTTT GGAAAGGTGC 3300 TCGAGGCCAT CATGGTGAAT CGTGTGAGAG AAGTTCTTCC GGAAGGCTGC AGATGGCAAT 3360 TCGGATTTCG CCAAGGACGA TGTGTGGAGG ATGCTTGGAG GCACGTGAAG AGCAGTGTTG 3420 GCGCCAGCGC GGCGCAATAC GTGCTCGGCA CATTCGTGGA CTTCAAAGGA GCATTCGACA 3480 ACGTCGAATG GAGTGCTGCA CTCAGCCGAC TAGCCGACTT GGGATGCCGG GAAATGGGCT 3540 TGTGGCAGAG CTTTTTCTCC GGCCGAAGAG CAGTGATCCG AAGCAGTTCC GGTACTGTGG 3600 AGGTACCGGT AACTAGAGGC TGCCCGCAGG GATCAATCAG CGGCCCATTT ATCTGGGACA 3660 TACTGATGGA TGTACTGCTT CAGCGTCTCC AGCCGTATTG CCAGCTGAGT GCATACGCGG 3720 ATGACTTGCT GCTTCTCGTC GAGGGAAATT CCCGAGCTGT GCTAGAGGAA AAAGGAGCGC 3780 AACTAATGTC CATCGTAGAA ACGTGGGGAG CGGAAGTTGG CGATTGCCTC TCGACCAGCA 3840 AGACGGTAAT CATGCTGCTG AAAGGTGCCT TGAGACGTGC GCCTACGGTG AGGTTTGCTG 3900 GACGGAACCT TCCGTATGTG CGTAGCTGTC GGTACCTTGG CATCACGGTC AGTGAAGGAA 3960 TGAAATTCCT CACGCACATA GCTTCGCTTC GCCAGCGGAT GACAGGAGTC GTTGGAGCAT 4020 TGGCGCGTGT GCTTCGAGCC GACTGGGGCT TCAGTCCTCG AGCCAGGCGG ACCATATATG 4080 ACGGACTCAT GGCACCTTGT GTGCTGTTTG GTGCCCCGGT ATGGTATGAC ACCGCGGAAC 4140 AAGTAGCTGC CCAGAGGCGA CTAGCCTCCT GCCAGAGGCT AATCCTGCTT GGATGCCTTT 4200 CGGTATGCCG AACAGTATCC ACAGTGGCAC TGCAGGTACT TGGTGGAGCT CCCCCGCTTG 4260 ATCTGGCTGC TAAGTTATTA GCGATCAAAT ACAAGCTAAA ACGTGGATTC CCGCTGGAGG 4320 AGAACGACTG GCTTTACGGC GAGGACATTG CGTGTCTTAG CTGGGAGCAG AGGAAGACTC 4380 GCCTAGAGGA GTGTTTAATC CAGAGTTGGC AGAACAGATG GGACGATGAC AGCGAACCAG 4440 GACGGGTGAC GCATAGGTTT ATCCCATACG TCACTCTTGC CTATCGGGAT CCAAGTTTTG 4500 GATTCTCGAT GAGGACGTCT TTCCTGCTTA CAGGGCACGG GTCGTTCAAT GCATTTTTGC 4560 ACGGGAGAGC CCTCAGCGAT ACCACTGCTT GCGCATGTGG AGATCCATAT GAGGACTGGA 4620 TGCATATCTT GTGCGCTTGC CCCCTATATG CAGATCTGCG GGACCTAGAT GGACTTGGAG 4680 TGCAGCGCCT TGGCGAAAAC TGGATCTTCG AGGGAATCCT GGATGATCAA GAGAAGACTC 4740 AACGGCTGGC AATGTTTGCG GAAGAAGTGT TCCTGAGGAG GAGGGCCCTT TAGCTCAACA 4800 TCTCTGCCGT GTGGTTAGCG GGCGAGAATA CTACCACAGT CCGCTGTTGC TTGTCGTAAG 4860 AGACGACTAA TACAGCGATA GGATTCCTCT AACCCTGCTT GTCGGAGCAA AAGGGGGAGG 4920 CCCACCGAGC CTCTTTTCGG TACCACGGGT TGAGCAGCTA TCCAAGACTG CTCATTGAGG 4980 TAGGCCCCCT GGTGGGAGTA TCGTGGTGGC TGTGGTTGGT ACCCATATCG CGGGTAGAGC 5040 CTTCATGCTC GACGTTTGAG TTACGGTGCT AGTTGCGCAA AACTCGGGTG CTGTGACCCA 5100 GAGATCAGTA GAGATTTTAG GTAGATCTCG CTCCTCAGCA AGGGGGAGTG CTTGCCCGGC 5160 AAGCAAGTAC TCGAATTGCT ACCGGGGTGG TCGCTATGTA CATAGCTATA GCTTCTAGTC 5220 CGGGACGCTT GTCTGGCGTA TCCAGACACA TGCACCATAT GCTCACTTGT GGGCGTATAG 5280 GGTGCCGTGG TTGTAATCCC TTCAGTGTGG AACACGCCAC GTAAAATAAG TTCGGAGGGA 5340 TCCGAAAAGC ATACAT 5356 // ID DMRER2DM standard; DNA; INV; 3607 BP. XX AC X51967; XX DR FLYBASE; FBgn0003909; R2-element. XX FT source X51967:1..3607 FT SO_feature CDS ; SO:0000316:181..3351 FT /db_xref="FLYBASE:FBgn0016699; R2-element\ORF" FT /db_xref="SWISS-PROT:P16423" FT /protein_id="CAA36225.1" FT /translation="FERKNFSDGLVPQRKFIHIGTTSTNNEPRIPLHNLMTTRPSVDIF FT PEDQYEPNAAATLSRVPCTVCGRSFNSKRGLGVHMRSRHPDELDEERRRVDIKARWSDE FT EKWMMARKEVELTANGCKHINKQLAVYFANRSVEAIKKLRQRGDYKEKIEQIRGQSALA FT PEVANLTIRRRPSRSEQDHQVTTSETTPITPFEQSNREILRTLRGYSPVECHSKWRAQE FT LQTIIDRAHLEGKETTLQCLSLYLLGIFPAQGVRHTLTRPPRRPRNRRESRRQQYAVVQ FT RNWDKHKGRCIKSLLNGTDESVMPSQEIMVPYWREVMTQPSPSSCSGEVIQMDHSLERV FT WSAITEQDLRASRVSLSSSPGPDGITPKSAREVPSGIMLRIMNLILWCGNLPHSIRLAR FT TVFIPKTVTAKRPQDFRPISVPSVLVRQLNAILATRLNSSINWDPRQRGFLPTDGCADN FT ATIVDLVLRHSHKHFRSCYIANLDVSKAFDSLSHASIYDTLRAYGAPKGFVDYVQNTYE FT GGGTSLNGDGWSSEEFVPARGVKQGDPLSPILFNLVMDRLLRTLPSEIGAKVGNAITNA FT AAFADDLVLFAETRMGLQVLLDKTLDFLSIVGLKLNADKCFTVGIKGQPKQKCTVLEAQ FT SFYVGSSEIPSLKRTDEWKYLGINFTATGRVRCNPAEDIGPKLQRLTKAPLKPQQRLFA FT LRTVLIPQLYHKLALGSVAIGVLRKTDKLIRYYVRRWLNLPLDVPIAFVHAPPKSGGLG FT IPSLRWVAPMLRLRRLSNIKWPHLTQNEVASSFLEAEKQRARDRLLAEQNELLSRPAIE FT KYWANKLYLSVDGSGLREGGHYGPQHGWVSQPTRLLTGKEYMDGIRLRINALPTKSRTT FT RGRHELERQCRAGCDAPETTNHIMQKCYRSHGRRVARHNCVVNRIKRGLEERGCVVIVE FT PSLQCESGLNKPDLVALRQNHIDVIDTQIVTDGHSMDDAHQRKINRYDRPDIRTELRRR FT FEAAGDIEFHSATLNWRGIWSGQSVKRLIAKGLLSKYDSHIISVQVMRGSLGCFKQFMY FT LSGFSRDWT" XX CC Derived from X51967 (g8432) (Rel. 24, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 3607 BP; 1064 A; 818 C; 900 G; 825 T; 0 other; TTGGGGATCA TGGGGTATTT GAGAGCAGAG GGGGAGTATT CTTCTGTAAT TCGTAAGTCA 60 TATCATATGA TGTGCGGAAG GGGAATTTTA CTCTGTAACT CACAAGTCTC TCCTTTACTC 120 AAGTCGACTC AAAACCTCCT CGTGGTGGTC CCGGTAATGC TAAACTCGTT TAGCAGCTAA 180 TTTGAGCGGA AAAACTTTTC CGATGGGCTG GTTCCCCAGA GGAAATTTAT TCATATTGGA 240 ACTACAAGCA CAAATAACGA GCCTCGGATA CCTTTACACA ATCTGATGAC GACCCGACCC 300 TCCGTGGATA TCTTCCCGGA GGACCAATAT GAACCAAACG CAGCGGCTAC TCTATCTAGG 360 GTTCCCTGCA CAGTATGTGG CCGGTCCTTT AACAGCAAGA GAGGACTCGG TGTTCACATG 420 CGATCTCGGC ACCCAGACGA ACTTGATGAA GAACGTCGAC GTGTCGATAT AAAGGCAAGA 480 TGGAGTGATG AAGAGAAGTG GATGATGGCG AGAAAGGAGG TTGAGCTCAC AGCAAATGGA 540 TGTAAACACA TAAACAAGCA ACTAGCGGTG TATTTTGCAA ACCGCAGCGT CGAAGCCATC 600 AAAAAGCTAA GACAGAGGGG CGATTATAAG GAGAAAATAG AGCAGATAAG AGGGCAATCA 660 GCTCTCGCCC CGGAAGTTGC TAATCTAACC ATAAGGCGCC GCCCTAGTAG AAGTGAGCAA 720 GACCACCAAG TAACAACATC GGAAACAACT CCAATCACTC CCTTCGAACA GTCGAACAGG 780 GAAATTTTGC GGACACTACG CGGGTATAGC CCCGTAGAAT GCCATTCCAA ATGGAGAGCC 840 CAAGAGTTGC AAACTATCAT TGATAGGGCA CATCTCGAGG GAAAGGAAAC CACTCTCCAA 900 TGCTTATCGC TATATCTCCT GGGAATTTTT CCGGCACAGG GTGTACGACA CACACTGACG 960 AGACCTCCTC GGAGACCTCG GAACAGGAGA GAAAGCAGAA GGCAGCAGTA CGCTGTCGTC 1020 CAGCGTAACT GGGATAAGCA TAAAGGAAGA TGCATCAAGT CCTTGCTAAA TGGAACTGAT 1080 GAGTCGGTAA TGCCAAGCCA AGAAATAATG GTTCCCTACT GGAGAGAAGT AATGACTCAG 1140 CCTAGCCCAA GCTCTTGCAG TGGAGAAGTG ATACAAATGG ATCACTCGCT TGAGAGGGTA 1200 TGGTCTGCTA TTACAGAGCA GGACCTTCGG GCGTCAAGAG TCTCATTATC CTCGTCTCCG 1260 GGGCCTGACG GGATAACTCC AAAATCTGCC AGGGAGGTGC CGTCAGGTAT TATGCTGCGC 1320 ATAATGAACC TAATTCTATG GTGCGGTAAT CTACCACACT CCATACGACT GGCCAGAACC 1380 GTCTTCATCC CGAAGACGGT GACGGCGAAG CGACCGCAAG ACTTTCGTCC AATATCAGTG 1440 CCTTCAGTCC TGGTAAGACA GCTAAATGCA ATATTGGCAA CCCGGTTGAA CTCATCAATC 1500 AATTGGGACC CGCGCCAGCG GGGCTTCTTA CCAACCGACG GATGCGCCGA TAATGCGACG 1560 ATAGTCGACT TAGTCTTGAG GCATAGCCAT AAGCACTTTA GATCTTGCTA CATCGCAAAT 1620 TTAGATGTAA GCAAGGCATT TGATTCTCTA TCACATGCAT CTATATACGA CACCTTACGT 1680 GCTTATGGTG CGCCAAAGGG CTTCGTTGAC TACGTACAGA ACACGTACGA GGGCGGTGGT 1740 ACCAGTCTCA ATGGGGACGG TTGGAGTTCA GAGGAATTCG TCCCTGCTAG AGGAGTGAAG 1800 CAGGGTGACC CTTTGTCTCC TATTCTATTT AACTTGGTAA TGGACAGGTT ACTTAGAACC 1860 TTACCCAGCG AAATTGGTGC CAAAGTCGGA AATGCCATTA CTAACGCGGC CGCGTTTGCA 1920 GATGATTTGG TACTATTTGC GGAAACTCGG ATGGGGCTTC AAGTATTGTT GGACAAGACG 1980 TTGGATTTTC TATCTATCGT CGGCCTCAAA CTTAATGCCG ACAAATGTTT TACCGTTGGC 2040 ATTAAGGGCC AGCCGAAACA GAAGTGTACC GTGTTAGAGG CACAGAGCTT CTACGTAGGC 2100 TCGAGTGAGA TTCCATCACT GAAGCGCACG GACGAGTGGA AGTACTTAGG CATCAACTTC 2160 ACTGCAACCG GGAGGGTTCG ATGCAATCCG GCCGAGGACA TTGGTCCAAA GCTACAAAGA 2220 TTGACAAAGG CCCCCCTCAA ACCACAACAG AGGTTGTTCG CCCTTCGGAC TGTCCTTATC 2280 CCACAGCTCT ACCACAAGTT AGCCCTTGGG AGTGTGGCGA TAGGCGTCCT AAGAAAAACT 2340 GATAAACTTA TAAGATATTA TGTGCGAAGA TGGCTAAATC TTCCGCTGGA TGTGCCGATA 2400 GCATTTGTTC ATGCACCCCC AAAAAGTGGA GGTCTCGGAA TTCCATCACT AAGATGGGTA 2460 GCTCCAATGT TAAGGCTAAG ACGCTTGAGT AACATTAAAT GGCCTCACCT CACGCAAAAC 2520 GAGGTAGCCA GCTCTTTCCT CGAAGCCGAA AAACAACGGG CCCGAGATAG ATTATTAGCT 2580 GAACAAAATG AACTGTTATC GCGTCCGGCA ATAGAAAAAT ATTGGGCGAA CAAGTTGTAC 2640 CTCTCAGTTG ATGGTAGCGG ACTCCGTGAA GGCGGCCATT ATGGCCCGCA ACACGGGTGG 2700 GTTAGTCAAC CCACGCGTTT ATTAACAGGA AAGGAATATA TGGACGGTAT TCGTCTGCGG 2760 ATAAATGCCC TACCCACAAA GTCTCGTACT ACAAGGGGAA GGCACGAATT GGAACGACAG 2820 TGTCGTGCAG GATGTGATGC TCCCGAAACA ACAAACCACA TAATGCAAAA ATGCTACCGC 2880 TCGCATGGGA GGCGGGTAGC TAGACACAAC TGCGTAGTAA ATCGAATCAA GCGGGGACTT 2940 GAGGAGAGAG GCTGCGTGGT CATTGTTGAA CCAAGTCTGC AGTGCGAATC CGGCCTTAAT 3000 AAACCAGACC TGGTGGCACT ACGACAAAAT CACATTGATG TGATCGACAC ACAAATTGTG 3060 ACAGACGGAC ACTCTATGGA TGATGCGCAC CAGCGCAAAA TCAATAGATA CGACAGACCG 3120 GACATACGAA CTGAATTGCG TCGCAGATTC GAAGCCGCAG GTGACATTGA ATTCCATTCT 3180 GCCACCCTGA ACTGGAGGGG GATCTGGAGT GGTCAATCCG TTAAAAGATT GATAGCAAAG 3240 GGTCTCCTCA GCAAATATGA TAGTCATATC ATTAGCGTCC AGGTTATGAG AGGCAGTCTC 3300 GGTTGTTTTA AACAGTTCAT GTACCTGAGC GGGTTTTCCC GAGATTGGAC TTAGCTAAAT 3360 CGTTTGGTTC AAAACATTTG CTTGCTGTCT TGGCATAACA TCAATAAAGG CATAAACATC 3420 GCAAAATAAT GGTTATAATT AAATGGCTAT GAGGATGGTT TTAGTACGTA GGCGTTGCGG 3480 AACTTCGGTT CATATAGAGC AATGAATCGT GCATGCTAGG AAAACTGACC ACACACAGTG 3540 TTGGCAGACC TAGTATCTTT CGAAGATTTC CATACCTCCG CGATCAAAAA AAAAAAAAAA 3600 AAAAAAA 3607 // ID DM33463 standard; DNA; INV; 1736 BP. XX AC U33463; XX DR FLYBASE; FBgn0004905; S-element. XX FT source U33463:37..1772 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..234 FT SO_feature terminal_inverted_repeat ; SO:0000481:1503..1736 FT SO_feature CDS ; SO:0000316:404..1441 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044019; S-element\T" FT /db_xref="REMTREMBL:AAC47095" FT /protein_id="AAC47095.1" FT /translation="MPGKRLAFEVTQLIYYNHQLGKSIPELVEIFSVSRKTVYNILNRX FT XKEGRLEPKSGGGCKTKINKRVDRLIMRKAIANPRISVRSLAQDIREECHLTVSHETVR FT QVILRHRYSSRVARKKPLLSEINIEKRHSFAVSMMDHAEEYWDDVIFCDETKMMLFYND FT GPSRVWRKPLSALETQNIIPTIKFGKLSVMIWGCISSHGVGKLAFIESTMNAVQYLDIL FT KTNLKASAEKFGLFSNNKPNFKFYQDNDPKHKEYNVRNWLLYNCGKVIDTPPQSPDLNP FT IENLWAYLKKKVAKRGPKTRQQLMAAIIEEWEKIPLEYDLKKLIHSMKKRLQLVAKANG FT GHTKY" XX CC Derived from U33463 (g1006788) (Rel. 47, Last updated, Version 5). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1736 BP; 600 A; 287 C; 300 G; 545 T; 4 other; CAGTTTGTCA AGAAACTGTT TACACACCGC AAAATAAGTA GAATTTTTGA CTTTAAAGGC 60 CAAAATTAAG GGTTTTTTGC TTAATTAAAC GCAATTTTTT TATGAAATAT AATTAAACAA 120 TATTTATTTT ACTTATAAAT TAAAAAACAA ATTCAATATA TCAAATATAC AAGAAAATAA 180 ACAACAAATT TCTTGTTTAC ACACTTTTGA GAGTGCCAAG AAACTCTTTA CACAGTTTTG 240 GGTTCCTACT TTGTTTTGCT CTTTTTCTTA GAAACAATCT CATTTTTCCG TTATTTTTGT 300 CTTATGCATT CCTTTTTACA ACGCTTCTAT TGCAATTTTT TCACTTTGCT TGTGAAATTT 360 TGTTGATCTA ACGTGCTTAA AGCGAATTAT TAAATTTAAT GAAATGCCTG GAAAGAGATT 420 GGCTTTTGAA GTGACCCAGC TAATATACTA TAACCACCAG TTGGGAAAAT CTATTCCTGA 480 ATTAGTAGAA ATATTTTCCG TATCCCGTAA AACCGTCTAT AATATTTTAA ATCGTNNNNA 540 AAAAGAGGGC AGGCTTGAAC CTAAGAGTGG TGGTGGGTGT AAAACGAAAA TTAACAAGCG 600 AGTAGACCGC CTTATTATGC GAAAAGCGAT TGCGAACCCC CGAATCTCGG TCAGATCACT 660 TGCTCAGGAT ATCAGGGAAG AATGTCACCT AACTGTATCA CACGAAACTG TGCGCCAAGT 720 CATCCTACGC CATAGGTACT CTTCAAGAGT TGCAAGAAAA AAGCCTTTGC TATCAGAGAT 780 CAATATTGAA AAGCGTCATT CATTCGCTGT GAGCATGATG GATCATGCGG AAGAGTACTG 840 GGATGACGTC ATATTTTGTG ACGAAACAAA AATGATGCTC TTTTATAACG ATGGGCCAAG 900 CAGAGTATGG CGCAAACCGT TGAGTGCGCT AGAAACACAA AATATAATTC CAACAATCAA 960 ATTTGGAAAA TTGTCAGTGA TGATTTGGGG CTGTATTTCC AGCCATGGAG TGGGCAAACT 1020 AGCCTTTATT GAAAGCACTA TGAATGCCGT GCAATATCTA GATATTTTAA AAACAAATTT 1080 GAAGGCCAGT GCAGAAAAAT TTGGTTTGTT TAGCAACAAC AAGCCAAATT TTAAGTTTTA 1140 TCAGGACAAT GATCCCAAAC ATAAAGAGTA CAATGTACGC AACTGGCTAC TCTATAACTG 1200 TGGCAAGGTG ATCGATACGC CCCCTCAGAG TCCTGATCTA AACCCCATTG AAAATTTGTG 1260 GGCCTACTTA AAGAAGAAGG TTGCAAAAAG GGGCCCCAAA ACTCGACAAC AACTCATGGC 1320 TGCGATAATC GAAGAGTGGG AAAAGATCCC GCTTGAATAT GACCTAAAAA AACTCATACA 1380 TTCCATGAAA AAAAGGCTTC AACTTGTAGC CAAAGCCAAT GGGGGTCATA CTAAATACTA 1440 AAACTTTTCA AATATTATCA AAATAATTAA AAAATTTAGG ATTAAACTTA GGTTTAGTGT 1500 TTTGTGTAAA GAGTTTCTTG ACACTCTCAA AAGTGTGTAA ACTTGAAATT TGTTGTTTAT 1560 TTTCTTGTAT ATTTGATATA TTGAATTTGT TTTTTAATTT ATAAGTAAAA TAAATATTGT 1620 TTAATTATAT TTCATAAAAA AATTGCGTTT AATTAAGCGA AAAACCCTTA ATTTTGACCT 1680 TTAAAGTCAA AAATTCTACT TATTTTACGG TGTGTAAACA GTTTCTTGAC AAACTG 1736 // ID SPRINGER standard; DNA; INV; 7546 BP. XX AC AF364549; XX DR FLYBASE; FBgn0003490; springer. XX FT source AF364549:1..7546 FT SO_feature five_prime_LTR ; SO:0000425:1..403 FT SO_feature three_prime_LTR ; SO:0000426:7143..7546 FT SO_feature CDS ; SO:0000316:1058..2422 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044343; springer\gag" FT /db_xref="SPTREMBL:Q967T6" FT /protein_id="AAK52057.1" FT /translation="MSESFRQYRNSKKCASDSESESDDSTENSVRKNTPTNAFTAYKMS FT LETEQIKALIRALQEQALESQRREADLRKTIQDLAGQVAAIQIAPARAEAPPIKVYRPV FT EITGLVPCGETLDAVKCLPDFMGTQETYVSWRQAANAAYHMFRKYEDSSRHYQAVVIIR FT SKVKGPADAVLSSFGTILNFDAIISRLDFTYSDKRPIHVIEQELGTLRQGSLTLLQYYD FT EVEKKLTLLTNKATMSYEASAATVLCEKFRDDALRVFVSGLRRNLTDVLFAAKPKDMPS FT ALALAQEVESNHERYTFATSFARSQEDRDHKQYPKVQERQRAPPQAGSQGSAGKNPHFT FT KQHRAQVHSAPRSDRMARENMPEPMDVDPSLSRMQPSHAPAYPKSKPAASGRSVPPKRQ FT RVNHVAQASDDLDKVYNTAASSAAVKVDDDSILEYDSDTINFLGESPCYPSSDEE" FT SO_feature CDS ; SO:0000316:2434..5475 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044342; springer\pol" FT /db_xref="SPTREMBL:Q967T5" FT /protein_id="AAK52055.1" FT /translation="MKLLIDTGAAKNFIRPFKGLKGVRPVQSPFTIHSIHGVTTITKKC FT FVSIFNLKATFFLLPDLTSFDAIVGLDLLKQAGASLCLASGKLKWGSGAEQIDFHTCPD FT VNFTKVDCSDAPPLIKDAFLKMLGNRKKAFADPNEALPYNTSVVATIRTVDEEPIYAKL FT YPYPMGAADFVNGEIQELLKNGIIQKSKSPYNNPIWVVDKKGTDDAGNKKMRLVLDFRK FT LNERTVPDRYPMPNISMILGNLGKAKYFTTLDLKSGYHQITLAERDREKTAFAVNGGKY FT EFRRLPFGLRNAASIFQRTIDDILREQIGKFCYVYVDDVIIFSEDENDHVKHVDWVLKS FT LYDANMRISAEKSRFFKKSVSFLGFIVTNNGAATDPEKVKAIKEFPEPKNVFEVRSFLG FT LASYYRCFIKDFASIARPISDILKGENGSVSRHRSRSIQVEFSEAQQRAFEKLRNILAS FT EDVILRYPDYKKAFDLTTDASAYGIGAVLSQEGRPITMISRTLSDREVNYATNERELLA FT IVWALAKLRHYLYAVKEINIFTDHQPLTFAVSESNPNAKIKRWKARIDESGARIFYKPG FT RNNLVADALSRQQLNVVEEQEPESCAATIHSELSLTHTIESTDKPVNCFQNQIILEEAR FT SHWKRTFILFGNKRRHSINFSCKQALLEELANIIIPNGVNAFHCDLHTLALIQDDVVRQ FT FPATKFWHCKNRVTDIFAMQERKEILTVEHNRAHRSAQENVKQVLSEYYFPKMTKLASE FT IAANCKTCAKAKYDRHPKKQELGETPVPTHVGEILHIDIFSTDKKYFLTCVDKFSKFAM FT VQPILSRTIEDLKAPLLQLMNVFPKAKTIYCDNEPSLKSQTIVAMLENHFGVSISNAPP FT LHSVSNGQVERFHSTLIELARCLKIDKGISDTVELVLLATARYNMSIHSVINKKPAEVM FT RADPDDPHTDVQEKIKNAQILTRKRENASRQNRVFQVGDKVLVKSNRRLGNKLTPLCEE FT KTIEADLGTTVLIKGRVVHKDNLK" FT SO_feature CDS ; SO:0000316:5870..7147 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044344; springer\env" FT /db_xref="SPTREMBL:Q967T4" FT /protein_id="AAK52056.1" FT /translation="MLDMFPQSHMKKLLSVDIAHLRDMLDSLSIHHRVARSLDFLGTAL FT KVVAGTPDAEDFEKVKFTEARLVDAHNSQIEINTKTQVRINELTDTINKLLKISKSAQI FT DTGHLYETLSTRNRIIVMELQNLMLTITLAKINVVSPNFLDHADLESIWGEEPTNTPIR FT EILSVASVKVLQSLNILHFIIKFPKIIMACNKVTILPVVHHDTVLRLKDNVVAECNREI FT RTVKNCSITPGATFCQLSSVSSCAQELHAGVVAHCDAQQSDLHPITYVDEGIIVINDRP FT ALVRVDNGTAIHIRGTHLITFIESAMVNETVFFNHDMVQNRAPGVANSPVLNISMKHEV FT LSLPYLHRLSEKNLEQIRNFEKDVDGYRLSQIALVAGAIFCALICIGLTWQRTTRAKKS FT TAQLKEVLAQIGSAEGGLNLEEGIVN" XX CC Derived from BACR06P08 by Sue Celniker, 29 March 2001. CC Michael Ashburner, 9-Apr-2001. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7546 BP; 2131 A; 1909 C; 1712 G; 1794 T; 0 other; AGTTAACTAA GTTAACCGGA CTGATCGTCC GCACACCAGC ACCGGTCAAA CTGCTGACCA 60 AGCATTTGGC CGGAAGCTCA TGCATAGCCG GCAGAAGCTC TGCGCATTGG CAGAGGCCGC 120 TATGATGTTT TTCCCTTTGT TAGCTTATAG TCAGTTTGAT TTTGTATTCA ATAAAGAGCG 180 CATCGCGCCT TCAATCAACT CCAGCTACTG CTGTTATCAT TGAATTGGTT GGCTAGCCTT 240 AAGGGCAGTC AACAACGGAG AGACGTTCTC CCACCATATC TCCCAATCTA GGAGAAGAGG 300 TCTGCGGCAA CCGCCCTGCC TCCCAGTGAC AGAAGAACCC CCCGTTACCT GCAACCTACG 360 CCGGAGACCG CGGCGAGGGA CCTGCACCTT ATATTTAATT AATTGGCACC CAACTCCAGG 420 AACCCACACC ACTACCCTGA ATCATGTAAG TGGGATTCTC AACTTAAACA CTACTCCAAA 480 CTGCGTCTAG AATTTTAAAA TATTTGGGAT GTTTGTGCGA GTTACATAAA TAAATTAAGA 540 AAATCGGCAT TTCCACTACA ATAAACGTTT ATATGTGTTG CGAATTAAGA TTATGTTACT 600 GTTATGAAGT TTAAATATCG AATTTTGATT TGTGGTTGAC TTTGCAATCC ATATTGTGTG 660 CATTTCATTC CGCCTTCGCA CATCCGCGGG ACACTGTCGT TTCATTCGAA ATTTAATTCG 720 CTACATTGGC TTCACAGCCC TTTCAAGCTT TGTTGTTTTT GACCCACTCC ACTTCGCTAC 780 CCGATACTGG CGCATGCATT GCTGTGACAA TTTTGTGCCT TTTATTTATC TCTTTGTCTT 840 TGCTGTGGCA ATTTTTGTCT TTGGATATTT GTTTGCCTTA TTGGAGACCC GCTCCCCGCA 900 GGCCCTTCAC CTTATCGTTA CTTAGCTGGA CAGTGGCTCT GCTCGTTGAG TCTTCGTCCA 960 ATGCCTTCAA AGCGGCGACT CAGCCCCCGC GACCCCCTTG CCGTACTGTT TGGCCCCACG 1020 GGCACAACGG CCTGAGTATT CACATACATA GCTACCCATG AGCGAGTCAT TCCGACAATA 1080 TAGGAATTCT AAAAAGTGCG CTAGCGACTC AGAGTCCGAA AGCGACGATT CGACAGAAAA 1140 CTCTGTACGT AAAAACACCC CAACTAACGC ATTCACTGCA TATAAAATGT CCCTCGAAAC 1200 GGAACAAATT AAAGCTCTCA TAAGGGCATT ACAAGAGCAA GCCTTAGAGA GTCAACGCAG 1260 GGAGGCTGAC TTGCGTAAAA CAATTCAAGA TCTGGCCGGC CAGGTCGCGG CCATACAGAT 1320 TGCCCCTGCC CGGGCAGAAG CTCCCCCAAT CAAAGTTTAC AGACCAGTAG AAATCACCGG 1380 ACTGGTCCCT TGTGGGGAAA CATTGGATGC CGTTAAATGT CTTCCAGACT TTATGGGGAC 1440 ACAGGAGACA TACGTCTCCT GGCGGCAAGC GGCAAATGCC GCTTACCATA TGTTCAGGAA 1500 ATATGAGGAT AGTTCGCGGC ACTATCAAGC TGTGGTCATC ATCAGGAGCA AAGTTAAAGG 1560 CCCTGCTGAT GCAGTTCTGT CGTCCTTTGG GACTATACTG AATTTCGATG CGATCATAAG 1620 TCGCCTCGAT TTCACGTATA GTGACAAACG CCCGATACAC GTTATCGAGC AGGAGCTAGG 1680 CACCCTCAGA CAGGGAAGCC TGACGCTCCT CCAGTATTAT GATGAGGTCG AGAAAAAACT 1740 CACCTTACTC ACCAATAAGG CGACTATGTC TTATGAAGCG TCGGCAGCAA CGGTGCTGTG 1800 TGAGAAGTTC CGAGATGATG CTTTGAGAGT TTTTGTCTCG GGGCTCAGGC GCAACCTCAC 1860 AGACGTGCTA TTCGCGGCAA AGCCTAAGGA CATGCCGTCA GCGCTCGCCC TGGCGCAAGA 1920 AGTAGAGTCC AATCATGAGC GGTACACTTT TGCAACTTCA TTTGCACGAA GCCAAGAGGA 1980 TAGGGACCAC AAGCAATATC CCAAAGTGCA GGAGCGCCAA CGGGCCCCCC CACAAGCCGG 2040 CTCGCAGGGA AGTGCTGGGA AGAACCCGCA CTTTACTAAG CAGCATAGAG CACAGGTGCA 2100 CTCCGCTCCA CGTAGCGACC GAATGGCCCG AGAAAACATG CCAGAACCCA TGGACGTTGA 2160 CCCGTCGTTG TCCAGGATGC AGCCATCTCA CGCCCCGGCT TACCCGAAAT CGAAGCCGGC 2220 CGCGTCTGGC CGTTCGGTCC CACCAAAAAG GCAAAGGGTC AACCATGTTG CCCAGGCCTC 2280 TGATGATTTG GACAAGGTTT ATAACACCGC AGCCTCCAGT GCAGCAGTTA AAGTCGACGA 2340 CGATTCCATC CTAGAGTACG ACTCGGATAC CATTAATTTT TTAGGGGAAA GTCCCTGCTA 2400 CCCGTCATCA GACGAAGAGT AGCGGGGATC GACATGAAAC TACTGATTGA TACGGGCGCG 2460 GCAAAAAATT TTATCCGACC ATTTAAGGGG TTGAAAGGCG TCCGCCCGGT GCAGTCCCCA 2520 TTTACAATCC ATTCGATTCA TGGTGTGACT ACAATAACTA AGAAATGTTT CGTGTCCATT 2580 TTTAATCTTA AAGCTACCTT TTTTTTATTA CCAGACTTGA CCTCCTTTGA CGCGATCGTT 2640 GGCCTAGACC TGTTAAAACA GGCCGGCGCG TCACTTTGCC TAGCTTCCGG CAAGCTCAAA 2700 TGGGGCTCCG GAGCAGAGCA AATTGACTTT CATACTTGCC CCGATGTCAA TTTCACCAAA 2760 GTAGATTGCT CGGACGCACC GCCCTTAATT AAGGATGCTT TTTTAAAAAT GCTCGGGAAT 2820 AGGAAAAAAG CTTTTGCTGA TCCTAATGAG GCTCTTCCTT ACAATACGTC GGTGGTAGCC 2880 ACCATCCGGA CGGTTGATGA GGAGCCCATT TATGCCAAGT TATACCCATA TCCCATGGGA 2940 GCAGCTGACT TCGTCAACGG CGAAATTCAG GAACTGCTTA AAAATGGCAT AATCCAAAAG 3000 TCAAAGTCCC CCTACAATAA CCCAATATGG GTCGTAGACA AAAAGGGCAC TGACGATGCG 3060 GGCAATAAAA AAATGCGCTT GGTGCTGGAC TTTCGAAAAC TTAACGAAAG GACGGTACCA 3120 GACAGATACC CCATGCCAAA TATCTCTATG ATATTGGGGA ATCTCGGCAA GGCCAAATAC 3180 TTCACGACCC TCGATCTGAA GTCTGGCTAC CACCAAATCA CGCTCGCAGA ACGCGACCGT 3240 GAAAAGACAG CGTTCGCAGT AAACGGAGGG AAGTATGAGT TCCGAAGGCT GCCATTCGGA 3300 CTCAGGAATG CTGCAAGCAT CTTCCAAAGA ACAATTGACG ATATTCTGCG AGAGCAGATC 3360 GGAAAGTTCT GCTACGTTTA CGTCGATGAC GTCATCATCT TTTCGGAAGA TGAAAACGAC 3420 CATGTCAAGC ATGTAGATTG GGTTCTGAAG AGCCTGTACG ACGCTAACAT GAGAATATCG 3480 GCAGAAAAGT CACGTTTTTT TAAGAAAAGC GTGAGCTTCC TGGGGTTCAT CGTCACCAAC 3540 AATGGGGCGG CGACTGACCC AGAAAAGGTT AAGGCCATAA AGGAATTTCC GGAACCCAAA 3600 AACGTATTTG AGGTAAGGTC ATTCTTGGGC TTAGCCAGCT ATTATCGTTG CTTTATCAAA 3660 GACTTCGCAT CAATAGCTAG GCCCATTTCA GACATATTGA AGGGCGAGAA CGGTAGTGTT 3720 AGCCGACACA GGTCCAGGAG TATCCAGGTA GAATTTTCCG AAGCGCAACA ACGTGCCTTC 3780 GAAAAGCTAC GCAATATCCT GGCGTCTGAG GACGTCATCC TGAGATACCC TGATTACAAA 3840 AAAGCGTTTG ATCTAACGAC AGACGCTTCG GCCTACGGCA TTGGCGCAGT GCTGTCCCAG 3900 GAGGGACGTC CCATTACAAT GATCTCAAGG ACATTGTCTG ACAGAGAGGT TAACTATGCT 3960 ACCAACGAAA GGGAGCTGTT AGCCATAGTC TGGGCACTGG CTAAGTTGCG GCACTACCTG 4020 TATGCGGTTA AAGAGATAAA CATCTTTACC GATCACCAAC CTCTGACGTT CGCGGTATCG 4080 GAGTCCAATC CGAACGCCAA AATTAAGAGA TGGAAAGCAC GCATCGACGA GTCCGGCGCA 4140 CGAATTTTTT ACAAGCCTGG GAGAAACAAC CTCGTTGCAG ATGCCCTCTC GAGACAACAA 4200 CTCAACGTTG TTGAAGAGCA AGAACCGGAG TCGTGCGCGG CCACGATTCA CAGCGAACTT 4260 TCGCTTACGC ACACGATCGA GTCCACGGAC AAACCCGTGA ATTGCTTCCA GAACCAGATA 4320 ATTTTGGAAG AGGCGCGCTC CCATTGGAAA CGCACTTTTA TATTATTTGG GAATAAGAGG 4380 CGGCACTCGA TCAATTTCTC GTGCAAACAA GCTTTGCTGG AGGAACTCGC CAACATCATT 4440 ATCCCTAATG GTGTAAACGC CTTCCACTGT GATCTTCACA CGCTGGCGCT AATCCAGGAC 4500 GACGTAGTTC GACAGTTTCC AGCCACGAAA TTCTGGCATT GTAAGAATAG GGTCACCGAC 4560 ATCTTCGCGA TGCAGGAGAG AAAAGAAATC CTCACCGTCG AGCACAACAG AGCTCACAGG 4620 TCGGCCCAAG AAAACGTGAA ACAAGTACTC TCCGAGTACT ACTTCCCGAA AATGACCAAA 4680 TTGGCGAGCG AAATAGCAGC CAATTGCAAA ACTTGCGCAA AGGCGAAGTA TGACAGACAT 4740 CCGAAGAAGC AGGAGCTCGG TGAGACACCA GTCCCGACCC ACGTAGGAGA AATATTGCAC 4800 ATCGATATTT TCTCAACGGA TAAAAAATAC TTTCTCACCT GTGTTGACAA GTTTTCTAAA 4860 TTCGCCATGG TACAGCCGAT TCTGTCTAGA ACCATAGAAG ATTTGAAAGC ACCCCTTTTA 4920 CAACTTATGA ATGTTTTCCC CAAAGCCAAA ACCATCTACT GCGACAATGA ACCATCATTG 4980 AAATCGCAGA CAATAGTGGC TATGCTGGAA AACCATTTTG GCGTCAGCAT TTCGAATGCA 5040 CCGCCCCTAC ATAGCGTCTC AAACGGACAG GTGGAACGAT TCCACAGCAC GTTAATTGAG 5100 CTCGCCAGAT GCCTAAAAAT CGACAAAGGC ATAAGTGACA CAGTGGAATT GGTCTTGCTG 5160 GCCACAGCCA GATATAACAT GTCCATCCAC TCCGTCATCA ATAAAAAACC GGCCGAAGTC 5220 ATGCGGGCAG ATCCGGACGA TCCACATACC GATGTCCAAG AAAAAATCAA AAACGCCCAG 5280 ATTTTGACAA GAAAACGAGA GAACGCTTCT CGGCAGAACA GAGTGTTCCA GGTCGGCGAC 5340 AAAGTCCTAG TAAAGTCAAA CAGACGATTA GGCAACAAAC TTACTCCTTT ATGTGAGGAG 5400 AAGACCATCG AGGCAGACTT GGGGACCACA GTCCTTATTA AAGGGAGGGT GGTCCATAAA 5460 GACAACCTCA AGTGACCCAA GCAGAGCCTA GCCGCGGCTC CCTCGGAGGC ACACTTTTAT 5520 TCCTCCAATT TGTAGCCACT CGGCATAAGT TTTTTCATTG TTTTTATAGC CGCTTGGCAT 5580 AAGTTTTTTA TTTTTTAGCC ACTTGGCATA TTTTTTATAT ATTTTCGCTA TTATTGGTGG 5640 TGGGCAACTC CATTCCGAAC AAGTAATAAT TTATCACACA CGTTACAGGT CGCTCCCAAC 5700 CCTTCTTCTT TGTTTCCTGG CCACGACATC GGCCCACATT ACTGACTATT CCCGAGCGAA 5760 TTACATTCCC GTCATTGACG GTAAAGTCTT AGTCTGGGAG GAATTCGCCT ATGTCAGACA 5820 CTCGGCTAAC CTCTCCGAGT ATAGGCGGGT AATTGACGAA ACCGACAGCA TGCTCGATAT 5880 GTTCCCCCAG TCCCATATGA AGAAGCTCCT GAGCGTTGAT ATCGCTCACC TCCGTGACAT 5940 GCTTGATTCT TTGAGCATCC ATCACAGAGT GGCAAGGAGC CTAGACTTCT TGGGAACTGC 6000 GTTAAAGGTT GTCGCAGGGA CACCTGACGC GGAAGACTTC GAGAAAGTCA AGTTCACTGA 6060 AGCGCGGCTT GTTGATGCAC ACAATAGCCA AATCGAAATA AACACCAAAA CACAAGTTCG 6120 AATTAACGAA CTCACTGATA CCATAAATAA ACTTTTAAAA ATTTCCAAAA GCGCTCAGAT 6180 TGATACAGGT CACCTGTATG AAACGCTTTC TACTCGCAAC AGAATCATTG TAATGGAATT 6240 GCAAAACTTA ATGCTCACTA TAACCCTCGC TAAAATTAAC GTAGTGAGTC CAAACTTCTT 6300 GGACCACGCA GATCTGGAGA GTATTTGGGG CGAGGAGCCC ACCAACACCC CCATAAGGGA 6360 GATTTTGTCC GTTGCGTCTG TAAAAGTCCT ACAATCCCTT AACATCTTAC ACTTTATTAT 6420 TAAATTCCCC AAGATTATCA TGGCGTGCAA CAAAGTCACT ATCCTTCCAG TGGTACACCA 6480 CGATACGGTG TTAAGGTTGA AAGATAATGT GGTAGCAGAG TGCAACAGAG AAATACGCAC 6540 AGTAAAGAAT TGCTCCATAA CACCAGGGGC AACATTTTGC CAGTTATCTT CAGTGAGCTC 6600 GTGTGCGCAG GAGCTCCACG CTGGGGTCGT AGCACATTGC GACGCACAGC AGAGTGATCT 6660 ACATCCGATC ACCTACGTCG ACGAAGGAAT AATCGTCATC AATGACAGAC CAGCACTCGT 6720 GCGTGTGGAC AATGGAACGG CCATCCACAT TAGAGGCACG CACCTCATAA CATTCATTGA 6780 GAGTGCCATG GTCAACGAGA CCGTCTTCTT TAATCATGAC ATGGTCCAGA ATAGGGCGCC 6840 GGGAGTGGCT AATTCCCCAG TCCTTAATAT CTCGATGAAA CACGAGGTCC TGAGCCTCCC 6900 ATACCTTCAC CGTTTAAGTG AAAAGAACTT GGAGCAAATC AGGAACTTCG AGAAGGACGT 6960 CGACGGATAC CGACTAAGTC AGATAGCGTT AGTTGCGGGA GCAATTTTCT GCGCTCTTAT 7020 CTGCATCGGT TTAACCTGGC AGCGAACCAC TAGGGCCAAG AAATCTACAG CCCAACTGAA 7080 GGAAGTTCTC GCCCAAATAG GGTCAGCCGA GGGCGGCCTT AATCTTGAGG AGGGAATAGT 7140 TAACTAAGTT AACCGGACTG ATCGTCCGCA CACCAGCACC GGTCAAACTG CTGACCAAGC 7200 ATTTGGCCGG AAGCTCATGC ATAGCCGGCA GAAGCTCTGC GCATTGGCAG AGGCCGCTAT 7260 GATGTTTTTC CCTTTGTTAG CTTATAGTCA GTTTGATTTT GTATTCAATA AAGAGCGCAT 7320 CGCGCCTTCA ATCAACTCCA GCTACTGCTG TTATCATTGA ATTGGTTGGC TAGCCTTAAG 7380 GGCAGTCAAC AACGGAGAGA CGTTCTCCCA CCATATCTCC CAATCTAGGA GAAGAGGTCT 7440 GCGGCAACCG CCCTGCCTCC CAGTGACAGA AGAACCCCCC GTTACCTGCA ACCTACGCCG 7500 GAGACCGCGG CGAGGGACCT GCACCTTATA TTTAATTAAT TTAACT 7546 // ID TARTC standard; DNA; INV; 11124 BP. XX AC AY600955; XX DR FLYBASE; FBgn0004904; TART-C. XX FT source AY600955:1..11124 FT SO_feature non_LTR_retrotransposon ; SO:0000189 CC telomeric retrotransposon FT SO_feature direct_repeat ; SO:0000314:1..331 FT SO_feature direct_repeat ; SO:0000314:10383..10713 FT SO_feature five_prime_UTR ; SO:0000204:1..205 FT SO_feature three_prime_UTR ; SO:0000205:6629..11102 FT SO_feature non_LTR_retrotransposon_polymeric_tract ; FT SO:0000433:11103..11124 CC derived from polyA tail of RNA transposition intermediate FT SO_feature CDS ; SO:0000316:206..3349 FT SO_feature start_codon ; SO:0000318:1..3 FT /product="gag protein" FT /protein_id="AAT12844.1" FT /db_xref="GI:47231635" FT /translation="MDGHNGDINEGWATVLSISSDDSNQLSSPPSIIVSSLDTTPTSN FT ETTIVRRSLHNPKADMKSYRFENIVLNENKNTILPDPLFVDKCGNTANTTEANEKKPA FT NSPFPISIIKNLSTSSPLTHVDTPTQEDDASAFNTLKAAKTARIIFPTHTQIKPAKPS FT PPSKELSTNSAPKTLSYTDKITVTQKNLPDKTHVDRPTQDDDINATKASKTAKIISTQ FT LHLRETKPTQPAKDPSPRTQKPIANKAAETLTHTDKLIASQNLVPAKTHINSPTQYND FT TNATNALKTAKINFSSHSHQSEIKPTQSAKNISPLTQKQFTSESAGTHTHTDKHKNTA FT SQNLFSAKTHINSPTQHNDTSAATASKTAKLILSPHSHLSETKPTQPALSPSPLSQKQ FT ITSIAAKTLTHTNKHTASQNFIPAKTHINIPTQYNDTNATKALKTAKAASPSHTYSRQ FT TKPIKPAINALHAAQDTNPSPAISAVTYTDKPTATQNIFPVKTFAELIRENAKRSPTP FT IENPPQAKHDSAALGRPPTAARKNLNKTLISPKTPGKRRGDCLDEGLLQTSNKKVRIR FT DDFSDDDLGVTNLLSETPLFKSKAAIKIRQDSRRESLQKSAEMDTAPAISPSNAAADP FT DLPPWKTVPASRKPPSIFLSNIQQIIPLIEKLNYKAGVNSFTTKSELGNNIRIQAKTM FT DAYNAIQNVLLEANIPLHSHQPKSAKGFQIVIRHLHQSTPTKWIESQLQDIGIATKFI FT RAMQFRDTRNPMRIHEVEVVPKADGSHLKVLLIKSLGGQTVKVERKRVSKDPTQCHRC FT QCFGHTKNYCRNPFKCMKCGQLHASVSCTKPKNLPATCANCNGSHVSSYKGCPVFQEA FT KQRLSINKIQSLHSQPTHLQTPRNKHPYPKPTHIQTPLNKQPYTHPLPRTLVNNTKLP FT AKRIQGKKISQRNLSINKRLNRIRTLDRKPRNETSPPTTSKKVLASLEESRKNPNSAL FT NPANTHLTHFRPPPLAQNIPNDESKELSGEQYLLNRIEGMEKKLNNLLEIVTRLLSQG FT KDCPKSPKNPFRDPIFV" FT SO_feature CDS ; SO:0000316:3350..6628 FT SO_feature start_codon ; SO:0000318:1..3 FT /product="pol protein" FT /protein_id="AAT12845.1" FT /db_xref="GI:47231636" FT /translation="MLFLVTSEVTFPMTRECNRDILKIAFWNAGGINNKIDELKLFIL FT NIDAHIIIVTETRLDNNSTKLELPGYFTYLAQNPASSKRGGVATIVNSSLRHMALEPI FT EKECIQSAPIVLLPENNRRSEMIVIASVYCPPSLSWSPHHFTDVLNFAEKTMGGQTKL FT ILCGDWNAKHRQWGCIRACQRGAALYDAIQADSMAEIVATGSATHFPHDTRKSPSAID FT FSICKRLGRYEKRISSSAHLSSDHLPILLEINLDIKTISLQKQNNNILKKTTNIELFK FT NVLERKILLNTEIRVAEDINDAINIFIKNIKDSAAESTPSPRIPDNHRRRYGQANRNS FT HTLTLDENTSRLLEEKRIQSRIFKATRTNEDKTKLKAAENRLKKVIKILREKRINEQI FT EGIDTNNPDRMRKIWRLLSEGKKMNQPNFPLKLETKKGPKWTKTIKETTEAFVSHLEG FT RFKPNKIVPDYHIDKVNTGLRIIKESMLTERHNLNKNPHNQPITLNELNEEIKNLKNS FT KAPGKDLITNQLIKTLPTKATLYLILIYNSILRLGYYPEAWKHAQVKMILKPGKSSNE FT PKSYRPISLLSGLSKMFERLLLKRLFRVDLFKKAIPLHQFGFRKEHGTEQQIARVTQF FT ILEAFERKEYCSAVFLDISEAFDRVWHEGLLLKLAKILPYNLYIILESYLTNRTFEVK FT DQAGETSRTGQIGAGVPQGSNLGPLLYSIFSSDMPLPYIYRPSPTQRIMLSTYADDTI FT VLSSDTLATAATRNNENYLKTFSDWADKWGISVNAAKTGHVIFTLKNDLPTNSMNVKI FT KGQTIKKESKQSYLGVTLDSKLTLSSHVTKLLGKYSTAYRKLTWILNGRSKLPTKTKI FT LILKSVLSPIWQYAIAAWGPLVTDAQIRRVQVEENRKIRDICRAGRYTRNQTIRDLFG FT VKTVEEFYQQAMHRFSETIKSHPNIAVRRILSRHYIPNRLERSRQRYFKMTNDHITQK FT QTGLALSPKLLKIPDIDDCRTVKKRSEREKIRQMHLTELPTLLRLEEEEEELKRIKKQ FT EEREKRERENQKWPPDRWCELEINRYNKQYRKGDLTRQEVIEKFRGQPLNVQRIILPD FT YEGD" XX CC A PNTR (Perfect Non-Terminal Repeat) is a perfect direct repeat partially CC overlapping the UTRs (annotated as "direct_repeat ; SO:0000314"). CC Distinguishing characteristics of PNTR’s are that the 3’ repeat terminates CC upstream of the 3’ end of the element and that the 5’ PNTR extends a short CC distance into ORF-1. The 5’ ends of TART-C elements are variable and the CC minimal size for functionality has not been determined. Thus, this CC canonical sequence, AY600955, may be 5’ truncated within the 5’ PNTR (M-L CC Pardue, 2009). XX CC Derived from AY600955, Michael Ashburner, 6-Apr-2004; updated Jan-2009 as CC per Mary-Lou Pardue and Greg DeBaryshe. CC Any changes to original sequence record are annotated in an FT line. XX // ID AY561850 standard; DNA; INV; 13424 BP. XX AC AY561850; XX DR FLYBASE; FBgn0004904; TART-A. XX FT source AY561850:1..13424 FT SO_feature non_LTR_retrotransposon ; SO:0000189 CC telomeric retrotransposon FT SO_feature direct_repeat ; SO:0000314:1..1850 FT SO_feature direct_repeat ; SO:0000314:11179..13028 FT SO_feature five_prime_UTR ; SO:0000204:1..1759 FT SO_feature three_prime_UTR ; SO:0000205:7953..13411 FT SO_feature non_LTR_retrotransposon_polymeric_tract ; FT SO:0000433:13412..13424 CC derived from polyA tail of RNA transposition intermediate FT SO_feature CDS ; SO:0000316:1760..4672 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:;" FT /db_xref=”GI:45594384” FT /protein_id="AAS68533.1" FT /translation="MDGQNVNQSGGWASVLSISSDDGNCSSSPPSAIVSSLDTTPTSN FT ETTIVRRSLYQTNADMKSYDFENIVLNENKNTILPDPLFVDKCGSTANTTEANEKQPA FT DSPFPISISKNFSTSSPLTHVDTPTQEDDASAFNTLKAAKTARIIFPTHTQIEPAKPS FT PPSKELSSNSAPKTLSYTDKITATQKNFPTKTHVDTPTQDDDTNATKASKTAQIDSSH FT SQLHETKPTQPAKNPSPLTQKLTTNKTAKTHTHTDKPTASQNLFPTKTHINSPTQYND FT TNASTASKDGKINLSSHSHLRETKPTQPAKNPSPLSQKQITSIAANTLTHTNKHTASQ FT NFIPAKTHINIPTQYNDTNATKALKTAKPASPSHTYSRQTKPIKPAINALHPAQDTNP FT SPAISAVTYTDKPTATQNIFPAKTFAELVRENAKRSQTAMQNPPHAKHDSAALGRLPS FT AARKNLTKTLSSPKTPGKRRGDCLDEGLLQTSNKKVRIRDDFSDDDLGVTNLLSETPI FT FKSKVAIKIRQDSRRESLQKSVEMDTAPAISPSNTAAEPDLPPWKTVPASRKPPSIFL FT SNIQQIIPLIEKLNYKAGVNSFTTKSELGNNIRIQAKTMDAYKAIQNVLLGANIPLHS FT HQPKSAKGFQIVIRHLHQSTPTKWIESQLQDIGIATKFIRAMQFRDTRNPMRIHEVEV FT VPKADGSHLKVLLLKSLGGQTVKVERKRVSKDPTQCHRCQCFGHTKNYCRNPFKCMKC FT GQLHATVSCTKPKNLPATCANCNGSHVSSYKGCPAFQEAKQRLSINKIQSLHSQPTHL FT QTPRNKHPYPKPTHFQTPRNKQSYTHPPPRTTVNNTKLPAKRIQGKKLSQRNISINKR FT LNRIRAFDKKPRKETSPPTTSKKVLASLEESSKNPNSVLNPANTHLTHFCPPPITQDI FT PNDEPTEPSQEQYLLNRIEGMEKKLNNLLEIVTRLLNQGRECPKSPKNPFRDPILI" FT SO_feature CDS ; SO:0000316:4710..7952 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:;" FT /db_xref=”GI:45594385” FT /protein_id="AAS68534.1" FT /translation="MTRACNRDILKIAFWNAGGINNKIDELKLFILNIDAHIVIVTET FT RLDNKSTKLELPGYFTYLAQNPVSSKRGGVATIVNSSIRHMALEPIEKECIQSAPIVL FT LPENNRRSEMIVIASVYCPPSLSWSPHHFTDVLNFAEKTLGGQTKFILCGDWNAKHRQ FT WGCTRACQRGTALYEAVQADPMAEIIATGCATHFPHDTRKNPSAIDFSICKGLGRLEK FT RISSSADLSSDHLPILLEINLDTSTLFLQKQNNNILKKTTNIELFKTVLERKILLNTE FT IRVAEDINDAINIFIKNIKDSADESTPSPRIPDNLRRMHGQANRNSHTLTLDENTSRL FT LEEKRILSRIFKATRTDEDKAKLKAAENRLKKAVKILREKRINKQIEGIDTKNPDRMR FT KMWRLLDEGKKTNQPNFPLKLETKRGPKWTKTIKETTEAFVSHLEGRFKPNNNVPDYH FT INTVNSGLRTIKESMLTERYDVNKNPCNQPITLKELNDEIKNLKNSKAPGKDLITNQL FT IKTLPTKATLYLILIYNSILRIGYYPDAWKHAQVKMILKPGKSVNDPKSYRPISLLSG FT LSKMFERLLLKRLFRVDLFKKAIPLHQFGFRKEHGTEQQIARVTQFILEAFERKEYCS FT AVFLDISEAFDRVWHEGLLLKLAKILPYNLYIILESYLTNRTFEVKDQAGETSRAGQI FT GAGVPQGSNLGPILYSIFSSDMPLPHIYHPSPTERIMLSTYADDTIVLSSDILATAAT FT RNNENYLKTFSDWADKWGISVNAAKTGHVIYTLKNDIPTNLKTMKIKGQAIKKESKQS FT YLGVILDSKLTLSPHVTKVVGKYLTAYRKMSWILNERSKLPTNTKMLILKSVLSPIWQ FT YAIAAWGPLVTDAQIRRIQVEENRKMRDICRAGRYTKNQTIRDRYCVKTVEEFYQQAV FT HRFSETTKSHPNVAVRRIFSRHYIPNRLERSRQRYLKMTMDHITQKQTGLTLSPKLLK FT IPDLDDCRTLKKRSEREKIRQTHLTELPTLLRLEEEEAELKRIKKQEERERRERENQK FT WPPDRWCELEINRYNKKYRNGDLTRQEIIEKFRGQPLNVQRIILPDYEGD" XX CC A PNTR (Perfect Non-Terminal Repeat) is a perfect direct repeat partially CC overlapping the UTRs (annotated as "direct_repeat ; SO:0000314"). CC Distinguishing characteristics of PNTR’s are that the 3’ repeat terminates CC upstream of the 3’ end of the element and that the 5’ PNTR extends a short CC distance into ORF-1. The 5’ ends of TART-A elements are variable and the CC minimal size for functionality has not been determined. Thus, this CC canonical sequence, AY561850, may be 5’ truncated within the 5’ PNTR (M-L CC Pardue, 2009). XX CC Derived from AY561850, Michael Ashburner, 6-Apr-2004; updated Jan-2009 as CC per Mary-Lou Pardue and Greg DeBaryshe. CC Any changes to original sequence record are annotated in an FT line. XX // ID DM14101 standard; DNA; INV; 10654 BP. XX AC U14101; XX DR FLYBASE; FBgn0004904; TART-B. XX FT source U14101:1..10654 FT SO_feature non_LTR_retrotransposon ; SO:0000189 CC telomeric retrotransposon FT SO_feature direct_repeat ; SO:0000314:1..1046 FT SO_feature direct_repeat ; SO:0000314:9031..10076 FT SO_feature five_prime_UTR ; SO:0000204:1..961 FT SO_feature three_prime_UTR ; SO:0000205: 7386..10637 FT SO_feature polyA_signal_sequence ; SO:0000551:10479..10484 FT SO_feature polyA_signal_sequence ; SO:0000551:10600..10605 FT SO_feature non_LTR_retrotransposon_polymeric_tract ; FT SO:0000433:10638..10654 CC derived from polyA tail of RNA transposition intermediate FT SO_feature CDS ; SO:0000316:962..4093 FT SO_feature start_codon ; SO:0000318:1..3 FT /product="gag protein" FT /db_xref="FLYBASE:FBgn0014071; TART-element\ORF1" FT /db_xref="SPTREMBL:Q23999" FT /db_xref=”GI:603663” FT /protein_id="AAC46493.1" FT /translation="MDGHNGDQSEGWATVLSISSDDSNSLSSPPSIIVSSLDTTPTSHE FT TTIVRRSLYQTNADMKSYDFENIVLNENKNTILPDPLFVDKCGSTANTTEANEKKPANS FT PFPISISKNFSTSSPLTHVDTPTQEDDASAFNTLKAAKTARIIFPTHTHIKPTKPSPPS FT KELSTNSALKTLSYTDKITGTQKNLPDKTHVDTPTQDDDINATKASKTAKIISTQTHLG FT ETKPIQPAKDPSPRTQKPIAHKADETLTHTDKLTASQNLVPAKTHINTPTQYNDTNATN FT ALKTAKINFSSHSHQSEIKPTQSAKNISPLTQKQFTSESAGTHTHTDKHKNTASQNLFS FT AKTHINSPTQHNYTSAATASKTAKLILSPHSHLSETKPTQPALSPSPLSQKQITSIAAK FT TLTHTNKHTASQNFIPAKTHINIPTQYNDTNATKALKTAKAASPSHTYSRQTKPIKSAI FT NALHPAQDTNPSPAISAVTYTDKPTATQNIFPVKTFAELVRENAKRLPTPMQNSHQAKN FT DSAALGRPPTAARKNLNKTLISPKTPGKRRGDCLDEGLLQTSNKKVRIRDDFSDDDLGV FT TNLLSETPLFKSKAAIKIRQDSRRDSLQKSAEMDTAPAISPSNTAADSDLPPWKTVPAS FT RKPPSIFLSNIQQIIPLIEKLNYKAGVNSFTTKSELGNNIRIQAKTMDAHNAIQNVLLE FT ANIPLHSHQPKSAKGFQIVIRHLHQSTPTKWIESQLQDIGIATKFIRAMQFRDTRNPMR FT IHEVEVVPKADGSHLKVLLLKSLGGQTVKVERKRVSKDPTQCHRCQCFGHTKNYCRNPF FT KCMKCGQLHATVSCTKPKNLPATCANCNGSHVSSYKGCPAFQEAKQRLSINKIQSLHSQ FT PTHLQTPRNKHPYPKPTHLQTPRNKQPYTHPLPRTSVNNTKLPAKRIQGKKISQRNLSI FT NKRLHRMKKPRKETSPPTTSKKVLASLEESRKNPNSVLNPANTHLTHFRPPPLAQNIPN FT DEPKELSGEQYLLNRIEGMEKKINNLLEIVTRLLRQGKDCPKSPKNPFRDPIFV" FT SO_feature CDS ; SO:0000316:4131..7385 FT SO_feature start_codon ; SO:0000318:1..3 FT /product="pol protein" FT /db_xref="FLYBASE:FBgn0014072; TART-element\ORF2 FT /db_xref="REMTREMBL:AAC46494" FT /db_xref=”GI:603664” FT /protein_id="AAC46494.1" FT /translation="MTRADNRDILKIAFWNAGGINNKIDELKLFILNIDAHIIIVTETR FT LDNNSTKLELPGYFTYLAQNPVSSKRGGVATIVNSSIRHMALKPIEKECIQSAPIVLLP FT ENNRRSEMIVIASVYCPPSLRWLPHHFTDVLNFAEKTLGGQTKFILCGDWNAKHRQWGC FT TRACQRGTALYEAVQADSTAEIIATGCATHFPHDTRKNPSAIDFSICKGLGRFEKRISS FT GADLSSDHLPILLEINLDTNTLFLQKQNNNILKKNTNIELFKKVLERKILLNTEIRVAE FT DINDAISTFMKNIKDSAAESTPSPRIRDNPRRRHRQANRNSHTLALDENTSRLLEEKRI FT LSRVFKATKNYEDKAKLKAAENRLKKAIKILRENRINEQVEGIDTSNPDRMRKMWKLLD FT EGKRTNQPNFPLKLETQKGPKWTKTIKETTETFVSHLEGRFKPNNNVPDYHIDRVNTGL FT RIIKESMLTERHNLNKNPHNQPITLKELNDEIKNLKNSKAPGKDLITNQLIKTLPTKAT FT LYLILIYNSILRLGYYPEAWKHAQVKMILKPGKSANEPRSYRPISLLSGLSKIFERLLL FT KRLFKVDLFKKAIPLHQFGFRKEHGSEQQIARVTQFILEAFERKEYCSAVFLDISEAFD FT RVWHEGLLLKLAKILPYNLYIILESYLTNRTFEVKDQAGETSRTGQIGAGVPQGSNLGP FT LLYSIFSSDMPLPYIYRPSPTERIMLSTYADDTIVLSSDTLATAATRNNENYLKSFSDW FT ADKWGISVNAAKTGHVIFTLKNDLPTSLRTMKIKGQVIKIESKQSYLGVILDSKLTLSS FT HVTKLMGKYTTAYRKMTWILNRRSKLPTKTKMLILKSVLSPIWQYAIAAWGPLVTDAQI FT RRIQVEENRKMRDICRAGRYTSNQTIRDRYGIKTVEEFYQQALHRFSETIKSHPNIAVR FT RIFTRHYIPNRLERSRQRYLKMTNEHITQKQTGQTLSPKLLKIPDLNDCRTLKKRNERD FT KIRQTHLIELPTLLRLEEEEEELRRIKKQEERERREKETQKWPPDRWCELEINLYNKQY FT RRGDLTRQEIIQKFRGQPLNVQRIILPDYKGDQEHN" XX CC A PNTR (Perfect Non-Terminal Repeat) is a perfect direct repeat partially CC overlapping the UTRs (annotated as "direct_repeat ; SO:0000314"). CC Distinguishing characteristics of PNTR’s are that the 3’ repeat terminates CC upstream of the 3’ end of the element and that the 5’ PNTR extends a short CC distance into ORF-1. The 5’ ends of TART-B elements are variable and the CC minimal size for functionality has not been determined. Thus, this CC canonical sequence, U14101, may be 5’ truncated within the 5’ PNTR (M-L CC Pardue, 2009). XX CC Derived from U14101 (g603662) (Rel. 42, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 20-Aug-1997; updated Jan-2009 as per CC Mary-Lou Pardue and Greg DeBaryshe. CC Any changes to original sequence record are annotated in an FT line. XX // ID TIRANT standard; DNA; INV; 8526 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0004082; Tirant. XX SY synonym: prygun XX FT source X93507:1..2484 FT SO_feature five_prime_LTR ; SO:0000425:1..417 FT SO_feature three_prime_LTR ; SO:0000426:8109..8526 FT SO_feature CDS ; SO:0000316:1866..2999 FT SO_feature CDS ; SO:0000316:3239..6505 FT SO_feature CDS ; SO:0000316:6683..8146 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. CC This replaces that from X93507 in versions previous to 4.8. XX SQ Sequence 8526 BP; 2961 A; 2097 C; 1384 G; 2084 T; 0 other; GGAGTTACCA CCCCACCCCC TAAACCCCCA CGCCTCTAAA CAAATCATCG GACACTCAAC 60 CGGGAAGACG GCAACTGGAA CACCGCATCC GGCCGAATGC TGACATTCCG GCCGAATGCT 120 GACATTACAC AAAAGTCGCA CTGCAACATT GTCCCCAGCT AGCCAGCCAC ATGCCGAGTC 180 GGCATGTTCA TTATGCTTAC AATTAAGAAC CTATGTACTT ATGTATAAGA CGAAAACGGA 240 GGACTCGAGT AGCCACTCTC TGACAATAAA CTTGATACTG ATTTTGAACT TCAAGAAAGT 300 CAGTCGTATT CTTTATTGGA AATCTTCACA CTACAACTAT CTGCTGAAAC TTAAAAACCT 360 TCATACATTT ACACATCATA TCTTCACAAA AGGCTCCACC CTCGATCACG GACTTAACTG 420 GCGCAGCCGG TAGGATGTCC TACCTATTAA TAATTACCTA CCTGTAAGTA AACATGTAAG 480 AAACGAAACA AACTATATGC AAGATGTCGA CTGAAAGTGA CTAGGAACAA ATTTTTATAA 540 AACAAAATTG AAGTTGTGAA GTACCAAATG AAACTCAAAC ATATATTCAA ACACAGGAAA 600 AAAAAAGAGA GAGGAAAAAT GTAAAATAAA TAAATATACA AAAAAAAGTG CAAGTGTACC 660 GTACTGCCGC GCTGACGTGG AATCTATCGC TGATCATCAC GCCATCGGTA TGTCCATACT 720 CTGCCGAACG TCATAATTTT TTTAAAAAAG TGCAAGTGTA CCGTACTGCC GCACTGACGT 780 GGAATCTATC GCTGATCATC ACGCCATCGG TATGTCCATA CTCTGCCGAA CGTCATAATT 840 TTTATAAAAA AGTGCAAGTG TACCGTACTG CCGCGCTGAC GTGGAATCTA TCGCTGATCA 900 TCACGCCATC GGTATGTCCA TACTCTGCCA AACGTCATAA TTTTTATAAA AAAGTGCAAG 960 TGTACCGTAC TGCCGCGCTG ACGTGGAATC TATCGCTGAT CATCACGCCA TCGGTATGTC 1020 CATACTCTGC CAAACGTCAT AATTTTTATA AAAAAAGTGC AAGTGTACCG TACTGCTGCG 1080 CTGACGTGGA ATCTATCGCT GATCACCACG CCATCGGTAT GTCCATACTC TGCCAAACGT 1140 CATAAGTTTT TATAAAAAAA AAGAGTGCAA GTGTACCGTA CTGCCGCGCT GACGTGGAAT 1200 CTATCGCTGA TCATCACGTC ATCGGCACTT ACATACGCTG GCCAACGCAT CGCCAAAGCC 1260 TCTATATACA CTTATATATG TGAGCATACA ATATCAACTA CAATCCAATA CATCCACGTA 1320 CTGTACCGCC TCGTTGGCAT GGAATCAAAC GCTGATCACC ATGCCACCGT GGTAAACAAA 1380 CAAAGCACCA AAGCCTCTCT AATACATTGT ACACTCAAAA CGCACACTGC CATACGTCGG 1440 CGAAAAATCA AAACATAAGC AAAAATCATT TCAAACCAAG CGAGGCTCAT TCTGCGTACC 1500 ACAACGACAA CGACACTGCA TGTGTAGTGG CGCACCCATG TCTGGGTAGC CGAGGTAAGG 1560 GGAAAACGCT TGAGTATCGT CAAGTGTTCT TGCCTTTCAC TCTTCTACAA TGGGTTGCTA 1620 CGCTCATGTA TTGCACATTC AAAATAACCA AAACAAATGT ACTAAAGAAG TCGACATATA 1680 CAGATATATT TTGTTTCCTT TCATTGTGTA ATTTTGTATA TCAAACAAAT ACTAATACCA 1740 ATCACATTGC AGAATATAAA AGGGAAAATA TAAAGCCAAA GACAGACACC CATACACTCT 1800 AGTAAACAAG AAATTTGTTC ATTATTTTTC AATCATACAT AATATACTAA GTAACCTCAA 1860 ATTTAATGTC AAAAAAGTTC GTTTACAACC TTAGGAAAAC TACACGTTCA GTTGTTGGAG 1920 TTCCACCAAA CACTAATAGG CCCCCACATC CCGTTAGACG TCCTGACTCC CTTCTCCCGA 1980 TTTCGGAAGA ACCCAAATCA ATATCTTCCC AAACCCCCAA TATGGACTCG GGAAACGATT 2040 CTGCCCGCCC CACTCCATCC CCTCTGGCGC CCACTGTCAG TGGTATTAGC TCCTTAATTT 2100 CAACTACGTT CAAGCCTAAA GATATCATGG CATTTGTTGA GCATTTGCCA ACCTTTGATG 2160 GTACACCTCG TCTATTGGAC AGGTTTATCA CTAGCGTAGA AGAAATCCTG ATGCTCATCA 2220 GGGGAGCTGA CCAAACACCG TATGGCCTGC TTACTCTGAG GACCATCAGG AACAAAATCA 2280 TTGATAGGGC CGACGAAGCC TTGGAACTGG CAAATACCCC CTTGGTTTGG GATGAGATTA 2340 AAAGCAATCT CATCCGCCTC TACTCGAGCA AGAAAAGCGA GGCCAACTTG TTAAGCGAGC 2400 TTAACACATT TTCGGACAAC CTGACCTTGG GCCAACTGTT CTTTGGTATA TCAAAGGTGA 2460 GAAGCCAACT CTTCTCCATA CTCAAAAACA GCGAACACAA CAACACTGTT GTAGATGCAA 2520 AAAAGGTTGT CTACAACGAG GTTTGTCTCA ATGCTTTTAT GACTGGTTTG AAGGAACCTC 2580 TCAAGACTTT CGTCAGGATA AAGTCCCCTT CTACACTTGA ACAGGCGTAC GAGCAATGCC 2640 AAATAGAGCA GACCTTATAT AGGGCACAAA ACAAGCGAAC CAACAGACCA GAGCAGGGAC 2700 CCAATGGATC AGACAATAAA ACCTACCGAA ATAGCTACGA CAGCAATTAC CGCAGCGGAC 2760 GTAACGACCG AAATGACCGT AGGGGACCCT ACTCTAACTC TAACTCTAAC TCTAACTCTG 2820 GCCAAAATAG ACCATTTAAT TCACACAATC GCACACCCCA ATCCGGCACC AAGGACAACC 2880 GGGCCAATAC ATCAAACCCC TTTCGAGCAC CTTCACATAG TTTGAATAAT ATAGAGGAGA 2940 ACCCTCAACC TGATTCGAAT TTTCAGCAAA CGGCCTCGGG AAACCAACAG GGTACATAAG 3000 CCCAGCCACG CACAACCCCT CGCTTCCTTT TATAAAAATC AAACTATCCC AGACAAACCC 3060 CCTGAAGTTT TTAATTGACA CAGGCTCTAC ACACTCCTTC ATCGACCCAA AATATGTCGA 3120 CCCTAGGAAC TGTGTGACCT TAGATACGCC CATAACACTC AAAACAGCCC TGAACAGTTT 3180 TAAAATATAT CAAAACGTCT CTATACCATT TCCACCGGAA TTCCAAATCA CGGGCAAAAT 3240 GACCCTTCTA CCTTTCAAGT TCCACTCTTA TTTTGACGGA TTGATAGGAA TGGACTTATT 3300 ATCTTACCTA AAAACAGAAA TAGATTTACT TAACCTAAAT CTAAAAACCC CAAGTACCAT 3360 TATACCCTTA TGGACCCACA GTAACTCAAC TTCAAACGTA TTTAATATCT CTGGACATAC 3420 GAAAACTATT TTGCCACTAC CAGTGGAAAC CAAACAGGGC GACTTCTACA TCGATTCAAT 3480 TACAATCAAT GATGACTTAA TAATATCAGA CGGGATTTAT AATGCCCAAA ACAATATTGC 3540 TAATTTCGTT ATCACAAACT ATAGCGAGAG GGATCAGTTA TTGTACCTCG AGAGCCCGAT 3600 AAAAGGCATG CCATACTCCA CGGCCAACAA TGTTGAACTT TTCAGTATCA CTTCAGACAC 3660 CCCACAGCCC CAAAACTCCG CAGCGTCGTT ACAAGCCCTT GGCGTCGATC ACCTCTCCTC 3720 TGAAGAGAAA CAAAGCCTAC TTTCACTTTG CAAAAGTTAT CTAGATATCT TCTACAATGA 3780 AGACAAATCA TTGACCTTCA CCAACAAGAT TACACACACG ATTAAAACCA CGGACGACAC 3840 CCCCATTCAT ACAAAATCTT ATAGATATCC TTACATTCAT AAAGAGGAGG TCAAAAAACA 3900 AATAGAGGCA ATGTTAAATC AGGACATTAT CAAATCCAGT TATTCCCCGT GGAGCGCCCC 3960 CGTCTGGGTC GTCCCAAAGA AAATCACTCC TACGGGAGAG CAAAAATGGC GTCTAGTTAT 4020 CGATTATAGA AAACTCAACG AGAAGACTAT ATCCGATAGA TATCCAATAC CTAACATCGC 4080 GGATATCTTA GACAGATTGG GCAAAGCCAA ATATTTCTCC ACACTTGATC TGGCAAGTGG 4140 ATTCCATCAG ATAGAAATGA ATCCCGACGA CACACCCAAA ACTGCATTTA CAGTAGAGGG 4200 GGGCCACTAC GAGTTCATTA GAATGCCGTT TGGCCTCAAA AATGCCCCAG CCACATTCCA 4260 AAGGGTGATG GACAATATTT TTGGAGACCT TATCGGAACT ATCTGCCTAG TTTACCTAGA 4320 TGATATAATA ATTTTCTCAA CCTCCTTACA AGAACACTTC ATACACTTGA AAACTATTTT 4380 TGGAAGACTC AGATCTGCCA ACTTTAAAGT CCAACTCACA AAATCCTACT TCCTCAGGCG 4440 GGAGACAGAA TTCCTTGGCC ACATCGTTTC ACAAGAAGGT GTTAGGCCAA ATCCCAATAA 4500 GATCGAAGCT ATAAAAAACT TTCCATGTCC CCACAGTAAA AAGTCAATTA AGTCTTTCCT 4560 AGGCTTGTTG GGATATTACA GAAAATTTAT CAGAGATTTT GCGAGACTTA CCCAACCCAT 4620 GACACAAAAA TTAAGGGGAA ACAATAAATC GATCATAATA GATGATGAAT TCAAAAAGGC 4680 CTTTGAATAT TGCAAAACCT TACTGTCTAA CGACCCAATC CTCCAATACC CGGACTTTAC 4740 AAAACCTTTC ACACTAACCA CGGACGCAAG TAATTTCGCA ATAGGAGCTG TCCTATCCCA 4800 AGGTCCGGTG CATAGTGATA GGCCCGTATG TTTTGCTAGT AGAACCTTGT CGGCTGCGGA 4860 AACAAATTAT TCCACAATTG AGAAGGAAAT GCTGGCCATT ATATGGGCGG TCCAATACTT 4920 CAGACCCTAC CTCTTTGGCA GGAGATTCAC TATAATCACC GATCACAAAC CACTAACTTG 4980 GTTAATGAAT TTCAAACAAC CAAATTCTAA AATAGTTAGG TGGAGACTCC AGCTTCAGGA 5040 GTACGATTTC GAAGTCGTCT ACAAGAAAGG CTCTCAAAAT GTAATTGCTG ATGCTCTCAG 5100 TAGACCAGAG GCCTCTGTCA ACCATAACGA AGCCCTATCA ATTCCTCAAA ATGTTTGCCC 5160 CATCTCAGAG AAACCCCTTA ATGATTTTAA TATTCAGCTC CTGTTCAAAA TAACCCCAGA 5220 TACAAATAAC GCCACACTGA CCCCGTTTAA ACACAAACTT AGGAGGGAAT TCTGTAAACC 5280 CAATTTTCAG TATGACGACG TAGTTTGCAT TCTTAGGCAG TCGTTAAAAC CAAACAAGAC 5340 ATGCGCGGTA TTTGCCCCCG ACCACATTTT TCAAATGGTG GAACAAGCCT ACCAAACCTA 5400 CTTCTCAGCC CACAGTCAAT TTAAACTCAT TAGATGTTTG ATCTTCCTCC CCGAAATTAC 5460 TGATAGTACG GAGATCGAAA AAATTATAAC CGACTATCAC TATAATAGTA ACCATCGAGG 5520 GATCGATGAA ACATATTTAC ACATAAAACG ACAACAGTTC TTCCCACATA TGAAGGAGAG 5580 AATAACTCAG TTAATTCGAA AATGTGAAAC ATGTTTAAAA TTAAAATACG ACAGACAACC 5640 TCAAAAGATC ACTTACCAAA TATCCGAACT ACCTTCAAAA CCGTTGGACA TCTTACATAT 5700 AGACATTTAT ACTATTAACA AAAATTATAA CCTTACTATT ATCGATAAAT TTTCTAAATT 5760 TGCGGCTGCC TACCCTATAA CTAATAGGAA TTGCATTAAC GTAGTTAAAG CCTTAAAACA 5820 TTTCATTTCC CAATTTGGTA TTCCCAAAAA GCTGATCTAT GATCAGGGAG CAGAATTCGC 5880 TAGCGATATG TTCAATAAGT TCTGCACTCA ATTTAACATT GACCTACACG TTACGTCCTT 5940 TCAACAATCC TCTAGTAACT CTCCCGTTGA ACGGCTTCAC TCGACACTAA CTGAGATTTA 6000 CAGAATAATA CTTGACGTCA GGAAACAACA GAAACTCAGT AGCGAGCATG ACGAGATAAT 6060 GTCCGAAACC CTAATCACAT ATAATAACGC TATTCATTCT GCAACTAAAC ATACCCCCTT 6120 TGAACTATTT AACGGACGTA CTCATATATT CAACCAAACA ATCCAGTTCA ATAACGAACA 6180 CGACTACTTA ACGAAATTAA ATGAATTTCG CGAGAAGTTG TACCCCCTCA TCACGGACAA 6240 ACTTTCAAAT GACGTAGTTA GGAGAACCCT AAAATTAAAT GAAACCCGAA CAGACCCCGT 6300 AGACCTACAA CCAGACACTT TAGTCCTTAG GAAGGAAAAC AGACGTAATA AGATTACACC 6360 CAGGTTTTCG ATTCACAAAG TCAAACACGA CAAAGGTCAT ACATTGATAA CTGCTAGGAA 6420 TCAAAAACTA CACAAATCAA AAATTCGAAA AACAGTTTTG AAAAAAGACA AAAGCAACAA 6480 CGTACCCAAC ACTGATAATA ACTGACCCCA CTACCTCTTA ACTTACCATT TCAGGTTCAC 6540 CCTTGTGCCA ACTCAGGCTA TCCATGTCCA TTATTTAAAT GATAACGCCC CTATAGCCAA 6600 GATAGAACTA GGGAAAGCCT TACTAATTGA GAGGTACAAA ATAATTAGTC ATGTAATCAA 6660 CCTACAAGAC TACAGCAGAT GTATGGAACA ATTCCATCTG ACCATTAATA AATTTAACCC 6720 CGATTCCACG TTGACGGACT CCGTCACAAT TTTAAAAACC AAATTAACCC AAGCCCAAGT 6780 AAAGCTCAAA GCCCTTACAC CTTCATATAG AAACAAACGG GGTTTGATTA ACGGATTGGG 6840 GAGTCTAGTA AAGGTGGTTA CCGGCAACAT GGATGCCAAC GACAATAAAG AAATACATGA 6900 AGAACTTGAC AATATAAAGA AAAATTCCGA AGTCAGTAAC GACAATCTCC AAAAACAAGT 6960 AATGTTTAAC AACGAAATAC TTATCCGGTT CGAAAATATC ACGGACCATA TAAATAATGA 7020 ACAAATTTTG ATAAGTAAAT TCTTTGATAC CTCACAAAAC AAAATATACA AACACTTAAA 7080 CTTACAAGAT ACCCTTCTGG AAGAAATACA ATATTTAAAT AGGATTAATT ATAACATAGA 7140 ATTATTCATT AACCACCTAA ACGACATAAC AGAAAGTATG CTATTGGCGA AAATAAATAT 7200 AATTCCCAAG TTCATCCTAA ATGAACAAGA AATGGATAAA ATAAAAACAA TACTGGAAAA 7260 ACAAAATATC ACAGTCAAAA ATGAACAAAG TATATACAAT TTCCTACAAA TGAATACACT 7320 AAATTACGAA CAAAAGATTA TTTTTAATAT CAAAGTCCCA ATTTTTAAAC AACCTTTTCA 7380 TACCCTCGCC AGACTAGTTC CATTACCAAT AAATAACACA TATTTTGTAA TAACCCCAAA 7440 TTACCTAGCT TATAATATTA ATAATAAGAA ATTTCATATG ACCCGTAAAT GCCCCAAACT 7500 GGATAATACA TTCTTGTGCG ACGAGAACTT CTACGTTGAT ACACCACAGA ACAACACATG 7560 CCTGGAACAC CTTTTGAACG GAGAAAACAG TTCCTGCGAT GTACGGGAAA CCGGCCCCAT 7620 CACCGACGTG TTCGAGGCAG AGAGAGGTTA CATCTTCGCA TTCAACGTGA ACAAACTGAA 7680 GGTATCCCTA ACAAACGGCT CCGAGCTCTC AATAATGGGG TCAGCCATCA TCAGATACAT 7740 TAACGAAACA ATACAGATTA ACGGTATCGA TTACGACGGC ACGGTTGACA CGTTCCCTGA 7800 ACAGACGGAT TTTGATCTTC CCCCCATGCG AAAAGTAACT AGGAATACCA CTATTACGGT 7860 ACTAAGCCTA GAAAAACTGC ACCTCGAAGC CACCCAAACA ATGGATAAAA TCCTGGCCGT 7920 CCATCACAAT ACTATACAGC ACACCTGGAC ACTCTACACT CTGCTCGGAT TGGTAACGTT 7980 CCTAGCAGTC ATCTTATGGC TGCACCGACG AACGAAACAC ATCGTCCACA TCCACGAGGA 8040 TCATCACGTA CCAATCTACG CGTCATCCAT ACCTTCGCTA TGGCCGTCAC TTCGAACTGG 8100 GGGGGGAGGA GTTACCACCC CACCCCCTAA ACCCCCACGC CTCTAAACAA ATCATCGGAC 8160 ACTCAACCGG GAAGACGGCA ACTGGAACAC CGCATCCGGC CGAATGCTGA CATTCCGGCC 8220 GAATGCTGAC ATTACACAAA AGTCGCACTG CAACATTGTC CCCAGCTAGC CAGCCACATG 8280 CCGAGTCGGC ATGTTCATTA TGCTTACAAT TAAGAACCTA TGTACTTATG TATAAGACGA 8340 AAACGGAGGA CTCGAGTAGC CACTCTCTGA CAATAAACTT GATACTGATT TTGAACTTCA 8400 AGAAAGTCAG TCGTATTCTT TATTGGAAAT CTTCACACTA CAACTATCTG CTGAAACTTA 8460 AAAACCTTCA TACATTTACA CATCATATCT TCACAAAAGG CTCCACCCTC GATCACGGAC 8520 TTAACT 8526 // ID DMBLPP standard; DNA; INV; 5034 BP. XX AC Z27119; XX DR FLYBASE; FBgn0014947; flea. XX SY synonym: blastopia SY synonym: Kermit XX FT source Z27119:372..5405 FT SO_feature five_prime_LTR ; SO:0000425:1..276 FT SO_feature three_prime_LTR ; SO:0000426:4766..5035 FT SO_feature polyA_site ; SO:0000553:5006..5011 FT SO_feature polyA_site ; SO:0000553:5019..5024 FT SO_feature CDS ; SO:0000316:760..4761 FT /db_xref="FLYBASE:FBgn0043491; flea\polyprotein" FT /db_xref="SPTREMBL:Q24262" FT /protein_id="CAA81643.1" FT /translation="MFTRTPPTNKKLNTDQIQAILENESEDESRKEKMNEEDQKLAPVG FT EAEAKKQNKDASAKVEEKFEQMMNTLTQSMLAKSKQEGQVIIAAEKFEKVVSDCDGKSI FT PIKKWFEIFEKNAEAYELSEKQKYVQARSKMIGSAELFLESECVSGYTELKELLIEEFS FT GSYNSAVIHKKLQDRKKKREETLHDYLLQMKKIAALGEVETVALITHIVNGLDIKKEYK FT GAMLRCKTLKELKQEFEIYESLNIVDKPNIQPKPKQITQGVKADHCFNCGSREHKRKDC FT TLPTKCFSCNQEGHISSKCPEKVNSMRIHVDSARTKPVIINGIIINCLVDTGSDVTIIK FT EAIFKKMKDVDLNRTATVLRGLGNASTQPIGCFRALIKTDQVEASHNVLVVHDSKFSCD FT GIVGHDFISKFRLICSAEGYTFLDLEADKKQAVEYSQMFNICEESSFTVAPQYREDVER FT MIERTYETPPKQIKQCPVELKIIPDGVIKPFRHGHTRLSEEEAIAVKKQVEEWVEQSIV FT RKSTSNVASRIVVVRKKDGTLRVCVDYRKLNTMVLMDCFPVPIMEEVLEKLQSAKWFTT FT MDLQNGFFHVAVEEASKPYTAFVTREGLFEFNKAPFGFKNSPAAFIRFVQFIFQELINS FT NIMQLYMDDIIVYAATPEECMEKTEMVLKRAAEFGLKIKWKKCNFMQRRIHFLGHIIEG FT GQICPGKEKTSAVNSFGTPQNVKAVQGFLGLTGFFRKFIPGYAQIARPLTDLLKKDAIF FT NIGPVEQQSVNKLKEILVNEPVLRIYSREAETELHTDASKDGLGAVLLQKFEGSFHPVC FT FWSRKTTKAESNRHSYYLEVKAAYLALKKFRHYLLGVPFKLVTDCVAFKQTTKKADVPR FT EVGPWILYMQDFNFQPEHRAGERMRHVDFLSRHPQACMMITSELTARIKKSQQNDDSIR FT AILEILKDRLFQPYKLKGGLLYSMVNGNELLVVPALMEREVIQSAHEVGHLSLQKTMHS FT IQQQFFYFLIWEYKVKKLISNCIKCIIHSKKLGKQEGYLNCIDKGDAPLHTLHIDHLGP FT MDSSAKQYKYILATVDAFSKFVWLFPTKSTGQEEVVKRLTDWSNIFGFPKRIVSDKGTA FT FTSGAFEQFMSSHNVEHVCTTTGVARGNGQIERVNRLILAIISKLSSDEPSKWYKYVPE FT VQKAINCHVHSSLKLSPFEVMFGTKMYTRVEDRLLELLQEEVVCQFNEDRYEMRQLVKR FT NIEQAQKDYKRNYDKKRRAEYKYKAGDLVAIKRTQFVAGRKMASGYLGPYEVTGVKDNG FT RYDVKKAANVEGPNVTSTSCDNMKLWKYIAENADLLSSGSDDDDQEGRM" XX CC Derived from Z27119 (g415797) (Rel. 50, Last updated, Version 6). CC Takis Benos and Michael Ashburner, 20-Aug-1997. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5034 BP; 1719 A; 938 C; 1145 G; 1232 T; 0 other; TGTAACATGA GTAAGGCTGA AGGCTGGCAA CAACCCGGTT GGCAGCGCTG TTGAGCAGCA 60 ACATGATTGT CGGAAATCCA AGTTATCGAC AATCAGTCAT CGAAGGACGA TCGCAGGCAG 120 CAGTAGAGGC GAGTGGAAGT CAGCGTTGCA GTCAGTCGAG TTCTCAGCAG CAGTCGTTCG 180 GTCCACAAAC TAAGAAATAC TTTATATAAT TACCGCATTT AGAATTAAAC TAATAATTAA 240 ATTAATAATA AACAATAATA ATAAACAATC TTACATGGGG GCTCGTCCAG TCCTAAATCG 300 GTTATATGAA GGTGCAGTTG TTTAAAGAAA AAAGACATTG TTGTGTGCGT GGGTATAGTC 360 TTTAAAACGT TGTAAAGTTG TGGCTATATC TATTGCATTT AAAGTTGGAA AAATCAGTTG 420 TACAGATTTT GTTTGAACAC AAGTCGGTAA AAGTCGGGAA AGCTGCTAGA GAGAACTGAT 480 AAAGTTGAAA TTGTCGTGTG CGTGGATTTA GTCTTTAAAG TTGTAAAGTT ATGGCTACGT 540 CTACTGCATT GAAAGTTGAA AAAATCGATT GAACTCATAC AGACTCAAGT CGTTTTGCTG 600 TTGTGGAATT TAAAACAATT AAATTGCAAA GGTGGTGAAA TTCGTTTCTA ACGAAAATCA 660 AAATTTGTCT TTTAACCGGT GGCGCCGTCT GCAAAATCGA CTACCGTCGC GCCGTTAGAA 720 CATTGTCGTT GTTTGCTGGT GTTAGTGCCT TGTCGCGGAA TGTTCACACG TACACCACCT 780 ACAAATAAAA AACTTAACAC CGACCAAATA CAAGCAATTC TAGAGAACGA AAGCGAGGAC 840 GAAAGCAGAA AAGAAAAAAT GAACGAAGAA GATCAAAAGT TGGCGCCTGT AGGAGAAGCA 900 GAGGCAAAGA AGCAGAATAA AGACGCTAGT GCTAAAGTCG AAGAGAAATT TGAACAAATG 960 ATGAATACTC TAACCCAGAG CATGTTGGCA AAATCTAAAC AAGAGGGGCA AGTAATTATC 1020 GCTGCAGAAA AATTTGAAAA AGTTGTAAGT GACTGTGATG GCAAATCAAT TCCTATTAAA 1080 AAATGGTTTG AAATTTTTGA GAAAAATGCC GAGGCATATG AACTTTCGGA GAAACAAAAA 1140 TATGTTCAAG CCAGAAGTAA GATGATTGGA TCAGCAGAAC TTTTCTTAGA ATCTGAATGT 1200 GTCAGTGGAT ACACTGAACT CAAAGAGTTA CTAATTGAAG AATTTTCAGG CAGCTATAAT 1260 AGCGCCGTTA TTCACAAAAA GTTGCAAGAC AGGAAGAAGA AGAGGGAGGA AACTCTACAC 1320 GACTATTTGT TACAAATGAA GAAAATAGCA GCCTTAGGTG AAGTTGAAAC AGTTGCTTTG 1380 ATAACTCATA TCGTAAACGG CCTCGACATT AAAAAGGAGT ATAAGGGTGC TATGCTCCGT 1440 TGTAAAACTC TTAAGGAATT AAAGCAAGAA TTCGAAATCT ACGAGAGTCT GAATATTGTT 1500 GACAAGCCGA ATATTCAACC AAAACCAAAG CAAATTACAC AAGGTGTAAA AGCAGATCAC 1560 TGCTTCAACT GTGGTTCGAG GGAACACAAA CGAAAGGATT GTACACTTCC TACCAAATGT 1620 TTCAGCTGTA ATCAAGAGGG CCATATCTCA AGCAAGTGTC CGGAAAAAGT AAACAGCATG 1680 CGCATTCACG TTGATAGTGC ACGAACAAAG CCAGTAATCA TAAATGGGAT TATCATCAAC 1740 TGTCTGGTGG ACACAGGATC AGATGTGACC ATAATTAAAG AAGCTATATT CAAGAAGATG 1800 AAAGATGTTG ATTTAAACCG CACTGCAACA GTATTGCGAG GTTTGGGAAA TGCCTCAACA 1860 CAGCCGATTG GATGCTTCAG AGCATTAATC AAGACCGACC AGGTGGAAGC AAGCCACAAC 1920 GTTTTAGTCG TCCACGATTC TAAATTCAGT TGCGATGGAA TAGTGGGACA CGATTTTATC 1980 AGCAAGTTTC GTCTTATCTG TAGTGCAGAA GGCTATACTT TTCTTGACCT GGAAGCAGAT 2040 AAAAAACAAG CGGTTGAGTA TTCCCAAATG TTTAATATTT GTGAAGAATC TTCTTTTACA 2100 GTTGCACCAC AATACCGAGA AGACGTTGAA CGCATGATAG AGAGAACATA CGAAACACCA 2160 CCCAAGCAGA TAAAGCAATG TCCAGTCGAA CTCAAAATTA TTCCTGATGG CGTGATTAAA 2220 CCCTTTCGCC ATGGACACAC CCGACTATCT GAAGAAGAAG CTATAGCTGT AAAGAAGCAG 2280 GTAGAGGAAT GGGTCGAGCA GTCAATCGTC CGTAAATCTA CATCAAATGT TGCCAGTCGC 2340 ATAGTCGTTG TCAGGAAAAA GGATGGTACC CTACGCGTTT GCGTGGACTA TAGAAAATTG 2400 AACACCATGG TTCTGATGGA TTGTTTTCCG GTACCCATAA TGGAGGAGGT GCTTGAAAAA 2460 CTGCAGAGTG CCAAATGGTT TACAACCATG GACTTACAGA ACGGATTTTT TCATGTGGCC 2520 GTAGAAGAAG CCAGCAAGCC GTACACAGCA TTTGTTACCC GAGAAGGCTT ATTCGAGTTT 2580 AACAAAGCGC CCTTTGGTTT TAAGAATTCC CCAGCAGCGT TTATACGGTT CGTTCAATTT 2640 ATTTTTCAAG AACTAATCAA TTCCAATATA ATGCAGCTAT ATATGGATGA CATAATTGTA 2700 TATGCCGCTA CCCCAGAAGA ATGCATGGAA AAGACGGAAA TGGTACTTAA GAGAGCTGCA 2760 GAATTTGGTC TAAAAATAAA ATGGAAGAAG TGCAACTTTA TGCAGAGGCG AATTCATTTC 2820 CTGGGACATA TTATCGAAGG TGGACAAATA TGCCCTGGAA AAGAGAAAAC ATCAGCAGTG 2880 AATTCCTTTG GAACACCTCA GAATGTAAAA GCCGTTCAAG GATTTCTGGG TCTCACAGGA 2940 TTCTTCAGAA AATTCATACC TGGATACGCC CAAATTGCGA GACCACTGAC GGACCTATTA 3000 AAAAAAGATG CCATTTTCAA CATTGGACCA GTAGAGCAGC AGTCGGTGAA TAAGCTGAAA 3060 GAGATTCTGG TAAACGAACC AGTATTGAGG ATCTACTCAC GAGAAGCAGA AACCGAACTT 3120 CATACAGATG CCTCTAAGGA CGGGTTAGGA GCCGTTTTAT TGCAGAAGTT CGAAGGCAGT 3180 TTTCACCCAG TCTGCTTTTG GAGCAGAAAA ACTACAAAAG CCGAATCAAA TCGTCATAGT 3240 TATTACCTTG AAGTAAAAGC CGCATACTTA GCTCTGAAAA AGTTCAGACA CTATTTATTG 3300 GGAGTCCCTT TCAAGCTCGT CACGGACTGT GTCGCATTTA AACAGACAAC AAAAAAAGCA 3360 GATGTCCCAA GAGAAGTTGG CCCATGGATT CTCTATATGC AGGATTTTAA TTTTCAACCC 3420 GAACATCGTG CAGGAGAAAG AATGAGACAC GTTGATTTTT TAAGCCGCCA TCCCCAAGCA 3480 TGCATGATGA TAACATCCGA GTTGACAGCA CGTATTAAAA AGTCGCAGCA GAACGATGAT 3540 TCAATTAGAG CAATCCTGGA AATTCTAAAA GATCGTCTAT TCCAACCCTA CAAGCTAAAA 3600 GGTGGCCTGT TGTATAGTAT GGTCAATGGC AATGAACTAC TGGTTGTCCC TGCACTAATG 3660 GAGAGGGAGG TGATTCAAAG CGCACATGAA GTTGGCCATT TGTCGTTGCA AAAGACGATG 3720 CATAGCATAC AGCAGCAATT TTTTTATTTC CTCATTTGGG AATACAAGGT AAAAAAGCTA 3780 ATTTCTAACT GTATAAAATG TATCATCCAC AGCAAAAAGT TGGGAAAGCA GGAGGGATAT 3840 CTAAATTGCA TAGATAAAGG AGACGCACCG TTGCACACAC TACACATCGA TCATTTGGGG 3900 CCAATGGATT CATCGGCCAA ACAGTATAAA TACATTCTGG CAACAGTCGA TGCGTTTTCA 3960 AAGTTTGTCT GGTTATTCCC AACCAAATCA ACCGGACAGG AAGAAGTGGT CAAGAGGCTG 4020 ACCGACTGGT CAAACATTTT TGGTTTCCCT AAGCGAATTG TTAGCGACAA AGGAACGGCC 4080 TTTACGAGTG GTGCGTTCGA ACAATTTATG AGCAGCCATA ACGTGGAACA CGTCTGCACA 4140 ACTACTGGAG TGGCCAGAGG CAACGGCCAG ATAGAACGAG TAAATCGTTT AATTTTGGCA 4200 ATAATATCAA AGCTGTCTTC AGACGAACCG TCGAAGTGGT ACAAATATGT GCCTGAGGTA 4260 CAAAAGGCGA TCAACTGTCA CGTGCATTCA TCACTGAAGC TGTCACCATT TGAGGTCATG 4320 TTTGGCACCA AGATGTACAC CCGAGTTGAG GATCGGTTAC TGGAACTGCT CCAAGAAGAA 4380 GTGGTCTGTC AATTCAACGA GGACCGCTAT GAGATGAGAC AGCTGGTAAA ACGCAACATC 4440 GAGCAGGCGC AGAAGGACTA CAAGCGCAAT TACGACAAAA AGCGCCGAGC TGAATACAAA 4500 TACAAAGCAG GTGATCTGGT TGCAATTAAA AGGACCCAAT TTGTAGCTGG CCGCAAGATG 4560 GCAAGCGGGT ATTTAGGTCC ATACGAAGTC ACAGGGGTCA AAGACAATGG CAGATATGAC 4620 GTTAAAAAAG CAGCAAACGT CGAAGGACCC AATGTCACAT CCACCAGCTG TGACAACATG 4680 AAGTTGTGGA AGTACATAGC CGAAAATGCA GACCTATTGT CATCCGGGTC GGATGATGAT 4740 GATCAGGAGG GCCGAATGTA ACATGGAGTA AGGCTGAAGG CTGGCAACAA CCCGGTTGGC 4800 AGCGCTGTTG AGCAGCAACA TGATTGTCGG AAATCGAAGT TATCGACAAT CAGTCATCGA 4860 AGGAACGATC GCAAGGCAGC AGTGGAGTAG GAGTGGAAGT CAGCGTTGCA GTCAGTCGTG 4920 TTCTCAGCAG CAGTTCGTTC GGTCACAAAC TAAGAATACT TTATATAATT ACCGCATTTA 4980 GAATTAAACT AATAATTAAA TTAATAATAA ACAATAATAA TAAACAATCT TACA 5034 // ID OPUS standard; DNA; INV; 7521 BP. XX AC AY180918; XX DR FLYBASE; FBgn0003007; opus. XX SY synonym: nomad SY synonym: yoyo XX FT source AY180918:1..7521 FT SO_feature five_prime_LTR ; SO:0000425:1..518 FT SO_feature three_prime_LTR ; SO:0000426:7004..7521 FT SO_feature CDS ; SO:0000316:1578..2831 FT /protein_id="AAN87270.1" FT /translation="MEETLRALSESLNALTNVVTGIKEDIKKNNDRLAILEQERGNAD FT PTVDQPQPLVRARTEYELREISVLPDCVKELQAFEGRQEAYLSWINRAQSILTEYDLI FT KTRPLYRAIVLHIRQKIRGHADMALAAYGVQDDDWDDIKRVLALHYADKRDLRTLEHE FT LGAMCQGSRPLDRFYMDVNGHLSLILNNLKARNHPREVVNALIETYRDKALDVFIRGV FT GRDCSKHLLVRSPKNLPEAYSFCMGLQNVMSRNFTAQNYQPSGAPRFAGPYQHQARPP FT FRTPFSPGSGRFSQNSYRTQGPRQAIKMESNRSGQSYQSGYSGRQEEGSGIKRMSEGN FT NPFQKAQRLYHMELAPPPLAPAASGDNQGRSHEGYYDDESQAVERSNNYPPQKNVEGV FT TDAPHNLETEGGANFMTNASPVYRT" FT SO_feature CDS ; SO:0000316:2972..5983 FT /protein_id="AAN87271.1" FT /translation="MITHRLVGKFFKPLGNDSDITFFVLPNLHSFDGIIGDDTLKDLK FT AIVDRKNNCLIITPGIKIPLLARASINVNPLLAAEHPDGTQEILNSLLGEFPRIFEPP FT LSGMSVETAVKAEIRTNTQDPIYAKSYPYPVNMRGEVERQIDELLQDGIIRPSNSPYN FT SPIWIVPKKPKPNGEKQYRMVVDFKRLNTVTIPDTYPIPDINATLASLGNAKYFTTLD FT LTSGFHQIHMKESDIPKTAFSTLNGKYEFLRLPFGLKNAPAIFQRMIDDILREHIGKV FT CYVYIDDIIVFSEDYDTHWKNLRLVLASLSKANLQVNLEKSHFLDTQVEFLGYIVTAD FT GIKADPKKVRAISEMPPPTSVKELKRFLGMTSYYRKFIQDYAKVAKPLTNLTRGLYAN FT IKSSQSSKVPITLDETALQSFNDLKSILCSSEILAFPCFTKPFHLTTDASNWAIGAVL FT SQDDQGRDRPIAYISRSLNKTEENYATIEKEMLAIIWSLDNLRAYLYGAGTIKVYTDH FT QPLTFALGNRNFNAKLKRWKARIEEYNCELIYKPGKSNVVADALSRIPPQLNQLSTDL FT DANPEDDMQSLATAHSALHDSSRLIPHVESPINVFKNQLIFDTTRSKYLCEHPFPGYT FT RHLIPLKDGSLADLTNSLQSCLRPVIINGVKIPEAHLQRFQSICLANFLLYKIRITQR FT LVADVSGAEEICEIIEKEHRRAHRGPTEIRLQLLEKYYFPRMSSTIRLQTSSCQCCKL FT YKYERHPNKPNLQPTPIPNYPCEILHIDIFALEKRLYLSCIDKFSKFAKLFHLQSKAS FT VHLRETLVEALHYFTAPKVLVSDNERGLLCPTVLNYLRSLDIDLYYAPTQKSEVNGQV FT ERFHSTFLEIYRCLKDELPTFKPVELVHIAVDRYNTSVHSVTNRKPADVFFDRSSRVN FT YQGLTDFRRQTLEDIKGLIEYKQIRGNMARNKNRDEPKSYGPGDEVFVANKQIKTKEK FT ARFRCEKVQEDNKITVKTRSGKIFHKSDLRN" XX CC Sequence from P1 DS01219.2:861..8381 provided by Guochon Liao CC Berkeley Drosophila Genome Project. XX SQ Sequence 7521 BP; 2322 A; 1740 C; 1677 G; 1782 T; 0 other; AGTTAAGAAC CCTCTTCTTG CGCTCTTCGT CAGGACTCAC CAGCGCTCGG CTCTCGTGTT 60 TTCGGGCCCC GTCAGCAGGC GACTCGGGGC CTGTCTAGTA ACATGTTCGT GTAAGTTACG 120 AACCCTCTTC TTGCGATCTT CGTCAGGACT CACCAGCGCT CGGCTCTCGT GTTTTCGGGC 180 CCCGTCAGCA GGCGACTCGG GGCCTGTCTA GGAACATGTT TGTGTATGTG TGCATTCGGA 240 ACAAGTGCCG TTGGTCGCAC TCAGGGTGAG GGGTCAACGG GGGAAGCGGA TATAAAAGCA 300 GCGGGGCGGG AGAAGAGGTC CCAGTCTCGA ACGGACACAT AACGGAACCG CTAGCAGATC 360 GCGAACTGAA TCTTAAAATA AAGCTAATCG TAAACTCGAA CCCTCTTAAC TATCTTGACT 420 ATTATTTGGA GAACCACAGC ATGTTGGTTG TCATATCAAG GTGAGGTATG CGGCAGCGAG 480 TGCCGAGAAC CCTGATGCAA GTGGAACTTG CGTTAACTGG CGCCCGAACA GGGACCGGCA 540 ATGTCCGGCC GATAAAAGTG ATACGAAAAA ATTGTGGAAA TTTGTGCGTA AAAATAGTGG 600 TGGTGTGCAT AAGTCAGATT AAGATCTGAA ATCCATAAAT GAAAAAGAAG TGCTGCGTGA 660 GCTGTGTATA AAATGATAAA ATAGCAATTA CCCGCTGCCG GGGGGAACTA CGCCCATCCC 720 GGGGCGCAAC AAATATTGCA TAATTCAATA AAAGGTGTAA AATTTCTAAA ATAAAAATGT 780 AAACCTATGT TGCGCCAAGA CCTAATTTAA ATTAATAAAA CAACGACCCG CTACCGGAGG 840 ACGCCACGTC GCCCATGCCG AGCGCAAAAG TTGTACGATA CCTATAACAT AATTAAAACA 900 CGATCAACCC ACTGCGGCGG TACGGCTTGT GGGAAAATTT TTTTTTTTTT CTCTCCTTGC 960 CAATTCGCGA GTGCAAAAGA TTGTGTATAA TAAACCAATA ATTAACCATT GCAGCAGTTT 1020 ACCTGCGGCA GTACGAGTAA TATGAGCGCC CAGAGTGATA AGGTGGTGTG TGGCAGCTTG 1080 TTGGATACGT TAAGTGGTGT GGAATGCACC CAAAAAAAAC CGCCCAACAA GTTGTGTGGC 1140 GGCCGTACCT TAGTAGGCAA CCAGCCAAAA GGGATACTAC GGAACCACCG TGCCCAGTGC 1200 CGAAATAAAT TAGAGGTCAT CAATAAAAAA CTGTAACAGC ACGCACGCAA GGAAAAAATA 1260 TTGCAAAATG GAATAGCGCA CAAAAATTGT ATAAACACAT GCACAACACC ACAATTCAAA 1320 GGAAAACAAA ATATTCATGC TGTAGGGGTA CAACCTAAAC GACGAAAACT AATAAAGAGC 1380 ATACAAGGGT GAGTGAAATA TTTCATTAAA CTTTATTGCC ATATTTGCTA AATTTAGAGA 1440 AATAAAGAAA AAGCAAAGAA GAACAGATAT TCTTTTTTAT CGGGTTAAAA CCGTTGTCTC 1500 ACATTTCCGT AAAGTAATAA CGAATTCTGT TGCCTTGAAA GCTTCCTGCA TCTTTCCAAC 1560 GCAAACTAAA AATCAAAATG GAAGAGACCC TGCGTGCTCT TAGCGAGTCC CTCAATGCCC 1620 TGACCAACGT GGTGACAGGC ATTAAGGAAG ATATTAAGAA AAATAATGAT AGGTTGGCTA 1680 TTTTAGAACA GGAGCGCGGG AACGCTGACC CTACGGTCGA CCAACCGCAA CCCCTGGTGC 1740 GCGCACGCAC CGAGTATGAG CTGAGAGAGA TATCGGTCCT CCCTGACTGC GTCAAAGAAC 1800 TGCAGGCGTT CGAAGGACGG CAGGAGGCTT ACCTGTCTTG GATAAACAGG GCACAGTCAA 1860 TACTGACCGA ATATGACTTG ATTAAAACCA GACCCCTGTA TAGGGCAATT GTCTTGCATA 1920 TTAGACAGAA AATAAGGGGA CACGCCGACA TGGCCTTGGC GGCCTATGGC GTCCAAGACG 1980 ACGATTGGGA CGACATAAAA CGAGTCTTGG CGCTGCATTA CGCAGACAAA CGAGACTTAC 2040 GTACGCTTGA GCATGAGCTT GGCGCTATGT GCCAAGGTTC TAGACCACTA GATAGGTTCT 2100 ATATGGACGT TAATGGCCAT CTCTCGTTGA TCTTAAATAA CTTGAAGGCC AGAAACCACC 2160 CTCGTGAAGT AGTCAACGCT TTGATAGAAA CCTATAGAGA CAAGGCTTTG GATGTTTTTA 2220 TCCGAGGAGT GGGGAGAGAT TGTTCCAAAC ACTTACTTGT CCGCAGCCCG AAGAATCTAC 2280 CAGAGGCTTA CTCTTTTTGT ATGGGATTGC AGAATGTAAT GTCAAGAAAT TTCACAGCTC 2340 AGAACTATCA ACCGTCAGGT GCCCCAAGAT TCGCAGGCCC ATATCAACAT CAGGCCAGGC 2400 CACCGTTCCG AACCCCTTTT TCTCCTGGTT CAGGCAGATT TTCGCAAAAC TCCTACAGAA 2460 CTCAGGGTCC TAGACAGGCC ATAAAAATGG AATCCAATCG GTCGGGTCAA TCTTACCAAT 2520 CAGGATACAG TGGTCGCCAG GAAGAAGGCT CCGGTATTAA GAGAATGTCC GAAGGAAACA 2580 ACCCATTCCA AAAGGCACAA AGATTGTACC ACATGGAATT GGCACCACCC CCGCTAGCCC 2640 CGGCGGCTAG TGGAGATAAC CAAGGACGTT CACACGAGGG TTACTATGAT GACGAGTCTC 2700 AAGCTGTCGA GAGAAGCAAC AATTATCCTC CGCAGAAAAA CGTGGAAGGA GTTACAGATG 2760 CTCCACATAA CCTTGAGACT GAGGGAGGGG CAAATTTTAT GACCAACGCC TCTCCAGTGT 2820 ACCGTACTTA GAGTATGCTA CGGAGAGGGG AGAAAGGCTG AAGTTTTTGA TCGACACGGG 2880 GGCGAACAAA AACTTTATTA GCCGAAGACT TGCAGCCGGG TGTACCACAG TCCGTAAACC 2940 CTTCTCCGTA CTGTCCGCTG CGGGTAACAT CATGATAACG CACCGCCTAG TTGGTAAATT 3000 CTTCAAACCA CTAGGGAACG ACTCGGATAT TACCTTTTTC GTACTACCGA ATTTACATTC 3060 CTTTGATGGT ATCATTGGCG ACGATACTCT CAAAGACTTA AAAGCCATAG TGGATAGGAA 3120 AAACAATTGT TTGATAATAA CCCCAGGAAT TAAAATCCCT CTTTTGGCGA GAGCTTCAAT 3180 AAACGTTAAC CCGCTACTCG CCGCCGAACA CCCAGATGGT ACACAAGAAA TTTTGAATTC 3240 CCTTCTCGGG GAATTTCCCC GCATCTTCGA GCCCCCCTTA TCTGGAATGT CCGTGGAGAC 3300 GGCCGTCAAG GCTGAAATCC GGACAAACAC ACAAGACCCG ATCTATGCTA AAAGTTATCC 3360 TTACCCAGTC AACATGCGCG GAGAAGTCGA ACGTCAAATC GATGAACTGC TGCAGGACGG 3420 TATAATTCGA CCCTCTAATA GCCCTTACAA TTCCCCTATC TGGATAGTCC CGAAGAAACC 3480 TAAACCAAAC GGAGAAAAAC AATATCGCAT GGTAGTCGAT TTCAAGCGGT TAAATACCGT 3540 CACCATACCC GACACTTACC CCATCCCAGA TATAAACGCT ACGCTAGCCA GCCTTGGCAA 3600 TGCCAAATAC TTTACCACCC TAGATTTGAC TTCTGGATTC CATCAAATCC ACATGAAGGA 3660 AAGCGACATT CCAAAGACAG CTTTCTCTAC TCTAAATGGA AAGTACGAGT TCCTCCGTCT 3720 ACCATTCGGT TTGAAGAATG CACCTGCAAT CTTCCAAAGA ATGATCGATG ATATTTTGCG 3780 CGAGCATATT GGCAAGGTCT GCTACGTTTA TATTGACGAT ATCATCGTCT TCAGTGAAGA 3840 TTATGACACA CACTGGAAAA ATCTCCGATT GGTATTAGCG AGTTTATCAA AAGCTAACCT 3900 CCAAGTGAAC CTTGAGAAGT CGCATTTTTT AGACACGCAG GTAGAATTTT TAGGATATAT 3960 CGTCACGGCC GATGGCATTA AGGCAGATCC GAAAAAGGTC AGAGCGATTA GCGAAATGCC 4020 TCCTCCGACC TCTGTTAAGG AGTTAAAAAG ATTTCTAGGC ATGACCTCGT ACTACAGGAA 4080 GTTCATTCAG GACTATGCGA AGGTAGCAAA GCCCCTTACA AACTTGACGC GTGGATTGTA 4140 CGCTAATATA AAGTCTTCAC AATCAAGCAA AGTGCCAATT ACATTAGACG AGACGGCCCT 4200 ACAGTCTTTT AATGATTTAA AATCAATTCT CTGTTCTTCT GAAATACTGG CGTTCCCATG 4260 TTTCACTAAA CCTTTCCATC TAACCACGGA CGCTTCTAAC TGGGCCATCG GAGCTGTCCT 4320 CTCACAGGAC GACCAGGGTA GAGATAGGCC GATAGCGTAC ATTTCCCGTT CATTAAATAA 4380 GACGGAGGAA AACTACGCTA CTATCGAAAA GGAAATGCTC GCGATAATTT GGTCATTGGA 4440 CAATCTTCGG GCTTACTTAT ATGGCGCTGG TACTATTAAA GTATATACTG ACCATCAACC 4500 TCTAACGTTT GCCCTAGGCA ACAGAAATTT CAATGCGAAG CTAAAACGCT GGAAGGCTCG 4560 TATAGAGGAA TACAACTGCG AACTCATCTA CAAGCCTGGG AAATCTAATG TGGTGGCTGA 4620 CGCGCTTTCA CGCATTCCGC CTCAGCTTAA CCAGTTGAGC ACCGATTTAG ATGCTAATCC 4680 CGAGGATGAC ATGCAGTCTT TGGCTACTGC CCATAGCGCT TTACATGACA GTTCACGATT 4740 GATTCCCCAC GTTGAATCTC CAATCAACGT TTTCAAGAAT CAACTCATTT TTGACACAAC 4800 CAGGTCAAAA TACTTATGCG AGCACCCGTT CCCAGGTTAT ACTCGCCATC TGATTCCTCT 4860 CAAAGACGGA TCACTTGCCG ATTTAACCAA CTCGTTACAA TCGTGTCTAC GACCTGTAAT 4920 AATTAACGGC GTCAAAATCC CGGAAGCACA TTTGCAACGC TTTCAGTCCA TCTGCTTAGC 4980 GAATTTTCTT TTATACAAAA TTCGGATAAC GCAGCGCCTA GTGGCGGACG TGTCTGGCGC 5040 AGAGGAAATT TGTGAAATAA TTGAAAAAGA ACACCGTAGA GCACATAGGG GCCCTACGGA 5100 GATTCGTCTC CAACTTTTAG AAAAATATTA TTTCCCGCGA ATGTCCAGTA CGATCCGTCT 5160 GCAAACTTCC TCATGTCAGT GTTGCAAACT CTACAAGTAC GAGAGACACC CTAACAAACC 5220 AAACCTACAA CCTACGCCAA TTCCTAACTA CCCATGTGAA ATACTTCACA TCGACATTTT 5280 TGCGCTCGAA AAAAGGTTAT ACCTAAGTTG TATTGACAAA TTTAGCAAGT TTGCCAAACT 5340 TTTCCATCTG CAGTCAAAAG CATCTGTGCA TTTGCGAGAA ACTTTGGTGG AGGCCCTACA 5400 TTACTTCACC GCCCCTAAGG TCTTGGTTTC GGATAACGAG CGAGGGTTGT TATGCCCCAC 5460 AGTGCTCAAC TATCTTCGGT CTCTAGATAT CGATCTGTAT TATGCTCCAA CCCAGAAGAG 5520 CGAAGTAAAT GGTCAAGTCG AGAGATTCCA CTCTACGTTC CTAGAAATTT ATCGTTGCCT 5580 TAAAGATGAG CTCCCTACCT TCAAACCCGT TGAGCTGGTA CACATAGCAG TGGACCGCTA 5640 CAACACTTCC GTTCACTCGG TAACGAATCG AAAACCAGCA GACGTTTTTT TCGACCGCTC 5700 GTCAAGGGTA AACTATCAGG GTCTGACAGA TTTCCGGCGG CAGACTTTAG AGGACATCAA 5760 GGGCTTAATT GAGTATAAGC AAATTAGAGG TAATATGGCT CGGAATAAAA ATAGGGACGA 5820 GCCAAAGTCT TATGGGCCGG GAGATGAAGT TTTTGTTGCA AATAAGCAAA TAAAAACAAA 5880 GGAAAAAGCG AGGTTCAGAT GCGAAAAGGT ACAGGAAGAC AACAAGATAA CAGTTAAAAC 5940 CAGATCAGGA AAAATTTTCC ACAAATCTGA TCTAAGAAAT TGAGACGTGG CTTTCACATT 6000 TAAAAAAGAA ACGCGAAAAA GAATAACGAA AGTAATAAAA GTACGTTGTG GCAGCTAATG 6060 AAATATTCCA CCCATGCATA CCCTATATAA AAAAAACATT AATAAAAAAA AAAAAAAAAA 6120 AAAAAAAAAA AAAAAAAAAT GAGTTAAGAA ATACAAAAAG AAATACAAAA AAAACTATAA 6180 AAAAAATAAT ATAAAAAAAT ACAGATTATA AGAAATAAGA AATAAGAAAT ATAAAAAAAT 6240 AAAAATATAA GTACACAAAA TGTACCGTAC CCCCACACAC TACGTAGTCT TAGAACAACT 6300 TAGACGACCA GATATTTACG AATTGTCTTT TTGTAAGCGC GATTTCTGCA TGCGGCGCAA 6360 ATCCCGCTCA CTGGACTGGC TGGGGTCGGC TTGGAAATGG GTAGCTGGAT CTCCAGATGC 6420 TGCTGATTGG AACGCCGTCT TGGCCGCGCA AGCGACGGCT TCGAGGAACT GCAAAAACTG 6480 GAGGAGGCTA GCTGTATCCC TCGGCTACTG AAGTAACCAA CGAGTGGTTA AGCAAGTCGA 6540 CGATGGAATG CTCCTCCTGA CCAACTTCAA CGGAACTCTA AGAACGGCTG CAGAGAACTA 6600 CGACCTGATC GGCTCCTTTA TCATCCAATT CGACAATGAG ACGATAATGG TCAACGGTCA 6660 AAACTATTCC AGTTACTCGG TCAGTCATCT AATGGCGATG CCGGCCGTGT TGAGCCACAT 6720 AACGGCCAGC AACTTTCAAC TTTCTCTGGA ATACGTCCAC GACGTGAGCA TGAAGAATTT 6780 GGAAAAGATG TCCAACATGG CGAGTGAGCT ACTAGCCTCT CTTCTCACCG AGGCGGCACT 6840 CGCAATCTGC ATATTCCTAG GCTTTTATTT CCTATGGAAG AAGCTGATGT CCACCAAAGG 6900 CATGCCCGAT GTCCGCGAGA TTGCCGCAAA CTTAGAAGCA TTGGGCCAAA CCGAGCTGAA 6960 CAAGGCTCAC TAATCTGCGG GACGCAGATC TTGAGGGGGG AGGAGTTAAG AACCCTCTTC 7020 TTGCGCTCTT CGTCAGGACT CACCAGCGCT CGGCTCTCGT GTTTTCGGGC CCCGTCAGCA 7080 GGCGACTCGG GGCCTGTCTA GTAACATGTT CGTGTAAGTT ACGAACCCTC TTCTTGCGAT 7140 CTTCGTCAGG ACTCACCAGC GCTCGGCTCT CGTGTTTTCG GGCCCCGTCA GCAGGCGACT 7200 CGGGGCCTGT CTAGGAACAT GTTTGTGTAT GTGTGCATTC GGAACAAGTG CCGTTGGTCG 7260 CACTCAGGGT GAGGGGTCAA CGGGGGAAGC GGATATAAAA GCAGCGGGGC GGGAGAAGAG 7320 GTCCCAGTCT CGAACGGACA CATAACGGAA CCGCTAGCAG ATCGCGAACT GAATCTTAAA 7380 ATAAAGCTAA TCGTAAACTC GAACCCTCTT AACTATCTTG ACTATTATTT GGAGAACCAC 7440 AGCATGTTGG TTGTCATATC AAGGTGAGGT ATGCGGCAGC GAGTGCCGAG AACCCTGATG 7500 CAAGTGGAAC TTGCGTTAAC T 7521 // ID DM_ROO standard; DNA; INV; 9092 BP. XX AC AY180917; XX DR FLYBASE; FBgn0000155; roo. XX SY synonym: B104 XX FT source AY180917:1..9092 FT SO_feature five_prime_LTR ; SO:0000425:1..429 FT SO_feature three_prime_LTR ; SO:0000426:8665..9092 FT SO_feature CDS ; SO:0000316:1275..8357 FT /protein_id="AAN87269.1" FT /db_xref="FLYBASE:FBgn0043856; roo\ORF" FT /translation="MMSEKTIQFLKKQSEIILEIRKLEVKPTLTDVEILKLNELQKCF FT IANHSNLLKIGVVDHEYFNAKQYDLIMMVLEKIKNKNEKIKGESVENTFPKSNTVPKS FT NPPPTLNLEMRGHPEKEGIAQNNALKVEQAFRNNVGQFRVYLEDTSKLIDSSPDFLKI FT RKNKIEFLWHKIDNLIEQVNSRFESSLFEEEISELEFDKQNILTAINSRLSGTINKAE FT MSTVVKAEELPTLPKIQIPTFFGDSKEWDLFNELFTELIHVREDLSPSLKFNYLKSAL FT KGEARNVVTHLLLGSGENYEATWEFLTKRYENKRNIFSDHMNRLMDMPNLNLESNKQI FT KTFIDTINESIYIIKLKAQLPEDVDAIFAHIILRKFNKESLNLYESHVKKTKEIQALS FT DVMDFLEQRLNSISSFSQEVKPVKKMINNNKNKNYSDNCAYCKLPGHYLIQCHKFKIM FT NPAERSDWVRKNGICLRCLRHPFGKKCISEQLCSTCRKPHHTLLHFAGHNPEKVNTCR FT TTGQALLATALIQVKSRYGGFEQLRALIDSGSQSTIISEESAQILKLKKFRSHTEISG FT VSSTGTCISKHKAVISIRNSPKNLEIEAIILPKLMKALPVNTINVDQKKWKNFKLADP FT DFNKPGRIDLIIGADVYTHILQNGVIKIDGLLGQKTDFGWIVSGCKKSKGKETIVATT FT IEIKELDRYWEVEEEEKDDIESEICENKFIKTTKKDSDGRYIVSIPFKEDVTLGDSKK FT QAIARYMNLEKKLKRNEKLKVDYTKFMNEYMDLGHMIEVSDEGKYFLPHQAVIRDSSL FT TTKLRVVFDASAKTTNNKSLNDIMWVGPRVQKDIFDIIIKWRKWEFVVSADIEKMYRQ FT IKIDNNDQKYQYILWRNSPKEKIKTYKLTTVTYGTASAPYLATRVLVDIADKCKNQVI FT SAIIRNDFYMDDLMTGADSVEEANKLITLIPHELQKVGFNLRKWISNNSKILTTVEDT FT GDNKVLNIIENECVKTLGLKWEPQKDLFKFSVNCNDESKNINKRVVLSTLAKIFDPLG FT WLAPVTVSGKLFIQKLWINKSEWDQELSIEDKNYWEKYKENLLLLENIRIPRWINSNS FT SSVIQIHGFADASEKAYAAVVYAKVGPHVNIIASKSRVNPIKNRKTIPKLELCAAHLL FT SELIQRLKGSIDNIMEIYAWSDSTITLAWINSGQSKIKFIKRRTDDIRKLKNTEWNHV FT KSEDNPADLASRGVDSNQLINCDFWWKGPKWLADPKELWPRQQSVEEPVLINTVLNDK FT IDDPIYELIERYSSIEKLIRIIAYINRFVQMKTRNKAYSSIISVKEIRIAETVVIKKQ FT QEYQFRQEIKCLKIKKEIKTNNKILSLNPFLDKGGVLRVGGRLQNSNAEFNVKHPIIL FT EKCHLTSLLIKNAHKETLHGGINLMRNYIQRKYWIFGLKNSLKKYLRECVTCARYKQN FT TAQQIMGNLPKYRVTMTFPFLNTGIDYAGPYYVKCSKNRGQKTFKGYVAVFVCMATKA FT IHLEMVSDLTSDAFLAALRRFIARRGKCSNIYSDNGTNFVGAARKLDQELFNAIQENI FT TIAAQLEKDRIDWHFIPPAGPHFGGIWEAGVKSMKYHLKRIIGDTIFTYEEMSTLLCQ FT IEACLNSRPLYTIVSEKDQQEVLTPGHFLIGRPPLEIVEPMEDEKIGNLDRWRLIQKI FT KKDFWVKWKSEYLHTLQQRNKWKKEIPNIEEGQIVLLKDENCHPARWPLGKVEKVHKG FT NDDKVRVAKVKMQEGYITRPITKICPLEGIKSVDKNEADQEPKRRTRATSGMSKIGII FT MAMLLFVLSCQVSSALPKDIAPRYSIDKINKTSAIYLDPLGDVEIVSTSWNLVIYYKM FT DPYFKMLTKGNALIQSMRKVCERLHSFEEQCSLVLDNMQSQLSELEENNKLFMMQSRS FT RSKRAPFEFMGSLYHILFGIMDEDDREQLEENMKNLLDNQNNLDKLIQKQTSVVDSTS FT NLLKRTTEDVNSNFRSMQIRIENMTEVLKENYYVYKESIKFFMITKQLHSLIEEGEKI FT QAGIISLLIDINHGRLNTNILRPNQLKKEIAKIQQSLSENLVIPGKRSGTELKEVYTL FT LTARGLFIDDKLIISAKVPLFSRHPSKLFRLIPVPIRNEDRIIMVHTTSEYLIYNFEI FT DSYHIMTEATLNQCQKWQLNKRICKGSWPWNSANDNACEIQPLKPDKAANCIYKTVVD FT SKSYWVELEKKSSWLFKVPANSKVRLQCTGSQIELFDLPQQGVLSIAPYCTARTDDKI FT LVAHHNIQSESEELLSTPYIGEVSGVPKIIWDPLKLSILNHTEEFERLNNEIKFMKEN FT HQKLKDLHFHHISGHAGLIIALILMIVLIIYFIRKCAVQQRMQAITFAGPLPVLXX XX CC Sequence from P1 sequence DS00941:20448..29535 provided by CC Guochun Liao, Berkeley Drosophila Genome Project. XX SQ Sequence 9092 BP; 3468 A; 1442 C; 1710 G; 2472 T; 0 other; TGTTCACACA TGAACACGAA TATATTTAAA GACTTACAAT TTTGGGCTCC GTTCATATCT 60 TATGTAAATG AATCGAGAGC GATAAATTAT ATTTAGGATT TTGTTATCTA AGGCGACATG 120 GGTGCATTGC TCAAAAACAT GTAATTTAAG TGCACACTAC ATGAGTCAGT CACTTGAGAT 180 CGTTCCCCGC CTCCTAAAAT AGTCCCTTAG TGGGAGACCA CAGATAAGGT CCTCGCCGCT 240 CAAGATAGGC AGATGTGCCC GAGCGTGGGA CCTCGATAAG GCGGGGACTA TTTACGTAGG 300 CCTCTGCGTA GGCCATTTAC TTTAAGATGC GATTCTCATG TCACCTATTT AAACCGAAGA 360 TATTTCCAAA TAAAATCAGT TTTTTTACAA AAACTCAACG AGTAAAGTCT TCTTATTTGG 420 GATTTTACAT TTGGTCAATC GAGCCTTTAA TCGACTCTGC AGTTTCCCCC TACCAAAGGT 480 AAGGAACTCA GAGAAAGGCC AGCTCCTTTA AGCATCTTAC AGCTAAAGGT AGCAAAAATA 540 AGTGACTCTT GTTTCCCCCT ACCAAAGGTA AGGAACAGAG TATAAATATA AAAAGCAAAA 600 GATACAAAAG AATCTTTTAT GTTTTAAAAC AAGCACCTTA TAGTCTATAG CTAAAGGTTG 660 CTTTGTGTAC CATTATAAAT TGTGGTAAGG CGTGCTTGAG GCCATACATC AGCAATTGTG 720 AAATTAAAAA GTGCATAACA AAAGTGCCTT ATAAATGCTC TAATAGCATT AAATCAGCTC 780 ATAAATAGAG TGCAGTGTAT ATGCCATAAG AGCATAAATT AAATAAAAAG TGCCTGAAAA 840 CAGTGCCTTA TAAATGCTCT AATAGCATTA AATCAGCTCA TAAATAGAGT GCAGTGTATA 900 TGCCAAAAGA GCATAAATGC CGAAATAAAT GGCTAAAAAA CAAAAAATCT GACTGGACTA 960 CAAAAATAAT AAAACGTGCC AAAAAAAAAA AAAAAATCAT CTTTAAACAT CGACGGAGCC 1020 TTAAAGAAGA GAAGGAAGTC AAATTCAAAG GAGCCTCTAC CAGCAGCAGA AGCAGCAACA 1080 ACAGCAGCAG CAGAAGCAGC AACAGCAGTA GCAACAGCAG CAACAACAGC AGCAACAGCA 1140 GCAGCAACAA CAACGACATC AGCTAAGTCA AAACAAGAAT TTTCTGTTTA TCCAAACACA 1200 CATATATATA TAAATACATA TAAAATACAT ATACACGTAC TATATATATT AAGAAATTAC 1260 AAAAAATTTT CAAAATGATG TCAGAAAAGA CTATTCAATT CCTTAAGAAG CAGTCCGAAA 1320 TTATTTTGGA AATTAGAAAG TTGGAAGTAA AACCAACATT AACAGATGTA GAAATTCTAA 1380 AATTAAATGA GCTTCAAAAA TGTTTCATTG CTAATCATAG CAATTTGTTA AAGATCGGCG 1440 TTGTCGATCA TGAATATTTT AACGCGAAGC AGTATGATTT AATAATGATG GTGTTAGAAA 1500 AAATTAAAAA TAAAAATGAA AAAATTAAGG GCGAGTCGGT AGAAAACACT TTCCCTAAAT 1560 CAAACACTGT CCCTAAATCA AACCCTCCCC CTACATTAAA CCTTGAAATG CGTGGTCACC 1620 CTGAAAAAGA GGGTATAGCA CAAAACAACG CTTTAAAAGT AGAGCAGGCA TTTCGTAATA 1680 ATGTTGGCCA ATTTCGAGTA TATCTAGAAG ATACGTCTAA ACTAATAGAC AGTAGTCCAG 1740 ATTTCCTTAA AATAAGGAAA AATAAAATTG AATTTTTATG GCATAAAATA GATAACCTGA 1800 TTGAACAGGT GAATAGTCGT TTTGAGAGTT CGCTATTCGA AGAAGAAATT AGCGAACTTG 1860 AATTTGACAA ACAAAATATT CTTACAGCCA TTAATAGTCG ACTCAGTGGC ACAATAAATA 1920 AAGCTGAAAT GTCGACGGTT GTTAAGGCGG AGGAGTTACC AACCCTGCCT AAAATACAGA 1980 TTCCCACCTT CTTTGGTGAT TCCAAAGAAT GGGATCTTTT TAATGAACTC TTTACAGAGC 2040 TCATACATGT GAGAGAGGAT CTCAGTCCTT CTCTCAAATT TAATTATCTA AAGTCAGCAT 2100 TAAAAGGAGA AGCCAGAAAT GTGGTTACTC ATTTACTGCT CGGCTCTGGA GAAAATTATG 2160 AAGCCACTTG GGAGTTTTTG ACCAAGCGAT ATGAGAATAA AAGAAACATA TTCTCAGATC 2220 ATATGAATAG GCTTATGGAT ATGCCAAATT TAAATTTAGA ATCCAATAAG CAAATAAAGA 2280 CATTTATTGA CACGATTAAC GAGTCAATTT ATATTATAAA ATTAAAGGCA CAATTACCAG 2340 AAGATGTGGA TGCAATTTTC GCTCACATAA TTCTTCGGAA ATTCAATAAA GAATCACTCA 2400 ATTTATATGA AAGCCATGTT AAAAAGACAA AAGAAATACA GGCACTTTCT GATGTCATGG 2460 ACTTTTTAGA GCAAAGGCTC AATTCTATAT CATCATTCTC ACAGGAAGTA AAACCTGTAA 2520 AGAAAATGAT TAATAATAAC AAGAATAAAA ATTATAGTGA CAATTGTGCA TATTGCAAAC 2580 TACCAGGGCA TTATTTAATT CAATGCCATA AATTTAAAAT AATGAATCCA GCAGAACGGT 2640 CTGACTGGGT AAGAAAAAAT GGGATTTGCC TAAGATGTCT GAGGCATCCG TTTGGTAAAA 2700 AATGTATAAG CGAGCAGCTT TGTTCGACTT GTCGTAAACC TCACCACACG TTACTTCACT 2760 TTGCAGGTCA TAATCCAGAA AAAGTGAATA CGTGTAGAAC AACAGGTCAA GCCTTGTTGG 2820 CCACGGCCTT GATTCAAGTA AAGTCGAGGT ATGGAGGCTT TGAACAATTA AGAGCATTGA 2880 TTGATAGTGG CTCTCAAAGC ACAATTATTT CAGAAGAGTC TGCACAGATT CTAAAATTGA 2940 AAAAATTTCG GTCTCATACT GAAATAAGTG GAGTATCTTC CACAGGAACG TGCATCTCCA 3000 AGCACAAAGC GGTTATTTCG ATAAGAAATT CTCCGAAAAA TTTAGAAATT GAAGCAATTA 3060 TTCTCCCAAA ACTTATGAAG GCACTTCCAG TCAACACGAT TAATGTTGAT CAGAAAAAAT 3120 GGAAGAACTT TAAATTAGCC GACCCCGATT TTAATAAACC GGGTCGCATT GATTTAATCA 3180 TTGGAGCAGA CGTATATACT CACATTCTGC AAAATGGAGT TATAAAAATA GACGGTCTCC 3240 TTGGGCAAAA AACTGATTTC GGGTGGATAG TTTCTGGATG TAAAAAATCC AAAGGAAAAG 3300 AAACCATTGT AGCCACAACA ATAGAAATAA AAGAGTTAGA TCGCTACTGG GAAGTGGAAG 3360 AAGAAGAAAA AGATGATATC GAGTCTGAAA TCTGTGAAAA TAAATTTATC AAAACGACAA 3420 AAAAAGATTC AGATGGGCGA TACATTGTGT CAATTCCATT CAAGGAGGAT GTCACCTTAG 3480 GAGATTCAAA GAAACAAGCG ATAGCTCGTT ACATGAATCT GGAGAAAAAA CTAAAAAGAA 3540 ATGAAAAACT TAAGGTTGAC TACACTAAAT TCATGAATGA ATACATGGAT TTAGGACACA 3600 TGATTGAAGT GAGTGATGAA GGCAAATATT TTTTACCGCA CCAGGCAGTG ATTAGAGATT 3660 CAAGCCTTAC GACCAAATTG AGAGTAGTTT TTGATGCTTC AGCAAAAACT ACGAATAACA 3720 AAAGTTTGAA CGACATAATG TGGGTTGGGC CACGAGTTCA AAAAGATATT TTTGACATTA 3780 TTATTAAATG GAGAAAATGG GAATTTGTTG TTTCGGCAGA CATTGAAAAG ATGTACCGAC 3840 AAATTAAAAT AGATAATAAT GATCAAAAAT ATCAATATAT TTTATGGAGA AATTCTCCAA 3900 AAGAAAAAAT TAAAACATAT AAATTAACCA CAGTCACTTA CGGAACTGCA TCTGCACCAT 3960 ATTTGGCTAC CAGGGTTCTG GTAGATATTG CAGATAAATG TAAAAACCAA GTTATTAGTG 4020 CAATAATTAG GAATGATTTC TATATGGATG ACCTAATGAC TGGAGCTGAT TCGGTAGAAG 4080 AAGCTAATAA ATTAATAACA TTAATTCCCC ATGAATTGCA GAAAGTTGGA TTCAACTTAA 4140 GGAAATGGAT TTCCAACAAT TCCAAAATAT TAACCACTGT GGAGGACACA GGGGACAATA 4200 AGGTTCTCAA TATTATCGAA AATGAATGTG TTAAAACTTT AGGACTAAAA TGGGAACCTC 4260 AAAAGGATTT ATTTAAGTTC AGCGTAAATT GTAATGATGA ATCAAAAAAT ATAAATAAGC 4320 GCGTTGTGTT ATCAACGCTA GCAAAAATAT TTGATCCGTT AGGATGGTTG GCACCAGTCA 4380 CGGTTTCAGG AAAACTTTTT ATTCAAAAAC TTTGGATAAA TAAAAGTGAA TGGGATCAGG 4440 AATTATCCAT AGAAGATAAA AATTATTGGG AAAAATATAA AGAAAATTTA TTATTGTTAG 4500 AGAATATTCG AATCCCAAGG TGGATTAATT CAAACAGTTC TTCAGTCATT CAGATTCACG 4560 GATTTGCGGA CGCCTCCGAA AAAGCATATG CTGCAGTAGT CTATGCTAAA GTAGGACCTC 4620 ATGTTAATAT AATAGCTAGC AAAAGTAGAG TCAACCCTAT AAAAAATAGG AAGACAATTC 4680 CCAAACTCGA GCTGTGTGCA GCTCACCTGC TTAGTGAATT AATCCAAAGA CTAAAAGGAT 4740 CAATTGACAA TATAATGGAG ATCTATGCTT GGAGTGATTC CACGATTACC TTAGCATGGA 4800 TTAACAGTGG TCAAAGTAAG ATCAAATTTA TAAAAAGAAG AACGGATGAC ATTCGGAAAT 4860 TAAAAAATAC TGAATGGAAT CATGTTAAGT CAGAGGATAA TCCAGCAGAT TTAGCATCCA 4920 GGGGAGTGGA TTCTAACCAG TTGATCAACT GTGATTTTTG GTGGAAAGGT CCGAAATGGC 4980 TAGCAGACCC AAAAGAACTT TGGCCTCGGC AGCAGTCTGT AGAAGAACCT GTCTTAATAA 5040 ATACGGTATT AAATGACAAA ATAGATGATC CTATTTACGA ATTAATAGAA AGGTATTCCA 5100 GTATAGAAAA ACTTATACGT ATAATAGCAT ACATAAATAG ATTCGTGCAG ATGAAAACAA 5160 GAAATAAAGC CTATTCATCA ATTATTTCAG TAAAGGAGAT AAGAATAGCG GAAACAGTTG 5220 TTATTAAGAA ACAACAAGAA TACCAGTTTA GGCAAGAGAT AAAGTGCCTT AAAATCAAAA 5280 AGGAAATCAA GACAAATAAT AAAATATTGT CATTGAATCC ATTTTTGGAC AAGGGTGGGG 5340 TTCTAAGAGT TGGAGGAAGA TTGCAAAATT CCAATGCAGA ATTTAATGTT AAACATCCAA 5400 TCATTTTAGA AAAATGCCAC CTAACAAGCT TATTAATAAA AAATGCTCAT AAGGAAACAT 5460 TGCATGGAGG GATAAACCTA ATGCGAAACT ATATCCAAAG AAAGTATTGG ATTTTCGGGT 5520 TGAAAAATTC GTTGAAAAAG TATTTAAGAG AATGTGTAAC GTGTGCAAGG TATAAACAAA 5580 ATACAGCTCA GCAAATAATG GGTAACTTGC CAAAATATAG AGTGACGATG ACATTCCCGT 5640 TTCTTAATAC TGGAATAGAT TACGCAGGTC CTTATTATGT TAAATGTTCA AAAAATCGTG 5700 GCCAAAAAAC ATTTAAAGGA TACGTTGCTG TATTCGTTTG CATGGCCACC AAAGCCATAC 5760 ACTTAGAAAT GGTAAGCGAT CTAACTTCAG ACGCATTTTT AGCAGCACTC AGAAGATTTA 5820 TTGCTAGACG GGGAAAATGT TCCAATATCT ATTCAGACAA CGGAACAAAT TTTGTAGGAG 5880 CTGCAAGAAA ATTAGATCAA GAGTTATTTA ATGCAATACA AGAAAATATA ACGATTGCAG 5940 CGCAACTTGA AAAGGACAGG ATTGATTGGC ATTTTATTCC CCCGGCAGGA CCTCACTTCG 6000 GAGGTATTTG GGAAGCTGGA GTTAAGTCAA TGAAATACCA TTTAAAGCGT ATAATCGGCG 6060 ACACTATTTT TACTTATGAA GAAATGTCAA CTCTTTTATG TCAAATAGAA GCATGCTTAA 6120 ATTCAAGGCC ATTATACACT ATAGTTAGTG AGAAGGACCA ACAAGAAGTT TTAACACCAG 6180 GTCATTTTTT AATTGGAAGA CCACCTTTAG AAATAGTCGA ACCAATGGAA GATGAAAAAA 6240 TCGGAAATTT GGATAGGTGG AGACTTATCC AAAAAATAAA GAAAGATTTC TGGGTTAAGT 6300 GGAAAAGTGA ATATTTGCAT ACGCTCCAGC AAAGGAATAA ATGGAAAAAG GAAATTCCTA 6360 ATATAGAAGA AGGGCAAATA GTTTTATTAA AGGATGAGAA TTGTCATCCT GCAAGATGGC 6420 CTTTAGGAAA GGTGGAAAAG GTGCATAAGG GGAATGATGA TAAGGTCCGA GTGGCTAAAG 6480 TAAAGATGCA GGAAGGATAT ATCACTAGAC CCATTACTAA AATTTGTCCC TTGGAAGGAA 6540 TAAAGTCTGT TGACAAAAAT GAGGCTGACC AAGAGCCAAA AAGACGAACT AGAGCGACAT 6600 CGGGAATGTC CAAGATCGGA ATCATTATGG CAATGTTGTT GTTTGTGTTA AGTTGTCAAG 6660 TTTCTAGCGC ATTACCTAAA GATATAGCAC CAAGATATTC TATAGACAAA ATAAATAAAA 6720 CCTCAGCAAT ATATCTAGAC CCGCTAGGAG ATGTTGAGAT TGTGAGTACT TCTTGGAATT 6780 TGGTTATCTA TTATAAAATG GATCCATATT TTAAAATGTT AACAAAGGGT AATGCGCTTA 6840 TACAAAGTAT GAGGAAAGTT TGCGAAAGAC TTCATAGCTT TGAAGAGCAA TGTAGTCTAG 6900 TCTTAGATAA TATGCAAAGT CAGTTATCGG AACTTGAAGA AAACAATAAA TTGTTTATGA 6960 TGCAGTCTAG ATCTAGAAGC AAGCGTGCTC CTTTCGAATT TATGGGTTCC TTGTATCATA 7020 TTTTATTTGG TATAATGGAT GAAGATGATA GAGAGCAATT AGAAGAAAAT ATGAAGAATT 7080 TGTTAGATAA CCAGAACAAC CTTGATAAAC TAATTCAAAA ACAAACATCT GTGGTTGATT 7140 CAACTTCTAA TCTATTAAAG AGAACAACAG AAGATGTTAA CTCCAATTTT AGAAGTATGC 7200 AAATAAGAAT TGAGAACATG ACAGAAGTTC TTAAAGAAAA TTATTATGTT TATAAGGAAT 7260 CAATAAAATT CTTTATGATT ACGAAACAGC TACACTCATT GATTGAAGAA GGCGAAAAAA 7320 TTCAAGCAGG CATTATAAGC CTGTTGATTG ATATTAATCA CGGTAGGCTA AATACAAATA 7380 TTCTCAGGCC AAATCAGCTT AAAAAAGAAA TTGCCAAAAT TCAGCAGAGT CTTTCAGAGA 7440 ACCTAGTAAT TCCAGGAAAA CGGTCAGGTA CGGAACTTAA GGAGGTGTAT ACACTGTTAA 7500 CAGCCAGGGG TTTATTCATC GACGATAAAT TGATCATTAG TGCAAAAGTG CCTCTGTTTA 7560 GCAGGCATCC ATCCAAATTG TTCAGGCTTA TTCCGGTGCC AATTCGAAAT GAAGATCGGA 7620 TAATAATGGT GCATACAACG TCCGAATATT TAATTTATAA TTTTGAGATA GATTCCTATC 7680 ACATAATGAC GGAAGCCACA TTAAATCAAT GTCAGAAATG GCAACTAAAT AAGAGAATAT 7740 GCAAAGGAAG TTGGCCCTGG AATTCAGCGA ATGATAATGC ATGTGAGATT CAGCCTCTAA 7800 AGCCAGATAA AGCGGCGAAC TGCATCTATA AAACAGTAGT CGACTCTAAA AGTTACTGGG 7860 TAGAGTTAGA AAAGAAAAGT AGTTGGTTGT TTAAGGTTCC TGCGAATTCA AAAGTCCGTC 7920 TGCAATGTAC TGGCTCTCAA ATTGAATTGT TTGATTTGCC TCAGCAAGGA GTTTTAAGCA 7980 TTGCGCCATA TTGTACGGCA AGAACCGACG ATAAAATTCT AGTTGCCCAC CATAACATTC 8040 AGTCCGAAAG TGAAGAATTA TTATCAACAC CTTATATAGG AGAAGTTAGT GGAGTGCCGA 8100 AGATTATTTG GGATCCGCTG AAACTATCAA TATTAAATCA TACTGAGGAA TTTGAACGAT 8160 TGAATAATGA AATTAAATTT ATGAAAGAGA ACCATCAAAA ATTGAAAGAT TTACATTTCC 8220 ATCATATTTC CGGACATGCT GGATTAATTA TTGCTTTAAT ACTAATGATA GTATTAATAA 8280 TATATTTCAT ACGGAAATGT GCTGTGCAAC AAAGAATGCA AGCAATAACC TTTGCAGGTC 8340 CGTTGCCAGT ACTATAAATA TCAATAGTAA ATAAACAATA AAATAATATA ACAAATAAAA 8400 ATATACAGTC CACTAATAGA AAATGTACTT CTACATAGAA AAAGCAAAAT GTTTAAAATA 8460 AGTTAATTAA GTACAAATTG TTGAATTAAA AATAATATAA ACCATAATTG TAATCCAATA 8520 AAATTAAAAG CCAGAAAAAC TAGGCCCATT GAAATCTTAG TTGCAAAATA AATGAACATA 8580 TATCAAATAA ATACAGTCCA CTACTGTTAT AAATGCAACT AATATACTAA TGTACATCTC 8640 AGCTTTGCTG GCCCTTTGGC AGAATGTTCA CACATGAACA CGAATATATT TAAAGACTTA 8700 CAATTTTGGG CTCCGTTCAT ATCTTATGTA AATGAATCGA GAGCGATAAA TTATATTTAG 8760 GATTTTGTTA TCTAAGGCGA CATGGGTGCA TTGCTCAAAA ACATGTAATT TAAGTGCACA 8820 CTACATGAGT CAGTCACTTG AGATCGTTCC CCGCCTCCTA AAATAGTCCC TTAGTGGGAG 8880 ACCACAGATA AGGTCCTCGC CGCTCAAGAT AGGCAGATGT GCCCGAGCGT GGGACCTCGA 8940 TAAGGCGGGG ACTATTTACG TAGGCCTCTG CGTAGGCCAT TTACTTTAAG ATGCGATTCT 9000 CATGTCACCT ATTTAAACCG AAGATATTTC CAAATAAAAT CAGTTTCTTA CAAAAACTCA 9060 ACGAGTAAAG TCTTCTCATT TGGGATTTTA CA 9092 // ID BLOOD standard; DNA; INV; 7410 BP. XX AC AY180916; XX DR FLYBASE; FBgn0000199; blood. XX FT source AY180916:1..7410 FT SO_feature five_prime_LTR ; SO:0000425:1..398 FT SO_feature three_prime_LTR ; SO:0000426:7011..7410 FT SO_feature CDS ; SO:0000316:966..1271 FT /protein_id="AAN87266.1" FT /db_xref="FLYBASE:FBgn0045863; blood\sORF" FT /translation="MSTKQTFEHPAPVEQRDLPSIKEVIEVDPSAGPKPLTIQEYKAR FT TAAREQPPKKKRGGRRIKLLSARRLNIELLKTATNEEDRQRYKERLAAINQQLRGAK" FT SO_feature CDS ; SO:0000316:1863..3116 FT /protein_id="AAN87267.1" FT /db_xref="FLYBASE:FBgn0045865; blood\ORF1" FT /translation="MEWLNLTISINNIRDAFDKSYKCINKTALIKTQTLIFHIKVLIT FT QYNTLQNLIVTNKSKLTEEHKVQCFKVLSSFGKRLHNTSVRHSIIIEVPTELTKIAEF FT DESQLRDLDESQPLEDLDIESDIESIEELKFNTVQPNTRNMANALEAQRAYVKQVSAT FT VPDFDGKKLHLNRFVTALKLTDLTKGDQETLAVEVIKTKIIGPLNYKVEHATTIQAII FT TILQANVKGESPDVIKAKLINAQQRGKTASQYVTEIDSMRKQLEAAYIDGGLDADNAD FT KFATKESISAMTKNCANEALKMILTAGTFSTFNDAMEKYLHCSTEITGNSNTVLFYNG FT NNRRGNYNAYYRGRGRNNYNHNYNQNYNQGYNNNNRGRGGYRGHGNNRDGGNRRGNQS FT QNNNNNRNVRNVQSENSQTPLSDQQ" FT SO_feature CDS ; SO:0000316:3749..6733 FT /protein_id="AAN87268.1" FT /db_xref="FLYBASE:FBgn0045864; blood\ORF2" FT /translation="MKDYDIFTTPVEKENRTEEILKQLRFPKQFNNELTKLCTEFSDI FT FGLETEPISANNFYKQKLRLGEKTPVYIKNYRMADSQKPEIARQVKKLIDDGIVEPSM FT SEYNSPLLLVPKKPLPNSTEKRWRLAVDYRQINKKLLSDKFPLPRIEDILDQLGRAKY FT FSCLDLMSGFHQIELEKRYRDITSFSTANGSYRFTRLPYGLKVAPNSFQRRMTLAFSG FT LEPSQAFLYMDDLVVIGCSEKHMLKNLTNVFELCRRHNLKLHPGKCSFFMKEVTYLGH FT KCTDKGILPDDTKYEVIEKYPIPTDADSARRFVAFCNYYRRFIKNFSDHSRHLTRLCK FT KNVQFEWTAECNDAFEYLKTELMKPTLLQYPDFGKEFCITTDASKQACGAVLTQDHNG FT QQLPVAYASRMFTQGESNKSTTEQELTAIHWAINHFRPYIYGKHFMVKSDHRPLSYLF FT SMKNPSSKLTRMRLDLEEYDFTVEYLKGKDNHIADALSRITIKDLKTINREILKVTTR FT SKAKQENSCKDEAIVKIQEEKEQTIEKPKVYEVVNNNDTKKYVLIKIDKHKCLLKRGK FT TIVSRFDVDDLYSNETFDLNQFFQRLISKAGMHKITKMRISPSEQMFQFVSLNEFKIK FT GNRVLEKVELAILQKVIIIDKNDEAQIKEILTKFHDDPIEGGHTGISRTQSKIKRFYY FT WPQMTKTISKYVKTCLKCQQAKITTHTKTPLTLMPTPATAFDTVLIDTIGPLPKSEDG FT NEYAVTIICDLTKFLVTIPTPNKSAKTVAKAIFELFVLKYGPMKTFITDQGTEYKNSL FT MNELCKYMHIENLTSSAHHHQTLGTIERSHRTFNEYIRSYISVNKSDWDIWLPYFTYC FT FNTTPSIVHDYCPYELVFGRLPRQFKDFSKINKIDPIYNLDDYSKELKCRLELSYNRA FT RRMLEKAKADRKLRYDRNTNNFELKIGDKVLLRKETGHKLDKRYEGPYDVVDIGINDN FT ITIKTGSKKQQIVHKDRLKKHK" XX CC Sequence from P1 (complement)DS03023:69372..76782 provided by Guochon Liao, CC Berkeley Drosophila Genome Project. XX SQ Sequence 7410 BP; 2840 A; 1403 C; 1328 G; 1839 T; 0 other; TGTAGTATGT GCATATATCG AGGGTACACT GTACCTATAA GTACACAGCA ACACTTAGTT 60 GCATTGCATA AATAAATGTC TCAAGTGAGC GTGATATAAG ATCACCCATT TATGCTTTAA 120 GCTAAGTCAG CATCCCCACG CTGGCCGCTG GCCATATATG CGCATAAGCT CTCTCTCTCT 180 CTCTCTTATA CATATATATA TACGCTGCTC TTCTGCCGCT GTCGACGGCG GCGCAGTCGC 240 AGTATTTAGG TAAGATTAGA CACTCTGTAG AGGTTAAGCG GGCAGAACCG TTTCTGCTAC 300 TCGAAGAGAT AAGAAGAAAT AAAAAGGTGC CTGACGGCTG CACCCAACTG CAAGGAAAAC 360 ACGTGTTCTC AATTGGTGGC ATATATTGGT TTATTACATG GCGACCGTGA GGCAGGAGCC 420 TGCGATCTGA GGACTACTGA GGAAATGCTG CTAATATTGC CGATTTGATT TGGGAATTCT 480 AAACAGCGAC AACAGGTGTG AGAAGCAGGC CGCCCCTTAC ACCAGTGCGG GAGACCTAGA 540 GACGGGACAC TGATGAAAAA AAAAGAAACA AAAATACTGA GTGAGTAGAG TGTGGTAATG 600 GGCAAACGCG GATGTCAGGA AATCAAAAAT AAAGGTATAG CACATATTAA GTGGCTATGA 660 TATACAAATA AAACACCGCC CCCATGGGCA ACGGCACAGA AATTAACTGC CGAATTAGAC 720 TTTCTGAAAG AAAACCTCCA GCAAAGAAAG CCGAATACCA CAACTCACTC AGCAAAAATA 780 GAAATAATCA ATGAAGAAAT AACTGAAAAT TCAACATCAC CCAAGCCGAA AAGACCCGAC 840 GTCTGCATGA AAGACTGCCC TCGACCATTG TAAGCCGCAA CAGCAATTAG CACGGCATCC 900 TGCGAGGGTA GGATTAGGAT AAAGGATAAA GGATTCCACC GGCGCGCCGC ACATGACAAC 960 AGCGAATGTC TACCAAGCAG ACGTTCGAAC ACCCTGCTCC TGTCGAGCAA AGGGATCTGC 1020 CAAGTATCAA AGAGGTAATA GAGGTAGATC CGTCCGCGGG ACCAAAGCCC TTGACCATAC 1080 AAGAGTACAA GGCACGGACT GCAGCGAGGG AGCAGCCACC TAAAAAGAAG AGGGGTGGCC 1140 GCCGGATTAA GTTGCTCAGC GCCCGGAGGC TCAACATCGA ACTACTGAAG ACGGCAACTA 1200 ATGAGGAAGA CCGGCAGCGC TACAAAGAGC GCCTTGCAGC CATCAATCAA CAACTTCGTG 1260 GTGCGAAGTA AAGCGGCGGG CTGCGTTATA CGCCATAGCC TCAACCGCCC AAATATTATA 1320 TTAATGTTGT CGATGCGGTT TCCGCTGCAA CAAAATTACT AACTTATCAG GGACCCATTT 1380 CATAACTAAC ACATTATACT CAGTCCTAAA CTTAAAATAA GTAATAATAT TGTAAAATTG 1440 CAAATTGCAA CCGATGTAAA CTGAGTATAA TGAATTCATC TATCAAGTAA AAATATGTTT 1500 AACAACAGTT TAGACCTATT AAAATTTCGA GCTATATTTA TATCTGATCG AGATAACAAT 1560 AATTGACCAA TTCTCAAAGT TAAAATTCTA TTTGTACTTT TGATATACAA ATAAAGACTA 1620 ATTTTCCCCA TATCAAAATG GGACATAAGT CGTGGATACA ACCCCACAGT TAAATTCAAT 1680 GTACTTACTA TTTTTGATTT TAGTTATCCT ATCAGCCTTT TTACCTTGGC CTTAAAACTT 1740 TATCAGTTTC ACACAAGATC GTTGAAAAGA CTTACATGAG TCGAGCCAAT GATTTAGACA 1800 AAATCTAATA GAAACTACAC CAAAAAGGTA CAAGGTCGAT TACATCGCTA AAAGGTACAT 1860 ACATGGAATG GCTAAACTTA ACCATATCCA TAAACAATAT TAGAGATGCT TTTGATAAAT 1920 CCTATAAATG TATTAATAAA ACCGCGCTGA TCAAAACTCA GACGCTTATT TTTCACATAA 1980 AGGTATTGAT AACACAATAC AACACATTAC AAAACCTAAT AGTAACAAAC AAAAGCAAAC 2040 TCACTGAAGA ACATAAAGTC CAATGCTTCA AAGTTCTCAG TTCATTTGGT AAAAGACTAC 2100 ATAATACCAG CGTTAGACAC AGTATTATAA TAGAAGTCCC AACAGAACTA ACCAAAATAG 2160 CAGAATTCGA CGAAAGCCAG TTAAGAGACT TGGACGAGTC GCAGCCGTTA GAAGATTTAG 2220 ATATCGAAAG CGATATCGAA TCAATAGAAG AATTAAAATT TAATACCGTA CAACCAAATA 2280 CAAGAAACAT GGCCAACGCA TTAGAAGCTC AGAGAGCATA CGTTAAACAG GTATCTGCCA 2340 CAGTACCTGA TTTCGATGGT AAGAAACTCC ATTTAAACAG GTTTGTGACA GCACTTAAGT 2400 TGACGGATCT AACTAAAGGA GATCAAGAAA CTTTAGCAGT AGAGGTCATA AAGACCAAAA 2460 TTATTGGCCC ATTAAACTAT AAAGTAGAAC ATGCGACAAC GATACAGGCA ATAATTACCA 2520 TATTGCAGGC AAACGTAAAA GGCGAATCGC CTGACGTTAT AAAGGCCAAA TTAATAAATG 2580 CCCAACAAAG AGGCAAGACC GCGTCTCAGT ATGTTACAGA AATAGACAGT ATGCGTAAGC 2640 AGCTCGAGGC AGCTTACATA GACGGCGGAT TAGACGCCGA TAATGCTGAC AAATTCGCGA 2700 CTAAAGAGTC GATATCAGCA ATGACCAAAA ACTGTGCCAA CGAGGCACTT AAAATGATCT 2760 TAACTGCAGG TACATTTAGT ACATTCAACG ACGCAATGGA AAAATACCTA CATTGCAGTA 2820 CAGAAATAAC CGGCAATTCA AATACAGTCT TATTCTATAA TGGGAATAAT AGACGTGGTA 2880 ATTATAATGC CTACTATCGT GGTAGAGGCA GAAATAATTA TAACCATAAT TATAACCAGA 2940 ATTATAACCA AGGTTATAAT AATAACAACA GAGGTCGCGG AGGCTACCGC GGCCACGGTA 3000 ATAACAGAGA CGGAGGTAAC CGAAGGGGTA ACCAAAGTCA GAATAATAAT AACAACCGAA 3060 ATGTGCGTAA CGTACAATCG GAAAACAGCC AGACCCCCTT AAGCGATCAA CAGTAAAAGT 3120 GTTTAAAGTA AACCTAAATC TGAGTATTTT CATTAAGACA AAAAACCATG AAACAAACAC 3180 AGTTCTTACA TTACTAATAG ACACAGGTGC AGAAATTTCA TTGCTAAAAG CCAAAGCAAA 3240 GGAATATAAT AATATAAATT TCAGTAATAT ATCAAATATT ACAGGTATTG GGCAAGGAAC 3300 CATACAGTCT ATAGGTACAG TAGATCTTGA CATACGCATT CAGGATGTTC TAGTGCCACA 3360 TGAATTTCAT GTAGTACCTG AGAATTTTCC GATACCATGC GATGGCATAA TCGGAATAGA 3420 TTTTATCAAG AAATACAATT GCGTATTAGA GTTTCAAAAT AACAAAGACT GGTTCACAAT 3480 AAGACCCAAT AACTTCAGTA GACAGATTAG TGTACCAATT ACACATAACT TAGACTCCAA 3540 CACACTCTTA TTGCCAGCTA GATGCGAAGT AATCAGACAA GTCAAATTAC TCACTAACGA 3600 AAAAACGGTG GTAGTACCAA ATCAGGAGCT GCAACCAGGT ATAATAGTAG CAAGCACCAT 3660 TGCCGATAGC AAAAACGCAT TGATTCGCAT TATAAATACA AATAATAAAG ACGCCATAAT 3720 AGATAGCGCG AAGATCAAAT GCGAATCAAT GAAAGACTAT GACATTTTTA CAACACCAGT 3780 AGAAAAGGAA AATAGAACTG AAGAAATTTT AAAACAATTA AGATTCCCTA AACAATTCAA 3840 TAATGAACTA ACTAAGTTAT GCACCGAGTT TAGCGATATT TTTGGTCTAG AAACAGAACC 3900 AATATCGGCT AACAATTTCT ACAAACAAAA ACTCAGATTA GGGGAAAAAA CACCGGTCTA 3960 TATAAAAAAC TATCGCATGG CAGATAGCCA AAAACCAGAA ATCGCCAGAC AGGTAAAAAA 4020 ATTAATAGAT GATGGAATAG TTGAACCATC AATGTCTGAA TATAATAGTC CATTACTTTT 4080 GGTTCCAAAG AAACCACTTC CGAATTCCAC GGAAAAAAGA TGGCGATTAG CAGTTGACTA 4140 TCGTCAAATA AATAAGAAAC TATTATCAGA CAAATTTCCA CTTCCAAGAA TAGAAGATAT 4200 TCTTGATCAA TTAGGAAGAG CAAAGTATTT TTCATGTCTC GACCTAATGT CTGGATTCCA 4260 CCAGATAGAA CTAGAAAAAA GGTATAGAGA TATAACGTCA TTTTCAACAG CCAATGGCTC 4320 ATATCGCTTC ACGCGATTAC CATACGGACT GAAAGTAGCA CCAAACTCCT TCCAACGTAG 4380 GATGACACTT GCATTTTCTG GTCTTGAACC ATCGCAAGCA TTTCTATATA TGGATGACTT 4440 AGTAGTAATA GGTTGTTCAG AAAAACATAT GCTCAAAAAT TTGACTAACG TATTCGAGCT 4500 ATGTAGACGA CATAATTTGA AACTACATCC AGGGAAATGT TCTTTCTTTA TGAAAGAAGT 4560 AACATATTTG GGTCACAAAT GTACCGATAA AGGTATACTC CCAGATGACA CCAAATATGA 4620 AGTTATAGAA AAATATCCTA TACCAACAGA TGCCGACAGT GCTAGGCGTT TCGTAGCCTT 4680 CTGTAATTAT TACAGACGTT TCATTAAAAA TTTTTCTGAT CATTCACGCC ACTTAACGAG 4740 GCTTTGTAAA AAGAATGTTC AATTCGAATG GACAGCAGAA TGCAATGATG CATTCGAATA 4800 CCTTAAAACA GAATTAATGA AACCAACATT ACTACAGTAC CCAGATTTCG GTAAAGAATT 4860 TTGCATAACA ACCGATGCTA GTAAACAGGC ATGCGGAGCG GTACTTACAC AAGATCACAA 4920 TGGTCAACAA CTTCCAGTGG CATACGCTTC AAGAATGTTC ACTCAAGGTG AAAGTAATAA 4980 GTCCACTACA GAACAAGAAT TAACGGCCAT TCATTGGGCC ATAAATCATT TTCGACCATA 5040 CATATATGGC AAGCATTTCA TGGTAAAAAG CGATCATAGA CCATTGTCAT ACCTATTCTC 5100 TATGAAAAAT CCAAGTTCAA AACTCACTCG TATGAGGCTG GATTTAGAAG AGTATGACTT 5160 TACTGTAGAA TATCTTAAGG GGAAAGATAA CCATATTGCG GACGCCTTGT CTCGCATAAC 5220 AATAAAAGAT CTGAAAACAA TCAACAGAGA AATATTAAAA GTTACCACCA GATCAAAAGC 5280 TAAACAGGAA AATTCCTGTA AGGACGAAGC AATAGTCAAA ATACAAGAGG AAAAAGAGCA 5340 AACAATAGAA AAGCCCAAAG TCTATGAAGT TGTCAATAAT AATGACACAA AGAAATATGT 5400 TTTAATCAAA ATAGATAAAC ACAAGTGTTT ATTAAAACGA GGAAAAACAA TTGTTTCACG 5460 CTTTGATGTT GATGACTTGT ATTCTAATGA AACATTTGAT CTAAATCAAT TCTTTCAAAG 5520 GCTTATTTCA AAAGCCGGAA TGCATAAAAT AACAAAAATG CGAATATCAC CAAGCGAACA 5580 GATGTTCCAA TTTGTATCAC TAAATGAATT TAAAATAAAG GGCAACCGAG TACTCGAAAA 5640 AGTAGAACTA GCTATTCTAC AAAAGGTGAT AATTATAGAC AAAAATGACG AAGCTCAGAT 5700 TAAAGAAATT TTGACAAAAT TCCATGATGA TCCTATAGAA GGAGGCCACA CTGGTATTTC 5760 GCGAACCCAG TCAAAAATCA AAAGATTTTA TTATTGGCCC CAGATGACCA AGACAATCTC 5820 AAAGTATGTA AAGACTTGTT TGAAATGTCA ACAAGCCAAA ATTACAACAC ATACGAAAAC 5880 TCCATTAACA TTGATGCCAA CGCCAGCAAC AGCATTTGAT ACTGTTTTAA TTGATACCAT 5940 TGGTCCACTA CCGAAATCGG AAGACGGAAA TGAGTATGCA GTTACAATCA TATGCGATCT 6000 AACCAAGTTT TTAGTAACTA TTCCAACACC AAATAAAAGT GCTAAAACAG TTGCAAAGGC 6060 TATATTTGAA TTATTTGTAC TGAAGTACGG TCCAATGAAG ACGTTCATTA CAGATCAAGG 6120 TACGGAATAC AAAAATTCAC TTATGAATGA ATTATGCAAA TATATGCATA TAGAAAATCT 6180 AACATCTAGC GCTCACCATC ATCAAACTTT AGGAACAATA GAAAGAAGCC ACCGAACTTT 6240 TAATGAATAT ATACGTTCAT ACATATCGGT TAACAAAAGT GATTGGGACA TTTGGTTACC 6300 ATATTTCACT TATTGCTTCA ATACAACACC CTCAATAGTC CATGACTATT GCCCATACGA 6360 ACTAGTATTT GGCAGACTAC CCAGACAATT CAAAGATTTC AGTAAGATAA ACAAAATAGA 6420 CCCAATATAC AACTTAGACG ACTACTCTAA AGAGCTTAAA TGCAGACTAG AATTGTCGTA 6480 CAACAGAGCA AGAAGAATGT TAGAAAAAGC AAAAGCGGAT AGAAAATTAA GATATGATAG 6540 GAATACAAAT AATTTCGAAT TAAAAATAGG AGATAAAGTA TTACTTAGAA AAGAAACAGG 6600 TCATAAGTTA GATAAAAGAT ATGAAGGTCC TTATGACGTA GTAGATATAG GAATAAATGA 6660 CAATATAACC ATTAAAACAG GAAGTAAGAA ACAACAAATA GTACATAAAG ATAGGCTAAA 6720 AAAGCACAAA TAGAATGAAA AAAAAAAAGG GCAATCAATG CCAAACCTTT CATAATAAAA 6780 CTTAAATAAC GGCCTGATCA GCCAAAACAA TATAACAAAG ACATAGACAT AATCGAATTT 6840 TTATTAATTC AAAATACATA CATATTTTTT CTTTATTCAT TTAAAAATTC TATATCATAA 6900 ATAATGTTAA TTCATTAAAA ATAATATTTA AGTAATTTTT ATTTTATAAT GGTAATATAG 6960 TTGATAGAAA ATAACTTCAT TTCTTTACGT TATTTTAAAA AAGAGGGGAG GTGTAGTATG 7020 TGCATATATC GAGGGTACAC TGTACCTATA AGTACACAGC AACACTTAGT TGCATTGCAT 7080 AAATAAATGT CTCAAGTGAG CGTGATATAA GATCACCCAT TTATGCTTTA AGCTAAGTCA 7140 GCATCCCCAC GCTGGCCGCT GGCCATATAT GCGCATAAGC TCTCTCTCTC TCTCTCTTAT 7200 ACATATATAT ATACGCTGCT CTTCTGCCGC TGTCGACGGC GGCGCAGTCG CAGTATTTAG 7260 GTAAGATTAG ACACTCTGTA GAGGTTAAGC GGGCAGAACC GTTTCTGCTA CTCGAAGAGA 7320 TAAGAAGAAA TAAAAAGGTG GCCTGACGGC TGCACCCAAC TGCAAGGAAA ACACGTGTTC 7380 TCAATTGGTG GCATATATTG GTTTATTACA 7410 // ID DMZAM standard; DNA; INV; 8435 BP. XX AC AJ000387; XX DR FLYBASE; FBgn0023131; ZAM. XX FT source AJ000387:1..8435 FT SO_feature five_prime_LTR ; SO:0000425:1..473 FT SO_feature three_prime_LTR ; SO:0000426:7963..8435 FT SO_feature CDS ; SO:0000316:join(494..531,6387..8004) FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0024272; ZAM\env" FT /db_xref="SPTREMBL:O46113" FT /protein_id="CAA04048.1" FT /translation="MENTLLNLLLVLLSCHGAYQSIFIHNFNSTNLLAKVPVGKTLVIG FT NYKKISHIIDLSEYTNCIEKLYHTIDTLRQDETLTDSISILNAKLAQTQSKIDALTPFS FT RHKRGLINGLGSLVKVVTGNMDANDAKNIETEINHLKSQSTTISDNFEIQNSFNDEVQL FT RFKNLTRHINNEQNLIKNFFENTQNTIYTKIYNNEEEIKKLQYINRLNYNIDLLVSHLS FT DIIESTLLAKINVIPKLILDKTEITKIKQIFKTQNYTIKSEQHIYNLLKMNALNYQNKI FT IFSIKIPIFLSCNYEMARLIPLPINSTQFVIAPKYLIYNNKSNSMFSTMYKCPVIEEQF FT VCEIDSINNLKNNTCLGHLIQNKTSYCDIKETGLTTDVFEPEKGFILVFNGNNLPIISS FT NQTITSINGSAIIKYNNCTLKINEINYDNRAVSTEEHPDFFLPPMRKLIKNATINILTL FT ERLHLDTLTTSNKLLVVAAGNSRHSTTLYILFTVSLVAVILTWTLRRDTHIFHTGPDHI FT LPIVAPPIPPSMAFAPNWGGRSYRPIGTIHHPSL" FT SO_feature CDS ; SO:0000316:1789..2820 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0024271; ZAM\gag" FT /db_xref="SPTREMBL:O46114" FT /protein_id="CAA04049.1" FT /translation="MSKKLTQTIKQTTRSVLESHTFPKRVTRSVSKTNTLPVIRESTPL FT PPLQPINMDSGNASVGNSAPVTPTVSGFSSIATALSATDILAFVKELPTFDGTPGQLDK FT YITSVEEIIMLIRGTDQTPYGLLTLRAIRNKIVGRADEALNLANTKLIWDDIKSNLLRL FT YSSKKSEATLLGELQSLPDNLTLGQLFFGLSRIRSQLISITSNSGQSATIIEAKKTLYD FT EVCLNAFISRIREPLKTVIRLKDPKTIETAYELCQGERARYQNRNPYPPTQNNTERRTN FT NYNNNNNNNHRDNNNRNNVTRLTPKTTQTITQTPIPNIVNQTTATELVTRLKIIKQIMG FT YTT" FT SO_feature CDS ; SO:0000316:<2795..6448 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0024270; ZAM\pol" FT /db_xref="SPTREMBL:O46115" FT /protein_id="CAA04050.1" FT /translation="NKLWATQHRRRKTHPTLPYQLKFSGTRLRNPTGYINPTTHATSLP FT YITLNLQQKFPLSFLIDTGSNNSFIDPESANQLECTILPTSTSITTALNSFKIEEKAIF FT PMPPEFKTEGQITLLKFKFHSYFNGLIGMDLLSHLEAKVDLVNLQLVTSKSTLPIFLYT FT NQASKIFNIPAYSKVILPLPVKTNHGEFYCCTTQLNNELSLSEGLYKSNNNIANVEISN FT QSDSDKLLYLEYPLETIPYNKNDHIELFNISATPLNNDTPQAPLHILTEHLNPEEKTAL FT TTLCKQFRDIFYNPETPLTFTNKITHSIPTIDNTPIHTKSYRYPFVHKTEVKKQIESML FT DQQIIRSSHSPWSAPVWVVPKKLDGTGNRKWRLVIDYRKLNDKTISDRYPIPNINDILD FT SIGKAKYFSTLDLTSGFHQIEMNPKDIAKTAFTVEGGHYEFTRMPFGLKNAPATFQRVM FT DSVLGDLNGTICLFYLDDIIIFSPSLQKHLLDIKMVFEKLRAANFKLQPSKSEFLRKEI FT EFLGHIVTQDGVKPNPNKISAIKKFPCPTNRRAIKSFLGLLGYYRKFIRDFARITKPMT FT KQLKGKRQVTTDKDFVDAFEQCKTLLSNDPILIHPDFEKPFILTTDASNFALGAVLSQG FT SLQNDRPVCFASRTLSDTEVNYSTIEKEMLAIIWAVKYFRPYIYGVKFTIVTDHKPLIW FT LMNFKEPNSKIIRWRLQLMEYNFEIIHKKGSQNVIADALSRADPNLNYNETLTVKPCPT FT SEKPINEFNTQLILEIDTNTSYQTTTPFKQKIRKKYSQPCFDFDNIVKILKGTLKPNRI FT CAFLADDNNSALIEKAFSTYFAHKKHFKIIRCKSLLHEIVGNPEQNKFIQEYHTNSNHR FT GIDETFLHLKRETYFPNMKNKISELIRNCETCLKLKYDRQPQNIVFETPETPSKPLDII FT HIDIYTINNNFNLTIIDKFSKFAAVYPIPNRNGINCIKAIKNFFSQFGLPKKLIHDQGV FT EFCNDIFRKFCSQYNILLHVTSFQQSSSNSPVERLHSSLTEIYRIILDTRKKHKLPTDH FT EEIMSETVITYNNAIHSTTKHTPFELFNGRTHLFEKTIIPNNEHDYLNKLNTFQDKLYS FT EIKEKLSTNTQQRIEKLNTSRVEPTTVQPNSTIFRKENRRNKLTPRFSLHRTAKDKGKT FT LVTTRNQKIHKSKIRKISKPPNDLSLSTCIPDLAMGHTNLSSSTTSIAPTS" XX CC Derived from AJ000387 (e1237231) ((Rel. 54, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 19-Jan-1998. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 8435 BP; 3286 A; 2055 C; 1151 G; 1943 T; 0 other; AGTTACCGAC CCATCGGTAC CATACACCAC CCCTCCCTCT AAGCCACCAC GCCTACACAA 60 GTAGAAGACA TCGAACCGGG AAGCTTTGCG ATACAAAGTT GCAGCATAAA CATCAACAAC 120 GGGTCAGACG CCGACATCCG CCCAAAATGC TGACACCACA TCCTTTTCGC TCAGACAGAA 180 CAACGCATAC AATTCCATAT ACATACGTAT AAACATACTC ATACTTTCTG CTGTGTCAGA 240 TACTTTATTT CTAAGAACTT TAACATTGTA ATACATACAC ACATATTCAC TGTTAGCCCA 300 TTTAAGACGA AGAATAAAGA CGACCACAGT CGAGTGCAAG CAGCAAACAC TTGTAGACGT 360 ACATAATCTC CGATCAAAAT TCTCCCAAGA CGACCGTGGC TACGTTCTGG ACCCGCATAA 420 CTCCTCTATC TTTCTGAGTG ATAATACCTC CGCAAGACTC CCCGGAGGTA ACTGGCGCAG 480 CCGGAAAACT GGAATGGAAA ATACTTTATT AAACCTTCTA TTAGTTCTAT TGTAAGTAGT 540 TGTGGAAAAA GAGTGAGAAT GAAGTGCAGA AATGTCTAAA AGTGATTACA ACAAAAATCC 600 TAATACAATA CATAAACCGC CTTAACAAAC ATACAAAACA CGCATATAAA AAAAAAAAAA 660 AAAAAAAAAA GAAAAAAAAA AACCCAAAAC TTAAAAATGC CGTAACCGCG AAACATGATA 720 TGCGTTGTAC TTGTGTGAAA TCAATCGCTG ATAGTCACTG CCGAAGTTTA TTAAGGCCAA 780 GTACCATATC ATTACTTTCA TGTTTACATA CATATATATG CCCCACAATT AAAACAACAT 840 ACACACACAC AAATATTTCA AATGCAAAAA AAAAAAAAAG AATGTAGTGT ACCTGCGTGG 900 CATCAATCGC TGATAAACCA CTGCCGAAAT ATTAAAGGCC CGGTACTACA TCACAAAACA 960 CGTATATATG CAACAAAAAT ATACACAACA AAACCATATA TACAAACGTG TATGAGTGAC 1020 GTGTAATGTA CTTGTGTGAA ATCAATCGCT GATAATCACT GCCGAAGCTT AGTAAGGCCA 1080 AGTACCACAT CATTACTAAC ATGTGTACAT ATATATATAT GCAAAACAAT TAAAACAACA 1140 TACACACACA CAAATATTTC AAATGCAAAA AAAAAAAAAA AAGAGGAAAT GTTGTGTACC 1200 TGCGTGGCAT CAATCGCTGA TAAACCACTG CCGAAATATT AAAGGCCCGG TACTACATCA 1260 CAAAACACGT ATATATGCAA CAAAAATATA CACAACAAAA CCATATATAC AAACGTGTAT 1320 GAGTGACGTG TAATGTACTT GTGTGAAATC AATCGCTGAT AATCACTGCC GAAGCTTAGT 1380 AAGGCCAAGT ACCACATCAT TACTAACATG TGTACATATA TATATGCAAA CCACCAAAAC 1440 AAATACATAT ACACATACAA ACACTCCAAA AAAAAAAACA AATAATACTA TATGAACGGC 1500 GAAGCGTATG TTTTCTAAGG CTGGATACAA AACCACAAAA CCAAATATAA ATTGCACACC 1560 TTAATAAAGA AAAGAACAAA AATGATAATA AACAAAAGAA ATTTTTTTTG GAACATGCAC 1620 CCATACTCTC ACTCTTTCAA CACAAATAAA GTATTCAAAT TATACATACA TACAATAATA 1680 CCACTATATT ACAGAAATTA ACGCACAAGA AAACACACAC ACTATCCAAC AACAAACAAG 1740 TAATTAAGAG TTATTAAGTA CATTGTAAAC TACATATTTT TATCTTAAAT GTCAAAGAAA 1800 TTAACACAAA CTATTAAACA AACAACTCGC TCCGTGTTAG AATCACACAC ATTTCCCAAA 1860 AGAGTTACAC GATCAGTTTC GAAAACAAAC ACCCTCCCCG TAATAAGAGA AAGCACCCCC 1920 TTACCGCCCC TTCAACCTAT AAATATGGAT TCGGGCAACG CCTCCGTAGG TAATTCCGCC 1980 CCCGTAACAC CTACTGTCAG TGGCTTTAGC AGTATTGCTA CGGCACTTAG TGCCACCGAT 2040 ATTTTAGCCT TCGTTAAAGA ACTTCCGACC TTCGATGGTA CTCCAGGCCA ACTCGACAAA 2100 TATATAACTA GCGTTGAGGA AATAATCATG CTCATTAGGG GTACCGACCA AACTCCGTAC 2160 GGACTTCTGA CACTCAGGGC AATTAGGAAT AAAATAGTTG GAAGAGCAGA CGAAGCTCTA 2220 AACCTAGCCA ACACCAAACT TATATGGGAC GATATCAAAA GTAACCTACT ACGTTTATAC 2280 TCTAGCAAGA AAAGCGAAGC TACCCTCTTA GGCGAGCTCC AATCTCTCCC AGATAACCTA 2340 ACCCTAGGGC AATTGTTCTT CGGCTTATCG AGGATTAGGA GCCAACTTAT ATCCATTACT 2400 TCCAATAGTG GACAGTCGGC CACAATCATC GAAGCCAAGA AAACACTATA TGACGAAGTC 2460 TGTTTAAACG CCTTCATCTC AAGAATTAGA GAACCACTTA AAACAGTCAT CAGATTGAAA 2520 GACCCCAAGA CTATCGAAAC AGCTTACGAG CTATGTCAAG GAGAAAGGGC TCGTTACCAG 2580 AACAGAAACC CATATCCCCC AACACAAAAC AACACCGAAC GACGAACTAA CAACTACAAT 2640 AACAATAACA ACAACAATCA CAGAGACAAC AACAACCGCA ACAACGTAAC TCGTCTTACA 2700 CCCAAAACCA CTCAAACCAT TACTCAAACC CCAATTCCCA ATATCGTCAA TCAAACAACG 2760 GCAACAGAAC TAGTAACCCG TTTAAAGATA ATAAAACAAA TTATGGGCTA CACAACATAG 2820 AAGAAGAAAA ACTCACCCAA CACTGCCTTA CCAACTTAAA TTTTCAGGCA CCCGCCTCAG 2880 GAACCCAACA GGATACATAA ATCCTACCAC ACATGCAACA TCCCTTCCAT ACATAACTCT 2940 AAACCTCCAA CAAAAATTCC CTTTATCATT TCTTATCGAT ACAGGATCCA ATAACTCCTT 3000 CATTGACCCA GAATCTGCAA ACCAACTAGA GTGCACAATT CTACCAACAT CCACTTCAAT 3060 TACAACAGCA TTAAATAGTT TCAAAATTGA AGAAAAGGCA ATATTCCCAA TGCCACCCGA 3120 GTTCAAAACC GAAGGTCAAA TTACCCTACT TAAATTCAAA TTTCACTCTT ATTTCAATGG 3180 CCTCATAGGA ATGGACCTAT TATCACACCT AGAAGCAAAA GTAGACCTAG TAAACTTACA 3240 ACTAGTAACT TCAAAGTCTA CACTCCCAAT ATTCTTATAC ACTAACCAGG CTTCAAAAAT 3300 TTTTAACATC CCCGCCTACA GTAAAGTTAT CTTACCACTA CCAGTAAAGA CTAATCATGG 3360 GGAATTCTAT TGTTGTACTA CACAACTAAA TAATGAGTTA TCGTTGTCAG AAGGACTATA 3420 TAAATCAAAC AATAATATTG CCAATGTCGA AATCTCTAAC CAATCCGACT CAGATAAACT 3480 ATTATACCTA GAATACCCCC TAGAAACCAT TCCATACAAT AAAAACGACC ATATAGAGCT 3540 CTTTAATATA TCAGCTACAC CTCTTAATAA CGATACCCCT CAAGCCCCAT TACATATCCT 3600 CACAGAACAC CTCAATCCAG AGGAAAAAAC AGCCTTAACA ACCCTATGTA AACAATTTCG 3660 CGACATATTC TACAACCCAG AAACACCATT AACTTTTACC AACAAAATCA CACACTCCAT 3720 CCCAACCATA GATAACACTC CTATCCACAC AAAATCCTAC AGATACCCTT TTGTCCATAA 3780 AACAGAAGTC AAAAAACAAA TCGAATCCAT GTTAGACCAA CAAATTATTA GATCTAGCCA 3840 CTCCCCTTGG AGCGCCCCGG TGTGGGTGGT CCCAAAAAAA CTAGACGGGA CAGGGAACAG 3900 GAAATGGCGA CTTGTAATAG ACTACCGGAA ACTCAACGAC AAAACCATTT CGGACAGATA 3960 CCCCATCCCA AACATAAATG ACATATTAGA TAGCATAGGC AAAGCAAAAT ATTTCTCAAC 4020 GCTCGACCTA ACTAGCGGTT TTCATCAAAT CGAGATGAAT CCAAAAGATA TCGCCAAAAC 4080 AGCCTTTACA GTCGAAGGGG GTCACTACGA ATTCACACGG ATGCCCTTCG GCTTAAAAAA 4140 CGCACCGGCT ACCTTTCAAC GGGTTATGGA CAGCGTTCTT GGCGATCTCA ACGGCACCAT 4200 TTGCCTATTC TATCTTGACG ATATTATAAT TTTCTCGCCT TCCCTACAAA AACACCTGTT 4260 GGACATAAAA ATGGTATTCG AAAAACTCAG AGCGGCAAAC TTTAAACTAC AACCTTCAAA 4320 ATCAGAATTC CTAAGGAAAG AGATAGAATT TCTAGGCCAC ATAGTCACAC AAGACGGAGT 4380 TAAACCAAAC CCGAACAAAA TAAGTGCGAT CAAAAAATTT CCTTGCCCCA CCAACAGAAG 4440 AGCTATCAAA TCTTTTCTCG GGTTACTGGG TTATTATAGG AAGTTTATAA GAGACTTTGC 4500 ACGAATAACG AAGCCCATGA CTAAACAATT GAAAGGGAAA AGACAAGTTA CTACAGACAA 4560 AGACTTTGTA GACGCATTCG AACAGTGCAA AACTCTTCTG TCCAATGACC CAATACTCAT 4620 ACACCCAGAC TTCGAAAAAC CATTCATTCT TACTACGGAT GCTAGTAACT TCGCGTTAGG 4680 AGCCGTACTA TCTCAAGGCT CCTTACAAAA CGATAGACCT GTATGTTTTG CCAGCAGGAC 4740 CCTCTCCGAC ACCGAAGTCA ACTATTCAAC CATAGAAAAA GAAATGTTGG CAATAATATG 4800 GGCAGTAAAA TACTTCAGAC CATATATTTA TGGCGTAAAA TTTACTATTG TTACAGATCA 4860 CAAGCCACTA ATATGGCTTA TGAATTTCAA AGAACCCAAC TCAAAAATAA TTCGTTGGAG 4920 ACTCCAACTC ATGGAATACA ATTTTGAAAT AATTCACAAG AAAGGTTCAC AAAATGTAAT 4980 TGCAGACGCC TTAAGTAGAG CGGACCCAAA TTTAAACTAC AACGAAACAC TGACTGTTAA 5040 GCCTTGCCCC ACATCCGAAA AACCTATTAA CGAATTTAAC ACGCAACTCA TACTAGAAAT 5100 AGATACAAAT ACGTCTTACC AAACTACAAC ACCATTTAAA CAAAAGATTA GGAAAAAATA 5160 TTCACAGCCT TGCTTCGATT TCGATAATAT TGTTAAAATC TTGAAAGGAA CCCTAAAACC 5220 TAACAGGATT TGCGCATTCT TGGCGGACGA TAATAATTCC GCATTAATCG AAAAAGCATT 5280 CTCAACGTAT TTTGCACATA AAAAACACTT TAAAATTATC AGATGCAAAT CACTTCTCCA 5340 CGAAATCGTA GGAAACCCCG AACAAAACAA ATTCATTCAG GAATATCACA CTAACAGCAA 5400 CCACAGAGGC ATAGACGAAA CATTCCTTCA CCTCAAACGA GAAACCTACT TCCCCAATAT 5460 GAAAAACAAA ATCTCTGAAT TAATTAGGAA TTGCGAAACC TGTCTAAAAC TCAAATACGA 5520 CAGGCAACCA CAAAATATAG TATTTGAAAC CCCAGAAACC CCATCGAAAC CCCTCGACAT 5580 AATACACATA GACATCTATA CTATTAACAA TAATTTTAAC CTGACAATCA TAGACAAATT 5640 CTCAAAATTC GCAGCTGTCT ACCCCATCCC AAATAGAAAC GGCATCAATT GCATCAAAGC 5700 AATCAAAAAT TTTTTCAGTC AATTCGGACT ACCCAAAAAA CTAATACACG ACCAAGGAGT 5760 AGAATTTTGC AACGACATAT TTCGAAAGTT TTGCTCTCAA TATAATATAC TTCTCCATGT 5820 CACATCCTTC CAGCAATCTT CAAGTAATTC TCCAGTAGAA CGTTTACACT CCTCTTTGAC 5880 AGAGATTTAC AGAATAATAC TAGACACACG GAAAAAACAC AAATTACCTA CAGACCACGA 5940 AGAAATAATG TCAGAAACTG TAATAACATA TAACAACGCA ATCCACTCCA CCACCAAACA 6000 CACCCCTTTT GAACTTTTTA ATGGTAGGAC CCATTTATTC GAGAAAACAA TAATACCCAA 6060 TAATGAGCAT GACTATTTAA ATAAACTAAA TACGTTCCAA GACAAACTAT ACTCCGAAAT 6120 AAAAGAAAAA TTGTCCACAA ACACCCAACA AAGGATAGAA AAGCTAAACA CAAGCAGAGT 6180 AGAACCAACA ACAGTACAAC CTAACAGCAC AATTTTCAGA AAAGAAAACA GGAGAAATAA 6240 ATTAACACCA CGGTTTTCCT TACACAGAAC AGCAAAAGAC AAAGGAAAAA CTCTAGTAAC 6300 CACAAGAAAT CAAAAAATCC ACAAATCAAA AATTAGGAAA ATATCCAAAC CTCCAAATGA 6360 CTTAAGCCTT TCCACCTGCA TTCCAGATCT TGCCATGGGG CATACCAATC TATCTTCATC 6420 CACAACTTCA ATAGCACCAA CCTCCTAGCA AAAGTGCCGG TAGGGAAAAC ACTCGTGATA 6480 GGAAACTATA AAAAAATTAG CCACATAATC GATCTGTCCG AATACACCAA CTGTATTGAA 6540 AAATTATACC ACACCATCGA TACCCTAAGA CAAGATGAAA CACTCACCGA TTCTATATCA 6600 ATACTAAATG CTAAACTGGC CCAAACTCAA AGTAAAATAG ACGCACTAAC ACCCTTTTCA 6660 CGCCACAAAA GAGGTCTTAT TAACGGGTTA GGTAGTTTAG TCAAAGTCGT CACCGGCAAC 6720 ATGGACGCCA ATGATGCAAA GAATATAGAA ACAGAAATTA ACCACTTAAA AAGCCAGTCC 6780 ACCACTATCT CAGATAACTT CGAAATACAG AACTCGTTCA ATGATGAAGT TCAACTACGG 6840 TTCAAAAACT TAACAAGACA CATTAACAAT GAACAGAATT TGATTAAAAA CTTCTTCGAA 6900 AACACTCAAA ATACAATTTA CACAAAAATA TACAACAACG AAGAAGAAAT AAAGAAACTA 6960 CAATATATAA ATAGGCTTAA CTACAATATA GATTTATTAG TTAGCCACCT AAGCGACATT 7020 ATAGAAAGTA CACTGCTTGC CAAAATTAAT GTTATCCCAA AACTCATCTT AGACAAGACA 7080 GAAATAACCA AAATCAAACA AATTTTTAAA ACACAAAACT ACACAATAAA ATCCGAGCAA 7140 CACATTTATA ACCTCTTAAA AATGAACGCA CTCAATTACC AAAACAAAAT AATTTTTAGT 7200 ATCAAAATTC CTATTTTTTT AAGTTGTAAC TACGAAATGG CAAGATTAAT TCCACTTCCA 7260 ATAAATTCCA CACAATTTGT AATAGCACCT AAGTACTTAA TATATAATAA CAAAAGTAAC 7320 AGCATGTTTT CAACTATGTA TAAATGTCCT GTAATAGAAG AACAATTCGT CTGCGAAATC 7380 GACTCCATCA ATAATCTTAA AAATAATACT TGCCTGGGAC ACCTTATCCA AAATAAGACC 7440 AGCTACTGCG ACATAAAGGA AACGGGACTC ACGACTGATG TGTTCGAACC GGAAAAAGGC 7500 TTCATACTTG TATTTAACGG GAACAACCTC CCAATCATCT CCTCCAACCA GACCATAACT 7560 AGTATCAATG GATCAGCTAT AATAAAGTAT AACAATTGCA CATTAAAAAT CAATGAAATA 7620 AACTACGACA ACAGGGCGGT ATCAACAGAA GAGCACCCCG ACTTCTTCCT ACCACCAATG 7680 CGGAAACTAA TAAAAAATGC CACTATCAAC ATACTCACCT TGGAAAGACT TCACCTGGAT 7740 ACACTCACAA CATCCAATAA GCTACTGGTC GTCGCCGCAG GAAACTCTCG ACACTCGACA 7800 ACCTTGTATA TCCTCTTCAC CGTATCCCTA GTCGCCGTAA TACTCACCTG GACACTTCGA 7860 AGGGACACCC ACATCTTCCA TACCGGGCCC GACCACATTC TTCCAATCGT CGCTCCACCA 7920 ATTCCTCCGT CTATGGCCTT CGCTCCAAAC TGGGGGGGGA GGAGTTACCG ACCCATCGGT 7980 ACCATACACC ACCCCTCCCT CTAAGCCACC ACGCCTACAC AAGTAGAAGA CATCGAACCG 8040 GGAAGCTTTG CGATACAAAG TTGCAGCATA AACATCAACA ACGGGTCAGA CGCCGACATC 8100 CGCCCAAAAT GCTGACACCA CATCCTTTTC GCTCAGACAG AACAACGCAT ACAATTCCAT 8160 ATACATACGT ATAAACATAC TCATACTTTC TGCTGTGTCA GATACTTTAT TTCTAAGAAC 8220 TTTAACATTG TAATACATAC ACACATATTC ACTGTTAGCC TATTTAAGAC GAAGAATAAA 8280 GACGACCACA GTCGAGTGCA AGCAGCAAAC ACTTGTAGAC GTACATAATC TCCGATCAAA 8340 ATTCTCCCAA GACGACCGTG GCTACGTTCT GGACCCGCAT AACTCCTCTA TCTTTCTGAG 8400 TGATAATACC TCCGCAAGAC TCCCCGGAGG TAACT 8435 // ID DME010298 standard; DNA; INV; 8507 BP. XX AC AJ010298; XX DR FLYBASE; FBgn0015945; GATE. XX SY synonym: Batumi XX FT source AJ010298:1..8507 FT SO_feature five_prime_LTR ; SO:0000425:1..272 FT SO_feature three_prime_LTR ; SO:0000426:8236..8507 FT SO_feature polyA_signal_sequence ; SO:0000551:158..163 FT SO_feature polyA_signal_sequence ; SO:0000551:8383..8398 FT SO_feature primer_binding_site ; SO:0005850:276..293 FT SO_feature RR_tract ; SO:0000435:8236..8507 FT SO_feature CDS ; SO:0000316:1741..6456 FT /db_xref="FLYBASE:FBgn0044067; GATE\polyprotein" FT /db_xref="SPTREMBL:O76925" FT /protein_id="CAA09069.1" FT /translation="MPIGDDKKKLSADKPRSIFSPQGPKSPRIPSISVKTPAQISDDCA FT TPSKATVQRTAKNMAASDLALAKFISVSDAQANLRLRSTLRNPQLQPSRCLASVATKSE FT AYGTRLKRIRPLLRVPCVSRRAAASGMPILRASYSYCYSVYERCVAQLVDKIEQGHFSV FT HPKRTLRPRPTFPLAVGCLHAIQEFSQVTIFAGRFPDLLHAIYINNPRLTPFEKLFHLN FT AKTSGDAHAIVSISPLTKRGFSSAWENLIERFENKRLLVNSQLKILFNVQSIPQESGAA FT LKVMQSTVQGCLTALELSGINTENWDCLLEYLCSSKLPKITLSLWEQSLHKKADIPTWG FT ELNTFLTERHRTLEAIDDVRPSVPSQSHSKAMNSSGPSRDGKLASDLCNKENHPVRVCR FT VFSKWSVDDRSAYIKRKQLCLNYFAKGHQLRECKDRQSFTWWPASHVVAPKQPLFQQFK FT PFKSCKPNFRYSGQFRSKRASRCSKLFCHGSRAILLGSAIINSSHLGTNFKARALIDSG FT SEATFITEPLFNLIRLPFQVVQAQVSGLNQTVAAQFKNAAVSPSDLRLGRVAVGDDGLC FT PPSTSRKSAFLPNSAKFLRDLPDFPLADPKFYESAPIDVLIRSPHPASVLLSGAKTNIC FT GSLLGQETIFRWVLTGPVSASAQSRIPLFRHRSPTRTIIHWTNSSQNLGRWRIYQQSCK FT RIRFHVRERVGKCLRRHQCGKYVVTLPFRDPEHIGCGLGHSRSWALAQFLKNEQRLKKD FT EALKARYDSVIQEYLDLKHMRQVLPTHDCNAYYMPHHAVLKPESVTTKLRVVFNASSPS FT SNGTSLNDILHAGPVLQSDLTVQILKWRDFRYVFSADIQKMYRQIWVDPKHTPFQRILF FT RNNRGEIRDFELKTVTFGVNCAPLLAIRVLQQLAADEELSHPKASNVIRNFMYVDDVLA FT GADSTEEAQLMVHELRDALNSSSSRQRWLSKRPLQRQVLSQIAKLFDPAGWLAPFIVRA FT KIFMQEIWLQELGWDENVPNDLFQRWLNFLQSYSVFEQIRIPRWLSFHPDFKVEHHGFC FT DASQKAYGAAIYVRGEVGSAIMVQLLTAKTRVAPVKTVSLPRLELCGALLLSEMAAAII FT PQMPTINSKLYCWTDSTIVLAWLSKPACQWTTFVANRETKIAQATKTENWSHVQSEHNP FT ADLASRGVSLQDLADSQLWWHGPTWLQNPRNQWPTQVNAPVTDLEKRALKVHLAKAPSE FT ELLARFSKLEKALRVLAYVYRFIQRCRKQTSPSDVHLLATEIAAAERFLISNTQRREFP FT VEYHCLSEKRPVPSSSAILSMNPFLDPQGLIRACGRVAASESPQYNERHPVILPYNCLL FT SRLLAKFTHRTTLHGGNQLMVRLIRSKYWIPRIKNLMKAVVNSCKVCVIHKRRLQSQLM FT GVLPKERASFSRPFTVSAWITPVRDIKNYTGRACVITKGYVLVFVCFSTKAIHLEPTSD FT LTTEKFLAAFSRFVSRRGCPRQVQSDNGKTFVGAATLLSRDFLQAVKESVTNAYIHQEM FT QWQLFSGGTQYGRPLGSRRKKLQDAILQMHGHTKIHVRRTLHALGKNRSVP" XX CC Derived from AJ010298 (e1315889) (Rel. 56, Last updated, Version 1). CC Takis Benos and Michael Ashburner, 08-Sept-1998. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 8507 BP; 2245 A; 2251 C; 1836 G; 2175 T; 0 other; TGTTCAAGTT ACGCTCACCC GCTGTCACCC GCTGTCACCC GCTCTCCGCT CCCTCTTACG 60 CTCTCCCGCT CTTCACCTCA GAGTCTCCAA GGAGTCCTCG GGCTTGGGAT AGCCTAACTA 120 ATTAGAATAA GCATCAGTGT AAAAACTAAC CACGCTGAAT AAACATACGC CCGGTCGCCG 180 CGCAATTACG AAAAGTCTAG TGTTTGCTTT CCTTCGAGTG TTTCTTTTCA GCATATTTGA 240 ATTCAGGACA GCCATCCCCC TACATCCCAA CATTTTGGTC CTTCGAGCCG GATCACCTGG 300 ATTTTCAAGT TTGTCCACCA GCGAACAAAT TATAAGATAA GTACGAAATT TCCATTCCTT 360 TTAATTGCCG GTCTGCAGCA AAAGGTTCGA AAATCCAATT TCGTTCAATT TGCTGTAAGA 420 TTTATTGTCA AATCTAACGG ATTTCTCCGA CAAAAGGCAA TTAAAGAAAA GTACTTATCC 480 AATCTCACGG GCGCCGCATA TTACTCGCCG TTCTCCGTTC TCCTTTCACC CTCATTCGTG 540 AAAATTTCTA AAGTCCAAAT GGGCGAATAT ATTTAAATAT TAATCCAGTG CGATAATGCA 600 AAATTCCAAA TGTGAAAAAG TGAATAATTT GTGCCAAGTT CAGTGAAACT TTCTAAGTCC 660 AAAGCTCTGC CAAAATTGGC AAAAATTCTG TTCTCGTTTC ACTGTGTCAA AGCGAAGCCA 720 AACTTCTTTT CGCAACACAT TTTTGCTTTA ACTCCGCAGT CCACTTAATA CCATTTGCTT 780 TGCTATCGAA GAATACCACA ACGAAACAAA CAACACCATA CCCTCTGGCC ATTCAAATAA 840 CATATTAATT AACATTTCCG CAGTTCCATA TCTCTTACAT CAACATATAC CTACTCCATA 900 CTCTTGCATA TATTCACATC TTACACAATA TATCCTCACC ATATATTACA TATATTACAT 960 CAACATATAA TATCCACATA TATTACCGAC ATACATTGCG CATATTATCA GCATTCCTTT 1020 AACGTATACC AAAGTTTAAA TTCGATCCCG TCGGCAAATC CAACCACAAA TAAAATTTAT 1080 TCCAAGTGCC GACGCGGAAA GGCGTTTTCT TTTCCATCAA TTTTTTCCGT AAATTTCCAA 1140 ATTAATTTCC GAGCAATAAA TTAAAAGCGG TTTTTTCTTT TTTTTTAACA AATAACTTAT 1200 TGTTGAAAAC ATTTATTAAA TTATTAAAAA TTATATAAAT AATACGACCG CCAAATACAA 1260 GTCGTTCACC CGACAAATAT TTTTTCCTGT ATTGCTTGGA TATTAATTTG TGTTTGTTTT 1320 AGAAGTACTT ACAACGCGGA GAAAAGACTC CAAATCCACC ATTCCATTTT CTCCGTTTCC 1380 AGTTATAAAC AAAAAAATAA ATAAAATTTT CTTCCTTCTA ATAAACATTT TATTTTACCG 1440 TGTTCACATT CCAAGTGTTC CAACCGTAAA TAAGGTGGAC CTAATTACCA TAAATCACAG 1500 GTCATTTATA CAATTCGCTG TTCACTCCGA GTCACCTGTC CAATTAGTCT AAACTACGGC 1560 GTTTCCACTT CGCAAATTCA ACACCACTTT CTCACCCATT ACATCCTATA CGGTCCTTTT 1620 CCGCTGCTTT ATACCGTTCA CGGCAGGAAG CTTAAATTTA TTAAGTGGAA TCTGTCTACT 1680 TTTTCAAAAG TGTGACCGGG CTCCAAAAAC GCTTCCCTTC CATTTCGTAT TTCTTCGATT 1740 ATGCCCATCG GGGACGATAA GAAGAAATTG TCCGCTGACA AACCCAGGTC TATTTTTTCA 1800 CCACAAGGGC CCAAGAGTCC AAGAATCCCA AGCATTTCGG TGAAAACGCC TGCGCAGATT 1860 TCCGACGACT GTGCCACTCC ATCCAAAGCC ACAGTACAGC GCACAGCTAA AAATATGGCT 1920 GCTTCCGATC TAGCGCTAGC CAAATTCATT TCGGTTTCTG ACGCTCAAGC GAATTTGAGG 1980 CTCAGATCAA CACTCCGGAA TCCGCAGCTC CAACCGTCAC GATGCTTAGC GTCCGTCGCG 2040 ACCAAGTCCG AAGCCTATGG GACAAGGTTG AAAAGAATTC GACCTCTGCT CAGAGTGCCT 2100 TGTGTCAGCA GGCGAGCGGC AGCAAGCGGC ATGCCTATTC TCAGGGCTAG TTACAGTTAT 2160 TGCTATTCAG TCTATGAAAG GTGTGTTGCC CAGCTCGTTG ATAAAATCGA GCAGGGGCAC 2220 TTCTCAGTCC ATCCCAAGCG AACGCTGCGG CCCAGGCCTA CATTTCCTCT GGCTGTCGGT 2280 TGCCTCCATG CGATACAGGA GTTTTCGCAG GTGACTATCT TCGCTGGCCG CTTTCCGGAT 2340 CTTTTACACG CCATTTATAT TAATAATCCA CGGCTGACTC CGTTCGAAAA GTTATTCCAC 2400 TTAAATGCCA AAACAAGTGG CGACGCGCAT GCCATAGTTT CGATTTCGCC TCTCACCAAA 2460 CGAGGGTTTT CCTCTGCGTG GGAAAACCTA ATAGAGCGTT TCGAAAATAA ACGATTGTTG 2520 GTAAACAGTC AATTGAAAAT ACTGTTTAAT GTGCAGTCGA TACCACAGGA ATCTGGGGCG 2580 GCCTTGAAGG TAATGCAAAG TACTGTTCAA GGTTGCTTGA CTGCCTTAGA ACTGTCAGGC 2640 ATCAACACTG AGAACTGGGA CTGCCTGCTG GAATATCTGT GTTCATCCAA GCTCCCGAAG 2700 ATAACTCTCT CCTTATGGGA GCAGTCTCTA CATAAGAAAG CCGACATCCC GACATGGGGA 2760 GAACTGAACA CCTTCCTCAC AGAACGTCAT CGAACCCTAG AGGCCATCGA TGATGTGAGA 2820 CCGTCCGTAC CAAGTCAGTC GCACTCCAAA GCGATGAACT CAAGTGGGCC CTCTAGAGAT 2880 GGCAAGCTGG CGTCCGACTT GTGCAACAAG GAAAACCATC CTGTCCGTGT ATGTCGCGTT 2940 TTCTCCAAAT GGTCGGTTGA CGACCGGTCA GCCTACATTA AACGGAAGCA GTTATGCTTA 3000 AACTACTTTG CAAAGGGACA TCAGCTTCGT GAGTGCAAAG ATCGACAAAG TTTTACTTGG 3060 TGGCCGGCAT CACACGTTGT TGCACCGAAA CAACCTCTTT TCCAGCAATT CAAGCCCTTC 3120 AAATCCTGCA AGCCCAATTT CCGCTACTCA GGCCAATTTC GTTCCAAACG AGCAAGCCGG 3180 TGTTCAAAAT TATTTTGCCA CGGCTCAAGA GCTATCCTTC TTGGCAGTGC CATAATCAAT 3240 AGTTCCCATC TTGGCACTAA CTTTAAGGCA CGCGCCCTGA TCGACTCCGG ATCAGAGGCG 3300 ACATTCATAA CCGAGCCACT GTTCAATCTA ATTAGATTGC CATTCCAGGT GGTTCAAGCC 3360 CAAGTCTCGG GCTTAAACCA AACAGTAGCT GCTCAGTTCA AGAACGCTGC AGTTTCACCA 3420 TCCGATCTCC GACTAGGCCG CGTTGCAGTT GGAGACGACG GCCTATGTCC TCCCTCAACT 3480 AGCCGGAAAT CTGCCTTCCT ACCCAATTCC GCAAAATTTC TTCGGGATCT TCCCGATTTT 3540 CCACTGGCGG ATCCAAAATT CTATGAGAGC GCCCCAATAG ATGTACTTAT CCGGAGCCCA 3600 CATCCTGCTT CGGTGCTTCT GAGTGGAGCA AAAACCAACA TCTGTGGCTC TCTCTTGGGG 3660 CAAGAGACCA TTTTCCGCTG GGTACTAACT GGGCCAGTGT CAGCCTCAGC CCAAAGCAGG 3720 ATTCCTCTTT TTCGACACAG ATCTCCCACG CGTACGATAA TTCACTGGAC AAACTCCTCA 3780 CAAAATTTGG GGAGGTGGAG GATATACCAA CAAAGTTGCA AAAGAATCCG ATTCCATGTG 3840 CGAGAACGGG TTGGTAAATG CTTACGACGA CACCAGTGCG GCAAATATGT CGTTACTCTG 3900 CCTTTTCGCG ACCCAGAACA TATCGGTTGC GGGCTAGGGC ATTCTAGGTC TTGGGCGTTG 3960 GCTCAGTTCT TGAAGAATGA GCAGCGTCTA AAAAAAGATG AGGCCTTGAA AGCGAGATAC 4020 GATTCGGTGA TCCAGGAATA TCTCGACTTA AAGCACATGC GACAAGTTCT GCCTACCCAT 4080 GATTGCAACG CCTATTATAT GCCACATCAC GCCGTCTTAA AACCGGAGAG TGTAACTACT 4140 AAACTCCGTG TAGTATTCAA TGCCTCCAGC CCTTCATCGA ATGGTACCAG TTTAAATGAT 4200 ATCCTTCATG CTGGCCCTGT CTTGCAGTCC GACTTGACAG TGCAAATTCT GAAGTGGCGC 4260 GATTTCCGAT ACGTGTTCAG TGCCGATATT CAAAAAATGT ATCGGCAGAT CTGGGTAGAT 4320 CCGAAACACA CTCCATTCCA GCGAATACTT TTCCGTAACA ATAGAGGGGA AATCAGAGAT 4380 TTCGAATTGA AAACAGTAAC CTTTGGAGTC AATTGCGCGC CCTTGCTGGC GATCCGAGTA 4440 CTGCAGCAGC TAGCAGCTGA CGAAGAACTC AGCCATCCAA AAGCTAGCAA TGTCATTCGA 4500 AATTTCATGT ATGTGGATGA TGTTTTAGCC GGAGCGGACT CTACGGAAGA AGCTCAGCTC 4560 ATGGTGCACG AGCTCCGAGA CGCTCTGAAT TCTTCTTCGT CCCGCCAGAG ATGGCTATCG 4620 AAACGTCCTT TACAACGCCA AGTCCTGTCC CAAATTGCCA AATTGTTCGA CCCTGCAGGC 4680 TGGTTAGCAC CGTTTATCGT TCGAGCTAAA ATTTTCATGC AGGAGATTTG GCTACAGGAG 4740 CTTGGGTGGG ACGAAAACGT TCCAAATGAC CTTTTTCAGC GATGGCTTAA TTTTCTCCAA 4800 AGTTATTCGG TTTTCGAGCA GATACGCATT CCACGCTGGC TATCGTTTCA TCCAGATTTC 4860 AAGGTCGAGC ATCATGGCTT TTGCGATGCA TCGCAAAAGG CTTATGGCGC CGCAATATAT 4920 GTCCGCGGAG AAGTGGGCAG CGCCATTATG GTGCAACTCC TAACCGCCAA AACCCGGGTA 4980 GCACCAGTCA AAACGGTTTC GCTCCCAAGA CTCGAGCTCT GCGGAGCGTT ATTGCTTTCC 5040 GAAATGGCTG CAGCCATCAT TCCGCAGATG CCTACGATTA ACTCCAAACT TTACTGTTGG 5100 ACGGACTCCA CCATAGTGCT TGCATGGTTA AGCAAGCCAG CATGCCAGTG GACCACATTT 5160 GTAGCCAATA GGGAGACGAA GATCGCCCAG GCCACAAAAA CAGAGAATTG GTCTCATGTT 5220 CAATCTGAGC ATAATCCAGC AGACCTGGCA AGTAGAGGAG TTTCCCTCCA AGATCTAGCC 5280 GATAGCCAGT TATGGTGGCA CGGACCGACT TGGTTGCAAA ATCCACGCAA CCAATGGCCT 5340 ACTCAGGTCA ACGCTCCGGT GACCGACCTG GAGAAGCGTG CTCTAAAAGT CCATCTCGCG 5400 AAAGCTCCTT CTGAAGAGTT GTTGGCACGT TTCTCCAAGC TAGAGAAAGC TCTACGAGTC 5460 CTTGCCTATG TTTATCGCTT CATTCAGCGG TGCAGGAAGC AGACATCTCC ATCTGATGTT 5520 CATCTACTGG CCACTGAAAT CGCCGCCGCC GAGCGGTTCC TAATTTCGAA CACTCAACGC 5580 AGAGAATTCC CTGTGGAATA TCACTGCCTA AGTGAAAAGC GTCCAGTGCC AAGTTCAAGT 5640 GCCATCCTAA GCATGAACCC GTTTCTAGAT CCGCAAGGAC TGATCAGGGC ATGCGGCCGT 5700 GTGGCGGCTT CCGAAAGCCC TCAATACAAT GAACGCCATC CAGTGATTCT TCCGTATAAC 5760 TGCCTGCTTT CTCGCCTCCT TGCGAAGTTC ACGCATCGCA CAACTCTCCA TGGTGGTAAC 5820 CAGTTAATGG TGCGCCTCAT CCGGTCGAAA TACTGGATTC CGAGAATCAA GAACCTGATG 5880 AAAGCAGTGG TAAATTCGTG CAAAGTATGT GTGATCCACA AAAGGCGGTT GCAAAGCCAA 5940 CTGATGGGTG TCCTGCCCAA AGAAAGAGCA TCGTTCTCCC GACCATTCAC GGTATCGGCA 6000 TGGATTACGC CGGTCCGCGA TATAAAGAAC TATACGGGAA GAGCATGTGT TATTACAAAG 6060 GGGTATGTGT TAGTTTTTGT TTGTTTCTCC ACCAAGGCCA TCCACTTAGA GCCTACATCT 6120 GACTTAACGA CCGAGAAGTT TCTTGCCGCT TTCTCTCGTT TTGTATCCAG GAGAGGGTGT 6180 CCACGTCAAG TCCAGTCAGA CAATGGCAAA ACCTTTGTTG GCGCTGCCAC CCTGCTTTCC 6240 CGCGATTTCC TTCAAGCCGT AAAAGAGTCG GTGACGAATG CCTATATTCA TCAAGAGATG 6300 CAATGGCAAT TATTCTCCGG GGGCACCCAA TATGGGAGGC CTTTGGGAAG CAGGCGTAAA 6360 AAGCTTCAAG ACGCTATTTT ACAAATGCAC GGCCACACGA AAATACACGT TCGAAGAACT 6420 CTCCACGCTC TTGGCAAAAA TAGAAGCGTG CCTTAACTCC AGGCCGCTCT CTCCTATGTC 6480 TGAAGATCCG ACAGACTTGC TGGCTCTGAC GCCAGGGCAT TTCCTTGTCG GGGGACCCCT 6540 TATGTCCACG GTGGAACCCG AAGTAAAGGG GGAAACGAAA TCCCTTCTTA ATCGGTGGCA 6600 GCATTTGAAG GCTCTCCATC AGCAGTTCCG TGTGCGATGG AAAGAAGAGT ACCTCAAAGA 6660 ACTCCACAAG CGTTCTAAAT GGCAGGTCCC GTGAACTTCG AGCTAAAATA CTCGTGCATG 6720 TGGAGCAGCG TGTGGTGGGA TCGGTCGCAC TTCTTGCAAC GATCACCGCT TCGGCAGTCT 6780 CCCGTGGAAT GCTCGTGAGC GAGGCAATTG GCGCTAGTAT GTTGGTAATG AGGACTGCTC 6840 GCAAACGCTT TTCAGCGCTG AGCTTTAGGA ACCTCGCGCA CGTCCGAAGA GGATGGATTA 6900 CCGCGGCAGA CTCGGCATCG GTAGGATTTA ATACCTCGGG TACGTCTGCT CTCCACGGCA 6960 CGACCTGCGT GCGTTTTGTT GACGAGGAGC CATGTGCGCG TAGTCGAATG TCGAAAGGAG 7020 ATCGAAAACG AAATGAAAAA TAACGGATGA TTAGTGATAG TGAACTACAA CTAAGGACGA 7080 GAGGAGAGAC CTATTATTGT GGAGATTCGG AACTCCGTCG GCAAAAGCAC CTTTTTTGCC 7140 ACTGGACGTT TAATAACTCC ACGTGCAGTA CGGATGTTTA CTACACGACG TTGCCGTCAG 7200 CTCCTGGGAA AACAGACTCA ATTCTGCCGA GCCGCCACTC ATTAGAGGGC AAGTTGTCGT 7260 CCTTGATGAC GACTGTTGAG TTACGGGGTT TGGACTGTCA CCGTCTCCGC TCCCTCTTAC 7320 GTTCTCCACT CCCTCTTACG CTCTCCCGCT CTTCACCACA GAGTCTCCGA GGAGTCTCTG 7380 GCGCTTGGGA GAACCCAACG CATTAGAATA AGTTTTAGTG TAAAACTAAC CACGATCAAT 7440 AAAACATACG CCCGGTGCCC GCGCTAATTC TACAAGTCTT CGAGTGTTTT TTCGAGTGGT 7500 CTTTTTTTCA GCAAACTAGG AACTTTCCAG GACCAGCACC CCCCATCACC CCAACAACGA 7560 CCATGTCATC GATGGCGTCG CCAACCGGCG CCAACCCTGC AAGCAGCTTC GGCCTCGTCC 7620 CAACGTTCCC GCCGGCACAA TCCGCCAACT CAGCGTAGGA GTTCACCGCC ACGACGACCG 7680 GAATCGACGA CGCCAGGCCC ATCACTCTCG TCGCCGCTGC AACGCCACAG CGTGAACATC 7740 CTTCCCACAG CGCTGGTCAA GATGGAGACC GGGACGAAGA CCTTCAGACC GCAGCACTCA 7800 TCGATCCGTG CAGCCCCATG AGCTGCATCG ACGCTTCGTT GGCGTCAGCC TTTAAGCTTT 7860 CGATGACCAA TGTTGGCGAC GAGAAGGTCT GCACGACGAC GATTCGCTCC AGGATCGACG 7920 CGAACACGAA GCTCGAGGTC GTGCTCAAGA TCGAGCCCAG GGTGCGGATC CGTACACCTG 7980 TCCGGGCATT GAGCGACACC GTAGTGTCCA AGTACAGGGA CATCATGCTG GCGGATGACG 8040 GGTTCCATCG GCCTGCTACC GTATCCATGG TCTTAGGAGC AGACATTTAT CCTAAGGTTA 8100 TCCAATCCGG ATTCCTGACC TTCGACGAGG GAATGCCGGT CGCTCAAAAG ACCGTGTTTG 8160 GGTGGATCGT GTCCGGTGCC TGCAGCTTGC CTAGATGGCT ATGTTGCAAC CCCAGTGATT 8220 GCAAGGGGGG CGGAATGTTC AAGTTACGCT CACCCGCTGT CACCCGCTGT CACCCGCTCT 8280 CCGCTCCCTC TTACGCTCTC CCGCTCTTCA CCTCAGAGTC TCCAAGGAGT CCTCGGGCTT 8340 GGGATAGCCT AACTAATTAG AATAAGCATC AGTGTAAAAA CTAACCACGC TGAATAAACA 8400 TACGCCCGGT CGCCGCGCAA TTACGAAAAG TCTAGTGTTT GCTTTCCTTC GAGTGTTTCT 8460 TTTCAGCATA TTTGAATTCA GGACAGCCAT CCCCCTACAT CCCAACA 8507 // ID ROXELEMENT standard; DNA; INV; 4740 BP. XX AC AF237761; XX XX DR FLYBASE; FBgn0042231; X-element. XX SY synonym: BS2 XX FT source AF237761:1..4740 FT SO_feature CDS ; SO:0000316:322..1827 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0041613; X-element\ORF1" FT /db_xref="SPTREMBL:Q9NBX5" FT /protein_id="AAF81410.1" FT /translation="MNTLNETAAADESLDTAFLSSPQCAAPQRFQKIKRKSRASPETER FT KKPKSTIGKQGENPSATEPRYGGNSNRFGLLAHLTADKQVGNEIGDLYDQPSTSHQAAI FT AAAKRDAASAGTTSSAKRAQSKPPPIVMEGVDDVYLMMQSIENIVDLEKIEARASMSGV FT LRLYAADANTFRTIVNWLEIEEYEFHCYQLKEDRPYRVCVKGLHHSTLHHQIKDELEKI FT GHKVLDIHTPLRRNEPGTSKASPVNMFFLNIAAAANNKEILAVKALCHMRVVIEPLRKR FT NAIVQCHRCQQIGHTAKYCRKAHICVKCAGEHPAKDCTRPRIELCTCYNCGGQHPANYK FT GCSKLQAFLQRSRPRSGVAGRTEVSDRPTPRGLAGGKEIPSSRGGISYADVARGSIHHK FT QPMSLTHQQQKQKQQPYDGSPSRQRSRSRTRASRGTLQRSTDASSSIEAILQTLNENIN FT SLRSIQEKQMELMMMMMKQQQQQSHQQGQIINLLTALQARQAP" FT SO_feature CDS ; SO:0000316:1827..4553 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0041612; X-element\ORF2" FT /db_xref="SPTREMBL:Q9NBX4" FT /protein_id="AAF81411.1" FT /translation="MMPLRILVWNADGVSTKLPEVECFVRRHEIDVLLLSETHCKGAET FT PKLFGFVAYTANDPSGGNAKGGAAILIKNSLAHFPLTPIATAKVQLAPAVIETALGPIS FT FGAVYCPPRFAWTTDEFKDILEEFQTKFIVAGDWNASHWLWGAGRSNQRGIALANLVLN FT SEVDSLATGGPTRYPYGCRGSPGYIDFALTKGVLGIHANISAVVELSSDHLPLVITLDA FT GAISYPKMERLITRRTNLEVFQSQLESTLPLNTAINSGQDVDDAIELLTNNIKSAARLA FT TRSISRQPAADRIPIPREILLLIAEKRRLRTRWMRSRHPSDKTEWNRALSRLRCALVLH FT KAAWFDERLANTGVESEATHSLWKATRAIKRRCTRKAPLVDSNGTWCRTDLGQAEVFAA FT HLAERFQPFKLASLQQVEETQDQLNQALQMDMPITPFEPCEVAEVIVRQSNNKAPGHDV FT ICNATLKALPRQAILYITLVFNAIVRLQYFPYQWKLGIISMIHKPGKPEREPASYRPIS FT LLPSISKVFERLIAVRIVSIMEAQGITPEHQFGFRAGHCTVEQLHRVVEQILTAYDSKE FT YCNSLFLDIREAFDRVWHIGLQLKIKQTLPAPYFGLLKSYLEGRRFAVRFHSAISTEHN FT VAAGVPQGSVLGPLLYCLYSHDMPQPDVSLYGKSMLATFADDVCVTYRSRCEHDAADGI FT QDFAYRFSEWARRWNIGINSSKSNNVCFTLKRRTPPPVYIEEVPVPQPNAAKYLGVLLD FT RRLTFSKHVTDIRTRLRAKVAKHYWLLSSRSKLSLSNKLTIYKQILAPNWKYGCQIWGL FT ACDSHIKRIQAIQNKVARLITGCEWFVRNTTLHRDLKLATVFDEINKHSSRYHDRLERH FT RNRLASALNRSRPPRRLNRRQPRDLITRSPLTRVRRS" XX SQ Sequence 4740 BP; 1336 A; 1215 C; 1183 G; 1006 T; 0 other; AATGTTAAAT AAAGGTTCGT GTCTAACAAT ACGCACCTGA CAAAGTGGAT TAAGTGAAAT 60 TAGTTTTCGC GGTAATAAAC TTATGGACAA GACCAGAATA CTGGCACACA TAGCAAATAG 120 TGACCCCCCA AGTCACTAAC AGTGAAATAA TAGTGAAACG AAAACATTTT CATTCAAAAA 180 TACAAAGTTA AGTTTCTCGA ACTGGGGCTC CGCTGCCCAG CTGCCACGCG ATCGCACAAA 240 CAGCTGTTTG CGAGCTTAAA GCTTTCTATC CCAGGGTTCA AGTTTTGGCT AGAACCCTGG 300 TGATTTGGTG CACACTTCAA TATGAACACT TTAAATGAAA CCGCTGCGGC TGATGAATCG 360 TTGGATACTG CGTTTCTCTC GAGCCCCCAA TGTGCTGCCC CGCAGCGCTT TCAAAAAATA 420 AAGCGAAAGT CTCGTGCTTC TCCGGAGACT GAAAGGAAAA AACCCAAATC AACCATCGGC 480 AAACAAGGGG AAAACCCTTC GGCTACAGAA CCTAGATATG GCGGCAATTC AAACCGATTT 540 GGTTTACTTG CGCATCTCAC AGCTGACAAA CAAGTAGGCA ATGAAATTGG CGATCTGTAT 600 GACCAGCCCA GTACCAGTCA TCAAGCTGCA ATTGCTGCCG CTAAGCGGGA TGCAGCCTCC 660 GCTGGTACCA CTAGCTCAGC CAAAAGAGCG CAGTCCAAAC CACCTCCTAT AGTAATGGAG 720 GGAGTGGACG ACGTATACCT GATGATGCAG AGCATCGAAA ATATAGTGGA CCTAGAAAAG 780 ATTGAGGCTA GGGCGTCAAT GAGCGGTGTC CTAAGGCTTT ACGCGGCTGA CGCTAATACA 840 TTTCGCACCA TAGTGAACTG GCTCGAGATC GAAGAGTATG AGTTCCACTG CTACCAGCTT 900 AAAGAGGACA GGCCTTACAG GGTATGCGTG AAAGGCCTGC ACCACAGTAC GCTACATCAC 960 CAAATCAAGG ATGAGCTGGA AAAGATCGGG CACAAGGTTC TCGATATTCA CACACCGCTT 1020 AGGCGAAACG AACCGGGTAC CTCAAAAGCG TCGCCAGTCA ATATGTTCTT CCTAAATATT 1080 GCTGCTGCGG CAAACAATAA GGAGATCCTG GCGGTAAAGG CACTATGCCA TATGAGAGTA 1140 GTTATTGAGC CTCTCCGCAA GCGTAACGCT ATTGTCCAGT GCCATCGTTG TCAGCAGATT 1200 GGCCACACAG CCAAATACTG CCGTAAGGCC CACATTTGTG TGAAATGTGC CGGCGAACAC 1260 CCAGCCAAGG ACTGTACCAG GCCACGCATC GAGCTGTGCA CTTGCTACAA CTGTGGCGGC 1320 CAGCATCCTG CAAACTATAA AGGTTGCAGC AAGCTACAAG CGTTCCTGCA GCGATCCAGA 1380 CCCAGAAGTG GAGTGGCTGG AAGAACAGAA GTAAGCGATC GACCAACTCC ACGGGGCTTA 1440 GCTGGAGGTA AGGAGATCCC CTCTTCTCGA GGCGGAATAT CTTATGCAGA TGTGGCTAGA 1500 GGGTCCATTC ACCACAAGCA ACCAATGAGC CTGACGCACC AGCAACAGAA GCAAAAGCAA 1560 CAGCCCTATG ATGGAAGCCC CAGTCGTCAA AGGAGCCGCA GCCGGACAAG GGCGTCTAGG 1620 GGTACACTCC AGCGCTCGAC GGATGCTAGC AGCAGCATTG AAGCCATCCT GCAGACGCTT 1680 AATGAGAACA TTAATTCTTT GCGCTCGATT CAAGAGAAGC AAATGGAATT AATGATGATG 1740 ATGATGAAGC AACAGCAACA ACAGTCACAT CAGCAGGGGC AGATTATCAA TCTGCTCACT 1800 GCTCTCCAAG CGCGTCAAGC GCCATAATGA TGCCGCTGCG CATCCTAGTG TGGAACGCCG 1860 ACGGCGTATC CACGAAGTTG CCTGAAGTAG AGTGCTTCGT GCGACGTCAC GAAATCGATG 1920 TATTACTGCT CAGCGAGACA CACTGCAAGG GGGCAGAGAC GCCTAAGCTA TTCGGATTTG 1980 TAGCCTACAC TGCCAATGAT CCGAGTGGTG GCAACGCCAA AGGCGGAGCA GCTATCTTAA 2040 TCAAAAATAG CCTTGCCCAC TTTCCGCTAA CACCAATAGC CACTGCCAAG GTGCAACTTG 2100 CGCCGGCGGT TATTGAAACG GCACTTGGTC CTATAAGCTT TGGAGCGGTC TACTGCCCAC 2160 CGAGATTTGC ATGGACTACG GACGAGTTTA AGGACATTTT GGAAGAGTTC CAGACGAAGT 2220 TCATTGTTGC AGGCGATTGG AACGCGTCCC ACTGGCTCTG GGGTGCGGGA AGGAGCAACC 2280 AAAGAGGCAT TGCATTAGCG AATCTCGTCC TAAATTCGGA GGTGGACTCG CTAGCAACAG 2340 GAGGACCAAC AAGATACCCG TACGGCTGTA GAGGCTCACC AGGGTACATC GATTTTGCAC 2400 TGACAAAGGG TGTGCTGGGC ATCCACGCTA ACATAAGTGC GGTTGTTGAG CTTAGCTCCG 2460 ACCACCTGCC TCTGGTAATT ACGCTGGATG CGGGGGCAAT ATCCTACCCT AAGATGGAGC 2520 GGCTTATCAC TAGGCGTACT AACCTGGAGG TATTCCAATC GCAACTGGAG TCCACACTGC 2580 CCCTCAACAC TGCCATAAAC TCTGGACAGG ACGTTGATGA TGCTATCGAA CTGCTCACCA 2640 ACAATATCAA GTCAGCAGCT AGATTGGCAA CTCGCAGCAT ATCTCGGCAG CCCGCGGCAG 2700 ATCGAATCCC AATACCCAGG GAGATCCTGC TGCTTATAGC TGAGAAGAGG CGCTTACGCA 2760 CTAGGTGGAT GAGGTCTCGG CACCCGTCGG ACAAAACGGA ATGGAACCGA GCTCTGAGTA 2820 GGCTCCGATG CGCGTTGGTG CTGCACAAAG CCGCATGGTT CGACGAAAGG CTTGCCAATA 2880 CCGGAGTCGA AAGCGAAGCG ACGCATTCGC TGTGGAAGGC CACGCGCGCA ATCAAAAGGC 2940 GTTGCACGAG GAAGGCGCCT CTAGTCGATA GCAACGGGAC ATGGTGTCGG ACCGACTTGG 3000 GACAAGCGGA GGTATTCGCT GCGCACCTCG CCGAGCGATT TCAACCATTC AAGCTTGCCA 3060 GCCTGCAACA GGTTGAAGAA ACTCAGGACC AGCTGAACCA AGCGCTTCAA ATGGATATGC 3120 CAATCACGCC GTTTGAACCC TGCGAGGTAG CCGAAGTCAT TGTGCGCCAG AGTAACAACA 3180 AAGCACCTGG ACATGACGTC ATCTGCAACG CCACATTGAA GGCCCTGCCC AGACAAGCGA 3240 TCCTCTACAT AACGTTGGTT TTCAACGCTA TTGTGAGGTT GCAATACTTC CCTTATCAGT 3300 GGAAGCTCGG GATAATCTCC ATGATCCACA AACCTGGCAA GCCGGAAAGG GAGCCCGCCT 3360 CCTACCGGCC GATCAGTCTC CTCCCTTCAA TTTCGAAGGT GTTTGAGAGA CTGATTGCTG 3420 TCCGGATTGT AAGCATTATG GAAGCCCAGG GGATTACCCC TGAGCACCAG TTCGGTTTCC 3480 GTGCTGGCCA CTGTACTGTC GAGCAGCTCC ATCGAGTCGT CGAGCAAATT CTGACTGCCT 3540 ACGACAGTAA GGAATATTGT AACAGCCTCT TCTTGGACAT TCGAGAAGCG TTTGATCGAG 3600 TGTGGCACAT TGGACTCCAA CTGAAAATCA AGCAGACGCT GCCTGCCCCA TATTTTGGGT 3660 TGCTGAAATC GTACCTGGAA GGAAGGAGGT TCGCTGTGCG CTTTCATTCA GCAATTTCCA 3720 CCGAGCACAA CGTGGCAGCT GGTGTTCCAC AAGGTAGTGT CCTCGGCCCC CTGCTCTACT 3780 GCCTGTATAG CCACGACATG CCGCAGCCAG ATGTAAGCCT TTACGGGAAA TCTATGTTGG 3840 CCACATTTGC CGATGACGTG TGCGTCACCT ACAGGTCCCG ATGCGAGCAC GACGCAGCCG 3900 ATGGTATCCA GGACTTTGCA TACCGGTTCT CGGAATGGGC AAGACGATGG AATATTGGCA 3960 TCAATAGCAG TAAATCCAAC AACGTCTGCT TCACTTTAAA GCGGAGAACG CCACCGCCCG 4020 TCTACATCGA GGAAGTCCCC GTACCACAGC CGAACGCAGC AAAGTACCTT GGAGTGCTTC 4080 TGGATCGCAG ACTCACATTT TCCAAGCATG TGACCGACAT CAGAACGCGC CTACGTGCTA 4140 AGGTGGCGAA GCACTACTGG CTACTTTCTT CGCGCAGTAA ATTGTCGCTA TCCAACAAGC 4200 TGACAATTTA CAAACAGATC CTAGCACCAA ACTGGAAGTA TGGGTGCCAA ATCTGGGGCT 4260 TAGCCTGCGA CAGCCACATC AAAAGGATCC AGGCTATTCA AAATAAGGTA GCAAGACTCA 4320 TCACCGGCTG CGAGTGGTTT GTTCGAAACA CCACCCTGCA CAGAGACCTG AAACTCGCAA 4380 CGGTATTTGA CGAAATAAAC AAGCACTCGA GCAGATACCA TGACAGGCTG GAGCGCCACA 4440 GAAATCGGCT GGCCAGCGCT TTAAACAGAT CTCGCCCACC AAGGAGGCTC AATAGAAGGC 4500 AACCGAGGGA TCTCATTACC CGATCTCCTT TGACAAGGGT CCGCAGAAGC TGACGCTTAT 4560 CTTAAATCCT ATTTGTTATA TGTGATTGTT ATGTAATTGT AGTTAAATTA CTGTAAATTT 4620 GAAAAAGCTA ACTATAGTTA GCCGGCGAGC CCAAATGGGC TGAATTAATA GATAAGAAGG 4680 ACACAAAGGG GCTTCAAGAC TTCCCCGTAT GCCTTAATAA ATAAATTAAA TAAAAAAAAA 4740 // ID AF222049 standard; DNA; INV; 5249 BP. XX AC AF222049; XX DR FLYBASE; FBgn0040267; Transpac. XX FT source AF222049:1..5249 FT SO_feature five_prime_LTR ; SO:0000425:1..330 FT SO_feature three_prime_LTR ; SO:0000426:4920..5249 FT SO_feature CDS ; SO:0000316:668..1603 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0040266; Transpac\gag" FT /db_xref="SPTREMBL:Q9NHF8" FT /protein_id="AAF36670.1" FT /translation="MNENSLAAFNSTMNCIKLIQNFDGTDTNQLPDLITQIENILPSFD FT VFDTNTKNILFGFIKNKFVGLCRPVIHRHGNMTEWESFKRVILKNFGEKETSDVLMDML FT KLCRVESTIEEYYNKINEISNRVHNRILIHDDKTYTTLEVNRIALRVFRDNLPEPTKTL FT IFARNPNSVEDAYKIIEDARHQSYTLYGPIRRNNRYNKPNFRTNFSNDNRNVNEPVVTE FT RSLDEANRFQQNNVEQQENSRTEGTPRVNNRRQGYQYQSNNSSTSNSARVSTNSRNVQS FT SEQSRRSFEPMDINQSSVNFQIEDQDEYHI" FT SO_feature CDS ; SO:0000316:1726..4839 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0040265; Transpac\pol" FT /db_xref="SPTREMBL:Q9NHF7" FT /protein_id="AAF36671.1" FT /translation="MGNKIKIRKLCRFPCFQEFEAEGYICEFLEYNFHNYFDGLIGCNI FT LNDLHAVIDYSNKKIQLNNVYLDLLVQENHLDLGIHNLIPMNLICPQARSSINELIQKF FT EHIFYKEGQDLSFTSEIKHRIITKNDLPIYSKTYKYPEIHKDEVNRQIAEMLDQGIIRH FT SKSPYNSPLWVVQKKMDQSNKQKWRLVVDYRKLNKETIEDRFPIPNIDEIFDKLGDCKI FT FSTLDLAKGFYQIEMDNKDVHKTAFSTTSGHYEFLRMPFGLRNAPSTFQRLMNNILSPY FT TGQFCIVYMDDILIFSKNIGEHVNHLSCIFSCLSKANLKLQSDKCEFAREEIEFLGHTI FT NSEGLKPSQKKIDAICKINLPTNQKQIKSFLGITGYLRRYIKDYSKIAQPLIKYLKKNS FT KINTHDQXYVEAFQKLKILITSDPIVVYPDFNKQFTIVTDASNYALGAVLMQDNKVISY FT ASRSLKNHELNYSTIEKELLAIYWSTKFFKYYIYGRKFIIKTDHRPLVWLNNLKEPNLK FT LQRWKVQLNEFDFDITFIKGKENALADGLSRIVKASTEEYDKNAQTNIEINNLDIFMLE FT NTNTLPEVTPKTKIINEDFSKDLIHIFANDIDDLETIHSADSDDLNFISITTNCLNVFK FT IQIQIIEAESDLSTFKILHNKKIRHIFKSSGNQENMLKFMQEKLPEKGLVVIFCENLSL FT FVKFQEIYRQYFSSNKNLRILKSGTLLEDINDKEKLLKIIENEHIKNNHRGINEIFISI FT REKYYYPKMQKFIQNYINNCKICNLAKYDRQPIKYNFNITETPDKINDIIHIDIWYPKR FT NIMYVTSIDKMSKYATAQHIKDRSWISLLNAIKLRIQYLGKPKKIVTDNEFDIVVIKQF FT LLENNIDIHFTTPYKKTGNSDVERLHLTLNEHMRLYNADPNNFDTIQENVYKAIVCYNN FT TIHSTTNIRPIDLFNNILCHEDISKLSEKLVRNKQTLNAKQDNINESNFSNIFVKNNQV FT GKSNPKYKEISNYTQYGNYLINNKDNKRKIYKDQVKRKYKYQNEDCET" XX CC Derived from AF222049 (AF222049.1) (Rel. 62, Last updated, Version 1). CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5249 BP; 2045 A; 905 C; 832 G; 1461 T; 6 other; AGTCATGTAT ACCCTTACAT ACCCTCTTAT TAGATTAAGT TATGTTAGAA TACACATGGC 60 GCTAAGCTTT TTGTCTGAGC TTAGCTTTAT TTTTAGTCAG TAGATAAGAA GACCTCGATC 120 TGTACACATC ATTGAATAAA CCCTCAATTT GTTAAGCGAA ACCTTTCGTT TTGTTTTAAC 180 CCCTCTTTAA TCGACAAAGA AGTCGATGGG TGCTTAACTG AGGAGTTCAA CCTTCAACCT 240 ACAGACGACA ACGAAGTCGT TTCTGGAGGC AACGATCTTT TGGACACAAC GGTCTCTGCT 300 CGGAAGGCCC TATCAACCTT CCACATAACT GGCGACGAGG ATCAACACAG AAAAAATAAG 360 GAAACCAGCC ATACGTCAAG AAATTCATTC NGAACCACAG TNAGCTGNTN TTATGTAATT 420 TAAGTCCGGA AATCTTCAAT ACGTTAACCA ATGCACGTGG CCGAGTATAA AGAGGAACAT 480 AAAACATAAC TGGAAAGCTT TGGAACACCT GCATGTACGG AGTGACGAAA CCCAACCTGG 540 AGAAGAAGAA CTTAAGCTGA TAAAATTTTT TTTTTCTTCG ACAAAGCTCT TTATACTAGC 600 TAAGATAAGA AACTAAAAGC TAACTATTAA TACTTGCGAA CCTCGAGAGA AAACAAAGAA 660 ATCTAAAATG AACGAAAATA GCTTAGCCGC ATTTAACTCA ACAATGAACT GCATCAAGTT 720 AATACAAAAT TTTGATGGTA CTGATACAAA TCAATTGCCA GATTTAATTA CACAAATAGA 780 AAATATCCTT CCTTCTTTTG ATGTTTTCGA CACGAATACA AAAAATATTT TGTTTGGCTT 840 TATAAAAAAT AAATTTGTGG GATTATGCAG GCCAGTGATT CATAGGCATG GAAATATGAC 900 AGAGTGGGAA AGTTTTAAAC GTGTAATTCT GAAAAATTTC GGGGAAAAAG AAACAAGCGA 960 TGTCTTAATG GATATGTTAA AATTGTGTCG AGTGGAATCA ACTATAGAAG AATATTATAA 1020 CAAAATTAAT GAAATTTCGA ATAGGGTACA CAACCGGATA CTGATCCATG ACGACAAAAC 1080 ATACACCACA TTAGAAGTGA ACCGAATTGC ACTCCGCGTT TTTAGAGACA ACTTGCCTGA 1140 ACCTACAAAA ACGTTAATAT TCGCGAGAAA CCCAAATTCC GTAGAGGATG CCTATAAAAT 1200 CATTGAAGAT GCTCGGCATC AAAGCTACAC ATTATACGGA CCAATTCGAA GAAATAATAG 1260 GTACAATAAA CCCAACTTCA GAACCAATTT CAGTAATGAT AATAGAAATG TAAACGAACC 1320 AGTCGTTACA GAAAGAAGCC TTGATGAAGC CAATAGGTTT CAACAAAATA ATGTTGAACA 1380 ACAAGAAAAT TCAAGAACTG AAGGTACGCC TAGGGTAAAT AATAGAAGGC AGGGATATCA 1440 ATATCAATCG AATAATAGCT CAACTTCTAA TTCTGCTCGA GTCTCAACAA ATAGTCGAAA 1500 TGTGCAAAGC TCTGAGCAAA GCAGGCGTAG TTTCGAGCCA ATGGATATTA ATCAATCAAG 1560 TGTAAATTTT CAGATAGAGG ATCAAGACGA ATACCATATA TAGAAGTTCA AGGTAAAAAG 1620 AGGCCATTAT TATTTATTAT TGACACTGGA GCAGAATTCA GTGTAATAAA TGATAATTTG 1680 TGTCACCCGA AGTGGAAAAC AGACATTGAC ATAGAAGTAA AAGCAATGGG AAACAAAATT 1740 AAAATAAGAA AATTATGCAG GTTTCCATGT TTTCAAGAAT TTGAAGCTGA AGGTTATATT 1800 TGTGAATTTC TAGAGTATAA TTTTCACAAC TATTTCGATG GGCTAATTGG CTGTAACATA 1860 CTTAATGACT TGCATGCAGT CATCGATTAC TCAAACAAAA AGATTCAACT CAATAATGTT 1920 TATTTAGATC TTTTAGTTCA GGAAAACCAC TTGGATCTTG GCATACATAA CTTAATTCCA 1980 ATGAACTTAA TTTGTCCACA GGCCAGATCC TCAATAAATG AGCTTATACA AAAATTCGAA 2040 CATATTTTTT ACAAAGAAGG CCAAGACCTA AGTTTCACAA GTGAAATCAA ACACAGGATT 2100 ATTACAAAGA ATGATCTTCC CATTTATTCG AAAACATACA AATACCCAGA AATTCACAAA 2160 GACGAAGTTA ACCGGCAAAT AGCAGAAATG TTAGATCAAG GAATTATAAG ACACTCTAAA 2220 AGCCCATATA ATAGTCCTTT GTGGGTTGTT CAGAAAAAAA TGGACCAGTC CAATAAGCAA 2280 AAGTGGCGGC TCGTCGTAGA CTACCGTAAA CTTAATAAGG AAACCATTGA AGATCGATTC 2340 CCGATCCCAA ATATAGACGA GATTTTCGAC AAATTGGGCG ACTGTAAAAT TTTTTCAACT 2400 TTAGACCTTG CAAAAGGTTT CTATCAGATA GAAATGGACA ACAAAGATGT GCACAAAACA 2460 GCCTTTTCAA CAACCAGTGG CCATTACGAG TTCCTTCGTA TGCCTTTCGG ACTACGGAAC 2520 GCACCTTCAA CATTTCAAAG GCTAATGAAT AATATCCTTA GCCCCTATAC CGGACAATTT 2580 TGCATTGTTT ACATGGATGA CATCCTTATA TTTTCTAAGA ATATAGGGGA ACACGTCAAT 2640 CATTTAAGCT GTATCTTCAG CTGTCTATCT AAGGCAAATT TAAAACTCCA ATCAGACAAA 2700 TGCGAGTTTG CCAGAGAGGA GATTGAATTT CTCGGACATA CCATTAATTC AGAAGGTCTA 2760 AAGCCAAGTC AGAAGAAAAT TGACGCAATT TGTAAGATAA ATTTACCGAC AAACCAGAAA 2820 CAAATTAAAA GTTTTCTAGG CATCACAGGT TATTTGAGGC GCTACATAAA AGACTACTCC 2880 AAAATTGCCC AGCCCTTAAT AAAATACTTA AAGAAAAATT CTAAAATAAA CACACATGAT 2940 CAANAATACG TAGAAGCTTT TCAAAAACTA AAAATCCTAA TAACAAGCGA TCCGATAGTC 3000 GTTTACCCAG ATTTTAACAA ACAATTCACG ATAGTTACAG ACGCAAGTAA CTATGCATTA 3060 GGCGCCGTGC TCATGCAAGA TAATAAAGTA ATTTCTTATG CTAGTCGTTC TCTTAAAAAT 3120 CATGAACTTA ATTACAGTAC AATAGAAAAA GAGCTTCTTG CAATCTATTG GTCAACAAAA 3180 TTCTTTAAAT ATTATATTTA TGGAAGAAAG TTTATAATTA AAACCGACCA TAGACCCCTA 3240 GTTTGGCTTA ATAATCTAAA GGAACCCAAC CTAAAGTTAC AACGTTGGAA AGTGCAATTA 3300 AACGAGTTTG ATTTTGACAT AACATTTATA AAAGGTAAAG AAAACGCTTT AGCAGACGGA 3360 CTTAGCAGAA TTGTCAAAGC ATCAACAGAA GAGTATGATA AAAATGCCCA AACAAATATA 3420 GAAATTAATA ACTTAGATAT ATTTATGCTA GAGAACACTA ACACATTACC GGAAGTTACG 3480 CCTAAGACAA AAATTATAAA TGAAGATTTC TCGAAAGATC TAATTCATAT ATTTGCCAAC 3540 GATATTGACG ATTTGGAAAC TATTCACAGT GCGGACTCAG ACGATCTTAA TTTTATTAGT 3600 ATTACAACTA ATTGTCTAAA CGTTTTTAAA ATCCAAATAC AAATAATAGA AGCGGAAAGC 3660 GATTTAAGTA CCTTCAAAAT TTTGCATAAC AAGAAAATAC GACATATCTT TAAATCAAGC 3720 GGAAATCAGG AAAATATGCT TAAGTTCATG CAAGAGAAAC TGCCAGAAAA AGGCTTAGTT 3780 GTAATATTTT GTGAAAATTT AAGTTTATTT GTAAAATTTC AAGAAATTTA TAGGCAGTAT 3840 TTTTCAAGTA ATAAAAATTT AAGAATTCTA AAGTCAGGTA CATTGTTAGA AGACATAAAT 3900 GATAAAGAAA AACTCCTGAA GATTATCGAA AACGAACACA TAAAGAATAA TCACAGAGGC 3960 ATTAACGAAA TTTTTATATC TATAAGAGAA AAATACTATT ATCCAAAAAT GCAAAAGTTT 4020 ATTCAAAATT ATATTAACAA TTGTAAAATT TGTAATTTAG CCAAATATGA TAGACAACCT 4080 ATAAAATATA ATTTTAATAT TACAGAAACG CCAGATAAAA TAAACGATAT AATTCACATA 4140 GACATCTGGT ATCCTAAAAG AAACATTATG TACGTTACAT CAATTGACAA AATGTCTAAA 4200 TATGCCACTG CTCAACATAT TAAAGACAGA TCTTGGATTT CTTTGCTTAA CGCAATCAAA 4260 TTAAGAATTC AATACTTAGG AAAGCCTAAA AAGATTGTAA CTGATAACGA ATTCGATATT 4320 GTCGTAATTA AGCAATTTCT CCTCGAAAAT AACATAGATA TACACTTTAC AACTCCTTAT 4380 AAGAAAACTG GAAACTCTGA TGTGGAAAGA TTGCACTTAA CCCTAAATGA ACATATGCGA 4440 TTGTATAATG CTGATCCAAA CAATTTTGAC ACTATTCAAG AAAATGTTTA TAAAGCAATT 4500 GTTTGCTATA ATAACACAAT TCATTCAACC ACAAATATTA GACCGATAGA TTTATTTAAT 4560 AATATACTTT GTCACGAAGA CATCTCTAAA CTATCCGAAA AGCTAGTAAG AAACAAACAA 4620 ACTTTAAATG CAAAACAGGA TAACATAAAC GAGTCAAACT TCTCAAATAT ATTTGTAAAA 4680 AATAATCAAG TAGGAAAATC AAATCCAAAA TATAAAGAAA TAAGTAATTA CACTCAATAC 4740 GGTAACTACT TGATTAATAA TAAAGACAAT AAACGAAAAA TCTATAAGGA TCAGGTGAAA 4800 CGGAAATATA AATATCAAAA TGAAGATTGT GAAACCTAAC ATTTCTGTAC TCCTATTATT 4860 TACATATGTA TTACCTAATG AAATAATTTC AACCACATAC TTGAAACTTA NGCTGGGGGA 4920 GTCATGTATA CCCTTACATA CCCTCTTATT AGATTAAGTT ATGTTAGAAT ACACATGGCG 4980 CTAAGCTTTT TGTCTGAGCT TAGCTTTATT TTTAGTCAGT AGATAAGAAG ACCTCGATCT 5040 GTACACATCA TTGAATAAAC CCTCAATTTG TTAAGCGAAA CCTTTCGTTT TGTTTTAACC 5100 CCTCTTTAAT CGACAAAGAA GTCGATGGGT GCTTAACTGA GGAGTTCAAC CTTCAACCTA 5160 CAGACGACAA CGAAGTCGTT TCTGGAGGCA ACGATCTTTT GGACACAACG GTCTCTGCTC 5220 GGAAGGCCCT ATCAACCTTC CACATAACT 5249 // ID CIRC standard; DNA; INV; 7450 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0022937; Circe. XX FT source nnnnnnnn:1..7450 FT SO_feature five_prime_LTR ; SO:0000425:1..240 FT SO_feature three_prime_LTR ; SO:0000426:7211..7450 XX CC Sequence from A. Villesante, 20-March-2004. CC Consensus sequence. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7450 BP; 2217 A; 1543 C; 1441 G; 2249 T; 0 other; ATAGAATAAG GCCAATAACA TCTGGGCCTC ATCTTCAATA GGTACACTTT TTTGTCCGCT 60 TCTTAACCGT TTGGTTATCG TTTTTTTTTT AATCTACCGT ATCTCTGTCT ACAGTATTGT 120 AGCTGTGACA CTCTTGTTCA TCTCAACACA AAACCCACGC CCATTTTCCC TGAGTATGCC 180 GGTAAAAATA GTAGACAGAG TGTGTTGGGG TGGACAACGC TCTATGAGTG TGTTCGTTAG 240 AGTTATAATT GGCGCCCAAA CGTGGGACAG CCCGATGCCT GATTAGGCAC TTGTCGTTTG 300 ACAACCGAGC TTCCCGTTAA TAAACAATCC ATCCACTCTT AGTTTCAGTT AGGTATTTCA 360 TATTCCGTCC TGGAATTTAT GATATCTGGA GAGTCCCGGA GCCATACCGG AGGAAATGTT 420 ATGCTTTTAT ATGTTTTTTT TTTCTATCAG TTCGGTAATA ATCGAACATT ATTGTTAAAA 480 AAAAGAACAT TCTAATTTGA GTAATTGGTT CTTAGGAATT ATAGTTCTGA TTTAAAGTCG 540 GTTCTATACC AGGATTTCAC TATTCAAAAA TTACCTCCTT ATCATTTAGG ATCGAGATTT 600 TTTCTAGCTG GATAGATTCG GCTAGTTACT CTTCTAAGTG GTAGGAGTTC ATAAATAAAT 660 AAATGGCATT TATTTTACAA CTCTGTATGC ACACCCCGTT AGCTTTTCGT AGCTATTGAA 720 TTATACAACC TACTGCTTTG CTACTCAGGG TAAGACCCGC AGGTATTATT TTGGCGCGTT 780 AACTTTTATC TTTTTGTCGC TTAGCGTTGA TCGTTGGATC TTAGCGCGTA CACATTATCT 840 TTTTTCGCTT ATCTTTGATC GTTGGATCTT ACCGCTTATT TTTCTTGTTC TAACGAGAGG 900 AATTGTTAGA CTTGTTAGAC TTGTTAGACT TTATATATTC TTCTAGAAGC TTCGAAAATG 960 ATGATGCGAA GTAATGAAAA TTTAGTTGAC GTAGATACGG ATGCGTCGGT GACATGCACC 1020 ATCTGCAACT GGCCAGTACG TAATCTGGAT TTGGTTAAAA CCCGTCGTAA ACATTCTTTT 1080 CACAAACATT GCCTTGACAA TTACCTTAAG AACCATGACA AATGCCCCAT ATGTGGCCAG 1140 CCATGTTCTC AATCCGAGTT AGGATCTCAT TTGCCTTTGT CAACTAAATC TCAAGGTAAC 1200 AGGATGATGA CCCGTTCAGG TTCTCGGAAT ACAGCCAAAC AACATTAGAA GTCTGATCCA 1260 AATTCCCAAC AGGAAATTTG TAGCGGGGAT AGTTCAGATT CCAGAGTTGT GACAGAAGAC 1320 CGTGTTCAGC AAATAGTCCA AAGTTCAATG CAGGCCTTTC AAGCAAGCAT GCTACAATCA 1380 GTGCCAGAAC AAATCACTAC GGCGTTCACT CAATTGAATA AATCCACATT TGTAAATAAT 1440 GGACAGAGTG AGCTGCAGCA AGACATACAA AATAATTTCG GTCGAGACGC TCCCGAAATA 1500 AACTCAGAAA GAAATAACAA CTCACATGTT AGGACTTCAC CAGACTTTCG TAGCGATCGA 1560 AGTGATCTTT CTTTGGATAG ACCTGATAGG ATTTCCAATA TTATATCAAA TTGGCGTATT 1620 AAATTCAGCG GTTCAGCCAA CGACATTGCC ATTGAGGATT TTATTTACCG CGTAAATTGT 1680 CTCACTTCGC AAAGCCTGAA CGGAAACTTC GAACTTCTTT CCCATTTCAC TAATTTACTG 1740 TTTGCAGGAC CGGCCTTAGC ATTTTATTGG CGTGTTCAAA GATCGGTGGA TAACATGAAT 1800 TTGAACGTGT TATGCAGGCG TTTAAGGGAA AGATATCAAG ATCAGAGATC TGATCGTGAA 1860 ATTAAAATTG CTATGCGTCG CCGCAAAGAG GGTAGCACAG AAAATTTTGA TGATTTTTTG 1920 GATGCAATGC TTTCCATTGC AGATTCCCTT AGTGAACCAA TGCAGGACAG TGAGATTACT 1980 GTAGAAGTTC GTCATAATCT CAAACCGGAG ATCAAGCACG AATTACTGCA CATAGACACC 2040 CCAAACTTAG CAATATTGCG TAAGGAATGT CATCGTCACG AAGATTTCTT TCGAAGCACT 2100 CGGACAAAGC CAATCCAACG TCCGAACACA AGCAAGCGTT TTGTTAACGC AATCCTGCAT 2160 GAAGATGATT CCGAAGATCA GTCTGAGGAA GAACAGGATG TCGAGGACGA GATTTGTGTC 2220 GTTAGGACTT CTGAGAAAAT TAAGTGCTGG AACTGCGATG AATCAGGTCA CGGTTACCAG 2280 AATTGCCTTA AGACACGCCG TATTTTTTGT TATGGTTGCG GAACTCCGGA GGTGTATAAA 2340 CCCAACTGCG CAAAATGTAA ATCGACATCG GAAAACTCTC AGCAGGGCAT TCGTTATGCG 2400 AACAAAACGA ATGTCCGCGT TAGCCTCGAA AATACAAACT GAACCAAATT CTGTGGCAAA 2460 TGATGTTCAT TCCATTCTGC CCTCAACTCA ACCTGACGTA ACATTATCCC AAACTTTTAA 2520 TCCATATCAT TTGCAAGTGC TGGAGTATGC AAATCGTAAA TTGCAAATAT TTAGGGAAAA 2580 TTCACTTCAT AATCGAAAAC ACCGATCATC ACGGCGAATT CGTAAAATTT GGATTCGTAA 2640 ACGATCATTC AACCGTTTTA CGATTTCATC AATTGTTCAA AATAAGAACG ATATTCGACC 2700 TTTCACACAC ATTGACATAT TTGGTCAATC GCATTTAGCT CTCCTAGACA GTGGCGCCAA 2760 CAAAAGTGTT ATAGGTGGTG AGCTAGCACA ACAATTAATA TCAAGCAAGC CATTTAATAA 2820 ATTTAAATCG GTTGTGCGAA CTGCTGATGG CCAAACGCAA AATGTTGCAG GCACTATACA 2880 AATTCCCTTG ACTTACAACT CAGTTGATAA TAACTTCGAA TTCTTAATAG TCCCTTCCAT 2940 AAAGCAAAGC GTAATATGCG GAATGGATTT CTGGTATACG TTTGGAATTT CTATTAAGCA 3000 AACAGTTTTG ATCAACGAAA TCAACTTTGA ACCCGAAGAA GACTCTACTC GCGTACGATT 3060 GTCTGATTCA CAAAAATTAA AATTGCAGAA AGTAATAGAC TTTTTCCCAT CGTTCGAGAA 3120 TGAGGGCTTA GGATTGACTA ACCTGATAGA GCACAATATC GATACATCCA ATGCGAAACC 3180 AATCAAACAA CGATTTTATC CACTTTCGCC TGCTAAAGAA AAACTTTTGT GTATAGAAAT 3240 CGATCGCATG ATTAAGATGG ATGTCATTGA AGAGGCACCT TCATCGCCTT GGTCATCTCC 3300 AGTTACAATG CATATCAAAC CAGGCAAGGT GCGATTTTGT CTCGATGCAA GAAAATTAAA 3360 TGCAGTCACC GTAAAGGACG CATACCCAAT CCCAATTATG GATGGACTGC TAAGTCGTCT 3420 TTCACCGGTA CATTGCATTT CCAAAATTGA CCTCAAAGAT GCCTTTTGGC AGATCTATTT 3480 AGATCAAGAA TCCCGTGCCA AGACTGCTTT CACCGTTCCA AATAGGCCCC TTTATCAGTT 3540 CAAGAGAATG CCTTTTGGAT TGTCAAACGC TCCACAAACC ATGTGCCGTT TGATGCACCT 3600 CGTTATACCC TATCAATTAA AATCGCATGT ATTAGTCTAC CTCGACGATT TGCTGGTTTT 3660 GTCAAACAAT TTCGAAGACC ATCTTTTGCA TTTGTCCGAA GTAGCCACCC AATTGCGTAA 3720 AGCTGGATTG ACAATTAATG TCCAGAAAAG TCAGTTCTGC CTGAAAACAG TAGACTATCT 3780 CGGTTACCTG GTGGGCGAAG GCACACTACA AGTAAATCCG AACAAAATCG CTGCTGTGAG 3840 AGACTTTCCA GTTCCAAAGA CCCAAAAACA ACTAAGGCGG TTTCTTGGTA TGACTGGTTG 3900 GTACCAGCGA TTCATATCTA ATTACTCCAC ATTCATTTTC AATCTTACAG AGTTGTTACG 3960 TGGCAAATCC TTCAGTTGGA ACGATGTTGC TCAAGAAGCT TTTGATAATA TCAAAGACAA 4020 GTTATGCTCT GCTCCTTATC TCATTCACCC CAATTATGAC AAACCATTTA TCCTGCAGTG 4080 TGATGCTTCA CTACATGGAG TGGGTGCAGT TCTAGCTCAA TGTGATGATT CCGGTTGTGA 4140 ACGTCCGATA GCATTCATGT CTAAGAAGCT GAATAAAGCC CAACGTAACT ATACAGTTAC 4200 AGAACTCGAA TGCATGGCTG TAGTTCTGGC AATTAAGAAG TTCAGAATGT ACATCGACGG 4260 TCATAGCTTC AAAGTAGTCA CCGACCACTC AAGTCTTCGA TGGCTAATGA ACCAATCAGA 4320 TTTAAGCGGG AGATTGGCAA GATGGGCTAT CAAACTTCAG GGTTATTCCT TCGAAATCGA 4380 GCATTGCAAG GGAACAGAAA ATGTGGTTGC GGACGCTTTG TCTCGATCGG TTCGAGGATG 4440 TTGACGATAT TGCCGCAGTA GACCCTTGAA GTCCATTCCC GAATTGATTT ATCTTCCAGT 4500 GCATTTCCAT CGGAAAAGTA TTCTGCTCTC AGAGATAAAT TAGTGTCTCA AAAATTACCT 4560 GATTTTCAAG TCAGTGATGA CTATATTTAT CATCGGGCAA CATTCCCCAA TTGTACTGAC 4620 GTTTCACCAG ACGATTGCTG GAAATTGCTT GTACCTGAGT CGTTGCGTCA TAGCGTAATG 4680 AGTTCGGCTC ATGATCAACC AACATCTGCA CATTGTGGAA TGGCTAAATG CTTAGAGCGT 4740 ATTAGACGGC GTTTTTACTG GCCAAACATG GTTATCAACG TTCGTGATTA CATTCGAAAT 4800 TGCGAGACGT GTCAGACCAC TAAATATCTC AATCGCTCTA AAAAACCACC AATGGCAGCA 4860 CAAGTGCAAA GCGATACAAT TTTTCAAAGA CTTTATCTTG TTTTTTTCGG CCCATTTCCC 4920 AGATCGAAAT CCGGCAACAT TGGTATACTT ATTATTCTGG ATAACTTTTC CAAATTTACC 4980 TTTTTAAAAG CTGTGAGAAA GTTCAACACT AAAGTGATCA TCAGTATATT GCGGGATGAA 5040 ATATTTGCAT GCTTCGGTGT ACCCGAAACA GTCGTGAGTG ACAACGGAAC TCAATTTAAA 5100 AGCCGTGATT TCTCTGACTT TCTTTCGAAA TATGGCGTCC TTCATATATT TACTGGTGCA 5160 TATGCTCCAC AATCCAACGG AGCTGAGAGA GTTAACCGTT CTATAAACGC TGCTTTAAGA 5220 GCTTACATTC GCTCTGACCA TCGCGAATTG GACGTGTTCC TCAGTAGCAT CAACTGTTCT 5280 CTTCGTAACT CGATTCACCA ATCTATTGGT ATATCGCCTT ATCAGGTTGT TTTCGGTAAA 5340 CACATGATAT CCCATGGTAA TGACTACAAA CTGTTGCGCA AACTTAATTT ACTCACAGAA 5400 GGTGACGTAA AATTATCCAG AACTGACGAA TTTCAAAGAA TTCGCTCCAA CATTGCCAGA 5460 CATTTGAATA AGGCTTATGA GACAAACCAA AAGTCTTACA ATCTCCGAGC ACGACCCCGA 5520 TCTTTCGATC TTCTGAGGTC AAGAAGTTGT TAAAAGAAAT TTCGTTCTAA GTAATGCAGC 5580 GAATAATTTT AACGCGAAAC TCGCTCCTGT CGGTGTTAAG GCCGGAGTCA AAGAAAGGAT 5640 TGGTCAATCA ATATATCTCT TAGAGGACAT GAACGGCAAG GAAATTGGTA GATTTCATGC 5700 CAAGGACATT TGGTAATTGC TTTTGACTCC TTTTTAAGTT TTTATTGTTA GTCACAGCGA 5760 CTAAAAATCA AATCTGTGGT TGTGGGAGTC GAGCCAAATG GCAACCTTAT ATGTCGTACC 5820 GCACTTATTT GGTAACGAAC CATTTTAGCC AAATTCAAAT CCTGGCAAAA TTGAATATTG 5880 CCCTATAAAA ATACCACAAA ATTCCCGAGT AGGATTATAG CAAGCTACGG TCACATATGA 5940 TTTGCATTCT TCCGTTCTCT TTTCGTGAAG GCGTGATCGA GTGCAGTTTA ACTTTATGCA 6000 AAATCCATGA TTAAAAATCC ACGAGCATCG TGTGGTTTGG ATAATAGTTT TTCAAGGCAA 6060 ATACTGCCAA GTGATAGTTT CCTTTAACTG CGTCGCTGCC GGTCCCACAT TTGGAGATAG 6120 AAGTCTTAGT GGCTGCGTCG CCAAGGCGGT CTGTATTGTT TTTCGCCAAA CTTTTGACCA 6180 CGCGTGAAAC TTACTGTGGC TTCATCGATC GCAGAAAGAT TTACAACTGC TTGCATCAGG 6240 AGCACCCACC GCGTGCAGCG GAAAGCTGGG CTTGTCCTCT TCTGCTGCCG ACAGATCGCA 6300 GGTCAACCCC CCCCCCCCCC AGAGTTGGCG TTCTTTTTTT TTTCTACTCT TCATCGCCGC 6360 ACGCAGGAAG CAGATCGCCG TGCATCCATT TTATGCACAA GTCAACTTGG TGTCTTCTTC 6420 ATCCGTGCGC ATTCCAAGAA TATTTGTGCA CAACACGAGC CCCAGGCTGC TGGCCTTGTC 6480 GCTGCGTATG CTGCGAACAT GAAGCAGCCG CAACACGAAT GATCGTGTCC AGTCTTAAGT 6540 ATCTTTAGAA TAGTTTTTCA TATTTGTGTT AATTTTACCT AAAAATATTT ACTTATAAAT 6600 TTCTACGATT CTAATTTAGC TTAACTTGTG GGAGATCTGA CAAGTATATA TATGTGATTT 6660 TCCCGATTTA TATTAATACC TACAGCAAAT CATCAACAGT CATCAGGCCT GCTGACATCG 6720 ATGGAAAGTG CGTGAGTACG GCACAAAAAT AATAAAAACA CACAAGCATA TATATAAAAA 6780 GTCAGATACT CCTAAATAGT GTGCTTAAAA TACCTTGCCT TAATTAATCA CATATCTAGC 6840 TCTCTTAATT TAAAGAAGTT TAAAGTTTAA ACATACATTA ATCATACATA ATTCTAACGT 6900 AAAGTAAGCC AAACAAAGAA AAGCATTAAT GTAAACTTAA ATACCAGTAT TTGCATTCCT 6960 ATCTTGATTA TTTTTCCTTT ATTTTATCGT TGATAAGCAA AATTACTCAA TAAGAGATTG 7020 CCTTAACGTA AATAACGTAA TTTTTAGTTT AAGGATTAGG TTTAGGAAAA TGTATCGGAA 7080 TATTATCTTT ATCTTCAATT ATATGGGGTC CAAAATGAAC TAATCTGACG TCTTGTTGGG 7140 CACTAAAGGG TATTTAAAGC AGTGGTTGTA TTAGGGTGGA GTCCGAATGG ATTTCACAGT 7200 TAGGGTGGGT ATAGAATAAG GCCAATAACA TCTGGGCCTC ATCTTCAATA GGTACACTTT 7260 TTTGTCCGCT TCTTAACCGT TTGGTTATCG TTTTTTTTTT GAATCTACCG TATCACTGTC 7320 TACAGTATTG TAACTGTGAC ACTCTTGTTC ATCTCAACAC AAAACCCACG CCCATTTTCC 7380 CTGAGCATGC CGGTAAAAAT AGTAGACAGA GTGTGTTGGG GTGGACAACG CTCTATGAGT 7440 GTGTTCGTTA 7450 // ID DME278684 standard; DNA; INV; 5108 BP. XX AC AJ278684; XX DR FLYBASE; FBgn0041728; Rt1a. XX SY synonym: Waldo-B SY synonym: pilger XX FT source AJ278684:32..5180 FT SO_feature five_prime_UTR ; SO:0000204:1..436 FT SO_feature three_prime_UTR ; SO:0000205:4854..5149 FT SO_feature polyA_signal_sequence ; SO:0000551:5149..5182 FT SO_feature CDS ; SO:0000316:437..1933 FT /db_xref="FLYBASE:FBgn0041727; Rt1a\gag" FT /db_xref="SPTREMBL:Q9N9Z2" FT /protein_id="CAB99191.1" FT /translation="MDRTGGGSAPDPNDPFRRSGRLSRSPIRGVGTQIQGGGGEQAGPP FT PPKASDCSAAVEVETLATTVSTTATSLKTMDFISVPAQRAGAKSPSGSPHRSPELTTTL FT QNEDLQGILDMMKAKITAILSSFETRRHVTSEDRGVLVDLSALNKRAIELQEGINKKPP FT PRSTATQTEAEKTKRSQVAPPRQALPNVQVRKTDNHRSAAKSTGKALPTTADSKPESYA FT SVAKDANKDEEWAKVKPKRLRKKPEALILKKTGEVTYSDMLRKMKAEPSLTEFGKHVRK FT IRRTQQGELLLELEGKASEVIPSFKNELEATLKEIASVRTGAHRTALICSGLDETTTAQ FT DLHNSLVSQFQGIRLEPEDVRGLRRRRDGTQIASVLMCANDAIAVINRGVVTVGWSRCR FT IAQDVRPIRCFRCLEFGHRAPYCKSVDRSDCCLRCGEHGHKAKGCVAPPRCLICSSDVD FT KNHATGGFACPTYKANTKGANSRQNDARRN" FT SO_feature CDS ; SO:0000316:1884..4853 FT /db_xref="FLYBASE:FBgn0041726; Rt1a\pol" FT /db_xref="SPTREMBL:Q9N9Z1" FT /protein_id="CAB99192.1" FT /translation="MMPEEINIIQLNVNHCAAAQNLLTQTAKERHADVVLLSEPYLPGV FT GNSGVLLDETGKAAIKCTSRLLVEEWDTVPMRGIAYAKIRGIHFYSCYAPPSDSPEQFE FT DMLEKLVNHASGRRPTVIGGDLNAWATEWGSRISNTRGRAVIDAMNLLDLVLLNDGFKP FT TFNNDRGTSFIDVTFVSRVLVAGSNWMVHEDITLSDHNLITFGARTMRPSPKQRRCALG FT PVWDIRKLDEDMLAYQIEGMESIIGHAETMVTALMDRLRAMCDAVMPRKRNTKRKPPVY FT WWSDSLHQLRTECIKARRQAQRSRGQPHHSQCIEVYKAKRTELKNGISAAKANAFKDLI FT DSVDDDPWGLAYKVVRKKLNSAGAGSPQDPAALANIVSELFPSQHTLWQPAVDPPASDF FT PCITSCEVVEAAKRIRPNKAPGIDGIPGVIVKAAATARPEVFRDTFQQCLLDGVFPKRW FT KKMKLVLLLKGKGPANVPRSYRPLCLLDIVGKLFERILYARIELITESPTGLQGQQYGF FT RKGKSTLDALKSVTDAARKALDGNRWLGGSKKYCAIITLDVKNAFNTARWPIILGAMRN FT LGVPDYIRGVIGNYFRDRVLWYVTEDGPRSHQVSAGVPQGSVLGPILWNIMYDGILSIS FT KPRGVELHCFADDVAITAVAKTIPELQDISNVAITAAIEWLEKVGLKIAAHKTEVVLLS FT SRKSVECMRVEVKGVEIASAETLKYLGVLIDRRLSFKAHARYASKKAAMTAAALARIMP FT NVGGPRLPARRLLVAVSKATLLYAAPIWSCVSTKKTYLDSARAVSRTMALRLIRGFRTI FT SDDAAHALSGITPIDLDIKGKYLASEGYTQLEIKEWIRGVWQTRWQESQRGRWTYKLIP FT QLTEWADCEHKTVDYHMTQFLTDHGCFRGYLCRFRHVDTAQCLYCTDAVETAEHILLHC FT SRFAEERAQLVALAGSPLSPRGLVAAMMADKIVWDGAHVIIVTMMKRVRKDEMANRNYR FT " XX CC Derived from AJ278684 (AJ278684.1) (Rel. 64, Last updated, Version 1). CC Michael Ashburner. CC Any changes to original sequence record are annotated in an FT line. CC [polyA_signal annotation looks odd.] XX SQ Sequence 5108 BP; 1411 A; 1307 C; 1398 G; 992 T; 0 other; ACATCCATTG TCCCGCGTAT TTCGCACGTC TTTTTGCTGC GCTGCGCGAA TTTCGTCATA 60 TACGAGTCCG GTCAGCCGTA AACTGCAACA TAAACCTGAA ATCCACTCTG GGGTTTGCGC 120 CGCGTTTCTG AGCGTCACGG TGTCGTTCCG GAATTCAGAG TGATCCAGTA AACTGGTAAT 180 AAATCAGATC AGCTGCATCA GACAATACAG CTGATACATT GCCAACCTTT CGCAGTCGAC 240 GAGCTGGTTA GACTGGCGTT GCCAGATCAG CGGTGGATCG TCATTCGGCC GCGCGCTCGT 300 TGGGCGCGAA ATTCGAATTC AAATTCAAAT TTGACTAATT AAGTTGAACA AAAATTTGAA 360 ATATAACCTA AGAACTGAAG CTAGTTCTAT TTCCACGGAG CGCTGGGGAC CAACAAGCCA 420 AAGTCCCCCT TTTATATCTG ATGGACAGAA CAGGGGGGGC AGTGCCCCCG ACCCCAACGA 480 CCCGTTTAGG AGGAGTGGTA GGCTATCGAG ACCCCTATAA GAGGAGTCGG AACCCAAATC 540 CAGGGAGGGG GAGGTGAGCA AGCAGGCCCC CGCCGCCGAA AGCTAGCGAC TGCAGTGCAG 600 CTGTGGAGGT GGAGACTCTG GCCACTACAT CTCCACGACA GCCACAAGCC TGAAGACAAT 660 GGATTTCATC AGCGTGCCGG CCCAAAGGCG GGAGCGAAGT CCCCATCAGG ATCGCCGCAT 720 CGCTCGCCAG AACTGACCAC GACGCTCAGA ATGAGGACCT GCAGGGCATC CTGGACATGA 780 TGAAAGCCAA GATCACCGCC ATTCTACCTC GTTCGAGACT AGGCGTCACG TAACCAGCGA 840 GGATAGGGGC GTGCTAGTGG ACCTACGGCG CTCAATAAGA GAGCGATTGA GTTACAGGAG 900 GGTATCAACA AGAAGCCCCC ACCAGGAGTA CTGCCACACA GACTGAGGCA GAAAAGACAA 960 AGCGTAGCCA GGTGGCGCCA CCCGACAGGC ATTACCGAAT GTGCAGGTCC GCAAAACGGA 1020 CAACCATCGA TCTGCCGCGA AACCACCGGG AAGGCGTTAC CCACGACGGC TGACTCCAAA 1080 CCGGAAAGTT ATGCGTCTGT CCCAAGGACG CTAACAAGGA CGAGGAATGG GCTAAGGTGA 1140 AGCCCAAGCG CCTGCGCAAA AACCTGAGGC GCTAATCCTG AAAAAAACGG GTGAGGTTAC 1200 GTACTCGGAT ATGCTCCGGA GATGAAAGCA GAACCGAGTC TGACCGAATT CGGCAAGCAC 1260 GTGCGTAAAA TAAGGAGGCG CAACAGGGAG AACTACTTCT TGAATTAGAA GGTAAAGCCT 1320 CGGAGGTCAT CCCCAGCTTA AAAATGAGCT AGAAGCGACG CTCAAAGAGA TTGCTTCGGT 1380 TCGCACGGGC GCGCATGGAC TGCGCTAATC TGCAGCGGAC TAGACGAGAC AACGACGGCT 1440 CAGGACCTTC ACAATCCCTG GTCTCCCAAT TTCAGGGCAT CCGCCTGGAA CCAGAGGATG 1500 TAAGAGGCCT TCGCGGAGGC GTGACGGGAC CCAGATAGCC TCTGTGCTAA TGTGCGCGAA 1560 CGATGCCATT GCGTCATCAA CCGGGGCGTT GTAACTGTGG GATGGTCGCG TTGCCGCATA 1620 GCCCAAGACG TCGCCCAATA AGATGCTTCA GATGCCTCGA ATTCGGCCAC CGAGCTCCCT 1680 ACTGCAAGTC ATCGATCGCT CTGACTGCTG CCTACGGTGC GGCGAGCATG GGCATAAGGC 1740 AAAGGGCTGC TAGCCCCACC AAGATGCCTG ATCTGCAGCA GTGACGTGGA CAAGAACCAC 1800 GCGACGGGTG TTTTGCATGC CCCACCTACA AAGCCAACAC CAAAGGAGCT AATAGCCGTC 1860 AAAATGATCC AGAAGAAATT AACATCATCC AGCTCAACGT TAACCATTGC GCAGCAGCAC 1920 AGAACCTCTG ACTCAAACAG CGAAGGAGCG CCATGCGGAC GTAGTGCTTT TGAGTGAACC 1980 ATACCTCCTG GTGTGGGTAA CTCAGGAGTG CTACTCGATG AGACAGGCAA GGCGGCCATT 2040 AAATGACATC CAGGTTATTG GTAGAGGAGT GGGATACCGT ACCAATGCGC GGCATAGCAT 2100 ACGCAAAATT AGAGGAATCC ACTTCTACAG CTGCTATGCT CCACCTAGCG ACAGTCCCGA 2160 GCATTCGAGG ACATGCTCGA AAAGCTGGTT AACCATGCAA GTGGGCGCAG ACCAACAGTC 2220 ATGGAGGTGA CCTCAATGCC TGGGCTACAG AATGGGGCAG TCGAATCTCT AACACAAGAG 2280 GCGAGCAGTG ATCGATGCCA TGAACCTGCT AGATCTCGTA TTGCTAAACG ACGGGTTCAA 2340 CCGACGTTCA ATAATGACAG AGGTACATCT TTCATTGATG TCACTTTTGT TAGCAGAGTC 2400 TAGTGGCTGG CTCGAACTGG ATGGTCCATG AGGACATAAC GCTGAGCGAC CACAACCTAT 2460 CACGTTCGGT GCCCGAACAA TGAGGCCGTC ACCCAAACAG CGAAGATGTG CGCTAGGCCG 2520 GTATGGGACA TTAGGAAACT GGATGAAGAC ATGCTAGCAT ACCAGATCGA GGGCATGAGT 2580 CCATAATTGG ACACGCGGAG ACCATGGTGA CAGCGCTCAT GGATAGGCTT AGAGCATGTG 2640 TGATGCGGTG ATGCCGAGAA AAAGGAACAC GAAACGAAAG CCCCCTGTCT ACTGTGGAGT 2700 GACTCATTGC ACCAGCTTCG GACGGAATGC ATCAAGGCGA GGAGGCAAGC GCACGATCTA 2760 GAGGGCAACC GCACCACTCC CAGTGCATTG AGGTATATAA AGCGAAGCGA ACGAGCTTAA 2820 GAACGGCATA TCAGCAGCAA AGGCGAATGC TTTTAAGGAC CTGATTGATA GGTAGACGAC 2880 GACCCCTGGG GTCTCGCCTA CAAAGTCGTA AGGAAAAAAC TCAATTCCGC GGTGCAGGAT 2940 CTCCTCAGGA CCCAGCCGCC CTGGCCAACA TCGTGTCGGA ACTATTTCCA GCCAACACAC 3000 GTTATGGCAA CCCGCTGTTG ACCCCCCCGC CTCTGACTTC CCGTGCATAC GTCATGCGAA 3060 GTCGTCGAAG CAGCAAAGAG GATCAGACCG AACAAAGCTC CCGGCATGAT GGTATCCCGG 3120 GCGTGATCGT CAAAGCAGCC GCCACCGCAA GACCGGAGGT CTTCAGGATA CATTCCAACA 3180 GTGTCTGCTG GACGGAGTCT TCCCCAAGCG CTGGAAAAAG ATGAACTGGT CCTTCTGCTG 3240 AAAGGCAAGG GGCCCGCAAA TGTTCCACGC AGTTACCGAC CGTTTGTCTG TTGGACATTG 3300 TTGGCAAGCT CTTTGAGCGC ATACTGTACG CGCGCATTGA GCTATCACTG AAAGTCCTAC 3360 AGGCCTTCAG GGCCAACAGT ATGGCTTCCG AAAGGGTAAA AGACGCTCGA CGCTCTCAAA 3420 TCCGTAACAG ACGCTGCCAG GAAAGCACTC GACGGTAACC GTGGTTAGGC GGCAGTAAGA 3480 AGTACTGTGC CATCATCACG CTAGACGTCA AGAACGCATT AATACAGCGA GATGGCCCAT 3540 TATCCTCGGG GCTATGCGCA ACTTGGGCGT TCCCGACTAA TACGAGGCGT AATTGGCAAC 3600 TACTTCAGGG ACCGTGTGTT ATGGTACGTA ACAGAAGAGG TCCAAGAAGC CACCAAGTCT 3660 CTGCAGGCGT TCCCCAAGGG TCGGTACTAG GACCGATTTG TGGAACATTA TGTATGATGG 3720 AATACTGAGC ATTAGCAAGC CCAGAGGTGT GGAGCTCACT GCTTCGCCGA CGACGTTGCG 3780 ATAACAGCGG TCGCCAAGAC AATACCGGAG CTCCAGACAT AAGTAACGTG GCAATCACGG 3840 CGGCCATAGA ATGGCTCGAG AAAGTCGGAC TTAAATAGCT GCGCATAAGA CCGAAGTAGT 3900 CCTGCTGAGC AGCAGAAAGT CCGTTGAGTG CATCGTGTAG AAGTCAAAGG AGTTGAAATC 3960 GCCTCAGCAG AAACGTTGAA ATACCTTGGT GTCTAATAGA CCGAAGGCTC TCGTTCAAGG 4020 CTCACGCAAG GTATGCCAGC AAAAAAGCGG CATGACAGCA GCAGCCTTGG CGAGAATCAT 4080 GCCCAACGTG GGAGGACCCA GACTGCCGGC AGGAGACTGT TAGTGGCGGT TTCAAAGGCA 4140 ACGCTGCTTT ATGCTGCGCC TATCTGGAGT GCGTTTCCAC AAAGAAAACC TATCTAGATA 4200 GTGCCCGCGC AGTATCACGG ACAATGGCCT CAGGCTAATC AGAGGCTTTA GAACCATATC 4260 GGACGACGCA GCGCACGCTC TGTCAGGATT ACACCCATTG ACCTGGACAT AAAGGGCAAA 4320 TACCTTGCGA GTGAGGGATA CACTCATTAG AGATCAAAGA GTGGATTCGA GGAGTATGGC 4380 AGACCAGGTG GCAAGAGTCA CAACGGGACG CTGGACTTAC AAACTCATTC CGCAACTAAC 4440 GGAGTGGGCT GATTGCGAGC ACAAACGGTG GACTACCACA TGACCCAGTT CCTCACGGAC 4500 CATGGCTGTT TTCGAGGGTA CCTTGTAGGT TCCGCCACGT GGATACAGCC CAGTGCCTTT 4560 ATTGCACAGA CGCAGTGGAA ACGCAGAGCA CATCCTACTG CACTGCTCCA GGTTCGCCGA 4620 GGAGAGGGCG CAACTCGTGG CCTCGCTGGG TCACCTCTCA GCCCGAGAGG CTTGGTTGCT 4680 GCTATGATGG CGGACAAAAT GTTTGGGATG GGGCTCACGT GATCATCGTC ACCATGATGA 4740 AGCGTGTCCG TAAGGACGAA TGGCCAATCG GAACTATAGA TAAGGAGTAC CCCCGTATGT 4800 TGGCGGGGCA AGAACTCTCG ACTGGTACGC TCAGTTGGTC GTAAAAAGGC GCTGCTGTGC 4860 ACCGCAAAAG AAGATGGGTG CAACTTGGCA CCACATCCTG CTCACCGATG AAATACCTTG 4920 ACTGGCAGTC CCGGTGGCTT GACAAGGACA GGAGAGAGAG CGGAGGTTTT TGTTTAGTAC 4980 GTAGGCATAA GCCCTAGCTG AGGGTTATGA ATCGTGCATG CCATCCAAGG ACATTAGATG 5040 GTATCTTTAG AAGATTCATT TTCCTGCCGT ATATAATAAT AAAAAAAAAA AAAAAAAAAA 5100 AAAAAAAC 5108 // ID RT1B standard; DNA; INV; 5171 BP. XX AC AC005734; XX DR FLYBASE; FBgn0042682; Rt1b. XX SY synonym: Waldo-A XX FT source AC005734:107957..102787 FT SO_feature CDS ; SO:0000316:AC005734:107575..106079 FT /db_xref="FLYBASE:;" FT /protein_id="" FT SO_feature CDS ; SO:0000316:AC005734:106095..103132 FT /db_xref="FLYBASE:;" FT /protein_id="" XX CC Sequence from I. Busseau (personal communication to FlyBase). CC Michael Ashburner. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5150 BP; 1371 A; 1339 C; 1477 G; 984 T; 0 other; GCTGCCAACT TGGCGCAGCG GGCGATCTGG AGAGGGAGGC GTTGCCAGAT CAACCGTGTG 60 ATCAGTGTTG TGTTGCGACA GTAGAGCGCG AAATTCAAAA TTAAACTTAA AATTAAACCG 120 TTGAATTTTA ATTTAATTTT AATTTTGATT CTGGTTACTG GGAGCGCACA AGTCTTTAAG 180 AAAGGGGAAA AGGCACAAAG GTCGAGGACA GACCAAATAT CATCCTCTCA CCGTGCTGGT 240 GCCTAAGCAG AGGCTCTGGT GTCCGAGTCG AGGCGGTACA AACCTCCATC CGTAAAAGCC 300 AGGGGGAAAT CCGTGGCCTA CGTGGGAACC CCTTTCTAAG GCTGGTGCTT CGCCTAGGTA 360 ACCTTAGGCC ATCTTTCAGT TGATGGACAG TGCAGTGGGG GGCAGTCCCC CTGATCGGAA 420 CGACCCGTTC AGGAGGAGCC CCCGACTTTC GAGATCCCCC ACCAGAGGAG TCGGTAACCC 480 GACCCATGGT GACGGAATCT CGGTCCCGTC GGTGGCAAGG GGTGAGCACG CAGGCCCCCA 540 GCCGCCGTTG ACTAGCGACT GCAGCGATGC TGTAGAGGTG GAGAACTTGG CTCCTACAGC 600 ATCTACGACC GCGACAAGTC AGAAGACGAT GGACCACATC GCCGTGCCAT CGCAAAAGAC 660 GGGAGGGAAG TCCCCAGCTG GTTCCCCTCA TCGGTCTTCG GACCTGACCG CGGCTCTCCA 720 GAATGAGGAC CTGAGGGGCA TACTTACCAG GATGAATGGC AAGATCAGCA CCTTGCTATC 780 CGCATTCGAG ACCCAACGGC ACGTGACACG AGAGACAAAG GGCATAGCTT CCGAACTCTC 840 AGCCCTTAAC AACAGGGCGT TAGAGCTGTA CAAAGGTACT ACTAGCGAGC ACCGCTTGAA 900 GAGCAGTGCT ACCCAGACGG AGGTACAAAA GCCTACCAAA AAGGCTCCTC AAAATGCACA 960 GGTACCCAAG GTGCAAGTGC CAAAAATCCA GGCGCCCAAG CCGAGCAACC CTACGTTGGG 1020 TGGCAACGGT GAAGAAAAGG CAACATCCAG ACCAAACGAG TCCAAACCAG GCAGCTACGC 1080 ATCTGTCGCC AAGAATGCTA GCACGGGTGT AGAGTGGACC AAGGTGAAGC CTACGCGCCT 1140 GCGTAAAAAA CCGGAGGCAC TCATCGTAAA AAAGACAGGA GAGGCTTCGT ACGCAGAGAT 1200 GCTTCGGAAG CTAAGATCGG ACCCGAGCCT TAGCGAACTG GGCAGCCACG TGCGAAAAAT 1260 CCGGAGAACG CAGAAAGGTG AGCTGTTGCT CGAGGTAGAG GGGAAAGCTT CGGAAAGCGT 1320 CCCCAAGTTT AAGAGCGACC TGGAAGCGGC GCTCAATGAC TTGGCCTCTG TGCGCACAGG 1380 AGCGCAAAGA ATAGCTCTAT CTTGCAGCGG ATTGGACGAG GCTACGACAG CAGAGGAGCT 1440 CCACAGCTGC TTGGTCGCCC AATTCCAGGG CCTGCAGATA AATCCTGAAG ATATCAGGGG 1500 CCTTCGCAGA ATGCGGGATG GCACGCAAAT AGCCTCAGTG CTGCTGAACG CGAACGTTGC 1560 GATACCAGTC CTTAAACAGG GCACCATAAC CGTTGGATGG TCAAGATGTC GTATCACCCA 1620 GGACGTTCGA CCCACGAGAT GCTACAGGTG TCTCGGCTAT GGGCATCGAT CAGCAACCTG 1680 CAAGAACACT GACAGGGCAG ACTGCTGTCT TAGATGCGGT GAGCGTGGGC ACAAGGCAAA 1740 GGGGTGCGTT GCAGCACCAA AATGCCTGAT CTGCAGCAGC GAGGTGGACA GAAACCACTC 1800 GACGGGTAGC TTTGCGTGCC CGACCTACAG AGCGACCCTA AAAGAAGCCA AGAGCCACCT 1860 TAATGCACAC TCATATTAGC GTAGTACAGC TCAATGTCAA TCATTGCGCA GCAGCTCAGA 1920 GCCTCCTGGC CCAGACTGCG GCTGAGCGCA ATGTAGACAT CATGCTCCTA AGCGAACCCT 1980 ACGTCTCTGG TAGCGGACAA TCGTCCATGA TCCTTGACGA GACAGGTAAA GCAGCTATCA 2040 AATGCTGCAG CTCTCTCCAC GTCGAGGAAC TGGCTGCTTT ACCTATGCGG GGTATCGCTT 2100 ATGCGAAGTT AAAACACGTG CACTTGTACA GCTGCTACGC TCCGCCGAGC GACACCCCCG 2160 ATCAGTTCGA GGAGTTTCTG GAGGCGCTCG TGGACCATGC GAGAGGGCGA AGCCCGAAGG 2220 TCATTGCCGG CGACTTTAAT GCCTGGGCAG TGGAATGGGG CAGCAGGACA TCCAACACCA 2280 GAGGCCGAGC TGTGATTGAC GCCATGGGAA TGCTGGACCT TATACTGCTG AACGACGGAC 2340 GGAAGCCGAC GTTTAACAAC GATAGGGGTA CGTCCTTTAT TGACGTTACC TTTGTCAGCA 2400 GAGGGCTAGT AGACAACAAT AACTGGATGG TCCATGACGT CATGACGCTG AGCGACCACG 2460 CCCTGATCTC CTTCAGTCTC TCCCCGGAGG ACATGCCCAG GAGACGGCAG AGTAGAGCAG 2520 TCGGGAAAGC ATGGGACACC AGGAAGATCG ATGAGGCCAT GCTGGCCTAT CAGATCAATT 2580 CCCTGGAAAT CCCAAGTGGG GACGCAGAGA GTATGGCGGC AGGCCTCATG AATATGCTGG 2640 GAAGAATCTG CGACGCAATC ATGCCAAGGA AAAATAAGGC ACAGCGCAAA CCACCCGTTT 2700 ACTGGTGGAG CGCCTCCCTA AGCCAACTAC GGTCTGATTG CCTCAGGGCT AGGAGAATGG 2760 CGCAACGAGC CAGAGGCAGT ACCCACCACG CGGAACTCTT GGAGGCTTTC AGAAGGAAAC 2820 GTCTAGAGTT CAAGCACGGC ATCGCGGCTG CCAAAGCGCG GTCGTTTAAG GAGCTGCAGG 2880 ATGGCGTAGA CAGCGATACC TGGGGCCTCG CCTACAAGCT TGTTACCAAA AAGCTAAGGA 2940 GGAGAGCGGC AACCCCATCC GACCCGGGGG TCCTGGCTAA CATAGTAGGG GAGCTATTCC 3000 CAAAGCAGAC CACACTATGG AGGCCAACAG AGGCAGCCCC TGCCCCAGAT TTTCCGTGCG 3060 TCACAGAACT TGAAGTCGCC GAGGCAGCCA AGCGCATCAA ACCCAACAAA GCCCCTGGAC 3120 TAGATGGTAT TCCTGGAGCT GTTATAAAAG CAGTGGCGCT GGGTAGACCT GAAATCTTCA 3180 GGGCCACCTT CCAGCAATGC CTTCTGGACG GAATCTTCCC AACAAGGTGG AAAAGCCAGA 3240 AGCTAGTCCT GTTGCCGAAA GGCAAGGGAC CAGCACATGC TGCAAACAGC TACCGCCCTC 3300 TATGCCTACT GGATATAGTA GGAAAACTGT TCGAACGTAT CCTGTATACC AGAATAGAGG 3360 CAATCACCGA GAGCATCAAC GGCCTGGGAA GTCATCAATA TGGCTTCCGG AAAGGTAAGA 3420 GCACTCTGGA CGCTCTTTCG GCCGTTTGTA ACATCGCCAA GACCGCTATT TCTGGTGATA 3480 GATGGTTAGG GGGCAGGAAG GAATACTGCG CAATTGTGAC TCTGGACGTA AGGAACGCTT 3540 TCAACACCGC CAGATGGCCC GTAATCCTCG CGGCCATGTA CCGTATGGGG ATCCCGGAGT 3600 ACCTAAGGAT AGTCGTTGGC AGCTACTTTA GGGACCGGGT CCTATGGTAC GATACGGAAG 3660 ATGGCCCAAA AAGATACCGA GTTTCGGCAG GTGTTCCCCA AGGATCGGTA CTTGGACCAA 3720 TCCTATGGAA CATTATGTAC GATGGGATCT TGGGCATCAA CAGGCCCGTA GGAGTAGAGC 3780 TGCATTGTTT TGCTGACGAT GTGGCAATCA CAGCTGTCTC GAAAACAATC GCAGGGTTGG 3840 AAGACAAATG CAACTCTACG ATCGGTGCTG CCATCCGCTG GCTCGAGAAA GCCGGGCTAG 3900 CAATAGCGGC TCACAAGACC GAAGCAGTCC TACTAAGCAG CAGGAAAAAG GTGGAGAACA 3960 TGCTGGTCTC CGTCAAGGGT ACACAGGTGA CCTCTCAAGA GTCCCTAAAG TACCTGGGGG 4020 TAATGATAGA TCGCAGACTA TCGTTCAAGG ACCACGCGAG CCACGCCAGC AAGAAGGCAG 4080 CAATCACAGC CTCTTCGTTG GCGAGGCTTA TGCCCAACGT CGGAGGCCCA AGACACCCGG 4140 CCAGGAAACT GCTGGTGTCA GTAGCAAAGG CTTCGCTACT ATACGCTGCA CCAGTCTGGA 4200 GCAATGCCAC TGGCAGGGTC TCATACCTGA AAGGAGCTCG TTCGGTGCTA CGGTCAATGT 4260 CTCTGAGGCT CATTAGAGGT TTCAGGACCA TATCCGAAGA CGCGGCGCTA GCGCTGGCAG 4320 GCCTGCCGCC GATTGATCTG GAGATCAAGG CTCTCAGCCT AATGCGGAGT GGCGCTTCCA 4380 GGCAAGAGGC ACACGAGTGG CTATTAGGTG AATGGCAGAG TAGATGGCAA ACGTCGCGAC 4440 GGGGGAGGTG GACTTATCAG CTCATCCCAG AGATGACGGT TTGGGCAGAG TGCCAACACA 4500 AATGCTTGGA CTACCACCTA ACCCAGTTCC TCACGGACCA TGGCTGCTTC CGGGCCTATC 4560 TACTCCGGTT CCGTCACGTA GAGTCAGCCC AATGCTTGTT CTGCGTCGAC GGTGAAGAAA 4620 CAGCAGAACA TGTGCTAATG CACTGCTCCA GGTTCACGGC GGAGAGAGAG CAGCTAAAGA 4680 CGCTGTCAGG TTCCCCGTTC AGCCCTAGTG GCTTGTTCGC GGCTATGATG GCGAACAGGG 4740 GGGCTTGGGA GCGGGGACAC AGCATTATCA TTAATATGAT GAAGCGTGTC CGATCAGACG 4800 AGATGGCCAA CAGAGTGGAT GTCTAAGCCC AAACTGGTGT CCTGGGTGAC GGCGGGCGAA 4860 GAATTCATCC TCAGCGTCCC CGGCTCGTCG TAAAAGGCGA CTAAAGGGTG GAAGGAGGAG 4920 CCCCCATGGA CTACACTGAA GGAAGGGAGT GCGACCTGGC CTCACATCCT GCTCACCGAA 4980 GTCATACCTT GACTGGCAGT CCCGGTGAGC GAGCAAGGAC TGTAGAGCAC GCGGAGGTTT 5040 TTGTTTTAGT ACGTAGGCAT AATTCCAATA GGGCTTATGA ATCGTGCATG CCACCTACGG 5100 ACGGTAGGTG GTATCTTTAG AAGATTTTAA TTTTCCTACC GTAAGTCAAA TAATAAAAAA 5160 AAAAAAAAAA A 5171 // ID QUASIMODO standard; DNA; INV; 7387 BP. XX AC AF364550; XX DR FLYBASE; FBgn0044355; Quasimodo. XX SY synonym: cruiser SY synonym: antonia XX FT source AF364550:1..7387 FT SO_feature five_prime_LTR ; SO:0000425:1..658 FT SO_feature three_prime_LTR ; SO:0000426:6729..7387 FT SO_feature CDS ; SO:0000316:1074..2081 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044353; Quasimodo\gag" FT /db_xref="SPTREMBL:Q967T3" FT /protein_id="AAK52060.1" FT /translation="MAQPSEQIMILSELHLNQALGQIRHISTFDGSTRELASFVRRVDF FT VMSLYPTTDKRQHSVLYSAIERQLSQHAQEVSQLQQCNTWAELRSVLIDEFKTQIPYEE FT LLRRLYNTYWSGSIRKFVEELENKMFEISSKLSLENNYTNTTLYTAAMANTIKDVIYKR FT IPDRMFMTLARYDITTTTLLRQVAQREGLYDTIVLNTEKAKAKLNSPSTLQSSSKNNGS FT QKNKGNNNDQGIKPYYIQAQQSQNRNNYASNNSKKVESNQATYNEFKQKLEQGRAQNAL FT NFQKPTSYSTTNQPQKRQRESSSGQSKMDTSENFHQLASGSESEVEEEHTHSYK" FT SO_feature CDS ; SO:0000316:2084..5281 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044352; Quasimodo\pol" FT /db_xref="SPTREMBL:Q967T2" FT /protein_id="AAK52058.1" FT /translation="MNNIKLKCLIDTGSSINLMSKNFFNCPINSKAPLDVHTINGQIIL FT KSKITLKPSKLCPTKQTFYLHKFSEKYDILLGREYLEDSKATICYTSNTVTLNKYTFKL FT RLEDINEEEETPKESLKENLLDSPNIEYVNYVIDNELNETNEFRLDHLNDEERKALTSV FT LYEFSDIQYKEGENLTFTSTTKHVITTKHEDPIYRRPYKYPQSFDEEVENQMKDMIRQG FT IIRKSNSPYCSPVWVVPKKPDASGKSKFRIVIDYRNLNEITVDDKYPIPVMDEILDKLG FT NCQYFTTIDLAKGFHQIQMDENSIAKTAFSTKNGHFEYTRMPFGLKNAPATFQRCMNNL FT LADLIYKNCLVYLDDIIVYSTSLEEHIMSLRKVFLKLREANLKLQLDKCEFLKKETEFL FT GHIITTDGIKPNPAKIKAVVNFPIPKSTKEIKSFLGLCGFYRKFIPNFAKIAKPMTLRL FT KKGSIINIKDSDYYLAFEKLKVLITSDPILIHPDFKKSFSLTTDASNFAIGAVLSQEHK FT PICYASRTLNEHEVNYSAIEKELLAIVWATKYFRSYLFGRKFEIHSDHRPLVWLDSIKE FT PNMKLQRWKIKLNEFDYHIKYLPGKENHVADALSRVKIEENFLGETSSNISLPTQATIH FT SAQEDNQSYISLTERPINYYNRQIEFIKDDNNNVETKRYFHKTKIKIHYIEMTNVHAKE FT LIKEYLCTKRSVIFFHNEIDFLTFQNAYIEIVSPNNVTKVMKSNIKLKDIETYSEFKEI FT IIKSHRELLHPGIEKQTNLFKEKYYYPDYQKLIQNIINECEVCNISKTEHRDTKLRYEL FT TPETFNPREKYVIDFYLINNKTFLSCIDIYSKYAALIEVSSRDWLEAKRALLKIFNEMG FT KPSEIKADKDSAFICSALHLWLRSENVNINITTSKNGISDIERFHKTVNEKLRIINSES FT DPENKITQFETILYVYNHKTKHNTTGRTPADIFLYAGTPAYDTQKEKQSKIDKLNKDRH FT SYEVDTRYKQAPLVKSKTTNPFKKTGHVEQIDEKHYEETNRGRKVEHYKSKFKKQKRIN FT KSKYNSSSTPENQVGSD" FT SO_feature CDS ; SO:0000316:5145..6701 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044354; Quasimodo\env" FT /db_xref="SPTREMBL:Q967T1" FT /protein_id="AAK52059.1" FT /translation="MRNITKKQIEEEKLNIINLNLKNKRELIKVNTIPAQLPKIKWGPI FT KQLFIITLIICFIRAVRGQSLEVNPIQAKNGYLIFKTGSINIPINYEYHYLSVNLTKTE FT QTFANLIKQAEEYGTIAQIQYLTEKLDREMNGIRIIKRNKRGLINIIGTAYKYLFGTLD FT QNDKEELEQKIYDLSQHSIQINELNEVIEVVNRGIEVINHLSAISEGDRRLELLVFNLQ FT QFTEYIEDIELGMQLTRLGIFNPKLLRHDPLSHVNSEKLLNIKTSTWLKSDANEILIIS FT HIPRDIIKTALFNIVPYPDKDNNILIENVNDKYYIQDNQVYKQNSGKPIINKCITGILN FT QIPTECRYSKTHNNIGITYVEPNIILTWKLSKIVLNQNCIINREIIIEGDNIIKAFNCS FT VQIENILITSTTLDYTQTVYINNNVTKLEPLSYLNAKEIIKEHTNTYNTLQIITLTILA FT IIILTLILYFIYKYKGIPKKLIVKYKKENPKQIEQQNNTTTENINTVLDTNPVLYPRIS FT A" XX CC Derived from P1 clones DS08479 & DS07153 by Sue Celniker, 29 March 2001. CC Michael Ashburner, 9-Apr-2001. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7387 BP; 3008 A; 1350 C; 1091 G; 1938 T; 0 other; AGTAGCATAT GTGCCATAGT ATAAGACCAC ATATACTTAC ACACACACAC ACATACATAA 60 CCACATAGGA ACAGCCATAC ACAAAATCAT AGAATGCCTT AGGATATAAG TCATAAAATC 120 ATGTATCCAA TAAGAACCCA AATTCTGGTG ACAATAACAA ATCCAGCCTA CCCTAATGTA 180 AACATTAACA TCCGCTGCCG AATCAGTTAT TCCCACCAAA GGGTCAAAAC CGCTTACGCA 240 GCTCAAGTTC TCGGCTGATC AGTACCCATG AGTGATCCAC TCAAGAAGGC ACCCAATCCA 300 ACAGACATAA ACATCCGCAG CCGCATTCGC GACTCCCACC AGAGGGTCAA AACTGCTAAC 360 ACAGCTTTAA GGGTCAAAAC TTTCCACGTA GCTCAAACTC GGCTGATCTA TATCCATGCG 420 CAATCCACTC AAGAGAGCGC CTAATTAACG TAAACATAAA TGAGAGCCTA TGTACGCTCA 480 ATATGAAGAA GTAAATAAGC TCAGCCAAAG GGTTGCTAAG CGTTCGCTTT GCAAATTAGA 540 TTTAGATTTT ATTCATACTT CAATTGTGTA AGACGTGAAT TTGTCTCCGA CATTCGCTTA 600 AGTTTTTGAT CAATAAAAAT ACTTTTTTTT ATAAACCAAC AAACCGCGAA GTTGTAATTG 660 GCGCAGCCGG TAGGATCCGC GTGTGAACAT AATCTCTTAG AGCTAATAGA TTAGTTCGAG 720 ATTAATAAAG CGCAACTCGT GACGAACAAA GTTTCGCTTG TGCTTCAGTT CATACAAAGC 780 AGATCCGCAA AAGAACTTAT GTGACAAAAT ATTTGTGAAC AGTTATCTAA CTCTCGATGT 840 TGGATACTAT TAATACAAAT ATAAATTAAA AAACTGCAAG TGACCATTAA AGTGTGACCC 900 ACCAAAATAA AAGGTGTGAC TGTTACCAGG TCAGTGTTGT GTAGCCCATC TAGCATTGAT 960 TGGACTGCTA CAAAGGTGTG CGAATTTTAA ATAAAAAAAA AAAAATAAAT AAAATAAAAC 1020 CCCGGTATCA ATCTACCAAG TGATTCAAAG AAAAATCAAC TATAGCCAAC AACATGGCAC 1080 AACCCAGCGA ACAAATTATG ATTCTATCTG AATTACATTT GAATCAGGCT TTGGGCCAGA 1140 TTCGTCACAT CTCAACATTT GACGGTTCCA CAAGAGAACT GGCATCATTT GTGAGACGTG 1200 TGGACTTCGT GATGTCCCTA TACCCGACAA CAGACAAGAG GCAACATAGT GTACTCTACA 1260 GTGCCATCGA GAGGCAGCTG TCCCAACATG CACAAGAAGT ATCCCAGCTA CAACAGTGCA 1320 ACACTTGGGC TGAACTACGA TCAGTCCTCA TTGATGAATT TAAAACCCAA ATCCCATACG 1380 AAGAGTTGCT AAGAAGACTT TACAACACCT ACTGGAGTGG TAGTATTCGT AAGTTTGTAG 1440 AAGAACTAGA AAATAAGATG TTCGAAATTT CAAGTAAATT ATCATTAGAA AATAATTACA 1500 CAAATACAAC TCTTTATACC GCCGCAATGG CTAATACTAT TAAAGATGTA ATTTATAAAA 1560 GAATTCCAGA CAGAATGTTC ATGACATTAG CAAGATATGA TATTACAACA ACAACATTAT 1620 TAAGACAAGT AGCACAAAGG GAAGGCCTTT ACGATACAAT TGTATTAAAT ACAGAAAAAG 1680 CTAAAGCCAA ATTAAACAGT CCATCTACAC TACAAAGCAG TAGTAAAAAT AATGGTTCTC 1740 AGAAGAATAA AGGAAATAAT AATGACCAAG GTATAAAACC TTATTATATT CAAGCACAGC 1800 AAAGTCAAAA TCGAAATAAT TATGCAAGTA ATAATTCAAA AAAAGTTGAA AGCAACCAAG 1860 CGACATACAA TGAGTTCAAA CAAAAGTTAG AACAAGGTAG AGCGCAAAAC GCTCTAAATT 1920 TTCAAAAACC CACATCTTAT TCTACTACTA ACCAGCCCCA AAAAAGGCAG CGTGAAAGTT 1980 CCAGTGGTCA GTCTAAAATG GATACCAGTG AAAATTTTCA TCAACTTGCC TCGGGATCGG 2040 AATCAGAAGT AGAAGAAGAA CATACCCATT CATACAAATA ACCATGAACA ATATCAAACT 2100 CAAATGTTTG ATTGACACAG GTTCATCAAT AAATTTGATG AGTAAAAATT TTTTCAATTG 2160 TCCGATCAAC TCAAAAGCTC CATTAGATGT ACACACAATA AACGGTCAAA TTATTTTAAA 2220 ATCAAAAATT ACATTAAAAC CTAGTAAATT ATGTCCGACA AAGCAAACAT TTTATTTACA 2280 TAAATTCTCA GAAAAATACG ATATTTTGTT AGGCAGGGAA TATTTGGAAG ATTCCAAAGC 2340 AACTATATGC TATACATCCA ATACAGTCAC TTTAAATAAA TACACATTTA AATTGCGACT 2400 TGAAGATATT AACGAAGAAG AGGAAACACC TAAAGAGAGT TTAAAAGAAA ACTTATTAGA 2460 TTCCCCAAAT ATTGAGTATG TCAATTATGT AATTGACAAC GAACTCAATG AAACAAATGA 2520 ATTTCGACTT GATCATCTAA ACGATGAAGA GAGGAAAGCC CTCACCAGTG TCTTATACGA 2580 GTTTAGTGAC ATACAGTACA AAGAAGGTGA AAATTTGACC TTCACAAGTA CAACTAAACA 2640 TGTCATAACA ACTAAACACG AAGACCCAAT TTACAGACGT CCATATAAAT ACCCACAAAG 2700 CTTCGACGAA GAAGTCGAAA ACCAAATGAA AGACATGATT CGACAAGGAA TCATAAGAAA 2760 ATCGAATTCT CCATATTGCT CTCCTGTTTG GGTAGTACCC AAAAAGCCAG ATGCATCGGG 2820 AAAATCGAAA TTTCGCATAG TCATTGACTA TCGCAACCTC AACGAAATAA CCGTTGATGA 2880 CAAATACCCA ATACCAGTAA TGGATGAAAT ATTGGATAAG CTTGGAAATT GCCAATACTT 2940 TACAACCATT GACCTCGCAA AAGGTTTTCA TCAAATACAA ATGGATGAAA ATTCTATAGC 3000 GAAGACAGCT TTTTCAACCA AAAATGGTCA TTTTGAATAT ACTCGAATGC CATTTGGTTT 3060 AAAAAATGCA CCCGCAACTT TTCAACGTTG CATGAATAAT CTCTTAGCAG ATTTAATATA 3120 CAAAAACTGT CTCGTATATC TGGATGACAT AATTGTGTAT TCCACTTCAT TGGAAGAACA 3180 CATAATGTCT TTGCGAAAGG TATTCTTAAA GCTCAGAGAA GCAAATTTAA AATTACAGCT 3240 AGATAAATGC GAATTCCTAA AAAAGGAAAC TGAATTCTTA GGGCACATAA TTACAACAGA 3300 TGGTATTAAA CCAAATCCCG CAAAAATTAA AGCCGTAGTG AATTTCCCTA TACCAAAGTC 3360 CACTAAGGAA ATAAAATCAT TTCTTGGTCT TTGCGGTTTT TACCGCAAAT TTATTCCAAA 3420 CTTCGCTAAA ATAGCAAAAC CAATGACATT ACGACTAAAA AAAGGCTCAA TAATAAATAT 3480 CAAAGATTCA GACTATTACT TAGCTTTTGA AAAATTAAAA GTGTTAATAA CGTCCGATCC 3540 CATATTAATA CACCCAGATT TTAAAAAGTC TTTTTCATTA ACTACTGATG CTAGCAATTT 3600 TGCTATAGGC GCGGTATTAT CGCAAGAGCA TAAACCTATA TGCTATGCTA GTAGAACTCT 3660 GAACGAACAT GAAGTTAATT ATTCCGCAAT AGAAAAGGAA CTCCTTGCTA TTGTTTGGGC 3720 CACTAAATAT TTCAGGTCCT ATTTATTTGG TAGAAAATTC GAGATCCATA GTGATCACAG 3780 GCCATTGGTT TGGCTTGATT CTATTAAAGA GCCAAACATG AAACTTCAGA GATGGAAAAT 3840 CAAATTAAAT GAATTTGACT ATCATATCAA ATATCTCCCT GGTAAAGAAA ACCATGTAGC 3900 AGATGCTCTA TCGAGAGTAA AAATTGAAGA AAACTTCCTA GGAGAAACCT CCAGTAACAT 3960 TTCGTTACCA ACACAAGCTA CTATTCACAG TGCTCAAGAA GATAACCAAT CTTATATTTC 4020 CTTAACGGAA AGACCAATTA ATTACTACAA CAGGCAGATA GAATTTATAA AAGACGATAA 4080 TAATAACGTA GAAACTAAAA GATACTTTCA TAAAACAAAA ATTAAAATTC ATTACATAGA 4140 AATGACAAAT GTCCATGCCA AGGAATTAAT TAAAGAATAT TTATGTACCA AAAGAAGCGT 4200 AATTTTCTTT CATAACGAAA TAGATTTCCT TACATTTCAA AACGCTTACA TAGAAATTGT 4260 CAGTCCGAAT AACGTAACTA AAGTTATGAA ATCAAACATC AAATTAAAAG ACATTGAAAC 4320 ATATTCCGAA TTTAAAGAAA TAATTATAAA AAGTCACAGA GAACTTTTAC ACCCAGGAAT 4380 AGAGAAACAA ACTAACCTTT TTAAAGAAAA ATATTATTAC CCAGACTACC AAAAGTTAAT 4440 ACAGAACATT ATTAATGAAT GCGAAGTATG TAACATATCC AAAACAGAAC ATAGAGATAC 4500 TAAACTAAGA TATGAGTTAA CACCAGAAAC TTTTAATCCT AGAGAAAAAT ATGTTATAGA 4560 TTTTTATTTA ATCAATAATA AAACTTTCCT GTCATGCATT GACATTTATT CAAAATATGC 4620 CGCTTTGATT GAAGTCAGTA GTAGAGACTG GCTGGAAGCA AAAAGAGCTC TTCTTAAAAT 4680 TTTCAATGAA ATGGGAAAAC CGTCCGAAAT AAAGGCCGAT AAAGATTCAG CTTTTATATG 4740 TTCCGCGTTA CATTTATGGC TTAGGTCCGA GAATGTGAAT ATAAATATCA CAACAAGTAA 4800 AAATGGAATT TCTGACATTG AAAGATTCCA TAAAACAGTC AATGAAAAGT TAAGAATCAT 4860 TAATAGTGAG TCCGACCCAG AAAATAAGAT AACACAATTT GAAACTATTC TCTATGTATA 4920 TAACCACAAA ACTAAACACA ATACAACTGG GAGAACACCA GCTGATATTT TCCTTTACGC 4980 AGGAACACCT GCATATGACA CCCAAAAAGA GAAACAATCA AAAATAGATA AGTTGAATAA 5040 AGATAGACAT AGCTATGAAG TAGATACTAG ATATAAACAA GCACCATTGG TAAAAAGTAA 5100 AACAACTAAC CCATTTAAGA AAACAGGACA CGTAGAACAA ATAGATGAGA AACATTACGA 5160 AGAAACAAAT AGAGGAAGAA AAGTTGAACA TTATAAATCT AAATTTAAAA AACAAAAGAG 5220 AATTAATAAA AGTAAATACA ATTCCAGCTC AACTCCCGAA AATCAAGTGG GGTCCGATTA 5280 AACAGTTATT CATAATAACA CTTATAATAT GTTTCATCCG TGCCGTCCGT GGTCAAAGCC 5340 TAGAGGTGAA CCCGATACAA GCTAAAAATG GATATCTTAT ATTTAAAACA GGATCAATTA 5400 ATATACCTAT AAACTATGAA TATCACTATT TGTCCGTAAA CTTAACAAAG ACCGAACAAA 5460 CTTTTGCAAA TTTAATTAAA CAAGCTGAAG AATATGGTAC TATAGCCCAA ATCCAATATT 5520 TAACTGAAAA ATTAGACAGA GAAATGAATG GAATAAGAAT AATAAAACGC AATAAACGTG 5580 GTTTAATAAA TATTATAGGT ACCGCATATA AATACCTTTT TGGCACTTTA GATCAAAATG 5640 ACAAAGAAGA ATTAGAACAG AAAATATACG ATCTTTCTCA ACATAGTATT CAAATTAATG 5700 AATTGAATGA AGTTATAGAA GTAGTCAATA GAGGCATTGA AGTAATCAAC CATTTAAGCG 5760 CAATAAGTGA AGGTGACAGA AGACTAGAAC TTTTAGTATT TAACTTACAA CAGTTTACCG 5820 AATACATTGA AGATATAGAA CTTGGTATGC AATTAACAAG ACTTGGTATT TTTAATCCCA 5880 AATTATTAAG ACATGATCCA TTATCACATG TAAATTCCGA AAAATTGCTA AATATAAAAA 5940 CTTCTACGTG GTTAAAATCA GATGCAAATG AAATACTTAT TATTTCCCAT ATTCCTAGAG 6000 ACATTATAAA AACTGCACTT TTTAACATCG TACCATATCC AGACAAAGAT AATAATATAC 6060 TTATAGAAAA TGTAAATGAT AAATATTATA TTCAAGATAA TCAAGTTTAC AAACAAAACT 6120 CAGGAAAACC AATAATAAAC AAATGTATTA CAGGAATATT AAATCAAATA CCAACAGAAT 6180 GCAGATATTC AAAAACACAT AACAATATTG GAATAACTTA TGTAGAACCA AACATAATTT 6240 TAACCTGGAA ATTATCAAAA ATTGTATTAA ATCAAAACTG TATAATTAAT AGAGAAATTA 6300 TAATAGAAGG AGACAATATA ATAAAAGCTT TTAATTGTTC TGTTCAAATA GAAAACATAT 6360 TAATTACAAG CACAACACTA GACTACACAC AAACGGTCTA TATCAATAAC AATGTAACAA 6420 AATTAGAACC ATTATCATAT CTAAATGCTA AAGAAATAAT TAAAGAACAC ACTAATACAT 6480 ACAACACATT ACAAATTATT ACATTAACTA TACTTGCTAT TATAATACTT ACACTTATAC 6540 TATATTTTAT TTATAAATAT AAAGGCATTC CTAAAAAACT AATTGTTAAA TATAAAAAAG 6600 AGAACCCAAA ACAAATAGAA CAACAAAATA ACACAACAAC AGAAAATATA AATACAGTAC 6660 TTGATACTAA TCCTGTATTG TATCCTAGAA TATCCGCCTG AGGACAGGCT AAATTTAAAG 6720 GATGGGGAAG TAGCATATGT GCCATAGTAT AAGACCACAT ATACTTACAC ACACACACAC 6780 ATACATAACC ACATAGGAAC AGCCATACAC AAAATCATAG AATGCCTTAG GATATAAGTC 6840 ATAAAATCAT GTATCCAATA AGAACCCAAA TTCTGGTGAC AATAACAAAT CCAGCCTACC 6900 CTAATGTAAA CATTAACATC CGCTGCCGAA TCAGTTATTC CCACCAAAGG GTCAAAACCG 6960 CTTACGCAGC TCAAGTTCTC GGCTGATCAG TACCCATGAG TGATCCACTC AAGAAGGCAC 7020 CCAATCCAAC AGACATAAAC ATCCGCAGCC GCATTCGCGA CTCCCACCAG AGGGTCAAAA 7080 CTGCTAACAC AGCTTTAAGG GTCAAAACTT TCCACGTAGC TCAAACTCGG CTGATCTATA 7140 TCCATGCGCA ATCCACTCAA GAGAGCGCCT AATTAACGTA AACATAAATG AGAGCCTATG 7200 TACGCTCAAT ATGAAGAAGT AAATAAGCTC AGCCAAAGGG TTGCTAAGCG TTCGCTTTGC 7260 AAATTAGATT TAGATTTTAT TCATACTTCA ATTGTGTAAG ACGTGAATTT GTCTCCGACA 7320 TTCGCTTAAG TTTTTGATCA ATAAAAATAC TTTTTTTTAT AAACCAACAA ACCGCGAAGT 7380 TGTAATT 7387 // ID Beagle standard; DNA; INV; 7062 BP. XX AC AF365402; XX DR FLYBASE; FBgn0001207; HMS-Beagle. XX SY synonym: midline XX FT source AF365402:1..7062 FT SO_feature five_prime_LTR ; SO:0000425:1..266 FT SO_feature three_prime_LTR ; SO:0000426:6796..7062 FT SO_feature CDS ; SO:0000316:1527..2930 FT /db_xref="FLYBASE:FBgn0044442; HMS-Beagle\gag" FT /db_xref="SPTREMBL:Q967S7" FT /protein_id="AAK53387.1" FT /translation="MANPNNIIRPSAFSSNERPVPSERHDLPEQHRVDMSVEQLTALIG FT QTVAQVLPGLIKQMNNNTLDFTDVSDQVVEPEYRNNLADFDRVPDIVKSIREFSGNPAE FT FGSWKKSVDRIMETYTPFVGTPKYYGILHTIRNKIVGSADVALESYSIPLDWKAMSRCL FT TLHYADKRDITTLEYQMSTLVQGHQQSVEDFHQDVYKNLSLILNKVGCMQMSRESEHFV FT TKMYREKALDTFIRGLRGDLPRLLAIKEPADLPSALHYCLKLENQTFRSNHAANKGQQS FT SFRGNEKPIPAPRNFQPSNAFGHRQPPPVPPRHPMRRPPQYLGQPFPAPRQHVSNFGNA FT PFIPPRQNYQQQWQQYNAPPRPFATKPQPRPEPMDVDGSVQTRNINYMNRPHSDINKRQ FT KNYNIQTVNMRQPSIEVTGSSSSMTGYQRSMQDYETQNNINSTLNEYCDGQIDNLDSEQ FT RDHVLHFLE" FT SO_feature CDS ; SO:0000316:<2870..6049 FT /db_xref="FLYBASE:FBgn0044441; HMS-Beagle\pol" FT /db_xref="SPTREMBL:Q967S6" FT /protein_id="AAK53386.1" FT /translation="WTNRQFGQRTTRPRVAFFRVEDSSLPYFECRVGSGKVLRVLIDTG FT SNKNYIQPNLVTNAIPNNKPYIADTPGGDVKISHYKRASLFDFEIKFFLLPTLKTFDAI FT LGKDTMKGMGAQIDLKNLTMTLENGKRVVLKEKQFEAVSTISPRIKHLEVEQRMILNKI FT IDSYPGLFADPNQKLTYTTSVRAAIRTISDTPIYSKFYQYPMSLKDEVHKQISELLHDG FT IIRPSRSPYNSPVWIVPKKLDSSGKKKYRVVIDYRKLNMVTVADRYPIPDINEVLAQLG FT DNKIFSVLDLKSGFHQILLKESDIEKTAFSINNGKYEFTRLPFGLKNAPSIFQRALDDI FT LHEHIGKICFIYIDDIIIFSKDDETHYQNLDTIFRTLQQANMKCQLDKCEFMKRKVEFL FT GFVVSDKGIETSPTKVQAISDFPIPRTLKELRSFLGLSGYYRRFIPNYAKLAKPLSSLL FT RGEDGRISRTLSSKKSVSLNREAIEAFKKLKSSLVSPDVILHYPDFKKEFHLTTDASNF FT AVGAVLSQENRPISFLSRTLSKAEENYATNEKEMLAIIWALKKLKIYLYGKAKVKIFTD FT HQPLTHSLSSWNGNARIKRWKAYLEEYDYEIFYKPGRENTVADALSRGPTVEQINTVAS FT TMHSSDSSSHGLIPSVEAPINAFKNQIFFRESESENYSFSIPFPTFQRHLVDRNLFTPD FT SLLSDLKEYLNPSVINGIFTSEDVMGKIQILYPIHFQGFKIRFCQSKVKDLVNEAEQEE FT EILRTHNRAHRNAVENKAQLSERVYFPKMRKKVSAIVNQCLVCKTAKYDRHPTHPEIRQ FT TPLPEYPGQIIHIDIYSTERHLVLTAIDKFSKLAMGRVIKSKAVEDIRKALRDIVFYYG FT VPKLIVMDNEKSLNSASIKFMLTDQLGIELYKAPPYKSTVNGQIERFHSTLSEIMRCLK FT GDGTHRGFEELLDRAIYEYNYTVHSVTKKRPLEVFFGRIATVAPEKYEQARLDNIDRLR FT QKQETDIEYHNRTRKPIKTYIKGQEIFVRVNTRLGSKLSSRFRKELVKEDRSTTILTES FT GKLVHKSNIRS" XX CC Michael Ashburner, 18-Apr-2001. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7062 BP; 2493 A; 1473 C; 1356 G; 1740 T; 0 other; AGTTATTGCC CTGCAATTGA TTCTCTAACA TCTTGTGGTT CCACATAGTC TCCGCTGCCA 60 TCAACGCCAA CGAACGGTTA AGCGCGACAT CGACACTTCT GCGCTGCGCC GCGGCCGACG 120 CCTGCTGCGC CGCTGCCGAC GACTTCACTT GATTGCTAGG GACTTAGGGA AACATTTTGT 180 ACGCTAGATT TAGTTTCAAA TGATAAATTG CAATAAACGG TCGCTTGCGA TCTTCAAAAT 240 CAAATCGATA ACTGTAATTA TTAACTGGCG CCCGAACAGG GACCAGCGAA TAAACGCGAA 300 CGAAAGACAA AATTCTAAGT CGCGGAGCAA AATCAAAATT TTGCTAAAAA ATATTCGTTG 360 GTTAAATTGT GCCGAAGAAA CTCCCGCGAG TTATTAACAA ACAAAATTCA CGGTCGGCTA 420 TAAAATAAAA TTGTTTGAGA AAAAAACAGA TTTTCCGGAA GAGGAAAATA CTTATCGGCA 480 TAGCATTGCC CATTGGTGGA TCACAGTTTC TGTCAGGCCA GCCCGCAGAA AACCTTCTTT 540 GGAGTGCTGA CGTGGATTCC CCAGTAGCCC AGGCAAAAAG GACAACATTC ACCCAGCCAG 600 GAGGAGTGAC CGCAACAATT CTATGTTAAA TAAATGCAAA ACGAAGAACA GCAATTTCAT 660 CGAAAAGGAG AAGGAAAAGT AGAAGTAGAA CGCCGAGAAT AACGGAACTA CAGAAGAGCA 720 ACACGGATAA CAGTTAACAG TGCAACCCAA AACGTAACGG CAGAGGTAAA GACGACGAAC 780 GGCAGAGAAC GGCTGCGTGA ACCAAAGTGC AGACAAGAAG GAAAAAAAAA GAACAACAAC 840 AGCACAGCCG TAGCAGCAGC AGCAGCCCGC CAAAAAGAAC AGGAATAAGA ACACGTTGAC 900 GACGAAGAAG CTGAAGCTGA AACAACCGAA GAGGAGGCAG AGAGACTACA AGAACCTGGA 960 AAGTACACTG GAAAAAACAT CAGCAATTTG AAATTCAGGC ATCCACCCAT AATAAAACAT 1020 AAGTATACCA AAAAAAAAAA AAAAAAAAAA AAAAATTTAA GTATATTATT ATTATTATAT 1080 TATTATTATA ATATTATTAT TATTATTATA TTTTTATTAT TATATTATTA GTATTATATT 1140 ATTATTATTA TATTATTATT ATTATATTAT TATTATTACA TTATTATTAT TATATTATTA 1200 TTATTATTTT ATTATTATTA TTGATATTAT TATTAATACT ATATTTTCAA CCCAGTTCCT 1260 AGAGATCTTC TGAAAGGAAA ATTTTCCTAT TTACTGTTCC TTTCTGGTAC ACTGTTCTCA 1320 AAGCAAAATA ACCGCGGTGA GCTAAAATTT ATTGCATGCA AAAATAAAAA AAAAAAAATA 1380 TAAAAATAAA AAATAAAAAA ACAAAAACAA AAGATAAATA AGCAAACACA TACACACGCA 1440 TTCCATATTT TCTGCCCACA ACTTTTGTTA AGTTCAAATT GGTTTAGGCT TGTTTTGTGC 1500 AGGTGTTGTC CGAATTAAAA ATCAGTATGG CAAATCCTAA TAATATAATA AGACCATCAG 1560 CTTTTAGTTC AAACGAAAGA CCGGTGCCCT CAGAAAGACA CGACCTGCCT GAACAGCATC 1620 GCGTAGACAT GAGTGTCGAA CAGTTGACAG CATTAATTGG CCAAACGGTG GCCCAAGTTT 1680 TACCTGGATT GATAAAACAA ATGAACAACA ATACTCTTGA CTTTACTGAC GTGAGCGACC 1740 AGGTCGTCGA GCCCGAATAC AGAAATAATT TAGCGGACTT TGACAGGGTG CCTGATATCG 1800 TAAAATCGAT CAGGGAATTC TCTGGAAATC CAGCAGAATT CGGGTCTTGG AAAAAGAGCG 1860 TTGATAGAAT CATGGAGACT TATACCCCAT TTGTGGGTAC TCCAAAATAT TATGGTATAC 1920 TTCATACCAT AAGAAATAAA ATTGTTGGAA GTGCTGATGT GGCACTCGAG TCCTACAGCA 1980 TTCCGCTGGA CTGGAAGGCC ATGTCGAGAT GTCTCACTCT GCATTACGCA GATAAACGGG 2040 ACATAACTAC CTTGGAATAC CAAATGTCAA CTTTGGTTCA AGGCCACCAG CAGTCGGTGG 2100 AAGACTTCCA CCAAGACGTC TACAAAAATC TGTCGCTTAT TCTAAATAAA GTCGGCTGCA 2160 TGCAAATGAG CCGAGAATCC GAGCACTTTG TTACAAAAAT GTACAGAGAA AAAGCTCTGG 2220 ATACTTTTAT CAGAGGCCTT CGAGGAGATT TACCTCGCCT TTTGGCAATA AAAGAGCCAG 2280 CTGATCTCCC CTCAGCTTTA CATTATTGCC TTAAACTTGA AAATCAAACT TTCAGGTCAA 2340 ACCATGCGGC GAATAAGGGT CAACAGTCGA GTTTCAGAGG AAACGAAAAA CCCATTCCGG 2400 CACCCCGCAA TTTTCAGCCG TCGAATGCCT TTGGCCATCG ACAGCCTCCA CCGGTCCCTC 2460 CACGACATCC CATGAGGAGA CCTCCACAAT ATTTAGGCCA GCCATTCCCG GCGCCAAGGC 2520 AACACGTGTC TAACTTCGGT AATGCACCGT TCATTCCACC CCGACAAAAC TATCAACAGC 2580 AATGGCAACA ATACAATGCC CCACCCCGTC CCTTTGCGAC CAAACCACAA CCAAGACCAG 2640 AACCAATGGA CGTGGACGGC AGTGTACAGA CGCGAAACAT TAATTATATG AATAGACCAC 2700 ACTCGGACAT AAACAAACGA CAAAAGAACT ACAACATCCA AACAGTGAAC ATGAGACAGC 2760 CAAGTATTGA AGTAACTGGG TCCAGTTCAA GCATGACAGG CTATCAACGG TCGATGCAAG 2820 ATTATGAAAC CCAGAACAAT ATAAACAGCA CACTAAATGA ATACTGTGAT GGACAAATAG 2880 ACAATTTGGA CAGCGAACAA CGAGACCACG TGTTGCATTT TTTAGAGTAG AAGACTCCTC 2940 ACTACCATAC TTCGAATGTA GAGTGGGGAG TGGAAAGGTT TTAAGGGTGT TGATTGACAC 3000 AGGCTCTAAT AAAAATTACA TCCAGCCCAA TTTGGTGACG AACGCGATAC CAAATAACAA 3060 GCCTTACATA GCTGATACTC CAGGCGGTGA TGTAAAAATA TCGCATTACA AAAGAGCAAG 3120 CCTTTTCGAT TTCGAAATAA AATTTTTTTT GTTACCAACG CTGAAGACTT TTGACGCCAT 3180 ACTTGGCAAA GACACAATGA AGGGAATGGG AGCGCAAATT GATTTGAAGA ACCTAACCAT 3240 GACACTGGAA AACGGGAAAA GAGTTGTTCT CAAAGAAAAA CAGTTCGAGG CTGTTAGCAC 3300 AATTAGTCCG AGAATAAAAC ACTTAGAGGT GGAACAGAGA ATGATATTGA ATAAAATAAT 3360 TGATTCGTAT CCAGGTCTCT TCGCAGACCC GAATCAAAAA CTAACCTACA CAACAAGTGT 3420 AAGGGCAGCA ATCCGAACTA TATCGGATAC GCCAATATAC TCAAAATTCT ATCAGTACCC 3480 AATGTCTCTT AAAGACGAGG TACACAAACA AATTTCCGAA CTTTTACACG ATGGAATCAT 3540 TCGACCCTCA AGGTCACCTT ACAATTCACC AGTGTGGATT GTACCAAAAA AACTCGACTC 3600 CTCTGGCAAG AAAAAATACA GGGTGGTAAT CGACTATCGA AAACTTAACA TGGTAACGGT 3660 AGCGGACAGA TACCCTATCC CTGACATTAA TGAAGTGTTG GCCCAATTGG GAGACAACAA 3720 GATTTTCTCA GTGCTCGATC TGAAAAGTGG GTTTCATCAG ATTCTTTTAA AGGAATCTGA 3780 TATCGAAAAG ACCGCCTTCT CCATCAATAA TGGAAAATAT GAGTTTACAC GACTCCCATT 3840 CGGTCTGAAA AATGCACCGT CAATTTTCCA GCGCGCACTG GACGATATTC TTCACGAACA 3900 TATCGGTAAG ATATGTTTCA TTTATATTGA CGACATCATC ATCTTTAGTA AAGATGATGA 3960 AACACATTAC CAGAACCTTG ACACTATTTT CAGAACTCTT CAGCAAGCCA ACATGAAATG 4020 TCAGTTGGAT AAATGCGAGT TCATGAAGAG AAAAGTGGAG TTCCTAGGCT TCGTCGTGTC 4080 CGACAAGGGC ATTGAAACCA GCCCAACCAA GGTACAGGCA ATCTCAGACT TTCCAATTCC 4140 AAGGACACTC AAAGAACTGA GATCATTCTT GGGATTATCC GGATATTACA GGCGATTTAT 4200 TCCTAACTAC GCTAAGTTAG CAAAACCACT TAGCTCGCTT TTGAGAGGGG AGGATGGTCG 4260 AATTTCCAGG ACATTATCGT CAAAAAAATC CGTCTCCCTT AATCGCGAAG CAATAGAAGC 4320 CTTCAAGAAA TTGAAGAGCA GCTTGGTTTC TCCAGACGTA ATACTCCACT ACCCGGACTT 4380 TAAGAAAGAA TTTCACCTAA CAACGGATGC TTCCAATTTC GCAGTAGGTG CTGTTCTTTC 4440 ACAAGAGAAC AGACCCATCT CATTTTTATC GAGAACACTC TCGAAGGCGG AAGAAAATTA 4500 TGCCACGAAT GAGAAGGAAA TGTTAGCCAT TATCTGGGCT CTAAAAAAGC TAAAAATTTA 4560 CCTTTACGGT AAAGCAAAGG TGAAAATCTT TACTGACCAT CAGCCTTTGA CCCATTCTCT 4620 CAGTAGTTGG AATGGAAATG CGAGGATTAA GAGATGGAAA GCATACCTTG AGGAATATGA 4680 CTACGAAATT TTCTACAAGC CAGGCAGAGA AAATACTGTA GCCGACGCTC TGTCCAGAGG 4740 ACCGACAGTC GAACAAATTA ACACAGTAGC CTCAACAATG CACAGCTCTG ACAGTTCGAG 4800 CCATGGGCTG ATACCTAGCG TTGAAGCCCC GATAAACGCA TTCAAGAATC AAATTTTCTT 4860 CAGGGAGTCC GAGTCAGAGA ACTACTCATT TAGCATTCCA TTCCCGACAT TTCAAAGGCA 4920 TTTAGTAGAT CGCAACTTGT TCACACCCGA TAGTCTCTTG TCAGATTTGA AAGAATATCT 4980 TAACCCATCC GTGATTAATG GAATTTTCAC ATCCGAGGAT GTAATGGGGA AAATTCAAAT 5040 TCTCTACCCC ATCCATTTTC AGGGTTTCAA GATTAGATTC TGCCAAAGCA AAGTCAAAGA 5100 CCTTGTTAAC GAAGCCGAAC AAGAGGAAGA AATACTTAGG ACACATAACA GAGCACACAG 5160 AAATGCTGTG GAAAATAAAG CTCAGTTGTC CGAAAGAGTA TACTTTCCTA AAATGAGGAA 5220 AAAAGTTTCG GCTATCGTGA ATCAGTGTTT GGTGTGTAAG ACTGCTAAAT ATGACAGACA 5280 TCCCACGCAT CCAGAAATAA GACAAACTCC CTTGCCAGAA TACCCCGGAC AAATTATTCA 5340 TATTGACATC TACTCGACAG AACGACATCT GGTGCTCACG GCGATTGATA AATTTTCCAA 5400 ACTGGCCATG GGAAGAGTTA TCAAATCCAA AGCTGTAGAA GACATTAGGA AAGCCCTAAG 5460 AGATATCGTG TTTTATTATG GAGTGCCTAA ATTAATAGTA ATGGACAATG AAAAGTCCCT 5520 CAACTCAGCC TCTATCAAAT TCATGTTGAC AGACCAGCTG GGTATTGAGC TCTACAAAGC 5580 ACCTCCGTAT AAGAGTACAG TAAATGGACA GATAGAAAGA TTTCACTCCA CACTCTCTGA 5640 AATAATGAGA TGTTTGAAAG GAGATGGAAC ACATAGGGGC TTCGAGGAAC TTCTCGATAG 5700 GGCCATCTAT GAATATAACT ACACTGTCCA TTCTGTCACA AAAAAACGAC CTCTAGAGGT 5760 GTTCTTCGGC AGGATAGCTA CCGTAGCTCC AGAAAAATAT GAACAAGCTA GACTGGACAA 5820 TATAGACCGA CTTAGGCAAA AACAGGAAAC CGACATAGAA TACCACAACC GGACAAGAAA 5880 GCCCATAAAA ACCTATATCA AAGGGCAAGA AATTTTCGTT AGGGTTAATA CAAGATTAGG 5940 TTCTAAGTTA TCAAGTAGAT TTAGGAAGGA ACTAGTTAAG GAAGACCGAA GTACCACAAT 6000 ATTAACAGAA TCAGGGAAAT TAGTACATAA AAGTAACATA AGATCATGAT TGCTTCAGAA 6060 TAATTACTGT GGCCACATCG ATAGACAAGT AGAACATAGA AACATTAAAC ATACTGATTA 6120 ACGTAACCAC GAGAATTAAA TTTGAAACGA TTCGACTTTG ACAGACGAGT TGGCCATAGG 6180 TTTGCAGAAT TAGATTGGGC TCGTGAAAGA AGAGATTGCG TTGTGAATGC GTTTCTGCGG 6240 AGCGAGTTTA AATTGAATGA GACCAGTAGC CTACTTGAAA ACAATATACC GATTTCTGCC 6300 TTGACGGTCA AATAAATGTT AAAACTTTCA CACGTTCTCG CTTTACACAA CGGAACTACA 6360 ATTTGAATGC CATTGAAATC TCCATATTGA ATCAGTTAAT TCCTTATTTC TTATATTTTA 6420 AAACATGATC AGTAAAGGAA ACTAATAATA ATAAATAAAT AATAATTTAA TTTGAATAAT 6480 AATAAAATTT ACCAATTAAA GACGAAATGT ACGGAATCAA TGCCTTCTGC AAAGGTTCCA 6540 ATGGTCTAAA AATATGTAGA TATAAATTCA TTAGTAATTT AGTAAAAACC AGAATTTTAA 6600 TAGAAGCAAT CAAGTTGTCA ATGCAGAACA CATCCGAGTT GCACAAACCT CGAACGCACG 6660 CAACAATAAA CACCTGGCAT TCAAACCATT AATGCTGTGC CCAAACTACC ACGAATCACC 6720 CCACCAACGA TGCGACTCAA AAACATCCCA TCCTTATGAT GACCGCTCGA GGACAAGCGT 6780 CAATTAAGAG GGGAGGAGTT ATTGCCCTGC AATTGATTCT CTAACATCTT GTGGTTCCAC 6840 ATAGTCTCCG CTGCCATCAA CGCCAACGAA CGGTTAAGCG CGACATCGAC ACTTCTGCGC 6900 TGCGCCGCGG CCGACGCCTG CTGCGCCGCT GCCGACGACT TCACTTGATT GCTAGGGACT 6960 TAGGGAAACA TTTTGTACGC TAGATTTAGT TTCAAATGAT AAATTGCAAT AAACGGTCGC 7020 TTGCGATCTT CAAAATCAAA TCGATAACTG TAATTATTAA CT 7062 // ID Tinker standard; DNA; INV; 6112 BP. XX AC AC004377; XX DR FLYBASE; FBgn0043969; diver. XX SY synonym: mazi SY synonym: Nuria SY synonym: Tinker XX FT source complement(AC004377:71947..77835) FT SO_feature five_prime_LTR ; SO:0000425:1..224 FT SO_feature three_prime_LTR ; SO:0000426:5889..6112 FT SO_feature CDS ; SO:0000316:577..5820 XX CC Michael Ashburner, 5-May-2001. CC Any changes to original sequence record are annotated in an FT line. CC Sequence coordinates from M. Butler, Personal communication to CC FlyBase, 30 April 2001. XX SQ Sequence 6112 BP; 1853 A; 1502 C; 1247 G; 1510 T; 0 other; TGTACGGATT ATGAGCGGAG TCACCCGGTG CTTCAGCTGT CTTAACAGCG GATTAACATT 60 TTATATATCG ATGCTAATTG AACTGAACTG TATTCTTTCT TTCGCTCTTG TAATTTTGCG 120 GTCATAGCGA TCAGACGTGA TTTTTGTATA TGAAGAAATA AATGAAGTTA AATAGTGTAT 180 ATATGATTCT TATTTGCGAA CCCCATTACA ATGATGTCAA AGCAGTGAGG CATCACAACT 240 ATTCATTGGT GGCCCATGAG GGGAACCTTG CTCATAAGAA TCATAATTAA TAATTGCTAA 300 AAGCTGTTTG GCAATTTTAT CTTCGTGGTC GGCTTCATCC AGATAAATCA CCGTGCCTAC 360 GTTTTCTATC GCGCTCGCTC TCATCACGAC AGAGCAGTAT ACGCAAACAT TCGCGCTCGC 420 TCTCATCACG ACAGAGCTAA GTCCCGTTGC CCTTCGATTC CGTTAGATTT GCCGGAATCT 480 CTGCCAACTC AAAAAAACAA GACGTCAAAG AAGAACACCG ATAACATACG AGCATTTGTG 540 CATATTACAT ACAGTGGAGC AACGTCAGTA TAAACAATGT CGAAATTTGA TCAACTGGTG 600 CGAGTTCAAA CTGACCGCAT TGAGTCTTTG AAGCGCTTGT TCAGTAATGT GAAAAAAGAC 660 TCGAGTGCGC GGAAAACGGA AATATATTTC GAAAAACGTT TGAATCAAAT CGACGAGTTT 720 AACAAAGAAT TTCATCAGGC GCATCACACA CTTATATGCA TGGCAGACTA CGAAGAAAGT 780 GTGTATAAGC AACAAAATAC AATCAGCCAA TTCGAAGACC TGGGCATGGA AGTTTACTGT 840 TTCGTGGCCG AAGAAAAGAA ACGAATTTAT CCAGGCACAG TGACGCACAA CGAATCAGCT 900 AACTCTACAA TAACAACACA TGTAGAGGAA GTACGGATAC CACTGCCAAA ATTACCAGTT 960 CCGAAATTCT CTGGCAACTG CGCGGACTGG CCAAGTTTTC ATGATGCATT TTTACGCTTA 1020 ATTCACAATA ATGAGCGTCT GGATAAGATT CAAAGGTTCC ATTTTTTAAA GGAAGCATTA 1080 CCAGTAGGCC TGGACAACGA CATTCGCCAA ATCGCTTTAA CTGAAGCAAA CTATGAAGTC 1140 GCTTGGACCA CACTTCTACA ACGATATAAC AACCCACGAA TTGTGTTCGC CAGCCACATG 1200 AACATGCTCT ACAACTTACC GAATCTTTCG AAAGAAAAAT CTGCTGACAT ACGGTCTATG 1260 GTTAGTACAG TCAACGTCTG CATTGCCGCT TGCAATACCG TCAAGGCGCC ACTACAGGGA 1320 GGAGATTTTT GGTTGACTCA TTATCTAACA ACCAAACTAC CCAAAGACAC TCACACAGCT 1380 TGGGAGCATC ATCTGGGCAG CAAGATTGAC GTTCCTTCAT ACAAAGATCT GCAACAGTTC 1440 CTCAATGATC GACTTGTTAC GTTAGACGCT ATTGAAAGCC GTAACGCGTG CAGCGGCATG 1500 AAACAGTCAA ATGAAACTTC AGACGGTACT AAACGCGTGC GTGTGCACAG CGCCCACACC 1560 AGGTCGGGTG CTTCCGCGTC CGCTTGCTAT CATTGTGGCA ACTTACACAT ACTTCGAAGA 1620 TGTCCGCAAT TTCTTTCAAT GGATTGCTAC CAACGGAAGG AAGTGGCCAG CAAGGCAAAA 1680 TTGTGTCTCA ATTGCCTGGG AAAATCGCAC ACACAAGCAA GCTGCCCCAG CAACAAGAAT 1740 TGTCTTCATT GTGGTCAACG TCATCACACG ATGTTACATT TTCCAGCAAC ACAGCCAACG 1800 CTGATACCCT CCTCGACATC ATGCCAAAGT TCAGCAGCCA GTTCAGATGC CAAGCCGACT 1860 CTACAGTGCA TGTCAACTAC AACTTCATCT ATGACTCATC GAAAGGTACT ACTAGCAACA 1920 GCTCGGGTTG TACTCAGTAA CACACAGACA GGATGCCAGG CCACGGTAAA CGCGCTTCTT 1980 GACCAAGGAT CGGAAGCAAC TATCATTTCG GAGCATGCTG TACAGTCTCT CCAGCTCTCC 2040 CGAAGCACAA CTCGCACTGC AATCACCGGA GTCGGTCAAG ATTCAGGACG ACGCTGCAAA 2100 TTCATCGTAA GTTGTTCGGT GCAAACAGCA ACTAACCCAA ATTTCTCGTT AAAGGTCGAC 2160 GATGCTTATG TTTTGAACAC GTTGACATCA CACATGCCAA GTCAAAGTTT TCCAGCAGGA 2220 AACTGGAGTC ATATCCATGG TCTCATGCTC GCAGACCCCT ATTATTATAG ATCCAAGCGA 2280 ATCGATATCA TTTTTGGAGC CGACCTTATG GCACAACTCT TGTTACCTGG AACTAAGATT 2340 GGCTTGCCAA ACGAACCCAT AGCGCAGAAT ACTCAACTCG GATGGGTTCT GTTAGGCAAT 2400 GTTGGCAACA CGCATATTAC ACGCCACATT CGATGTAATC ATGCCATAAT AAACTCCGAA 2460 GAGCTGCTTA AGGTATTTTG CGAGGTAGAA TCAGTTCCAG AACGCCCAAA GCTCTCCAAA 2520 GAAGATCAAT GGTGCGAGTC TTTCTTTAAG CAGACCCATC AACGTCAACC AGACGGCAGT 2580 TATCAAGTCC GTTTGCCATT CAAGAGGAAC TTTGACCCCA GTATGACGCT CGGAAAATCT 2640 CATCAGATCG CCCTGAACAG ATATCTTCAA CTCGAAAGGC GCCTTCAAAG GGACCCAGAC 2700 AAATGGATCA GATACTGCAA AGGAATTGAA GAATATTTTC AACTAGGTCA AATCACGTTG 2760 GCAGAGACGA GCGAAAACTC AACCATAACC ACGGATTCCT ACGGTCGGCA TGTTGCATCA 2820 TGCGTGCTAC CACATCATGC AGTTTTCAAA GAAGAAAGCC TCACCACAAA ACAACGTATC 2880 GTTTTTGACG CATCGGCCCG GACCTCAAAT GGCAGATCGT TAAACGATGT ACTATGTGTA 2940 GGTCCCACGC TGCAAAATGA TCTGCCAGCC GTTCTCTTGA ATTGGAGACA ATATCAGTTT 3000 GTTTTCACTG CCGATATACA ACGAATGTAT CGCTGTATCA ATGTCCACCC TGATGACACG 3060 CAGTACCAGA GGATTTTATG GCGGGCGGCG GATGGAGTCA TCAAACAGCA TTGCTTAACT 3120 ACCGTCACGT TTGGAACAGC GTCTGCACCT TATACAGCTA TAAGAGTCAT CCATCAAATA 3180 GCTGAAGACA CACAGACAAA ATATCCTATG GCATCCAACG TTCTCAAAAA TGGGATATAT 3240 GTCGACGACA TTCTCTCAGG CGAGCATTCA CAAGAGGCAG CAATACGGAA AAGTTTGCAA 3300 ACTATGCTAG CTTTAAAATC CTCTGGCATG GAGCTACGAA AATGGGCTAG TAATGACCAG 3360 GATCTGATGG CAACGATACC TCTCGAGCAT CGATGCAAGC AGACATCCCT CAGTTGGGAC 3420 AATGCGGACA CCATCAAAAC ACTGGGTATG TACTGGTTGC CCAAGCAAGA TTGCTTCACT 3480 TATAAATTAC TAGCAAATAC TCCAGCCGGT ATAACAAAAC GAGAAATCCT ATCGACCATA 3540 GCACGTTTAT TTGATCCTCT TGGATTAATC GCTCCAGTTG TAATTTCAGC AAAGATTATA 3600 TTGAAAGAAA TCACACTAGC GAAGCAATAT CGCGAGGACG GATCGAGTAC CTCACTGGAT 3660 TGGGATGAGC CAGTTCCCAA CACCATTGCC GTCAAATGGC AACAATTTCG ACAGCAACTA 3720 ATGAAGGTTA AGACGATCAA AATACCACGC AGCGTCAAAT TTACGCCACT ATTTAGTAGT 3780 GAGATACAAC TGCACACCTT TTGCGATGGA TCCTCCAGTG CTTACGCAGC AGCAGTGTAT 3840 GCACGCACCC AACAGTCTGA TGGGACTTTC TATACAACGC TCATCGTCGC AAAATCAAAA 3900 ATTTCGCCAA CCAAGCCGTT AACAATACCG CGCACTGAGC TATGCGGTGC AGTATTAGCT 3960 ACCAAACTCA CCAAATGGGT GCTGGAGAAT AACCGATGGA CCAATGCACA TATATCTACC 4020 TTTTACTGGA CCGATGCTAC CATTGTTCTG CACTGGATTA AAGGAGACAT TACTAGGTGG 4080 AAAACGTTTG TAGCCAACCG AGTGTCTTAC ATTCTCGACC ACACCTCAGC GGCTCAGTGG 4140 CACCACATAG ACACATCGGA AAACCCAGCA GATTGCGCAA CCAGAGGCTT ACCACCGAGC 4200 CAAATACCTG ATATTTGGTG GCATGGCCCA TCCTGGCTAT GCAAACCACA CAATATTTGG 4260 CCAAACACGC AATCACAATT GCTCAATCCA GAAGAACGGG ATCTGGAGGC CAAATCCATA 4320 AAAATTAGAG CCTTCACTAC CTTGTCAGAC ACAAAGGATT CTATTATTGA CCGATTCTCG 4380 TCGTATACAA AACTGCTACG TGTCACGGCA TACATGTTAC GATTCTGCCA CAATGCTCAT 4440 GCTCGAGCAC AACGAAGTCA CGGATCACTC TCTCCTGACG AGCTGGATGA GGCCCTGTGC 4500 TGCATAGCTC GCCTTGCACA ATCCGATACA TTTCACGCCG ACATCCAAGC GCTTAAAAGG 4560 AACAAGCCGT TACCACCCCG TAGCACACTT TCAAATCTTA CACCGTTTCT TGACAACAAT 4620 ATCTTGAGGA TTCGTGGCCG TCTCAAGCAT TCAAATCTCT CTTTTTCTCG AAAACATCCA 4680 ATTATATTAC CTCATTGTCA CCTATTTACA GATTTGGTAA TTCAACACTC TCATCAGCTC 4740 ACTCTACATG GCGGCGCTCA ATTAACACTG GCTCACATAC GCTACAAATT CTGGATTCCC 4800 AGAGGCAGAC AAGCAGTCAG GCGAATCATC CGGAAATGCG TCACATGTTT TAAGGTAGCT 4860 CCAGTCGTAG CGAAGCAATT GATGGGTGAC TTGCCATTAC ATCGAGTCAA CCCCCCAACT 4920 CGTCCGTTCA TTACAACAGG AGTTGACTAC ACTGGTGCGA TTGAACTTCA AGCCGCGCGT 4980 GTACGAGGAT CAACCACCTA CAAAGGCTAT GTAGCCATTT TTATATGCTT AGCAACCAAG 5040 GCGGTCCACT TGGAAGCCGT CACTGGACTT TCAACAGAGC ACTTCCTGCA AGCATTTACG 5100 CGATTCACCG GACGTCGAGG ACAAGTTCAA CATATGTATA GTGATAACGG CACAAATTTT 5160 GTTGGCGCGA GTACATCGCT TAACCAGCCC ATCACTTGCA AGGCAGCCCT AAACGAATCG 5220 ACATGTCAAC ATGGGACAAG GTGGCATTTC ACACCTCCAT ATTCTCCCAA TTTTGGTGGC 5280 ATCTGGGAGG CAAACGTGAA AGCAATGAAG CATCATCTTA AACGGATCGT CGGCAGCCAC 5340 AAGCAGACGT ATGAGGAGCT TACTACAGTT TTGATCAGGA TTGAAGCATG TTTGAACTCA 5400 CGTCCACTTT GCCCGCTAAC CGCCGACCCT GACGATTTAG AAGTACTAAC TCCAGCACAT 5460 TTCTTAATTG GTGACGCACT ACTGGCACCG CCACAAGGTC GGCCGAATAA TAAGCCTTTG 5520 CGTGAACTAT TTCTCGCACA ACAACACATG ACGCGACAAT TCTGGTCTCA ATGGTCTCGT 5580 GATTGGTTAT CACACTTGCA AACTCGGCCA AAGTGGTGCC AAATCAAAGA TAATCTTAGC 5640 ATCAACGACT TAGTCATTAT AAAGGATGAT AATCTACCAC CAGCTAAATG GACTATAGGC 5700 CGAGTCGTCG AACTGCACCC TGGATCGGAC TCGCTAGTCA GAGTGGTTAC ATTAAAGACG 5760 AAATCTGGCA TTCAAAAGAG GTCAATCACA AAGCTTTGTC CACTTCCGAT TTCAACATAA 5820 TTATGATCAA CACACGGATC AAGAATCACA AGGAGCCTTC ATGCTGGACG ACGCATTGGC 5880 GGGCGGCATG TACGGATTAT GAGCGGAGTC ACCCGGTGCT TCAGCTGTCT TAACAGCGGA 5940 TTAACATTTT ATATATCGAT GCTAATTGAA CTGAACTGTA TTCTTTCTTT CGCTCTTGTA 6000 ATTTTGCGGT CATAGCGATC AGACGTGATT TTTGTATATG AAGAAATAAA TGAAGTTAAA 6060 TAGTGTATAT ATGATTCTTA TTTGCGAACC CCATTACAAT GATGTCAAAG CA 6112 // ID TABOR standard; DNA; INV; 7345 BP. XX AC AC007146; XX DR FLYBASE; FBgn0045970; Tabor. XX SY synonym: wolfman SY synonym: Pilgrim XX FT source AC007146:115349..122693 FT SO_feature five_prime_LTR ; SO:0000425:1..506 FT SO_feature three_prime_LTR ; SO:0000426:6840..7345 FT SO_feature CDS ; SO:0000316:1229..145 FT /db_xref="FLYBASE:;" FT SO_feature CDS ; SO:0000316:1531..2851 FT /db_xref="FLYBASE:;" FT SO_feature CDS ; SO:0000316:2787..6479 FT /db_xref="FLYBASE:;" XX CC Sequence & annotation from Horacio F. Naveira, September 21 2001. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7345 BP; 2928 A; 1303 C; 1286 G; 1828 T; 0 other; TGTAATGTGC ACACATATCG AATAAGCACT GTATCAAATC AGAACATTGG GAACAATCAC 60 ATTGTAGCAC TTTTGCAACA ATGTTTATGT AAGACTTTTC AGTTCCCAGG CATATCTTGA 120 GTGCTCAGCA ATCTGACCAC ACAAGAATGC TATTCAGACC AGAAGTGCAG AGTCAGCATA 180 ATTAGCATGC GCGACTGCTC GATCTTTATG TCCACATGAC TCTCTTAGAC AGCGGCGCTC 240 TCGCTGCCAA AGTTAAGTAC ATAAGAGCAA AGCAACGTCT CTGCCGACGG CCTCTCTGCC 300 GGCGCAGACG GATTGCATTT GGCTAGCTTG GACTCTTCTA GACCAAGTAC AAGGCAGTCG 360 TAAAGGAGTC GTCAAAGAGC CTTCAACATG TCCTAATTGA ATATTAATGA GTCTTAACAG 420 AAGTTACAAT TTTACTGATA TCATACTGAA TCTCTATTTC AATAAAAGTA ATATAAAGAA 480 CACAAAAATG CATTACAATT ATTACATGGC GACCGTGACA TGGTCACTTA AGCCTGTAAA 540 ACAATAATAA AAAGAAATAT ATAAAAGTTA AAATGCGAAC AGTGACCATT AAAAAATAAA 600 ATTGTGACAA TTGATCAAAC CCAGACGACA ACAGAAGCCG CAAACTGGAG GACTCTGCTC 660 CTATCGAGCA AAGGGACGCA CCTACTCTAC AAGAGGCACT AGAGGTGTGT CCAAACGATG 720 GACCACGCCC ACTCACAATA GCTGAGTATA GGGCAAGAAG ACAGCCGAAA CCAAAAATAA 780 AGAAGAAAAG AAGCGGCAAA AGGATCAAGC TTCTTCAACA ACGGAGATTA ATAAAGGACC 840 TAATAAAGAC GGCAACTACG GAGGAAGAGA AAACTAACCA AGCCAAAAAT CTGGAAGCGA 900 TTGAAGCCAA ATTGTGCAGT GGTGCGCAAT AACGCAGTTG AGGCTGTATT TAATACCAAT 960 GCCCCAATCT GCCTTAATAT TAAAAATACA TACCTCAAGC CGATGCGTGA AAACGCTGCT 1020 TGAAAAATAC TAATCTCTGT TAGGCTTTTA CTAAAAACAT GTGTACGGTA TTGAATTATA 1080 TAGAGCAAAA CATTGTAAAC CGTACACATG CCCTCTAACT AACTTTTTCT TGGACAGACA 1140 TACGTGTAGG GAGAAATTAT AGCAATATAC AATTTTTAAT CAAAATTTAA TAAATAAATA 1200 AATAATAATA ATCGTTCAAT TAACAACTAT GGGTTGGTTC GGATCTGACG ATAGTCAGAC 1260 CAAAGACAAC ACGGCTAATG TCGTGAACAA CGTCAAAATT GTAGATCACA CAGAAGATAT 1320 TCAAGCACTG TGGGTTTTAC TCCTAATTTT GACAATTACA AGTGTGGCCC AGTTCCTTCT 1380 GACGTTGTAC ATAAAACACA ACAAAATATT AAAAAGAAGA TATATGAGTA GGGCGAATAA 1440 TCTAGACAGG GTTTAAAAGA AAAAAAAAAA ATATATTTTA AAGAATGAGT ATATAAATAT 1500 ATACTAACAA GTAGGAATAT ATAAAATAGT ATGGAATGGA ACGAGTTATG TAAAAACATA 1560 ACAAGAATAA GAAACGAATT TGAAAAATCT CATAAATGTT TGTCGCAAAA TAGACCTATT 1620 TTAGGACCGA CAACAAAGAA ACATGCAAAT ATTCTGGTAA ATTCCTTCAA CGAAGCACGA 1680 ATACTAGTGT ATGATAACAA GGAAAGACTA AATCCAGATC ATTGGTCTCA GGTATCGAAA 1740 GTTCTCATAA AACTTAGATC GAACTTGTTA TCTGTAAAAT TGAAACTTGG TTTGGATATA 1800 TCAATACCAA CCATTTTAAA TTCGCCAATA AAAATAGAAT CGGACGAACA AACAGAAACA 1860 GAAATAGAAG ACGAAGATTT AAACAACTTA ACAATTCCAG CTATATTAAC ACTAGCTGAA 1920 TTGACGGAGG AAGAATTGGC TGAGTCAGAC ATAGAAGAAA CAGAAACAAA ATCTGTCATC 1980 ATGGTGGACG AAGCAGCTGC CCAGAGGGCA TACATAAAAG ACATTTCAAC GGCGATTCCG 2040 GAATTTGACG GTAAAAAGAT CAATTTGCGT AGATTTATCA CGGCAATCAA GTTGGTTAAC 2100 CTGACTAAGG GACCACATGA AGCTATTGCC ATTGAAGTGA TTAAATCAAA AATAATCGGA 2160 ACAACACTCT ATAGAGTACA GAATGAAGTT ACAATTGATG CCATAATCAG GAAATTAGAA 2220 GAAGTGGTCG TAGGAGAAAC AACGGACGTA GTAAGAGCAA AAATGGCAAA TGTTTACCAA 2280 AAAGGTAAAA CAGCCACACA ATTTACAAAT GAAATTGAAA ACCTACGCAA GTCTCTTGAA 2340 TCTTCATATA TAGATGAAGG ATTGCAACCA GAACATGCTA TCAAATTTAG CACAAAGGAA 2400 GCAATCAATA CAATGACAAA GAATTGCGAT CATGGAAAAC TGAAAGCAAT CCTAGAGGCA 2460 GGAACATTTA AGACAATGGA CGAAGCAATA AGTAAATACA TCCATTGCAG CACTGAGATG 2520 ACAGGAAGTG CAAGTTCCGT TTTGTTTTAC AAGAGAGGAC AAGGAAGTTA CACCAGAGGT 2580 AACTACCGAG GACGAGGAAA TGGTCGAGGT GGTAATAATA GAAATAATTA TAACCAGAAC 2640 ACCGGCCAAT ACAACAACTT TAATAATTAT AACAATAGCA ATGGCAGAGG ACGAGGAGGA 2700 TATAGAGAAT ATAACTACCA GAACAGAGGC GGTGGTAACT ATAACAACCA AAACCGCAAT 2760 TTCTCAAGTT ATAGCCAAAA TGGTAATGTC AGACATGCAC AAGGCACGTC GGAAAACCAG 2820 CAGGCTCCCT TAGGGCATCA AGAACAGTAA AACAAAAGGT TCATACCATC AATCTTAATC 2880 TAAATATTTT CATTAAAGTA AAAAATGAAC ACACAAATAA GATACTTACA TTTCTAGTAG 2940 ACACAGGCGC CGATATATCA GTCATTAAAG AAAACTCAGA TGAACTTTTG AATCTTGATC 3000 ATAATAATAT AACACAAATT ACAGGAATTG GAAAGGGTTC AATAAATTCA ATAGGTTTAA 3060 CACTTCTCGA GATGAGAACC GGTAACTATA TAGTACCGCA CAATTTTCAT GTTGTGGACG 3120 ACAATTTTCC GATCCCTGGT GACGGAATAG TTGGAATCGA CTTCATAAAA ACATTCAATT 3180 GCCAATTAGA TTTTACTACA GAAAGTGATT TTTTCATTCT AAGACCAAAC AATATTAAAC 3240 AAGCGATACA AATACCAATT TTTCATAATA TCGATAACAC TGAGATAACC ATACCATCTC 3300 GTTGTGAGGT TATTAGGCAA ATTACCGTAA GTTCGGTGGA TAACCAAATT CTAATACCCA 3360 ATCAGGAAAT AGAAGATGGT GTATTTGTCG GAAATTCGAT ATCAGATTCA AAAAACACGT 3420 ACATCCGAAT ACTAAACACA ACTAACAGCA ATAAAATTTT AAATGTAAAC AAAATCAATT 3480 TTGAACCATT AACTAGCTAC AAAATAGCAG ACCTAAACGA CTCCATAAGA GCCGAATCAA 3540 TTTTAAGCAG ATTAAAGAAA AACTTTCCAT CTGCACATAA GAAAATGTTG ACTGAACTGT 3600 GTTCACAATA TACTGATATT TTTGGATTGG AGACAGAGCC AATCACTACA AATAAATTTT 3660 ACAAACAAAA GATAAGATTG AGAGATGATG AACCAAGCTA TATTAAGAAC TATAGAACAC 3720 CCCATTCACA ACAAGCCGAA ATATCGAGAC AGGTAACTAA ATTAATAGAA GACAAGATAG 3780 TAGAACCAGC TGTATCTGAA TACAATAGCC CATTATTGCT AGTTCCAAAG AAATCCTTAC 3840 CGGATTCAAA AGAAAAAAAA TGGCGTTTAG TGATAGACTA TCGCCAAATA AATAAAAAGT 3900 TGTTAGCTGA CAAGTTCCCC TTGCCAAGAA TAGATGACAT ACTTGATCAA TTAGGCCGAG 3960 CAAAATATTT CTCGTGCCTG GATCTTATGT CGGGATTCCA TCAAATAGAG TTAGAAGAAG 4020 ATTCTCGAAA TATAACATCC TTTTCTACGA GCAGTGGCTC TTATCGCTTC ACGCGACTAC 4080 CATACGGATT AAAAATAGCT CCAAATTCCT TTCAACGAAT GATGACAATG GCATTTACAG 4140 GTCTGGAACC ATCTCAAGCA TTTCTGTACA TGGATGACTT AATTGTCATA GGCTGTTCTG 4200 AAAAACACAT GACCAAGAAT CTTACAAATG TTTTTGAACT ATGCAGGAAA AACAACCTCA 4260 AACTGCATCC AGATAAGTGC TCATTTTTCA TGAGTGAAGT CACTTTTCTT GGACACAAAT 4320 GTACTGATAA AGGCATATTG CCTGACGATA CTAAATATGA TGCTATACAG AGATATCCAG 4380 TTCCTACTGA CGCAGACAGT GCCAGAAGAT TCGTAGCATT TTGTAACTAT TATAGACGTT 4440 TTATACAAAA TTTTGCAGAT TATTCCCGTC ACATAACAAG ATTATGTAAG AAAAATGTAA 4500 AGTTCGAATG GACAGCTGAT TGCCAGCATG CCTTCGAACA CCTGAAAAAA CAACTAATGA 4560 ATCCAACTTT GCTGAAATAC CCCGACTTTA GCAAAGAGTT CTGTATTATA ACTGATGCAA 4620 GCAAAGAGGC ATGCGGAGCG GTACTGACCC AGAACTATAA TGGTATCCAT TTGCCAGTAG 4680 CTTATGCATC CCGTAGCTTC ACTAAGGGTG AGAGCAACAA GTCAACAACA GAGCAAGAGT 4740 TGTCAGCAAT ACATTGGGCG ATAAATCATT TCAAACCATA TATATATGGT AGACACTTCA 4800 CCGTCAAAAC AGATCACAGA CCGTTAACAT ACTTGTTCTC GATGGTTAAT CCAAGCTCAA 4860 AACTGACCAG AATGAGGCTT GATTTAGAAG AATATGAATT TACTGTGGAA TATCTTAAGG 4920 GAAAAGACAA TTACGTAGCA GATGCGTTAT CCAGAATAAC CATCGACCAA CTAAAAAATA 4980 TATCCAAACA AGTACTTAAA GTCACTACAA GAAATCAAAG TAGACAGGAA TCCTGCGCAG 5040 GAAAAGGAAA TAAAAATGAT AAAGTAGAAT TGCCTAAGCA AACTACTCAA GAAGCTTCTA 5100 AGCCCAAAGT ATACGGAGTC ATTAATAATG ACGAAGTACG CAAAGTAGTG ACATTGCATG 5160 TAAATAATAT GATATGTTTT TTTAAACATG AAAAAAAAAT TACTGCAAGA TACAACGTTG 5220 AAGATTTGTA TATTAATGGA ACTCTCGACT TAGGTCAATT TTTCCACAGG CTTGAAAAGC 5280 AGGCCGGTAT GCATAATATC AATCAACTTA AAATGGCACC GTGGGAAAAT ATCTTTGATA 5340 ACATTTCAAT AGATACATTT AAGAAAATGG GCAATAAAAC ATTAAAACTA TTAAGAGTAG 5400 CGCTACTCAA CCCGGTGACC TTAGTGAATA CAAAGAAAGA AAAAGAAGCA ATCCTGTCTA 5460 CACACCACGA CGATCCAACA CAAGGAGGAC ACGCAGGCAT TACAAAAACC CTGGCCAGGA 5520 TTAAAAGACA TTACTTTTGG AAAGGTATGA CTCGGGAAAT AACAGAGTAC ATACGGAAAT 5580 GTCCAAAATG CCAAAAAGCT AAAATTATGA AACACACAAA AACTCCTTTA TCAATTACAG 5640 AGACACCAAT AAGCGCATTT GACAGAGTCA TAGTGGATAC GATAGGTCCA CTACCAAAGT 5700 CAGAAAACGG TAATGAATAT GCTGTTACAC TTATATGCGA CCTGACAAAA TATTTGGTAA 5760 CCATTCCAAT CGCAAATAAA AGCGCAAATA CAGTAGCGAA AGCAATATTT GAATCCTTTG 5820 TACTAAAGTT CGGTCCAATG AAGACGTTCA TTTCGGACAT GGGAACTGAA TACAGAAATT 5880 CAATTATCAA AGACTTATGC AAATACCTGA AAGTAGAAAA TATAACTTCT TCAGCACACC 5940 ATCATCAGAC GGTAGGAACA GTGGAAAGAA GTCATAGAAC TTTTAACGAA TACATCCGGT 6000 CATACATATC GGTTGATAAA ACTGACTGGG ACGTATGGAT AAAATATTTC GAATTTTGTT 6060 TTAATACTAC ACCATCTATG GCACATAATT ATTGTCCATA TGAGTTGATT TTCGGTAAAA 6120 CATGCAATTT ACCAAAGCAT TTTAATAGCA CGGATAGAAT AGAACCCATT TATAATATAG 6180 AAGACTATCC TAAAGAAAGT AAATATAGGT TAGAAGTAGC ATATAACAGA GCAAGAATTA 6240 TGCTCGAAGA ACAGAAGAAA AAGAATAAAG AATTATATGA CTTAAAATTA AACGATATAA 6300 GTATATCAAT AGGAGATAAG GTGTTATTAA AAAACGAAAC CGGACATAAA TTAGATTTCA 6360 AATATACTGG ACCATATACG GTAGTAAAGA TTGAAGAAAG GGATAATATA GTAATATCAA 6420 ATGATAAGAA AAAACCACAA ACGGTACATA AGGATAGGTT AAAATTGTTT AGTTCTTAAA 6480 AAAAAAAAAT AAATAATATG GTGGCCACCA GCAAAAAAAA AAAATATATA TAGAGAAAAG 6540 AGTTAACCCC ACATAAGTGC ATTGAATGTA AGAAAACATT TTTCTTTTAT CATCTTTGAT 6600 GGTTGAACGA TTATAAACTA AAAATTTGAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 6660 ACCAAAAAAA AACCCCCAAG AACACGTATG TTTGTACAAA ATGTTCCTTA AATTTCCTTA 6720 ACATTAAAAT TACTTCCTTA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAT ATATATATAT 6780 AAAATTTTTA ATTATAAAAT AACTTCATAA AATTACGTTA TTTTCCAAAA GGAGGGAGAT 6840 GTAATGTGCA CACATATCGA ATAAGCACTG TATCAAATCA GAACATTGGG AACAATCACA 6900 TTGTAGCACT TTTGCAACAA TGTTTATGTA AGACTTTTCA GTTCCCAGGC ATATCTTGAG 6960 TGCTCAGCAA TCTGACCACA CAAGAATGCT ATTCAGACCA GAAGTGCAGA GTCAGCATAA 7020 TTAGCATGCG CGACTGCTCG ATCTTTATGT CCACATGACT CTCTTAGACA GCGGCGCTCT 7080 CGCTGCCAAA GTTAAGTACA TAAGAGCAAA GCAACGTCTC TGCCGACGGC CTCTCTGCCG 7140 GCGCAGACGG ATTGCATTTG GCTAGCTTGG ACTCTTCTAG ACCAAGTACA AGGCAGTCGT 7200 AAAGGAGTCG TCAAAGAGCC TTCAACATGT CCTAATTGAA TATTAATGAG TCTTAACAGA 7260 AGTTACAATT TTACTGATAT CATACTGAAT CTCTATTTCA ATAAAAGTAA TATAAAGAAC 7320 ACAAAAATGC ATTACAATTA TTACA 7345 // ID STALKER standard; DNA; INV; 7256 BP. XX AC AF420242; XX DR FLYBASE; FBgn0003519; Stalker. XX FT source AF420242:.. FT SO_feature five_prime_LTR ; SO:0000425:1..406 FT SO_feature three_prime_LTR ; SO:0000426:6849..7256 XX CC Sequence from BDGP, September 2001. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7256 BP; 2795 A; 1329 C; 1243 G; 1889 T; 0 other; TGTAGCATAT TGGACTAATC TACCCTAAGA ATACAATAGA TGATTGGGTA TAACATAGCG 60 TCAATACATT GTGACACTTT GTCATAATAA ATATAAATAT ACAAATATAC AAAAAGACCA 120 CCAAAAACTA CGTAAGCACT CCAGCGCCCC AGTAATACGA TCTAACGCTT ATACATAAGC 180 CGATCGCGGA GCGTGGGAAT GCTGAGCATG CACTTTGCAG CTCAAGTGGT CAATGCCTTC 240 TGCATGCATA TGTATATGTA TAAATGTAAG TAAGAATACA TAGATATAAG CAATGTATGT 300 GCGGGTTAGC TGAACCCAAC TTCAGCACAC TTTGATCATT CGAATAAACA GATTCAAACA 360 GAGCAGAGGT TCTGAGCTCG GAAAGCAAAT CTATTACATC TATTACATGG CGACCGTGAC 420 TCGGTCTCGA GTCTGTCTCT GTGTTGTGTG TGTCCGTGTA TGTGCGAGGG TCTTTAAACG 480 GTGCTAACTG TGCTGCAAGC GATGTTCGCG CTGCCTGCTG CATTTTCTAA TAAAATGCGT 540 TTGAGTAAAA CAAGATAACT GTGCACCAAA CCAGATTGTG TGGCATGCCA CGTGTAGCAG 600 CTGATGACCA GACAGCAAAA CGATCAGCAA GCACCAAACG CACAAAAACA CACAGACACA 660 AGCGGCAATT GATGGCAGAC GAGCCACAAT TGACCGAGGC GCAACCCGAG GTACAGAGGG 720 ACCAGCCAAC GTTGGGAGAA GCACTGGAAT TAAACCCCGC CGATGGACCG CGCCCACTCA 780 CAATAGCTGA GTACCGGGCA CGTCAGGAGA AGCGCCAACC CAGGAAACAT AAACGCTCTG 840 GACGTAGAGT GAAATTACTA CAACAACGCC GACTGGTCAA GGAAATGACC CAGTTGGCCA 900 AGGAGGAATC ATCCCGACAA CGCTACAAAG AGCGTCTTGA GGACATCGAA AGCAAGATTT 960 CGCAAGGTGC GAAACAACGC AAACGGGCTG CATAAATAAA TGCCAATGCC CCAATTTGCT 1020 TTAGATTTAA TTTCTACCCC AAGCCGATGC GGGAAACCGC TGCTTGGAAA ATACTAAAAT 1080 CTGTTAGGCT TTTTCACTAA CAATGTGGTT GGAGAATTTT TTTTTATTAT TGTAAACGAC 1140 TATGTGAGCC AACCACATAT ATTAACCATT AATATTTCCA CGTCTCTGGC ACTGAGTATA 1200 CATATATACT CAGCTGCAAA ATGTTATTTG TGTAAAATGA CAACAATGAA AAAAGTTCTT 1260 ATTTTGACTA ATATAAAAGA AAATATTAAT TTCATTTTTC CGATTTTCAA GAAGAAAATA 1320 TTTTTCCTTC CTTTAAGCTC CAAATAGAAT ATCTTATTTT TTTTCCTTTT TTTTAAAGAT 1380 CCGTTTCATT GAATATAGAT AAAATGGGAT GGTTTAGCGA TTCTAGTGAG GCAAAGGACA 1440 ATACTGCCAA CGTAGTTAAT AACGTAAAAA TTATAGATCA CACAGACGAT ATAAATGCGT 1500 TGTGGATCTT ATTGCTGATC ATTACAATAG TACTACTTCT ACAATTTCTG CTTACAATTT 1560 ATGTTAAGCA TAACAAGATC ATCAAAAGAC GTTATATAAA TAGGGCAAAT CGTTTAGACC 1620 AGATTTAAAA AAAAAATAAT ATGGATTAGA AGAAAGCTTA ATAAAAACTT TTTTTTTTAC 1680 GACAAGAATG GAATGGAGCG AAATAGCGAT ACGAATAGAC GAATTTCGCT TCAGGTTCGA 1740 TAAGTCTTAT AAATGTATCA ATAGAGACGC AGTAATAAAA TCCGAAACTT TGAAAAATCA 1800 TATAGAGATA TTAGTAGGAG AATATAATAA TATAGTTACA TTAGTAAATA AATATGCAAA 1860 TAGGCTCACA TCTGAACATA ATAACAAATG TTTGAGGGTT ATAAAATCCC TAAACACAAG 1920 ATTAAATAAC ATCAGAAAAA GAAGGCATAT TCTGATAGAT GTACCAGAAA GTCTAAGTCA 1980 ATTGGTTGAA TTCAACACAG ACCAGTTCAA AGAACTAGAC GAATCTGTTC AATCAAGCGG 2040 CGCTGAGTCC GATAGTGACA TTGAAACGCT AGAAGGAAGC GACCGAATTG AATTTAAATC 2100 TGAACCAATA AAAATTTCTG AGATGGCACA GACATTGATA GAATTTATCA GGCTAGCCAC 2160 ATCTCTGATA CCAGAGTTTG ATGGTAAACC AGAAAATCTA CAAAGTTTTT TGGATGCTCT 2220 AGGTCTACTA GATAGCTTAA AGAGCACACA TGAAACGACA GCAGTAAGCC TAATAAAAAC 2280 TAAACTTAAA GGCCATGTAA GAAACCTTAT AAGTAATGAG CAGACGATTG CTGCAATCAT 2340 TACCCAACTG TCAAGTGCAG TAAAAGGAGA ATCGGTAGAA GTGATATCTG CCAAGCTTCT 2400 GAATCTACAA CAGAGAAATA AAACGGCTAA CCAATACACC CAAGAGGTGG AGAAACTGAC 2460 AAAGGCCCTT GAAGGTGCCT ATATCAGTGA AGGTCTCAGC CAGTCCTTAG CAAATAAATA 2520 CAGCACTACA ACAGCTGTAA AAGCAATGAC ACAGAATTGC TCCATTGATA AGGTAAAACT 2580 TATCATGCAA GCAGGCACAT TCACAAACAT GAATGATGCC ATCTCCAAAT TTGTAAACAG 2640 TTGCACAGAG ATAACAGGTC AAAGTAACAC TGTACTCTAT TATCGACGAG GTGCAAATAA 2700 TTATAATAGA GGCGCCCGGG GTTATAATCG TGGTAGAAAT ATCAACCACA ACAATTACAA 2760 CCGGGGTAGC AATAACAACA ATAATAATAA CTATAATAAC CGTGGAGGTA GGCGAGGCCA 2820 AAACCAAGGG AGAGGCCGCG GAAACTACAA CCATGGTAAT AATAATAATA GCAGTGTGAG 2880 AATCGCGCAA AATACGTCGG AAAACTAACA GAACCCTTTA GGAAACAACC AATAAATGTA 2940 AAAGTTCATT CCATCAATTA TAGTCTTAAT ATATTCGTAA CCTTCTATAA TCATTCAACT 3000 GAAAATAAAC TAACATTTCT CATAGATACT GGTGCAGATA TCTCACTTTT GAAAGTAAAT 3060 TCTGATAACT TCGTAATTCA AAATGAAAAA ATAATAAACA TCGAAGGCAT AGGCCAAGGT 3120 GTGATAAAGT CTCAAGGAAC AACCTTAATA GAACTCCAAT CAAAAAAATA TATTATCCCA 3180 CATGAATTTC ATTTGGTAAA CCCAAATTTT GCAATACCAT GTGATGGAAT AATAGGCATT 3240 GATTTTATAA AGAAATTCAA TTGTCAACTA GATTTCAAAC CAAGTGAAGA CTGGTTTATA 3300 ATTAGACCCC AAAATTTAAA TTATCCAATA TATGTCCCGA TAACATATAG CGCTGGCAAC 3360 AATACAGTTC TTCTGCCAGC CAGATCACAA GTTATTCGGA AAATAGACAT TAATGTTGTA 3420 AATGATTTCA TATTTGTTCC TAATCAAGAA ATACACAATG GGATTTATGT TGCAAATACA 3480 ATAGCAGCAT CCAAACATGT ATACGTTCGA CTTCTAAATA CAACTAATTT CGACCAAGTG 3540 GTCAAAGTAA ATAAAATACA ATATGAAAAT CTAAAAGATT ATGACATTCA TAATACCGAC 3600 ACTGGAAATA GAAGCGAACA AATACTTTCA AAACTAAAGA AAAATTTTCC AGACCAATTT 3660 AAAAATCAAT TAACAGAATT ATGCACACAG TATAGTGATG TGTTCGGACT GGAAACCGAA 3720 CCTATATCAA CAAATAATTT TTATAAACAA ACATTAAGAC TTAAAGATGA TGAACCCATT 3780 TATATAAAAA ACTATAGAAG CCCGCATAGC CATATTGAGG AAATTCAAAA ACAAGTAGGG 3840 AAATTAATAA GCGACAAAAT CGTCGAACCG TCTGTATCTG AGTATAACAG CCCACTCTTG 3900 CTAGTTCCAA AAAAATCATT ACCAAATTCA CAAGAGAAAA AATGGCGATT AGTAATTGAC 3960 TATCGTCAAA TAAACAAAAA ACTTCTTTCG GACAAATTTC CACTCCCTAG AATTGATGAC 4020 ATTTTAGATC AACTAGGTCG AGCTAAATAC TTTTCATGCC TTGACTTGAT GTCAGGTTTC 4080 CATCAAATAG AACTTGAGGA AAACTCTAGG AATATAACAT CTTTTTCAAC GAGCAATGGC 4140 TCATATCGCT TCACGCGATT ACCATTTGGT CTTAAAATAG CACCAAATTC ATTTCAGAGG 4200 ATGATGACTA TATCATTCTC GGGATTAGAA CCCTCTCAGG CATTCCTTTA CATGGATGAC 4260 TTAATGGTGA TAGGATGTTC CGAAAAACAC ATGATTAAAA ACTTAACTGA CGTTTTTAAT 4320 GTATGTAGGA AATATAACCT AAAGTTGCAT CCAGAAAAAT GTTCATTTTT CATGCACGAA 4380 GTGACATTCC TAGGTCACAA ATGCACAGAC AAAGGAGTAT TGCCAGATGA CAAGAAATAT 4440 GACGTCATCA AAAATTATCC TGTCCCTCAT GATGCGGACA GTGCAAGACG ATTTGTAGCA 4500 TTCTGCAACT ATTATCGTCG ATTTATAAGG AACTTCGCCG ACTATTCACG GCACATAACT 4560 AGATTATGTA AAAAGAATGT CCCTTTTGAA TGGTCAAGCG AATACCAGAA CGCATTCGAA 4620 TACCTAAAAG AAAATCTTAT GTACCCCACA CTATTACAAT ATCCTGATTT TCGCAAAGAA 4680 TTTTGCATTA TAACGGATGC TAGTAAACAA GCTTGCGGAG CGGTTCTAAC TCAGAACCGA 4740 AACGGGATTC AGCTCCCAAT AGCTTATGCA TCACGTTCAT TTACAAAAGG AGAAAGCAAT 4800 AAGAGTACAA CGGAACAAGA ACTAGCGGCA ATCCATTGGG CAATTACCCA TTTTAGACCA 4860 TACATTTATG GCAAGCATTT CACCATTAAA ACGGACCACA GACCATTAAC GTACCTATTT 4920 TCTATGACTA ATCCCAGTTC TAAATTAACT CGCATGCGGC TAGAACTAGA AGAATACGAC 4980 TTCACAGTAG AATACCTAAG GGGGAAAGAT AATTTTGTAG CAGACGCACT CTCACGTATA 5040 AATATAAAGG AACTCAAAGA CATGCAACAT AAAGTCCTGA AAGTCACTAC TAGGCAACAA 5100 AGTAGACAAG AAAACTGTAC AGTAACAAAC AAGGAACTAT TGCCTAGGCA AAGTATCCAA 5160 AATGTATCTA AGCCCAACGT ACACGAAGTC ATAACAAATG ATGAAGTACG AAAAGTAGTG 5220 ACCTTGCGAA TAACTGAATC TATTTGTTTA CTAAAACGAG GAAATAAAGT TATTGCAAGA 5280 ATTGATGTTG ACGATTTATA TACCAATGGA ATTTTTGATT TAGGTCAGTT CTTCCAAAGG 5340 CTTGAAATGC AAGCCGGTAT ACTAAAAATC AGCCAACTCA AATTGGCACC GAGTGAAAAA 5400 ATCTTTGAAA CCATTTCAAT AGATAATTTC AAAAATATGG GCAATATAAA ATTGAAAACA 5460 TTAAGAGTAG CGCTACTCCA GCCGGTGACC ATTATAAAAA CTGAAAAAGA GATACAATCG 5520 ATACTGTCTA CATATCATGA CGATCCAATT CAAGGAGGTC ATACAGGCAT TACAAGAACG 5580 CTAGCGAAAA TAAAAAGACA CTATTATTGG AAAAATATGA CTCGTCATAT AAAAGAGTAC 5640 ATACGTAGAT GTCATAAATG CCAAATGTCA AAAACAACGA CACATACAAA GACCCCATTG 5700 ACTTACACAG AAACCCCAAC AAATGCTTTT GATATAGTGA TAGTGGACAC AGTTGGTCCA 5760 CTACCGAAAT CAGAATATGG CAACGAATAC ATCGTCACAC TAATATGTGA TTTGACGAAG 5820 TATCTAGTAA CCATACCTGT TGCGAATAAG AGCGCAAATA CTGTCGCAAA AGCTATATTC 5880 GAAAATTTTA TACTAAAGTA CGGTCCAATG AAGACGTTCA TTTCGGACAT GGGTACCGAG 5940 TATAAAAACA ATGTAATTCA AGATATGTGT AAATATATGA AAATTGAAAA TCTTACATCC 6000 ACTGCATATC ACCACCAGAC TTTAGGGACA ATCGAACGAA GTCATAGAAC ATTCAATGAA 6060 TACATTCGTT CATACATCTC TGCAGATAAA ACTGATTGGG ACGTTTGGAT ACAATACTTT 6120 ACATATTGTT TCAACACAAC ACCATCAGTC ATGCATAATT ACTGTCCATA TGAACTAGTC 6180 TTTGGAAGAT TACCAAGGCA GTTCGCAAAT TTTAATAAAA CAGATAGAAT AGAACCACTG 6240 TATAATATAG AAGATTACTC AAAGGAAATA AAATTTAGAT TAGAAATAGC ATATAAAAGA 6300 GCTAGACTTT TGTTAGAAAA AGCTAAGTCT TATAGAAAAC AATTTTATGA TAAGAAAACT 6360 TCAGATTTTC AATTAAAAAT AGGAGATAAA GTTATACTAA GGAACGAATC GGGTCATAAG 6420 TTAGATCCAG TATATATAGG CCCTTATACT GTAGAAACCA TAGAAGACAG AGATAACATA 6480 GTAATTAGAG ATACAAAACA AAAGAAGCAA AAAGTACATA AGGATAGACT AAAAATATAT 6540 AATCAATGAA ACGTTTCATT TCACTTAAGA AAAGGTCTGA TCAACCTCAA AACAAAAAAA 6600 AAAACACAAA AAAAAATTTA ATTATTATTT TTCCTTCTAA GAAAGTTAAA CATAAATCCA 6660 AAAACATCGT AATTCAACAT ACATTTTTTG TATTATTCTG TCATTATACA AAAATGCTTT 6720 GAGACAAAAC ATTGCTAATA ATTAATAAGA AAAATCAATT TCAAAAAAAA TTTTTTCCTT 6780 TCTAAACACA ATATTAATAT TGAGAACTCA ATGACTACAT ATATTACGTC ATTTCTTTAA 6840 AAAGGGAGGT GTAGCATATT GGACTAATCT ACCCTAAGAA TACAATAGAT GATTGGGTAT 6900 AACATAGCGT CAATACATTG TGACACTTTG TCATAATAAA TATAAATATA CAAATATACA 6960 AAAAGACCAC CAAAAACTAC GTAAGCACTC CAGCGCCCCA GTAATACGAT CTAACGCTTA 7020 TACATAAGCC GATCGCGGAG CGTGGGAATG CTGAGCATGC ACTTTGCAGC TCAAGTGGTC 7080 AATGCCTTCT GCATGCATAT GTATATGTAT AAATGTAAGT AAGAATACAT AGATATAAGC 7140 AATGTATGTG CGGGTTAGCT GAACCCAACT TCAGCACACT TTGATCATTC GAATAAACAG 7200 ATTCAAACAG AGCAGAGGTT CTGAGCTCGG AAAGCAAATC TATTACATCT ATTACA 7256 // ID INE1 standard; DNA; INV; 611 BP. XX AC U66884; XX DR FLYBASE; FBgn0026416; INE-1. XX SY synonym: mini-me SY synonym: DINE SY synonym: narep1 SY synonym: Dr. D XX FT source U66884:4880..15490 XX CC This is presumably a dead element. CC Derived from U66884 (e1371475) (Rel. 52, Last updated, Version 6). CC Michael Ashburner, 28-Sep-2001. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 611 BP; 193 A; 123 C; 93 G; 202 T; 0 other; TATACCCGTT ACTAGATTCG TTGAAATGAA TGTAACAGGC AGAAGGAAGC GTCTTAGACC 60 ATATATAGTA TATACATACA TGTATATTCT TGATCAGGAT CAATAGCCGA GTCGATCTTG 120 CCATATCCGT CTGTCCGTAT GAACGTCGAG ATCTCAGGAA CTATAAAAGC TAGAAGGTTT 180 AGATTCAGCA TACAGAGACA AAGACGCAAG TAGCCATGCC CACTCTAACG TCCACAAACA 240 GCGCAAAACT ATCACGCCCA CACTTTTGAA AAATGTGTTG TTCTTTTCAC ATTCTGATTA 300 GTCTTTTACA TTTCTATCGA TTTCCAAAAA AAAACTTTTT GCCAACGCCC TAAAACCGCC 360 CAAAACTCCG ACACCCACAT TTGTAAAAAA TTGTTGGGAA TTTTTTTCAT AAATTTATTA 420 GTTTATTATT TATTATAAAT TTAAGTTTAT ATCGATTTGC CGACAACATA TTTTAATTTT 480 TTTTCTCATT TTATCTTTTA TCTATCGATA TCCCAGAAAA ATTGTGCAAT TTCGCATTCA 540 CACTAGCTGA GTAACGGGTA TCTGATAGTC GGGAAACTCG ACTATAGCAT TCTCTCTTTT 600 TGAAATTGCG G 611 // ID GTWIN standard; DNA; INV; 7411 BP. XX AC AC006215; XX DR FLYBASE; FBgn0063436; gtwin. XX SY synonym: hamilton XX FT source AC006215:36795..44204 FT SO_feature five_prime_LTR ; SO:0000425:1..494 FT SO_feature three_prime_LTR ; SO:0000426:6917..7410 FT SO_feature CDS ; SO:0000316:981..2357 FT SO_feature CDS ; SO:0000316:2729..5392 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. XX SQ Sequence 7411 BP; 2298 A; 1753 C; 1619 G; 1741 T; 0 other; AGTTAACAAC TAAGAGCACA CACTATTAAG AGCACACATA TAACAAATTA ATTAATATAA 60 ACAATGCTGA CGCGCCCAAA CTGAGTTCAG CGCTCTGCGC CACGAACGGT CAGCAACAGC 120 AATCGGACAC CCCTTATCCG GGGTACCGAG CTATGTTGCA TAAATGCTGA GTCGGCTTGC 180 CGACCATGGC TTTATGGCGT GATGCATTGG AGCCACAGTA GATTTACATT TTCATATTCT 240 TTTGTATTCA GTCTTAAGCC AGAGCTTTAA TAAAGAGCAG CTATTCACTC CGGCTCGCAG 300 CCGTAAAATA TCATTTTATT ATTTGAACAT TCTCGCTGAG CCAAAAGGCC AGCGATCGAC 360 AAAGAGGGCA AAGTCACGCC CTCCGACATA AACTTCATAA GAAGATATTA TCCTATTGTC 420 CCTGGTAGCA GCCAGCACCA GATCTCCACC GCCTGGAGAC CACCGGACAC ATCCCGAAGG 480 AGGAACACAT TAATTGGCGC CCAACGTGCA TTTGAACCTG CACCTAGTTA TATCTTAACA 540 TAACACTTAG TTGAGAATAC AAGCCATGTA AGTTTGATTA ATGAAGTCAC TCGGCAACCT 600 TAAGAATTTA GTGTGGAATA TTAAATAAAT AAAAAAAGCT AACAGCCATT TCTTTGCATG 660 CTTTTAGGGA GCATTATTTT CCACAGCATA CTTTTACCAA GTTACAATTC TGTAGCATAC 720 CCCAAGTTAC CCGCTCGTTT CAATTGCATA CTTTTACCAA GCGTAAATTC TACAGCATAC 780 CCTAGGTTAC TTTGAACTAC TGCCCAAACG AATTAATAAG AAATAATTAT TTCTTCTGCG 840 CATACTTTTT CCTTGTTTGT TTGTAGGGAT TTCCTTTTCC TTTGTGTCGC CTTCTATTTC 900 TCGCGTGTGC AAATATCGCG CCTCGTATGG CCGATCAACC GTGTTCGGAA AGAATTAAAT 960 TGCGTGGTGT CAACCAATAA ATGAGTTGGA CGGCATATAG AGACATAAAA GTCGAGCGCA 1020 ATAGTGACGG CGAGTTTGAT TTAGAGCCCG TAGGTTTTTG TCCGAATACT ACAAATAGCA 1080 CTGCAGCCAT GGACGCCGCA CAGCTTCAAG CTATTATAGC GGGAGCCGTT AACCAGGCTT 1140 TGACTGAGCA GGAAAACAGA CTTAAGAGGG ATTTCCAGGC GCAATTAGAT GAGGTAAAAC 1200 AACAGATGCA ACACTTGCGT GTGGAAGCAC CACAAGTAGA GACATACCAA AAAATAACCG 1260 CAGGCCCGGA AGTACGGTGC GACATCAAGC TTGACATTGT GAAGACAATG CCAGACTTTT 1320 CGGGAGAGCA GGATGACTAT GTGTCATGGC GGCAGTCAGC CGTGGACGCA TACGAGATCT 1380 TCAAACGATA TAATGGTAGC GAGGCGCATT ACCAGGCCGT TTCGATTATC AGAAACAAAA 1440 TTAGGGGGCC AGCAAGAGGG CTGCTAGTTG CCCATAACAC AGTCCTAAAT TTCGATGCCA 1500 TTATTGCTCG ACTAGACTGC ACCTACGCGG ACAAAACGTC GCTGCGTGTA CTTAGGCAAG 1560 GACTCGACAC AGTTAGGCAA GGGGAGCTTA GCCTCATGCA ATATTACGAC GATGTAGAGA 1620 AACGGTTGAC ATTAATAACA AACAAGATAG TTATGACTCA CGAACCGGAC AGTGCTATTC 1680 TTTTTAATAA TGAAGTTAGA GAGGATGCTC TCCACGCGTT CATTGCCGGT CTAAGGAAGC 1740 CCTTGAGGGC CATAGTCTTG CCGGCGCAAC CCAAGGATTT GCCCTCAGCT TTAGCTTTGG 1800 CTAGGGAATC CGAGGCTACC ATAGAACGCA CTAACTTTGC CGCTACATAC GCAAAGGCTC 1860 TAGAGGACAA GGCAATCACC TATGAGCACC GAAAAGATCG GAACCACTCA AGGGAACCGC 1920 ATGGGAGATA TAGCAGGGCC GAGGACAGCC AAGGGAAAAA CCCTCATTTT TACAAGAAAC 1980 AGGGTAGACA AAATAATTCT GCCCAGAATA ATAACAGCTA CCGGAATCAG TCACCTGAGC 2040 CTATGGAGTT AGGCTCCACT GTAACCCAAC ATAGGCAACC GACCTCCTTC GGAAGCGGTC 2100 AGCCTGCTAG CGTCAAGGCC GAGGGTGCTC GTGGCCAGAA GAGATTCGGC TCCGCACGCA 2160 TGACTGGTCA AAGGCGTCAG AGGGTGCAGA ACACCATCCA ACAGGCTGAC GATAAAAACG 2220 ACAGCGAATA TGAGGGCGCG GCCGCAGCGG CCGTTACCGA GATAGATGAC GAGTCTGATG 2280 AAAATGATCA GATCAATTTT TTAGGGAGCG CTCCCGACTG CCGTTCATTG AACGCCAGTT 2340 CCATGGGAGG AAATTGAGAA TTTTAATAGA CACAGGGGCG GCGAAAAACT ACATCAAGCC 2400 CGTAAAGGAG CTAAAAAATG TGGTGCCGGT CAATTCCCCT TTCACAGTTA GTTCCATCCA 2460 CGGTTCTAAC ACAGTCAAAC AGAAATGTTT AGTTAACATT TTCGGGAAAA CAGCGGATTT 2520 TTTCCTTTTG GACACGTTAA CACTTTTCGA TGGCATTATT GGATTTGATT TTCTAGCTCA 2580 AGTTGGAGCA AAGCTAGACA CAGAAAAAGG CACAATCAAC TATGGTTCTG TTGTTGAAGA 2640 GCTACAGCAT CATAGTTGCG ACGAGGTAAA TTTCACTAAC GTGAATGACG CTACGGTGCC 2700 AGAATCTGTA AAAACTGAGT TTACAGCCAT GATCGCAAAC AGATCTAAGG CATTTTCAGC 2760 CTCTAACGAG GCACTGCCGT TTAATACTTC TGTTGTTGCC ACAATTCGCA CAAGCGATGA 2820 TAAGCCAGTA TATTCCAAAT TATACCCGCA CCCCATGGGT GTCTCTGAAT TTGTTAAGCG 2880 AGAAATCGCA GACTTACTGC AGAAAGGCAT TATTAGAACC TCAAGGTCGC CGTACAACAA 2940 TCCAACATGG GTTGTTGACA AAAAAGGCCA TGACGAACAG GGGAACAAAA ATAAGCGTTT 3000 GGTTATAGAC TTTAGGAAAC TTAACGAAAA AACCATCGCT GATAAGTACC CAATGCCAAA 3060 CATCCCCATG ATACTGGCAA ATTTAGGAAA AGCCAAGTAC TTCACAACAT TAGATTTGAA 3120 GTCCGGCTAC CACCAGATAT ACTTGGCCGA ACATGACCGT GAAAAGACCT CCTTTTCGGT 3180 GAACGGCGGA AAGTATGAAT TTTGTCGATT ACCGTTCGGA TTGAAGAATG CGGGAAGCAT 3240 CTTCCAAAGA GCGATCGACG ATATCCTACG GGAACAAATT GGCAAGTCAT GCTACGTTTA 3300 TGTAGATGAC GTCATTATTT TTTCTGAAAA TGAAAACGAC CATGTTAAGC ACATAGATTG 3360 GGTCTTAAAA AGCCTGTGTG ATGCCAACAT GAAGGTGTCC AACGAGAAGA CGCACTTTTT 3420 TAAGCAAAGC GTTGAGTACC TTGGATTTAT TGTCACCAAT GGAGGTGCAA AAACCGACCC 3480 AGAAAAGGTA AAGGCCATAA AGGAATACCC AGAGCCTACA AATTTGTATG AGCTAAGGTC 3540 ATTTTTGGGT CTGGCCAGTT ATTACCGCTG CTTCGTTAAG GACTTTGCGG CGATCGCTAG 3600 ACCGTTGACG AGCTTGATGA AAGGAGAAAA CGGCTCCATC AGCAAGCACA TGTCCAGGAA 3660 GACTCCCATC GAGTTCGGTG ATTTGCAGAG AGATGCATTC GAGAGGCTGA GAAATGTCTT 3720 GGCTTCTGAA GATGTAATTC TCAGATACCC TGATTTCAGG AAGCCATTCG ATCTAACGAC 3780 GGACGCTTCC GCAAACGGTA TCGGTGCAGT CCTATCGCAA GATAAAAGAC CCATCACTAT 3840 GATCTCTAGG ACCCTCAAAG AAAGCGAGTC ACACTACGCC ACGAATGAAA GAGAATTGTT 3900 GGCCATAGTG TGGGCCTTAG GCAAGTTGCA ACACTACCTG TATGGCACTC GCGATATTAA 3960 CATTTATACG GACCACCAAC CGTTGACATT CGCGGTATCC GATCGGAATC CAAATCCGAA 4020 AATAAAAAGA TGGAAAGCGT ATATCGATGA TCATAACGCA AAAATCCACT ACAAACCGGG 4080 AAAAGACAAC CATGTGGCAG ACGCTCTTTC CAGGCAGAAC ATTAATGCCT TACAAAGTGA 4140 GCCTCCGTCA GACGCTGCGA CTATTCACAG CGAGCTGTCA CTGACCTACA CGGTCGAAGC 4200 AACAGACCAG CCAGTGAACT GCTTCAGAAA CCAAATTGTT CTAGAAGAAG CACGTTTCCG 4260 ACTAAAGCGA AGCTTGGTGT TGTTTCGTAG TAAAACTCGC CACTTAATCA ATTTCGCCGA 4320 CAAAAGCACC ATTTTGGAAA TGCTAAAGGA GGTCGTGAAC CCCGATGTCG TGAATGCGAT 4380 ACACTGCGAT CTACCCACCC TGGCAAGTTT TCAACACGAC CTAATTGTTC ATTTCCCGGC 4440 TACGCAATTT CGATACTGTA AAAATATTGT AATAGACGTT ACCAATCGAA ATGAGCAGTT 4500 GGAAATTGTC ACGACAGAGC ACAATCGTGC ACACAGGGCA GCGCAGGAGA ATACCAAGCA 4560 AATCCTCCGC GATTACTATT TCCCCAAAAT GAGCAGCCTG GCAAAGGAAG TAGTTGCAAA 4620 TTGTAAGATA TGCACCAAAG CCAAATATGA CAGGCACCCT AAGAAACAGG AGCTCGGGGA 4680 AACACCTATT CCTAGCTATA CCGGCGAAAT GCTGCACATT GATATCTTTT CGACCAATAA 4740 GAAACAATTC TTAACGTGCA TTGACAAGTT CTCAAAGTTT GCAATAGTGC AACCAGTGCT 4800 GTCCAGAACA ATAGTGGACG TCACAGGGCC CCTACTTCAA CTCGTAAATT TGTTCCCCAA 4860 AATCAAGACA ATATATTGCG ACAACGAAGC CGCCTTCAAT TCAGAGACTA TTACCTCGTT 4920 ACTTAAGAAC AGCTACCACA TTGACATTGT CAACGCGGCC CCGCTGCATA GCTCGTCAAA 4980 CGGCCAGGTG GAACGATTCC ACAGCACCTT GTCAGAGATC GCCAGGTGTC TTAAACTAGA 5040 CAAAAAGATT AGCGATACGA CAGAATTAAT TTTGAGGGCA ACGATAGAAT ATAATAAGAC 5100 CCTACATTCT GTCACTCAGG AGAAACCGGT CGAGATCCTC CACTCGGGTC CCGATGATCG 5160 CTGCCTAGGC ATCAAAGACA AGCTGGTAAA GGCCCAGAAA AATAACATCG AAAGATGCAA 5220 CCCAGCTAGG CAGAACCGTG TTTTTGAGGT AGGAGACGAG GTTTTCGTTA AAAACAACAG 5280 AAGGTTAGGA AACAAGCTAA CCCCGTTATG CACAGAACAA AAAGTGCAAG CAGACCTGGG 5340 AACGTCTGTT CTTATTAAGG GGAGGGTGGT CCACAAGGAC AACCTCAAAT AAAAAATTCC 5400 CACTTTTATT TTCCTATTCC TAATTGTAAG CCAGATTTAA GTAAATATGC CGTGGTCTCA 5460 ATCTACGTTT GGTTTGCAGG TTCGCCCTCG TAACACTCAT CATCCTGGCA GTGGCAAATG 5520 CACGGATTAC CGACTTTTCC CACGCCAAGT ACATTCCCGT CGTAGATGGA GATGTACTGG 5580 TGTTCGAACA TCGTAACTGC CTGAGGCATT CGAGCAACCT GTCTGATTAT ATTTACATGG 5640 TAGATGAAAC AAAGAAATTG TCCGCTTCTT TTCCACAGTC GCATATGCGC AAGTTGTTAG 5700 ACGTTGATAC AGATCACCTG GTAAACTTGT TGTCCGTTTT AAAAATACAC CACCGTATCG 5760 CTAGAAGCTT AGACTTCCTG GGTACAGCTC TTAAGGTAGT TGCGGGAACT CCCGACGCCT 5820 CAGATTTTCT AAAGATCAAA ATGACCGAAG CTCAGCTGGT AGATTCCAAC TCCAGGCAAA 5880 TAAATATAAA TTCCGAAACC CAAATACAAA TAAACAAACT TACCGACACC GTTAACAGAA 5940 TTATTAAAGC CCGAAACAAC GACTTGGTCG ACACCCCGCA TCTGTACGAG GCATTGCTAG 6000 CGAGGAATAG AATGCTGGCA ACAGAAATCC AAAATTTAAT TCTAACAATA ACATTGGCTA 6060 AAGCTAACAT AGTAAATCCC ACAATCCTTA ATCATGCCGA TTTGAGCTCA TTAATTTAAC 6120 AAAACACTCC AATAGTTAGC TTATTAGAAG CCTCTAAAAT TAGGGTTCTT CAGTCCGACA 6180 GCATTATTCA CATACTAATA GCCTATCCCA GGGTGAAGGT AAAATGTAAG AAGGTCCTCG 6240 TATACCCAGT ATCACACTAT CAGACAATCC TGCGACTCGA TGAAGACACC TTAGCGGAGT 6300 GTGAGGAAGA CACCTTTTCG GTCACCGAGT GCGTGGAGAC CACGCACGAC ACTTTCTGCG 6360 AGCGGTCGCG ACGCGATACC TGCGCCCGCT CACTCCATGC GGGAAATACT GCCCAATGCC 6420 ACACGCAATC CAACCACCTT AGGGCAGTAA TGCCTATAGA TGACGGCATA GTTATCATCA 6480 ACGAGGCAAC AGCCCGCGTC AGCACGGATG GAGGCCCAGA GGTGCTCGTC AAGGGGACGC 6540 ATCTAATTAC ATTCGAACAA TCAGCTACTA TCAACGGTAC GGAGTTCGTA AACCTCCGTA 6600 AGGCAATAAA CAAGCAGCCT GGCGTAGCAA GATCGCCGCT ACTGAACATC GTCGGCCACG 6660 ACCCGGAACT TAGCATGCCC TTGCTTCACC GCATGAACAA CGACAACCTG CGCTTTATCC 6720 AAGGATTCAA AGACGAGGTT GACGCCGCGG GTTCCCCCAA ACTTTGGTTC GTGGCTGGAG 6780 TAGTCCTCAA CGTTGGACTG ATTGGTTCGC TTATCCTTTT TCTGGCATTA AGGAAACGGC 6840 GAGCCTCCGC TGAGATTCAA CAGACCATCG ATAAACTCAA CATAACCGAG GACGGTCATA 6900 ATCTTAGGGG GGGAGTAGTT AACAACTAAG AGCACACACT ATTAAGAGCA CACATATAAC 6960 AAATTAATTA ATATAAACAA TGCTGACGCG CCCAAACTGA GTTCAGCGCT CTGCGCCACG 7020 AACGGTCAGC AACAGCAATC GGACACCCCT TATCCGGGGT ACCGAGCTAT GTTGCATAAA 7080 TGCTGAGTCG GCTTGCCGAC CATGGCTTTA TGGCGTGATG CATTGGAGCC ACAGTAGATT 7140 TACATTTTCA TATTCTTTTG TATTCAGTCT TAAGCCAGAG CTTTAATAAA GAGCAGCTAT 7200 TCACTCCGGC TCGCAGCCGT AAAATATCAT TTTATTATTT GAACATTCTC GCTGAGCCAA 7260 AAGGCCAGCG ATCGACAAAG AGGGCAAAGT CACGCCCTCC GACATAAACT TCATAAGAAG 7320 ATATTATCCT ATTGTCCCTG GTAGCAGCCA GCACCAGATC TCCACCGCCT GGAGACCACC 7380 GGACACATCC CGAAGGAGGA ACACATTAAT T 7411 // ID GYPSY2 standard; DNA; INV; 6841 BP. XX AC AL035631; XX DR FLYBASE; FBgn0063435; gypsy2. XX FT source AL035631:complement(24724..17886) FT SO_feature five_prime_LTR ; SO:0000425:1..351 FT SO_feature three_prime_LTR ; SO:0000426:6489..6840 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. XX SQ Sequence 6841 BP; 2002 A; 1658 C; 1549 G; 1632 T; 0 other; AGTTAACCAA AGCTAACGTC GCTCCCACGG GCGTATGAAA TACGACAACA TCGGCATACT 60 GCAACGTAAG CATTATGCCG GATGTCGATA CGTAGCCGGC AGACACTGCA GATTTGCGTG 120 GCCGCTACGA ATATGAGATC GCAACAATGT GTTTCGTTTT CATGCTAGCT CCAAAGTTTG 180 AATTCTTGTA TTTTGTATTC AGTTCTGATT TTGTCTCCGA CGCTACAGCT CCGCTTAGCT 240 CGGTATAATA AAACATTAAA AGTAACTCCT AATCTTTTAT TGTCACGGCT CCGCCTGACA 300 CTTGGTTAGT AAAAGCCGCG CGCGTTAACC GCCGGCCAAC CCTTTTTCTA ATTGGCGCCC 360 AACCAGTGGT ATTTGACAGT GCGTTAGTCA TTACCCACGA ACAAAAAAAA AAAAAAGAAG 420 TGCAAAACAA ACTTAAAAAA AAAAAATCTA ACTAAAAGTG CAAAACAAAC TAAAAACCTA 480 ACTACAAGTT AATCAGACTC AGCCGCATTA ACAGGGCTGT AGCAACAAAA CAAGTGCTTG 540 TGACCACTGT TGTTTGTAAC AGAACTATTT AACAAGTGGC TGCTGCTGAC CTACGTCAAC 600 ACAGGCAGCA CAGCTCCGTC ATTAACCTCG CCTCTTGAGT GCAGTTGGCT GAGGTGCGCT 660 AAAGCGCAAA CTTAATTTCA CGTCACACCG AAGCCGCGGG TGCTCTTCGG GCAGCGACTT 720 ATCGCCAGAT TAGCGCGCAA GCTATCTGGA CAGCGGAGAC GGCAGCGCGG GAGGCAGAGA 780 CAAGCAGTGT ACAAAAAAAC AAGAAACCAA CTCCAGTGTA GAACACAGCA CAACACCAAA 840 AAAAATGTAA GTGGGCACCT TTTTCTAAAA CTGCTCTTAA ACCTGCGAAA TAAAATTAAA 900 CATATAAACA CATTTTTGCA AGTGTCTTCT TAAATGAAAT TAAATAATAA ATAGTTTTAA 960 CATACATGAA ACTAGTAATG GATTAGGTTG ACGGAAAACT ATTGGCAGCA AAAATATATA 1020 AACATTTGTT GAATAATTAC TAAAAGCGGC GCATCAAATG CATGCACGCA CAATTTGTTT 1080 TTTCCGTCTG CCTTCATCTT GTTCGTTTTG TTTGGTCGTT CAACGTAATA CGATCCATGG 1140 AATTTTGCAA CTTTGTTGCC TCCGTCCTCA AGCGTTGTGT TCGAGTTTTC GTTGCTTACA 1200 TTAGAGGCAT ATGACTTCAC CCTTCCTTTT CTATAGAAAC ACTCGCCGAA AGTGTAGTGA 1260 TTCTGACTCT GATTGCGACG ACCATCCCAT TCGCTGTTCA GTGCCAAGAC GCACACCCTC 1320 ACCCCCACCC TCACCACGTC CTGCAGACAG CGTCATGGAC CCAGACCAGT TGAAATTCGT 1380 AATACAGGCA GCCGTCACCG CTGCCTTAGC TGAGCAAGCC GCCGCAAATA AGGTCTTGCT 1440 TGACAAAGTC AACTCAATGT CCCAGCAGCT GGCCGCAGCG CATATTACGC CACCAGAGGT 1500 GCAAGCCTAT GCACCCATAG AGATAAGAAA TGACATCCGC TGCGACGAGC CACTAGATGC 1560 CGTCAAATGC TTGCCCGAGT TTGCAGGCGC ACATGAATCG TATGTTTCAT GGAGACAGGC 1620 GGCTCTCGCA GCCTATCGTA TTTTTAGACC GTACGACGGC AGTTCACGCC ATTATCAGGC 1680 GGTCATAATT ATAAGAAACA AAATCAGGGG GGCCGCGGAC GTTGTTCTAG CCTCTTTCGG 1740 CACAGTTCTT AACTTTGACG CGATTATAAG TCGCCTCGAC TTCACGTACA GCGACAAACG 1800 TCCAGTTCAG GTAATCGAGC AAGAGCTGGC CACGCTGAGA CAAGGCAAAA TGTCCCTGCT 1860 ACAATACTAT GATGAAGTCG GCAAAAAGCT TACATTGTTG ACTAATAAAG TCAACATGTC 1920 ATACGAGCCG GTCCTAGCAA AGGGCCTCTG CGAAAAGTTC CGCGAAGATG CACTACGTGT 1980 GTTTGTTTCG GGACTCAAGC GTAGTCTCAC AGATGTGCTG TTCTCAGCAA AGCCAAGGGA 2040 CTTGCCGTCA GCTCTGGCGC TCGCGCATTT TTCATTTTAC CAGACTTGTC AACCTTCGAT 2100 GGGATAATCG GTCTCGATTT GTTAGCTCAG GCTGGGGCGT CACTCTGTTT GGCCTCCGGT 2160 CAGCTCAAAT GGGGTACGGA AGTTGAGAAA ATCTCCTTCC ACAAATGCAC TGATGTCAAT 2220 TTCACCGATG TGGATTGCTC AGATGCACCC GCTTCAGTGC GGGAGACTTT TCGGAAATTA 2280 TTAAAGGCCA GAAAAAAGGC CTTTGCAGAC CCAAACGAGG CTCTACCGTA CAATACTTCG 2340 GTGGTTGCCA CCATCAGAAC GGTGAGCGAA GAGCCCATCT ATGCCAAGCT GTACCCATAT 2400 CCCATGGGCG TAGCTGACTT CGTTAATAAG GAAATCCAGG ATCTTCTAAG AAACGATATA 2460 ATTCAGAAAT CGGCATCCCC CTACAACCCC ATATGGGTGG TAGATAAAAA GGGCACCGAT 2520 GATACGGGAA ACCGCAAAAA GCGCTTGATG ATAGACTTTC GCAAGCTTAA CGAGCACACC 2580 ATTCCCGATA AATACCCCAT GCCAAATATA TCAATGATAT TAGGCAATTT GGGCAAAGAA 2640 CGCTATTTTT CGACACTGGA TCTTAAATCA GGATACCATC AAGTCGTACT AGCGGAGCGC 2700 GATCGGGAAA AAACTTCTTT CTCGGTAAAC GGAGGAAAGT ATGAGTTTAA GAGATTGCCA 2760 TTCGGCCTCA GGAACGCCGC CGGCATCTTC CAAAGAACGA TCGATGACAT CCTACTGGAA 2820 CAAATAGGCA AATTTTGCTA TGTTTATGTT GACGATGTGA TTATCTTCTC GCAAGACGAG 2880 GAGGCTCACA TCAAACATGT AGATTGGGTG TTAAAGAGCT TACAAGAAGC TAACATGAGA 2940 GTATCGATCG AGAAATCGTG TTTTTTTTAA GAAAAGCGTG AGCTTTCTCG GGTTCATTGT 3000 CACCAGTAAC GGTGCAACAA CGGACCGGTT ACCGGAAAAA GGTAAAGGCC ATAAAAAAAT 3060 TTCCAGAGCC TAAGACGGTA TTCGAAGTCA GATCGTTTCT GGGCCTCCCA AGCTACTATA 3120 GGTGTTTCAT CAGAGACTTC GCTGCTATAG CAAGGCCCAT TTCAAACATA TTAAAGGGCG 3180 AAAATGGAAT AGTTAGTAGG CATAGATCGA GGAACATTCA GGTGCATTTC TCTGAGTCCC 3240 AGCGAGAGGC GTTCCAAAAG CTGCGCAATA TATTGGCATC AGAAGATGTC ATGCTCAGCT 3300 ACCCGGACTA TAAGAAGCCA TTTGATCTAA CGACAGATGC TTCAGCCTAT GGTATAGGTG 3360 CGGTGTTGTC TCAAGAGGGC CGCCCTATAA CAATGATTTC AAGAACTCTT AAAGGCAGTG 3420 AAGCCAACTA CGCGACCAAC GAGCGTGAGT TATTGGCCAT CGTTTGGGCC CTGGTTAAAC 3480 TGCGGCATTA CTTGTATGGA GTGAAAGATA TAAATATCTT CACTGACCAT CAGCCGTTAA 3540 CATTTTCTGT GTCGGAATCA AACCCGAACG CTAAAATTAA GAGGTGGAAG GCCCGCATAG 3600 ACGAGTTCAA TGCTCGTCTA TTTTACAAGC CCGGTAAAGA GAACCTGGTA GCGGACGCCT 3660 TGTCCAGACA ACAGCTTAAT GTGCTGGAAC AAGAAGAGCC CGAATCTTGT GCAGCAACGA 3720 TTCACAGTGA GGTGTCTCTC ACCCACACAA TCGAGTCAAC GGACAAGCCA TTGAACTGCT 3780 TCCAGAACCA GATAATACTG GAAGAAGCAC GTTTCCCGTC TAAAAATACC TTGATTTTAT 3840 TCGGGAATAA AAGGCGCCAC ACGGTTAACT TCGTCTGCAG GGGATCTTTA TTAGACGAAC 3900 TGGCAGACAT AATCGTCCCA AGGGCCGTAA ACGCCTTCCA TTGCGATTTG CACACGCTCG 3960 CAATGATACA AGATGAGATA GTCCGGAGGT TCCCAGCCAC AAAGTTCAGG CATTGCAGGA 4020 ACCGTGTCGT GGACGTACTC AGAATCGAAG AAAGAAGGGA AATCTTAACT GCTGAACATA 4080 ATAGGGCGCA CAGAGCAGCA CATGAAAATG TAAAGCAGGT ATTGTCGGAA TACTATTTCC 4140 CCAAAATGGC CAAGCTGGCC AACGAGATCG TGCAGAATTG CAGGACTTGT GCAAGGGCAA 4200 AGTATGACAG GCACCCGAAA AAACAAGAAC TCGGTGAAAC GCCAATACCG TCACATACAG 4260 GGGAAATGTT GCACATCGAC ATTTTCTCTA CCGACAAAAA GTTTTTCCTC ACTTGCGTCG 4320 ACAAGTTTTC TAAATTTGCC GTCGTACAAC CAATCGCTTC GAGAACTATT GAAGACCTGA 4380 AGCCAGCGCT GCTTCAGCTC ATGAACTTTT TCCCAAGGGC AAAGACCATT TATTGCGACA 4440 ACGAACCGTT AAGCGTTACA AACGCACCAC CACTGCACAG CACCTCCAAC GGGCAGGTGG 4500 AGCGTTTCCA TAGCACGTTT CTGGAGCTAG CCAGATGCAC AAAGATAGAC AAGGGCTTGA 4560 GTGATACGGT CGAAATAATA ATGTTGGCCA CTACCCAGTA TAATAAGTCA ATTCACTCGG 4620 TCATCGACAG GAGACCGGCT GACATCGTCC TAACACACCC CGAGGAGCCA CAGCTAGAGA 4680 TCCACAATAG GATCCAGAAG GCTCAGACCG CGCTGAGGGC CAGAGAAAAC GCCTCGCGAC 4740 AGAATAGGAC ATTCGATGTT GGCGAGAAAG TGTTGGTAAA ATCCAACCGA AGACTCGGCA 4800 ACAAGCTCAC GCCGTTGTGC GAAGAAAAAG CTGTAGAAGC GGACATGGGG ACCACGGTCC 4860 TTATTGAGGG GAGGGTGGTC CACAAGGACA ATTTAAAATG ACGCGCTCAG CAGCGAGGTC 4920 AGTGTTTCGC TTAATTTTAA ATTTTTTCTA GCCACTTGGC GTAATTTTTA ATAGATCTAA 4980 GCGTAGCGCC ATCCGCGCAC ATTATATTCT TAAGCATTTT TATTATTATT GGTGTTGGGT 5040 TCCCCTTGTC GACAAAATAA AAAAATCAAC CATTTAAACT TTCACCCACA GGACCAATCT 5100 TATCCTTCTC GTGTCTCTAT CATTGGCATC GGCTCACATC ACCGATTATT CGCGCGCCAA 5160 ATACATTCCC ATAGTCGATG GCCAGATCTT GGTGTGGGAG AATTTCGCCT ACGTGAGGCA 5220 CTCCGCGAAT CTTTCGGAGT ACGCACGGGT AGTAGAGGAG ACGGTCGGTC TGCTTAGCCA 5280 CTTCCCGCAG TCACACATGA GAAATTTGCT GAATGTAGAT TCAGCACACC TCCGGGACTT 5340 GCTGGATGCG CTGGGCGCGC ACCATCGAGA AGCTTGGACT TCCTGGGCTC CATACTAAAG 5400 GTTGTAGCCG GTACACCTGA TGCCAGCGAT TTGCAAAACA GTAGGGTCAT AGAGGCGCAA 5460 TTGATAGACG CGAACAATAG GCAGATAGAA ATCAATACAA AAATCCAGAG TCAAATTAAT 5520 AAATTAACTG CCACCGTCAA CTTAATCCTC AAAACAGCAA AAGCGTCGCA AATTGACTCC 5580 GGTCATTTGT ATGAAACGTT GCTCGCGAGA AATAGAATGC TCATGATGGA GCTTCAAAAT 5640 TTAATGTTGG CCGTGACACT TGCAAAAATG AACATTGTAA GCCCAAATAT TCTTGATCAC 5700 GCAGATTTGA GTTCAGTTTG GCTAGAGGAG CCCACCAACA CCCCCATAGG GGACCTCATG 5760 TCCGTATCGT CCGTAAAGGT TCTGCAGTCC AATAACGCAT AACACTTTAT CATCAAATTT 5820 CCGATAATTA AGTTCGCCTG TAAAAAATTA CTATTTTCCC CGTCAGTCAC GAAGGAGCCA 5880 TGCTACGTTT AGATGATGGC ATTATCGCAG AATGCGACAA CGAGATCCAC ACCGTCAAAG 5940 CTTGCACCGC ATCAACTAGC GCCACCTTCT GCCAGCTATC ATCAGGCAGC TCTTGTGCGA 6000 AAGAACTCCA TGCAGGCGGC GAGGCACACT GCGAAATACA ACCCAGCAGC CTGGACCCCA 6060 TCAGCTATGT AGATGAGGGG ATCGCATCAT CAACGACAGG GCCGCCAAGG TACAAGTGGA 6120 CAACGGCTCT GAAAACTGGG TGCATAGTAC GCATCTTGTT ACCTTCGCCC GACGTGCCGT 6180 CATTAACGGA ACCCTGTACG TCAACCAGAA CGGTATCCAG AACAGAGTAC CAGGGGTACC 6240 AAAATTCCCT CTGCTGAACA TCACCTCCCA CCAAAGCGTG CTCAGTCTCC CGTACCTTCG 6300 TCGTCTAAAT GAAATGACCC GAAAGACCCC AGCGAGTGGC ACTGACCGTA GCAGCATCAT 6360 GTTGCCTCCT GTTATGTGTT GCCATTGTGG GCTGGCGCGC ATGGAAGGCC AAGAGGTCCG 6420 CAAGACAACT GAACGTGGCG ATTGCGGAGT TAAGGTCGGC CTCCGTCTTG AGGGGGGGGG 6480 GGGGAGGTAG TTAACCAAAG CTAACGTCGC TCCCACGGGC GTATGAAATA CGACAACATC 6540 GGCATACTGC AACGTAAGCA TTATGCCGGA TGTCGATACG TAGCCGGCAG ACACTGCAGA 6600 TTTGCGTGGC CGCTACGAAT ATGAGATCGC AACAATGTGT TTCGTTTTCA TGCTAGCTCC 6660 AAAGTTTGAA TTCTTGTATT TTGTATTCAG TTCTGATTTT GTCTCCGACG CTACAGCTCC 6720 GCTTAGCTCG GTATAATAAA ACATTAAAAG TAACTCCTAA TCTTTTATTG TCACGGCTCC 6780 GCCTGACACT TGGTTAGTAA AAGCCGCGCG CGTTAACCGC CGGCCAACCC TTTTTCTAAT 6840 T 6841 // ID ACCORD standard; DNA; INV; 7404 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063447; accord. XX FT source nnnnnnnn:1..7404 FT SO_feature five_prime_LTR ; SO:0000425:1..557 FT SO_feature three_prime_LTR ; SO:0000426:6482..6910 FT SO_feature CDS ; SO:0000316:1552..2604 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 7404 BP; 2750 A; 1738 C; 1139 G; 1777 T; 0 other; AGTTACCATG CCCAGCATTA ACCCCCCTCA ACAACCACCT CCGCCTATGA AGCCCGCCCG 60 GTAGGCGACA TCAGCAAAGT GCCAACGCTG TATATATATA TATTGATCAC GAGCTACCAT 120 GCCAGCATAG CCTCGTCCCC CGCTACCTGA AACTCTGTTG CACCCGATGA TGAAATCGGC 180 ATGAACACAC ACACACACAC ATATTCACAC ATACTGCGTG GTGGCGACGC TCTCGACGTT 240 GAAATTAACA TGAACCTACA CACACACATA CACACTGCGT AATCGCGACG TCCTCGTCTA 300 GACTCGCTAT CTAGAACTAA CGGGATCAAA AGCACTGCTG CTTGCCCGTG CGTATACATT 360 AAGAATAAAG CTTTCATCAT TCTTGATCTT GACACCAAAC CGAGCAGTTG ATTTATTTAA 420 AGTGGCAAAT ATATATAACC TACATATATC ATAAGTACAC AATAAAGTCA TTATTGACTC 480 CCATCACATC AGCCTGGGCA GCAACTAACT AAGAGTAGAG AAAGGAGGAC CCCCGATCCA 540 ACGGAGGCAC CCGTAACTGG CGCAGCCGGA CTAGAACTAA AAGGATTAAC AACGCCTTTC 600 ACGTTCTTCG TTCCACTTCT GTAAACGGGC AAGTGAGTGA ACAAGTGGCA GAAAAAAAAA 660 AACACAAGTT TGAGGAGCCA AGTGTAAATA TGTGGTTCTC AAAAAAAAGA GAAGAAAAGG 720 AAAAAACTTC ACACAAACAA GTTCAACAGT GACATACAGT ATAATTACAA ACTATTGTGA 780 CCGAATATGT GCACATCAAT CGCTGAAAGG GCACCGGGAT CACACATAAA AAAAATGTCT 840 TTACATTCAA AGTGACCGAA TATGTGCACA TCAATCGCTG AAAGGGCACC GGGATCACTT 900 ATGACATACA TAAAAAGTTA ACATCATTCA GTGCTATGGC CAAATATGTG TACATCAATC 960 GCTGATGAGA CACCGGGACC ATACATAAAT AAGCAAAACA AATAAATCTA GTTCAATGTT 1020 GTGGCCGAAT ATGTGTACAT CAATCGCTGA AAAGGCACCG GGACCACAAC ATTGAAAGAA 1080 ATTAAACAAT AAATCAGTAA TAAAATCACT ATGTGTGTCA GGTTGAACCT TTCCTTGAAA 1140 GCCATATACT TGTTCGCGAA TGCTCCCAAT TGAGAAACAT TTGTCAAATT ACAAATACAC 1200 TCAACTTACT ACTTCGGATT GTGTTACTGC CATGTGTTCA TTTACTTATA CAAACTGAAT 1260 ATAAATTTTT TATACTTACT AATGTACATA ATATATTACA GTCAAACATT GGAGCCAACA 1320 CTTACTTAGA AAGAAACAAA CGGATTCATA TTATAAATAC ACACTACATT TTAAGATTAT 1380 CTATATATAA GATTACAATT ACTTAATTAC ACCTTATTAT TTAAGCTAAA ACATTACACT 1440 AAACATATAC ACAACATATA GGTACACGTA TACAAAAGTA TATAAGCACA ATAACACATA 1500 TACCATATTT TTAAGTACAA CACACAGACT ACACAAATCA AAAACACCTA CATGCCAAAG 1560 AGAATTCCGT TTAATACCCA AAGTAAAAGA AGACTTGGAC TTAGAAGTGG AGGCAGCCTT 1620 CCTCAAATTA GGGAACAAGT AGAACAATCC ACCAACAGCA ACAACATGGA CTCGGGAAAC 1680 GCATCAGCCC CTTTAAGTCC GACTACTTCC GCTACTTCCA CAGTTATTAA CACCAACCTT 1740 AACCCAACCG ACATTTTGGC TTTTATCGAA CAACTCCCCA CTTTTGAAGG TCATCCTAGC 1800 AATCTCGACA AGTTCATAAC TAGCGCTGAG GAACTGCTGT TCCTCATTAG ATCAGTAGAC 1860 AAGACCCCTT ATGGACAACT ACTTCTAAGG GCCGTCCGCA ACAAAATCGT AGGAAAAGCA 1920 GACGATGCCT TGACCCTTTG CGACACTCGA TTGAACTGGG ACGACATCAA GACTAATCTC 1980 AAACGACTCT ACACCAGCAA GAGAACCGAA GCGATGATAC TTAGGGAAAT ACAGACCCTC 2040 CCAAATGACC TTACAATGGG AAAACCTTTT TACAGTATAA TTAAAATGAG AAACGAACTT 2100 ATAACAATAG CTAAAGACAT GGATACCACG GGTAATGCAC TCGCAACTAA ACGTACCCTC 2160 TACGATGATA TATGTCTCAA CGCATTTATC ATCGGATTAA AGGATCCACT AAGAACAATC 2220 ATAAGAATTA GGAACCCGGA CACTATCGAA AAAGCTTACG AATACGGACA AATGGAACAA 2280 AGTTTTTTCT TCCAAAACTC AAACAGACAA GTAGAGGGCC GTCGCCGTGA CAATCCAACG 2340 GACAGAGGAC GCCCATACCC AAAAAATAAC CAACAAGAAC CGAAACTATC AAATAATAAT 2400 GGTCGCAACA TTTCGCACCA ACAAACCAGT CATCAGCAAG GGGGACAATC GATGAGAAAT 2460 CAAAACACAT CCCACAACTA CGAAAAACCA ACTTCAACAC ACCAACAATA CAGAATCAAT 2520 ACTAACACAA ATGTAAATTT GTGCAACATA AATGACGACA CAAATTTTCC GCAAAGAGCC 2580 TCGGAAGACC AGTCGGATTC ATAACTAAAA AAGCCCACGG GTCATCATTA CCTTATGTAA 2640 TGCTATCCCC AACATTTGTC CCTTCACAAC CACTTAAATT CCTAATAGAC ACTGGGTCCA 2700 CTTATTCATT TATAAACCCG GCCCTCATTA CAGAAGACAA TATAAAAAAA CTTAATTCAA 2760 GCATAGCCAT TCATACAGTA CTTAATACAT TCCAAATTAC GGAATTCACA GACACTATTA 2820 AATTTAAACA ATTCAAAGAA TTGTCAAATC AGAAGTTTCT TTTGTTCAAT TTTCATCCAC 2880 ACTTCAATGG CCTCTTAGGC ATGGATCTTC TATCCACTCT GAATGCAAAA ATAAACATCG 2940 CGGATTCCAT TTTGGAAACA CCTGAAACCG CAGTTCCAAT TTTGACACGA CCAAACCCTG 3000 TAGACATTTT GCATACTGTA CCATCAAATA GCAAGGTAAG GCTTCCACTA CCAGTAAACT 3060 GCATGCAAGG AGACTTTATA TACGGGACCA CAGATTTTGA CAACCAACTA TCAATAACAG 3120 GTGGACTGTA TACTGCAACC GCAGGTATAG CCTATTTCGA AGTCTGTAAC AGTTCAGACT 3180 ACAACCAAAC ACTTTATTTA GAAGAACCAT TAATAGCAGA AGAACTCTCT TTACAACACT 3240 ATGGACTTTT ACACTGCATG TCTACAATGG CGGATACTAC TCAGACGGAC ACCCAACAAG 3300 AACTTAATAT TAAGACAGAA CATTTAAACC AAGAAGAAAA ATTTAATCTA ATCAATCTAT 3360 GCAAAACCTT TAGGAAACTT TTCCATTATG AAGAGAGCCA TTTGACCTTT TCTAATGTCG 3420 TTAAACACTC CATACCAACA GTAGACAATG TTCCGATATT CACTAAATCA TACCGTTACC 3480 CATATGTACA CAAAGAAGAA GTTCGTAAAC AAATATCAGA AATGCTTCGA CAAAATATTA 3540 TTAAAAACAG TCACTCTCCT TGGAGTGCAC CCGTAGGGAT CGTACCCAAA AAAGCCGACA 3600 CTACAGGCAA AGAAAAATGG CGGTTGGTAA TAGATTTTCG AAAATTAAAC GAAAAAACCA 3660 TCTCCGACCG TTATCCGATA CCTAACATAG CAGACATACT GGACCGAATA GGTGGAACAA 3720 AATACTTCAC AACAATAGAC TTGGCAAGTG GATTCCATCA AATTGAGATG AATCCACAAG 3780 ACGCGAACAA AATAGCTTTC ACAGTCGAAA ATGGGCATTA CGAATTCACA AGAATGCCCT 3840 TCGGTCTGAA GAACGCACCA GCCACTTTTC AACGTGTGAT GGACAACGTT CTAGGAGACC 3900 TTATAGGTAA TGTTTGTCTG GTTTACCTAG ACGACATTAT TATATTCTCA CCCTCACTGC 3960 AGAAACACAT TAGCGACATT AAACTTGTTT TCTCTAAGCT ACTGAATGCG AATCTAAAAA 4020 TCCAACCTGC TAAGTGTAAC TTCTTAAGGA AAGAAATTGA CTTTTTGGGA CACATTGTTA 4080 CTCAGGAGGG AGTCAAGCCC AACCCAAACA AAATACAAGC TATTAAAGAC TTTCCCTGCC 4140 CCAAAACCAT CAGGGAAATC AAGTCATTTT TGGGATTACT TGGGTACTAT AGGAAGTTTA 4200 TCAAAGATTT CGCAAAAATA ACAAAACCTA TAACAAGACA ATTAAAAGGA AAAAAATCAA 4260 TAGTAATAGA CGACGAATTT AGGAAAGCAG TCGAAATTTC AAAGAATTTA TTATGCAACG 4320 ACCCAATACT TATATTCCCC GACTTTACAA AACCATTCAC GTTGACGACC GACGCAAGCA 4380 ACTACGCCAT AGGGTCAGTG CTATCACAAG GCCCCGATAC CAACGACAGA CCCATATCCT 4440 TCGCTAGTAG AACGCTATCA GACACAGAGG TTCGATATTC CACCATAGAA AAAGAAATGC 4500 TAGCTATTAT TTGGTCAGTA AGTCATTTTA GGCCATACCT TTTTGGCCAG AAATTTAAAA 4560 TAGTCACAGA TCACAAACCA TTAGTCTGGT TAGAAAGCTT TAATGGTCAA AACCCTAAAC 4620 TACTTCGGTG GAAAACTACC CTAGCCGCAT ACGATTACGA AGTAGTGTAT AAAAAAGGCA 4680 AACAAAACGT GGTCGCTGAT GCGCTTAGTC GAATAGAACC AAATTTAAAC ATAAACGAAG 4740 ACCCACAAAA AATCCCAGTT GTCCAACAAC CATTAAATCA TTTCAATACA CAAATTGTGT 4800 TCCACATTGG GGAAAAGTCT TCAGTACAAA TCACAACACC ATTTACGTAC AAAACAAGAC 4860 AAGTCATTTC AGAACCTACA TATACCTTTG ACACTATTTC TAACATCTTG CAATCAATTC 4920 TTAAACCAAA CAAAATGACT GCAATATTTG CCCCAGATCA AATCTTTTTG CTCATAGAGG 4980 AAGCGTATAA CAACTACTTC TCGGTGAACG ACTCATATAA AATCTCCCGT TGCAAACTGT 5040 TCTTACCCGA AATAACCGAA ACGGAAGACC AAAAAACAAG AATACTTACA TACCACTTAA 5100 AGAACAACCA CAGAGGTATC GACGAAACCT TTCAACACCT AAAAAGAGAT ATTTATTTCC 5160 CCCGCATGAA AGATATTATT ACGCAAGTAA TAAAAGACTG TGACATTTGC CTTACACTTA 5220 AATACGACAG GCAACCCCAT AAACCAGCTA TGCAATCACC ACCTGCCCCA CCAGGACCTT 5280 TAGAGGTGAT CCACATAGAT ATTTACTTCG TCAATGGTAC TTATAATCTA ACTGTCATTG 5340 ACAAATTTTC AAAGTTCGCA CAAGCATACC CGCTTGATAA CAGAAACTCC ATAAAAATCA 5400 TTGCCGCCTT GTCTCAGTTC ATGAGCAACT TTGGCATACC CAAAAAACTT GTTTTCGATC 5460 AGGGAACAGA ATTTTCCGGA AACCTGTTTA ACGATTTCCT TGCTCAATAC GATATACTCC 5520 AACATACTAC TTCATTCCAA CAATCGACTG GTAATTCACC CGTCGAAAGA CTTCACTCTA 5580 CGTTAACAGA ATTGTACAGA ATAGTTATGA ACCGTCGAAA AGAATTACAC CTACAGTGCG 5640 AACATACCGA AATACTTAGC GAAGTATTAA CAACCTACAA TAACGCGATA CATTCTAGCA 5700 CAAATCACAC CCCTTTCGAA CTCTTTCACG GCAGGACACA CATATTTGGA AAAACCATAA 5760 CTTATGACAA TCATCATGAC TACCTCTCAA AACTAAATCA ATTCAGACAA ACACTATACC 5820 CCCAAATACA ACAACACCTA CAAAACACCA CCGATAAGAG ATTGGCTAAG TTAAATAAGG 5880 ACAGGGAACC GCCAATTCTA GTTGAGGAAA ATAATACTAT ATATAGGAAG GAAAACAGAA 5940 GAAACAAAAT AACACCTAGA TTTTCTCTAC ACAAGGTAGA AAGAGACAGA GGGATAACAC 6000 TAATAACTAC CAGAGGTCAG AAATTACATA AACAAAAAAT TAGGAAACAC ATTAAAAGAA 6060 CTGTGAACAC CCAGTCCAAA TCAAACACTA ACACTTAATG AATTCAACTA AACATGAAAA 6120 TGTAACAATT AAAAGACAAA TTTGATAAAC AGACAGGGAC ATTATTATCA GGATCATATA 6180 TAAAATAGAA CACGTAAATA ACAAACAAGT TTCGATATAG AAATTTTTCG AAAATTACCA 6240 GAACCAAATT TTTAAAGAGC AATAACGTGG CCGAAAGCTT ACTTGCTAGT AAGTTAGGCC 6300 TCATTTTCAA ATTTATGTTT AATACACAAC AATTAGAAGA CATACGAATT TTATTTAATA 6360 AGCAAAATAT AACCATAGAA TTGGATGAAC ACATTAACAA CTTTTGTAAC CATCCTGTTA 6420 AACTCTTACT TGCAACACCA AACTATTATT TAAGGTTAAA AATCCATTTT TCAACTAGCA 6480 AATGTATCAT TTCATTCAAA CAATTCCCTT ACCTATACAT TGTTCATACT TCATTGTCTC 6540 CCACGACAAA CTTACCATCG AGATACCACC AATGGAACGA ATAGAAAATA ACAAAACATT 6600 GAAAGTCCTA TCACTCGAAA AACTGCACCT GGAGGCATTG GCAACAACCG GGAAAATGGA 6660 CTACATAACT CAACAATACT TTACCGCTAC GTACCTTGCA TTGAGCATAG TACTAATACT 6720 TGTGATCTGT CTCCGAACCT GTAAAAGGCA GAAAACGATT TTTACCCAGC AACCCACCCA 6780 CACACCTCAC GTTGTCGAGT CGGCTATCCC TTCGTTATGG CCATCTCTCC GCACTAGGGG 6840 GGGAGGAGTT ACCATGCCCA GCATTAACCC CCCTCAACAA CCACCTCCGC CTATGAAGCC 6900 CGCCCGGTAG GCGACATCAG CAAAGTGCCA ACGCTGTATA TATATATATT GATCACGAGC 6960 TACCATGCCA GCATAGCCTC GTCCCCCGCT ACCTGAAACT CTGTTGCACC CGATGATGAA 7020 ATCGGCATGA ACACACACAC ACACACATAT TCACACATAC TGCGTGGTGG CGACGCTCTC 7080 GACGTTGAAA TTAACATGAA CCTACACACA CACATACACA CTGCGTAATC GCGACGTCCT 7140 CGTCTAGACT CGCTATCTAG AACTAACGGG ATCAAAAGCA CTGCTGCTTG CCCGTGCGTA 7200 TACATTAAGA ATAAAGCTTT CATCATTCTT GATCTTGACA CCAAACCGAG CAGTTGATTT 7260 ATTTAAAGTG GCAAATATAT ATAACCTACA TATATCATAA GTACACAATA AAGTCATTAT 7320 TGACTCCCAT CACATCAGCC TGGGCAGCAA CTAACTAAGA GTAGAGAAAG GAGGACCCCC 7380 GATCCAACGG AGGCACCCGT AACT 7404 // ID 1360 standard; DNA; INV; 3409 BP. XX AC AC005453; XX DR FLYBASE; FBgn0005673; 1360. XX SY synonym: Hoppel SY synonym: protop XX FT source AC005453:634..4052 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..30 FT SO_feature terminal_inverted_repeat ; SO:0000481:3379..3408 FT SO_feature CDS ; SO:0000316:970..2718 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0066320; 1360\T" FT /db_xref="SPTREMBL:Q86BW1" FT /protein_id="AAN39288.1" FT /translation="MNASDKECILACDEVAIKKNLTYNVSVDIIDGIEHLLDRSNKIGS FT HICVFVLRGILKKWKFILNYFVAETNIKGDCLKSLIYKNIIIAETIGFKVRGVVYDQGG FT NNRKCTSLLEVTNEKPYFTLNNKKYMFYDIPHLFKSVRNNFLRANFETPDGLVDFDVIR FT EVYELDHGSVTRMTKLTRSHVNPTRFELMRVCLATQTLSHTVAAAIKTCNQNKQLHRNS FT SEVAASTAAFVQKDNDYFDCLNSRVLTDKNPMKCALQVNNGVWNKLKEMQEYLKSVKYH FT GNKIYCVDGLIQTTEAIFGLVEDLFKDHTDHFFFLTSRVNQDPLENIFACVRAKGGNCR FT NPSVNEFNIIIAKLISLHIFKFSQKSNCESDDDVMLPIEFDSIIYQPFVEKKEIQQQEY FT SVSFSKIVQDNERYFDQNIDNFLCNDVPIELTSSRYFVGYIAKGSSCDKCRSVILKETE FT HLTAPSELFIHEKNYSIESDFGKLRAPSDLFFNIYKIHIKAFENIFKNNKKQMCIKKFI FT VEQCIKCTNESSAFPLWFYENNECYAHRTDLLNKLIKVLLFKHCKWTVIADRQKKQAKL FT SILSHE" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC Translation is of the sequence from AF533772; AC005453 is mutant. XX SQ Sequence 3409 BP; 1264 A; 533 C; 575 G; 1037 T; 0 other; CAAAGACACT AGAATAACAA GATGCGTAAC GCCATACGAT TTTTTGGCAC ACGATTTTTT 60 TGCCGTGGCT CTAGAGGTGG CTCCAGGCTC TCTCGAATTT TTGTTAGAGA GCGAGAGAGC 120 GAAGAGCGCT ACAGCGAACA GCTCTTTTCT ACGCATACAG TGATAGCAGA CAACTTTATG 180 TGTGCACACG TATGCTCATG CATTGTAAAT TTGACAAAAT ATGCCCTTCA CCGTAGAAGT 240 TCTTAGACTT TAAATCTATA TTATTTTTGA TCAATTGGCA TCATGCGAAA AATTCTTGTT 300 TGCATTGCCT TAACGTTATT ATTATTTGAA AACAGATTAG AAATAGCCAA ATCCATGTAC 360 ATATCATAAC AAAATAAATT TCAAAAATTA CTTTATATTA GAATATTTGT CATTAGAGTA 420 TTCAGCTTGC GGCGTGTGAA AAATTAATAA GGCAATGATT GTTGAGTGCT TGTGTCCGCA 480 CTTCGTGCCT CAAGATATGA CCAAAACAAA GACACTAGAA TAATTCTAGT GTCTTTGATG 540 TGACTTTTGC AATAAACAGT TTTCATATTC ATATTTATTT TACTAAAGAC AAATTAGAGA 600 GCCAATTTAG AATTAAAATG CTGAGGAAAG AAAACTCAAA CCTAAAAAAA AATAAGTTAG 660 GTTGGAATCT AAGACCGAAA AAAACATATA TTCTGAAATA GATAAAATTA ATTTAACAGG 720 AAATGAAAAA ATATTAACGA AAATGCTTTT TAAGGAAAAA AGGGCACAAA TACAAGATGG 780 AATGATTGCG AAAAATTGTT TGCTCAGAGC ATATATTACA GGTCCACTTT GACATATACG 840 TTCTTAAGAG ACTCGCTGAA ATTAAATTTT CCAAGTCCAT CATCTTTGCA AAAGTGGAAC 900 AGCATAAAAA AGTTACAGCC AGGAGACAAC GAATGTTTGT ACTCAGCCCT TAAAGAATCT 960 ATAAAAGAAA TGAATGCATC TGATAAAGAG TGCATATTGG CTTGTGATGA AGTAGCGATA 1020 AAAAAAAACC TAACTTACAA TGTGTCAGTT GATATAATAG ATGGAATAGA GCATTTACTT 1080 GATCGCTCTA ATAAAATGGG AAGCCATATT TGCGTATTTG TTGTTCGGGG TATTTAAAAA 1140 AAATGGAAAT TTATATTAAA CTATTTTGTG GCCGAAACGA ATATAAAAGG CGATTGCGTA 1200 AAAAGTCTTA TTTATAAAAA TATTATTATT GCTGAAACAA TTGGGTTCAA AGTAAGGGGT 1260 GTTGTGTACG ATCAAGGAGG TAACAATAGA AAATGTACCT CACTACTTGA AGTCACCAAT 1320 GAGAAATCTT ACTTTACCTT AAATAATAAA AAATATTATA TGTTTTATGA CATTCCGCAT 1380 CTTTTCAAGT CTGTAAGGAA TAATTTTCTA AGAGCTAATT TTGAAACCCC TGACGGACTA 1440 GTTGACTTTG ATGTCATTCG AGAGGTTTAC GAATTAGATC ATGGATCAGT AACGAGAATG 1500 ACTAAATTAA CAAGGAGTCA CGTTAATCCA ACCAGATTTG ATCTAATGCG CGTATGTTTG 1560 GCAACTCAAA CACTTAGCCA CACTGTTGCA GCGGCAATTA AGACTTGTAA CCAAAACAAG 1620 CAATTACATC GAAATAGTTC CGAAGTTGCA GCTTCTACGG CTGCATTTGT CCAAAAAGTT 1680 AATGATTACT TTGATTGCCT GAATAGCAGA GTATTAACTG ACAAAAACCC CATGAAATTC 1740 GCGCTCCAAG TTAACAATGG AGTTTGGAAC AAACTAAAGG AAATGCAGGA ATACCTAAAG 1800 AGCGTTAAAT ATCACGGGAA TAAAATATAT TGTGTTGATG GCTTGATCCA AACTACTGAA 1860 GCTATATTTG GACTCCTCGA AGATCTCTTT AAGGATCACA CAGACCACTT TTTCTTTTTA 1920 ACTAGTAGAT TAAACCAGGA TGCCCTTGAA AATATATTTG CGTGCGTTCG AGCAAAAGGT 1980 GATAACTGCA GAAATCCTTC AGTTAACGAA TTTAATATAA TAATTGCCAA GCTCATATCT 2040 CTACACATTT TTAAATTTTC ACAAAAGTCG AATTGCGAAT CAGATGATGA TGTTATGTTG 2100 CCAATAGAGT TCGATTCGAT AATATATCAG CCTTTCGTTG AAAAAAAGAA ATACAGCAAC 2160 AAGAATACTC TGTATCATTT TCCAAAATCG TGCAAGACAA TGAGAGGTAC TTTGACCAAA 2220 ATATAGATAA CTTTGTGCAA TGATGTACCC ATTGAATTAA CTTCCAGTAG ATATTTTGTT 2280 GGATATATTG CTAAGGAATC CAGTTGCGAT AAATGCAGAT CGGTAATTTT AAAAGAAACC 2340 GAGCATTTAA CAGCTCCATC AGAGCTTTTT ATACATGAAA AAAACTACTC AATAGAATCG 2400 GACTTCGGGA AATTAAAGGC ACCATCTGAT TTATTTTTTA ATATTTGCAA AATACACATT 2460 AAAGCTTTCG AAAATATTTT TAAAAATAAT AAAAAACAAA TGTGTATAAA AAAATGTATT 2520 ATTGAGCAAT GCATTAAGTG CACAAATGAA AGTAGTGCCT TCTCATTATG GTTTTATGAA 2580 AATAATGAAT GTTATGCACA TAGAACAGAT CTTTTAAACA AACTTATAAA AGTTTTGCTT 2640 TTTAAGCATT GTAAGTGGAC TGTGATAGCT GATAGACAAA AAAAGCAAGC TAAGCTCAGC 2700 ATTCTATCTC ATAAATAAAA AAAATGAGCG TTATTTAAAT AACTGACGAC TGAAAAATCG 2760 AGCAATATAT AATAAAAAAC AATAGTTAGA AAAATAACTT AATTGGGGAA TATGGAAATG 2820 TTAATGTTAA ACATTAAAAT ATTTCAAGTC GACTTGAAGG TCATATACGT AAATATAAGA 2880 CACCATTACA ATTGTAATGG CCTCCCCGTG GTGTTCCCTG GGTACCGATT ATTACACATA 2940 TTACACATAA ATAATTATGA ACATAAATAT AAATATGTAA ACGGTAGCTA ATTCGAGCGG 3000 CGATTTTAAC AAACGAATTT AAAAAGCTTT AAAATTAAAA AATTTTAAAA TTTAAATTAA 3060 AATTTTAAAA TTATAATAAC AAGGGCGCGA ATTTTTTAAA ATTATTTTAT TTTATCATAT 3120 TGCTACGAAA TTGGCAAAAA AACTACCCTA ATATGTACAA TGTAAATTCA TTTCTTAGAT 3180 CAGAATTGAT TTCGGCCCGA AAATCGTCTT CTAGCACAAC ACGCACACTT ATACGCGTTC 3240 TCGTCTCTTG TTTTTACTCA CACAAGCAAG CAAATTCTAT TTTTAGATTT CTTACGTTCT 3300 CAGCGTGAGC GAGCGGAAAG ACAGCAATTT TGGCCGTCAC CAAAAAAGTG GCTGCATAGT 3360 GCCAAACCAA TGTATGGCCG TTACGCATCT TGTTATTCTA GTGTCTTTG 3409 // ID GYPSY3 standard; DNA; INV; 6973 BP. XX AC AC007477; XX DR FLYBASE; FBgn0063434; gypsy3. XX FT source AC007477:50788..57759 FT SO_feature five_prime_LTR ; SO:0000425:1..397 FT SO_feature three_prime_LTR ; SO:0000426:6576..6972 FT SO_feature CDS ; SO:0000316:1046..2326 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. XX SQ Sequence 6973 BP; 2016 A; 1760 C; 1499 G; 1698 T; 0 other; AGTTAACTAA GTTAACCGGA CTGATCGTCC GCTGACCAGC ATTTGGCCGG AAGCTCGAGC 60 ATAGCCGGCA AAGCTCTGCA CTTTGGCAGA GGCCGCTATG AAGCTCTTCT CTTTGTAAGC 120 TTTAAGCAGT TCGTTTTGGA CACTCATAAA ATAAAGAGCC ATCGCTCATT AACCAACAAC 180 TCCGGCATTC TGTCGTTAAT TTTATATTTG ATCATAGAAA TCTGAGTCCC TAGGCCAATT 240 GCATTAAGGA AGACAGTCAT CTTCCCAACG TAATCATCGT AATTCTCCAA CTGGATTTAT 300 CACCCTCTTC CCCTGTAAGA GAGACACCAT CAGCCAACCG GCAAATCATG GGAGACTGCA 360 GCGAGGGACC CCCTGCAGCA CCCCCACATT TAATAATTGG CGCCCCACTA AGGAAACACA 420 CAACAACACA CACAAACACA CAGATTATGT AAGTGAGCGC CTATCTCCAA ACCCCAACTG 480 CATTTAAACA AAGTTTATAA CATTTAAAAT ATTTCTGCGA TAAATAAATA ATCTGTATTT 540 CCATGACAAT CCTAATCTTT GTATGTGTTG TGAATTTAAA TTATATTAAT GTATGAAATT 600 TAAACATGGA ATTTTAAATT TTGATTGACT GTGCGGATTA AATTATTCTT TGGCACTGTC 660 GCAGAGATTT ATTGATTTAC AATTTACATC ATGTGCATTA TTTCGTTCCA CTTCTGCATA 720 TGTCTGCACA CTGTCGTTCC ACTTGAACTT GAATTCTCTC CCCACAGCCG TCTCGGGCCC 780 TCCTATCGCT GCCCCGACAC CCGATACAGC AGCGCATGCT TTGCTGTAGC AATTTTTTGC 840 CTTTTATATA TCTCTTTGTC TGTGCTGCTA CGATTTGCTT GTCATTTATA TATTTGTTTG 900 GCTTAGCGGA TGCGAGCTCC CCGCGGCTCC GCTCAACGAG CCCTCGGCCA ATGTTATTAA 960 ACGCCGACTC AGCCCCCGCG TCCCCTCGCC GAACTAATAC TTCCCTTGGG CAATACGACT 1020 TGAGTATTTA TATACTCAGC TTCTCATGAG CAACTCATTC CGACAATATA GGAATTCTAA 1080 ACCATTCGCT AGCGACTCAG AGTCAGAAGG CGACGATTCT ACAGAAAACT CTGTACATAA 1140 GAACACCGCA GTTAACGCAT TTGTTGCATA TAAAATGTCG CTCGAAACGG AACAAATTAA 1200 AGTCCTCATA AGGGCGTTAC AAGAGCAAGC CCTAGAAGGT CAACGCAGGG AGGCTGACCT 1260 GCGTAATACA GTTCAGGAAC TTGCCAGCCA GCTCGCGGCT ATGCAGGTTG CCCCTGTGCG 1320 GGCAGAAACT CCACAAATTA AGGTTTACCA ACCGGTAGAT ATCACCGGAG AAGTCCGTTG 1380 CGAAGAAACA TTGGACGCTG TAAAACGCCT CCCAGATTTT ATGGGCACAC AGGAGACATA 1440 CGTCTCCTGG CGGCAGGCGG CGAAGGCCGC TTACCACATG TTCAGAAACT ATGAAAATAG 1500 TTCGCGACAC TATCAGGCTG TAGTTATCAT CAGGAGCAAA GTAAAAGGCC CTGCTGATGT 1560 AGTTCTGTCA TCCTTCGGAA CTGTGCTAAA TTTCGATGCA ATTATAAATC GCCTCGATTT 1620 TACGTACAGC GACAAGCGCC CGATACACGT TATCGAACAA GAGTTAGGCA CTCTCAGACA 1680 GGGAAGCATG ACGCTCCTTC AGTATTATGA CGAGGTTGAG AAAAAACTCA CCTTGCTCAC 1740 CAATAAGGCA ACCATGTCTT ATGAAGCCTC GGCAGCAAAG GTGCTGTGTG AAAAGTTCCG 1800 AGATGATGCT TTGCGAGTTT TCGTCTCGGG ACTCAGGCGC AACCTCACGG ACGTGCTGTT 1860 CGCGGCAAAA CCGAAGGACA TGCCCTCAGC ACTCGCCTTG GCACAGGAGG TGGAATTTAA 1920 TCACGAGCGA TACACCTTCG CAACTTCGTT TGCAAGAAGC CAAGAAGATA GGGACCGCAC 1980 GCAATATCCC AAAGTGCAGG AGCGCCAGCA GGCCCCTCCA CAAGCCAACC CGCAGGGAAG 2040 TGCCGGAAAG AACCTAGGCA GCACAGAGCA CAAGTGCACT CTGCTCCACG TAGCGACCGG 2100 ATGGCTCGAG AGAACACACC GGAACCTATG GAAGTTGACC CTTCATTGTC CAGGATGCAA 2160 CCATCTCACG CCCCGGCTTA CCCGAAATCG AAGCCGGCCA CATCTGGCCG CTCGATCCCG 2220 CCTAAAAGGC AAAGGGTCAA CCATGTTGCC CAGGCCTCAG ATTACACGGC TTATGCCACC 2280 GCAGCCTCCA GTGCAGCGGT TAAAGTCGAC GATGATGCCA TCCTAGAGTA TGACTCGGAT 2340 GCCATTAATT TTTTAGGGGT AAGTCCCTGC TACCCGTCAT CAGACGAAGA GTAGCGGGGA 2400 TTGACATGAA ACTGCTAATT GCATGGCCAA AAATTTTATC TGACCTTTTA TGGGGTTGAA 2460 AGGCGTCCGC CCGGTGGAGT CCCCATTCGC AGTCCATTCG ATTCATGGCG TGACTACAAT 2520 CGCAAAAAAA TGTTTTATCT CTATTTTTAA CCTTAAAGCT ACATTTTCCT TACCACCAGA 2580 TTTGACCTCC TTCGACGCGA TCATTGGCCT AGACCTATTA AAACAGGCCG GCGCGTCGCT 2640 TTGCCGGCCA GCTCAAATGG GGCTCCGAAG CAGAGCAAAT TGACTTTCAC ACTGACTTCG 2700 TCAACGGCGA AATTCAAGAG CTGCTGAAAA AGGGCATAAT CCAAAAGTCG AAGTCCCCCT 2760 ACAATAACCC AATATGGGTC GTAGACAAAA AGGGCACCGA CGATGCTGGC AACAAAAAAA 2820 TGCGTTTGGT ACTGGACTTT CGCAAACTGA ACTTGAGGAC TGTACCAGAC AGATACCCCA 2880 TGCCAAATAT CTCAATGATA TTGGGGAATC TCGGCAAGGC CAGGTACTTC ACGACACTCG 2940 ATCTGAAGTC TGGCTATCAT CAAATCACGC TCGCTGAACG CGACCGTGAA AAGACATCAT 3000 TCTCAGTAAA CGGAGGGAAG TATGAGTTCC GAAGATTGTC ATTCGGACTC AGGAATGCTG 3060 CAAGCATCTT CCAGAGGACA ATTGACGATA TTCTGCGAGA GCAGATCGGC AAGTTCTGCT 3120 ACGTTTACGT TGATGACGTC ATCATCTTTT CAGAAGATGA AAATGCGCAT GTCAAGCACG 3180 TAGATTTGGT TCTGAAGAGC CTGTACGATG CTAACATAAG AGTATCTGCA GAAAAGTCAC 3240 GCTCCTTTAA GAAAAGCGTG AGCTTCCTAG GGTTCATCGT CACCAACAAT GGCGCGGCGA 3300 CTGACCCAGA AAAGGTTAAG GCCATAAAGG AATTTCCAGA ACCCAAAAAC GTATTTGAGG 3360 TGAGGTCATT CTTGGGCTTA GCCAGCTATT ATCGATGCTT CATCAAAGAT TTCGCATCAA 3420 TACCAAGGCC CATTTCGGAC ATACTGAAGG GCGAGAACGG AAGTGTTAGC CGACACAGGT 3480 CCAGGGGTAT CAAGGTAGAA TTTTCCGAAG CGCAGCAACG TGCCTTCGAG AAACTTCGCA 3540 ACATTTTGGC GTCTGAGGAC GTCACCCTTA GATACCCTGA TTACAAAAAA GCGTTTGATC 3600 TAACGACAGA CGCTTCGGCC TACGGCATTG GCGCAGTGCT GTCCCAAGAG GGACGCCCCA 3660 TTACAATGAT CTCTAGGACA TTGTGTGACA GAGAGGTTAA CTACGCTACC AATGAAAGGG 3720 AGCTATTAGC CATAGTCTGG GCGCTAGTCA AGTTGCGACA CTTTATACGG TAAAAGAAAT 3780 AAACATCTTT ACCGACCACC AACCTTTGAC GTTTGCGGTA TCGGAATCCA ATCCGAACGC 3840 CAAGATCAAG AGGTGGAAGG CACGCATCGA TGAGTCCGGC GCAAGAATGT TTTACAAGCC 3900 CGGAAAAAAT AATCTCGTTG CAGATGCCCT TTCGAGGCAA CAACTCCACG TTGTTAAAAA 3960 CCAGGAACCT GAGTCGTGCG CGTCCACGGT TCACAGTGAG CTTTCGCTCA CATACACAAT 4020 TGAGTCTACG GACAAACCCG TAAATTGCTT CCAGAACCAA ATAAATTTAG AAGAGGCGCG 4080 CTCCCCTTGG AAACGCACTT TTATACTATT TGGAAATAAG AAGCGTCACT TGATTAACTT 4140 CTCGTGCAAA CAGACTTTGC TAGAGGAACT AGCCAACACC ATTATCCCTA ATGGTGTGAA 4200 TGCCTTCCAC TGTGATCTTC ACACGTTGGC AAAAATCCAG AACGAGGTGG TCCGACAATT 4260 TCCGGCCACA GAAGCTCGGT GAAACACCAA TCCCGGGCCA CGTAGGAGAA ATATTGCACA 4320 TCGATATTTT CTCGACGGAT AGGAAATACT TCCTCACCTG TGTGGACAAA TTTTCGAAAT 4380 TCGCCATGGT GCAGCCGATC CCGTCTAGAA CCATTGAGGA TCTAAAGCCA CCACTGCTTC 4440 AACTTATGAA TGTTTTCCCC AAAGCCAAAG CCATCTACTG CGACATTGAA CCATCACTGA 4500 AATCGCACAC AATAGTGGCC ATGCTGGAAA ACCATTTCGG CGTCAGCATC TCGAATGCAC 4560 CGCCCCTCCA TAGCGTCTCC AACGGACAGG TGGAACGCTT CCACAGCACG TTGGTCGAGC 4620 TCGCTAGGTG CCTAAAAATC GATAAAGGCA TAAATGACAC GGTGGAGTTG ATCTTGCTGG 4680 CCACAGCCAG ATATAACAAG TCCATCCACT CCGTCATTAA CAAAAAACCG GCCGAAGTCA 4740 TGCGGGCAGA TCCGGACGAT CCACAAAGTG ATGTCCAGGA AAAAATCAAA AATGCCCAAA 4800 ACGTGACACG AAATCGAGAA AACGCCTCTC GGCGGAACAG AGTCTTCCAG GTCGGCGATA 4860 AAGTCCTAGT AAAGTCAAAC AGACGACTAG GCGAGGAGAG AACTATCGAG GCAGAATTGG 4920 GGACCACAGT CCCAATTAAA GGGAGGGTGG TCCACAAAGA CAACCTCAGG TGACCCAAAC 4980 AGAGCCCAGT TGCGTTTCCC CCGGATTTTC TTTTCATTCT TTTCATTTAT AGCCACTTGG 5040 CATAAGTTTT TTATTTGTTT TTCATTCGTA GCCAATTGGC ATAAGTTTTC ATTATTTTTC 5100 GCTTCCCATA GCCACTTCGC GTAAGCCTTT CTTATATCCT TCATTACTCA TAGCCACTTG 5160 GCTTAATTTT TTAACGTTTA ATTTACTTCC TCTTATTCGT CTATTGGTGG TGGGCAACTC 5220 CATTCCGAGT ATTTATTAAT CAACCACGCA TATTACAGGT CGTTTCCAAC GCTTCTACTT 5280 TGTCTCCTGG CGGTGGCATC GGCCCACGTC ACAGATTACA CCCACGCAAA TTACATCCCC 5340 GTCATTGATG GCCAAGTCTT AGTAGAATAA GAATATGCCT ACGTCAGACA CTCTGCTAAC 5400 CTCTCTGAGT ATAGGCGAGT GATTGACGAA ACCAACGGTA TGGTTGATAT GTTCCCCGAG 5460 TCCCATATGA AGAAGCTCCT GAGCATTGAC ATCGCTCACC TTCGTGACAT GCTCGACTCG 5520 TTGAGCGTCC ATCACAGAGT GGCTAGGAGC CTAGACTTCT TGGGAACTGC GCTAAAGGTT 5580 GTCGCAGGGA CACCTGACGC AGAAGACTTT GAGAATATAA AGTTCACCGA AGCAAGACTA 5640 ATTGATGCAC ACAAGAGCCA AATCGAAATT AACACCAAAA CACAAGTTCG AATCAACGAA 5700 CTTACCGACA CCATTAATCA ACTTTTAAAG ATTTCAAAAG ATAAGCAAAT CGACACAGGC 5760 CACCTTTACG AAATGCTATT AACACGCAAT AGAATCATTG TCATGGAACA TAGAACCTTT 5820 TGCTTACAAT CACACTCGCA AAAATTAACG TAGTTAGTCC AGTCTTTCTG GACCATACAG 5880 ACTTGGAGAA TGTTTGGGGC GAGGAGCACA CCAACACCCC CATAAGGGAA ATTTTGTCCG 5940 TTGCGTCCGT AAAGGTTTTA CAATCCCTTA ACGTTTTACA TTTTATTATA AAATTCCCCA 6000 AGATTGTTAT GGCCTGTAAT AAATTCACCG TCTTCCCAGT GGCTCACCAT AACACGGTAT 6060 TGAGGTTAGA AGACAACATG GTGGCAGAAT GCAATGGAGA AATCCGTACC GTGAAGAACT 6120 GCTTCAGATC ACTCGGGGCG ACATTTTGCC AGTTATCTTC GGTGAGCTCG TGTGCCCAAG 6180 AACTCCACGC TGGAGGCATG GCACATTGCA ACACGCAGCA GAGCGACTTA CATCCTATCA 6240 CATACGTCGA TGAAGGAAAT ATTATCATCA ATAATAATAA GTGACAGGCA CTCTCCTGTC 6300 CTTAATATCT CGATGAAACA CGAGATCCTG AGCCTCCCAT ACCTTCACCG CTTGAGTGAA 6360 AGGAATTTGG AGCAGATCAG GAAATTCGAA CAAGACGTCG ACGGATACCG ACTAGGTCAG 6420 ATAGCGCTAA TTGCAGGAGC AATTTTCTGC GCTCTCATTT GCATCGGTCT AACCTGCCAA 6480 CGAGCCATTA GGGTCAAGAA GTCCACAGCC CAACTTAAGG AAGTACTCTC CAAAATAGGG 6540 TCGGCCGAGG ACGGCCTCAA TCTTGAGGGG GGAGTAGTTA ACTAAGTTAA CCGGACTGAT 6600 CGTCCGCTGA CCAGCATTTG GCCGGAAGCT CGAGCATAGC CGGCAAAGCT CTGCACTTTG 6660 GCAGAGGCCG CTATGAAGCT CTTCTCTTTG TAAGCTTTAA GCAGTTCGTT TTGGACACTC 6720 ATAAAATAAA GAGCCATCGC TCATTAACCA ACAACTCCGG CATTCTGTCG TTAATTTTAT 6780 ATTTGATCAT AGAAATCTGA GTCCCTAGGC CAATTGCATT AAGGAAGACA GTCATCTTCC 6840 CAACGTAATC ATCGTAATTC TCCAACTGGA TTTATCACCC TCTTCCCCTG TAAGAGAGAC 6900 ACCATCAGCC AACCGGCAAA TCATGGGAGA CTGCAGCGAG GGACCCCCTG CAGCACCCCC 6960 ACATTTAATA ATT 6973 // ID INVADER standard; DNA; INV; 4032 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063430; invader1. XX FT source nnnnnnnn:1..4032 FT SO_feature five_prime_LTR ; SO:0000425:1..423 FT SO_feature three_prime_LTR ; SO:0000426:3609..4032 FT SO_feature CDS ; SO:0000316:456..1321 FT SO_feature CDS ; SO:0000316:1919..2320 FT SO_feature CDS ; SO:0000316:2390..2923 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4032 BP; 1201 A; 774 C; 956 G; 1101 T; 0 other; TGTCGCAGAT CATTTAAATG AAACGAAATT TCGTGTTTCT GCTTGGCACG CGCCATGCAG 60 ACGCCTCTTT TTATTTTTCT GATGCGCGGC AGACAACCGT TAGAGTTTCT GCCGAACGTA 120 GTCTGGTCGC GGGTAGGAGC GGGGGGGGAA GTAGATGTCT GTACGAAAAC GAGAAGCATA 180 CAGAAAAATG CGGTGTGCAT AAGTATTGGT GTATGCGGAC TAGAACAACT GTCATAATTG 240 TGTTGGTATT GCATGTAAAG TCAAGAACTA CGCATAATTC TGATTTTGTG AAGAAGAGAT 300 CAGTCAGTCA GTTTTCGATC GTTACGCATA ACAGACATGC CTCGCTCAAG CGCCAGAAAA 360 CGTCGCGCGA ACCATAACGA GAGTAGTGAA GAGGAAGAAG GGCATCCTGA TGACTCTTCG 420 ACATCAGAAG TGGGATCGCC TATACCTACT TTGAACAATG AAAATAGTTT GCTGACAATT 480 TTGGAGTCGC AAAATCGTCA TTTTTCCGAA CTTTTGAAAC AAGTGCAACA CTCACAAAGA 540 GAGTCAAGAG AATCGTTTGC TGTGGTATTG CCCAGGTTTA ATCCTGAACG TGCGGGATCA 600 GATGCGCAAT CCTGGTGTTC AACGGTGGAC TACATCTTGT CCGAAAACCA ACTGGAAGGC 660 AGCGCTCTTG TGATGACCCT TAGCAAATCA CTGGAAGGAA GCGCGTCTCA TTGGCTTTCA 720 CAGACTTGCT TTCCTGGTAT ATCATGGCCA CAGTTCCGAG AATTGTTTTT GCTACATTTT 780 GCGGGAACAG AAACATTAGC GGCAGCCGTT ATGAATCTGC TAAATGGACG CCCTGCGGAA 840 GGCGAGTGCC TGTCGTTATA CGGAAGCCGA ATGGTTACTT CGCTTATGGC CAAATGGAAG 900 ACGCTGAGCG TGGAGCAAAT CGCAGTCTCC ATTACACTGG CGCATGCAGC TGGAATTGAC 960 AGGCGTTTGC GGCGTTTAGT TTTCACTTCC GACATCAACA CTCGGAACGA GCTGAACCAA 1020 GAACTAAAGG CCTATTCATT CGACGCTAGA CCGAGCCACC TTCCAAACGA TGGTACATCA 1080 GAGCCGCCAG GAAAACGAGC TAAGCCCTAC TTCAAGTGTC ACAACTGCGG AAAGCCTGGT 1140 CATAAAAAGG CGGATTGTCG ATCAAAAACA GTGGCAGTGC ACGTTCAGCA ACCCGACAGG 1200 CAACCATCAG GGAAAGACCG GTCTACTTTG ACTTGTTTCA ACTGCGGAAA AGCTGGACAC 1260 ATCTCCACCG TATGTCCTGA TCGTCAGGAG CGTTCATCGT CTAGCAAAAA TAACATTGTT 1320 TAAGATGTCA ACATCTGTAC CGTTTTTGAG CCCGTACGGA CGTTTTCGCA ATCTGGTGAG 1380 TCGTTCCAAT TTTGTTTTGA TTCTGGAGCC GAGTGTTCAC TCATCAAGGA GGCTACTTCG 1440 CAGAGACTAT CTGGATCCAG AATGGGCAAC GTAGTTATTT TAAGGGGTCT TGGTAGCAAT 1500 ACTATTTGTA GCACTTTGCA GATATTGGCT AATGTTTTGA TAAGTGAACG CTCATATGAA 1560 ATATTGTTTC ATGTAGTGTT AGACAATTAT ATTAAGTATG ACGCTCTGAT TGGTCGAGAT 1620 ATTTTGAGCC AGGGAGTCGG TGTGACAATA ACATCAAAAT CTCTTACTAT GTTTAACGAA 1680 AAATCTATAT TGTCTGTTGA AGTTTCCGAA AAGAATCCCG ATTTGGCTCA TGTTGACTTT 1740 CTGTCTCGGA ATCCGGTTCC AAAAATTGTC AAGAATTCCA TAGGCGTAGT CGAAAGGCAT 1800 ATCAATCTGA CAGAAATATC CGACAACTGG TTATTGGCTG AACAGCAAAG AGACGAAGAA 1860 ACTTCTTCCA TTATCTCTAA ATTGCGTAAT AATGAATTGT CCGAGGATTT GGCCAAGACT 1920 TATGAACTAC GGTCTGGAAC TATGTACCGA AAAATTCAAA GAAATGGAAA GACTCGATGT 1980 CTGCCCATAA TCCCAAAGCA GTTTAGGTGG TCCGCTGTCA ACAACGTACA CGAGTCAGTT 2040 ATGCATCTCG GGTGGGAGAA AACTCTTGAG AAAATGTACG AGTTCTATTG GTTCGATAAA 2100 ATGTCTAAGT ATGTTCGCCA ATTTGTGGAC AACTGTATCA CTTGTAGATT ATCAAAACCT 2160 CCGTCTGGCA AAATACAATC CGAACTCCAT CCCATTCCTA AAATTGACAT CCCGTGGCAT 2220 ACGGTTCATG TGGATATTAC AGGTAAACTT AGCGGAAAAA GCGACCAGAA GGAGTATATA 2280 ATTGTGATGA TAGACGCATT TACCAAATTT GTCTATCTCT CTCATACTAC GAAGCTAGAC 2340 ACCGATAGTT GCATTAAGGC AGTGAGGTCT GTTATATCAT TATTCGGTGT CCCCTCCCGA 2400 CTTATTGCTG ATCAAGGTCG TAGTTTTGCT AGTTCAGCTT TCCGTGATTT CTGCTCAGCA 2460 CAGAAAATGG ATTTACATTT AATTGCCGCA GGTGCTAGTC GTGACAATGG CCAAGTGGAG 2520 CGAGTCATGA GTACTCTAAA GTCCATGCTG ACTGCGGTTG AAACTGGCCC AGGATCTTGG 2580 CAGGATTCTT TATATGAGAT TCAGCTGGCC CTTAATAGCA CGCCCAATAG AGTTACTAAA 2640 GTCAGTCCAT TAGAGATACT TATTGGCAGA GAAGCAAGGC CATTTGGGCT TACACCTGTT 2700 GTAAAAGAAC AAAATATTGT CGATGTTGGC TTAGTTAGAG AAATAGCAAA ACGCAACATG 2760 GAAAAGAATG CCAAAATTGA TAAAGCTAGA TTCGATAAAA ACAAAGCCAC TATTGTAAGA 2820 CACAAGCTAG GAGACCATGT TTTACTTAAA AATGAAGAGC GTCACCAGAC CAAATTAGAT 2880 CCTAAGTTTA AAGGTCCATT TGTAGTGGCC GAAGTATTAC CAGGAGATCG ATATAAACTC 2940 AAAGCATTAA ATAATAACCG AACTTATAAA TATTCTCATG AATCTTTGCG ATGTATGCCT 3000 GAGAAGAGGA TTACCACAGT ATTTGAAGAT GAAGGCACAG AAGATGCCAG TGATCAAGCC 3060 AGTGAGGGCT GTGATCTTGA GAATGACTAA AAATTTGTAA ACCTCCGGAC CTATTCCAAA 3120 TGCGACTTTC TCTACCTCAT GGGTGCCCGA GTTAAGTGGT TCGCCATTTT GTTGCAGTTA 3180 AGCTTAAATT TGGTAACGGT TGGTGGTGTG AGTTAACTGG CCGGCTTATA GACTGCGTCA 3240 TCTGGGTGCC ACGTATGTTG GCCGGATTGT CGAGTTGTAA GCTGTGAGCT AGGCTAACAA 3300 AAGATGGACT GGCCGGTTTG AAAAACTGTG CGCTAGTAGA AAAATGTATT GAAAATCAAG 3360 AACGTTAACG ATTTATGTTT GAAGAAGACA AAATTATGAT TGTTTGATTG AAAGCTGATG 3420 GTGGTCCGGT TCTCATGGCT TAAATGTAAT AAATGAAATG TTTTAAATAT TTGTATTGGT 3480 TTGATTTAAA GTAATAAAGA ATGAAGAATA ACTTTATATT TTGAAAAAAA AAATAAGCCT 3540 TTGATAGATT CTGTTTAATT ATTGAGAGAC TGTTAGCACA CGAGGACGTG TGATAGGTCA 3600 GGAAGGCCGT GTCGCAGATC ATTTAAATGA AACGAAATTT CGTGTTTCTG CTTGGCACGC 3660 GCCATGCAGA CGCCTCTTTT TATTTTTCTG ATGCGCGGCA GACAACCGTT AGAGTTTCTG 3720 CCGAACGTAG TCTGGTCGCG GGTAGGAGCG GGGGGGGAAG TAGATGTCTG TACGAAAACG 3780 AGAAGCATAC AGAAAAATGC GGTGTGCATA AGTATTGGTG TATGCGGACT AGAACAACTG 3840 TCATAATTGT GTTGGTATTG CATGTAAAGT CAAGAACTAC GCATAATTCT GATTTTGTGA 3900 AGAAGAGATC AGTCAGTCAG TTTTCGATCG TTACGCATAA CAGACATGCC TCGCTCAAGC 3960 GCCAGAAAAC GTCGCGCGAA CCATAACGAG AGTAGTGAAG AGGAAGAAGG GCATCCTGAT 4020 GACTCTTCGA CA 4032 // ID INVADER2 standard; DNA; INV; 5124 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063429; invader2. XX FT source nnnnnnnn:1..5124 FT SO_feature five_prime_LTR ; SO:0000425:1..265 FT SO_feature three_prime_LTR ; SO:0000426:4859..5124 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 5124 BP; 1977 A; 662 C; 1178 G; 1307 T; 0 other; TGTAGTAGGC TAGAGGTACA CATACATAAG TTCAGTTCAT AATAGCAACA TAGTATATAA 60 ACATTAAGTA CTTGTTCGCT TTCGGGTGGC ATTCTCTCTG TTTCGCAACA TCGGCCCTGT 120 AGCAACAAGC ATAGAGGCAG ATCGCAAACG GAGAGGGAGT CAGTCGAATG CTAAAGCTGA 180 ACACGGAAAG TAACAAATAA ATCAAAGAGT CCAACTCCTG CCTTTGTGTG TTTTAGTTAA 240 TTAAGAGGAT ATATAATACC CCTACAATAT CAGAAGTGGG CCGATCACGA GCCAAAGAGA 300 AAAAAAAAAT GAATATAAGC AATGCATCGA CGGCGCAATT AAAAGGATGG TTGAGTGAAG 360 TAAAAGAACC AACCACGGGA ACAAAAAGAG AATTGATTTT GCGTTTGAAT AATTTAACAA 420 ACGAAAAGCG AAATCAATTG AAATTATTAT TAAAAAGCGA GGAAGAAGAT TTCGACGGAA 480 GCAGCAATAA TAATGTTCAA AAAGGAAAAA GAAATAACGG AAAAGACGGA GAGCATAAAG 540 ACGACGAAAG TGACAGCGAA GCTGGAGATA GCAACGAAGA CGGCGATGAG AAGAGCACTG 600 TGCGCAACGA CAAAAGTAAC GGCGAACATG ACGACAAAAG CGGGGTACGC AGTTACAAGA 660 ACAACAGTGC AGACGGCGCC CGCAACAACA ACAGCGGCGA CGGGAATAGC GACGAAAACG 720 AAGTTTGCGA CAGCGAAGAC GGCGACGAAG ATGGCGGCGG TGAGAAAAGC AACAAAGAAT 780 GTGGCAAGAA TAAACGAAAT ACAGTGGGAC AAGCTGTGCA AAATATGGAT TTATTATTGC 840 GTATGGCAAC CGAGGCAGTA AATGAATTTT CGGGAGATAC ATGTGCGCGC AAATGGATCT 900 TACAGGTGAA AAACATAGCC AGCGTTTATG GAATAAAGGA ACCGTATATA AAGATGTTGA 960 TTGTCAGTAA GATAAAAGGA AAAGCATGCA TGTGGTTGCA TGCAGATCCA GAACGTGTAT 1020 TGCTACCAAC GGAGCAGTTG GCGGCTGAGC TAATATCAAT GTTTGGTGAG AGGAAGTCGA 1080 AGTTAGAAAC AAGGCGAAAA TTTGAGGAAC GAAAGTGGAC AGCGAGCGAA AGCTTTGTTG 1140 CTTATGCAGA CGATAAGGTG ATGCTTGCAC ATGGAATAAA CATGGATAAG GAAGAATTGG 1200 TGGCGCTCCT CATTGAAGGT ATTCCGAATC AAATGCTACG TAACCAAGCC CGTATCCAAT 1260 GCTTTGAAGA TATCCAGCAC ATAAAGAAGG CATTTGCGGA AGTGAAACTA CCGAAAGTGG 1320 ATGAAGCAGA CAAGAAGGTG GCATCGGTGA ACAATAATAG CAGCACACTC TTGCGTTGTT 1380 TCAATTGTAA TTCGAAGGGA CACTGGGCCA AAGAGTGCAG GAAGCCAAAG CGCGAAAAAG 1440 GGTCATGCTA TGCCTGTGGT GAGATGGGGC ACTTTGCAGC AAAGTGCCTG AAAAACAAGA 1500 ATGGGGATGA AAACAATTAC GTAAGATATT TTGAAATTAA TTTTATGAAC AATGCTAAAT 1560 CAAAATTTAT TACAGCATGC CTCATAGACA CGGGAAGCCC CATTTCGTTC ATAAAAATTA 1620 GTAAAGTACC CAAAGATGTT ATTGGAGTAC CAGTTTTAAG TTCATTTTAC GGATTAAACA 1680 AAAGTCCCTT GAAAACATAT GGAAAAATAT TGTGTTACAT TATGAAAAAT TTGAATAAAA 1740 TACACTTTAA TTTAATAATG GTGGCAAACG AGTCTATGAA TCATGATGTA GTATTGGGAA 1800 GAGATTTTAT GGAAGCATGT AATTTAATCT TGAATCTGGA CACCTTGAAA ATGATTACAG 1860 TGGATAAAAT TGAAAGTCGA TGTAACAAGA AAAGCGATAA AATGTTAAAG GAAGAAAATA 1920 TGTACAACAA TAATAAAACA GTTAGCGATA GTGTTAGTCA AGGTAGTAGT AAAAGTCAAA 1980 TGAGTAGTGA CACGGTTAAA AATAAATTAG AGAAAGCTGT CAATGCTGAA ATAAGCAAAG 2040 TGGAGAGGGA AATGTTTGAA ATTAATGTAA TAGATGACTC AATTGATTAC AAAATAGGTG 2100 ATCAGGTGGA TCATAGTATA AAATGTCAAT TTATTGAGTT TGTAGAAAAC TGTTATGTTA 2160 ATAAAGAAAG ACCGAATGAA CCGGAGATTC GGTGTGAAAT ACAATTAAGA TTAAATGACC 2220 TGAAACCGTT TAGTTGTTCT CCTAGAAGAT TAGCATATAC TGAAAAAGAA AAGCTGCAGA 2280 TTATCTTAGA TGAATATCTA AAAAACGGAA TTATTAAACC GAGTGATTCA GAGTATGCAT 2340 CTCCGATGGT GTTAGTGAAA AAGAAAACAG GAGACCTAAG ATTATGTGTT GATTACAGAA 2400 AACTTAACAA AACAATGGTC AAAGATAATT ATCCTTTACC ATTGATTGAT GATCTGTTGG 2460 ATAGATTAGT AAATAAAACG ATATTCTCGA AGTTAGATCT TAAACATGGA TACTTTCATG 2520 TTTTTGTTAA TAAGGAATCA ATGAAATATA CATCTTTTAT GACGCCTTTA GGACAGTTTG 2580 AATTTTTAAG GATGCCAATG GGGCTGAAAA ATGCACCGGC GGTCTTTCAG AGATTTATTA 2640 ATAGAATTTT CGAGGATATG ATAAGGCAGG ATAAAGTTAT TATATATATG GATGACATTA 2700 TGATAGCTAG CAAAGGAATT AAGGAACATA TGGAAGTACT TAGGGAAGTT TTTGACAGGT 2760 TGACGAGGAA CAAATTAGAG CTGAGAATGG ATAAATGTGA GTTTCTGCAA TCGAGTGTAC 2820 AATATTTAGG ATTTTTAATA ACAAGTAAGG GAATACAGGC CAATGACAAG GGAATAGAAG 2880 CAATTAGAGA TTTTCCAGTA CCAGATAAGG AGCATGGTGT CAGAAGCTTT TTAGGATTGT 2940 GTTCATATTT TAGAAGATTT ATTAAGGGTT TTTCAACGAT TGCAAAACCT TTATATGACT 3000 TATTAAAGAA AGACAAAGAA TTTTCGTTTG AAGAAAAAGA GTTAGAATGT TTTGAGACAC 3060 TTAAGAAAAA GTTAGTGGAA GCTCCAGTGT TAGGGTTATA TTGTTATAAG GATGAAATAG 3120 AGTTACATAC AGATGCAAGT GCACAAGGTT TTGGTGCAGT GTTATTGCAA AAGAAAGAGG 3180 ATATGAAATG GCACCCTATA TTTTATTACT CGAAAGCCAC AACAAAGGAT GAGGCCAAAT 3240 ATCATAGTTT TGAACTTGAA ACATTGGCCA TCGTGTATGC ATTAAGAAGA TTTAGAATCT 3300 ATGTACAGGG AAAGAGATTT AAAATTGTTA CGGATTGCAA CGCATTAAGT CTAGCGTTAA 3360 ATAAAGTAGA ATTAAACCCT AGGATTGCGA GGTGGGCATT AGAACTATTA GAGTACGATT 3420 TCGAGGTAGT TCATAAAGCA GGAAAGCATA TGCAACATGT GGATGCATTG AGCAGAAATA 3480 CGAACATATT AGTGATAGAA ACTAATACTT TTGAAGATAA TTTAATTATT TGTCAAGCAA 3540 AAGATGAAAA GTTGAAAGAT ATTAGGAAAA AGTTAGAGAA AACGGAAGAT AAAATTTTTG 3600 AAATGAGAAA TGGTGTACTT TATAGGAAGG CAAACGGTGG AAGATTGTTG TTTTGTGTAC 3660 CAGAAGAAAT GGAAGAAAAA ATTTTATATA AATATCACAA TGAATTAGGT CACTTAGGGA 3720 GAGATAAGGT TATAGATGCT ATTGGTAAGA CTTATTGGTT TTCGAATATG AAAGAAAAAG 3780 TTGTTAGGCA TATAGGAAAT TGTTTGAGAT GTGTGGCATT TTCACCGAAG TCGGGAAAAG 3840 AAGAAGGAAT GCTGCATAGC ATCCCAAAGG GAAATGTGCC GTTTGAGATA ATACACATCG 3900 ATCATTACGG ACCAGTAGAT AACGGTAGAA TAAAGAAACA TCTGTTTGTA ATAATAGATG 3960 GATTCACTAA ATATGTTAGG TTATACGCAA CAAAAACAAC GAACACAAAG GAAGTGATTC 4020 TAGCGTTAAA GGATTATTTT AGATCATATA GCAGGCCGAA ATTTATAGTA TCAGATAGGG 4080 GGACCTGCTT TACATCCGAA GATTTTGCAA AATTTATGGG AGAGTATAAT ATTAAACACA 4140 TAAAGATAGC GACAGGTTCA CCCCAGGCCA ATGGACAGGT AGAAAGAGTC AATAGAATAG 4200 TAGGGCCTAT GATAGCTAAA TTAACAGACA TTGAAAAGGG ATTGCATTGG GATGCAGTTA 4260 TTGAAGACGT AGAATTTTCT CTGAATAATA CCAAACAAAG AAGTGTAGCG CAATCTCCCA 4320 GCAAGATGTT GTTTGGAATT GAACAAAAGG GAAAAGTTGT TGATGAATTA AGACAAAAAT 4380 TAGAAGAGTT AGATAATGTT AGTGAAATTA GAGACTTGAA AGAAATTAGA AGAAAAGGAT 4440 CGGAATCGCA ATTAAAAGCA CAGAGGGATA ATGAAAAAAG ATATAATAAT AAGAAAAAAG 4500 AAAGTACGCA GTACAAAGTA GGAGATTACA TTATGGTTAA GAATTTCGAT AGTACAGGAG 4560 GAGTTGCTAG AAAGCTAATA CCTAAACATA AGGGTCCATA TGAAATAGAG AAGGTATTGA 4620 AAAATGATAG GTTTTTAATT AAAGATGTTG AAGGTTTTCA ATTGGCGCGA AATCCATATC 4680 AAGGTGTCTG GAGTAGTCAA AATATTAAAC ATTGGATAGG GAGTAGAAAG TAAATTTGTA 4740 TAATTAAAGA TAAGAAAAGT AAAAAAAATG TAACAAATAG CGGCTAAGAA CAATGTAATA 4800 GTTATAAGTT GGAAATTTCG ACAAGATCAG GTGATCTTAA ATGTCAGGAT GGTCGAATTG 4860 TAGTAGGCTA GAGGTACACA TACATAAGTT CAGTTCATAA TAGCAACATA GTATATAAAC 4920 ATTAAGTACT TGTTCGCTTT CGGGTGGCAT TCTCTCTGTT TCGCAACATC GGCCCTGTAG 4980 CAACAAGCAT AGAGGCAGAT CGCAAACGGA GAGGGAGTCA GTCGAATGCT AAAGCTGAAC 5040 ACGGAAAGTA ACAAATAAAT CAAAGAGTCC AACTCCTGCC TTTGTGTGTT TTAGTTAATT 5100 AAGAGGATAT ATAATACCCC TACA 5124 // ID INVADER3 standard; DNA; INV; 5484 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063428; invader3. XX FT source nnnnnnnn:1..5484 FT SO_feature five_prime_LTR ; SO:0000425:1..298 FT SO_feature three_prime_LTR ; SO:0000426:5186..5484 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 5484 BP; 1941 A; 1014 C; 1192 G; 1337 T; 0 other; TGTGGTAGGA TGAACACTCA TAAGGCCCAC GAAGCATACG AGGAAGCATA CGAAGAAGCA 60 TACGATCATA TCCCCGTTAG GAATAAGTAT ATCATTGCTA TCTCGCTCTT ACTTATAAGC 120 TAGCATTAAG AGATAAGAAT GTACACCTAG CTCACTCTCT CTTTTGAAAG AGCATAAGAA 180 AAGAGGAATA CGCGAGACGC GGCATTCGGT GGCGTGAGCT TGAAGTGAGA AGTCGAATAA 240 AGAGATTGTG AAGTTATTTA AAGGAAGGAG CACGTGTATT CATTTTTCGG CGTCCACAAC 300 ATCAGAAGTG GTCATTACCC ACTATGCCTG ACACTAAATG GAATCAATTC TCAGTGCCCC 360 AACTAAAGAA GTGGTTGGAA GCTCTTGGAT GCCAGACTTC CGGCACAAAA GCGGAACTAA 420 TAGTGCGTCT GCAGGCCATT TCCCCAGAAA CGCGTGGCGA TTCACCGCCA AATAACGGAT 480 CAGCAGCAGA AGAGTTGGAG GAATTGGATG GAGATGCAGC CTGGCCAAAG CAGAGAGAGC 540 CACAGGGTGG AGACACACGC ACGATCCCGT TCGACGAGCA GTCATTGCAG TCAGAACCCC 600 CGGAGAACCT CGATCTGGAT CAGCTATCGT TGGTGGATGC AGAGAGAGCC ACTCTAAAAG 660 AGCTTCGGGA CGAGATAGAG GCAAACATGG CTGTGCTGCA GAGGATCCAA GCGGATATCG 720 ACGACGCGAA CAAGAGCAAG TCCGCTCATG TTCGCCAGAT CAATGACGTC ATCGAGAACA 780 ACAGCAACGG AAACCACGGC AATGTTAACA TATGCGCTAA CAGCACCGAC GCCAATGTTA 840 ACAGCACCAA GGTGACTATC GCCGCCGATA ACATTGATAC GGGACATACC AGTTTTGGAA 900 AAGAAACCGC TGGTAGTGTT GTTCAGCACC AGAGTCGAAA TGTAAACAAG CTTCTGGAAT 960 CTCCTGCAAC ATCGCTTGCC TTGGCAAAAG AGGTAACGAT GGAGTTTGAC GGCAGCTTAT 1020 GTGCGCGCAA TTGGGTCACG CAGTTTCAAA ACATCGGAAA GATTTACAAT TTGGACGACG 1080 GATGCATGCA CATGCTGCTA ATTGCCAAAC TAAAAGGAAA CGCACAGCGC TGGCTGCACG 1140 CAAGCGCCAC ACGCATCCTA GAATCGACCG ACCAGCTATG CGAACAGTTA ATCTTGACAT 1200 TCGAGATCAA GATGTCAAAA GGGGAACTGA GGAGCGCATT TCAAAAACGC GAATGGCATC 1260 CGGATGAGAA ATTTGCTGTT TACTTCGAGG ACAAGGTGAT GCTGGCCAAC GACATCAACA 1320 TCGATCTAGA GGAGCTCCTG GAAAACATCA TTGAAGGAAT CCCAGCACCA GCGTTGCGCA 1380 ACCAGGCGCG CATACAGCGT TTCTCCGAGC CGATGCAAAT TCTGCGGGCT TTCTCGGAAG 1440 TCCGTCTGCC GAAGCACAAA ACAGGGAGCA GTTCATCAAA GCGCCTTACT GGAGGAGGTG 1500 CAGCCAATAA GGACTTACGT TGTGCCAATT GCAACTCCAA AGGGCACTTC GCCAGGGAGT 1560 GTCTCAAGCC AAAGAGGGAG CCCGGATCCT GCTATGGCTG TGGGGCATTT GGGCACTTCG 1620 TCGGACAATG CCCGGAGCGC AAGAGCGCCA ATATCAACAA TTATGTAAGA CTTGTCAAGA 1680 TTTTTTTTCA CAATAACTAT AAAACCCCAT TAATTACAGA ATGCCTCATA GACACAGGGA 1740 GTCCAATATC ATTTACAAAG TTAAGCAGTG TATCCAAAGA AATTAATTTA GAATCCGCTT 1800 CAGGAACCTA TTTCGGGCTG AATAAAAGCC AACTGACATT ACATGGAAAA AGCTTATGTT 1860 ACATTATAAA AGAAAAAGTT AAATGTTATT TTTATTTGTA CGTGGTCCCA GACGAGTCTA 1920 TGAGTCATAA TGTAGTTCTT GGACGAAATT TCATGGAAGC GTGCCAGTTG AAACTGGATC 1980 CAGACGCCTT GGGAAGGATA ACGGTCAACC CAGTCGATAG TAAAAGATAT AAAAAAGCAA 2040 ATAGTAAAAC AATCAGTTCT ATTGTTAGTG AAACAGTTAG TGAAACAGTT AGTGAAACAG 2100 TTAGTGAAAC AGTTAGTGAA ACAGTTAGTG AAACAGTTAG TGAAACAGTT AGTCAAATGG 2160 ATAATGAAAC AGTTAATAAG ACGATTAGTA AAACAGCCAA CGAAGCGAGT AGTATAGTTA 2220 GTCAAACTGT TAACGAAGCA GGTAGCAAAA CAGTCATCGA AACAGTTATG CCTTTAGCCA 2280 ACTTCACGAC GGATTCCGGA CCTTATGCTG CGGTTGTGCC CAGTTATGAA GAAGCTGTTA 2340 GAGAAATTTG TAGCATTGAA CACTTAGAGG ATGAGCAAAT GAACTTCATA TGCGGCGAAA 2400 ACAATAACCA AGCCTTAAAC AGTCAGTTAA AGGAGATTAT TAAACGCAAC TACTTAAGCG 2460 TTAATAAGCC TACCGAGCCT TTAGTCAAAT GCGAAATAAA ACTTTGTTTA GAAACGTCAA 2520 AACCATTTAG CTGCGCCCCA AGAAGACTTG CCTACTCAGA AAAGGAAAAA TTACAAAAAC 2580 TATTAGATGA CTATTTAAAA GAGGGAATTA TTCAACCTAG TGTATCAGAA TATGCTTCAC 2640 CAATTGTTTT AGTCAAAAAG AAAACAGGAG ACATACGGCT ATGTATAGAC TTTAGAAAAC 2700 TGAACAAAGT TTTAATTAAA GACAATTATC CGATACCGCT TATCGATGAC CTTCTAGACA 2760 AGCTAGTCAA CAAGAAAGTT TTTTCGAAAC TTGATTTAAG AAACGGTTAT TTTCACGTAT 2820 TCGTTAATAA GGAATCAGTC AAATACACCT CATTTGTTAC ACCGTTAGGG CAATTCGAAT 2880 TCCTCAGAAT GCCAATGGGC CTAAAAACTG CATCGGCGGT GTTCCAAAGG TTCGTAAACA 2940 AAATTTTCGA GGATCTTATA AGGGATAACA AAGTAATTGT ATACATGGAC GACATCATGA 3000 TTGCTAGCAA GAACTCGGAG GAGCATTTAG AAACTTTAAA GGAAGTGTTT ATCAGATTAG 3060 CTCAAAATAA ATTGGAATTA CGAATGGATA AATGTGAGTT CCTTCAGACA AATATTAAAT 3120 ATTTAGGATT CATTATAAAT GGTGAAGGCA TTAGGGCTGA TGACAAAGGA ATAGATGCCA 3180 TACAGCATTT TCCAGTTCCA GACAAAATAC AAGCCGTTCA AAGTTTTCTG GGCTTATGTT 3240 CATACTTCCG TAGATTTATA AAAGATTTCT CAACATTAGC GAAACCTTTA TATGACCTTT 3300 TGAGAAAAGA TGAAAAATTT AAATTTGGAG AAAAGGAAAT GAGCTGTTTC TCAACTCTTA 3360 AGGAAAAGCT GGTGCAAGCA CCGGTCTTAG CAATTTATAA TTTCAAAGAC GAAATAGAAC 3420 TGCACTGTGA TGCAAGCGCT CTTGGTTTCG GTGCGGTGTT ATGTCAAAAA AAAGAGGATG 3480 GAAAATTACA CCCAATATTT TATTTTTCAA AGCGAACAAC AGTCGCTGAG GCAAAATATC 3540 ACAGTTTCGA ATTGGAAACA ATGGCAATTA TTCACGCACT GCGTAGATTT AGAATATATT 3600 TGTACGGAAA ACGCTTTAAA ATTATAACTG ATTGTAATTC TCTCACTTTG ACGCTTAACA 3660 AGACTGAACT TAATCCGCGA ATAGCTAGAT GGGCGTTAGA GCTCCAGAAC TACGATTATG 3720 AGCTCATACA TAGAGAGGGT AAAAGGATGC AGCATGTTGA TGCTCTTAGT AGATGCACGA 3780 ATATTTTGAT AGTGGAATCA AATACGTTTG AAGACAATTT AGTAATCTGC CAAGGCAAGG 3840 ATCAAAAAGT TCAAGAAATC AAGAAACTGC TGGAAAAAAA TGAACACAAG TTGTTCGAAA 3900 TGAGAAACGG AATAATTTAT AGAAAATGTA ACGACGGCAG ACTTTTATTT TATGTTCCCA 3960 CAGACATGGA AGATCATATT CTATATAAAT ATCACAATGA GATCGGACAC ATAGGTAGAG 4020 ACAAAATGTT AGATGTTATT TCTAAATCTT ATTGGTTTCC CAATGCCAAA GACAAATGTT 4080 TGAGCCATAT AGAAAATTGT TTAAAATGTG TTGCGTTCTC TCCTAAGACA GGCAAGGAAG 4140 GATTTCTGCA CTGCATTCCG AAGGGCAGCA AACCGTTTGA ACTAATACAT ATCGATCATT 4200 ACGGCCCAGT CGACTCAGGC AGATCCAAAA AACATATTTT TGTTGTCATC GATGGTTTTA 4260 CGAAGTTCGT ACGTTTATAC GCCACAAAGA CCACTAATAC AAAGGAAGTC ATTACAGCCC 4320 TGAAGGACTA TTTTAGAGCA TACAGCAAGC CTAAATGCAT CATTTCAGAT AGGGGAAGTT 4380 GTTTTACATC CAAGGAATTT GATGACTTTA TGTTAGAGTT CGATGTTAAG CATATGAAGA 4440 TTGCGACTGG TTCACCCCAA GCAAACGGAC AGGTTGAAAG AGTCAACAGA AGCCTAGGTC 4500 CAATGATAGC GAAACTGGTA CAACCAGAGA AAGGCATATA TTGGGACACA GTAATAGATA 4560 AAGTTGAGCA CGTCTTAAAT AATACACAGC ACCGCAGCAT CAAACAAACT CCAAGCAGAG 4620 TTCTTTTTGG TTTAGAGCAA AGAGGAAAAA TAGTAGACGA GTTAAAAGAA AAACTAGAGG 4680 AATTAGAAAA TACTGAAGAA GATAGGGATT TGGTAAAAAT TAGAAATGAA GTTGAAAATA 4740 ATCAAAAAAA GGCTCAAGCG TATAATAAAA CACATTATGA CAAGACCGCC AGACAGCCAT 4800 CAGAATACAA AAAAGGAGAT TATGTGATGG TTAAAAATTT CGACAGTACA GCAGGCATCT 4860 CTAGAAAATT GATTCCAAAG TGTAAAGGCC CATATGTGAT ATCTAAAGTA TTAAGAAACG 4920 ACAGATTCTT GTTAAAAGAC GTAGAAGGTT TTCAGATATC TCGTAACCCA TACCAAGGTG 4980 TCTGGAGCAT CCAAAATATT AAGCATTGGA TAGGCAACAA GAAAAATAAA CGAAGACACA 5040 ATTAAACAAT AATAATAATA TAATAATAAT AATAACAGAT TAATATTGTA TGTACATATG 5100 TATAAATAAG GTAAAAGAAT TCATCATACA ACTAAGCAAT CAATGTAAAC CATGATCAGG 5160 GGATCAGCTA GTCAGGATGG CCGAGTTGTG GTAGGATGAA CACTCATAAG GCCCACGAAG 5220 CATACGAGGA AGCATACGAA GAAGCATACG ATCATATCCC CGTTAGGAAT AAGTATATCA 5280 TTGCTATCTC GCTCTTACTT ATAAGCTAGC ATTAAGAGAT AAGAATGTAC ACCTAGCTCA 5340 CTCTCTCTTT TGAAAGAGCA TAAGAAAAGA GGAATACGCG AGACGCGGCA TTCGGTGGCG 5400 TGAGCTTGAA GTGAGAAGTC GAATAAAGAG ATTGTGAAGT TATTTAAAGG AAGGAGCACG 5460 TGTATTCATT TTTCGGCGTC CACA 5484 // ID G2 standard; DNA; INV; 3102 BP. XX AC AC006215; XX DR FLYBASE; FBgn0063507; G2. XX FT source AC006215:18012..21113 FT SO_feature CDS ; SO:0000316:264..2960 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. XX SQ Sequence 3102 BP; 877 A; 949 C; 642 G; 634 T; 0 other; CGAAGACCGG CCACCCCTTT CAGCAAACCC TGCACCATCA AGCTCCCAAG TATCTCCAAT 60 ATGAGCTACA GAGATGCGCT CATGTCTACC CAAGTGCCTC AGCAAACCAA TCCCAACCAA 120 ACCCGTCAAA CCCCACCTGA ATCTCCGCAA CCCAGTAACA TGGAACAAAT GTTTGCTCGC 180 TTCGAAGGAA TGGTCGAAAG GATGATGGAG AAGATGTTCT CGCAAATGAC GCAGCTTGTT 240 GCTTCCATCC TCAACGCAAA AGTATGCAAA GCCTAAAAAT AGTTATTTGG AATGCTAATG 300 GCCTGCAGAG GAGCAGAGCC GAGGTGGAAC ACCTCCTAAA AACAGATGCC ATCGATGTAC 360 TACTGGTATC AGAAACCCAC TTTTGTCCCA GATCCCATTT TAGCATTAGC GGCTACGATG 420 TCATCCCTGC CAACCATCCA TCTGGCAGAG CGCGCGGAGG AGCAGCCATG CTCATCAGGA 480 GTGGCATCCA GTACACGGAG CTACCTGCGT TTCAGGAGGA ATGGGCCCAG TGTGCCCTGG 540 CCAGAATCTC CAGCCTACAT GGAGACATCA CCGTTGGAGC GGTCTACTTC CCACCTAGAC 600 ACTCGATCAC CGAGACTCAC CTACAAGATT TTTTTGAGTC CTTTGGACCT CGCTTTATAG 660 CAGCTGGTGA CTTCAACGCC AAACACTCCT GGTGGGGGTC CCGCTCCATC AACCCCAAAG 720 GCAGAACGCT CCACAGATTC CTGCAGAGCA GAAGACTGGA TTGCCACTCC TCTGGTGAGC 780 CTACCCACTG GCCTTCAAAC CCTTCCTTGC TGCCCGACCT CCTGGATTTT GCTATCTCCA 840 AAGGCATAGG GCAAGACAGA CTAACTTGCT CCAACTATAA CAAGCTGATA TCCGACCACA 900 GCGCTATAAA AATGCTTCTC AACATCCCCG TCCTCAAAAA AGAGTTGCCC AGAAGACTCA 960 CCGGGAAGCA CACTGATGCC TCCAAGTTCA CACTCTGGAT GCTGTCCTCC CTGCACCCGG 1020 ATCCTCCACT CTCCACTCCA AGCAACATCG ATGCGGCAAT CAAAACTCTC ACGGATGAGA 1080 TGTACAATGC TGCCGAGTTT GCCAATCCTC CTCCTTCAAC GTCCCCAAGA ACTCCCGGTA 1140 GAGACCTTCA CCTCTGGTCT CCAGAAATTG CGGCCTTCGT AGCGGAAAAA AGGCGCCTCA 1200 GGAGAATCTG GTTCTTCTCG CGCAACCCAA GGGACAAGGC AGCACTCAAC AGAGCCATCA 1260 AGGAACTCAA GGACAAGCTG TGTACCCTAC GGCAAGACTC CTTCGACAGA TTCCTTGAGG 1320 AGCTGGAACC TGGTGACCCG CAACATAATC TGTGGCAAGT CACGCGTCAT ATCAGGAGGC 1380 CTGCAAAGAA GGCGTCGCCG GTGCGCAAAG CGGACGGCTC TTGGTGCCGC TCTGAAGCCG 1440 AAAGAGCTGA AGCGTTTGCA GATCTCCAGA ATGCATTCAC ACCATTTGAC AGATGCACTG 1500 GCGAAGAGCG TGCTGCAACC ACCAGGTTCC TAGAGAGTCC ATGTCCTCCT AGCCTGCCCA 1560 TAGAGCCCGT CACCCCAGAA GAGGTTGCGC AAGAGATCGC CTCCCTAAAG GCTAGCAAAT 1620 CCCCAGGACT GGATCGCATC GACGCCACAT CCCTTAAAAT GCTGCCACCT CCCTGTTCCC 1680 AGTTGTTGGC CAACATATAC AACAGATGCT TCTCACTAGG GTACTTCCCG AGATCATGGA 1740 AACGTGCAGA AGTCATTCTC ATCCTCAAAC CTGGAAAACC TGAAGCCAAT CTTGCCTCAT 1800 ATAGACCGAT TAGTCTGCTG ACAATCCTCT CCAAAATACT CGAAAGAGTA TTTCTGCGCA 1860 GAGTCATGCC AGTACTGGAC GAGGCTGGAC TGATCCCTGA TCACCAGTTT GGCTTCAGGC 1920 GATCCCACGG AACACCCGAG CAATGCCACC GGCTCGTAGC ACGCATCCTA GATGCATTCG 1980 AGAACAAACG ATACTGTTCG GCCGTATTCC TGGATGTCAA GCAGGCGTTC GACAGAGTGT 2040 GGCATCCTGG ACTCCTCTAC AAACTCAAGT CCCACCTTCC CAGTTCCCAC TATGCCCTAC 2100 TCAAATCGTA TACTGAAGGA AGAGAGTTCC AAGTGCGATG CGGTTCCTCA ACAAGCACGA 2160 CAAGGCCTAT ACGAGCCGGA GTACCTCAAG GCAGCGTCCT TGGTCCCATC CTCTACACCC 2220 TGTTTACAGC AGACCTCCCT ATCATACCCT CCCGTTACCT CACAGCAGCC ACCTATGCAG 2280 ATGACACGGC GTTCCTTGCC ACCGCAACAA ACCCTCAATT AGCATCAGCC ATCATCCAGA 2340 GGCAACTGGA TGCACTGGAT CCATGGCTGA AACGCTGGAA CATCATGATC AACGCTGATA 2400 AATCCTCCCA CACCACCTTC TCTCTGCGCA GAGGAGAATG CCCCCCGGTC TCACTCGACG 2460 GCGACACAAT CCCTACCTCC AGCACCCCCA AATATTTAGG GCTGACCCTA GACAGAAGGC 2520 TGACTTGGGG CCCCCACATC AACAATAAGC GTATCCAGGC CAACATACGC CTAAAGCAAC 2580 TCCACTGGCT CATCGGTAAA AAGTCCAAGC TGCGAGAGAA ACTAAAGATT CTCGTCTACA 2640 AGACTATTCT CAAGCCAATC TGGACGTACG GAATTCAGCT GTGGGGCACT GCTTGCACAT 2700 CACATAGAAG GAAGATCCAG CGATTTCAAA ACAGATGTTT GAGAATAGTC TCCAACGCCC 2760 ATCCCTACCA CGAAAATTCC GCCATCCACG AGGAGCTCGG GATTCCATGG GTAGACGACG 2820 AAATCTACAG ACACAGCGTG AGATATGCTA GCAGACTGGA GAACCACCAC AACCACCTGG 2880 CCGTCAACCT TCTAGACCAC AGCCAATCCC TAAGACGCCT GCAGAGAACG CACCCGCTTG 2940 ACCTTACTCA ACATACTTAA TCATACTTAA CCCCTACCCA AGTACACTCG ATGTACTCCC 3000 CTTAAGTTAA TGTTTCCCTC CAAAAAATTT AATTATTGTC CACTAGGACA GATTTTAAAT 3060 AATAAATAAA AGGCTACAAA AAAAAAAAAA AAAAAAAAAA AA 3102 // ID DMCR1A standard; DNA; INV; 4470 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063594; Cr1a. XX FT source nnnnnnnn:1..4470 FT SO_feature CDS ; SO:0000316:435..1466 FT SO_feature CDS ; SO:0000316:1622..4294 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4470 BP; 1225 A; 982 C; 828 G; 1435 T; 0 other; TTGGCTTTCA AAGCGAACGG TCGTTTTTTT TTGCGCCCCG ATTTTTATTT TGTGTTTGCC 60 AAACCAGCCG TGGTTTTTAA TAATCTTCTT TAATTATCAT ATATTTTAAA CATAATTATT 120 AAAGATCCGT TCGCTTCGCG AGTGATTCTT TGTGATTTGT GCAGTAGTTT AAACAAATAT 180 ACAAAGTATT TTGCATTTTT CGTCTTTGTG TTTTGTTTAC TCTCCCTTTT GTGCGTTGTT 240 CGCAGTGCTG TTTGGTGTTG TTTTTTGCAT GCACATAAAC TGCACTTTTC GCTCTCACTC 300 TCTCGCTCTT TCGCTCTCCC ACACAAAATC TAAATTTCTC AGCGTGCAGA CTGCTCTCGT 360 GCGGTTCTTT TAACAGCTCG CTTGAAAGTT TTGATCTTGT GTTTTCGTGC ACAGTGGTCT 420 GATTTTAGAT AAACATGGAA ACCGTAAATG TTGTCTGCTT TTATAAAAAT TGTTGCCAGG 480 TTATAAAACC TGAGCAGCCA AAAATGACTT GTTGGCTATG TGATAGAATG GTGCATACTA 540 AGTGCGCTGG TTTTAATGGC CGTACAAGTG ATGACCTAGC AAAGGGTAAA AATCTAAATT 600 ACTGTTGTGA TGCTTGCCTT GTGGTTGCGA ACGAGATGAA ATCGTTTATG CGCCAAACTA 660 AAGGTGGCTT GAAAGAACTG ATCAACAGTT TTGGCGTAGC TAGAGATAGT TTTCGTCGAG 720 CCGATGATCT TCTTTCCGCG CTTAATTCAC AGTTTAATGG CCTTAAGCTA TTAGATGAAT 780 CTCCAAAGCG CAAAAAAGCC GCTGGTGGTA GGCTGCCTAA GGCCCCGCAA CCACGGGCAA 840 ATGATCAACA GGACTCCACA ACTGATCAGC TGTCAATCGC AGCAAATCCG CATATGTTGC 900 TAAGATCGGC AGCTAAAAAA GATTCGGAGC AATCCGATCA ATCCGCTGTG GAGGGAGCCC 960 ACCCTGTGAT TGCTAAAGAT TCCGAATTGT CTGCACTGGT TTCTCAACCA GTGATTCCAC 1020 CTGTCTCGAT TAGTTTACCA CCACAAGGGA ACATTGAGTC CGGACTACCG GCACAAGTTG 1080 GCACTTCAGA AATACAATCT GCAGGGCCAA AACCCCTCGC GGTGGTTCCA CTAAAGAAAC 1140 AAATTTTCGT TTCTCGGCTT TCCCCTGATC AAACATCGTC TGATGTATTG TCTTATATAC 1200 AAGACAAAAC AAAAGCCGAT AATATAAAAG TGGAAAAATT TAACTTCTCT TACGCTAGGG 1260 ACATATCATC TTTCAAGATA ACTGTCCCAA ACGAGCTTTT TTCGACCATA TGTTCGGGTG 1320 ATTTTTGGCC GGATAGTATG GTGGTAAAAG AGTTCGAAGC TAAGATCAAA AATAGGAAAA 1380 AGGGGCCCGT GGGAATTAAA CTTCCCTCAC GTGGCCAGAC CACTGCACCA TCCGCCTCTT 1440 CATCTTCTTC CTCTTCAAAA AACTAACAAC GAATCTTACT ATCTCCTATC AAAATGTTAG 1500 AGGCTTGCGT AGCAAATTAA CCAAATTGTA TTGTGATAGT CTTTCATTTG CATCCCATAT 1560 AATAGTGTTC ACAGAGACTT GGTTAAAGCC GGAGATACTT AACTCCGAAG TCTTCCCAGG 1620 TATGTACACT ATATATAGGT ATGACCGTCC CTCCAGACGG GGAGGTGGAG TCTTAATTGC 1680 GGTTAGATCT ACCCTCGCGT CGGAGGAGTT ACTTTTGGAC GAATCCCGTA ACTCGGAATT 1740 CTTATGCGTA AAGCTGTCTT TTTCCGACAG ATCAGTTTAT ATTACGTGCT CGTATATTCC 1800 GCCATCTTCT GAATTCCCAG AATATCAGAA TCATCTGTCC GCTATCCAGT CCATCTTGAA 1860 TAAACTGTCT GACAGGGACC AATTGGTAGT TCTAGGTGAT TTTAATATAC CTGGCACAAT 1920 GTGGTCCCCA GAAAAACAAT CGAATATCCT TCTCCCTTTA GCACAACATG ACTTTATTGA 1980 CGGCTTGCTC GACTTATCGT TGTCGCAAGT TAATTATATT CCAAACTCTC TTGGTCGACT 2040 GTTGGATCTA TGTTTTGTAA CAAGTCCTGA AAGTGTGTTT CTGTCCAGGG TAGCACCTCT 2100 TACTCAACCT GAAGATCCAT ATCATCCTAC TTTCGAGGTG GCAATCGACA TAGGTACGGT 2160 ATTAAAAGTA AATTCAGAGA AGTCAACAAA GCGAATTCCC TGTTTTCGCA AGGCAAATTT 2220 TCGGAGGCTA AACAATTTTA TAGCTGGCTT TAATTGGTCC GATCTTTACT CCTGCAACAT 2280 AATGGCGGAT GCGATTAATA TGTTTTATAC CGCAATTAAA TCATTTTTTG ACTCTTGTGT 2340 TCCGATGTAC TATCCGTCAA TCTCTAAACC TCCTTGGTTT AATAAAGAGT TGACACACCT 2400 AAGAAATGTA AAGTCCAGAC TCTATATAAA ATTTAAAAAT ACCGGTTCTC AGTCCATCCT 2460 TTGTAAATAT TTAAGCGCTC GGTTAAACTT TACCGTGCTT AATGCTCAGT GCTACAAGAA 2520 TTATTTAAAC CGTTGTAAGT TCCAGTTTGC ACAGGATCCG AAACAGTTTT ATAATTTCGT 2580 TAGCACTAAG CGAAAAACAA GTTCTTACCC TTCCTCGCTA TTTTTTGAAA ACACTACGGT 2640 AACATCGGAT CAGGCAATAG CCGATCTATT CGCCAAGTTT TTTCAAACAA CATATTCAAC 2700 TTTACCTCAT TCCGAACAGC CATACTCTTA TGCGGTATCT AAGTCGAATC TAATATTTTG 2760 CCCCACTATA AACGAAAGCT CACTGCTTAA CGATCTTCAG CGTGTTAAAC CGGTCTATTC 2820 GCCAGGTCCC GATGGAATCC CTGGCTGTGT GCTCAGATTC TGTGCGGAAG CCTTGTGCAA 2880 GCCTCTACTG AAACTGTTTA CCTTATCTCT GGAATCTTCA CAATTCCCTC ATATATGGAA 2940 GGAGTCCTTT GTGATTCCTC TTCATAAAAA AGGTAGCAAA CTGGATGCAA GCAATTACAG 3000 AGGAATCTCT AAATTGTCGG CTATCCCAAA ACTTTTTGAA AATGTTATCA CTCCTCATTT 3060 GCAGCACCTT TGTAGATCAA TCATATCACC GTGTCAACAC GGTTTTATGA AACGCAGATC 3120 AACAACCACT AACCTCTTGG AGCTAACTTC TTTCGTAATA CAGGGTTTCA AAAATAATCT 3180 TCAAACAGAT GTCATCTACA CTGATTTTAG TAAAGCATTT GACTCTGTTA ATCATTACCT 3240 TCTAATAAGA AAACTTGATC TTTTAGGTTT CCCTGTTGAT CTTCTAAATT GGATTTCAAG 3300 CTATCTGAAT GGCAGGACAC AACAAGTCCT CTTTAAAAAT TCTTTATCTT GTATTCTCCG 3360 AGTCACATCC GGTGTCCCAC AAGGGAGCCA TCTTGGTCCG CTTCTTTTTA CCTTATTCAT 3420 TAACGACCTC CCCTTAATAA TAAAACATTC GCGTGTACTT ATGTACGCAG ACGATGTTAA 3480 ATTATGCCTC CAGCACAAGG ACACTTCGTG CCATTTGGAC TTACAATCCG ATCTAGACCG 3540 ATTTCAAATA TGGTGCCGTG ATAACGTATT AGACTTAAAT GGCTCCAAGT GTAAAGTTAT 3600 GACCTTTTGT CGTGCCAACC CAATACGCAC GACTTACACT CTAAGTGGGT GCTCTTTGGA 3660 CAGAATAACA CGAGTTGATG ATCTTGGTGT TCTTCTGGAC CCTAAACTAA AATTTTCTGA 3720 CCATATTTCG TCTATTGTCA ATAAGGCCAG GGGTGTGCTT GGTTTTATAA AAAGGTGGTC 3780 TAAGGAATTT GATGATCCTT ACTTGACTAA AACCTTATTT ATTTCGTTAG TCCGTCCGAT 3840 TCTCGAGTAC GGATCACCTG TTTGGAGTCC ACAATACGCA GTCCACTCGG ACCGCATTGA 3900 ATCGGTCCAA AAAAACTTCT TACTTTTTGC CTTGCGGCGC CTAAATTGGG ATGCAAACCG 3960 TATATTACCT CCTTATTCCA GTAGACTACT TTTAATTAAT TTACCGTCCC TAGCTAACCG 4020 TAGAACTATG CTTGGAACAG TCTTTATTTG TAAGCTTATT CGTGGGGATG TTGAGAGTCC 4080 CGACTTGATT AGTCGGCTTA ACTTCTCGGT TCCAAGTAGA TTCACTAGAA ACTATATACC 4140 CCTTATCTTA AATCATTGTA GATCTAACTA TGAGTTGCAT GACCCTTACA GAGTTTTATG 4200 TTCTGACTAT AATAGACTTT ATCCTATTAT CTGTAATCTT GACTCTCTGC CGCTCTTAAA 4260 GCAATCGATT TTAACTTTTT TATTACATAA TTAGATCCTA CTAACACTAA TATTTAATAT 4320 AGTTATTTTA CATTATATTC ATTTCCTCGT TCCTGTTCTC TTTTTTATAT CGCGTCTATA 4380 TCTTCTCGCG AATCGAGCCG TACGATACAC GGCAGCGCCC CTCGGTCGGT TGGGCGGGAG 4440 GTGTGGCCGT GGGACCCGTG CGAAAAAAAA 4470 // ID TC1 standard; DNA; INV; 1666 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0026410; Tc1. XX SY synonym: Mercurio XX FT source nnnnnnnn:1..1666 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..26 FT SO_feature terminal_inverted_repeat ; SO:0000481:1640..1666 FT SO_feature CDS ; SO:0000316:359..1354 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 1666 BP; 535 A; 288 C; 369 G; 474 T; 0 other; TACAGCTGCG GTTAAAATAA TAGCACTACT GCAGGTGGAA AGTTGATTTC CTAAAAAAAA 60 ATTATTAAAT GTTTATATTT TTTAAAGTCA GATTGCATGA ATAATAAGTA CCATATGTTG 120 GCTCTCTGAG CAAGAAATTT TTACTCTCTC AATGTAACGG TTCTTTTTGT TTTTGGGCAC 180 TTGCTGCAAA AGTGCGCGAA ATGAGGCGGT AACAAAAATA GCACTGACCA CGTATTTGCT 240 GAATAAAATT AATAGGAGTG ATTGCTTTGG GTTTTTTTTC GACAAATTTT GAAAAAAGGA 300 GTTGTATTAA AGGTTTTAAT TGAATTTTTT CCAAACGAAG ACCAAAAATT CTCTAGTTAT 360 GGGTCGCGGA AAGCATTGTA CCGTCGAAAA AAGAAATTTG ATTAAAAACA TGATCTCTGA 420 AGGTAAAACC TACGCTGAAA TTGGTCGCAT TGTCGGTTGT TCAAACAAAA TGATCCGCAA 480 AGCACTGCTG TTTGTCGAAA AGAACGAAAC ACGGCGAAGA AAGCCCTCAA TGTCCAACGT 540 GGAGATCAAG CGCTTGGTTC GGCAAGGCAA GAAGGAGCCT TTTAAGCCGG CGACGGAACT 600 GAAGAAGGAG CTTCAGATAG CTGAAAGCGT GGAAACTGTT CGCAAACGCT TAAGACAAAA 660 CAACCTTAAT GCGTGCAGCC CAAGAAAAGT CCCGCTTTTG ACTGTTAAGC ATGTGGCAAA 720 GCGAATCGAA TATGCCAAGA TACACAAGGA CTGGCCTGTG GAGAAGTGGC GCAACATTTT 780 GTGGTCAGAT GAGAGCAAAA TTGTGTTGTT TGGTGGGAAA GGCTCTCGGT CTTATGTTCG 840 GCGTCCACCA CGAACTGAAT ATAATCCTCG CTTCACCTTT AAGGCGGTAA AGCACGGGGG 900 ATCAAACATC ATGGTATGGG CGTGCTTTTC ATACTATGGA GTAGGTCCGA TCCATTGGAT 960 TCAAGGCATC ATGGATCAGC ACATTTACAC AGATATCCTG GAAAATGTGA TGCTGCCATA 1020 TGCCGAGGAT GAAATGCCGT TGGTTTGGAC ATTTCAACAG GATAACGACC CAAAACACAC 1080 GAGCAAGAAA GCTCGAAAGT GGTTTGAGCA GAAATCGATC CGAGTAATGA AATGGCCTGC 1140 TCAGTCACCC GACTTGAATC CAATCGAAAA CCTTTGGGCG GACGTGAAAA AAAAGGTTTC 1200 TGAAGCCAAA CCCAATAATA ACGAGGATCT TTGGGGTGTG GTCAAGGATT CATGGAGCAA 1260 AATTCCTCAA AAACGGTGCC AGGACTTGGT GGACTCCATG CCAAGACGAT GCGCAGCTGT 1320 CATTGCCAAC AAAGGTCACG CAACCAAATA TTAAGATTCT TTAAACATAG TTCTTAAGAT 1380 ATAATCCATT TGTTGAATTA TTTTTTTATT TTTTTGGTTA GTTTTAGCAA ACTACGAGAA 1440 ACAGTGCTAT TTTTGTTACC GCCTAAATTT GAAGGTTTTT TTTTTTGTTT TACCAATTAT 1500 TTTTAAAATA TCCATTAGAT CTGTTACTAA TTTTTTTATT TCGATTGAAA ATCATTGTAG 1560 TATTTAAGTT TAGTGAGTAA AATGATGAAA AAGTGTCAGA AAATAGAGAA ACGCTGGGAC 1620 AAACACGAAA TGTGCTTATG GTGCTATTAT TGTTACCGCA ACTGTA 1666 // ID DOC2 standard; DNA; INV; 4789 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063534; Doc2-element. XX FT source nnnnnnnn:1..4789 FT SO_feature CDS ; SO:0000316:1315..1989 FT SO_feature CDS ; SO:0000316:2023..4674 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4789 BP; 1666 A; 1225 C; 873 G; 1025 T; 0 other; AACGCTCGGT CGCTGACGTG TGAAGACGTG TTTTCATCGA GCTCCGTATA AAATCGGTTG 60 TTTTTAGTGA AGTGAATGGT TATTAATATA ATACAAACAA ATAAATCGAA AGCGAAAGAG 120 ACGCTCTGTG CGATGCACCA TCGCTCGAAT ACCTATTCTG TGTAGTATAT AAGTATACTA 180 AGTTATAAAT AGTATAAGCA AGTAGTTTTA CACCTTCAAA AGAAAAAGAA AAAAAAAAAA 240 AAGGAAAAAG AAAAAAAAAA AGGAAAATAA AAAAAGACAC TATGATCCGG AACGACACTC 300 GCGTTCAGCG ACAACGCGAG CAAGACGATC GCCGGCTCTC ATTGCAACGA AACAACGCGT 360 ACTTCTCTTA CGTCTCCGCC ACTTCTGCAG ACAGCGAGCG GTCAATCACC CATAACCCGA 420 CAAGTTTACC ATTGCCAACA ACTCAAGAAA GAGCGCGTTC TTGCTCTCCC TCACTACTGT 480 CACAAAACCC CCAATTTCCA AACACTTGCG AAATTTCTTT ACGCTTACCT ACGGTGACTA 540 GTACTGTGAC AACAACAACC ACTGCAACTG CAATCACAAC AACAACTGGA ACGGTATCGT 600 CAGCTCGCTC AATTATGTCG ACGCAAGCAT TTGCTCCTAC TAAGCCAGCG ATCAAAACAA 660 AAACACAATT TAACGGTTCT GCTGCGCTAT CAGCAAGCCA AAACGTAAAC AAGAACGCCG 720 GGTCCACCAT ACAGACTGGA ATGGATCGCT ACATAACAAT AAAAAGAAAA CTTAGCCCTC 780 AAAATTACAA AAGCCAACAA ACAGAAAACA AATCAAATAT TACACCGCGA CAATGCTAAC 840 ATGCTCACTA ACAAAACGCC AGAAAACACC AATAGATTTG AGATTTTGAC CGGCAAAGTC 900 ACCTAAGAAG GTCAAGCCTC CTCCAATATA CATTCGCGAA AAAAGCTCAA GTGCATTAGT 960 GAACAAAATA GCCACACTTA TAGGAGAAAA CTCATTTTAC GTAATACCCT TAAGAAAAGG 1020 GAACATAAAT GAAACAAAGG TTCAAACTGA AACCGAGGAT AACCATCGTA TACCAACCAA 1080 ATTCTTGGAT GAAGCAGGAA AAAACTTTTA TACATACTAA CTAAAAAGCG CAAGGGGCCT 1140 ACAGGTTGTT CTCAAAGGGA TTGAGGCAAC TGTTTCGACC ACTGAAATCG TAGAAGCGCT 1200 CAAGGAAAAG AACTTTAGTG CAAAAATGGT CATGAACATA TTAAACAAAA ATAAGGAACC 1260 ACAACCAATG TTCAAAATCG AGTTGGAGCC AGAGCGCCAG ACACTGAAAA AAAAATGAAG 1320 TTCATCCAAT CTACAAGTTA CAGCTCCTAC TGCATTCGGC GTATCACAGT GGAAGAGCCA 1380 CACAAGCGCA ATCGCCCAGT GCAGTGCACT AACTGCCAAG AATATGGCCA CACTAAGGCG 1440 TACTGTACGC TGAAATCTAT TTTGCGTAGT CTGTAGTGAA GCTCATAGCA CAGCGAACTG 1500 CCATAAGAAC AAGGAAGACA TCTCTGTTAA AATGTGCAGC AACTGCGGTG AAAGTCACAC 1560 AGCGAACTGG CGTGGCTGTG TAGTATACAA GGAACTAAAG AACCGCCTTA ACAAACGCGG 1620 AGAATCAATA CGCGCTCAGA CCACCCACAT CGCTCACACT CCGCCACAAT CGGGCCCGTC 1680 CTACACTGCA CCCCCCCCCC CCCCCCTACA CATCGTACCA GTCCTAGCGT CTCCTTCGCT 1740 AGTGCCTTAA AATCGGGAAT TAAATCTGTG AATCCGCTAA CACAAAAGTC GACTTCTACT 1800 CGGGAAGAAC AGAAAATAAC TACCTTGAAC GAACAAACGC AGCATATATC AACTGGCATT 1860 GAAACGATGA TGATCTCACT CCAGCAAACC CTAAAAGAGT TTATGACATT CATGCAAACC 1920 ACAATGCAAG AGCTTATGCG AAACCAAAAC ATGCTGATAC AGCTTCTTCT GTCATCCAAA 1980 TCAATATAAT GTCTCCACTT CAAATATCTA TGTGGAACGC GAATGGCGTT TCACGGGCAT 2040 AAAAACGAAC TTGCCCAGTT CTTATTCGAA AAAAATATTG ACGTCATGCT CCTCTCCGAG 2100 ACACATCTAA CAGACAAGCA CAACTTCCAT ATCCCAGGAT ACTTATTCTA CGCCACAAAC 2160 CATCCGGACG GCAAAGCCCA TGGGGGCACT GGTATACTTA TCAGAAGTCG CATCAAACAC 2220 CACCTTTATA ACAGAACTGA AATGGACTAC CTGCAATCCA CTTCCATAAG CATGCAATCC 2280 AGCAGCGGCC CCGTCACTCT GGCCGCAGTC TACTGCCCAC CTCGTTTTGT AGTTTCTGAG 2340 GACCAATTCT CGGAATTTTT CAACTCACTC GGTGAGCGAT TCATAGCGGC CGGTGACTAC 2400 AATGCCAAGC ATACGCACTT GGGGATCACG TCTTTTGACC CCAAAGGGCA AACAACTTTA 2460 CAACGCGCTC ATTAAGCCGC GAAACAAGCT CGATCATGTG ACTCCGGGTA GGCCTACATA 2520 CTGGCCAGCA GATCCAAATA AGCTACCGGA TCTCATTGAC TTCGCAATAA CAAGAAAGAT 2580 TCCAAGAAAT AATATTACGG CCGAGTCACT TGCAGAACTA TCATCTGATC ACTCTCCAGT 2640 TCTACTTACT CTCTGGCACC GACCACATAT AACTGAGCGG CCCTACAGGC TGACAGGCAA 2700 CAGGACCAAC TGGCAAGGTA CGCGAAATAT GTTTGTACTC ACATGGAACC AACTCAAACA 2760 GTATTCACCG ACGAGGATGT TGATCGCCTG GTCAAATCGA TAGAGGAAAC ACTTGTTGCT 2820 GCAGCCAAGG CCTCTACACC TCCAGACACA CACAAAATGA CAAATCATAG CAAGACAAAT 2880 CGTGAAATCG AACAGCTAGT ACTTGAAAAA CGAAGACTAA GAAGAGAATG GCAGAATCAT 2940 AGATCCCCAA CGGCTAAAGA ACATCTAAAA ATAGCAACAC GTAAACTAAC CAGGGCACTT 3000 AAGCTTGAGG AAGCCAACGC CCAGCATCTA TATATCAAAC AACTATCGCC CACCAGCAAA 3060 AGGAACCCTT TGTGGAAAGC ACACAAAAGC ATACAGCCAC CGGCAGAAAC AGTCGTTCCA 3120 CTGCGAGGTC CCTCTGGAAG CTGGATACGC AGCGACGAAG ATAGAGCTAA TGCGTTCGCC 3180 GATCACCTTC AGAAAGTATT CCAACCAAAT CCTGCTAGCA ACTCATTTGT CCTCCCTGTA 3240 GTAACTAAAA GCTTGCCCCC TATAAATCCA GTAACATTCA GCCAAAGTGA GATAGCATCT 3300 ATCATAAAAG ATCTTAAGCC CAAAAAATCA CCTGGACATG ACTTGGTGAC ACCAAAAATG 3360 ATCATAGAAC TCCCACCGTG TGCGGTTCTG ACCTTATGCC ATCTCTTCAA CGCTATTACG 3420 AAACTTGGAT ACTATCCCCA AAGGTGGAAA AAGGCGGTTA TAGTAATGAT TCCCAAGCCA 3480 GGCAAAGACA AAACGCAACC CTCATCCTAC AGACCAATCA GTCTCCTAAC GTGTCTCTCG 3540 AAACTGTTTG AAAAGGCATT TTTAAAACAA CTAACTCCCT ACTTAAAAAG GCGAAATACA 3600 ATACCATCGC ATCAGTTTGG CTTTCGCAAA AATCACGGAA CGATCGAACA GGTCAATCGC 3660 ATCACAAACG AAATTCGTAC CGCTTTTGAG CACCGGGAAT ACTGTTCAGC TATATTCTTA 3720 GATGTTGCAC AAGCATTTGA CCGAGTCTGG CTGGAAGGAC TAATGTACAA GATAACAAAA 3780 CTGCTTCCCC AGAACACGCA CAAAGTGTTC GAATCTTATC TCTTCAAAAG AGTGTTCTCA 3840 ACCAGGTGTA ACAGTTCGAC TTCACACGAC CGCGTTATCA ATGCTGGAGT CCCCCAAGGT 3900 AGTGTACTGG GCCCCGTTCT TTACACCCTA TTCACAGCAG ACATGCCTAC AAACTATCAG 3960 CTCACAACCT CTACGTTTGC TGACGATACC GCAATACTTA GCCGATCTAG ATGCCCAGCA 4020 AAGGCAACAG AGCAACTTGC CTGCCATCTT AAAATAGTGG AAAGATGGTT TGCGGACTGG 4080 CGCATTAAGA TAAACGAGAC CAAAAGCAGA CATGTAACCT TCACACTGAA CAGACAAACC 4140 TGCCCACCCT GTACCCTGAA CAATACATTT ATCCCACAAG CAGACGTAGT AACATATCTC 4200 GGCGTTCACC TAGATAGACG CCTCACCTGG CGGCGACATA TAGAATCTAA GAGGACTCAT 4260 ATGAAGCTTA AAGCAGCCAA TCTTCGCTGG TTTATTAATT ACAACTCTCC CCTTAGTCTG 4320 GAATACAAAG TACTCCTCTA CAATACCGTG CTGAAACCCC TCTGGACATA CGGCTGTGAA 4380 CTATGGGGAA ACGCTTCAAA AACCAACATT GAAATCATAC AGCGAGCTCA GTCCACGATT 4440 CTGCGAACTA TCACTGGGGC ACCATGGTAC CTACGCAGCG AAAGCATCCA TAGAGACCTC 4500 CACATAAATC TTGTCAATGA GGAAATTCAG ATGAAAAAAA GTAAACATCA AGCAAAGCTC 4560 GCCGCACACG AAAATCCTCT CGCGAAGTCG CTTACGCGAG TCTACAGTCA GAGCCGACTG 4620 AAGCGCAAAG ATTCTCCAGC CCAGCAAAGA AATCCTAGGG CCGTCTCTAA CTAATACAAT 4680 TACTTCATAT TGTAATTAAT ATAAGTTATA TAATAAGATT TGAATAATTA TTGTTAGTCT 4740 CACAAAAAGA GAAGATCCAA TAAATAACGC AATAAGTTAA AAAAAAAAA 4789 // ID DOC3 standard; DNA; INV; 4740 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063533; Doc3-element. XX FT source nnnnnnnn:1..4740 FT SO_feature CDS ; SO:0000316:221..1939 FT SO_feature CDS ; SO:0000316:1939..4608 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4740 BP; 1639 A; 1175 C; 906 G; 1020 T; 0 other; TGGCATTTCA AGTGAACAGA CGTGTTTTCA TCGAGCTCCG TATAAAATCA CTTGATTTTA 60 GTGAAGTAAA TAGTTAATAA CATAAAATAA ATAAATAAAT CGAAAGCGAA AGAGACGCTA 120 TTGGCAATGT TAAATTGCTA TACTACAAAA TTTTCTGTGT GTGGAATAAT AAAGTAAAAC 180 TAAAAAACAA ATAGACAACG AACGAATCGA TCGTTCTAAA ATGAGCCAAC GCGAATTGCG 240 CGCTCAGCGG CAAAAAGAGC AAGACGAGCG CCGACTCTCT TCGCAGCGGA ACAATGCATA 300 CTTCACCGTC GTGTCTGACG CCCGCGAGTG CCAAATCATC CACCCGGTCA ATGACCTAGC 360 ACCTGCAGAA TGCAAACGAT CGCGGTCTTG CTCACCAATG CTTATGACTG CACGAAATGA 420 ACCCGAACGC ACTGAGAGCG TCTCTCTCAA GCTCTCCCCA TTCCAAACGG TGTCAAGTTC 480 TTTAGCTACC AGTACAGCAA TCACGATATC TACTGCAACT GCACCTTTAG CAACAACAAC 540 TGCAACAACA ACTGCAACAG CATCAACATC AACTCTATGC TCAATACCGA CAGCGGTGTA 600 CAATTCCGCT GTACCAACAA CAACGAAAGC AAGAGCACTC ACACCGACTC TTGAAAGCGG 660 AAGCGGACAA GAACCAAAAC AAAACAAAAA ACTTTGTCAA ACAGGGCTGG ATCGCTACAT 720 CCAAATAAAG CGAAAGCGAA GCCCACAACA CAAGCAGGCT GGTAATCAGC CGAAAATAAA 780 TCGCGCCAAC GCCTCCAATG ACTCAAGTTC AAATAGGTTT GCACTATTGG ATGACTGCGA 840 AGTGCAGGAG CAGGAGCAGA AGAAGATAAA GCCGCCACCA ATATATATAC GTGAGAAAAC 900 CTCAAGTGCG CTGGTTAACA AACTAGCTGC AATTATCGGA GAAAACAAGT TCCATGTTAT 960 ACCTCTTACT AAGGGCAACA TACAAGAAAC GAAAGTGCAG ACTCAGGATG AGTCTAGCCA 1020 CCGATCAGTG ACCAAGTACC TGGACGAAGC TGGGAAGAGT TACTATACCT ACCAGCTGAA 1080 AAGTGCCAAA GGATTGCAGG TTGTCATTAA AGGAATAGAG TCATCAGTGA CCTCATCCGA 1140 AATTATCGCG GCACTAAAGG AGAAGAACTT CAACGCCAAA ACGGTGATCA ACATTCTCAA 1200 CAGGAACAAA GAGCCGCAGC CACTCTTTAA AGTCGAACTG GAGCCATCGA GTCAGAAAAA 1260 CAACAAAAAT GAAGTACATC CTATATATAA GCTACAGTAT TTACTGCACC GAAGGATAAC 1320 CGTAGAAGAG CCACACAAGC GCAGGCAACC TGTGCAGTGC ACCAACTGCC AAGAATATGG 1380 ACACACCAAA GCGTACTGCA CCCTAAAATC CATCTGCGTC GTTTGCAGCG ATGCCCACAG 1440 TACTGTAAAC TGTCCTAAGA ATAAAAACGA CAGTTCAATT AAGTTATGCA CCAACTGTGG 1500 CGAAAAACAC ACAGCCAACT GGCGAGGATG CATTGTTTAC AAGGAACTCA AGAGTCGCCT 1560 AAACAAACGA GTGGATTCAG CTCGAGATCG TATCACACCA ACAATTCTCA AAGCAATGGA 1620 ACCGACTACG ACTAACATCC CTTCGTCTGC CAGCCACCGA ACAATGCCAA ATTCTTCCTT 1680 TGCAAGTGTA CTAAAATCAG GGATCCAAGC ACCAACAGCA GTCACCTCCA GCGTCCAGCA 1740 TAAAACTCAC CAAATAAACA CAAATAAACA AATAAACACA CAGAACATGC AACAGCAGCC 1800 AATTGGCGTT GAAGCGATTA TGCTTTCGAT GCAGCAAAGT ATGAAAGACT TCATGACCTT 1860 CATGCAAGAA ACTATGCAAG AGCTTATGAG GAACCAAAAC ATCCTTATTC AAATGCTCGT 1920 TTCTTCCAAA ATAGGATAAT GACTCCCTTA ACAATAGCTA CCTGGAACGC TAATGGCGTT 1980 TCGCGGCACA AGCTAGAACT TGCTCAGTTC TTAGCCGACA GAAACATTGA CGTTATGCTA 2040 CTCTCAGAGA CTCACCTTAC CGAAAAGTAC AACTTCCAAA TACCAGGATA TAAATTTTAC 2100 TTTACGAATC ACCCGGATGG CAAGGCCCAT GGAGGCACTG GCATACTAAT ACGATTGCGA 2160 ATTAAGCACC ACTTTCTTGG TAATTGGCAA GAAGACTGCT TACAATGTAC CTCTATAAAC 2220 CTCCAATGCC ATAACGGTGC TCTCACTCTG GCTGCTGTCT ATTGCCCACC TCGCTTCACA 2280 ATCTCTGAGA AAAAATTCAC AACACTTTTT GACTCGTTTG GTGAAAGGTT TGTTGCAGCT 2340 GGCGATTGGA ATGCCAAGCA CATGTACTGG GGATCTCGGA TAGCGAACCC TAAAGGAAAA 2400 CAGCTGTACA ACGCAATCAT CAAACCGCAA CACAAACTTA ACTATGTTTC CCCGGGAGCG 2460 CCTACATACT GGCCAGCAGA TCCCATGAAG CTTCCAGATT TGATTGACTT TGCCATCACA 2520 AAAAGGATAT CCCAATCAAT GATTACAGCT GAGGCACTTC CAGAACTCTC ATCTGATCAC 2580 TGTCCGGTAA TCTTTCATCT TTTGCACCAA CTACAACACA TCGAGCGACC GTGTAGGCTT 2640 ACAAGCAATA GGACCAACTG GGCGAGGTAC AAAAAATATG TTTGTTCCCA TACCGGCTTT 2700 TCTAGCCCCC TGGAAACCGA GCAAGACATT GATCAGTTTG CTGGTGATAT AGAATCAATA 2760 CTTGTCGCAG CAGCAAAAGC GTCAACTCCA CAAGACAACC ATGTATGCAG CACAAAGTTT 2820 AACAAGACAA GTAGAGATAT TGAACTGCTT GTGCTTGAAA AACGACGACA TCGAAGGGAG 2880 TGGCAGGAGC ACAGATCACC TGGCGCAAAG AGTAAATTAA AAGCAGCTTC TCGCAGACTT 2940 ACCAAAGCGC TTAAGAAAGA AGAAGCGGAC GCACAACTAA GATACATCGA GAATCTCACG 3000 CCCACAGGCA CAAAGAACTC GTTGTGGAAA GACCACAGGA ACCTACAGGC ACCAACAGAA 3060 ACGGTTGTAC CTCTCCGTAA CTCTACCGGA AACTGGTCTC GCAGTGACAC GGACAGAGCA 3120 AGAGTCTTCG CTGATCATCT TAAAAAAGTA TTTCAACCAA ATCCAGCCAC TAACTCATTT 3180 ACTCTCCCGC CTTTAACTGC ATCTGACTTA GCACCACAAG ATCCTGTAGA ATTTCGGCCA 3240 AGCGAGATAA CGAAAGTCAT TAAAGAGCAA CTGAAAACTG GAAAATCGCC TGGCTACGAC 3300 TTAATAACCC CGAAAATGAT CATCGAGCTC CCAATGTGTG CAGTTCTGCG AATCTGCTTG 3360 CTCTTCAACG CAATTACTAA AATTGGATAC TTCCCTCAAA AGTGGAAGAA ATCAATTATA 3420 ATCATGATCC CTAAGCCTGG GAAAGACAAA ACACAGCCAT CATCATACAG GCCAATAAGC 3480 TTACTTACTT GTCTATCCAA GTTATTTGAA AAAGTGCTGC TGCTACGCAT CAGCCCCCAC 3540 CTCAAAACAC ACAACACACT CCCATCGCAC CAATTTGGGT TTCGGGCAAA ACATGGAGCC 3600 ATTGAACAGG TTAACCGCAT CACAACAGAA ATTCGCACTG CCTTTGAACA CCGTGAATAC 3660 TGCACAGCTC TCTTCTTAGA CGTTGCACAA GCATTCGATA GAGTATGGCT GGATGGACTT 3720 ATGTTTAAAA TAATCAAACT GCTGCCTCAA AACATACATA AGCTTCTAAA GTCTTACCTA 3780 TATAAAAGAG TGTTCGCGGT CAGATGCAGT TCAAGCACTT CAAGCGATTG TACTATAGAA 3840 GCCGGAGTGC CTCAAGGAAG TGTTCTAGGC CCAATTCTAT ACACCCTATA CACGGCGGAC 3900 ATCCCAACAA ACTACCAGCT AACGATATCC ACATTCGCTG ATGACACTGC AATACTAAGT 3960 CGATCCAGAT GCCCAGTAAA AGCTACGATG CAACTAGCAC GCCATCTAAC TTGTGTAGAA 4020 CATTGGCTTG CAAATTGGCG CATACGAGTA AACGAAGGAA AGTGCAAACA GGTAACATTT 4080 ACCCTAAACA AACAAACCTG CCCGCCCTTG GTAATGAACC ACACGCGCAT CCCACAAGCC 4140 GATAACGTAA CATACTTAGG TATTCATCTG GACAAGCGGC TCACCTGGCG GAAACACATA 4200 GAGGCCAAGA CGACGCACCT CAGACTAAAA GCAAGGGATC TACACTGGCT TATAAACGTT 4260 CGCTCTCCCC TAAGCTTGGA GTACAAAGTC CTCTTATATA ACTCCGTTTT AAAACCCATA 4320 TGGACCTATG GCTCCGAGCT ATGGGGCAAC GCATCGAGAA GCAACATCGA CATAGTACAG 4380 CGAGCACAGT CTAGGATTCT GAGAATCATT ACTGGAGCTC CATGGTATCT TCGGAACGAG 4440 AATATACACA GAGACTTACA AATAAAACTT GTCATTGAGA CAATAGCAGA GAAGAAAGTA 4500 AAATACAATG AAAAACTAGC CTCACATCCA AATCCTCTTG CAAGAAAACT TATTCGAGTA 4560 TGCAGCCAAA GCCGACTGCA CCGGAACGAC CAACCAGCCC AGCGTTAAAT TTGTTAGGCC 4620 ACACAGCACG AATCATTTCA TAAAATGATA TTTAGTTAGT TTAGAATAAG ATTTGAAAAC 4680 TTATTGTTAG TCTCTTAAGT AAAAGGGAAG ATTCAATAAA TAAGCAAAAC ATGAAAAAAA 4740 // ID IVK standard; DNA; INV; 5402 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0043055; Ivk. XX SY synonym: You XX FT source nnnnnnnn:1..5402 FT SO_feature CDS ; SO:0000316:192..1511 FT SO_feature CDS ; SO:0000316:1607..5290 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 5402 BP; 1780 A; 1425 C; 827 G; 1370 T; 0 other; CAGTCACCGA CTATCTCTCT TCGAGTGCGG ACGTCTGTGT CTTTCCAATT GTGTGCGCTT 60 TGCTTATCGC GTAATTACCA TCGCCCAGCT TATCGCCGTT ACATTGATCT TTATCATCTT 120 ATCTTCTTGA TCTTCTTATC AACGCGTACG TGCAAATCCA AAAACATTCA ATATCCATAA 180 ATTTATTAAC CATGGCCCCC GGGCCACTCA ATTTGGGGGA TAATCGCTTT GCCAGTCTCA 240 GCGTTAGCCC CAAAGCTAAG AAAAAAAGAC CCTTTCAAAA ATTACAAGTT GATTTCCCAG 300 ACCTCCCTGA AACGCAGTGC AAAAATCCGA GATATTTAGT GGTCTCATCA AAAAACTCTC 360 CGAAAACCAT ATCCGACTAC AACTGCTTTG CTGTCCATAG AGCCCTTAAG TATATCAGCA 420 AAGACATTAC ATCCATCTCT AATCTTCGCG ATGGTAACCT TCTCTTGCTG GTTAACTCCA 480 GGGAAGTAAG TGACAAATTT ATCAAAGTGA CCTCTTTACC TGGCTTATGC GATATCGAGT 540 GCAAGCTCCA CAACACTCTT AACTTTGTCA AAGGCACCAT TTACGCTCCC TGTCTTATTA 600 ACATTCCAGA AAACGAAATT GTTGAGGAAC TAAAATCTCA AAATGTACAC AGCGTATTTA 660 AATTCACCAA AATGATCGAC GGCACCCCCA AACCATTTGG AAAAATTCTC GTAACGTTTG 720 ACCGTTTCAC TCTTCCCAGT AAACTAACGG TTTCCTGGCA CACAGTTAAG GTGTCAGAAT 780 ATATACCCAA CCCGATGAGA TGTAAGTCTT GTCAATTATT GGGCCACACA TCAAAGCACT 840 GCAAAAACCC ACCCGCCTGT GTATCATGCA ATCTCGCTCC CCACCTTCCC GTGCCATGCA 900 CTCGCATTTT CTGTGCAAAC TGCACCGGCG AACACCCAGC CTCCTCACCC GAATGCCCAC 960 AATACCAAAC ACAAAAACAA CTGCTCCACA TCAAAACAAG CAAAAAATGC AGTTTCCATG 1020 AAGCTCGCAC TATACTAAAA CAACAACAAA ACACAACAAA TAACCCCACG CTTACCTACT 1080 CGACCGTCGC AAACCAAAGC ACAGCATCCG TGATGCTTGT CAAAAAATCC CGAAACCCTA 1140 ACACGATTAC AAACACCGAC TTGCCCACCA GCTCGACAAA AGTACCAGCT CCGCTCTCAC 1200 CAACGTATCC CACTTCTCGG ACGAACTCTT TGAATCTCGA AGAGAATCTT ACTTCCTCCC 1260 CAAGCACCTC TGCCAAAACT GATTTCTCTT ACTTAAGCCC AACAAGTAAA CTGAAAACAT 1320 GTATGGAACG ATCTCGCTCC CTTAGGGCGG AAGCAAATGC TCTATTAGGA CAAACCTACA 1380 ACGACGACAA CAATACATTA ATGACCTCAA GATCCAACTC AAACGAATCA ATTGCTTCAC 1440 AAACCTCAAA CCTACAAACA AAAATCAACA CAACAGACTC AGATGACACT ATGGACGATC 1500 CCGACTCCTA GTCCTGATCC TCTCACCCTT ACCTTCATAT ACACACACCA CAAATTAATA 1560 ACAGTCCAAC AAAAAACAAT AAATAAAAAA AAAAAAATTT TTAATCATGA CAATTACTAT 1620 ATTACAATGG AACATCCACG GCATTTTTAA TAATTACAAC GAACTGACAC TACTCATAAA 1680 AGACCATGCA CCCGATATTG TATTTTTGCA GGAAACCAAC CTTCCATGTA ATTCTACAAA 1740 TTTTATTTGC CCCAAAGAAT ATAGTGGTTA CTTCCACAAT TTTTCTTATA ACACCTCAGC 1800 CAAACAAGGC ATTGGTGTTC TTATTAAAAG GAATGTCCCA CATACATACC GTAACATTAA 1860 CTCCAGCATA CTCTGCTCCG CACTGCAGCT TAAATTTGAA CAGGTCATTA ATATTGTTAA 1920 TGCGTACATC CCACCAAGTC AAATCTTTTC ATCCTCTGAC ATATCAGAAA TTCTACAAAA 1980 TCTTAACGGA TCCACAATAC TCCTGGGTGA TCTTAACTCA TGGAGTCCTT TATGGGGTTC 2040 GCCTCGCACA AATACAAGGG GTAAAAAAAT TGAAAGCGTA ATTCTTGAAA ACAGCCTAAT 2100 TGTTCTTAAC GACGGTTCCC CGACTCACCT TTCAACTCAC AACACGTTCA CTCACATCGA 2160 TATCTCACTG ATATCTCCAC AGATAGCCCA TATGTGCAGC TGGTCTATAT CAGATAATCT 2220 CCATGGAAGT GACCACTTTC CCATAACCAT CCACATAAAC ACACCTACCC GTCCCGATAA 2280 CACTTTACCC CTGCCTAAAT ACAAGACAGA CCAAGCGAAC TGGAAACGCT TTAACGAAAG 2340 TTGCGAAAAA TCAGCAGCAT ATTGGACGGT CGGTTGCCTA AATCAGCAAG TCGCACAAAT 2400 GACGAAAGTC ATACGAGCAG CTGCAAACTG TAGCATTCCC CAAACGAAAA GAGTGATCCA 2460 CAAAGCCAAA GTGCCATGGT GGAACGCTAA CCTTCAACAG CTTAGGGACC AAAAGCAACG 2520 CTTGTTCTCG TCCTACAAAG CCAATACGAA TGATACAAAC CTTATCCGAT ACAAAAAAGC 2580 AAACGCGCTT TTCAAAAAAG CTGTACTTTC GGCCAAGCGT AACTCCCTTG AAAAATTTAC 2640 TTCCAAAATC TCCCCAGTAT CCTCCACAAA AAAGGTCTGG TCAGATATAA AACGCCTAGC 2700 CGGCATCCCT CCCACTCCGT TCAAATATAT CAAATCCAAC TCGGGTACTC TAACTGGATC 2760 ATTTGATATC GCCGAAGAGT TTGCCTTATC TTGGTCTAAA TACTCTTCTG ACCAAAATTT 2820 TTCAGCAGAA TATATACGAG TAAAAAATCG GTATCTTTTA GAGCCCTATG CCATCGACTC 2880 ACTTTCCCCA TCGGCTACAT CCTTAGATTC CAATTTCACA TTGCTCGAAA TAGAAAATTC 2940 AGTAGCAAAA GCAAAAGGGA AAAGCCCTGG GGCTGACAGA GTATCATACC CTATGCTCAA 3000 AAACCTATCT CCCCACCTAA AAACAAAACT TTTAGACATT TTTAATCAAA TATTAACCAC 3060 TGGAAAATAC CCACATACAT GGAGATCGGC TATCATTATC CCTATATCAA AACCTAACAA 3120 ACCCCCTTCC GATATTAACA GCTACCGTCC AATCTCTCTG TTATCCTGTC TGGGAAAAAC 3180 ACTAGAAAAA ATAATAGCAC AAAGACTTAC CTGGTTCATT AAACGCCACA ACCTAATTTC 3240 CCATAATCAA GTTGCCTTTA AAAGAAATCA TAGCACGATG GACGCGTTAT TGCGTATCCA 3300 ACACTTCGCG TCGAACGCTC TTTCGACCAA AAATCACGTC TCTATCTTGG CGACGGATTT 3360 TGAAAGAGCC TTCGATCGCG TGGGGATTCA CGCCGTACTT TGCAGACTCG AGCGCTGGGG 3420 CATTGGTCCA AGGCTTTATA ACCTTATTAA AGCCTTCATG ACCAATCGGT CCTTCAGGGT 3480 TCGAATAAAC AATGTCACAT CGAACTCCCA CATTTTACAC AATGGAATCC CGCAGGGTTC 3540 GCCACTTTCG GTGGTTTTAT TTATGATAGC TATTGAAGAT ATAAATGACA TTGTAACTCG 3600 GCATAAAGAT ATTTACATCT CACTATACGC AGACGACGCA ATAATCTTTA CAAAAATAAA 3660 AAATATTAAC ACAGTCAGAG AAAAATTCTT AGAAATATTG CAAGAAATTA ACTCGTGGGG 3720 AGCAACCTCT GGGGCCTCTC TTGCCATTGA AAAATGCCAA ACTTTACATA TCTGTCGGAA 3780 ACAACGTTGC AACCTTTCCG ACATTGTCTT TAACAGCCGC ACAATAAAAG ATGTAAACTT 3840 TTTAAAAATC TTGGGGATAA CCTTCGACTC CAAACTACTT TTTAAACAGC ACTGTCAGAC 3900 TCTAAGAAAA CAACTGGAAA CTAGATTTAA CATTATTAAA TTTCTATCAT CTAAATATTC 3960 TTACATACAT ATTAAGACTC TCATAGATAT TACGCGCGCA TTAATGCTAT CTAAAATTGA 4020 CTACGGACTG CCAATCTTCG GTTGGTGTGC TAAATCACAC TTAAAAAAGC TACAAGCCCC 4080 ATATCACGGA GCGGTCCGTC GCGCCATTCA CGCATTTCCC ACATCTCCAG TAGCGTGCAC 4140 ATTGGCAGAA TCGGGTCTCC CGAGTATCCA ATCACGCGTA GAAGAAACTA CATTGATGCT 4200 TATCCCGAAG CTGTACACCA CGTCCAACTG CCTGCTAACC AAAGACTTCG GAGCCATATT 4260 TAAACAAAAG CGGAAATTCA AGTGCATATC CACCCTAAGA CGTTGCGCCA ACTACATCAA 4320 GCTACTTGAC CTTCCCCTGC CTAAGCCCAG GAGGCCCTTC AAATCGCCAG CACTTTGGGG 4380 TTCCAAACAG CCTAACATAA ACCTTCAAAT ATACAATGCT GCCAAAAAGG ATACAGGTCG 4440 CCTAGAATAC CAAAAACGTT TTATGAGCGC ACAAGAAGAT CTTGGTGTGA AAAACTGGAT 4500 CTACACCGAT GGTTCTAAAG TCACCGGCGC AACTACTTTT GCGGTTGTTG ACTCTAACCG 4560 TAAAATAATT GCAGGAGGTA GGCTTCCGTC CTACAACTCC ATATTTACAG CCGAAGCTTT 4620 CGCCATTCTC AAAGCATGCC AATTCGCTTC CAAAAACGCT GGAAAATCTG TTATCTGCAC 4680 AGACAGCCTC TCCTCTCTTT CCGCTATACG CAACTGGAAC CATAATGACC CCACAACACA 4740 AGAAGTTAGG CACATTCTAA GTTCTCACCC AAAAAAAATC ACCTTACTCT GGGTTCCCAG 4800 TCATCAAGGT ATTCATGGGA ATGAACTTGC TGACAAAGCC GCCCAAGAGA TGAGGCTTAC 4860 ACCATCAATC CTGTTCACTC CGTTTAACTC CAAGGACCTA AAAAGTCGAA TCAAGTTATA 4920 CCTCAAAGAA AAGAAACTCT CCGAATGGGC ACTCTTCATG CACAGGTACC AGTCTATCAA 4980 TCCGAATTGC ATCATGTTCA AGCCACCAAC GAACGTACAT AAACGGGAAT GCGCGACCTT 5040 TATCCGTCTA CGCATCGGGC ACACCCAATC CACACATCAA CACTTACTGA TGAGATCGGC 5100 GCGACCAACA TGCCAACTAT GCGGGGACGA GCTCACCGTT GACCATATTC TCAACGCCTG 5160 CAGTCAATTG CACTCAATTA GATCTCACTT ATTTGGTACA CATAGTCTCT CGAATTGTTT 5220 AAGTATCCCA TCCTGTGAAA ATATCTCAAA AATTTACAAA TTTGTCCAAA AAGCTAAGTT 5280 TATTATATAA AGCAATTAAC CCATAAGTCT CGAGTCGAAG GCCCCCGTAG CTAGTACTCA 5340 TTAAATTTAA TTAGGTTTAG TATTGTATAC TATATTTAAT TTTGTTAAAT AAAATAAATA 5400 AA 5402 // ID RT1C standard; DNA; INV; 5443 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063467; Rt1c. XX FT source nnnnnnnn:1..5443 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 5443 BP; 1389 A; 1524 C; 1572 G; 958 T; 0 other; GGGGACGCGC GCGAGTAGCC GTGTGAATCG AGTTGGCGCT CCTGGAAAAC CCCAAAATTT 60 TCCGTGTCGC GGCGGCTTGA TGGAGACAGG AGTGGCTGTC TCTAACACAC AGTGCCGATA 120 TGAGTACTCA CCAGCAGTGG GCAAGCTCCA AGTATCCATG GATCGCCCGC CGGCGCCGAT 180 CCTCAGGGCC ATGGAGCAGG TCAGGGTGAG AGAAGCAGCG AACTTTCTCT GCACTACTTT 240 CTCTTTGTCG CTGTGTTGTT TCTTTTGGCG TCTTGGCTGC CACCAGACTT GCCGGTCACG 300 CCAGCTGATC TCGCAGTCAG CTTGGAAAGC AGTGCTGGTA AACGGCTAAA GCACCGCGGT 360 CTGGCAGCGC TGGTCGGATG CACGAGTCCC CTAGCACAGG TGAGCCGTTT TCTCGGTGAA 420 AGCACTCACT GACAAAAGCT AGCAGCTACG GAAATCGCCT AACGATCGGT TCACCGGTCG 480 TAGCATACAT TTTTGGGCGG CTGACGGTTA TCCTTGGCAA CATCCCATGG AAACCCGGAG 540 ACAGATCCAT AATCGAGCAG GAGAGGCAGA ACAAGAGTGT AAAAAGTCAA GGAGCAGACC 600 AATAGCCAAC CCCCCCGCTT AGGCGGCACA CTTTAAATCC GGGGCTCTGG CGACCAAGCT 660 GGAGCGGTAT ATCTCCCACC GTCGCAACTT TCCCCTGCTC ACCCACCCCA TAAGTCCACG 720 GCGCGTCTGA CCCAAAGTGG GCGGGTACGT GGCCTGCGCC TGTTCTCTTT ACCAGCCGGG 780 CAGCAGCAAA ATAAAAACCC ATTCAGCGTC TGAGCCAAGT TGGCCGGCGG AATGGTCTGC 840 GTCTGCGCCG CATCGGCCGG CAGAATATCC AAAACCTGGC GAAAGGTATA AAGCCGCAAA 900 CCTGGCAGGA GGTATAAAAT GCAACACCTA ACTGGAATGT CTGTAAACAA CTTTATTACT 960 CCAGGTGGCA GCCACACATG TGGAAATCCA CCCTCGGCTA ATCCCGAGGC TAGGGCACGG 1020 ATTCTGGAGG CCCTACCAAA CATTCTCCCC AAACCCGCCC CGACGGCGGC ACACTTGGCC 1080 GTCGGACTAA GCAATGCAAC AAGTGCACCC ACAACGGACG AAGAAAGCTT GGTGTTTGGG 1140 AAAAGGTCAA AGGTCCTAAG GACCCCACCC CAGAACCCGA GCGATGGCAC CCCCAAGAGG 1200 CCATTAGAAG CGACATCCCC CCTACCAGAG CCCAACCAGC AAAGCGGGTA AAGACCCCCC 1260 AGCTAGAAAT CGAGGAGATG GGAGCAATCC TGGACGACCT TCTGACGAAG GTCAACCACA 1320 ACGGGGTGAG GAGCGTCAAT CAGGCAATGA AAAACTCATT CGCCAGATTG AAGGAGCTCC 1380 AACTAAAGCT GCGCACAAGG CTGCCGGAGG CGGAGAATTC GCATGTCGGA CGCACCGCAA 1440 GAGCCGATGC TAGCCAACAG ACCACCCCTA AGCGCCCCTC CAGCCACGAA GAACCGAGTA 1500 AGGGTCCGCG AAGAAAGCCC AGCGATCCGA CCACCAGAAG GCAGCGGGAC CCAAGCGCCT 1560 ACCAAGGGCT CCGCCCCAAG CCACTGCAGC GGGAAACAAG ACGCAACCAA GATTGCAGCA 1620 TCCGCCCGGA CCGCGCAGGA AGAGACCGCC AAGGGAGAGG CCAGATGCGC TGGTTATCAC 1680 CCCCTCAGCT GGCTTACCGT ACAGCGAAGT GCTGTCGCTG GTCACAAGAG GGCAGGACGC 1740 CAGGCTCAGG GCCATCGGGG AGAACGTATC AAGGGTTAAG AGGACGGCCA AAGGCGAACT 1800 GCTCCTCGAG CTACGTGCCT CTGCCCAAGA CTTGACGCAG AAGCTCAAGA TGGACATGGG 1860 AGCGGTGCTA GGAGACCGCG CCAGCCTTCG CGCGTTAACT CAATCCAAAG TATTTTTGAT 1920 TCGCGACCTC GACGAGCTTA CTACTGAAGA CGAGCTGAGG AGGGTCCTGG AGTCCCGGCA 1980 TAGATTCCAG CAGCAGTGGT GGCTATCAAG AGCCTCCGTC AAACGCAGTA TGGAGGGAAG 2040 TCTGCTATAA TAGCAGTTCC AGCCAATCTG GCGGACCCGC TGATCAAGCG TGGCAAGCTG 2100 AGGGTAGGAT GGTCCCAATG TCTGATCAAG GAACTGGAGC CACGCCAAAG ATGCTTCAAA 2160 TGTCTGGAGG AAGGCCACAT AGCGGCCCAT TGTAGAAGCG CCGTCGACAG AAGCCAGTGC 2220 TGCTTCAGAT GCGGGTCCGC GGGACACAAG GCCGCAGAGT GTCCCAACGA GGCTAAGTGC 2280 TTTTTGTGCG CAAGCAGAGG AAGCCAAGCG ACCAACCACC AAGCAGGCAC CCGGAAGTGC 2340 CCATTGGCGG GCAAAGGAGC ACCAAAGGCA CCACAATGAT GCGTTTGATT CAGCTAAACC 2400 TGAATCACTG CACGGCAGCC CAAGACCTGC TAGTGCAGAC GGTGCGCGAA CGCAGAGTGG 2460 AGCTTGCGTT ACTTAGCGAG CCCTACCGGA CGGCGGACAG CCCAGACTGG GCTTTCGACC 2520 GCGCCAAGAA AGCAGCAATC TGGAGGTGCA GCAGAGAAGC CCAACAATTA ACCGATGTTT 2580 TTTCGGACAT CGGGTTTGTT AGGGCAAAGG TGGGCAGATG GTGGGTGTAC AGCCGGATGC 2640 TAGAGGCCGC ACCCAGGTTC TCATAGCTGG CGACTTCAAC GCATGGTCAG AGAGCTGGGG 2700 CAGTTCAACC ACCAACGCGA GAGGCAGGAT GGTGCTCGAG GCATTCGCGA CGCTGGACCT 2760 GGCTCTATTA AACCAAGGGA ACCGGCACAC GTTCAGGCGT GCCGGACTGG GCTCTGTGGT 2820 GGACCTCACC TTCACTAGCG GCTCGTCGTT CAGGCTAACG AGGTGGAGAC TCAGCGAGGA 2880 ATACACTGGC AGTGACCACT TGGCCATCAT TTGTGATCTG GGATGCCCTT CCTCGACCCA 2940 AGCCCAGCTA GCAGCCCAAG CCAGGATAAA ATACAAAACG GACACCCTGG ACACGCAGTT 3000 ATTCCGAGAG CAGTTCCTAC CCTCGGTGAG TGGAGAAGGA GCTGAGCTGA CGGCAGTGGC 3060 GCTGATGAGG CAGCTGAAGA CCGCGTGCGA CGACAGCATG CAAACAAGCA GGACACATAG 3120 CCAACAAAGA GCCCCTGTCT ACTGGTGGAA CCAGGAGATA GAGACGGCTC GCCGAGAATG 3180 CCTCTCCGCC AGACGTCGCT ATCAACGCGC TAGAGGTGCG GAGTCCTTTG CCGAACGCCA 3240 ATCCGAGTAT AGAGCCCGCA GGAAAGCACT CAAGCTAGCC ATACGGGAGA GCAAGCGGAA 3300 ATGCTTCCTC GACCTATGCG ATTCTGCTGA CAGCGACCCA TGGGGAAGTG CCTACAAGGT 3360 GGTGGTCAAG CAGGCATATA CGAGGACTCC CAAGCTACTG GACCCAGCGA TGCTCCGCAG 3420 TGTAGCGGAA CATCTGTTTC CTTTGATGGA CAGGTTACGC CCCGCCGACC CAGCCACAGG 3480 GGACCACGTC GAAGCCGACG CCACGGTCAG CAGTGAGGAG ATCCTGGAGC TGGCGAAACT 3540 GCTGAAGGAC GGCAAGGCCC CCGGGCCCGA CGGCATTCCG ATCAGGGCGC TTCGGCTCTC 3600 TCTACCTCCA GCCAACTCGT TTGCGAAGGC ATTCACCAAG TGCCTGACGG AAGGAGTCTT 3660 CCCAAGTTGC TGGAAGGTAC AAAAGTGTTG CTCCTCCCAA AACCAGGGAA GCCACCCGAG 3720 GAGCCTATAT CGTTCCGGCC GATATGCCTC ATCGATGGAA CTGGCAAGCT CCTGGAGAAA 3780 CTGGTGTGCA TTCGGCTAGA GAGGGCTATC GCAGACGCGG GTGACCTCTC ACGGTCCCAG 3840 TTTGGCTTCA GGAAAGCGCG GTCCACCGTC GACGCCGTCA ACAGAGTGGT CGAAGTAGCG 3900 GCCCAAGCAA TCGAGGGCAC CAGATGGAAG GGGGGTAGCA AAGAGTACTG CCTCATGGTC 3960 ACACTAGACA TCAGGAACGC CTTCAACACA GCGAGGTGGG ACCGGATCCT AGAAGCATTA 4020 ATTGGCTTCG GCGTCCCAGC CTACCTAGTC AGGATCATTC GCAGCTACTT CTCGGATCGG 4080 GTCCTACTGT GCGAGTCGTC GGAAGGAGTC TTCAGGCACC AGGTCACCGG TGGAGTCCCG 4140 CAAGGCTCGG TTCTCGGCCC GCTTCTGTGG AACACCATGT ATGACGGGAT ACTTCGCCTA 4200 CCGCTGGTTG GGAGGAGCGA GATCGTAGGC TTCGCGGACG ACGTGGCTTT GCTCGTTGTT 4260 GACAAGCATC CCGGCAAGGC TGAGGAAAAG TGTAACCAAA ACATCAGGGC CATTGAACAA 4320 TGGCTCAGCT CAATGGGGCT AGAGCTGGCG CCGGAGAAGA CGGAGGCTGT CCTGATTAGC 4380 TCCAGGAAGG CAGTGGAGAC GGCCACTGTC GAAGTAGGCA GTACCTCGGT GTCCTCCTCT 4440 CGGGCAATTA AGTACCTGGG AGTCATGATA GACACCAGGC TTTCGTTCCG TGAACACCTA 4500 GCCTACGCTA GCTCGAAAGC AGCAGGAGTA AATCGGGCGT TATCAACCAT AATGTTGAAC 4560 ACCCGAGGCC CAAAACAGGC GAGTAGACGG CTGCTGACGA GCGTCACCCG CGCGACCATG 4620 CTATATGCCG CCCCGGTGTG GGCAAAAGCG CTTGAAACAG AAAGCTACGC CCAGGGCCTA 4680 AATGCTACCC ATAGACTGTC GGCACTACGT ATCTGCTGCG CCTTCAGAAC AGTGTCCAAC 4740 GAGGCGGCAT TGGTGATCGC AGGCATACCG CCTCTAGACC TATTAGCGCA GGAAAACAAA 4800 GTGGTCTTCA ACCAAACCCA CGGTAGAAGC CTCTCCCCCT GCGCCAAGAG GTCAATACGA 4860 GCAGAAGCAA GGCAGCACAC CCTGGAGACA TGGCAGCACA GGTGGAACAA CGAGCTTAAG 4920 GGTCGCTGGA CGCATCAGCT GATTCCTAAT CTCAAGCCGT GGATAGAACG GAAGCATGGC 4980 GAAATAACGT TCCATCTAAC GCAGCTGGTC AGTGGGCACG GTTGCTTTCG CAGCTATCTG 5040 AAGCGCTTTG GACATGAGGA GGCGGATGAT TGCCCATGGT GCGGAAGTGG ACGGAGCGAA 5100 ACCGCAGAGC ACGTTCTCTT CTCGTGCGAT AAATATGCAA GGGAACGGAG CTCTCTGGAA 5160 ACCGTACTTG GCAGCAGACT GAATGCGGAT AACCTAGTCC CATTCATGCT GCAGGGTGAG 5220 GCGGAATGGC AAGCGGTCAA CAGCTTCGCC GCAGCGATAA CTACCGAGCT CAGAAGAGCC 5280 GAGAGAGCCA GGAGGGTCCA CGAGTAAGCA AATCTTGATC GTCCTCGCGA AGCAATGCTT 5340 CACGGCCATA CCGCGGGGGG TTCTGGATTT GCCCTCCTTA CTGTTACATT GTTATATTAA 5400 TTAATCTTAA TCTTAAATGT CTAATAAACG AATTAAAAAA AAA 5443 // ID GYPSY4 standard; DNA; INV; 6852 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063433; gypsy4. XX FT source nnnnnnnn:1..6852 FT SO_feature five_prime_LTR ; SO:0000425:1..287 FT SO_feature three_prime_LTR ; SO:0000426:6565..6852 FT SO_feature CDS ; SO:0000316:731..1987 FT SO_feature CDS ; SO:0000316:2241..5073 FT SO_feature CDS ; SO:0000316:5174..6667 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 6852 BP; 2396 A; 1478 C; 1318 G; 1660 T; 0 other; AGTTAACACG CGCAGGGCCA AAATCGCTAA CTCCACGATA AGACCGGACG CGGCCGCAGT 60 AACGTCAGCG CGACTGCAAC TAAGTCGCGA AATATGACTC CTGCATTCCA ACATTCTACT 120 GCCCGGAGCG TGTGAAGCGC AATGTCAGCT TCTGCCGTGA GCGCTGCTTC AGAAGACGGG 180 CTACTTCATA TTAAGCTTAA GTTTTCTGTC TTTAGTTTTA AAACTCATCG AACGCGCATA 240 GTCGCAAATA AACACAATAA CTAAAATTGT TGTCTGTTAA TTGGCGCCCA CGGCACTCTT 300 ATAGAACTCA CAATATTTGT TCACCAGCCA AAGCTGTGAC AAATAAACGA AAAAGTGCGA 360 AGTGAACTGT GACAAACAAA TTACAAGTGC AAAGCTAATT ATAGAACTCA GAATATTTGT 420 TCACCAGCCA AAGCTGTGAC AAATAAACGA AAAAGTGCAA AGTGAAGTAT TTTTTCCGAC 480 CAAGAACACA GCAAACAAAG CGAAAACAAG AATAAAAATA AAAAAGCTAA ACAAACGGTT 540 AAAACAACAT ACAAATTAAA AAGAAAAAAA CGGTTAAAAC AACATACAAG TTAAAAAAAT 600 AGAACCCCAA CACGAAGTGT CGAGAGAGTA AAGAAAAAAT CAACAAATCA ACAAGTGCAC 660 GCATCGGCAG TTTAAGCAGT TACCCAGAAA TGAAATACAT GTAAGTGATT TGTTCATTGT 720 GCCATTATAA ATGCAGAAAA GATTTAGCGA CAGTAGTTTA GAAGACTATA TCCCCTCTTA 780 CCGAGAATAC AAAAACCGCA CAATGACGCA ACAAGAATCA GTACAGATTG ACCTTCACAT 840 ACGAAATTGT CTGGCAGAGC AAGCACAAAT ATTTGACCAA AAAATAGCAA CTTTAACAGA 900 ACAACTGAAC GCTTTAAAGG CTACTGCGCC AGAGGTGACA GCTTATAAGC CTATAGAAAT 960 CATACCAGAA GTTAGATGTG AAGAGCCCCT AGACATAGTT AAATCTATTC CTGAGTTCGA 1020 CGGCAAACAA GAACATTATA TCTCTTGGCG CCAAGCGGCC AACGCCGCAT ATAAAGTCTT 1080 CGAAGCTTAT GACGGAAGTT CTAGGCATTA TCAGGCTGTT GCCATAATAC GCAACAAGGT 1140 TAGGGGATCT GCAGATGCGA TCTTAGCATC ATTTAATACA GTTTTAAATT TTCACGCGAT 1200 AATTGCTAGA CTGGATTTCA CATATTCAGA CAAAACTCCG GTTCACGTAA TTCAACAAGA 1260 TTTAAGTACT TTAAGGCAGG GTGACCAGCC CCTTTTGAAA TATTACGATG AAATAGAAAG 1320 AAAGCTGACG CTTTTAACTA ACAAAACTCT CATGACCCAC GACGCAGCTT CGGCAGCTGT 1380 GTTAAATGAC AAATATAGAA ACGATGCACT TCACACATTT ATCTCAGGCC TGAAGAAATC 1440 TCTAAAATGG GCAGTTTTTC CAGCCCAACT TCGAGACTTA CAGACTGCCC TGGCATTAGC 1500 ACAAGAGGCA GAATCAAGTA ACGAGAGAAG CATTTTTGCC GCAAACTTTG CTCGGCACAT 1560 AGAGGAAAAA GCGCAAAAAT CTGGGAATCA GAGGTCCCAA GGTTCGCGTC AAAACCCCCA 1620 GCAAGAGAGC GACGCTCCCA CTATCTTTAG GAAAAATCCC CATTACCAGA AAGGTAACGG 1680 TCAAAAGCCG AACACTTCAG GTAGCGCTAG ACAGAGATAT TACCAGAAGC CACAAGAATC 1740 AACTTCCGAA CCCATGGAAG TAGACACGTC ATCACGCTTT AGACAGCCCA CTCAAGCAAC 1800 AAATTTTAAA GGTGGTCATT CAGCGAAAAG TCGCCAATCA TTGAACCACA TGCCCCAAGA 1860 CCAAAAAACC GAATACGAAG AACAGGCAGA ATCCGAGGCA AATGCGGCTG AAGACGATTT 1920 GTCTGAGGTC GAAAGCTGTA ATTTTTTAGG GGTCACTCCC TGCTTCCGTA CATCACACGT 1980 ACAGTAGCGG GGCAGACCAT AAAACTTCTT ATCGATACAG GGACTTCGAA AAATTACATT 2040 AAGCCACTTC TAGGGCTGCC ACACTTTGTG CCAGTCGACA AGCCATTCCA GGTAAAATCA 2100 CTACATGGTC ATACAAGAAT TGAACGGAAG TGCCAAATTC AACTTTTTAA AACTAAAACA 2160 TATTTTTTTA TTTTAGAAAA TCTAAGTGAT TTTCACGGCA TTATTGGCCT AGACTTACTA 2220 AAAAAGATTA ACGCCAATGT TGATTTCGAA AAAAATTGTA TCACTTATGA TCGCGGTTCA 2280 GAGCCAATCA AGTTTACAAA ATGCCAGAAC GTAAACTTCA TTAAAATAGA CGATGGCGAT 2340 GTTCCTGAAG TCATTAAAGA TGACTTTAAC AAAATGATTA ATAAAAGAAC AGGGTCATTT 2400 GCAGATGTTA ATGAGTCCCT GCCCTACAAT ATCAATACAG TTGCCACAAT ACGAACTGAC 2460 GGGGAGCCAG TATATTCTAA ACTGTACCCT TACCCAATGG GCGTATCCGA GTTTGTCAAC 2520 GCAGAGGTTA AACAACTTCT AGCGAATGGC ATAATAAGAC CGTCCAAATC ACCTTACAAC 2580 AACCCGATCT GGGTTGTCGA TAAAAAGGGG GTAGACCAAT CGGGCCACAA GCTAAAGCGT 2640 ATGGTCATCG ATTTTCGTAA ACTTAACCAA AAAACGACTG ATGATAAGTA CCCCATCCCA 2700 AGTATTTCAA CCATATTATC AAATATGGGC GAGGCACAGT ACTTTACTAC TCTTGACCTC 2760 AAATCGGGAT TTCATCAAAT CGAATTAGCG GAAAGGGACC GGGAAAAAAC CGCTTTCTCC 2820 GTAAACAACG GAAAATATGA ATTCTGTAGA CTCCCTTTTG GTTTGAAAAA TGCGCCAAGC 2880 ATATTTCAAA GAGCCATTGA TGATGTCTTA AGAGAGCAGA TAGGGAAAAC CTGTTATGTC 2940 TACGTCGACG ACGTAATTAT TTTCTCTAAC AACAAAGAAG ACCACGTTAA ACACATCGAT 3000 TGGGTGTTAA ACAGTCTTCA GACGGCTGGC ATGAGGGTGT CTCAAGAAAA ATCCAAGTTC 3060 TTCAAAAAGA GCGTGGAATA TCTGGGATTC ACGGTATCCC GCGGCGGGAT TCAAACTTCG 3120 CCAAGTAAGG TTCAGGCCAT AAAAGACTTC AAACCGCCAA AAACCTTGTT TAGTCTCAGA 3180 TCATTTTTGG GGCTAGCAAG CTATTATAGG TGCTTTAGAA AAGGCTTTGC CAACATCGCA 3240 CGACCCCTAA CGGACGTCTT AAAAGGCGAA AATGGTAAAA TAAGTGCCAA CAATTCAAGG 3300 AAAGTAAACC TTGAATTAAC ACAAGATCAA CTAAAGGCTT TTAATAGACT AAAAGACGTA 3360 TTAGCCTCTG AGGACGTTAT TTTAGCATAC CCAAACTTTA AAAAACCATT CGATTTAACT 3420 ACGGATGCCT CAGGGCACGG TCTTGGAGCT GTGTTGTCTC AGGACGGACG TCCAATTACC 3480 TTGATTTCTC GAACATTACG AGACAACGAA GTAAACTTTG CAACAAATGA GAGGGAACTC 3540 TTGGCCATTG TTTGGGCCCT AAAAAATCTT CGAAATTATT TATATGGCGT AAAAAATTTA 3600 AATATCTTCA CGGACCATCA ACCGTTGACT TTTGCCGTGT CGGATAGGAA CCCAAATGCG 3660 AAGATAAAAC GCTGGAAAGC TTTTATTGAC GAGCACAACG CAAATATTTT TTACAAGCCT 3720 GGCAAAGAGA ATTTCGTGGC AGACGCCCTA TCTCGCCAGA ATGTAAATGT ACTCGAAGAT 3780 TGCCCAAACT CTGATATTGC CACAATACAT AGCGAAGAAT CTCTTACCTA TACTATCGAA 3840 ACGACCGAAA AGCCTGTAAA TTGCTTTAGG AACCAGATCG TAATTGAAGA ATCTAACGCC 3900 CCATCAGTTA GATCTACTAT TTTGTTCAGG GAAAAAACCA GGCACGTTAT AAGATTTGTT 3960 GACCGTAACA CACTTTTGCA AACAGTACAA GACGTCGTAA ATGCCAAGGT AGTTAACGCA 4020 ATACATTGCG ACCTACCCAT TTTGGCATTC ATACAACACA GTCTAGTAAA AGCGTTTCCG 4080 TCAACCACAT TCCGACACTC CAAAAATATC GTGATAGACA TTGTCGATAA AACCGAACAG 4140 AGAGAAATAA TTATTGCAGA GCATAACCGA GCGCACCGAG CAGCGCAGGA AAATGTGAAG 4200 CAAATTCTCC GGGACTACTT TTTCCCAAAG ATGAATTCGC TGGCAACAGA GATTGTCGTA 4260 AACTGTAAGG TTTGTTCAAT GGGTAAATAT AATCGACATC CGGTCAAGCA AGCGATGGGT 4320 GAGACCCCAA TCCCATCATA TGTTGGGGAA ATATTGCATG TTGACATTTT TAGTACCGAC 4380 AAAACATTTT TTTTAACATG CGTCGATAAA CTCTCAAAGT TTGCGGTCGT CCAATATATT 4440 ACTTCACGCG CAATCGTGGA CGTAAAGGCA CCGATTTTAC AGCTGGTAAA CCTGTTTCCC 4500 AAAATAAGGA TCATTTATTG CGACAACGAA AAATCTTTAA ATTCCGAAAC AATAAAAAAT 4560 ATACTAAACA ATTTTGATAT TCAGATTTGT AATGCCCCAC CACTCCACAG CACCTCTAAT 4620 GGTCAAGTGG AAAGATTCCA CAGTACCCTT ACAGAAATAG CCCGTTGTCT AAAAATAGAT 4680 AAACACCTTA ACGAAACCAA GGACATAATC CTTTTGGCGA CCATTGAGTA TAATAGGTCA 4740 ATTCACTCAG TAACAAACAG AAAACCGACA GAATTAATTT GCTCCGCCCC TTCGGATTTA 4800 TTAACAGAAA CTAGGGATAA AATTGTACTT GCGCAAGAAA AACAACTTGG GTACATGAAC 4860 CGAGATAGAA TCCATAAAAA GTATAAAGTA GGCGAAAAAG TATGGCTAAA ATCAAACAAA 4920 CGATTGGGTA ATAAACTGTC ACCACTGTGC ACGGAAGAAG CCATTGATGC TGACTTAGGT 4980 ACGACGGTTC TAATGAAAGG AAGGGGGTGG TCCATAAGGA CAACCTTAAG TAAACCCAAG 5040 AAAAAAAAAT TAAAAACAAA TATAAATAAA TAAAAAATTT CATCGCTTAT AGTACTTTTA 5100 ATTTTAATAT TTTTATTACA GGTTAGGATA TTTGTGTGTG CTGGCTTCCG CAATCACCTT 5160 GACAATAACG ACTATGAAAC TTAATGACTA TTCACACGCA GACTACATAC CTATAATAGA 5220 CGGTGACTTA ACGATATGGG ACAGATACGG ATATCTCGGA CACACTTCTA ACATAACTTC 5280 TTATGAAACT TATGTTGAAG ATACCAGACG GCAACTGAAC TACTTTGGAA AAGACCATAT 5340 GCGAAATTTG ATAACTACTG ACTTGGGGCA CATAGAATCA CTAATAGCTA CGATTAGGGT 5400 ACACCATAGA CATGCTCGCA GTATAAATCT ACTAGGTACA GCACTCAAGG TAATAGCCGG 5460 AACCCCAGAC TTCGACGACT GGGAACAGAT TAAATTTAAA CAAGGGCAAT TAATAGATTC 5520 AGCAAATAGA CAGATCGAAA TAAATACGAA ACTTCAATCT CGACTAAACG AAATTACAAG 5580 GTCAATGAAT GCAATAAGTA AAACAGACAA CGCGGACTCA GAACACCTAT TTGAAAGTAT 5640 ACTTGCTAAA AATAGAATAA TTATTACAAA CCTAGAAGAC TTGATACTGT CGGTAACTCT 5700 AGCTAAAATA AATTTAATAA GCCCACTTAT TTTGGATAGT GTTGACATTC ATGAGCTGGC 5760 CAATGAACTT CTCACAAATA CGAGCTTAGC AGATATTTTA AGAGTATCTA GCGTTAAAGC 5820 CTTTCAGAAC AACGATTTAT TGTATTTCTT AATAAAATAT CCTAAACCAG AAACAGTTTG 5880 TAAAAAAATT AACGTTTTCC CAGTTCAACA TAATAAGATA ATATTAGACT TTGAGAATAA 5940 TAATGTAGTA GCCGAATGCG GTGCACGAAT CTACGCCGTC AAGGAATGTA ACGCAGCCAT 6000 GAGTACCACC TTCTGCAAGA GATTGCAGAA CCCGACTTGC GCTCAACAAT TGGTGTCTGG 6060 AACCGTCGCA CAGTGCACCA CTCTCCTTGG CCATCTGGAT CCGGTGACAC TTGTTGAAGA 6120 AGGTGTCCTG ATTATCAACG ACGCAACCTT CAGGGTCGAA GATAGCCTAG GCAGCAGTAA 6180 GATGATCTCG GGAACCTACT TAGCCACATA TGACGATCGT ATATCTCTAA ACGGCACCCA 6240 CTACGAAAAC CACCAGGGCG TCCTAAAGAA GAAACCCGCT GCGGCAGTGG CCTCTCAGAT 6300 CAATGTGACT AGCCATCGAG ATCAACTAAG CTTGTCGTTC TTGCACGAGC TGTCGCTGCA 6360 GAACCTCCAA CATATTGGTA GTCTGAAATC TGACATCATT TCGAGACCCA TTTTAAGCAG 6420 CGGCGTCACC ATCGCCGTGG TCCTAATTGT TTATGGGATC ATCCAAAGCA TCCGTTACTG 6480 TCGAGTCAAA AGACGACACC CCGACTCCAG CATCGAGATG ACGATACCTT CCCGAAAAGC 6540 CGAGGACGAC TTTAACCTAA CGCGGGGAGG AGTTAACACG CGCAGGGCCA AAATCGCTAA 6600 CTCCACGATA AGACCGGACG CGGCCGCAGT AACGTCAGCG CGACTGCAAC TAAGTCGCGA 6660 AATATGACTC CTGCATTCCA ACATTCTACT GCCCGGAGCG TGTGAAGCGC AATGTCAGCT 6720 TCTGCCGTGA GCGCTGCTTC AGAAGACGGG CTACTTCATA TTAAGCTTAA GTTTTCTGTC 6780 TTTAGTTTTA AAACTCATCG AACGCGCATA GTCGCAAATA AACACAATAA CTAAAATTGT 6840 TGTCTGTTAA TT 6852 // ID INVADER4 standard; DNA; INV; 3105 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063427; invader4. XX FT source nnnnnnnn:1..3105 FT SO_feature five_prime_LTR ; SO:0000425:1..346 FT SO_feature three_prime_LTR ; SO:0000426:2759..3105 FT SO_feature CDS ; SO:0000316:395..1267 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 3105 BP; 962 A; 545 C; 787 G; 811 T; 0 other; TGTCGGAGCG CGACTCTTCG CATTCCAGAA ATTCCTTCTG TGATTTCATA TTCTGATGAA 60 ATTGTAATAT TGCGGTAAAA TTCTGTCGAG CGCTGCGGCA GAGGCACGAA CAGCCTCTGC 120 AGCTGAGTGA GGTCCGATCG AGTGGTCATA AATAGTGTGT TAGAGAGAGA TGACAATGTG 180 GCACACGCCA GTGTGTATAG CCATTAGAGA ATATGATGAA GAAGGGACAT GTAAGAAGAT 240 CCCTTCAGTG AAGTTTGACT GCTGACGTCG ATCGGAACTT GCTGCGCTGA CGCACAAAAT 300 CGCGAAGTGA ATAAATAATA TGGATGAGAC TCCTGTTTCG CCGACATCAG AAGTGGGATC 360 GCCTTCGCCA CAAGAGACCA GATCTGATCA GTATATGGCT GCGTTGGAGG CACAAAACCG 420 TAACCTCATG GAAATAATAA GAACCATGCA CGCACCGAGA GCATCGGCGT CAAATGAAAA 480 GTCCTTGCAC GTCACACTGC CAAAATTCTG CGCTGACAGC GCTGGAGCAG ATCCATCTGC 540 ATGGTGCACC ACCGTGGATT TAATCTTTGC AGATAATGCG CTTGTAGGCA GTGCACTCGT 600 AATAGCGTTA AGCAAAGCAT TAGAAGGCAG TGCATCGCAA TGGCTGTCGC AGATATGCTT 660 CGCTGGCATC ACGTGGCCGC AGTTTAAAGA ACTGTTCATA CAGCGATTCG TAGGAATGGA 720 GACGTCGGCT GCTATTTTGA TGAACGTTTT GAACGGACGT CCAACACCTG GAGAGAGCTT 780 TGCCCAGTAT GGAAGTCGCA TTGTCACCTT GTTGCTGTCT AAGTGGAAGG CCAAGGATTT 840 GGAGGAGATT GCGGTTTCGG TAGCGTTGGC TCACATGGCA CAAATCGATA ATAATTTGTT 900 GCGTTGGGTG TTTACGACTA ATGTGGCTAC GCGCAATGAG CTTCAGCAAC AACTGCAAGC 960 TTACGCCTTC AAGAAACGTA ACAACGAAGA TGATTCTGGC CCAGAAAAAA AGTTAAGAAT 1020 GCAGTTGCAG ACCATGTGCC ATTTTTGTGG GAAAGCTGGC CACAAATTCG CCGAATGCCG 1080 TGCTCGAAAG GAAGGTACAT CAAACACAAA AGGAAGAAAC TACAGCGAAA GCAACACGCC 1140 TGGGTTAAAA GATCGTTCGA ACATAAAATG TTTTAAATGC GACGAAATGG GACATGTGGC 1200 GTCTGTATGT CCCAAAGGCC ACAACAAATA CATCGAAAAG CGAGTTGACG TGTGCGAAAC 1260 TAAGTGAGCC AAGTGGACTG TGTTTTTCAA TTGGGTGAGC CCTTTCCATT TTTTTTTTTG 1320 ATTCGGACGC CGAATGTTCT TTAATAAAAT TAAAATAAAT TAAGTAACAA ATTAAGAGCT 1380 TGGTTTCGTT TTATTTATTA AATCTAATAA ATTTAGTATG TACAAATCCA AAGTTATCAA 1440 AGCCGTTCAA AATAATGCAC ATGAGAATAG TTTGGAAATA CTCTTTGACA TCAATATAGA 1500 AGTAAGCGAT AATGATAAAT GTAAATTAAG AAATATATTG GAAAAGTATG CCGATAGTTT 1560 CGTAACTGGA ATACCAAATA AGGCGCATTG AAAATTCGTT TAAGTAATAA AAATAAAAGT 1620 GTTCAAGGGC GCCCATAGCG ACTCAGTTTC CATTGCTACA GATTTTGGAT CAAATAGATC 1680 GTTTGCGGTG TGGTAAATAT TTTTTTCATT TGTTTTAGAC ATGGCCAGCG GATTTTATAA 1740 AATACCGATT CATCCAGATT CTATAGAAGA GTCAAGTATG TCTTTCCTTA GAGCTCCCAG 1800 TCGCATCATA GTGGATCAAG GCAGAAGCTT TGTCAGCAAC AAGTTCCGTG AGCTTTGTTC 1860 AACAAACAGG ATTGAGTTGT TTTCAATAGC TACAGATGCC AGCAGAGCAA ATGGGCAGGT 1920 TGAAAACAAA TGAGCGTCCA AGCAAGGAAA ACATGCGAAA GGATGTAGAG TAGTTCGGTA 1980 TGAGGTCATG TGTTAAGTGA GAAGTATCAT CGAGTGATTA ATGTACTAGA TGGTCATCGA 2040 CACAATTTGA AGTCGTTAGT TATCAAGCGA ACATATAAAT ATTCGCATGA ATGTAATGGT 2100 GACTCCCAAC AGAGAAAAGA GTTGAAGGAT GGCACTAGTG ATCGTGCATC TGTGTCCCAA 2160 GAGAGAAAAG AGTTTAAAGA TGGCACTAGT GATCGTGCAT CTGTGTCCCA AGAAAGGAAA 2220 CATTTGGCGA ACTGTTATGA ATAGAATGGC GTGTGACCGC ACTATAACAG CTAACTCGCA 2280 GGGACGAGAA GTGTTATGGC GGGGGTCAAG TGATGATGAC TGCACTATAA CAGCTAACTC 2340 GCAGGGACGA GAAGTGTTAT GGCGGGGTTC AAGTGATGAT GACTGCACTA TAACAGCTAA 2400 CTCGCAGGGA CGAGAAGTGT TATGGGGGGG GTCAAGTGAT GATGACTGCA CTATAACAGC 2460 TAACTCGCAG GGACGAGAAG TGTTATGGCC GGGGGTCAAG AGATTGGGTG TGAGAGAAGA 2520 CACGCCAATG TTATTATGAG AAATTAAAGT CATGGAAAAT GTAAATAGTT TGAAGTTTTG 2580 ATATGTAAAT TGGAGATGTC TTTGTTAAAG AAAAATCAGT CATGAGATGA ATTGTCAATT 2640 AAATAATTAC TGATTATTAC TTGTTGTCAT TAATTGTTCT TAAGTTGACG AAGTTGTGTG 2700 ACTTGGACTT GATTGGTGGA TTAGGCACAC GAGGGACGTG TGAAAGGTCA GGAAGGCCGT 2760 GTCGGAGCGC GACTCTTCGC ATTCCAGAAA TTCCTTCTGT GATTTCATAT TCTGATGAAA 2820 TTGTAATATT GCGGTAAAAT TCTGTCGAGC GCTGCGGCAG AGGCACGAAC AGCCTCTGCA 2880 GCTGAGTGAG GTCCGATCGA GTGGTCATAA ATAGTGTGTT AGAGAGAGAT GACAATGTGG 2940 CACACGCCAG TGTGTATAGC CATTAGAGAA TATGATGAAG AAGGGACATG TAAGAAGATC 3000 CCTTCAGTGA AGTTTGACTG CTGACGTCGA TCGGAACTTG CTGCGCTGAC GCACAAAATC 3060 GCGAAGTGAA TAAATAATAT GGATGAGACT CCTGTTTCGC CGACA 3105 // ID BAGGINS standard; DNA; INV; 5453 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063440; baggins. XX FT source nnnnnnnn:1..5453 FT SO_feature CDS ; SO:0000316:398..1771 FT SO_feature CDS ; SO:0000316:1771..5385 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 5453 BP; 1576 A; 1325 C; 1418 G; 1134 T; 0 other; GGCCGCCACG CCAGTAGGTA TCACAAGGGG ATCAACGCGC CACCAACGGT GGTACGCGCC 60 GTAATTGTGA CACACTACTG GAAGTGCAAC TACACTCTCC ACGCGAGGGC GGCTGGAAAC 120 AGGCTCTTAG TTAGGTCATG TCTCTGCTAA GAGTGCTGGC TAATCGTAGT TGGGCTGGAG 180 CTGGTAACTC TCAAAAATTT CCACATCTCC GGGCCGGACT AAGGTGTCAT TCGACCCGTG 240 CCGAGAGTCT GCCAGGCTGG GGGCCCGGGA AACAACTAGC CCAGTCGCTA ACGGAGAGTC 300 CGGGCAAGTG GCGTCCGCCC GGTCTGGTTA GCCTTACCCG GGTACTGCGG ATCTCTGCCC 360 GGGTGGACCG TTTATTTCTC TGCAACTCGT GGGAACCATG AACAATAGAC CCAAAAATAT 420 TAAAATACCG CAGGCCTTGA CTGCGGGAAA TGTCGCAAGA CAGTTTGAGG ACCGTCTCCT 480 TGACGGAGAG CGGCCTACTT CGGTGGTAAC ACCAAAAAAT GCTCCTAATC GGAGTGTGCC 540 AACGGCACAA GCACGCCCAG TTGCGGGAGG GAAGAATCGG GCAGCCACAC CGAGTCACTC 600 GACTGTGGTC CAAGCCGGCC AAGCGGGACG GGCTAAAAAC CCATTACCGC AACTTGGTCA 660 GCCTGGGGCC CGGGATGACC CTAACCGTAA GGGGCTGAGC GGGGCCAGAA CGAAGTGGTA 720 TCTTCGTTAC CTTCAGCAGG GCAAGTCACC CGCAGAAGCT TTGTCTCTGG CCAAGACACC 780 GCAGCCGCTA GAGCAAAGGC CCAACTCTGC TTCGAAGCGG GCAAACTCCA CCCTGACGCC 840 TCCGACGGAA ACGCCTAAGA GGCAGAAGGT GGAGAAGAGC AGGCCACTGA CTGCTCCTTT 900 CGATGGCCCA AGTACCTCAA GGGCTGCAAA GGCAACTGAG GTACGCAAGC CATCATATGC 960 GGCCGTAACT GGGGCCATTA AAGTTGCGGT CGTACCAGAG GGGTACCCCA AAGTCGTCCT 1020 CAGCAGTGAG AACCTCTCAC AGCTCGAGGA CGCACTACTG GAAGAGATAG TCCTCTCCGG 1080 TTGGGATTCT CCAATTAAGT TTGGAGGAAT CCACTTCAGG GTGGGTCACC TGATAGTGGA 1140 TTGCCGCAAC GCAACCACTG CTGAGTGGCT GCAATTAGCA GTCCCCAGCC TCAGTAAGTG 1200 GAGCGGTGTC TCCCTGGAGG TAAAGATGGG CGACGACTTG CCATCGTCAC ACAACATCAC 1260 CATCTTCTGC CCCAGGACAG GAGACAAGAC AACAAAGTGG ATCATGGAAT TGATCAAAAA 1320 ACAAAATGAC TTGGACACTG AGAACTGGCG TCTTATATCA AGGAAGAATG AGGGTGGTGG 1380 CTCGCTCCTC AGCCTTGGCA TTGATGACAA CTCATGTGCC AAGATTATTT CTAGCGACCA 1440 CAAGCTGAGC TTCAGGTTTG GAGACATCTC AGTTTGTGGT CTAAAGAAGG CCAAGGCGAT 1500 CAAAGCCGGC ACAAGACGGG TCGAGACTCC TCGAAATCCA CCAGTCTCCG CGGAGAAGGA 1560 GAAGCCCAAT AAGCTCTCCG AATCCAGCGA GGATCTGGCG GATGATGAGG ATCTCGATGC 1620 GACCGTCATC GAGATGGGGG ATCAGACCCC GATATCCTCT CAGGAGCTAG GCATGGAACT 1680 AGACCTGCTG AGTAAGGAAG AAGCGGTACC GGCGGATGGT GATCTGGGCA TCTCCAGATC 1740 AGCAACGCCG ACGATGGAGC CGATCATCTA ATGGTCGGGA GTACAGGGAT AGCTCAGGTG 1800 AACATCCATC GCGCCAAGGC GGCCTCGGCC GTGCTGGCAA GGATGTTCAC CAAGCAACAT 1860 CTTGGGCTAG CCCTGGTACA GGAACCCTGG TATAACCAGG GAATAAAAGG ACTGTACGTG 1920 AAGAACGCCA AGGTAATTTG GGATCAACGG GCTTCTAGCC CTAGAGCCTG TATCCTGGCA 1980 TGTAGAAGTA TTAATTATTA TATACTTACA GAATTTCTCA CGCGGGACTG TGTACCGATC 2040 GTGGTAGAAG ATGTGGGAGC AGCCAAGAAG ACTGTGGTGG CATCGGCATA CTTTGCCGGT 2100 GACGAGGCGT GCCCACCGCC CGAAATCACT GCACTGGCTG AACACTGTAA GAGGATGCAC 2160 TCACCCATCA TCATTGGATG TGATGCAAAT GCACATCACG TGATATGGGG CAGCAGCGAC 2220 GTGAACCAAA GAGGTGAGTC ACTCTTAGAA TTTATCTTGA ATAACAATCT AGAAATCCTA 2280 AATGTCGGCA ATGTTCCGAC ATTTGTTACT AGGGTAAGAT CTGAGGTTCT GGACATTACA 2340 CTTTCCAGCA GATCAGTGGC TTCGCATATA AGCGACTGGC ACGTATCTCC TGAGGAGTCA 2400 ATGTCAGATC ACAGAATTAT TCGTTTTAAC ATAGGGCTAA ATCTAGAATG TAGCGAGCGT 2460 AAGCGCAATC CCAGGAATAC GAACTGGGAA GGGTTCAATG CCTCCTTACT GCGAGATCCA 2520 GTTACTAGGG TAACCGGGAA ACTCCGTACC ACTCTAGAAC TGGAGGCCGC CGTAGAAGAT 2580 ATTAACTCGT GTCTTACTAA CGCGTTCAGA GAAAACTGCT CCCTTGGCCG AGCCAAGAGG 2640 GATAAAGATG CTCCATGGTG GAACGACCAA TTGGAGAAGT TACGGAAAGC GACCAGGCGT 2700 CTCTTTAACA AAGCAAAAAG AGAAGGGAAC TGGGATGCCT ACCGGAATAA ACTGACCATG 2760 TATAACCATG AGATCAGGAA AGCCAAACGA AGGAATTATA GAACCTTCTG TGAGGCAATA 2820 GTAAACACAT GTGAAGGCGC TAGACTTCAC AGAGCAATCT CCAAAGGAGT GCCAGAATCA 2880 AATCAGGCAC TACGAAAGGA GGATAACTCC TATACTATAG GAAGTAAGGA GAAACTTGAG 2940 CTACTACTGG CAACTCACTT TCCGGGAAGC ACGCTTCAAA CGGATGGGAA TACCCCGGTA 3000 ACGGCACAAA GTAGACCGCG TGCTACTGAT TGGGCCAAAG CAAAATCAAT AGTAACAACG 3060 GAGAGATTAA GATGGGCAAT TGGTACATTC CAGCCCTACA AATCTCCTGG TCCAGATGGC 3120 ATTCAGCCAA TCCTGCTTCA ACGAGCGCTA AGTCAGCTCG CAGTACCGAT CAGGAAAATA 3180 CTGATCAGCA GCCTAGCTTT AGGACACATA CCTACTGCCT GGAGTATAGC AAAAGTGGTC 3240 TTCATTCCGA AAGCTGGAAA GAAAGACATC ACTGATCCGA AGTCGTTCAG ACCCATCAGC 3300 TTGACATCGT TTTTGCTAAA AACGTTGGAA AAGCTAGTGG ATGTCAGCAT TAGAAGCACT 3360 CTGCTTGTGG AGCATCCGCT CCAACGGACG CAGCATGCCT ACAGGGCGGG CAGATCAACT 3420 GATACTGCCC TATATCACCT GAAAAGCCTT ATTGAGGATT CACTGACTCA TAAGGAAGTG 3480 GCGTTATGTG CCTTCTTAGA CATACAGGGT GCTTTCGACA ATACCTCCCA TGAGGCTGTC 3540 AACGCATCCC TGGCAAGAAG AGGACTAGAT GCGACAACCA GCAGATGGAT CAAATCACTA 3600 CTGGCATCTA GGCGAGCCAT GGCCACTATA GGAGGACAGT CGTCCACAGT ATCTACCACA 3660 AGGGGTTGCC CCCAAGGGGG TGTCCTCTCG CCTCTCCTGT GGAGCTTGCT GGTAGACGAG 3720 TTGCTGGACA GACTGACCCG CAGGGGTATA CTTTGCCAAG GCTATGCTGA CGATATTGTC 3780 ATTATAGCTA GAGGTAAATA TGAGGAAACA CTCTGCGATA TCATTCAACT GGGCCTCAAT 3840 ATGACCAGCG AATGGTGCAA GGAAGTAGGC CTGAGCCTAA ATCCTAGCAA AACTGTTATT 3900 GTTCCTTTCA CAAATAGGTA CAAGCTACAG AGAATGAAGG CAATAACCCT GTCAGAATGT 3960 CGCATAGAGG TCAGCAAAGA AGTTAAGTAT CTTGGGATAA CCCTTGACTC CAAACTTAGC 4020 TTCAAAACTC ATGTCGATAA CACAATTGAT AAGTGCACCA GAGCACTTTT CACGTGTAGA 4080 AACATTGCTG GTAAGTCGTG GGGAACCTCA CCACGCATAA TAAGATGGCT GTACCTCATG 4140 GTAGTCAGAC CCATACTAAC CTATGGGGCA ATAGCCTGGG GTGATAGAGC ACGCCTTAGC 4200 ACCACTAAGG TGAAACTCCA TAAGCTGCAG AGAATGGCCT GCGTATGCAT GACAGGAGCA 4260 ATGCGTACCT GTCCCACTGC AGCCCTAGAA GTACTAATGG AGGTGACGCC GCTTCATATC 4320 GTCATTGAAA TGAAACGGAA AGCCACCCTA ATAAGAATTG AGGGAGCAGG AAATGACTGC 4380 AACCTCACAA GTAAGGATGC TGAAAGCCTA AAAAGGGATA TCCCTTTGCT AATGCAACCA 4440 AGGGATGAGA TGCCAGCTGA GCATAGGTTC GCTCAGAACT TCAGCACTCA TCTCAGTAAT 4500 AAAAACAGTT GGACATCCCT GGGGAAAGTA CACCCTACGA AACCACAAAC AATAAAGTGG 4560 TATACAGACG GATCCCTCAC CGACGAGGGA AGTGGGCTGG GGGTTGTAGG CCCCAGGTTG 4620 AAATACCACG AATCAATGGG CAGATACACC AGCATTTTTC AAGCTGAAGT CTGTGCTATT 4680 GGACGCTGTG CGGAGTTTAA TCTGCAAAGG AACTATCGTG GCAAGGACAT TGCTATACTG 4740 TCTGATAGTC AAGCAGCCAT AAAGGCGCTC AGCAAAGCTA AGATAACATC TAAGCTAGTA 4800 AATGAAGTGA GGACAGCCCT AGACAAACTA GGAGCTGTCA ACAAACTCAC AATAAGGTGG 4860 GTCCCGGGAC ACAACAACAT CCCGGGAAAT GAGCTAGCGG ACAACCTAGC CAGGAAAGGG 4920 GCAGAGAACC CTCTAATTGG GCCCGAACCC TTCTGTGGTG TTGGTCACCA CAGAGTACTG 4980 GGCTTACTTA AGTCAATAGA GGAAGAAAAA CGTCTGTCCT TCTGGGAACA CCTACCAGGA 5040 CTTAGGCAGT CTAAGATTCT CCTCCGTGAA TATAACCATA AAAGGTTCAA GACCTTAATG 5100 ACACACGGGA AAAACACCGT GCGCATTTTG ACTGGCCTTC TTACGGGGCA TTGCCGACTT 5160 CACAGCCATT TGCACAAGAT TGGCATTGCA GACAGTGAAC TCTGTCGCTT CTGCTGCATG 5220 GAGGAGGAGA GCTCTGCACA TATTATATGT GACTGCATGG CGCTTTCCAT CAGGAGGAAC 5280 AGACTCCTGG GCATGTATGT AGTCCCACGG GAAACAATTG CAGCCCTGAA CCCCAATAAA 5340 ATTCTGGCAT TTATACAGTG CATTGGACTT CAGGGGGATC TTTGAGTCAT GAAGGGGTAG 5400 TACAATAGAT CCGACTGGGT CGCAGTACAC ATACAAACCC ACTTAATAAT AAT 5453 // ID G3 standard; DNA; INV; 4605 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063506; G3. XX FT source nnnnnnnn:1..4605 FT SO_feature CDS ; SO:0000316:787..1818 FT SO_feature CDS ; SO:0000316:1982..4504 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4605 BP; 1656 A; 977 C; 840 G; 1132 T; 0 other; CAAATTCGTG TCGTTTTGGC TGTGCAAAAA GAATTTACTG TGTGCATAAA CACACACACA 60 TCATTGCTAT TGTGTGTTCA GTTTGGTTGC TGGCTACCGC CAAATTCAGA AACCTGTCGC 120 CTCAAGATCG CTTACAGGAG GCAAAAAGTT GTCTTTATAC CAAAAATGCA TTGAAAGTGC 180 GCATCACCTA CGACATTGTA ATGCCGATCG CTGTCGTATT TGTGGAAGCA AGCACAACAA 240 TTTGCTAATT TTTGTCTCAA GGCCTTCGTC ATCTCGGTCC TAGCCTGCTC AGCTTCAGCT 300 ACCATCTGAT GCTCATTCCA AGTAATTGCG AGTAGTTGCA CAGGAATCAT AAATAAATAA 360 ATCTCATCAA TGGTCAATCG TCTTTCTTAG TGGCTCAAAA TCTCGGAGCA GACTTTGTGT 420 TGTTGGCCAA AATAAACCAG GTTCTTGGGT TCCTTATCGC ACCTTGCTTG TCTCCGGCTC 480 GCAACTACAC ATAATCACCT CTCGTCTGGT ACGCACTTAG CAATCGCCCA AGTACAATTC 540 TGCCGCTACA GTTTTTGGAA TTGGTGAAGC CAGATTTAGC GTTCATTTGT CTGTACAAAA 600 AACAAAAAAA AAAAAGAAAA CTGATAAGAG ACTCCAATTT AGCTGCTGTA AAGTACCAAT 660 TTAAGATGTC AGCAGAAGAC GAAAGCAATA TTTTTGCGTG GCAATAAGTG CCCACAAACC 720 CACGTTCCAT TAAGCGAAAA AATGTAAACT CCCCAACGGT GGCGGGGAAA GTTATGACTA 780 TCGACCATGG ACCTTCCACT TCTAGCCATG ATAACTACTT TGCTGTGCTT GCTGAAAAGG 840 ACATGTACAA ATCCAACAAA AGATGATGAA ATTAAAAGCG ATACACCATC ATCTGAAAAA 900 CCACCCCCAA TTTTTATATC TGATGTCAAT GATATCAACG GATGGCTTCG TTACCTTAAC 960 TTACACATGG ACAGCACTGA ATTCACGTAC AAGTCACAAC GGGACGGACA AACTCGCATC 1020 ATGGTAAAAA ATGTTGAAGA ATTCAGGGAA TTAGTCAAAA AATTAAACAC TGACCAAGTG 1080 AAGTACCACA CATTCCAGCT TAAGCAAGTT AGGGCATTTA GAGTTTTAGC CAAAAACATC 1140 CACCACTCCA CCGATATTGA AAGCATAAAG AAGTCAATAG AAATGCAAGG CCACGTAGTT 1200 AGAGGAATTC ACAACATTAA AAGCCGCATT ACCAAGGAGC CATTACGAAT GTTTTTTATA 1260 GACCTAGAAC CACATAACAA CAACTCCGAG ATCTACAACG TAAAACATAT CAATAATGCT 1320 ATAATAACAA TTGAACCACC AAAGAAAGTT AGTGACATGG TGAAATGTTA CCGATCCCAA 1380 GAATTTGGGC ACATCAAATC ATACTGCAAT AAAAAATTCA GATGTGTTAA ATGTGCTGAG 1440 AATCACTCAA GCATCTCCTG TCCAAAAGAA AAGACTGAAC CAGCCACTTG TGCCAACTGC 1500 CATGAAAACC ATGCCGCTAG CTACAAAGGC TGCAGAATCT ACCAAGAAAT TTCCAAAAAA 1560 AGGTCCAATG TACCCAACAG CAGGCCAGCA CTAACATCTC TTTATGACAT ACCGAGAGCA 1620 ACATCATTCA TTCCAACGAA ACCAAACCAT GAAAAAAGTT ATGCGAATCA AGCTAATTAT 1680 GCACAGGTAG TACGAGATGA GACTGAACAA GAAAAATCAA CATTGGATAG AATTGAAAAG 1740 CTATTAATGA AGCAGTCTGA GCTTACATCG AATTTACTAA ACATGATAAT GATGTTAGTA 1800 AACAAACTAT GCAAGTAGAT CTTCGTATCC ATCGTATATC CATATGGAAT GCTAACGGCT 1860 TTCCAACCAT CGAAATAAAA TAACTTATTT ATTAAAACCA GAAATATTGA TATTCTACTA 1920 ATTTCGGAAA CGCACTTTAC AAAGAACAAT TTTATAGCCA TTAAGGGATA TAACGTCATA 1980 AATGCTAATC ATCCATCTGG GTGGAAGTGC AGTTATTGTC AAAAATACTT TCCAGTACAA 2040 ACAAATAGAC AACATCTGCT CAAATATAAT GCAAGTGGCA TCGATTGAAA TAAAATGTAT 2100 GCATTCAAAT ATCTCTATAG CTGCAGTATA CATCCCACCA AGACACTCTT GGAAGCTCGA 2160 TGAAATCAAC TCTCTCTTTC GAGGGCTAGG AAGACACTTT ATTGCTGGAG GCGACTATAA 2220 CGCAAAGCAC TCATGGTGGG GGTCAAGATT AGTAAATCCA AAAGGCGCAG AACTATACAA 2280 ATGCATAACT AACAATAAAT ATTCAACCTT ATCCACCGGA AAACCGACGT ACTGGCCAAC 2340 AGATCTTAGT AAAATTCCAG ATCTGCTGGA TTTTATTGTA TTCTCTGGAC TACCAGCAAG 2400 CAATTTCCAA ATAGAGGAGA ATTTTGATTT GAGCTCTGAT CACACACCGA TTATTGTAAC 2460 CTATGGTACT ACAGTTTGTT TCAAACAGAA GCCATACAGA ATAATTAACG CAAAAACCAA 2520 TCTAGGTGCT TTCAAAAACT GGCTCGAAGA ACGAATTTGT CTAAATGTAT CGTTAAAATC 2580 AGGGGAAGAA ATTGAGTCTG CAATTGAAGC TTTGACAAAT TCGATTCATG CAGCTGGGTA 2640 CATGTTTACT CCACCGCCAA ATGAAAGTAC AAGCAGGCAA ACTTTTCTAC CACTAGAACT 2700 AAGAGCGCAA ATAGCTGAAA AAAAAAAGCT TCGAAAAATA TGGCAAACCA CGCGAAGCCC 2760 AATAGACAAA AGAAAATTCA ACAAAGCTGT CAATAACCTG AATAGCAGAC TGTGGGAAAT 2820 TAAAAATGAA GCGACAGCCG AATACCTTCG AAATCTTGAT CCAAACACAA GTGACCTTTA 2880 TTATAGTCTG TGGAAAGCAA CTAAATATCT AAAGAGGCCA ACCAGAAGGG ATACCACAAT 2940 TGTTAATAGC ACGAGACATC GCTGCACAAG CAACGCTGAG AAAACTGAAG CTTTTGCTGA 3000 ACACTTAAGC TCAGTCATTC AGCCAAATTT GGAAAATGAT CCTGAAACAG TAGCTGAAGT 3060 ACATAACTTT TTAGAGGCTC CTTGTCAGAT GAGCCTGCCT ATTAACCAAA CTACTATCTC 3120 AAAAGTAGAA GAAGAAATAA AAGAGCTAAA CAGGAAAAAA TATCCAGGAT ATGACAAAGT 3180 CTGTGCAATT ACAATCAGAA ATCTCCCTCC AAAATGCATC CGATTGCTGA CACACATATA 3240 CAATGCCATG CTCAGACTGG AATACTTCCC CAGTCAATGG AAATGCGCGG AAATTATAAT 3300 GATTGCTAAG CCAAACAAAC CGGAAAACTG CACGTCTTCA TACCGTCCCA TTAGTTTGTT 3360 GGGGACCTTC TCTAAAGTAT TTGAACGAAT ACTTTTAAGA AGAATGTTAC CTGCACTGGA 3420 CGAACTTAAC ATCATATCCG AACACCAGTT TGGGTTTAGA AGAGGGCACG GAACGATTGA 3480 GCAATGTCAC CGCATCACTC ATGCAATAGT GAACGCATTA GAAAACAAAA ATTACTGCAC 3540 GGGAGTATTC CTGGATGTCA AACAGGCATT TGACACAGTC TGGCATGAAG GTCTTCTGTA 3600 CAAAATAAAG AAGTGGCTAC CCGCACCATA CTTCCAAATT ATATCGTCCT ACCTTAAGAA 3660 TAGAATATTC TATGTTCGCG AACATGACGA ATTTTCAAGT ATCTACACTG CTGAGGCTGG 3720 AGTTCCACAA GGGAGTGCTT TAGGTCCAGT GCTGTACACA ACTTCCACAG CTGACTTACC 3780 GACTACAAAT GCTGTTGAGA TAGCAACATA TGCTGATGAT ACTGCTATTA TTGCCACGAG 3840 TAATGACCCG CGCGTCGCAT CCAACTTAAT TCAAGAAGAA CTAGGGTTAA TTGAAGCATG 3900 GCTTAAAAAA TGGCGAATAG CAGTAAACGC ACAAAAATCA GTGCAAGTCA CATTCTCGTT 3960 GAGACCCGGG AGTTGCCCGC CAGTTACACT AAACGGCTCC ACCATACCAG TTAGTGAAAC 4020 GGTGAAATAT CTTGGAGTAC ATCTGGACAG GCGTCTCACT TGGAAGCAGC ATATAAAAGC 4080 GAAGCAAACT CAGTTAAAGA AAAAACCAAA AAAAATGTAC TGGCTCTTAG GACCCCGCTC 4140 TCAATTAAGC CTAGAGAACA AATTACGGCT GTATAAAGCA GTCCTAAAGC CAGTATGGAC 4200 CTACGCGATG CAACTGTGGG GAACATCTAG GAACTCTAAT ATTGAAATAT TGCAGAGATA 4260 CCAGTCTATG ACGCTAAGAA CAATCACAGG TGCTCCATGG TATATGACTA ATCATGACAT 4320 CCACAAAGAC ATGGAACTTC CCCTTGTTAA AGACGAAATA AAGTCAAATG CAGAGCGATA 4380 CATAAGTAAA CTACACGACC ATGATAATGT CCTAGCACTA GGCCTACTAG ATAACACCAG 4440 CAACCGAAGG CGACTAAAAA GATTTCATGT ACTGGACCTG CCATATAGAT TTACAAATAG 4500 ATAAAAATAT GTAAATATGT ATATATATGG ATTGAGCTCA CTGGAGCTCA ATTCATCCGG 4560 TGATACAATT ATTTAACGCT ACAGATTGCA AATACAAAAA AAAAA 4605 // ID MARINER2 standard; DNA; INV; 912 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063401; mariner2. XX FT source nnnnnnnn:1..912 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..29 FT SO_feature terminal_inverted_repeat ; SO:0000481:883..912 FT SO_feature CDS ; SO:0000316:134..802 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 912 BP; 257 A; 213 C; 204 G; 238 T; 0 other; CAGGGTGCGG CAGCATAACT TCCTTTTTTA AAATGCGCGC CATTCAGTTA GTTGATGTCA 60 TAGCGGAGCG CTAGTGGTCC CGTTCAAGAG GGGCTACTGC AAAGTTTTGT CCCGACACGG 120 TTCAGTCGCC ATCATGCGTT GGAATAGTGA GGAGCGTGCC TTTGCCGTTG AGGCCTACTT 180 TTCAAGCGGA TGTTCGGTTA TTAAAACACA GCGTGCATTT CGGAATCGCT TTAATTTAGC 240 CCCGTTGGCT CCCGTCCCAG ACCGCAAATC AATTGTTACA TGGGTCACTA CATTCAGGCA 300 AACTGCAAGT GCGACAAAAC GAAGAACTGG AGTCCCTCGA CCCGTTAGAT CACCTGAGAA 360 CATTGAAGCA GTGAGAGCGT CAATATTGCG ATCTCCACGG CGTTCTGCGC GCAAACACGC 420 ATCTGCCCTT GGACTTGAGG ATTTGGGGGA CACTTGGTTC CAACAAGACG GTGCAACAGC 480 ACACACTTCA AGAGCATCGA TGGCTGTTTT GAGGGAACAC TTTCCAGAGC GCCTTATCTC 540 AATTAGAGGC GATTTGGAAT GGCCGGCACG CTCTCCCGAT CTGTCCCCTT GTGATTTTTT 600 TCTATGGGGT TTTTTGAAAT CCCGTGTTTA TGTGAACCGT CCAAGAACCC TACAAGATTT 660 GAAAACCAAC ATCCAAGAAG AAATTGCCAA CATAACACCT GCTATGCTAA CAAGAGTCAC 720 GACAAACGCC AGAAATCGGT TTACGCAATG TATGGAGAAT GGGGGACGTC ACCTATCAGA 780 TTTCATCTTC AAAAAAATAT AAATAAAAAC TTTAGACATG TACCTACATT ATAAAAATAA 840 ATAAATATTT TCCGATGCCT ACAATAGTTT TCATTGAGTT CTGAAAAAAG GAAATTATGC 900 TGCCGCACCC TG 912 // ID TRANSIB1 standard; DNA; INV; 2167 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063372; transib1. XX FT source nnnnnnnn:1..2167 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..43 FT SO_feature terminal_inverted_repeat ; SO:0000481:2126..2169 FT SO_feature CDS ; SO:0000316:544..1926 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 2167 BP; 758 A; 325 C; 364 G; 720 T; 0 other; CACACTGGTT TTTTATGTGT TTTTTTTTGG CAGAAAGTCA AAAAAAAATA TTTTAACGGA 60 ATAAAAAAAT GTATGGTTTC TTTATATTTA GGATAGTCTT ATTAGATAAA AAGTCGATTT 120 TTACTTTTTT ACGCAAAACT GGCTTATCGT CGACCTCTCA AAGACAGCTG ATTGTCGATA 180 ACATGCGTAA GACAAAATTG CGGCAAATCA TTTTGCAACA TTTGTGGTTT ATATTTCCTC 240 AGAGAATTAA AAAAGTTTAA AGTTTTGTCA TGGAAAACAG TAGCACAGGT ATGTAGATTT 300 GTAGTTCACT AACATTTTTG TGTATGTTAT ATGTTTTTGT GTAGTTGTAT CGTGTTTTGT 360 TTATGTGCCT TTTTTACAGT CAACTTAACA AATGTAGCTG TGTTTCGTGG AATATAACTT 420 TTGTATGGGA AACCACTTCA AAAAAATTTT AACGGCAAGT TATTAACTAA CATGTGTATA 480 ATATTATACA ACTAGTTTCA TATGCTCCTT TTTGTTCCTT TTTGAAAAAA TTACGTTTTT 540 GTAGATGAAT ACTTGTTTGG GCATGCAGAG CTGCTAGACA TTTGGAGCAA CAATAGTCGC 600 TCTGAGGATG CAGTTAGGCC GCATATATTT AAAAAAATTG GTCAGAATAA TTTAAATGAA 660 TGTTGTATTT CATTGTTAAA TAAAAAAGTT AAGTCCTTCT TTGATTACAT AAGGAAACAT 720 TTGCCAGGTT GCAATAGATC GTTGGACAGG TTGAAGAAAA AACACTTCAA ATGGCTGGCA 780 TCTAGTATTA GTATAATCGC TGATTGCAAA ACTCCAAAAC TAGTTTATGA AAAGGCTAAT 840 GACCGACTTA AACGAAAATT AGCTTCGGAC CTTTTCGGAC CTGAAAATAA TACTTCTGTA 900 CCATTACTCT TGCATGCTGC TTCAGTTTCG TCTAAAAAAG GAAATGAAAA GGACATGGCA 960 GTTGTATTAA AAGCAGCTGG TAACAGCGAT AATATCAATA ATATAAAAAG AAAAATGTTC 1020 TCTTCTGAGC CCACACAAAT GTCCGTCGAC AATGCATTAG CCTTTTTATT TGAAAACGGA 1080 TTTACAAAAA GCCAATACAT CAATTTAAAA CGGAATACAA AAGTACACGG CTTTGATATT 1140 TACCCATCAT ATCCTGATGT TCTTAATGCT AAACTAAAGC TTAGACCAAA TGGCATCGAG 1200 TATTTTGAGA ATAAGGCGCA AGTTAAGTTA CAACAACTCT TAAACCACAC AACTTCGCGG 1260 ATTCTTGAAA TGCAGTATGA AACTTTCAAA GCAAATACCA ACTCGATAGA ATGCCAATTA 1320 ATATTTAGCT ATGGATATGA TGGATCAACT GGTCAGAGTA TTTATAAGCA ACGTTATGAG 1380 GAAATCGGTG CTGTATATGA TGGATCGCTG TTTGTTACTA CAATTGTGCC ATTAAAATTA 1440 GTAGACAATG AACAAAAAAT AATTTGGATC AACAGATCGC CACAATCTAT ACGGTTTTGT 1500 CGTCCTTTAA AAATTGAATT TATTAAGGAA ACTCGAGAGC TTATATTAGC AGAAAAAGAA 1560 AACTTGGATA CCCAGACTCG TAATCTGCAC ATATATTTGC ATAAAATAAA CAACATTAAT 1620 ATATTTGTAA AATATGTCGG TCAAATGACA CTAATAGACG GTAAAGTCTT AAACATATTA 1680 ACAGGATGTA ACTCTTGCCA ATGTTGCCCG ATTTGCGCAG CAAAGCCAAC GCATTTAATG 1740 AATGTGAATG ATTTTAATTC AGATATTTTT GATGCTAAAT CTCAAACTCT TCAGTTTGGA 1800 ATAAGCCCTT TACATGCGTG GATTCGCTTC TTTGAATTTG TTTTAAAACT TGCATACAGA 1860 GCTGATCTTA AAACTTGGCA TAATTTACAT CTTAATATGA TTTTATTTTA TTTTATTTTC 1920 TATATATAAT ATCAATAAAA TATTTAATTA AATACAGATA CAAAAATATT TTTTCTTTGG 1980 TAAGGTAAAT ATTTTTATGG CATATCGGCC AGATCGTTTA TATGGCAGCT ATATGAAAGT 2040 TGACCAAATC ACTTGAAATT TTGTGAATCA TCTTGGGGGA AGTAAAGAGT AACACAAACC 2100 AAATTTGGTG AAGATAGGTC ATCATTTCGA ATTTCTGCCA AAAAAAAACA CATAAAAAAC 2160 CACTGTG 2167 // ID TRANSIB3 standard; DNA; INV; 2883 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063370; transib3. XX FT source nnnnnnnn:1..2883 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..40 FT SO_feature terminal_inverted_repeat ; SO:0000481:2571..2610 FT SO_feature CDS ; SO:0000316:543..2610 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. XX SQ Sequence 2883 BP; 1014 A; 488 C; 511 G; 870 T; 0 other; CACACTGGGC CAGATTCCCT TTTTTTTGGC ATAAAGTCAA AAAATTTTGT AAATTGTATT 60 TTGAATTTAA CTATATATTA TGAAAGAGCA TACATTAGGG ACACCAAAAC GATATTTAAA 120 AACATTTAAG TTGCAACACT GCATTTTTTA TGCGATATCA AATTCATATG TTTGATCAGC 180 TGTTTGTTCA AGCAAAATTG GCGCGAAACT TTTACTCAAC TTTGTGATTC TGTTAAAAAT 240 ACACCACGTG TCCAAATATT TTCAAATAAA ATGGAATATA AAGGTCAAGG TATGCTTAAT 300 ATAGATATAA CTTACTGATT TAGAATTATG TGTTCCATGT TGTAATTGTT GTTGTCTTTG 360 TTTACGTCTT AGTTTAGGCA AAATCTAACC AAAGTGTTCG TCTATTGTGG AGTGAGAAAT 420 GATTTGTGCC AACTTGTGCC AAAAACTAAT AACTGATGTG TTTACTCAAA ACTGTAATTA 480 AATTGATGTT AGTAACGCCT TTTGCCTAAA TTTGCCTATG CTCTTTATAT TTTGCCTTTT 540 AAGACAAGTT TGAGATTAGC CGATCGGTTC TGCTGGATGA ATGGATCAAA GGAGATCGAA 600 ATGACAAAGC CTTAGTTGAC TGGGTTGTTG GCGAATTTAA ATTAAATGAT ATTACTATTG 660 ACGAGAAAGC TAAAATCGCT AAGCAAATCA GAAGCTTTCA CCTATATATA ATGAAAAATC 720 TGGCTAAATG TAACCGTACA ATTGAAGCAT TAAAAAAAAA ACATTCCAGT TGGCTTTCAT 780 CCACGATGAT GGTTGTCGTG GACCAAAACA ACAACCAAGT CGCTTCAGAC TGTCTAATGG 840 GACGACCTCG ACTCTCTTAC AGTGATGCAG GATCACGGCT CAAAAGAAAA ATAGCAACGG 900 ATCTGGCTAA TATTGAAGAA AATAATACAA GTCTTCTTGT CCATGCAGCA TCAATTTCAG 960 CCAAAAAGGA AAAAATGGGC GATACTGCTT TTGTATTAAA GCAAACCATT TCAACTTCAA 1020 CTGCATCCAC TGAAATTAGA AAGAAATTGA TATTCTCAGA GCCAGTACCG CTCACACCAA 1080 TTCAGGCTCT TGCATTTTTG ATTGACAGCT CTCTATCCAA AGCAAAATAC AACGACATGA 1140 GAGCTTTGAA CAAAACACAA TTTAGCAACA TTTATCCATC GTATAATAAA GTCAGAGAAG 1200 CTAAATTAGC GTGTAGACCT GCGGGAGTTC ATGTGACAGA GACCTTTGCC CAAGTATCTT 1260 TCCAAAATTT GCTGAATCAT ACGGCCAGTC GAATTATTCT TATGCAAGAG GAGGTTTTTA 1320 GGTGGCATGA AAACATAACA GGTGTCAAGC TGATGGCAAG CTATGGATTT GACGGTACTA 1380 CCAGGCAGAT TATGTACAAG CAGAGATTCC TAGAAGATGA ATCCCATTTC AAAGACTCTA 1440 TTTTCGTTAA TAGCCACATT CCAATAAAAT TAACAGATGA TTTTGAGACA CCTTTATGGA 1500 TAAATAGAAG CACCCAATCA GTAAGATTTT GCAGACCCCT AAAGATTGAA TTCGCTAAAG 1560 AAACCAAGGA ACTTATTTTG GAGGAAAAGT CGAATTTGGA GTCACAAATA AAAAATAACG 1620 TCACAGACTT TACTTACAAT TTTTCAATCG ACAGAAAAAT AGTTACATTT TCCCTTCATC 1680 TCACATTAAT TGACGGAGAG GGGTTTGAAT ATAATAGCTG GAAAGAAATC TTGCCAGGTG 1740 TGTCCTATCT GTGGAGACAG TCCAATGGAT TTTATAAAAA CCACCGAAAT ATCAAATATC 1800 AAGGCTAAAA TTGAAAATCG AAATTATTTC ATCAGTCCTC TGCACGCATG GATCAGATTT 1860 TTCGAGTTTG TACTCAAAAT CTCATACAAA CTTAATTTGG AAAAGGGGCA AAAGAGGGAT 1920 AAAGAAGAAA GAGACATGAT AACATCCCGT AAACAAGAGT TACAACAATT ATTTTTTACT 1980 CAGATGGGAT TACATGTCGA TGAGCCAAAA CGAAACCGAA GTGGAAATAC GAATGACGGA 2040 AACACCGCAA AAAGAGCATT TAAAAGGACA AGGCAACTTT CATCAATACA AGGATTAAAG 2100 TATGATATTT TGCACAGGTT TTATATTATT TTGGTTGCTA TGTCTTGTGA ATTTCGTATA 2160 TATATTTCTA AATTAAAACA ATGTATATGG ATCACTATAA ATGGTATCCG ATGTCAGCAA 2220 CTTTTCGCAA AGTTTTGATT CACGGTCCGC AGATAATAGC ATCGTCGCTT TTATCTATCG 2280 GCTGCTGGAG AGAACGCATC TGAAGCACGT AATAAATTCT ACAAGCGTGA TAGACGGTCA 2340 TATTCGAGAC AAATTTCTCG TGTTAACAAT TTAACGGACG TATTTCACCG ATCCATGGAT 2400 TCATCGGATC CTTTACTTTC AAGCCTAAAT ATAAACAAAA GATCGTCTCA AAATAAAAAA 2460 ATCGCTTTGC CGAAAGAAGT CATCAATCTT TTAGATGCAC CAAATGTAGA ACATATTAAT 2520 AATGAACATT TTTTGGGAAG CGATGATAGC GACTCGGATT CGGATCACGA TGATCCATTT 2580 TCTATGCATT TGGACGTAGA AAATTTTGAT TAATATAATA GTATAATTAT ATGTATATTA 2640 AATAGATACA TAAGAAAAAT TCAGCTTAAA TTTATTATAT TAATTAATAA AAAGAAATAA 2700 TCATGCCAAA TCTACTCCTT AACTTTATAT CCTAATATGG GCGGTATGGG TGGGCGTGGT 2760 CCGATTTACT TGAAATTTCA CGTACGTATT TTAAATGTCA AATGCACCAA TTCTACAAAA 2820 ATGCAGCTCT CTAGCTTTAT TATTTGACTT TATGCCAAAA AAAAGGGAAT CTGGCCCAGT 2880 GTG 2883 // ID TRANSIB2 standard; DNA; INV; 2844 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063371; transib2. XX FT source nnnnnnnn:1..2844 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..43 FT SO_feature terminal_inverted_repeat ; SO:0000481:2801..2844 FT SO_feature CDS ; SO:0000316:543..2612 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 2844 BP; 1025 A; 487 C; 507 G; 825 T; 0 other; CACTATGGTC CAGAATTCGT TTTTTTTGGC ATAAAGTCTA AATTTTTATG TCAATATATC 60 GTCCTTTAGC AATTATTTTC TAATAGAAAA TTGATCATAT AAATAAAAAC CAAAAACAAA 120 ATTGACCGAA AGCTTCACTA AATCGTTTTA TGAGCTTTTA AAGGTGAAGT GTTAAACAGC 180 TGATTGAGAG AATGCAATTC CCGCCAAATT ATTTGCTTAC TTTTGCACGT GGTACGATCG 240 TTAATACGCA TCCATAAATT ATAGTAAAAT ATGGAAAATA AATGCCAAGG TATGTTGTTT 300 TTAACGTATG TAGTTAATAT ATGTGTAGTT GTACTGTTTT TGTGTTGTGT AACATCCAAT 360 TTATGTGTAT TTTAACAGAA AATTCTAACG TGTCCCCTGT GTTTCGTGGA GTATAACTAA 420 TTTTCTGACA ATTTCTGACA AAAGTTAAAA ACGTCAGTTT GTACCTTTAA CTATTGTCTT 480 TGACGGTAAA GTAATATCTT TTGCCCATTT TTGCTCGTTT TCTTAAAATT TGATTTTATA 540 TAGGCAAGTT CGAGTTCAGA CATAGCGACC TCCTGGACAT TTGGAGGCGT AACAATCGGC 600 AAATAAGTTA TGTTTCAGAT TACGTTTATT CCGCAATCAA CAACAATAAT CTGGAATCGA 660 AAAAGACAGA AGAGATTGAC AAAAAAATAG TACATTTTGA ATTATTCATT AAAAGAAAAC 720 TGCCCGAGTG TAACAGACAA CTAGACCGAT TTATTTCCAA ACATACCAAG TGGATGGCGT 780 CTAAGATAAC AATTCTTGTT AGTGATTTTG AAAACTCCAA GACACCACAG CAAATGGGAC 840 GCCGAAACTT ATCTTACCAA AATGCAGGAG AAAGATTAAA AAGGAAACTG GCATCCAATC 900 TGGCAAGTGA AAGTGATCAC GACACAAGTC TTTTGATTCA TGCAGCAACT GTTTCTGCTA 960 GAAAAGAATG CAAGAGGGAC GTTGCCTTTG TTCTTAAAGA AACTTTAAAA ACACCAGAAT 1020 TTCCAAGCGA ATCGAGGATG CAAATTCAAT GGCAAAAACC AACTCCTCTT TCACCAGATG 1080 AAGCTCTAGC TTATCTCCTG GAAAACACGC TGACAAAACA GCAATATATA AGCACCAGGC 1140 TTTTAAATAA AAGCCATAAC AGCGACATAT ATCCGCCCTA TAATGAAGTG ATCGAAGCAA 1200 AATTACAGTG CCGACCAGAG GGTATAGAAG TAATGGAAAA CACTGCTCAA GTGCTATTAC 1260 AAAATCTCTT GGATCATACA GCGCAAAGAT TAATTAAGTT GCAATCTGAT GTTTTCAAGC 1320 AATTTCCAGA TATCTTTAAA ATCAAATTAA TTTGCAGCTA CGGATTTGAT GGGACAACTG 1380 GTCATAGTGC TTACAAGCAG AAATTTGAAA CTGAAGCACT TGGCACACCA ATTTCTGATC 1440 AATCTTTATT TGTAACTTCT GTCATACCCA TACAAATTAT AGATTCGTTT AATCGGCATA 1500 TCTGGATAAA CAGAGCACCG CAGTCCATTC GATTTTGCAG GCCTCTAAAA ATTGAGTTGA 1560 TAAAGGAAAC TGCTGTCCAC ATAATGATGG AAAAAAATCA GTTAGATAAT CAAATTAATA 1620 ATCTTACGCC ATTCACTTAT AAATGTGATG AAACTCGTGA CATAGAAGTT ACCTATGAAA 1680 TGCACATGAC ACTTATAGAT GGAAAAGTCC TAAACGTATT AACAGACACT AAATCTACTC 1740 AATGTTGTCC GATCTGTGGA GTCAGCCCAA CACAAATGTT GAAAATTACA AATTTTAGTT 1800 CAGAAACCTT TGCACCAAAA GTAAAAGCTT TACAATTCGG GGTAAGTCCA CTACATGCTT 1860 GGATCCGGTT TTTTGAATTT GTTTTAAACG TATCATACAG AGCCGAGATA AAAAAGTGGC 1920 ATATAAAAGG AGAGGACAAG GTTAAAATAA GTATTCGAAA AAAATATATC CAGGAGGAAA 1980 TGTGGAGAGA AATGGGCCTT CGAGTAAGCA TGCCGAAGCA GAATGGCAGC GGCAATAGCA 2040 ATGATGGTAA TACGGCAAGA AGAGCTTTCG CAAGTACCAA ATTCTCGTCG ATAACCAGCT 2100 TCAACGAAAA TCTTTTGGAA AAGTTTCACA TTATTTTAAT AGCAATATCG TCCAACTATT 2160 TTATAAATTC CGACAAATTT CGTTCTTTTT GCGATATAAC TTTTCAATTA TATATCCGAA 2220 CATATCAATG GTATGGATAT CAAATCAATA ATTCGGCGTT GTTTCCACTT GGATGCCTAG 2280 GCGAAAATGC GAGTGAAGCC CGGAATAAAC TTTACAAAAG GGACCGACTA TCACATGCAA 2340 GGAAAAATAG TCGGATCAAT ACGATGAGTG ATATTTTCCA CAGAGCGATT GATTCCTCGG 2400 ACCCACTATT ATCAACAATA TGTTTAAAAG AAAGAGAGAG AAAAAATAGG AAAAAACCAC 2460 TTCCGAAGGC AGTAATAAGT CTTTTGGAAA TACAGTCTAG TTTTAAACCT GATCCATCAG 2520 ACAAAATATT GGATTCACAT TCTGATTCTG ATTCGAGTTC AGATTCGGAA ATTTATAATG 2580 TAGAACTAGA AGAAGAAGAG GGCGATTATT TTTAAAACAA CAATATACAC ACTTTTATTT 2640 AAAAAAAAAA AAAAACAATG AAATTTCGTT GACTCCTGAG TCATACCAGT AGTTAATATG 2700 GGAGGCAAGA GAGGGCGTGG TCGCACAGAC TTCAAATTTT GTGTGCAAAC TATTTGTGGT 2760 TTAACAACAA AATGTACAAA GTTTCAAGTC TGTAGCTCAA TTTTTAGACT TTATGCCAAA 2820 AAAATCGAAT TCTGGACCAC AGTG 2844 // ID GYPSY5 standard; DNA; INV; 7369 BP. XX AC AE003485; XX DR FLYBASE; FBgn0063432; gypsy5. XX SY synonym: nik XX FT source AE003485:109015..116382 FT SO_feature five_prime_LTR ; SO:0000425:1..429 FT SO_feature three_prime_LTR ; SO:0000426:6940..7368 FT SO_feature CDS ; SO:0000316:708..1865 FT SO_feature CDS ; SO:0000316:2156..5344 FT SO_feature CDS ; SO:0000316:5374..6612 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. XX SQ Sequence 7369 BP; 2822 A; 1461 C; 1232 G; 1854 T; 0 other; AGTTACCACA TCTACATCTA CTTCCCCACC CCCTAAACCT CCACGCCAGA ACCATTAGTT 60 CGTTGGCCCA CATCTGACCA CATCAGCAGT CCACGCCACT ACTTCCCCAC CCCCCTAAAC 120 CTCCACGCTG AGCCGCTAGT TCGTTGTCCC CTCATGTTGC AACAGTGTAG GCGGAAACAG 180 CGCAGCAAAA CGGAGGCATT CCCCAAATCC GCTGGGTCAG CAGATTCGCG ACAACACAGA 240 GCCATACATA ATACTTGATT CTCCTTTGTA ATAAGTCAGT CTTAAGCTAC ACTCGAGGGG 300 AGTGAGATCA AAATAAAATC ACAACCTTAA AGGGAAATAC CTTTTAATTA AAATCGGGAA 360 GAAACAATAG AGGGAAAAGA AACACAAAGG ACCACAGAAG GAAGACACTG GAAGGCTTCA 420 CTCGTTAATT GGCGCAGTCG GAAACTGGAC CTGATGCAGC TCCTTTTCAT CACAATTATT 480 CAACACGAAT CCATCGTTCA TCATCTGCAA ACAAAAAATA TTCAACACAC AATTAATTAA 540 GTATACATAA AAACACACAA TTAATTAAGT ATACATAAAA AACACACAAT TAATTAAGTT 600 CACGTACACA GATATACACT ATTAATTAAA TACATACACT TAATTTTTTT TTTTTTTTTA 660 CACAATTACA CAGAATTAAG AATTAAAGGA TTTCTCTAAA AGAGTAAATG TCTAAACTTG 720 CTAAAGTAAA GAAGCTTATT CCGGCGACTA GAAGTAGTAC TCGTTTGAGT GCGAGCTTAT 780 CAGACGTTAG GGAAGACCAG GTTGCCCTAC CACCCAGAAT TAGACCTTCT TCGAGTAGTG 840 CACCAAGCAC AATGACTAGT TCGTCAGAGA CAAGTAACTT TTCTTCCCCA CCCCCTCCAA 900 TTTCCCCTCA GGACATTTGC CTATCCCTAA AAATTCCGGA TGGCATTCGC GACCTTACTC 960 CCTATGATGG AGATCCAACT ACCCTAAACG AATTTATAAA TAATATCGAG GAAATTCTTC 1020 TTCTCATTAG AGGCACGGAT AAAACTCCGT ATGGTCAGTT CTTGTTAAGA GCAATAAAAA 1080 ACAAGATAGA AGGTAAAGCA AAGGCGTTAT TATTATCATT AGGCATAGGC TTAAACTGGG 1140 ATGACATCAA AGAAGCATTA ATAGGGAAAT TCGCTGACAA AAGAAGCGAA GAAACTCTAA 1200 ACTTTGAACT TCACGGACTT GTTCACAAGA GATTGCCACT GCAGGAATTT TACGAACGAA 1260 TTTTAGACAT CCAAGCAACT TACGGTAGAA CCATGGAAAA CATGGGAATA GAAACAGCAG 1320 CACTAAATCT TAAAAAGGAA TCTTTTGAAA AAACATGCTT GTATACGTTT ATACACGGCA 1380 TTAAAGGCCC ATTGGGGTCT ACTGTCAGAG CATTTAGACC TAGGAGTCTC CAAGAAGCCT 1440 ATGATGTGGC AATTAAGGAA AGGGATATTT TTATGAGAGA AAATTGGTCT AGACAAACAC 1500 AGAGTTCGAG ATCAAACAAT TACAATTATA ATGGGAGAGA TTATCGTAGA AATGACGAAC 1560 ATCGGGTTGA GAAATATAAT AAGAAATCGG AGTTCGACCG AAACATCAGA TCTAGGGATC 1620 AAAGAGGAAG TTATTCAAAG CCCAGAGACA ACGACAGAAA TTTTAACAAA ATGAGGGCTA 1680 TTATGCCAAA AGAAGAGGAC AGAAAACGCT ATGACAGAGA CAATAATTAC TCACGAAATC 1740 ACGGAAATCC GGGACAGCAA AGACAGACCC TTAACAATAT TGAGACAGCA GGTGTACAGG 1800 AGTTACATAA TATTGATGAT AAAAATTTTC AATTAAAAGC CTCGTCGGAC AGACGGGATA 1860 CTTAGAAAAA TTTAAAGTTG GAGGTTCCTT ATTGTACATT TTAATTAACT TCGGTCCAAC 1920 CAGTCCACTA AAATTTCTAG TGGACACAGG TGCCACTTCT TCCTTCATTA ATCCAGAACT 1980 TATTGAGGAA CATAACCAAC AAACCCTAAA TACACCAATC TTAATCACAA CAGCACTAGG 2040 CTCTCATAAA GTAAACAAGA TAGCCAAAAT AAAATTATTT ACTAAACACA ACTGTAACGA 2100 TACCTTTCCG TTACTTCTCT TCAAATTTCA TAATTATTTT GACGGTCTTA TAGGCATGAA 2160 CACGTTGAAT AAAATTTTAG GAGTAATAGA TATACCAAGT AAAATCTTAT CAACACCCAA 2220 ATTCAATTTT GAATTACTGC AGCAAGTAAA ACAAAAAGCT GAATTATTTA ACATAGAAGA 2280 AGCATCCAAG ATATTGGTAA TGCTACCCGT CACAATTAAG GAGGGTACAT TTTTATTTGA 2340 AGACCAAAAG GTTTCAGGAA ATTTGTTTAT TACCGGAGGC CTGTACACGG CCCACAACTA 2400 TTACAGTTGT ATGGAGGTGG TAAACTATTC CAAGAAGCAA GAAAAATTAT ACCTAGAAAC 2460 ACCAATAGAG GTAAATCACT TTAACACGGA CGAATTCTAT ATAGTAAATG ATATTAAAGA 2520 CTATGGTACG ATAAACTCCG AAAAGGAAAA AGAAAAGTTC AGTACCCTAA GATTAGAACT 2580 TTTAAATTCA GAAGAAACAA CGGCGTTCAA AAAATTATGC AAACAATTTC CTAATATATT 2640 CTACAAAGAA GGAGACAAAT TAACATTCAC TAATAGAGTC AAACATAGCA TTAAAACCAC 2700 TGACGAAATA CCAGTACACA AAAGACCTTT CAGATACAGT CCTGCAGAAA AAACAGAAAT 2760 AACCGACCAA ATTAACAAAC TCCTAGAGCA GGACATAATT AGGCACAGCC ATTCACCTTG 2820 GAGTGCTCCA GTTTTTCTGG TCCCAAAGAA ATTGGACGCT TCAAATAAGA AGAAATGGAG 2880 ACTGGTTGTA GATTTCAGAC AATTAAATGA CAAGACGATC AAAGATAGAT ACCCAATGCC 2940 CAACATCAAT GAAATTCTAG ACAAACTAGG GAGAGCTCAG TATTTTTCGG CCCTAGATCT 3000 AGCAAGTGGT TACCACCAAA TTGAAGTGGA GCCTAAAGAT AGATCGAAAA CAGCCTTTTC 3060 CGCAGTAGGT GGTCATTTCG AATTCATCCG AATGCCATTC GGCTTATCTA ATGCACCCGC 3120 AACCTTCCAA AGGGTGATGG ATAATGTCTT GGCAGAATTC AATGGCAAAT TTTGTCTGAT 3180 TTATCTCGAC GACATAATAG TATTCTCGAC GTCCTTACAA GAACATATCA ACCACCTAAG 3240 CTCGATCTTC AAAAAACTCA CTCTGGCAAA TTTAAAACTT CAGCCAGATA AATCGGAATT 3300 CCTCAAGAAG GAATTAGAAT ATCTTGGCCA CATAGTGACC GAAAAAGGCG TTAAACCGAA 3360 TCCAAAGAAA ATTGAAACTA TTAAAGCCTT CCCTATGCCT AAAACCAGAA AAGAAATTAA 3420 GTCATTCCTG GGGTTGCTGG GGTACTATAG AAGATTTATC AGAGATTTTG CCAAAATCAC 3480 TAAACCTCTA ACTCAGCAAC TTAAAGGAAA GTCAGACGTG ACCATCGACG ATAACTATAT 3540 TAAAACATTT GAATTTTGTA AAACACTCTT ATGTAATGAC CCCATACTAC AGTATCCCGA 3600 CTTCACTAAA TCGTTTATTT TAACAACAGA TGCTAGTAAT GTAGCAATAG GAGCAGTGTT 3660 GTCCCAAGGA ACTGTAGGAA ATGATAGACC TATAGCATAT GCAAGTAGGA CACTATCGGA 3720 GTCCGAAACG CATTATGCCA CCATAAAAAA AGAATTACTG GCAATAGTGT GGGCAGTGAA 3780 ATATTTTCGA CCATACTTAT TTGGAAACAA ATTTCTACTG GTCACTGATC ACAAGCCTTT 3840 AGTTTGGTTA AAAAACCTAA AAGAGCCTAA TTCCATGTTA GTGAGATGGA AACTCCAACT 3900 CTTAGAATAT GACTGTGAAG TCATTTATAA AAAAGGTTCT CAAAACGTGG TGGCAGATGC 3960 ATTAAGCAGA ATTGTTGTCG AACTCAACAC TAATGAAGCC AGACCAATAA CCTCAGATAC 4020 AGAAATTTTA ACCTCTAACA GACCAATAAA TGAATTTGCA ATTCAGATCA TATTAAAGAT 4080 TGCAGATTGT GCAAACGACT TATTGGAAAC ACCTTTTAAA AATAAACTCA GACGAATTAT 4140 TTCCAGACCT CTGTTCGACG ATCAGACAAT GATAGACGTA TTAAAAACGG CACTAAGATC 4200 AAATAAGACG CATGCAATTT ATACCACTGA TACAATTTTC GAATCAGTGC AAAAAATTTA 4260 TGCAGCATTT TTCGCTCCAA GTAAATCATA TAAAATTATA AGGTGTACAA CATATTTAGA 4320 TGAACTCCGA ACACCAGAAG AACAGGTCGA ATACATATCT GACTACCATA TCAAGAACAA 4380 CCATAGGGGC ATTGACGAAA CCGTTAGTCA TATTAAACGA CAGATTTATT TCCCCTGTTT 4440 AAAAGAGAGG GTTTCCCAAC TGATAAACAA ATGCGACATT TGTCAAACCC TCAAATATGA 4500 TAGACAACCT CAGAAACCAA TATTTCAGCT AACAGAAACA CCTAATAAAC CACTAGACAT 4560 TGTACACATA GATCTATACT CAATTAACAA TAAAACCATT TTAACAATAA TAGACAAATT 4620 TTCAAAATTT GCAGAAGGTT ACACTATTCC ATCTCGAGAT TCCATTAACA TCACTAAACA 4680 TATGATGTTC TTCTTCAAGA CCCATGGTAT TCCCAAAACA ATAGTTTGCG ATCAAGGCCC 4740 CGAATTTGCA GGAATTATTT TCAAAGAATT GTGTAATCAA TACAATATAA CTTTACATGT 4800 AACATCTTTT CAACAATCAA GTAGTAATGC CCCAGTTGAA AGATTACATT CATCGTTGAC 4860 GGAAATTTAT AGGATAATTT TCGAAAAGAA AAAGGCATTA AAACTCAACC TAGACCATGA 4920 TTGTATCCTA ACAGAAACAT TTATAACGTA CAACAACGCC ATACACTCGT CTACAAAATT 4980 GACTCCATAT GAAGTTTTTA CAGGACGAAC CCATATTTTC GAACAAAATT ATAAGGCAGT 5040 CAGTCAACAA GACTACATGA GGGAACTGGA AAGTTATAAG AACAACTTAT ATGTAGAAGT 5100 AAAGGAAGAA TTTGAAAGTC AAAAACTCAA CAAAATTCTA AAACTAAATG AGGACAGAGT 5160 ATTACCAATA GCGGTACAGG AAGGAGATAC TGTTTTTAGA AAGGAAAATA GGAGAAATAA 5220 GCTAACACCC AGATTTACCC AACACAAAGT AGCTGATGAT AACGGGGTAA CCTTTACAAC 5280 GAACAAAAGA CAGAAAATAC ATAAATCAAA AATTAAAAAA AGAATCAGAC AAAATAGTAG 5340 ATAAATATAT AATAATATCA TTTCAGGTTA TACATGACAG CAGCTCAACT GCTCCATGTT 5400 CAACATGTTA AGGAAAATAC CCCCCTAGCA GAGATCAAAC TGGGAGAAGC AAGGACCATT 5460 AAAACATATA GTTCAGTGGC CCATGTAATA GAACTAACGG AGTTAAAAAA TAATTTGAAT 5520 AAATTAGAAT CAAGTCTACA AAATTTACAA GATGAAGACT CTAATGATTC AATAACACCC 5580 ATACAAAGAA AGATAGACCA AGCAAGAGAA AAGATCTATT CCTTGACCCC AAACCGAAGA 5640 GAAAAGAGAG GCTTAATAGA CGGTCTAGGT AATGTTATAA AGAAAGTAAC TGGGAACATG 5700 GACGCAGAGG ACGCAGCTAA AATCGAGACA AAATTTAAGC AAGTATTTGA AAATGAAGAA 5760 AACATTAACA AGAACCTCGG GCAACAAATT TCTTATAACA ACGACATAAT CGTAAATTTT 5820 AATAATATGT CTGATCATTT TTATAAAGAA CAGGAGAAAA TTGAGAGGTA CATTAATAAC 5880 TTTAAAAATG GCATTTCTAA GGAATTAATA GTAGAAGATA AGAAAATTAG AATTTTACAA 5940 ATATTGAATA TAATAAATAA TAATATAGAT ATTCTCCAAA ACCACATACA TGATATAGCT 6000 GAAAGTATAC TATTGGCTAA ATTCAAAATA ATTCCCAATT TTATTTTACC TATACAAGAA 6060 TTGAAAGATA TTGCACAGAT TTTTAAACTT CAAAATTTTT CTATTAATTC AGAAGAGCAT 6120 ATTTACAGAC TACTCAACAT AGAGATAGAA TCCAAAAAAA CAAAAATAAT ATTTCATATA 6180 AAGATTCCAA TATTTGATAA GTACACTTAC ACACTTTCTC ACCTAATTAA ATTACCTATC 6240 AATGGCACGA ATTTTCTCAC TCTACCCAAA TATATATTGT ACGATGATAA ATTTAATTAT 6300 ATGTTAAGTT TTCAAGAAAA ATGTTCAAGA ATACATGGCA CTTACATCTG CGACCCCGGA 6360 ACTGCGGAAA GTAACATGGC CAACAGGGAG TGCGTGAAAC AAGTGCTACA AGGAATGAAC 6420 CCTGTTTGCG AGGCAAAGAT TATCAACACG GACGAGGCGG TAGTAGAACC GGAACAACAG 6480 TGGATTGTCA TTATCAGCAA GAATCAACTA AGGGCTGAAC CAAGTTGTGC TGAACCGTTC 6540 ACTATTGAGG GAACGACGGA TTATCAATCA CATAAACTGT TCCATATTTA TAGACGGTAC 6600 CAAATACGAT GACAGCGAAT TGTTCTACCA AGAAAGGATT AAATTATCAC TACCAAGTAA 6660 CGAAGAACAT ATCTATAAAT GCAACATCGG AGGATTTAAG CCTAAGCAAA ATAGTACTGC 6720 ATCTTCAAGA GAATAAAGCT AAAATATTCC AAGTCAAGGA ATCTACGGAT TAACCTTCGA 6780 TTCATAACCT ACGGCATCAA GTATCATCCT GGTCATATTT ATACTGTATG CTTTAGCAGT 6840 AAGGAGAAAC ACTACTTACC TGCCCAACCC TGTTACCTCA GTCAACTACG TTCCGTCTAT 6900 TCCTTCGTTA TGGCCGTCAC TTTATTCAAG GGGGGGAGGA GTTACCACAT CTACATCTAC 6960 TTCCCCACCC CCTAAACCTC CACGCCAGAA CCATTAGTTC GTTGGCCCAC ATCTGACCAC 7020 ATCAGCAGTC CACGCCACTA CTTCCCCACC CCCCTAAACC TCCACGCTGA GCCGCTAGTT 7080 CGTTGTCCCC TCATGTTGCA ACAGTGTAGG CGGAAACAGC GCAGCAAAAC GGAGGCATTC 7140 CCCAAATCCG CTGGGTCAGC AGATTCGCGA CAACACAGAG CCATACATAA TACTTGATTC 7200 TCCTTTGTAA TAAGTCAGTC TTAAGCTACA CTCGAGGGGA GTGAGATCAA AATAAAATCA 7260 CAACCTTAAA GGGAAATACC TTTTAATTAA AATCGGGAAG AAACAATAGA GGGAAAAGAA 7320 ACACAAAGGA CCACAGAAGG AAGACACTGG AAGGCTTCAC TCGTTAATT 7369 // ID GYPSY6 standard; DNA; INV; 7826 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063431; gypsy6. XX FT source nnnnnnnn:1..7826 FT SO_feature five_prime_LTR ; SO:0000425:1..407 FT SO_feature three_prime_LTR ; SO:0000426:7419..7826 FT SO_feature CDS ; SO:0000316:1026..2003 FT SO_feature CDS ; SO:0000316:2277..5315 FT SO_feature CDS ; SO:0000316:556..7012 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. XX SQ Sequence 7826 BP; 2274 A; 1929 C; 1744 G; 1879 T; 0 other; AGTTAACTAA GTTAACCGGG CGACGGCTAC GACCGATCGT CCGAAGACTC GCTCCGGCCA 60 AATTGCTGAC ACAGCGTCTG GCCGGAAGCC CGTGCATAGC CGGCAAACAC TACCCATTGG 120 AGTGGCCGCT ATGAGTTTCA TATTTTGTTA GCCTTAAGTT CAGTTTGATT TTATATTCAA 180 TAAAGAGCGC ATCGCGCCTT GAATCGACTC CAGCTTTTGC TGTCATTATT AAATTGGTTG 240 GTTGGTTGAC TAGCCTTAAG GGCACTCAAC AACGGAGAGA CGTTCTCCCA CCATATCTTC 300 TAATTCTGAG GAGAAGAGGT CTGCGGCAAC CGCCCTGCCT CCCGGATACC TGCAACCTAC 360 GCCGGAGACC GCGGCGAGGG ATCTACACCA ACATAATTAG TTTAATTGGC GCTCAACTAG 420 CGGGAAGTAC GCACCCGCCC ACATCATACA GAACATGTAC ATCTTCGACA CAAACCATAG 480 TTTCAGACCA GTGTTAAAAT ATTAAATAAA CCAGTCTGAA GAGTGTTTTT AAAATCTCAA 540 TATTGAATGT GACGGCTAAT AGGTGTTTGG TGTTTATTTT CACTGCGAAA AAAATTTAAT 600 TAACATCATA ATCCGTTGCA AACACGGCAA AAAAAAAAAA AAAAAAAAAA ATAGCCTAAT 660 TTAATTCGCT CACAGTTGTG TTACGGCGCA TTGCTATTTC GCCGAGCAGT TAAATTTTTT 720 TTTTTGCAAA TATTTTTGGT GAGGAAATTA ACATATTGCG GGGGCTAAAT ATCAGAGCCA 780 AGGCTCTAAC AGTTTAATAT ACCAGATTTG AATTTTTTAT TTGACATTTA ATTCAGTTTT 840 GTATACTTGG TATTAGCTTA ACAGTCCATT TTGTATCTTT TATTGTCCCC ACTCCACCGC 900 TGTGAACCCG ATACGGCCGC GCATGCATTT GTATTTCAAT TATTTATTCT TCGCTTCTTT 960 GTTTATAGTG GCGGAAGCTC AGCTCCCACC CACCCCTTAC CTTGCCGCTG CTCCGCGAGG 1020 TGACGGCTTG GTTCGAACAG CTTTATACAT CGTTGCAAAG GCGGCGGCTA CAGTCCCCGC 1080 GCACCCCTCA CCCTACCGGT GCTTCTTTGT TTATCGTGGC GGAGGCTCAG TCCCCGCGCA 1140 CCCCTCACCA TACCAGTGTT GGCACTTGGA AAGGCGGCAA CTGGGCTCAC ATACATAGTT 1200 GTTAAGGCGG CGGCTCCGTC TCCGCGTACT TCTTGCGTTG CCAGCTGCCT TCGTGGGCAA 1260 AATAGCTGAG CGGGCATTGC ATACCAGTGC AGTTTAACCT TAAGCGCATA TGTGCTCCCC 1320 GTTTACACAA TATAGGGAAC ATAAACCTAG CGACTCAGAT TCTGACAGTG AAGGGCCCCC 1380 GCGTATTTTC ATCCCAATAC GGGCACCCAC CAACCCCCCT TCAGGTAGAA CCATGGACTC 1440 AGATCAGCTT AAGGCTGTGA TTCAGACCGC CGTTAATCGT GCCTTGGCAG AAGCAGCCGG 1500 TGAAGCCAGG CGTAGAGAAG AAGAAATGCG CCAAGTTATA CAACAGTTGG CTACCCAAGT 1560 TGCAGCGGTG CAAATCGCAC CCACACAAGC GGCAGTCCCT ATAATCAAAG TATACCAACC 1620 CATTGATATC ACGGGCAACG TCGAGTGCAG CGAGCCTCTG GATGCCGTAA AATGCTTGCC 1680 CGAATTTACG GGAGCACAAG AGACATATGT CTCCTGGCGG CAGGCGGCGG TAGCCGCATA 1740 TTACATATTT AGGAATTATG TAAATAGCTC ACGCCACTAC CAGGCGGTCG TTATAATCAG 1800 AAGTAAAATA AGAGGCCCCG CCGATGCGGT GCTATCTTCG TTCGGCACTG TATTGAATTT 1860 CGATGCGATC ATAGATCGCC TCGATTTCAC ATATAGTGAC AAACGCGCGA TTCACGTCAT 1920 CGAGCAGGAA ATGGGCACCC TCAGACAGGG AAGCCTGACT CTATTACAAT ACTACGACGA 1980 GGTCGAGAAA AAGCTCACCT TGCTTACCAA TAAGGCCACC ATGTCGTATG AAACGGCGGC 2040 GGCAAAAATC TTATGCGACA AATTCCGGGA TGATGCGCTC CGTATATTTA TTTCGGGACT 2100 CAAGCGCAGT CTTTCCGATG TCCTCTTCTC GGCGAAGCCG AAGGACATGT CAACTGCGTT 2160 GGCATTAGCT CAGGAAGTGG AATCAAACCA CGAAAGGTAC ACAGTCGCAA CATCGTTTGC 2220 AAAAAGCCTG GAGGATAGGG ATAGGAAGCA ATACCCAATG GCGCAAGAAC GCCAACAAAC 2280 GCCCTACCAA GCGCACTCCC AAGTAAGCAG GGGAAAAAAC CCACACTTTA TTAAACAGGT 2340 TAAGGCTCAG GTACACTCGG CTCCGCATAA CGACAGGCGT CGTGAAAACA CATCAGAACC 2400 CATTGGAGGT TGACCCGTCT ATGTCCAGGT TGAGACAACC CACTCAGGCC TACCAAAACG 2460 GGAAGCCCGC CCACTCCGGT CGTTCACACC CTCACAAGAG ACAGAGGGTT AATCACATCG 2520 CTCAGACCAT AGGTCAGGCC GAAGGCACGT ACGCAACCAC AGCGTCCAGC GCGGCGGCCA 2580 AAGTGGATGA CGACACCATT TCTGAATATG ATTTAGAAGT GATTAATTTT TTAGGGGAAA 2640 ATCCCTGCTG CCCGTCATCA GACGAAGAGT AGCGGGGAAA GAGATGAAGT TCCTCATCGA 2700 CACGGGTGCG TCAAAAAATT TTATCCGGCC TCACAAAGGC TTAAAAGGCG TTCGCCCCGT 2760 CGATTCCCCA TTCACCATCC ATTCACTCCA TGGCGTTACC ACGATCACGA AAAAATGCTT 2820 CGTGTCACTT TTTGATTGGA AGGCCACGTT CTTCATTTTA CCTGATTTGT CCTCCTTTGA 2880 CGCGATAATT GGCCTCGACC TACTTAAACA GGCAGGGGCA TCGCTTTGTC TGGCCTCTGG 2940 CCACCTCAGA TGGGGCAATG GAGAAGAGAA AATTGAATTC CACCCATGCC CCGACGTCAA 3000 TTTTACCGAA GTGGACTGCT CAGATGCGCC ACCCTTGGTC AGAAACGCGT TCTTAGAAAT 3060 GCTGAAGACT AGGAAGAAGG CTTTTGCAAA CCCTAACGAG GCTTTGCCCT ATAACAGATC 3120 GGTGGTAGCT ACCATCCGAA CAGTTAGTGA GGAGCCCATA TACGCAAAGT TATACCCGTA 3180 CCCGATGGGG ACGGCGGACT TCGTCAATAA AGAGATTGAA GACCTGCTTA AAAACGGGAT 3240 AATTCAGAAG TCGGTATCCC CTTACAACAA CCCGATATGG GTTGTATATA AGAAAGGGAC 3300 CGATGACCAT GGCAACCGGA AAATGAGACT AGTTATCGAC TTCCGCAAGC TAAACGAAAG 3360 AACAGTGCCC GATAAATATC CCATGCCAAA TATTAACATG ATATTGAGCA ACCTAGGCAA 3420 TGCGAAGTAT TTCTCCACGC TGGACCTCAA GTCTGGCTAT CACCAAATCA TTCTTGCAGA 3480 ACGCGACAGG GAAAAAACCT CTTTTTCTGT GAATGGGGGA AAATACGAAT TTCGCAGATT 3540 ACCGTTTGGC CTCAAAAATG CAGGTAGCAT TTTCCAGAGG ACAATCGATG ACATCCTACG 3600 GGAACAAATC GGCAAGTTCT GCTATGTTTA CGTTGACGAC GTAATTATCT ACTCCGAAGA 3660 CGAAAACTCT CACATCAAGC ACGTAGATTG GGTTCTAAAG AGCCTGCACG ATGCGAACAT 3720 GAGAGTATCG GCAGAAAAGT CCAGCTTTTT TAAGAAAAGT GTGAGCTTCC TTGGGTTTAT 3780 AGTCACTTGT AACGGTGCTA CAACAGACCC AGAAAAGGTT AAGGCTATAA AAGAATTTCC 3840 GGAACCCAAA AGTGTTTTTG AGGTACGGTC ATTTCTAGGC CTAGCCAGCT ACTACAGATG 3900 TTTTATTAAG GACTTCGCGG CAATAGCAAG GCCTATATCA GACATCCTAA AAGGGGAAAA 3960 CGGAACAGTT AGTAGACACA GGTCACGAAA CATTCCGGTT CAGTTCTCGG AGACGCAGCA 4020 ACAAGCGTTC CAGAAACTAC GAAACATCTT GGCATCCGAC GACGTGATGC TAAGGTACCC 4080 CGACTACAAA AAGGCATTTG ATCTAACGAC AGATGCCTCG GCCCATGGCA TTGGCGCAGT 4140 ATTGTCCCAA GAGGGACGCC CAATAACAAT GATCTCAAGG ACATTGAAGG ACAGGGAGGT 4200 TAACTACGCC ACCAATGAGA GGGAACTCTT AGCCATAGTC TGGGCTTTGG CCAAGCTACG 4260 GCATTACCTG TATGCGGTTA AAGACATCAA CATCTTTACC GACCACCAGC CGCTAACCTT 4320 TGCAGTATCG GAATCCAATC CGAATGCAAA GATTAAAAGG TGGAAAGCGC GCATTGATGA 4380 ATCAGGAGCG CGTATTTTCT ACAAACCCGG CAAAGAAAAC TTGGTCGCCG ATGCATTGTC 4440 ACGGCAACAA ATTAACGTCA TGGAGGAGCA AGAAGCTCAA TCGTGCGTGG CCACTGTTCA 4500 CAGCGAACTC TCCTTGACGC ACACTATCGA AACGACGGAT AAGCCCCTAA ACTGCTTTCA 4560 GAACCAGATA ACTCTGGAGG AGGCACGCTT TCCGTTAAAG CGCAGCTTCG TCCTCTTTGG 4620 AAACAAGAGG CGGCATGCGA TTAACTTCCC CTGCAAAGAG TCATTGATTG ATGAACTCGC 4680 AGATGTAATC GTTCCGAAGG GCGTAAACGC CATTCATTGT GACCTGCACA CGCTGGCACT 4740 AATACAGGAC GAGTTGGTTC GGAGATTTCC AGCCACCAAA TTTTGGCACT GCAAAAACCG 4800 TGTAACGGAT ATTTTTGCAG TCCCTGAAAG ACGGGAAATT CTTACCGTAG AACACAATAG 4860 GGCCCACAGG TCGGCCCAAG AAAACGTTAA GCAGGTACTC TCCGAGTACT ACTTTCCGAA 4920 GATGACCAAG TTGGCTACCG AAATCGTAGC AAACTGCAAA ACATGCGCAA AAGCCAAATA 4980 CGATAGGCAC CCAAAACAAC ATGAGCTAGG CGAGACTCCA ATCCCCTCTC ATGTGGGAGA 5040 ACTACTGCAC ATCGACATTT TTTCTACGGA TAAAAAGTAT TTTCTCACTT GCATTGACAA 5100 GTTTTCGAAG TTCGCAATTG TGCAACATGT CCACTCTAGG ACAATCGAAG ACCTTAAACC 5160 GGCCATACTA CAGATTATGA ATTTTTTCCC TAGGGCTAGA GTAGTTTACT GCGATAATGA 5220 ACCTTCATTA AATTCGCATA CTATCTCGAC CATGCTTGAC AATCACTTTG GCGTTAGCAT 5280 TGCCAATGCA CCGCCACTCC ATAGCGTGTC AAATGGGCAA GTGGAGCGCT TTCATAGCAC 5340 CCTTTTGGAG CTTGCTCGTT GCCTAAAAAT CGACAAGGGC ATAACCGATA CTGTGGAAAT 5400 TATTTTGTTG GCAACCGCCA AGTATAATGA GTCAATTCAC TCCGTCGTTG ACAAACGACC 5460 AGTTGACATC GTGCAGGAGC ACCCAGATGA CCCACAGACG GAAGTCCGGA ACAGAATCAT 5520 TAAGGCACAA AACACGCTCA GGACCAGGGA AAACGCCTCC CGACAACACA GAGTATTCGA 5580 AGTCGGCGAG AAGGTATTGG TTAAATCCAA TAGAAGGCTG GGAAATAAAC TCACACCATT 5640 GTACGAGGAG AGAGCCGTAG AAGCGGACCT GGGGACCACG GTCCTCATCA AGGGGAGGGT 5700 GGTCCACAAG GACAATCTTC AATGATATGA CCAACAATAT CCTTTATGGT TTTTTTTATT 5760 ATTCATTTTT TATATTATAT TTTTTATCAC ACCTTCAAGC CATCTGGCGC ACTTTATTTT 5820 TTATATTCAT TAGTTATTAC ACCTTCAAGC CGCTTGGCGC ACTTTGTTTT TTACTATAGG 5880 TTCGTAGCCA CTGGCATAGT TTGTTTTAAG TTCATTTTAT CGGTGGTGGG AAAACCCTCT 5940 ATAAGCATAA AAATAGCTAA AGCTTAATTT CACAGGATCG GACCAACATT TTGCATACTC 6000 CTGCCCTTGG CGTCGGCCCA CGTTACCGAT TATTCACAGG CCAGGTACAT CCCCGTTATA 6060 GATGGCGAAA TCCTAGTATG GGAGGAGTTT GCTTACGTCA CGCACACAGC AAACCTCTCA 6120 GAATATGGGC GTGTAATAGA GGAGACAACC AACATGATAG ACATGTTTCC GCTATCCCAT 6180 ATGAAGAAGC TTCTGAGCGT GGATACCGCC CACCTCCAGG ACTTGTTAGA GTCGTTGGGC 6240 GTTCATCACA GAGTAGCTAG GAGCTTGGAT TTCTTAGGAC CTATGCTAAA GGTTGTAGCA 6300 GGGACACCAG ATTCCAGCGA CTTAGAAAAA ATCAGGTTTA CCGAAATGAG GTTAGTCGAG 6360 TCTAGCAATA GACAGATCCA AATTAACACC AAAACCCAAA ATCAAATTAA TCAACTTACC 6420 TCTACCGTCA ATTCAATTTT GAGATCAGCC AAAACCTCAC AAATAGACAC TGGACACTTA 6480 TATGAAACAC TGCTTGCTAG AAACCGCATG TTTATGGCGG AATTACAAAA TTTAATGCTT 6540 GCAATAACCC TAGCCAGGAT TAACATTGTT AGTCCGAACA TTTTAGATCA CGAAGACTTA 6600 GAAACAGTTT GGCTTGAGGA ACCCACCGAC ACACCTATAG GAGATCTTTT GTCCGTCTCG 6660 TCTGTAAAGG TTTTACAGTC CCGTAACATT TTACACTTTA TCATTAAATT CCCCAAAATT 6720 AAATTAGCCT GCAAGAAGAT CACTATTTTC CCAGTTGCCC ATGGTGGAAC GATGTTGCAG 6780 ATCATAGACA ATGTCATAGC CGAATGCAGC GGAGAAGTTT ACGCCATCAA AAACTGCTCC 6840 GAATCACCGA GAGCCACATT CTGCCGCCTA GCTTCAGAGA GTTCGTGCGC CAAAGACCTG 6900 CACGCCGGTG GGGTAGCACA CTGCCGAGTA CAAGAGAGTG ACCTGCATCC GATAACCTAC 6960 GTAGACGAAG GAATTATTAT CATCAACGAT AGGTCGGCCA AAGTGCGAGT GGACAACGGC 7020 ACAGAAATCT GGACTCATGG CACACACCTC ATAACCTTTG ACAAACAGGC CACCATAAAT 7080 GACACACTCT TCATCAACCA CAATAACACC CAGAAGAGAG CTCCAGGGAC AGCAAGTCTT 7140 CCCTTGTTGA ACATCACCGC CACCCAAGAT GTCCTCAGCC TCCCGTATCT TCACCGTCTG 7200 AGTGAACGAA ACTTGGAGTT CATTAAGGAG TTCAGAGAAG AGATTGAGAA CCAAAGAACA 7260 CGTCTCGTAG CAATTATTGC AGGAGCAATA TGCTGCGCAC TCATCTGCAT CGGGCTTATT 7320 TTTAGGCGTT TCACTGAGGC AAGAAAATCC GCAGGCCAAG TTAGGCAGAT TATCGCTGAA 7380 CTACAGACGG CCGAGGGCGG CCTTAATTCT GAGGGGGGAA GTTAACTAAG TTAACCGGGC 7440 GACGGCTACG ACCGATCGTC CGAAGACTCG CTCCGGCCAA ATTGCTGACA CAGCGTCTGG 7500 CCGGAAGCCC GTGCATAGCC GGCAAACACT ACCCATTGGA GTGGCCGCTA TGAGTTTCAT 7560 ATTTTGTTAG CCTTAAGTTC AGTTTGATTT TATATTCAAT AAAGAGCGCA TCGCGCCTTG 7620 AATCGACTCC AGCTTTTGCT GTCATTATTA AATTGGTTGG TTGGTTGACT AGCCTTAAGG 7680 GCACTCAACA ACGGAGAGAC GTTCTCCCAC CATATCTTCT AATTCTGAGG AGAAGAGGTC 7740 TGCGGCAACC GCCCTGCCTC CCGGATACCT GCAACCTACG CCGGAGACCG CGGCGAGGGA 7800 TCTACACCAA CATAATTAGT TTAATT 7826 // ID INVADER5 standard; DNA; INV; 4038 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063426; invader5. XX FT source nnnnnnnn:1..4038 FT SO_feature five_prime_LTR ; SO:0000425:1..352 FT SO_feature three_prime_LTR ; SO:0000426:3686..4038 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4038 BP; 1204 A; 692 C; 934 G; 1208 T; 0 other; TGTCGTATAT TCCTTTTTAA TTTCAAATTA TAACGTACAT TTTATATTTG CTAAAAGCTG 60 GGCATATTGT TTTGTTTGGG TATTTCTTCT GTATTCATAA AGCGAGGCAG AAATAGTTTC 120 TGTCAACGCA GTCTGGCATG GGCATGAATG GGACGGATAG AGTCCGGTAA AACTCACCAC 180 AGTGTGTCTG TGTAGTGGTG TATGAAAGAG AATGAGAAAG GGAAACGCTT GGTTTTGGTC 240 ATTCTTGGTG ATGAAATAAA GAGGGATAGT CATGCCACGT GCCAGAAAGC GTTTCGTTGA 300 GGAAACAGAG GAAGACTCTT GTAACGAGGA GTCTACAGAC CTTGGATCGA CATCAGAAGT 360 GGGATCTTCC AAGCCTCACA ACTGTGGAGA CAACCTGCGA ACACTCTTGG AGTCGCAAAA 420 TCACACCTTT GCGGAACTCT TACAGGCTAT GCAGCGAACG CAAAGAGTGC CAAATGACGC 480 AGTAGCTGTA GTGCTTCCAA AATTTAACCC CGATCGCACT GGTTCAGACG GAGCATCATG 540 GTGTTCAACA GTGGATTATA TTCTATCTGA AAATCAACTA CATGGCACCT TTCTCGTTAT 600 GGCTTTCAGC AAGGCGCTGG AGGGAAGCGT GTCTCATTGG TTATCCCAAA CATGCTTTCC 660 GGGAATCACA TGGATGCAGT TTCGGGAGTT GTTTTTGCAA AATTATGCAG GAACTGAGAC 720 ATTGGCCGCA ACAGTTATAA ATCTGCTGGA TGGACGCCCA AAAGAAGGAG AGTGCTTATC 780 GTTGTATGGG AGCAGGATGG TTACTTCTCT CGTGTCCAGG TGGAAGTCAC TGGATGTGGA 840 ACAAATTGCA GTTTCGGTTG TGCTGGCGCA TGCAGCTGGT TTTAATAAGC GATTACGGCG 900 TCTAGCTTTT ACGACAGATG TGAACACTCG GAACGATCTG CATCAGGAAT TAACAGCCTA 960 TTCTTTTGAT GCCCGACAAA GTCAATCTTC ATATGGTACC ACGTCAGCAC CATCAGCGAA 1020 AAGAGTCAAG ACATTCTCTT CCGGAAAACT TGGTCATAAA CAGGCGGAGT GTCGTTTAAA 1080 GAAGGAAGCA ACTCAAAGCC ATCATTCTGC TGGTCAACGT TCTGGCTATA CTGAAAGTAA 1140 GCGCCGGTCC ACCTTGACCT GTTTTAAATC TGGAAAAGTT GGACATGTGG CCAGCGTGTG 1200 ACCGGATCGT GAGGAGCGTC AATATATGCA CAATACTGGA GCCTGTGGAA ACGCTGAAGT 1260 CCAGTTCATC TCCGTTTGCG AGTCCAATAC TACATCCAAA TTCAATCGAG ATAACAGCAT 1320 TCGTGACTCC AGAGGGACAA TATGCATTTT TGGCAATGCC TTTTCGGTTT AAAAATGCCA 1380 CTTCTGTTTT CTATGAGCGA TAAATTGTGC TTTAGGTGAT CTTAACCATA AATACGTCAT 1440 TGTTTATATG GACGATGTAC TTATAGCTGC CAGCACTAAG GAAGAAGCTT TCGATAGATT 1500 ACAGGTAGTT TTGCAAACAC TTATTACTGT CGGATTTTCT TTCAATATGT TCAAATGCTC 1560 TTTTCTTAAA TCTCGTATTG AGTATTTGGG TTTTGGAGTT GAAGCAGGTG AGGTGCGTCC 1620 AAATCCTGGA AAAATACATG CTCTTGTTAA TCTGCCGCCT CGCAAACTGT GACACAATTG 1680 AGACAGTTTA TTCGGCTAGC TTCGTACTTT CGCGAGTTTG TGCCCAAGTT CTCGCAATTG 1740 CTAAAACCCC TTCATGCTTT TACATCAAAG GCCATATCAT TTGCATGGAA TAGAGAGCAC 1800 GAGCAAATAC GAAAGAAAGT AGTTACAAAA TTTGACTTAT GAACCTGTTC TTATAATATT 1860 TGATTCACAG TTTCCTATTG AGCTTCACAC AGATGCCAGT GCAGACGGCT ATGGAGCTAT 1920 TTTGTTACAC AAAATAGAAA CCAAGCCTCG TGTTATTGGG TATTTCAGTA AACGGATGTC 1980 TCCCGCAGAA TCTCCGTTAT CATTCATACG AGTTGGAGAC GTTGGCGACA TTGTAAAATT 2040 GTAATTGTAA TTGTAATTCA CCTAAAGCTT CCCATACTAA AGTCGAGTTG ACTCCTAGGG 2100 TCCATCGCTG GTAGTCTTAC AAGCAGACCT TCGATTTTGA TGTAGAATAT AGACCAGGTA 2160 CACGTATGGC TCACGTTTAG TTTCTTTTCC GAAATCGTAT ACCTGAATAC CTGATATAGT 2220 TGAAAAACAC ATTAATTTGA CAGAGATATT TGACAATTGG TTACTAGCCG TACAGCAAAA 2280 AGATGGACAG ATATTGTTTC TAAGTTGCGT AATAATGAAC TTAGTGAGGA ATTGGCTAAA 2340 AAATACGAGT TGCGGTCAGG AATATTATAT CGAAAGATTC AAAGGAACGG GAAAACCCCT 2400 TGCCTTCCTA TTATACCAAA GTCGTTTATG TTAATAATGT ACATACATGA AGCCATTATG 2460 CATCTTGGTT GGAAGAGAAC CCTTAAAAAA ATGTATGAGT ACTATTGGTT TGAGAAAATG 2520 TCGAAATATG TTTGTAAATT TGTAGACAAC TGCATCACTT GTAGGCTGTC GAAACTATCA 2580 TCTGGCAAAA TACTACCATT ATTTGGTGTC CTTCTCGTTT AGTTGCTGAT CAAGAGATTT 2640 TTGCTCTGCT CAAATTTTTG CTCTTGCAAA TGGTCAGGTG GAGCGTGTTA TGATTAACTG 2700 TAGCTTAAAC AGGCCAGGGG TGTTGGCAGG ACTCTTTGGT TGACGTTCAA TTGGCTCTTA 2760 ATTGCACGCC CAACATGGTG AATAAAGTAA GCCCATTAGA GTTACTCATT GGTAAAGAAG 2820 CGAGACCATT TGTACCTGTT GTGGAAAATT AACTACTTTT CATGTCCTCG TAGTAAGAGC 2880 AATGGAAAAA CTTAACATGG AAAAGATAAA ATTAGATTTG ACAGAAATGA AGTAAAATGA 2940 AGTCGGGGAC TACGTACTAC TTAAGTATGA AGAGCGTCAT CAAACTTAAT TAGATGCCAA 3000 ATTTAGAGGT CCATTCTTGG TAGTTGAAAC ATTACAGGGT GATCGGTATA AACTAAAAGC 3060 TTTGAATAAT AATCGCACAT ATAAATATTC CCATGAATTT TTACAAAAAA TGTCTGAGAC 3120 CGGGATTGCC AGAAAGTTTG AAGATGATCA AGGTGATCAT AAAGATGGTC CAGGTCGAGC 3180 CAGTGAGGGC TGCGACATTG AAATTGAATA ATAAGATAAT TTTCGGATAG TACGGCTCTC 3240 TGCACATATG GGTGCCCAAG TTATGTGGTT CGCCATTTGT CGTAGTTGAG TTGGAATCTC 3300 GAAAGTGATC GGAAGTATTA AGTTACTACT TGGCCGGTAT AATATTAACT TTGTGCTATT 3360 TAAGTTGGCC GGTTTAGTGA TAACTGTGTG CTATGTGTGT TGGCAAGTTA TGTTGTTGTG 3420 TGTGAGCTAG AAAAGCGAAA TATTGACTAG TAATGGTTTA AATGAAATAT AGTTATTCTA 3480 AGAAAGTGCA GTATATCGAA GTTGTTTTGA TGTTGGGTTG GTGGTAATCA CAGTCTTATG 3540 GCATAGATAT AAAGAGAAAT GTTAAGTAAT TAAGAGATGC TTGTTGATGT TCTAAAAAGG 3600 AAAGGAAAAC ATAGAGAAAA AAAATGTATT TTGAGTTATC TACAAATTGT AAACTCAAGT 3660 GGACGTGTGT AGGTCAGGAA GGCCGTTGTC GTATATTCCT TTTTAATTTC AAATTATAAC 3720 GTACATTTTA TATTTGCTAA AAGCTGGGCA TATTGTTTTG TTTGGGTATT TCTTCTGTAT 3780 TCATAAAGCG AGGCAGAAAT AGTTTCTGTC AACGCAGTCT GGCATGGGCA TGAATGGGAC 3840 GGATAGAGTC CGGTAAAACT CACCACAGTG TGTCTGTGTA GTGGTGTATG AAAGAGAATG 3900 AGAAAGGGAA ACGCTTGGTT TTGGTCATTC TTGGTGATGA AATAAAGAGG GATAGTCATG 3960 CCACGTGCCA GAAAGCGTTT CGTTGAGGAA ACAGAGGAAG ACTCTTGTAA CGAGGAGTCT 4020 ACAGACCTTG GATCGACA 4038 // ID DIVER2 standard; DNA; INV; 4917 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063439; diver2. XX FT source nnnnnnnn:1..4917 FT SO_feature five_prime_LTR ; SO:0000425:1..385 FT SO_feature three_prime_LTR ; SO:0000426:4532..4917 FT SO_feature CDS ; SO:0000316:436..4147 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4917 BP; 1152 A; 1227 C; 1203 G; 1335 T; 0 other; TGTTGGGTCA TGCAGCCGCC AAAACTGTGC AGTAGACTAA TCTCATACTC TTTAATATTG 60 TAAGCCGAAG TAGTCAGTGG TAGCAGTTGC GATGCCCATA GTGCAAACCG CTGTTCTCGC 120 ACGCATTTAT CTGTACGGCG CTCATTCCTT GTTCTTTTTG TTATCTACCT ACGTTAAGCT 180 TGGGCGCTGC ATTTCGGCTG TCTGTCCCGC CAATTGCCAC TTCTTCGTCT GTCGGAAGAG 240 ACTAAACTTG TGCATTCGAT ATAGCTCTTT GTCGGCCCTA GCTGCTGTAA ACAATTCGCA 300 AAATAAACAG TATCGTAAAA TAAACAAACT TTCTTCGGCT ATTTATTATT CGCAACAGCT 360 ATTGAAAAAG CTCAATAGCT AAACATTGGT GACCCCGACG TGATAGTGTT CATTTGCATG 420 CATTAATTAA TGCACTTATC CCGTTAATAA ATAAATATAA ATACATTAAA TCGCAATCGG 480 TTGGACAAGT GAGTGGTGAT TTCGGTGTTA GTGGCTGTGT TTGAGCGAGC TAGCATTACA 540 TATGTACATA CATACATTGC TACATATATC GTTACGTGCA TTGACTCCGC TGGCATTTTC 600 GGCATATATT TTGTGCTCAT CTTCCCGCTG AACCGAATTA AAGGCGATTT GTTGCCATGC 660 CCGCTGAAGG TAACAAAAAC AGGGTGACCT TTTTAAAGCG CAAAGCCAAT GCGCTGTTCA 720 GCAGGTTGCA GAGGCTACAA ACGTCGCTGG CTAATGAGAC GCTAAGCGGA TACGATGAAC 780 CCACTCTTAC GGTGAGGCTG GAGCACTTTG CAAAACTGCA AAGTTTGGCG GAGGCTACGC 840 AAACTGGGCG GATTTTTACT CCGTGTTCAC GAGCATAATC GACAGCCATC CGGACCTCTC 900 CAACATTGAG AAATTTTAAC ATCTGCGGTC ATGTTTGAGG GATTCGGCGC TGGAAACTAT 960 CCGATCATTG GATATTTCAA ACAGCAATTA CGAAGCGGCT TTAGAGTTGT TTCAAAAAAG 1020 GTTTGATGAA TCGGCGTCTC GTTTTTCAGG CGCACATCAC CGAGATTCTG GGTTCAAAGG 1080 TAGTACCGGA ATGGTTCAGT GGCATCGCTT CGGGAATTGT CGGACAAGTT TAACGCTCAC 1140 ATTCGTGCGT TGAAGGGTTT GGGCACCACT GAGCAAATCG CTGGCTGCAT CATAGTGCAA 1200 GTGCTGCTGC AAAAGCTGGA TGCGGCGAGC CAAGCTAAGT GGGAGGAGAG CTTGGAGGAT 1260 CCGATCTTTG CCAACCTTAT TCCGTCGTGG AAATCGATGG CTGCATTCCT GGAGCAGCGA 1320 TGTAGGACTA TGGAGGCCGC GGATTGCGCC ATGGCAACCT ATGCGCCAGG CGTTCAGGTG 1380 GGCAGAAATC GTTCGACGCT AGTTGCTACC ACCCAAAATT CTCTTGGTTG CATGCTTTGC 1440 CATAGTGCAG AGCATGCCAT ATATTATTGC CCGCAATTTA CAGACTTAGC GCCAGTAGAT 1500 CGTCTGCGCG AGGCAAACAG ACTATCACTT TGCCTAAATT GCCTAAAGGC AGGTCATCAG 1560 CTACGGCAAT GCAGCTCGAG CCGCCAACGC ACCTGTGGAA TCAGGCATCA TACGCTGCTC 1620 CATCTAGGTG GTCCGCCTTC CTCGCAGCCA CATGTTCCGG TGTCTTCAAG CTCCCACACT 1680 GAGCCTTCTG CCCCTCTTTC GACTTCTTCT ACTCTAATTG CCCAGGATCT CGGTAGTGAC 1740 CTTGTGCTGC TAGCCATTGC AACCGTTCTA GTGCAGAATC GGTCGGGACT GTTCGTTCCC 1800 TGCAGGGCCT TGTTAGATTC TGGCTCTCAA CTGCACTTGG TCACCTCTCG GTTTGCAAAT 1860 CAACTGCAAC TTAAGAGGTC AAGGTCGTCC GGCTCCGTCA CTGGAATCGG GGATTCCAAT 1920 TTCGCGACTG TTTCGGTCGG TCACTTCGGA TTTCTCAACG AGCATAACAG CAGTTATCGC 1980 TCCCAATATC ACGGGATCGC CAGACAAGTT TTAATGTGGA CATTGGGGAC TGGAAGATTC 2040 CAGAAAACCT GCAGCTCGCC GACCCGGAAT TTCATATAGC TCAGCGTGTT GACCTGTTAA 2100 TAGGAGCTAG CTTGTTTTAT GAACTGCTGT GCCGTAGGTC AGATAAGGTT GTTGCCCGGA 2160 CTGTCCCTGC TTCAAAAAAC TCGTCTGGGC TGGGTTGTGT CTGGAGGCTG CGCGCGCCCT 2220 TGCGGGTAGC GCCTTAATAG CTTCACGCGT TCCTTCTTCA GCCAGCAAGG AAAACAGCAT 2280 TTGATGTTGA ACTTGATTCA CTTCTGCAGC GCTCTTGGGA GGTAGAAAAG TGTCCCGGCC 2340 CAATAGTTCA AGCCACTAAG GAGGAGTTAG ATTGCGAGGC CCACTTTGGT TATAAATTAC 2400 ACCCGACTGC CAGCTGGCGA TTACTCGGTA CGGTTGCCGC TAAAACTCCA TTTGGAGTCT 2460 TTAGGAGATT CCTATCCTCA GGCTTTGCGG AGATTCTGGT CGCTGGAAAG GAAGCTTACA 2520 AAGCACCCTG GCTTGAGGGT TAAATATTCG GCGTTCATGA AGGAATATCG TGATCCGGAA 2580 CATATGTCGC CTGTGCCTGC CTCCGAGGTC AGCTCGTCCC GATATTTTTT TACCACATCA 2640 CTGCGTCATG AAGGAGGATA GCACTACCAC CAAGCTTCGC GTTGTCTTTG ACGGATCAGC 2700 TGCCACTTCC ACTGCTTACT CGCTTAACGA TGTGTTAATG GCTAGCCCGG TTATCCAGCC 2760 CAAGCTATTT CACATCCTAA TTCGATTTCC GCTCACACCC AGTTGCCATT CGGGAGCTGA 2820 CAGATGTCAC GGCTTGGCGT TATGTTCCAA CAGCGTTAAA TCCGGCTGAC ATCTTATCCA 2880 GAGGATCCCT GCCGTCTGAG CTTAGCGAAT CGTCACTCAC CCGCCTATCT TACGGAGCCT 2940 GAAAGAGATT GGCCAAAAGC TGTTTGTCCT GACAAACCGG TGCTTGAGCT TCGTCGAAGT 3000 GTGTTCGTCG TAAAGTCGCC GTACGTGGAT GTAATTGCTA GCTCCAAATT TGCAAATTCT 3060 TACCCCTCAC TGCAACGAGT GTTTGCATAC ATTTACAAAT TTTGTAATGG AATTCGTCAT 3120 CCTGGGCTCA CCGTCGCACA CATTCAAGAG GGTACTCATA TGATGCTGCG GTTGGTGCAG 3180 CGCGCGCAGT TATGGGAGGA CCCTGCAGTC CCTAAAAACG CTGGGAAGAG TTTCCTCGTC 3240 TAGTCCCATA TCCTCGCTCT CGCCGTTCCT GGATCAATTT GGACTTCTTA GAGTAGGCGG 3300 TCGCCTCTGG AATTCGTCAT TGGACTTTTA TGGCCGCCAC CCGAAAATCC TTCCAAGGTC 3360 CCATTCGGTG ACTCTGGCAA TTATTTCGCA TTACCATGAA ATCAATCTTC ATGCCGGACC 3420 TCGAGCTCTT CTGGGTGCAA TTCGATCACA ATATTGGCCT ATTGGGGGGA GGAAGACGGT 3480 TACCAAGGCC GTGAACAGGT GCATCAGATG TTTTCGGATT ATGCCGCGGC TGATAGAGCA 3540 CATAATGGCG GACCTTCCCA AGGAGCGCTT GGAAGGATCT CACGCTTTCG AGGTTGCTGG 3600 TATAGACTTC TGTGGACCCT TCTTTCACAA GTCGGATACT CGCAACAAGC CCGCGGTTAA 3660 ATGCTACGTC TGCGTATTTA TATGCTTTGC AACCAAGGCA GTGCACCTGG AGCTGATCAA 3720 GGATCTCTCG ACAGTTGCGT TTCTGTGCGG ACTCAAGAGG TTCATATGCA CCCGGCGGAA 3780 GCCGAAGCAA ATTTGGTCAG ACAACGCCAC CAACTTTGTC GGCGCCAAAA ATAAACTTCT 3840 GGAACTTCGA AGGTTATTTC TCAGCAGCGA GCATCAAGGC TCCGTTCAGA ATTTTTGCCT 3900 TTCGGAGACG ATTGACTGGC GGTTCATCCC TCCACGGTCG CCCCATTTCG GTGGACTTTG 3960 GGAAGCAGCG GTGAAAACGG CCAAACATCA TTTCTACCGC GCTGTGGGTA CGGCGGTTCT 4020 GACCTTCGAC GAGCTGAGGA CGCTGGTGTG CCACATCTCG GCAGTTATTA ACTCCAGACC 4080 TTTAGTTCCC ATTTCAGAGA ACCCTGCCGA TCTGGACGTC CTCACTCCGG CGCATTTTCT 4140 CAATGGTGGT CCGCCTTCGT CGTTTGACGA GCCAGATATA ACGGGCCTAA ACTATAATCG 4200 GCTTGACTCT TGGCAGCGCA TCTCCTTTCT TCAGCAAATA TTTTGGTCAC GATGGAAGGA 4260 AGAGTACTTG ACGTTGCTCC AGCAGCGCTC CAAGTGGCGC ACCCCAAAGC CTGGCGTAGC 4320 CGTGGACAAC GTCGTTCTTG TTAAGGACGA GAATCTACCC CCAATGAGAT GGCCTTTGGC 4380 GAGAGTCATG CAGTTGATTC CTGGCAGAGA CGGCGTCGCT CGAGTTGCAG AATTGAGGAC 4440 CGCGTCTGGA GTAATAAGGC GGGCAGTGAA CAAGCTGTGT CTGCTTCCCC TTGAGGACTC 4500 TGTTGGAAGC CAAGCTTCCA ACGGGGGGAG GATGTTGGGT CATGCAGCCG CCAAAACTGT 4560 GCAGTAGACT AATCTCATAC TCTTTAATAT TGTAAGCCGA AGTAGTCAGT GGTAGCAGTT 4620 GCGATGCCCA TAGTGCAAAC CGCTGTTCTC GCACGCATTT ATCTGTACGG CGCTCATTCC 4680 TTGTTCTTTT TGTTATCTAC CTACGTTAAG CTTGGGCGCT GCATTTCGGC TGTCTGTCCC 4740 GCCAATTGCC ACTTCTTCGT CTGTCGGAAG AGACTAAACT TGTGCATTCG ATATAGCTCT 4800 TTGTCGGCCC TAGCTGCTGT AAACAATTCG CAAAATAAAC AGTATCGTAA AATAAACAAA 4860 CTTTCTTCGG CTATTTATTA TTCGCAACAG CTATTGAAAA AGCTCAATAG CTAAACA 4917 // ID TRANSIB4 standard; DNA; INV; 2656 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063369; transib4. XX FT source nnnnnnnn:1..2656 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..40 FT SO_feature terminal_inverted_repeat ; SO:0000481:2618..2658 FT SO_feature CDS ; SO:0000316:545..2152 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 2656 BP; 932 A; 455 C; 451 G; 818 T; 0 other; CACTATGGGT AAAAAGCCGG AAATTTTGGT ATTAATGTAA TTTTTTTTGT GATTGATATT 60 TTAATGCTGC TTTCATTTTT TGATTCCTAA GTAACAAGAT AGCATAAAAA TAATTTTAAA 120 ACATTTTTAC GAGTTATCGT TTATTTTTGG CGCGCTTTCA AAGTCAGCTG ATTCAGAGGT 180 GCTTTGAATT GTGAAAATTG GCGGAAAAAG TTGAACAAAT TGCAGCGCAA ACATTGTCTT 240 CATAGCAAAC TTTGAGTTTA TTTTTAATTA TGGAAAAAAC AAGTCAGAGT ATGTAAATTA 300 TTCGTGAATA CACGTTTTCC GATAATTGTG TGTGGTTGTT GTTGAATTGT CCAATGTATT 360 GCTTGTGAGT ATTTTCAATT GTTTACTAAC AAACGTTCTA GTGTTCTGTG GAATATTACT 420 TTTGTATGGC ACAACGAATC ACCAAAGTTA AAGTTGTATG TTTTTAACAA ATATATGTGG 480 AATCAAATGG TATTAATGTT TTTTGCCCCT TTTTGCCCTT TTTTGCGAAA CATTTTATGT 540 TGTAGGTGTG TATAAGTTTA GTCATGCAGT TCTGCTGGAT GTTTGGAGCA ACAACAAACG 600 CGATGCAGAC GCTGTTAACA GTTATATTTT GGAAAATATA GAAGAGAACT TTGAGACTAA 660 AGAGCAAAAG ATTACAATTT CAAAGAAAGT GAACCAATTT ATTACTTATG TAAACAAGCA 720 CTTAAGGAAA TGCAATCGAA TGTTGGACCG ATTCAAACGA AATCATTTAT TGTGGCTTGC 780 GTCTAAAATT GTTGTCGTTA TTGATAAGCC AGAAATGACA CTATGCAAAC CTAGCCGAAA 840 ACAGTTAACA TATGAAGATG CAGGGCCTCA ACCTAAAAAA AAAATTAGCA AATTAGCTGG 900 CTGCAGAACA AGGGCATTCA ACACCACTTC TCGTACATGC TGCTTCAATT TCCGCAAAAA 960 AAACGAAGCA GGCAGAAACG GCCTTTGTAT TAAAAAAAAA AGGATAAGCA GTGAAAACAT 1020 ACATGATATA AAAAAGAAAA TTGAGACAAA TCATCCTATT AAAATAAGTG CTGATAATGC 1080 TTTAGCTCTC TTAATTGAAA ATGGATTTAC AAAGCAGCAA TATAATAATA TAAAAGCGCT 1140 CAATAAGCAG CAAGGATGCG ATATTTTCCC TCCTTATTCA AAAGTAGCAG AAGCAAAATT 1200 GAAATGCAGG CCTTCCGAAC ATAAAATTTC TGAAAAAAGG CCAGGAGTAT CGTAGCAAGA 1260 TTTGCTAGAT CATACAGCTG GCCACATACT TTTAATACAA GAGTAAGTTT TTTGTGTTCA 1320 TCCAGACCCA ACGAATTGTA CATTAATTGT GAGCTATGGC TTCGATGGAT CTACGGACCA 1380 AAGCACGTAC AAACAACGCT TTCTATCAAA CGAATGTATC GCCCTAGATC AATTCCTTTT 1440 CGTCACTCCA ATTATACCGC TGAAACTAAT AGATGAAATC AACAATCGAG TTATTTGGAT 1500 ACATTCATCT CCACAATCGG TGAGATTTGG TCGGCCTTTG AAAATTGAGT TTATAAAGGA 1560 AAGTGCTGCC CTTATATTGA AGGAAAAACA AATTTTGGAT GCGGCTATAA AAAACCTTAG 1620 ACCATATTCC TACACTTCCA AAAACCAAAT AATAGCATAT ACTGCAGACC AGCGTTCGTT 1680 CTCGAACAGC AAGATATTTT CGTCCGTTTT GAATTTGGAC TACAAAATCA TTTACAATTT 1740 TTATATAATT TTGATTTCAA TTTCTTGTGA GCACATGATT GATGCGAAAA ATTTAAATCT 1800 CTATACCATT CTACTTTTAA TTTATACGTT AAATACGAAC CTTATATGGA GTACATACAT 1860 GGTAACATAG AGTAGTCACC AACCAATATT GGTACATGGA TATAAAATTG TAAGCCAATT 1920 TATTGTTCCT GTTAGTGTCC TTGGAGAAAA CGCGTCAGAA GCGCGCAACA AACTTTACAA 1980 GAGCGATAGA AAATCTCACG CCAGGAAATG TAGCCGACTA GCAAACATTA CAGATGTTTT 2040 TAACAGAGCA GTGGATTCAT CTGATCCATT GCTTTCAAGT CTTTGTCTAA GAGATCGAAG 2100 TAAGTTTGGA AAAAAGAAAC TCCTCCCAAA AGAAGTAATC GATATTCTTC AAAATCCGGA 2160 TGCAGAACTT TATAAATCAA GTTATCATTC TAATAATAAA GAAAAAGAAG TGGAAGTGGA 2220 AGACGAATTC AATTGCCTTT ATAATAAGAG GAGGCAGTTG AAGAATGACA ATCAACTTAA 2280 TAAAATATAA AAAAATTTAT TGACTTTTAT CTCATAATAA TGGTTTTTGC ATCTTGTGTC 2340 CTCACTTCGC TCCCACGCCA ATCCCCCAGG GAAGACACCT CCGGATCCAC CGGTTGAACC 2400 ACAGACATAT TAACACCCAC ACCTCCTTCG CCACCCATCC ATCTCCCTTT AAAACCGCTT 2460 TCATTTGTAT ATAAATACAT TTTTCACTTT ATTTTCAAGG ATTGTTTTTG TGGGCGGAGA 2520 TGAAGGCGTG GTCAGATCAA AAAATAAACA AAATCCAAAA TTCTATTTAA ATCCTAATTA 2580 TAAATATAAA AAAAATCAAG TCGCTATCTT TAAAATTGTA ATTAATACCA AAATTTCCGG 2640 CTTTTTACCC ATAGTG 2656 // ID S2 standard; DNA; INV; 1735 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063466; S2. XX FT source nnnnnnnn:1..1735 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..235 FT SO_feature terminal_inverted_repeat ; SO:0000481:1502..1737 FT SO_feature CDS ; SO:0000316:403..1437 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 1735 BP; 597 A; 279 C; 317 G; 542 T; 0 other; CAGTTTGTCA AGAAATTGTT TGCACAATGA AAATAAATTG TATTTTCTGC CTTTCAATGC 60 AAAATTAAGG GCTTTTTGTT TACCTTATGG ATTTTTTTTA ATTTACATTT ATTTGTTTAT 120 ATGTTTTTTA TCAATATGTA AAAAAATAAT AACAAAACTA TTAACAATGA ACGAGAAAAA 180 CAACTAAATC CAAGTTTGCA CAGATTTGAC ACTGTCAAGA AATTGTTTGC ACACTTTTCT 240 TTTGTTTTCT ATTTTGCTCC TATTCCGAGG AATTTCGATA ATCAAACGGT ACATAATCGT 300 CATTTACGCT TTATGCGTTG TGTGGGTGGC CGATTTATAT GCATTTGGTT GTGAAATTTG 360 CCAAAGTGAT CATTGTTCCA GTTAGTTTTG TGAAATTTAG AAATGCCTGG CAAAAGATTG 420 AGTTTTGAAG TAGTTCAACT GATTTTTTAT AATCACCATT TGGGAAAATC TGTCAAGGAA 480 TTGTCAGAAA TGTTTTCTGC ATCGCGGAGA ACAATTTATA ACGTTATTAA TCGCGCAAAA 540 AACGAGGGCA GATTGGATTC AAAATGTGGA GCAGGACGTA AAAATAAAAT TTCCAACCGT 600 TCAGACCGAC TTATTATGCG AAAAATTTAT GAAAATCCGC AAATATCCTT AAGAACTGTT 660 GCTAAGGAGC TTAAGGAGGA GTGCGATCTG GATGTGTCAC ACGAAACAGT GCGCCAAGCC 720 ATACTCCGGC ATAAGTACCA CTCACGGGTA GCAAGAAAGA AACCGATGCT GTCGGCAATT 780 AACATCGAAA AACGGCTTAC TTTCGCTATC AAAATGCAAC ATCAGCCAGA AGATTACTGG 840 AATAACGTAA TTTTCTGCGA CGAAACCAAG ATGATGTTGT ATTACAACGA TGGACCCAAC 900 AGAGTGTGGC GTAAGGCGCT AACTGCGCTG GAGAACCGCA ACATCATACC GACAGTCAAA 960 TTTGGCAAAC TGTCGGTTAT GATTTGGGGA TGCATATCTA GTCGAGGAGT TGGAGATTTG 1020 GCATTCATAG AGAACACAAT GAATGCAGAA CAATTTTTAA ACATTTTGAA ATGCAATCTG 1080 ACATCCAGCG CCAAAAAATT CGGTTTTTAT AAGAACAATA GACCTGATTT TAAATTTTAC 1140 CAGGACAATG ACCCCAAGCA TAAGGCGCTT AATGTGCGGA ACTGGCTCGT TTACGACTGC 1200 GGCAAGGTCA TCGATACGCA TCCTCAGAAT CCAGATCTGA ACCCTATAGA AAATTTGTGG 1260 GTTCATTTGA AGAAAAAAGT GGCAAAAAGG TCACCAAACA ACCGAGCAGC ACTGAAAGCT 1320 GCTATACTAG AAGAGTGGCA TAAGATCCCA CAACAATATG ATCTTGAAAA GTTATTTTTT 1380 CAATGAAAAA AACGTCTACA ATATGTTATT AATGCTAAGG GTATGCATAC AAAGTATTAG 1440 AATTACCTCA AGTTATTATA ATTTGTTATA AACTATAAGT CTAAGATAGC TTAAGTACTT 1500 TGTGCAAACA ATTTCTTGAC AGTGTCAAAT CTGTGCAAAC TTGGATTTAG TTGTTTTTCT 1560 CGTTCATTGT TAATAGTTTT GTTATTATTT TTTTACATAT TGATAAAAAA ACATATGAAT 1620 AAATAGATAT TAATAAAAAA AATTCATGAA TTTTAATAAA AGGCCTTTAT ATTTTTGCAT 1680 TGAAAGTCAA AAATACAATT TATTTTCATT GTGCAAACAA TTTCTTGACA AACTG 1735 // ID DM88 standard; DNA; INV; 4558 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0062343; Dm88. XX SY synonym: copia2 XX FT source nnnnnnnn:1..4558 FT SO_feature five_prime_LTR ; SO:0000425:1..198 FT SO_feature three_prime_LTR ; SO:0000426:4360..4558 FT SO_feature CDS ; SO:0000316:317..4285 XX CC Full sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled and annotated by Josh Kaminker & Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4558 BP; 1480 A; 803 C; 1097 G; 1178 T; 0 other; TGTTGAAAAT ATGTATGTAA TATGTATGTA TATATAATAA TATGTAATAA AGTCAATGCA 60 CTGTGTCTCC CTCTTTTGGT CGCGGTAACC AAAAGCTTTT TTCTCTTATT GTGTTATCCT 120 CTTTAGCGTG TAATTTGGCT GCCTGCGTGC AGTAACATTG TACTCTAGAT CAGTCACAAT 180 AAATACTTGA AACGAATAGG TTATGGGCCC AGCCCACGCG GACATAAAAG TGAATAGTTC 240 TAAACGAATT AGAATATTCG GCATCTTTAA ATATTGTGAT TCAGGGATTT AATACGAAAA 300 ACTTATCGAA GTGAAAATGA GTTCATCAAT AAACCAAATA GAAAAACTTG ATGATGAAAA 360 CTACAGTGCG TGGGCTGTAC AAATGAAGAG TGTCTTAATT CATGCGGAAT TATGGGGGGT 420 GGTGTGTGGA CGTGTGGTTA AAAACGAAAG TGATAGTGCT GAGCTAAAGG CTTTGTTCGA 480 CGCGAAGGAC GAAAAAGCAT TAGCAAGTAT AATGCTATGC ATTAAGACAT CCCAAATTAA 540 CCATATAAAA AACTGTGAGA CTGCTGTAGA GGCATGGCAA AGACTTAGTG AAATCCACAC 600 TCCCTCAGGA CCGGCACGAC GTATATGTTC GCTAAAGCAG TTGTTGCACA TGAGAATGTC 660 TGAAACAGAA GTTGTTTCGA GTCATGTGAA CAATTTTTGT GCGGTTGTTG AAAAACTAAA 720 AGAAATTCAA TTGGTGATCC AAGAAGAAGT CCTGAGTATT CTCCTGTTGT CCAGTTTGCC 780 GGAATCGTTT GAAAGTTTCG TAGTTGCAAT AGAAACGCGC GATGAGTTGC CCACATTGAA 840 AATGTTAAAA ATAAAATTAC AAGAGGAAGG GCAAAGGCGT ATGGCAAATG AAGAACATTC 900 TGCAAAAAGC GAACAAAGTG CATTTGGAAT TCGGTCTGCG AAACAAAAAG CGAAAAGTGA 960 CATTAAGAAG AGTGTGCAAG TAAATAATCA GCAACTGATT AACGGGAAAA GAACAGTTAA 1020 TTGTTGGAAC TGTGGTCGCA GCGGCCACAA AGCTGTAAAC TGTATGTATA GGCAAAGGAA 1080 AGGAGAACAT AGAGAGAGCG AAAACTTAAA TACCGAGAAA TCATTCTCTG TGCTCTGTTC 1140 AACTGTACAG CTTGGCGAAT TGCCAAAAAA TATGTGGTGC CTTGATTCAG GAGCAACGGC 1200 TCATTTGTGC GGTGAGAGAT CGATGTTTAA AAGCTTCAGA GAGCATAAAG AGCGCATTCT 1260 TTTGGCTGGC AATAAGTATA TTATGGCTGA CGGTCGTGGT ACAGTGAAAA TAGCCTGGCG 1320 CAACTCATCA TTTGAATTGA TCGACGTACT CTTTGTAAAG AATTTACAAT GCAACTTTAT 1380 GTCGGTGTCG AAGGCAATTG AAAACGGATT TCGTGTGTCG TTCGAAAACA GGCGCGGAAT 1440 TTTAAAAAAT GATAAGAATA AAGTTGTTTT GATTGCTGAA CTGCAAAGTG ATTTGTTTAT 1500 GTTCCAAAAT GAAAGAAAAG TTGAAAGTGC ATGTTTCGTT GCACAGACAA ATTTAATGAG 1560 AAAGTGGCAC CAGAGATTCG GTCATTTGAA CTTTGCAAGC TTGAACAAAA TGATCAAACA 1620 AGAAATGGTA CTCGGACTAA AGAAAAAATA TTTAAGTGGC GCAAAAGAAA ATGACTGTTT 1680 GAGCTGCGCA AAAAGCAAGA TATGTGTGAA GAGTTTCCCA AAAGCCTCTG GAAATCGCAC 1740 AAATGAAGTA CTGGAGCTAG TGCATAGTGA CATCTGTGGT CCAATGCAAA CGACGTCTGT 1800 TGGAGGTGCA CGCTATTTTG TCACCTTCGT AGATGACAAA TCGAGATACA TGTTTGTGTA 1860 TTTCATAAAG ACGAGAGATG AAATACTGGC CAAATTCAAG GAGTTTAAAG CATTTGCCGA 1920 AAACCAAAGC GGCAAGAAGT TGAAAGCAAT AAGGAGCGAC AACGGGCGTG AGTATTTAAG 1980 TAAAGCGTTT CAATCTATCT TGACAGAAAA CGGCATAAAG AGACAATTGA CGGTACCGCA 2040 CACCCCACAA CAAAACGGTG TAGCGGAGCG CGCTAACCGA ACCCTGGTCG AGATGGCAAG 2100 GACTATGATG ATTCACGCAG GCGTCGGCGA CTCACTATGG GCTGAAGCTA TCAATACTGC 2160 TGCTTATCTA CGCAATCGCG CTGAGACGTC AGCATTATCT GATGTAACAC CGTTCGAGTC 2220 TTGGACTGGA AGAAAGCCAT ATGTGTCCCA CCTGAAGATA TTTGGTTCGA AGGCTATCGT 2280 GCTTAACAAA TCCACAAAAA GGAAGTTTTC TGCGAAGGGC GAAGAAAATA TTTTGGTTGG 2340 ATACTCTGAT GCTTCTAAGG GATATCGCCT GTTCAATCCT ATCAGGAAGA ATATCTGTGT 2400 TGCTCGTGAC GTAATTGTAT TTGAGAACGA CCAGGATGAT GGTATGACGG CAGGGCAAGC 2460 AGTACCTTCT CCCGATGTTC AACTTGTGGA GATGCAGCAC CTTCCAGTCT GTGTTGCTGA 2520 GGGTAAGAAA CAAGATGATG TTTTCCAGGT GTCAGATAAA CACGTTGACG ACGTTGCGTC 2580 TGTCGAGCAA CACCGATTCA GAGGGAGAAG TCGTCATCCA CTGAAGACAA TGAAGCCACC 2640 GAAGTCAGAG GACCTGGACG GCCAAAGAAG ATTTATACTG GTAAACCTGG AAGACTACGA 2700 AAACAATACA ACATGGTCAA CGGCATGAGT ACGGATGAGG TTCCAATTCC AACAAGGCGT 2760 CAAGGAGGCG TTAACCATAA GAAACTTTGA AGAATGGAGA ACCTCAATGC AGAAGGAGTT 2820 GGAGAGCCTA CGTGCAAACA AAACCTGGTC ATTGGTGGAT TTGCCGGCAG GTGAGAAGGC 2880 CATTGGGTCA AAATGGGTCT TTGCAGTGAA GCGTAACAAG ATGGGCAAAG TCGAGAGGTT 2940 TAAGTCTCGA CTTGTCGCAA AAGGGTGCGC TCAGCGATAC GGCGTTAACT TCACGGATAC 3000 GTTCTCACCA GTGGCACGCT ACTCATCAAT CAAATTAGTA ATTGCTTTGG CGGTGGAAAA 3060 CGGTCTCTAC ATGCACCAAA TGGACGTCTC GTCTGCATAC TTGAATGGTG ATCTTCACGA 3120 AACGGTATAT ATGAGGCAGC CAGAGGGATT CATCGATGAA CGATATCCAA AGAAGGTGCT 3180 GAAGTTACAC AAATCTATAT ATGGACTGAA GCAGAGCGGC CGAGAATGGA ATAAACTACT 3240 GAACGAAGTA CTGCAAAAGA TTGGGTTTTC TTCATGCCCT AGCGAGCCAT GTGTTTACAC 3300 CCGTAATTCT GGCAAAAGTA AGAACCTTGT TGTAGTGTAC GTAGACGACC TTATCATAGC 3360 CAGTTCAAGC AAGGAGGAAC TGTGCGATAT TAAAGCATCA ATTTCAAAAG AATTCGATGT 3420 CGTAGACGGG GGCGAGCTAA GACATTTCCT GGGTATCGAA ATTGAACGTG ACGGAGAAAC 3480 TGGTGGAATC AGGATTGGCC ACAAGCAGTA TATTGAGAGC CTTCTCAATG ATTATTCGAT 3540 GCAGGATTGT AAGCCGAATC TCATCCCACT GGAAGCAGGC TTTGAAGTAA AATGTGACAA 3600 GGCCGATTGT CAGAAGGTGA ATCAAGTCAG CTATCAATCG CTGATTGGCT CATTAATGTA 3660 TCTGGCAATC ACAACTCGTC CTGATATTAT GCATTCGGTC ATTAAACTAT CACAACGGAA 3720 CTCTGACCCA CATAAAGAGC ACGAAGCCGG TGCAAAGCGA GTTCTGCGAT ACTTGAGGGG 3780 CACAGCTGAT TTGCAGTTAC ACTACGAGCG TACTGGTGTG CCTATACATT GCTATGTTGA 3840 CGCGGATTGG GCTGGTGACA CCACGGACCG AAAATCCTTT ACTGGATGGG CATGTATTGC 3900 TGCAGGAGCT GCCTTTACGT GGGACTCAAA GAAACAATCA GTGGTTTCCC TAAGCAGCAC 3960 AGAGTCAGAG TATGTGGCAC TTTCCATGGC AGCGAAGGAA GTGGCGTATG TTCGGAAATT 4020 GGTCAACGAA ATGGGTTTTG GTGAAACACA GGCAACTAAG GTTTATAGCG ACAACCAGAG 4080 TTCCCAGTGC TTGGTGAAAA ATGACACTTT CCATGCACGT AGTAAGCATA TAGATATAAA 4140 ATATCATTAT ATAAGAGAAC TGTATAAGAA CAATATAATA GAAGTTAATT ATGTACCCAC 4200 AGAAAACATG ATGGCAGATG TTCTAACCAA GAATTTGAAT AGATTTAAAC ATGAAAAGTG 4260 TATCCAAGGG ATGGGTTTAA ACTAACAAAA ATGTATTTTG ATTTTGGAAT TAAGATAAAA 4320 ATATTATCCG TAAGGGTTGA CATATTGCAT TGAGAGGGAG TGTTGAAAAT ATGTATGTAA 4380 TATGTATGTA TATATAATAA TATGTAATAA AGTCAATGCA CTGTGTCTCC CTCTTTTGGT 4440 CGCGGTAACC AAAAGCTTTT TTCTCTTATT GTGTTATCCT CTTTAGCGTG TAATTTGGCT 4500 GCCTGCGTGC AGTAACATTG TACTCTAGAT CAGTCACAAT AAATACTTGA AACGAATA 4558 // ID JUAN standard; DNA; INV; 4236 BP. XX AC AY180919; XX DR FLYBASE; FBgn0046110; Juan. XX SY synonym: Strider SY synonym: DOC6 XX FT source AY180919:1..4236 FT SO_feature CDS ; SO:0000316:179..1330 FT /protein_id="" FT /db_xref="FLYBASE:; Juan\ORF1" FT /translation="MRTKRNKPLVERQNERGSKQFTNYFRPLQTLDDDNDDKPIETDTL FT DEIIIKPPPITLIKQNTKFVHELMAKIKTDDYFIKTISIGIKIFLPNMDCFNAACNQLK FT EHNCEFYTYDTRSNKPYKAILSGLDKLPIETVKSMLQNLGLQCTEVKIVEKKTKSDHEL FT LLYIVYFVNKSITVKELRKNFMYVNHTKVRWEYKVKLQNKITQCYNCQLFGHGSNNCSV FT KTSCAHCAGSHQTAVCMDVNNKKCANCKGDHPSTELTCPCRISYLNLLSKRSSQRIGRN FT NLEAHGMRNSKQHIAPSALFGNQPQISLPTQTKRVQQLNFGSYSNALTNIHTLPNPKAN FT ITNETNLFSCEEINSLLSELITRFNECSNKGEQFQVISQLAIKYVYTNK" FT SO_feature CDS ; SO:0000316:1329..3983 FT /protein_id="AAN87272.1" FT /db_xref="FLYBASE:FBgn0062541; Juan\pol" FT /translation="MFTQTNNFIDNLNIMAWNARAVRNKRIELIKFLENNHIHIALIN FT ETWLTHSDRFNIPEYTIYRNDRKESRGGGVAIAVSNTLIHEQIPCIKSHVIENVGIQI FT NTDSNSSLKIYSIYFAGNTSKCIPSSCDLSWDHNHLKSLYRSDLLKISQINGNFLICG FT DFNSRHRAWKCTRANGWGKILNELSDLGKFSILYPTQPTYIPHNHKAKASTLDLCLTN FT IPNQLANPAVMQELSSDHLPVVIKYSTNFMRNVKTYPILGRANWALFKRIINERLMVE FT EVVVNCSQISTLDIDNLIGSFTRVINYAFLKAVPHKPIHHSKLTLPVHITELFKTRNI FT IQRQWFRSRHPLLKSRCDHLNYIIRKELFIYNNAKWNNKLQKLDKCSKPFWNITKAIK FT KRKQIVPCLSKADEIYASGKEKANILADVFQLKHNLTHSYSNQNTIDQVKISLQSIAE FT TPNNIAEICRVTYPEVFSIVKALHTRKSPGIDGIPNISLKNLPTSAINHIVNIANHCL FT QAGYFPRDWKIAKIVPILKPGKPPDNPESYRPISLLSGLSKILEKLIKLRLVKFLDIH FT NTLPTVQYGFRNGLNTILPTLKLRNYIKESIQSKQSVGLVTLDIEAAFDTVWHDGLLH FT KLKMIGTPIYLIKIVQNFLTHRYFSVNLNSSQSARRILNAGVPQGSVLGPTLFNIFTH FT DIPQATNCVLSLFADDAAVYSAGFSYSEINQSMQSYLNELDIYYKKWKIKINPNKTNA FT IFFTKRRKPRYLPDRQLRILDSPIQWVDNIRYLGIIFDKKLTFKYHINNTIMKVNKII FT CTLYPLINRKSKLSISNKIIIFKTIFNPILMYGSPVWGRCAQTHIKKLQICQNKLLKL FT IMNLPYYTNTKYLHIKAGVQKVQQKIQY" XX CC Sequence from BDGP, January 2002. CC Any changes to original sequence record are annotated in an FT line. CC Repbase, http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC for ORF1. XX SQ Sequence 4236 BP; 1572 A; 791 C; 636 G; 1237 T; 0 other; ACAGTCTTCG ACTAGCTCTT GTCGAACGCG GATGTAAAAT CTTCTATTCT CTCTGAGAAC 60 ATTTTGGTGT ACATTAAATA TTTTCTGTTT TCTTCTTTAA TTGTGATATT CGCAAAAAAC 120 TACAATCAAA CGTTCAAGTG ATGCAATTGG CGATGCTGAA GCATCTGCCG TTGAAAAAAT 180 GCGTACTAAA AGGAATAAAC CACTAGTGGA GCGCCAGAAC GAACGGGGCT CCAAGCAATT 240 TACAAATTAT TTTCGTCCAC TACAAACTTT AGACGATGAT AATGATGATA AACCTATTGA 300 AACGGACACA CTTGATGAAA AAATAATCAA ACCACCTCCT ATAACATTAA TTAAACAAAA 360 TACAAAATTT GTACACGAGC TAATGGCTAA AATTAAAACT GATGATTACT TTATTAAAAC 420 CATCTCAATT GGAATTAAAA TATTTTTACC AAATATGGAC TGTTTCAATG CAGCTTGTAA 480 CCAACTCAAG GAACATAACT GTGAGTTCTA CACGTACGAT ACTAGATCAA ACAAGCCATA 540 CAAAGCCATT TTATCCGGGC TGGATAAACT ACCAATTGAA ACTGTCAAAT CCATGCTACA 600 AAACCTAGGA TTACAATGCA CCGAAGTCAA AATTGTGGAA AAGAAAACTA AGTCTGATCA 660 TAAATTATTA CTGTACATTG TTTATTTCGT AAACAAAAGT ATCACAGTAA AGGAACTGCG 720 CAAAAACTTT ATGTATGTCA ACCACACTAA AGTACGCTGG GAGTATAAAG TAAAATTGCA 780 GAATAAAATT ACTCAATGCT ACAATTGCCA ACTATTTGGA CATGGATCAA ATAATTGTTC 840 AGTTAAAACA TCATGTGCCC ATTGTGCTGG ATCACACCAG ACTGCCGTAT GTATGGACGT 900 AAACAACAAA AAATGCGCTA ACTGTAAAGG CGACCATCCA TCGACAGAAC TGACATGCCC 960 ATGCCGAATT TCTTATCTCA ATTTACTTTC CAAACGCTCA TCACAACGCA TTGGAAGAAA 1020 TAACTTGGAA GCTCATGGAA TGAGGAATTC TAAGCAACAC ATCGCTCCAT CTGCTCTCTT 1080 TGGAAATCAA CCCCAGATCT CATTGCCTAC TCAAACTAAG AGAGTGCAAC AACTCAACTT 1140 TGGTAGCTAT AGTAATGCAC TCACCAATAT TCATACTTTG CCTAATCCCA AAGCAAATAT 1200 CACAAATGAA ACTAATCTCT TTTCATGTGA AGAGATAAAT TCTCTTTTGT CTGAATTAAT 1260 AACAAGATTT AATGAATGCT CCAATAAGGG AGAGCAATTT CAAGTAATTT CTCAGTTAGC 1320 GATTAAGTAT GTTTACACAA ACAAATAATT TTATTGATAA TTTGAATATT ATGGCATGGA 1380 ATGCTCGTGC CGTTAGAAAC AAACGTATAG AACTGATCAA GTTCTTAGAA AATAATCACA 1440 TTCACATAGC ACTTATAAAT GAAACGTGGC TGACTCATAG TGACCGTTTT AATATACCCG 1500 AATACACTAT ATACAGAAAT GACAGGAAGG AAAGTAGGGG AGGCGGCGTT GCCATTGCAG 1560 TTAGTAACAC ATTGATACAT GAACAAATAC CCTGCATTAA AAGTCATGTT ATTGAAAATG 1620 TAGGTATACA AATCAACACT GACTCTAATT CCAGTCTAAA AATTTACTCT ATTTACTTTG 1680 CTGGAAATAC TTCAAAATGT ATTCCTAGTT CATGTGATTT ATCTTGGGAC CATAATCATT 1740 TAAAAAGTTT GTATAGATCT GATCTCCTTA AGATCTCACA AATAAATGGA AATTTTCTAA 1800 TTTGCGGGGA CTTCAACTCG CGCCACAGAG CATGGAAGTG TACCCGAGCC AATGGTTGGG 1860 GCAAAATTCT CAATGAACTT TCAGATCTGG GAAAATTCAG TATTTTGTAC CCTACGCAAC 1920 CCACATATAT TCCGCATAAC CATAAAGCAA AAGCATCAAC ATTGGACCTA TGCCTCACTA 1980 ATATCCCTAA CCAGCTGGCT AATCCAGCAG TAATGCAAGA GCTTTCCTCT GATCACCTAC 2040 CAGTGGTTAT TAAATACTCT ACAAATTTTA TGCGCAATGT AAAAACATAT CCGATTTTGG 2100 GAAGGGCAAA TTGGGCGCTT TTTAAGAGAA TTATAAATGA AAGGCTGATG GTTGAAGAGG 2160 TGGTTGTCAA CTGCTCACAG ATATCGACTT TGGATATTGA TAATTTAATC GGCTCTTTCA 2220 CTAGAGTAAT CAATTATGCC TTTCTTAAAG CAGTCCCTCA TAAACCTATA CATCACTCCA 2280 AGCTTACACT CCCTGTACAT ATAACTGAAT TATTTAAAAC AAGAAATATT ATCCAAAGAC 2340 AATGGTTTAG GTCACGTCAT CCATTATTAA AATCAAGATG TGATCATTTA AATTACATCA 2400 TACGAAAAGA GCTTTTCATT TATAATAATG CAAAATGGAA CAATAAACTT CAAAAGCTGG 2460 ATAAATGCAG CAAACCCTTT TGGAATATAA CCAAAGCAAT TAAAAAAAGA AAACAAATTG 2520 TTCCGTGTCT AAGTAAAGCT GATGAAATAT ACGCTTCTGG AAAAGAAAAA GCGAATATTC 2580 TTGCTGATGT CTTTCAACTG AAGCATAATC TGACACATTC ATATTCAAAC CAAAATACAA 2640 TTGACCAAGT CAAAATATCT TTACAATCTA TAGCTGAAAC CCCAAACAAC ATCGCAGAAA 2700 TTTGTAGAGT CACTTATCCA GAGGTTTTTT CAATTGTAAA AGCTCTCCAT ACTAGAAAAT 2760 CTCCTGGAAT TGATGGAATA CCTAATATAT CCTTGAAAAA CCTTCCTACA AGCGCCATTA 2820 ATCATATTGT TAACATTGCC AATCACTGTC TTCAGGCAGG CTATTTCCCA CGGGATTGGA 2880 AGATAGCAAA AATAGTTCCA ATTCTCAAAC CAGGAAAACC TCCTGATAAC CCTGAAAGCT 2940 ACCGACCAAT AAGCCTATTA AGCGGATTGT CCAAGATTCT GGAAAAATTA ATTAAATTGA 3000 GATTGGTGAA ATTTTTGGAC ATACACAACA CACTACCAAC GGTACAGTAT GGTTTTAGAA 3060 ATGGTCTAAA TACCATACTG CCGACCTTAA AACTAAGAAA TTATATTAAA GAAAGTATCC 3120 AATCCAAACA ATCTGTAGGG CTGGTTACAC TTGATATTGA GGCTGCATTT GATACTGTAT 3180 GGCATGACGG ACTGTTGCAC AAACTAAAGA TGATTGGAAC TCCCATATAC CTCATTAAAA 3240 TAGTTCAAAA TTTTTTGACA CATCGATACT TTAGCGTTAA TTTGAACAGT AGTCAATCTG 3300 CTAGACGAAT TTTAAATGCA GGTGTGCCAC AGGGTTCTGT CTTAGGACCT ACATTGTTTA 3360 ATATTTTTAC ACACGATATT CCACAAGCTA CAAACTGTGT ACTTTCACTG TTTGCTGACG 3420 ACGCCGCTGT CTACAGTGCT GGTTTCTCGT ACAGCGAGAT AAATCAATCC ATGCAGAGTT 3480 ACTTAAATGA ATTAGACATA TATTACAAAA AATGGAAAAT TAAAATTAAC CCTAATAAAA 3540 CGAATGCTAT ATTTTTTACA AAAAGAAGGA AACCTCGTTA TCTTCCTGAT AGACAGTTGC 3600 GAATACTAGA CTCTCCAATA CAATGGGTTG ACAACATTCG ATATTTGGGT ATAATTTTTG 3660 ATAAAAAACT AACATTTAAA TACCACATCA ACAACACAAT TATGAAAGTT AATAAAATTA 3720 TTTGCACACT GTACCCTCTA ATAAATCGTA AATCTAAATT ATCAATATCC AATAAGATCA 3780 TAATTTTTAA AACAATTTTT AATCCAATAT TAATGTATGG GTCACCTGTT TGGGGTAGAT 3840 GTGCTCAAAC TCATATCAAA AAACTTCAAA TATGCCAAAA CAAGCTATTA AAATTAATAA 3900 TGAACTTACC TTATTACACA AACACTAAGT ACCTACATAT AAAAGCAGGT GTACAAAAAG 3960 TTCAACAAAA AATACAATAT TAACAATCTT TTGTTTCAAG TCTTAGTTTT TAAGTTAGTT 4020 GTAAGTCAAT TAAGGTTTTT CTTCTCCCTT GTTTTCCTTA ATAAATAATA AATATTGTTA 4080 TACAAAAAAC TAAAAACAGG TCAACTAGAC ATAATTAATA AATATTTCAG ATGAAAGCCT 4140 TAGCTAAACA CTGTAAATAC CTAATTATAC CATTAGCTTT AATGAAATGT ACATATATAT 4200 ATAAGGGATA AAGCACAAAT AAATAAATAA ATAAAA 4236 // ID FROGGER standard; DNA; INV; 2 BP. XX AC AF492763; XX DR FLYBASE; FBgn0061513; frogger. XX FT source AF492763:1..2483 FT SO_feature five_prime_LTR ; SO:0000425:1..203 FT SO_feature three_prime_LTR ; SO:0000426:2281..2483 FT SO_feature CDS ; SO:0000316:997..2244 FT /protein_id="AAM11672.1" FT /translation="MLLAVAAEKDLHMHQIDISNAYLNSDLEEDVYLKQPKNYVDKEN FT PGKVLKLQKAIYGLKQSERLWNDALNEVLQNMGFKRSKNEACLYYKKQQNGFSYIAVY FT VDDLIIISPKESDIEDIKGSIATKFDMKDGGQLRYFLGMEISRKGQTGPIKLCQKRYI FT ENLLRRYGMQSCRLVGTPFDPGYESGCTNEKCAKVNLTHFQSLIGSLMYLAVVSRPDI FT LHSVSKLSQRNTDPHHEDEAAAKHVLRYLCGTINLSIIYMKTGELVKEFADADWANDK FT VDRKSYSGYAFLMAGSAFSWGSSKQSVIAQSSTEAEYIALSTAAKEAVFLRRLLQEMG FT WFDKGPLKLLCDNLSASSIAKNPINHKRTKHIDVRYHFIRDKVNKNEIIVEYVNTQNN FT VADILTKAKALWLFEITWICLNL" XX CC Sequence from BDGP, January 2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 2483 BP; 621 A; 534 C; 492 G; 836 T; 0 other; TGGTTGCATA AGAGTGCCGC CGCGGTAAAA TGATTGAGAC AATGAGATGT GATTAGGGTG 60 AATGCTTTAT TAGAACTGAA CTGTGTACAT AACTTCGTAT ACATGCTTCA CTACTTTGAG 120 TGGCAAATGA GAGCAAGAGA AGCTTGCTCC CTGAGAGAGT AATAAAGTTG CCAGACAGAT 180 GATACGCAAT CTTATATTTC CAACACTTTT GCTTAAGATC AACATCATCT GTTTTGCAAT 240 TACAAATTTA AACAAATCCA AGTGATTTCA AAAAGCCACA ATGCTTTTGC TTTGGTTAGT 300 ATATCTGCGA CATTATTCTG AGTGTTAACA TATTCAACAA TTATTTCATT TTTGTTAACT 360 TTGTCCCTTA TAAAATGATA TCTGACATCA ATGTGTTTTG TACGCTTGTG ATTAATTGGA 420 TTCTTGGCAA TACTCGAGGC GCTCAGATTG TCGCATAGTA GTTTCAATGG CCCCTTGTCG 480 AACCATCCCA TCTCTTGTAA GAGTCTTCTT AAAAACACTG CTTCTTTTGC TGCTGTTGAT 540 AAGGCGATGT ATTCAGCCTC GGTGCTGCTT TGGGCGATGA CGCTCTGCTT TGATGAGCCC 600 CATGAAAATG CACTTCCAGC CATAAGGAAT GCATATCCTG AATACGACTT TCGGTCGACT 660 TTGTCGTTTG CCCAATCTGC GTCTGCGAAT TCCTTGACCA GCTCGCCGGT CTTCATGTAG 720 ATAATGCTCA GGTTGATTGT CCCACACAAA TATCGTAGCA CATGCTTTGC TGCTGCTTCG 780 TCTTCGTGGT GAGGATCTGT ATTTCTCTGT GATAACTTGC TGACAGAGTG CAATATGTCT 840 GGTCGACTTA CTACTGCTAA ATACATAAGC GAACCAATTA GTGATTGAAA ATGGGTTAAA 900 TTTACCTTTG CACATTTCTC ATTTGTGCAA CCAGATTCAT AACCAGGGTC AAATGGGGTT 960 CCCACTAGTC GACAGCTTTG CATGCCATAC CGTCTTAGTA GATTTTCAAT GTAGCGCTTT 1020 TGACATAGCT TTATAGGACC TGTTTGTCCC TTTCGGCTTA TTTCCATCCC AAGGAAATAT 1080 CTCAGTTGGC CGCCATCCTT CATATCAAAC TTTGTTGCTA TGCTCCCTTT GATGTCTTCT 1140 ATGTCGCTTT CTTTGGGAGA TATGATGATC AAATCGTCAA CGTAGACAGC TATATAGCTA 1200 AAACCGTTTT GTTGCTTCTT GTAGTATAAA CATGCCTCGT TCTTGCTTCT TTTAAAGCCC 1260 ATATTTTGAA GAACTTCATT TAAGGCATCG TTCCACAGTC GTTCAGACTG CTTAAGCCCA 1320 TAGATTGCTT TTTGCAACTT AAGGACTTTT CCAGGATTTT CCTTGTCAAC GTAATTTTTT 1380 GGTTGTTTCA AATAAACATC CTCCTCAAGG TCGCTGTTTA GGTATGCGTT TGATATATCA 1440 ATTTGATGCA TGTGAAGGTC TTTTTCTGCA GCAACTGCAA GCAGCATCCT GATGGTCTCG 1500 TACCTGATTA CCGGGGAATA TGTTTCCCAG TAGTTCACTC CAAATTGTTG TCCACAACCT 1560 TTGGCAACAA GTCGTGATTT GAATCTCTCT ATTTCGCCAT CCTTGTCTTT CTTGATGCGA 1620 AATACCCATT TTGATCCAAT TGCTTTTGGG CAGGTCTACC AGTGACCATG TATTGTTGGC 1680 TATTAACGCT TCATACTTCG CTCGCATTGA ATCCTGCCAA TTTACAATTG GCCCTTGAGT 1740 GCTTGCTCCA ACGACTGCGG TGTTTCTTCA TCTTCCTCGT AATTCAAATA TTGAAGAGTT 1800 TGATACACCT TTTTCGGCCT TCCTGGTTGG CCAGTTCGTA TGAACTTTGG TCGGCCTGGC 1860 CTTCGCTGTT GTGCCATTAG GGTGTCATCT GCAGATTGGT CATCAGAGTC GTCATTGGCA 1920 CTGACAAACG TCCTGCCCTC ATCGCTGTCG TAAACGGCTT CTTCATCGTC AGTACTGCTG 1980 GCTTCAGCTA GCAAGTGACG TTTCAGCCCA TAAAAATTCA TCCAAACCAG CATGTATTAA 2040 AAGACTTTTG GCCATTTCCA CAATTGTGCG GTTTGCACGT TCAGCAATTC CGTGTTGCTG 2100 AGGGGTATAC GAGACACTTA ACTGTCTCTT TATCCCGCAT TGGCTTAAAC AAAGATAGAC 2160 GCGAGTTCTT TTCTCTTAAC TTTGTTTTAT TTTTGGTTAC TTCACTTTGA TCACTTCACG 2220 AATACTTTGT AAACGAATCA ATACATTTCT TTACGCGTAG GGCCTGGGCC CATAACCAAA 2280 TGTTTGCATA AGAGTGCCGC CGCGGTAAAA TGATTGAGAC AATGAGATGT GATTAGGGTG 2340 AATGCTTTAA TAGAACTGAA CTGTGTACAT AACTTCGTAT ACATGCTTCA CTACTTTGAG 2400 TGGCAAATGA GAGCAAGAGA AGCTTGCTCT CTGAGAGAGT AATAAAGTTG CCAAACAGAT 2460 GATATGCAAT CTTATATTTC CAA 2483 // ID ROVER standard; DNA; INV; 7318 BP. XX AC AF492764; XX DR FLYBASE; FBgn0061485; rover. XX FT source AF492764:1..7318 FT SO_feature five_prime_LTR ; SO:0000425:1..367 FT SO_feature three_prime_LTR ; SO:0000426:6952..7318 FT SO_feature CDS ; SO:0000316:1282..2535 FT SO_feature start_codon ; SO:0000318:1..3 FT /protein_id="AAM11673.1" FT /translation="MATASPIILSDSNMSQVERQINAVEIFNGDPNTLHTFITRIDFI FT LALYQTTDERQKLIIYGHIERNISGDVIRTLGTNNFTSWIELRTRLILYYKPQAPSHQ FT LLEDFRNIQYKGNIRQFLEEAEKRRQVLMSKLELENNSAETALFTRLIQDSIENLILK FT LPNHIHLRIVNCQIPDLRSLINILQEKGLYEPQTLNKDNFKPAPNSNRTPNNLPNRQV FT NGQNKPITPFQPYHNNVFQPYYPPYPYAPIHTPRPNQTPFPQYQPRPTYPQPNIPRPN FT PVFNRQNIFDQNRFGPNQTYTPNNFPPTGNTQTNNPVKRQRPSDSGQTKMSIEELRYQ FT EMVSNQPEYPYHYYYPYYPDNFQPQPYPYQMSYIPDPTQIPFEQDHDTSDKTQQEQTA FT PNDDESDNTKEAENFRLVAPDQTNI" FT SO_feature CDS ; SO:0000316:2595..5144 FT SO_feature start_codon ; SO:0000318:1..3 FT /protein_id="AAM11674.1" FT /translation="MLTWNIFETPIQQTNHLIHTSNGSVTINETTTIPPNNYFPIAQE FT FLIHKFSDHYDMLIGRKLLSKAKAIIDYQKKTATLFNKVYKIKDTEKLTDQNHAQVDS FT SFPQDPTFLETSTLQENLFRLNHLNAEEKYKLTNLLTRFKDIQFHEGDKLSFTNQEKH FT TINTTHNIPVYSKMYSFQQSYEQEVERQIQEMLEQGIIRESSSPYCSPIWVVPKKLDA FT SGQQKLRIVIDYRKLNEITINDRYPMPNIDEILGKLGRSNYFTTIDLAKGFHQIEMDS FT ESIAKTAFSTKYGHYEYTRMPFGLKNAPATFQRCMNNLLRPLLNKNCLVYLDDIIVFS FT TSLEEHLQSLEAVFEKLSQANLKLQLDKCEFLRQETTFLGHVITKDGIKPNPEKIKAI FT QDYPLPSKPKEIKAFLGITGYYRKFIPNFSDIAKPLTKCLKKGVKIDTKHKEYIEAFQ FT KLKLLISEDPILKIPNFERKFVLTTDASNVALGAVLSQDGHPISYISRTLNEHEVNYS FT AIEKELLAIVWATKTFRHYLLGRHCEIASDHQPLCWLHKLKEPNSKLTRWKIRLSEYD FT FDIKYIKGKENHVADALSRIKIEQAFFGESTQHSAEEDNSDLIGLSEKAINYHKRQII FT FSKGPNSSIEHETYFKKQIIHISYKEMTLEAAKQYLLNHFCSKNSALYIESDADFEMI FT QNAYTTTMNPKCTKIFRSLVLLKNIPTYASFKELILNTHEKLLHPGIQKTVKLFSENY FT YFPNSQLLIQNIINECQVCNLAKTEHRNTEMPMKITPKPEHCRDKFVIDIYSSEGNHY FT LSCIDIYSKFATLEQIKTKDWVECKNALMRIFNQLGKPKLLKADRDGDSPA" FT SO_feature CDS ; SO:0000316:5823..6575 FT SO_feature start_codon ; SO:0000318:1..3 FT /protein_id="AAM11675.1" FT /translation="MLRELNAITLHQTNRQKRGLINIMGSAFKYLFGTLDNNDRIQIH FT NQLESATNNSINIHEMSDIIQLINSNMQKIKEFEDEHNNREQMLYELIQFTEYIEDIA FT MGMQLSRLGLFNPKLLNYDKLENVDSSNILQTKTSTWINTNQLLIIAHIPTKQKTIHT FT INIIPYPDNNGYQLDHIDKDTYFEEQDKIYNQNYQEINNECIKNIIKRTNPICNFVPI FT TVEQIIKYVEPNSIITWNLNNTIIEQKLPRNK" XX CC Sequence from BDGP, January 2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7318 BP; 2989 A; 1603 C; 975 G; 1751 T; 0 other; AGTAACATAA TATGCTTCTC ATATTACGTT TACATACTTA CACTAATTGT ACATACAATC 60 TTGCACATGC ATAAACACAT GAAACCAGTT TACATTTTTA CTTACACTTA AGCGCATAAT 120 TTGTTGTGCA TCCATACCGT TATTTTTCCG TTCTTTTTTG TACACATATA CTGATTAGAC 180 ATTCCCGTTT CTCGCGACTC ACTTCGAGCC GATCAAAAAC TCTGTACAGT CAGTCTTAAG 240 CCGACAACGA AGAAATAAAG ATCTAAACTA AAAAATACCT CGTGTTGATT CTGAAACTTC 300 TTTAAAGGCG TTGATCTTAG TCAAACGACG GATCATTTGT TCGACTCGAA TAGTAAAATA 360 CGTAAGTGGC GCAGTCGGTA GGATAATACA TTGTTGATGC GATAGTAACC CTATTCAATT 420 CTTCAATTAT TATCCACAAA AGAACTAACG CAGTCGCGTC GTGCTAAGTG GAAAGTGTTA 480 ACCAATGGTA CTCAATCCCA TAAAGTGACC ACGAAGTGAA ATAACTTAAA ATTACAAAAA 540 TTTTAAGAAG ACGCCTTTGA ATTCGCTGAG TAGTAAATCC ACAAAAGGTT ACTCAAACCA 600 AAGCGAAACA GCTAAGAGCG CAACACAACA AACTTAAATT AAAAACTTAC AAATAAGTGA 660 ATAACAGAAA TATTACGAAC AAATAAGTGC GAAAATTCAA CAAAACCTAA GCTAAAGAAC 720 TATAACTAAG TGAATAACAA ACATTATAAG TGAATTAAAT TAATAAGTGA ACAACACAAC 780 AAACTTAAAT TAAAAACTTA CAAATAAGTG AATAACAGAA ATATTACGAA CAAATAAGTG 840 CGAAAATTCA ACAAAACCTA AGCTAAAGAA CTATAACTAA GTGAATAACA AACATTATAA 900 GTGAATTAAA TTAATAAGTG AACAACACAA CAAACTTAAA TTAAAAACTT ACAAATAAGT 960 GAATAACAGA AATATTACGA ACAAATAAGT GCGAAAATTC AACAAAACCT AAGCTAAAGA 1020 ACTATAACTA AGTGAATAAC AAACATTATA AGTGAATTAA ATTAATAAAT GAACAAACAC 1080 AAACACAAAG CTCAGACCTG GAATACAAAC TAATAGTGGA AATCAAGAAC CCATAGTGAA 1140 AACCAAAACC CTATCCAAAA CATAAACCTA CAGTAAAAAC TTGAACTTTA TCGAAAAACG 1200 AAACCTATCC GAAACTGCAA ACCTTACCCA GAACGTGAAC CTTACCTAAA ACATTAAATA 1260 ACAATAAATA CAACAACAAC AATGGCAACA GCAAGCCCTA TAATACTATC CGACTCAAAT 1320 ATGAGTCAGG TCGAGAGACA GATAAATGCT GTCGAAATCT TCAATGGAGA CCCGAATACT 1380 CTACATACCT TTATAACTCG CATCGACTTC ATTCTGGCCT TATACCAAAC TACTGACGAA 1440 AGGCAGAAGT TGATAATTTA TGGACACATC GAACGCAATA TCAGCGGCGA CGTCATCCGT 1500 ACTCTCGGGA CCAACAACTT TACCAGCTGG ATTGAACTAA GGACCAGATT GATCCTATAT 1560 TACAAACCCC AGGCACCGAG TCATCAACTT TTGGAGGATT TTCGGAACAT TCAATACAAA 1620 GGCAACATCA GGCAGTTTCT GGAGGAAGCC GAGAAAAGAC GACAAGTTTT AATGAGTAAG 1680 TTAGAACTAG AAAATAACTC AGCTGAAACC GCCTTATTTA CCAGACTTAT TCAAGATAGC 1740 ATAGAAAACT TGATCCTAAA ACTCCCTAAC CATATTCATC TAAGAATAGT TAACTGCCAA 1800 ATCCCCGATT TAAGATCCCT TATTAATATC TTACAAGAAA AGGGTCTATA TGAACCTCAA 1860 ACATTAAACA AAGATAACTT TAAGCCAGCT CCTAATTCTA ATAGGACGCC AAATAACTTA 1920 CCAAATAGAC AAGTTAATGG CCAAAATAAA CCTATAACAC CTTTTCAACC GTACCATAAC 1980 AACGTCTTTC AACCCTATTA TCCACCATAC CCATACGCAC CAATCCATAC ACCCAGGCCC 2040 AATCAAACCC CTTTTCCCCA ATATCAACCA AGACCAACCT ACCCACAACC TAACATACCC 2100 AGACCGAATC CTGTCTTTAA TCGACAGAAT ATCTTCGACC AAAATCGTTT CGGACCTAAT 2160 CAAACTTACA CACCGAATAA CTTCCCTCCA ACTGGGAACA CACAAACCAA CAACCCGGTA 2220 AAAAGACAAC GACCATCAGA CAGCGGACAG ACTAAAATGA GCATAGAAGA ACTAAGATAC 2280 CAAGAAATGG TATCAAACCA ACCCGAATAC CCTTACCATT ATTACTATCC ATACTACCCA 2340 GATAACTTCC AACCACAACC ATATCCTTAT CAAATGAGTT ATATCCCAGA CCCAACTCAA 2400 ATCCCTTTCG AACAAGATCA TGACACTTCG GACAAAACAC AACAAGAACA GACAGCACCA 2460 AACGATGACG AATCAGACAA TACCAAAGAA GCAGAAAATT TTCGGCTAGT TGCCCCGGAT 2520 CAAACCAATA TATAATTATT AAACATAATG ACATAGACTT AAAGTGCCTA ATTGACACTG 2580 GATCCACCAT AAACATGCTT ACTTGGAACA TTTTCGAAAC ACCTATACAA CAAACCAACC 2640 ACCTAATTCA TACAAGTAAT GGCTCGGTAA CTATTAACGA GACCACCACT ATACCACCCA 2700 ATAACTACTT TCCTATTGCT CAGGAGTTTC TAATTCACAA ATTCTCAGAT CATTATGATA 2760 TGCTGATTGG ACGCAAATTG TTGTCTAAAG CTAAAGCCAT AATAGATTAT CAAAAGAAAA 2820 CTGCCACCCT CTTCAACAAA GTCTATAAAA TAAAAGACAC TGAAAAACTT ACAGACCAAA 2880 ATCATGCTCA AGTAGACTCT TCGTTCCCAC AAGACCCTAC CTTTTTAGAA ACCTCCACAC 2940 TTCAGGAGAA TCTGTTCCGA CTAAACCATT TAAATGCAGA AGAAAAATAC AAATTAACAA 3000 ACTTACTGAC AAGATTTAAA GACATTCAGT TTCACGAGGG CGACAAACTT AGCTTCACAA 3060 ACCAAGAAAA ACATACAATT AATACCACAC ATAATATTCC AGTTTACTCT AAAATGTATA 3120 GCTTCCAACA ATCATACGAA CAAGAAGTAG AAAGACAAAT TCAAGAAATG TTAGAACAAG 3180 GCATCATTCG AGAAAGCAGC TCACCGTACT GCAGCCCCAT CTGGGTAGTA CCAAAAAAAT 3240 TAGATGCTTC CGGACAGCAG AAACTTCGAA TAGTCATAGA CTACAGGAAG CTGAATGAAA 3300 TAACAATAAA TGACCGATAC CCTATGCCAA ATATAGACGA AATACTAGGA AAATTAGGGA 3360 GGTCTAATTA CTTTACCACC ATAGACCTGG CAAAAGGCTT CCATCAAATA GAGATGGATT 3420 CAGAATCAAT AGCCAAAACG GCTTTCTCTA CTAAATACGG ACATTACGAA TATACTAGAA 3480 TGCCATTTGG GCTTAAAAAC GCCCCAGCTA CCTTCCAACG CTGCATGAAT AACCTACTTC 3540 GTCCGCTTTT AAATAAAAAT TGTTTAGTAT ATCTAGACGA TATCATTGTC TTTTCTACCT 3600 CTCTAGAGGA ACACCTTCAA TCCCTTGAAG CAGTCTTCGA GAAATTATCT CAAGCCAACC 3660 TTAAACTACA GCTAGATAAA TGTGAATTCC TAAGACAAGA AACTACCTTT CTAGGACACG 3720 TTATCACTAA AGACGGAATT AAACCAAATC CCGAAAAAAT CAAAGCTATA CAAGATTACC 3780 CACTTCCATC TAAACCTAAA GAAATTAAAG CATTCCTTGG AATCACAGGA TACTACAGGA 3840 AATTCATACC CAACTTCTCT GACATAGCGA AACCACTGAC TAAATGTCTC AAAAAGGGAG 3900 TAAAAATAGA TACAAAACAT AAAGAATATA TAGAAGCATT TCAAAAATTA AAATTACTTA 3960 TTTCCGAAGA CCCCATATTA AAAATACCAA ATTTTGAAAG AAAATTTGTA TTAACAACTG 4020 ACGCAAGCAA TGTAGCATTA GGTGCTGTTC TATCTCAAGA TGGACATCCC ATTAGCTACA 4080 TTAGCAGGAC CCTAAACGAA CATGAAGTTA ATTACAGTGC TATAGAAAAA GAACTACTAG 4140 CAATAGTTTG GGCCACAAAG ACCTTTAGAC ACTACCTGTT AGGACGACAT TGCGAAATAG 4200 CTTCTGACCA TCAACCATTG TGCTGGCTAC ACAAATTAAA AGAACCTAAC TCAAAGCTGA 4260 CTAGGTGGAA GATTAGATTA TCCGAGTACG ACTTCGATAT AAAATATATC AAAGGAAAGG 4320 AAAACCATGT CGCAGACGCA CTGTCTAGAA TCAAAATAGA ACAAGCATTT TTCGGAGAAT 4380 CTACGCAACA TAGCGCAGAA GAAGATAATA GCGACTTAAT AGGGTTGTCC GAAAAAGCCA 4440 TTAATTACCA TAAAAGACAA ATTATTTTCT CAAAAGGTCC CAATAGTAGT ATAGAACACG 4500 AAACTTATTT CAAAAAACAA ATCATACACA TTTCCTACAA AGAAATGACC CTAGAAGCAG 4560 CCAAACAATA TCTACTCAAT CACTTTTGCT CAAAGAATAG CGCTCTTTAC ATCGAAAGCG 4620 ATGCAGACTT CGAAATGATC CAAAATGCAT ACACAACAAC AATGAATCCG AAATGTACAA 4680 AAATTTTCCG AAGCCTCGTT TTATTAAAAA ACATCCCTAC TTATGCAAGT TTTAAGGAAC 4740 TAATTCTAAA CACCCACGAA AAGCTCTTAC ACCCCGGAAT TCAAAAAACT GTAAAACTCT 4800 TTAGTGAGAA TTATTACTTC CCCAATAGCC AACTCCTTAT CCAGAATATA ATCAACGAAT 4860 GTCAAGTCTG CAACCTAGCC AAAACAGAAC ACCGTAACAC GGAAATGCCA ATGAAAATTA 4920 CCCCTAAACC CGAACATTGT CGTGATAAGT TCGTAATAGA CATCTATTCT TCCGAAGGAA 4980 ACCATTATCT GAGCTGTATC GATATTTACT CTAAATTCGC CACCCTTGAA CAGATTAAGA 5040 CAAAAGACTG GGTAGAATGC AAAAATGCCT TAATGCGCAT ATTTAATCAA TTAGGGAAAC 5100 CTAAACTTCT CAAAGCAGAT CGAGATGGTG ATTCTCCAGC CTAGCCCTCA AACGTTGGCT 5160 CGAAACCGAA GAAGTAGAGT TACAACTTAA TACTACAAAA ACAGGTGCGC TGACATAGAG 5220 AGACTCCACA AAACTATAAA CGAAAAAATC CGCATCATAA AAAACTCTGA CGACAACGAG 5280 ACCAAATTAA GTAAAATAGA AACAATCCTT TACATTTACA ACCACAAGAC CAAACACGAT 5340 ACTACCGGAC AAACCCCTGC TCACATATTC ATTTACGCCG GACAACCTAA CTTAGATACA 5400 CAAAACAATA AAGAGAAAAA AAATTAACGA AATAAATAAG AATAGAACTG AATATAATGT 5460 CGACACCAGA TACAGAAAAG GACCATTACA AAAAGGAAAA TTAGAAGCCC CTTTCAAATT 5520 AACAAAGAAC GTCGAACAAA CCGACGAAGA CCATTACAAA ATTACTAACA GAAACAGAGA 5580 GACACATTAC TATAAGACCC AATTTAAAAA ACAGAAGAAA ACTAATCAAC TCCCCATTTT 5640 ACAGGCCCCT TGTTCACCAT AATACTTTTT CTTACTCTGA CAAACCTATT CAAATCCCTT 5700 CAGAATACGA ACACCACTGT CTCCGCATAA ACCTCTTGGA TATAGACAAC CTAACTCACT 5760 ATTTTAACAC TAAAGTAACC AACTATACAC ATATCCCCCA AATTAAACTT CTACACAATA 5820 AAATGTTAAG GGAACTTAAC GCTATAACGT TACACCAGAC AAATAGACAG AAACGCGGAC 5880 TAATCAACAT AATGGGATCT GCATTTAAAT ACCTTTTCGG GACACTAGAC AATAACGACC 5940 GGATACAAAT TCATAACCAA TTAGAGTCCG CGACAAACAA CTCAATTAAC ATACACGAAA 6000 TGAGCGACAT AATTCAACTT ATCAATAGTA ACATGCAAAA GATCAAAGAA TTTGAAGACG 6060 AACATAATAA CAGAGAACAA ATGCTATACG AACTAATACA ATTTACCGAA TACATTGAAG 6120 ACATCGCAAT GGGAATGCAA CTCTCACGAC TAGGACTGTT TAACCCAAAA TTACTTAACT 6180 ATGACAAATT AGAAAACGTT GACAGTAGCA ACATACTACA GACTAAAACC TCAACCTGGA 6240 TCAACACTAA CCAACTACTT ATCATAGCTC ATATCCCTAC TAAACAAAAA ACCATTCATA 6300 CTATTAACAT AATACCCTAT CCAGATAATA ATGGCTATCA ACTTGATCAT ATAGACAAAG 6360 ACACTTATTT CGAAGAACAA GATAAAATTT ATAACCAGAA CTATCAAGAA ATTAACAACG 6420 AATGTATCAA AAATATTATT AAGAGAACAA ATCCAATATG TAACTTTGTC CCTATAACAG 6480 TCGAACAAAT AATTAAGTAT GTAGAACCAA ATAGTATCAT CACTTGGAAT CTAAACAACA 6540 CTATCATAGA ACAAAAATTG CCAAGAAATA AATAAAAATG TAACGGTTAA CGGCAACAAA 6600 ATAATAACAA TAAAACAATG TAAAATCAAA ATAGGAAACA TAATACTTAA CGAAAACAAG 6660 CTAAAAACCC CGAGATAAAC CTTACCCCAT TGTACACACC CTTAAGCCTA ATAAAAATAA 6720 AACCAATAGA ACATAAGGAC ATTGTACAAT TAATATCAAA TAATAATATC ACAGTTTGTA 6780 TAGTAGTATC AATCATAAGC CTCACATCTT GTATTGCTTG TATTTACTTA AAACTAAAAA 6840 ACAAAATAAT CCTTGAATCA CAAGATGCAA ATCCAACTGC ATCACCAAGA TTGCGGGCAT 6900 CAACACCAAT AGAAGGAGTG GAAGCATAAG CTTCCCTTTT AAAGGGAGGG AAGTAACATA 6960 ATATGCTTCT CATATTACGT TTACATACTT ACACTAATTG TACATACAAT CTTGCACATG 7020 CATAAACACA TGAAACCAGT TTACATTTTT ACTTACACTT AAGCGCATAA TTTGTTGTGC 7080 ATCCATACCG TTATTTTTCC GTTCTTTTTT GTACACATAT ACTGATTAGA CATTCCCGTT 7140 TCTCGCGACT CACTTCGAGC CGATCAAAAA CTCTGTACAG TCAGTCTTAA GCCGACAACG 7200 AAGAAATAAA GATCTAAACT AAAAAATACC TCGTGTTGAT TCTGAAACTT CTTTAAAGGC 7260 GTTGATCTTA GTCAAACGAC GGATCATTTG TTCGACTCGA ATAGTAAAAT ACGTAAGT 7318 // ID DMTOM1_LTR standard; DNA; INV; 410 BP. XX AC nnnnnnnn; DR FLYBASE; FBgn0063450; Tom1. XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 410 BP; 136 A; 98 C; 60 G; 115 T; 1 other; agtaacgtaa ttatgccttc ctcataattc acttatacaa aagaccgacc gcggtggtcc 60 gccgtatyga ttcctcgtta aataaacaaa catttctcga tattgcgcat ccactcaagc 120 cgtaatcaaa acccagccac taggcaggca catagccacg tagcacataa gaattctttt 180 tcatatgtat gcacataagc ataagcaaga agcgctctcg cgcagatcaa gtaaacaacc 240 cactaaccca gtacatctaa gattttctca caccaattgt ttcagtctta agctgatatt 300 caatcaataa aaacgcaaag cggattcatt cggagttttt atttatttcg gcgttgatct 360 tagttcaaat acggatcatt tattcgactc aaaacaaaaa ctattttact 410 // ID G5_DM standard; DNA; INV; 4856 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063504; G5. XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4856 BP; 1320 A; 1338 C; 1088 G; 1110 T; 0 other; acttcgtttc gcagttgcga taagccggac gcgtttttgg tgcagtgaca cagtttggtt 60 taacaaattt acggacaagt ccacaactgt ggcacgaagt gtagtgatcg agaacataat 120 caaattatgt gttggcaagt gaaagtgtgg cgacttttgc tagccagcgt cgcttttctt 180 tacttggtga ggctggctca agactcactg aaaaacttca cgaggcgatc agctgtgagt 240 gctttaagct gtaagagctt taagccctgg ctgtttcgtg tccacggtac actactactt 300 tcaaaccgcg atcttcaaga tctatctggg tcgtgtctta gatactctgc cgctaaaact 360 ttaagcttct agctggttcg agtcgtcgct aagcattttt ctttttgaaa atggataacg 420 gtccacagca aaaatctatc gactacgagt cgatgatgct aattgctggc gcgggcccaa 480 aagagatacg ggagcaaatg aaaaagagct ggaaaagtga aactccagcc tctgctacta 540 ccggtatttc aaacacctac tgcagttcga ccttcacact gacctccacc agttcgacta 600 tctcgtctgc gtacatcttc acctcctcct tgagcggcaa catatgctct actgctgcca 660 gtacaattca caaaagtgca tctgtcagct caatcgcgaa gccacaactt gccaccggct 720 ggcacgacgt ctcctttaaa tcaagaaatg gtggaatcaa aaagagagca ggcgcagtct 780 tctcccctaa aatggcaaaa aaggttgcaa ctgatatgcc ggcaaaaatc tctgccaatt 840 gcttccaagt cctaagcgac gatgaagata tggtcgtggg agaggcatct tcgagtgacg 900 acgacgagcc atgctcctcg aataccgccc tcaaacgtgc cgcaaggaaa gcaggaccaa 960 aacaacagct gaaaggtgct caccaaactg taccagccgc cccgaaatca agccgtcact 1020 ctaaggtgcc tagaatgatg tttcccaatg tggttaactt cacagcattt cgcagcgagc 1080 tggacgccct tgtgggcgac tcctatacaa ttaaggtcct aaattcggga gactgcgctg 1140 ttcagtgcaa ctcccctgat agctacaggc tggtggcccg gcactttctc gacaaaggat 1200 ctctttttca tcaccatcag ctaccggagg accgacccta caaaatagtc atgcgcaaca 1260 ttcaccacgg agtcccatcg gaggatatca tcgcaaccct ccagaacgaa ggacataacg 1320 ttgtacggat ttatacccct cgcaacaagg caacctctct tcctcttaat atgaggttca 1380 ttgacctcaa aaaggcggag aacaacaacc aaataaaggg catatctgtt gtctgcaggc 1440 acagggtaat atgggagaaa ccccgtaagc aatcggagcc gattcaatgc cacaggtgcc 1500 aaggctatgg tcataccaag gcatattgtt ctcggcacta catatgcaga gaatgcggag 1560 aaaatcaccc cactgcagag tgcaagctgg aacaggacga ggccagattc tgctttcact 1620 gtggtggccc ccatgcagcc aactttaaag gttgcaaaaa gtacctgcta gaggcttcta 1680 accgcaaaaa ccaacggaag gtcaatgaac cctctggttc tggtcctgca agaggaccac 1740 accaaccctg tccaccagcc cacatgtctg gaaagccttc atttgcaaat tttgtcagag 1800 gaagtcagcc cgtcgccaag cctgccgtta ttgtccccca tgcaagcgct aaccttgagt 1860 ccaagctgga gcagctgttt attaggcttg acaggatgat gtcactagtt gaaacgctaa 1920 tgcaactgct cctgcaaact cgcaccttcc cttctgctgc tcaaaatggg tcctcttaag 1980 gtagcagctt ggaatgccaa tggggcctcc tcgaagacca atgagattct cgccttcatc 2040 gagcttcacg aaatcgacat cctcctgctg tcggaaaccc atttcgtttc ccggtccacc 2100 ttcagagttc ctggctttac cctccacact gccaaccacc cggatgacag caaacgcgga 2160 ggtgcggcga tccttatcag atcactcatc tcccacctac ccttctccac cctgtcggaa 2220 aatcacatcc agacagcagt tatccagctg acggcaagca ggggtacttt taacattgcc 2280 tcagtatact gccctcctaa tctcaggtgg acggaggctg acgtcgagct gatcatcgct 2340 caatttggca caaagttcct tgcggcaggt gactggaacg caaagcacag atggtgggga 2400 aactacagga tgtgcactag aggcagggtg ctgttctctg ctctggcggg cgaaggtatc 2460 gacatcgttg caactggaga agctacttgc tatccctttc gagcaagtgc cactccaagt 2520 gccatcgatt tcgggatctc caaaggattc agacagcaag aaatcaacgt gcaactactg 2580 acagaactgt catctgacca tctccccctg ctgtttgagc tggatgaaga cgcccagcta 2640 ttcaaaggtg tcacaaaaat gctgtcacct actgcaaata ctgtggcctt caaggagcac 2700 attgaggcca cagtagatct caacatcccc atagacacct gcaatagtct ggaagcgtat 2760 gtagattacc tcgcagccac aatcgcggaa gcagcacgga gggccacacc tcccccacat 2820 caagctcgtc acacgaccgc aaggagggct cccatcttga gcttggaagc cagggagctg 2880 ctctcccaca aaaggcgcct caggagacgg tacatcgcaa caggagatcc tagcattaag 2940 cagctatact ctagcaccac taacaaactg caccgtttgc tagccaggac gcgacgagaa 3000 aatctggata ccctgctgga gggtactggc ccagataaca acagccattt ttcgttatgg 3060 aggctcacaa gaggtatcaa gagacagccg ttgtttcaat ctcctgtcca gagtcacagt 3120 ggcctctggc ttaagacgga cgatgaaaaa gcgagggcat ttgcttcgca cctgacctcc 3180 accttcatgc ccttcaacct aacagacgac tcaaatcgcg tagccatcat taattttctg 3240 gacactccga ctgcaccggc tcgccccatt aggcatacca caccgcagga agtcataatg 3300 cagctgaagg cgctgcaaat caagaaaacc cctggttacg atggcattga caaccgtgct 3360 gcgaagtctc tgccacgcaa aggagtccta gccttagtaa aaatatttaa tgccatgcta 3420 aggctagggc actttccaag gcaatggaag cgtgcacgta tcatcatgat acctaaagcc 3480 ggaaagccgc caacgaagat cgatgcatat cgcccaatca gcttgctatc aactttcttc 3540 aagatttttg agagaatact cctagcccgc ctgatggaac tgccgcaagt agtgaaccac 3600 atacctcgac accaatttgg cttcaggaag tcccacggct gccctgaaca aatacatcga 3660 ctggtaaacc aggtgacgca tggcttcgag cacaaactct acacggtcgg cgtctttcta 3720 gacgtgaagc aagcgtttga cagggtgtgg catgagggcc ttctatacaa gatgaaagct 3780 ctccttcctg ctccctacta tgccatcctg agatcgttca tctctcatcg gactttcgac 3840 gttgcagtgc gtgatgctcg gtccagcctg gaagagattc acgcaggagt tccacaggga 3900 agtgttcttg ggcccttcct gtacaccctg tacactgccg acctgccatc tcccgccaac 3960 aacaccgaag tctccccgga tcagctgctc ctggccactt atgccgatga caccgccatg 4020 ctggcgtctc atcctgtact gcaaactgcc tccaacgcag tccaggaatg gctgcatgca 4080 gtcgaaaaat ggactgccaa atggaatgtg gccattaact cctcgaagtc agcctgtgta 4140 acttttaccc tgcggcccgg aacttgcaca gatctgactt ttgatggaaa ccccatcaac 4200 aatgtcacat cgcactgcta cctaggagtt catctagatc ggagactgac ctggagggcc 4260 cacatcacgt cggtcaaatt caagtcactg gcgaagctaa aaaagctcga ctggctcttc 4320 cactccagta aactacaaat gagctctaag gctcttttaa tcaaagccat acttgctcca 4380 acgtggagtt atgccatcca ggtgtgggga actgccgcca aatcccagct caataggctc 4440 cgtgtggtcc agtcgagagc tgcacgtcac gcatctgggc tcccctggta cgtgacgaac 4500 caggtaatcg aaagagatct gaaagttacc cctcttgggg atcagattaa ttttcacagc 4560 agccgttatg ccgacaggct tatggtccac ccgaaccgac tagcaaatat cctagctaat 4620 cccatctccc tccgaaggct gaagagggta caccccaccg atctccctac ccggaggata 4680 gtataaaata ttacacatta agtaattaat aactagaaaa atttttcctt ttctacttaa 4740 taattaagaa accacttggt ctcatataca tttatacaca caataggtta agttcagagg 4800 aattgactga acgattcctt ttggatccaa aataaataga atgattattt aaaaaa 4856 // ID G4_DM standard; DNA; INV; 3856 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063505; G4. XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 3856 BP; 1447 A; 760 C; 655 G; 993 T; 1 other; cgaagttcaa agaaaaaaga attccctgga caattcatca agcaccagcg caaataaatt 60 cgctctctta agcgacggat tgcccgacaa aacaggaaac aaatataaca aaaacgaaga 120 tctcgaaatg gtaaatgaag acagcgcaac tgattctgct aagcccccac ctatcattct 180 gtctgacgtc aatgatataa gtgaaatgct agcctatctt aattctaaaa ttaaaagaga 240 acttttttat tataagactc aacgttatgg acatgtaaga gtaatggtta aaagtattga 300 agaattcaga aaattagtca aaacacttaa taatgattgt gtgcagtacc acacatacca 360 gcttaaggat gacagagcat ttagagttgt tattaagaat ttgcattttt ccacaaattt 420 agatgaaatt aaaagtgatg aagagtcaaa aggtcatgtt gttagaaaca taagtaatct 480 taaaagccgt gcgaccaaaa cgccactgaa catgttctat gtagatattg agcctaacaa 540 caaaaaccga gacaatgtaa aacacatagg taacgctatt gtcaatattg agcctccccg 600 caaaaacaat gaaatcgttc aatgctatcg ctgccaggaa ttcggacata ctaaatcata 660 ttgtaccaaa acgtaccgct gtgtaaaatc ctcttctcga cacccaagca acatctgccc 720 gaaaaataca gaacaaccag caaaatgtgc caactgctac gaagaacacg ccgccagcta 780 caaagggtgt agaatttatc aggaactctt gtccaaaaaa atcagctacc aatcgaaaat 840 ccctgaatag caataaagac ctgagtaaaa atgatttagg aatccagcaa agtttgctcc 900 gcctaacaag ccaacatata cccaacaaag taacgattat caatcttatg cccaaattgc 960 tgcaggaaat agcaaaacaa acacatcact ggagagaata gagcaactat tagaaaaaca 1020 gtctgaactt acaaataatt tacttaacat gataatgtta cttgtaaaca aattatgcaa 1080 gtaaatctta accttggaat atggaatggc aatgggctkt ccagccatgt aaatgaaata 1140 tctttataaa aacgaatgaa attgatatta tgttaatttc cgagacacac tttaccagca 1200 aaccatatat catggttgtt ggatatgata taattagagc tgaccaccgt tccttttaat 1260 tagacctttt aattagacgg ctaaaattag acggcctaaa atttcaaatt atggacagca 1320 ttagagaaaa tgctatgcag gctgccacgg tcacgatcaa atgtatgcat gcggacgtat 1380 ctgtaacagc aatttatctt cccccaagat ttgctctcaa agaagcagat tttaagaact 1440 tttttcaaaa acttggacca cagtttatat tgagaggcga cttcaatgat aagcacccct 1500 ggtggggctc cagattgaca aatccgaagg gaagcgaact gtacaaatgc atagtaaaca 1560 atagcataac aacattttca accggcaagc ctacatactg gcctactaat agtagaaaga 1620 taccaaactt aaaagacttc gtagcatact ttggaatccc agaatctcac atgcgaataa 1680 tggaaagttt tgatctaagc tcagatcatt ctctaataat agtgacatac agtacagtag 1740 ctcatatatt gacaaaacca tacaaagtaa tttctgcaaa tacagatatc aatgcattta 1800 aaagttatct ggaaacagat aagatagacc atgctgtgga gctactcaca gaacaagata 1860 aagtaagcta tatatgtacg aagctaccag ccagaaactc acaatcaaat cagctctatc 1920 tctcagctga aatccgacaa caaatacaac acaagagaaa tttgcgtaaa agatggcaag 1980 aaactctcta ccctgccgac aaaagatcgt ataacaaggc tgcatctgat ctcaaaaaac 2040 tactgtcaac tttaagaaat gaatctctcg ctgaatatct tagaaatcta gatccacatt 2100 cttgtaacca cgaacataat ttatggagag caaccaaata tctcaagcga cctgcaaaaa 2160 gaaacacagt agtccgaaac tgtaatggcg aatggtgtag atctgatgat gaacaagcca 2220 aagcatttgc tcaacacctg cactctgtat ttcagccaaa tgatattgat aacccgcaaa 2280 cagaaagaga agtagataac tttctcgagt caccctgcca aatgagctta cccattcgta 2340 aaatcagtat taatgaagtt tcatcagaaa ttaaatggct gaatagtaaa aaggctccag 2400 gttcggacaa aatagatggc ataaccttga aaattctacc accaaagtgt gtacgatttc 2460 taacgtttat atttaatgcg atgttaagag ttgaccactt cccaagccaa tggaaatgtg 2520 cagaaattat aatgatcctt aaaccaaaca aggcagaaaa tgaagtgaca tcgtaccgtc 2580 ccattagttt gttgtcaata ttttctaaag tatttgaaaa aatactttta aagagaatgt 2640 tgccaatctt ggacgaattc gctatcatac ccgaacacca gtttggattc agaagaggcc 2700 acggaacccc tgagcaatgt cacaggatta taaatgaaat tttgtcagca tttgagagca 2760 aaaaatactg cactgcaaca tttcttgacg ttcaacaagc gtttgatcga gtctggcatg 2820 acggcttatt atataaaatc aaaaagtggt tacctgcacc atatttttta ttattaaagt 2880 catacttaac caatagacac ttttatgtgc aacaaaaaaa tgaatactcg cccttgcatt 2940 ttataaaagc tggagtccca caaggaagcg tcttaggacc tgtcttatac accctgtaca 3000 cggcagatat gccggtaaca aatacctgca ctgtggcaac atacgcggat gatacagcta 3060 tattagctac cagctcatct aaagaggaag cctcacaact cctgcaagca gagctacgcc 3120 ttattgaaag ctggtttctt ctttggaaaa ttaaagtcaa cgccctgaaa tctgcgcaaa 3180 taacttttgc attaagaaga ggtgactgcc cagaagtgtc atttaatgga tcagcaatcc 3240 cacaaagtaa ttgcataaag taccttggct tgcacctcga tcgcagacta acgtggaaaa 3300 accacataaa agcaaagcgc cagcaactaa atcaaaaaag tttgaagatg acctggttgc 3360 ttggccgaaa atctgcaacc actctggaaa ataaagtccg tttatacaaa gctatactaa 3420 agcccgtgtg gacttatggc atacagctct ggggtactgc cagcaactca aatattgaga 3480 ttctacaacg ctaccaatca aaaatattaa gacaaattgt taatgctcca ttttatattt 3540 caaatgcaag tatccataaa gacttaggaa tcccttatgt taaagaagaa atagcaaaac 3600 atagtaaaaa atatatagac agactaagaa cacatgaaaa taacttagcc ttaagtttgg 3660 taaataataa taacaacgtc agaagactta aaagatttca cgtgctagat ctccccgaca 3720 ggtattaagt attaaaacat gttatgcata gagtatgtag tagggattaa tggataacta 3780 ctgcctctgt taatttaaga ttaattataa aatttactta ttgtcatact tttttaagac 3840 agattgtaaa taaaaa 3856 // ID ROOA_LTR standard; DNA; INV; 7621 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063394; rooA. XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC Assembled by Michael Ashburner. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 7621 BP; 2712 A; 1444 C; 1652 G; 1808 T; 5 other; TGTCTAAATA TGTGTTTAGA CATGATAAGT AGGCAAACTA TAAAAATGTT CTATTTATGG 60 GCTGCAATAA ACATGTCACy GGACAGCATA AGTGGCAACT ACAGATAAGT ACGATTGCAG 120 CGGCCTATTG CCGAAGTGTC AAGAGATATG ACCAyGCGGG AGGTGATTAG CGCGGTCATA 180 GTCCTCAAAC ATAGATTTAA GAATAAAACT CAGCTGCATT TACCAACGCA GACTGCGGCG 240 TCTTACAAGC GCTGCATTAT ATAATTAGAT GATAAGAACC TATGTAAGAA TGAATAAAAG 300 GCGAAGCCCT CGCAGTAGCG AGTCAGTTAG ATTCAAACAC CCGAATTGAA CTCATTAAGT 360 GTACGCACAA GTTTATAGTG TGAACATTTT CGTCCTTTCG AGAAATTCTG TTGTTTTCCT 420 CCACCAGTGG TAAGAAACAC AGAAGAAAAA ACCAGCGCTT CAAAGTAAAA AGAGCAAGGT 480 TATTCGAGTG ATTCTGTTGT TTCCCCCACC AGTGGTAAGA AACACAGAAT AAAAGACCAC 540 GCCTTAAAGT ACTAGGACTA TAAGGTGAAA CATTGTGTTC GTGCTTTTCC TTGGCTGATC 600 AATCAGCTGT GAGTCGAGGC ACAGCTAGGT CAACTGGGCG ACCAATCAAA AAATACTCCA 660 ACGGATCACG CCAAAGAATA CAAGCAGAAA GCACAAAAAA GTCACAGTGA TTGTTTTCCC 720 AAGATGTTGT CGGAAAGAAC TGTGAAATTG ATGCAAGAAC AAGGAGAGCT GCAGAACAAA 780 ATACTGCAAG CCATCAAGGA AAAGGCATCT ATACCAGCAA TAGAAGTATT GGTAGTCCGA 840 TTTACTACTA ACAACAATGC GCTGGTAAAA TTGGGAGTTG GCGATCATCA ATATTTTAAT 900 GATAAAATAT TTATTAAAAC TATGGATGCT ATCAAGGGCT ACAAGGAGTC ACAATTAAAG 960 GTAAAAACAC TTGAAGGGAT AAATATACCA AGCGGATCTA AATTATCTGA TACCGCATCA 1020 GCTCGGAGCT CCAGTCCAGT TGATAATCTG GTTCAACTGG TGGATAGAAG AGCGGACAGG 1080 GTGAGGCAAC AGATGAGCCT AATCTACGAA ACCCTAGTGA GATTGAACTA GTCAAGCAGC 1140 TGGCAATATG AAAAGTCATT GGTCCAACGT GACGTACACA TGTAAACAAG TATGACACAG 1200 AACTGCCTTT GATCAAGATG AGGCAGACAT TCTGCAATCT AAAGTTGCCG CACTGGTACT 1260 TCAAATGAAG TTGGAACAGT TCAAAAGTTC CAACTACGAC GTGCCAGAAC TCCCAAAAGT 1320 GGACCTTCCA ACATTCAATG GAAATGCGAA GGAATGGCCA TCATTTTACG AGCTAATAGA 1380 CAGCAGGAAG GATCTCAGCA ACACAAGGAA GCTGGGATAT TTAAGAGCCT GCTTAAAAGG 1440 AGAAGCTCAA ATGGTGGTTA GCCATTTGAT AACGGGATCA GCGGCTAGCT ATACAGCAGC 1500 GTGGGAGCTT ATCTGCAAAC GCTATGAGAA TAGCAGAAAA ATATTCTCCC AACACTTCAA 1560 CAAATTAATG GAACTGGAGT GCTTGCTGCC CCATGATGAG AAAAATTTAA GGAAGTTTTT 1620 GGATACTGCG ACCGAGAGCA TATTCATCAT AAAGGAAAAA GGAAAAATAG AAAGCTCTGC 1680 TGACGTAATT TTAGCAGAAA ATTTTCGCCA GATGCCATTC AGCTGTATGA ACAGCATGTA 1740 AAAAAGGCAA GAGCGTCCTT ACAAGACGTA CTGGAGTTTA TTGAGCAACA ATACAACTCA 1800 GTAAATGCCA TTACCAAAAA TACCGCTCAG CTTGCTACAA GAAAAGTGCA AGTTAGATCG 1860 TGTGCCTTTT GCTCTAAGGA TGGCCACGAT ATGATAAAGT GTCTCAAATT CAAAGCACAA 1920 TCAATCGAAA AAAGAAAAGA ATTCGTTCAA AAGAACAGTA TGTGCTTAAG ATGCTTTGGA 1980 AAGCATAATG CTATCGACTG CAGAAAGGAA ATTACATGCA ATCGATGCTC CAAAGGGCAC 2040 AACAGCCTTC TTCATGAAGA CACAAAACGC AGTATCAACA GCAATAGCCT CAAGCAAGGC 2100 CAAGACACAC TATTGGCTAC AGCTGTTGTT TTAGTGAAAA ACAAAGCTGG AGGTTACAAC 2160 GAGTTGAGGG CGCTTATTGA CGGTGGATCC CAGAAGACGC TGATTTCAGA GGAGGCAGCA 2220 CAAATATTAA GGATTCCCAG AGTAAGGAAT ACTATAGAGG TCGAAGGTAT CTCCCAGACT 2280 ACTCAATTAT CAAAAAATTG CGACCACCTG ACGATCAAAA AAAATATTCC AAGCAGCTTC 2340 AAAACATCAA CGGAAGCATT GGTTTTACCG ACACTCCATA GAGCCCTTCC CAGCAAAATG 2400 TTTGATATTG ATATCAACAA AGAGTGGAAG GGCTACAGGC TAGCAGATCC AAGATTCAAC 2460 GAGCCAAGCA GAATTGACAT GGTGATAGGT GTGGATCTAT TTCCCCTGAT TATGATGGAG 2520 AAAATAAAAA CCGTGAATGG AATCTTGGGA CAAAAAACCA GATTTGGATG GATTGTGTCC 2580 GGAAACATAA CTCGAGCAGC AAAGCAAAAA ATTATAAGTG CCACTACAAC AATAAATCTA 2640 AAGGACCTGG AACGCTTTTG GGAATTGGAA GATGAAGCCG ATGAGACGAT TACAGACAAT 2700 GCAGAATGCG AAAGAAAATT CCAAGAAACA ACTGTCACCA ACGAGGAAGG CAGATTTGTG 2760 GTTTCAATTC CATTCCACCA ATTACACCAC AAAGAGGCAA AGCTGGGAGA CTCTCGCAAA 2820 CAGGCAATGG CAAGGCTTAT GCAAATGGAA AAGAAGAATC GGGCTGCATA CAACGAATTT 2880 TTCAAAGAAA CCCTTAAGAT GGGACATTTG GAATCTGTAA AGACAACGGG TCAAGGTAAA 2940 TACTATTTAC CCCATCAAGC AATCATCAGG CCTGGAAGCT TAACTACGAA GCTACGAGTA 3000 GTTTTTGACG CATCCGCAAA GACGACAAAT GGACTAAGCC TAAATGACGT TATTATAGCT 3060 GGTCCTAAGA TTCAAAAGGA TATATTCGAT ATTCTAATTA ATTGGCGCAA GTGGCAATAT 3120 GTTATGGTAG CTGACATTGA AAAATCGTAT CGCCAAATAA AGGTTGCTGA GAAAGACCAA 3180 GAATACCAAT ATATCCTATG GAGAGATGAT CCAAAATTGC CGATCAGTGA GTTTAAGTTA 3240 ACAACCGTAA CTTATGGCAC ATCGGCAGCA CCTTTCTTAG CAGTCCGATG TCTACGAGAG 3300 TTGGCAGATC GCTTTTGCCA AGAGGATAGC GTCTTAGCAG AAACAATTAG AGACGACTTT 3360 TATATGGATG ACATCATAAC TGGTGGAGAC ACAGTCAATG AGTGCTACGA ACTTCAAAGG 3420 AAATCGAGAC AAGTGATGGA GAAGGTCGGC ATGCATCTGC GAAAATGGGT TGCAAATGAC 3480 GAACGTATTT TAGCTGACAT TCAGGACGAC GGTGCTATGG AGAAAATCTG CATTGAGGAG 3540 AATGAATCGA TCAAAACCTT AGGACTTCAA TGGGATCCGA AGAAGGATAC GTTTACGTTT 3600 TTGGCAGAAA ACCCAATGCT AACACGCATA ACAAAGCGGT TAGTGTTATC ACAGTTGTCC 3660 AGAATTTTCG ACCCACTAGG ATGGTTGGCA CCTGTAACGA TTCAAGGCAA ATGTTTCATT 3720 CAGGAACTGT GGAAGTTACC GATGACTTGG GACGTTGAAT TGGAATCCAA CTTAGCTAAC 3780 TGGTGGATGG AATATGCTAA AGGTCTATCA TATTTAGAAG AAATTAGCAT TTCACGCTGG 3840 ACTGGATGCT CCAAAGGTAT TATGGAGCTA CATGGATTCT GCGATGCATC AGAGAAAGCA 3900 TATGCAGCGG CTGTGTATAC AAAAGTAGGC GGCAGAGTTA CTTTGCTAGC AGCAAAAAGC 3960 AAAGTAAATC CTATAAAAAA CAGGAAAACA ATTCCAAAGT TGGAATTATG TGCTGCGCAT 4020 TTATTAGCAA AGTTATTAGC GAAAGTGCAG GCTATATGGA GCAACAAGAT CACAACGCAT 4080 GCATGGAGTG ATTCGCAAAT TACTATTGCT TGGATACCGA ACAAGCGCAG CAAAGATAAA 4140 TTCGTCAGAA CTAGAGAGGA AGAAATTAAT AAACTAATTC CCAATGTCAA ATGGAATTAC 4200 GTTAAATCGA AAGACAATCC AGCAGACGTG GCTTCAAGAG GGATATCACC GCAAGCTCTT 4260 AAAATCTGTG AAATTTGGTG GAGAGGGCCG AATTGGCTAG CTATAGATGC ACAACACTGG 4320 CCCACTCAAA AGGAATCGGA AATTGTTGTG GTATCCACAT TGATAAAATC CGAATATCTG 4380 CAAAATCATC TTTTATCGAA GTATTCATCG ATCGACAAAC TTCTTAGAGT AATGGCGTAT 4440 GTATTACGCT TCATAACAAA GCTGAGAGGA AAATCGCAAC AGCCGTCACA TCTTACGGCA 4500 GAGGAATTAA AGCTAGCAAA GATTGCCGTG GTAAAGATAC AACAACAGCT GGATTTTGGA 4560 CACGAAGTCA GACTACTCAA AAACAAAAGA CCATTAGACC CAAAGAGTAA GTTACAGGCG 4620 CTAAATCCGT TTTTGGATAG TGATGGCGTA CTTCGAGTTG GTGGACGATT ACAAAACGCA 4680 ATGATACCCT ATAATGTAAA ACATCCAATT ATACTGGACA AGTCACATTT GACTTGGTTA 4740 ATTGCAAAGG ATGCTCATAA AGAAACTCTG CATGGCGGAA TTAACATTAT GAGAACTTAT 4800 ATTCAGAGGG AGTTCTGGAT ATTTGGCATA CAAAATTCCT TAAAGAAATA TTTAAGGGAA 4860 TGTATTGTAT GCATACGATA CAAGCAAGAG ATGTCCAGTC AACTGATGGG AAATTTACCA 4920 GTTTACCGAG TAACGACTGA TTACTCGTTT CAAAATACTG GAATCGACTA CGCCAGACCG 4980 TTCCAGATTC GCTGCTCAAA GGCAAGAGGT CAAAAAACGT ATAAAGGATA CATTTGTGTA 5040 TTTGTTTGTA TGGCAACAAA AGCAATACAT CTGGAAGCTG TTAGCGACCT TTCGTCAGAC 5100 AAATTCCTGG AGGCTCTTCG ACGGTTCTTT GCAAGACGAG GCAAGAGTGA GAACCTATAC 5160 TCAGATAATG GAACAAACTT CGTGGGAGCT TCAAGAGTAT TGGACAAAGA ATTTGTAGCT 5220 GCCATTAAAA ACAATAATGA GTTAGCACCT ACTCTAGAAA AAGAAGGCAT CAAGTGGCAC 5280 TTTATTCCCA CGGGAAGCCC CCACATGGGA TGTTTATCGG AATCCGGTGT AAAATCAGTG 5340 AAGCATCACC TTAAACGAGT TATTTGTGAA AACAGCTTTA CATATGAAGA ATTTGCATCG 5400 TTGCTATGTC AAATCGAAGC AGTGCTAAAC TCGCGTCCAT TAGTCACTGT AAGGAGCGAA 5460 AACGATGGTC AGTACATATT ACCGCCGGGT CATTTTCTGG TGGGAAGACC TCTAATTGGA 5520 GCTCCTGAAC ATTTTGGAGA AAGTAAGACA ATCAGCTCTT TGGATAGATG GAAGCTTATT 5580 CAACGCATCA GAGGTGATTT TTGGAAGAAA TGGAAAGAGG AGTATCTGGT GTCATTGCAA 5640 CAGCGAACCA AATGGCGCCA AGAAAAGCCA AATCTGAAGG AGGGACAGCT GGTTCTTATA 5700 AAACATGAGA ACACTCACCC TGCAAGATGG CCTGCATAAA ACAATCAGAG GACTTCCTGG 5760 GAGACTTCAA GGACTACTGC GATTTCTTCG GCACCACAAT ACGGACAAAA ATTGACAACA 5820 TCAAAGAAAA AGACAAAATA CTACGGCACC GTACCAACCG AAAGAAAAGG TTTATACTGT 5880 TCTTTGATAT CGGAAATGCA AATAGAATAC AGGAAAATAT GCAAGCGATC ATAAAAAACG 5940 AAAAACATCT AATGGAATCA TAGCTACTGT GCTATGCTGT ATGTTGACAA TCAGACCTCG 6000 GTCATCAATG CCACGGAAAA ATTAGTAAGA CCAACTACGA TGGAAGCAAA CCGGAATTTG 6060 GGAAAACTGA CCCAACAAGT CAACATTATT GCAGAAACCA TGAAGGAGCA CTTTATGGTA 6120 TATAAGGAGT CAATTAAATT CCTTATGTTA TCAAATCAAG TGCGAAACTG GATTGAACAG 6180 GCAGAAAGCC TACAAGCAAC AGCGATCTCA ATGATAACGG ACATTAGTGA AGGAAGAATT 6240 CATCCTACAC TAATTGCGCC TAACAAAATG CTGGAGGAGT TCGAAAAAGT TAAGCAAAAA 6300 TTAGGACGAA ATCAAATGCT ACCGAGTGGA AATTCAGTTA TACAATTACC ACTGATCTAT 6360 AAACTGATGA AGGCCCAAGC TATGTTGTGG GAAAATTTAC TATTCATCGA AGCAAAATTG 6420 CCGATATACA ACAATCAGGA AACGGATCTC TTTGAAGTAA TCCCAATACC ACTGTGGACA 6480 AACGGAACAA AGCTTATTCC GAAATTGAAT TCTACATTTT TTGCGTATTT TTTTTTCAAT 6540 ACAGACATAA ACGCATATCA CCTAATGTCT GAAATGGAAA TTAACCAATG CAGACAAGAG 6600 GATTCGACAA GGCCATGGCT TTGCCAGAAA ATAATTGGGC ATGGAAAAAC GCGGATGATC 6660 ACTCTTGCGA AwTATCACCA TTGAAACCAA GCAAGGCACA CTCATGCGAA ATGATGGAAT 6720 TCCAAGGCAA TTCGTTCATC AAAGAGATAA GTGGATCCAA CCGTTGCCTA TTTAGACTGT 6780 TTCGGAATAC AACAGCTAAT ATAAGATGCA ACGAACAGCA TCAAGACAGA ACAGCTTATA 6840 AGACTGCCCA ATCAAGGCAT TATACAACTA CTTGCAGGAT GCACAGCAAT ATTAGGGGAT 6900 ACAACAATAA TTACTCCTCA AAAAGTAATT TCGACAGCGT CTGAAATGTC TATTATCTTT 6960 CCCAGTTTAC GAATTATAGA CGACAAGGAG AAGGAGGCAT GGAACGTGGT CCCGCTGAAG 7020 CACTTGATTG TCAACAACAC TAATGAACTG CAAAATCTTC AAATGCGCAT CAAGACTCTG 7080 AAAAATAACA AGGTACACAT TGATGACTTG ATTTTCCACA CGGCAAGCGG ACACTCGGCT 7140 CTAGGGTTGA CAACGATTAT CATAATTATA TTGGTCATTT ATATCCGGAG GCAACGCATA 7200 AATGAGAGAC GACTACTGGC CGTACACTCA AGGGATGTCT AAATATGTGT TTAGACATGA 7260 TAAGTAGGCA AACTATAAAA ATGTTCTATT TATGGGCTGC AATAAACATG TCACyGGACA 7320 GCATAAGTGG CAACTACAGA TAAGTACGAT TGCAGCGGCC TATTGCCGAA GTGTCAAGAG 7380 ATATGACCAy GCGGGAGGTG ATTAGCGCGG TCATAGTCCT CAAACATAGA TTTAAGAATA 7440 AAACTCAGCT GCATTTACCA ACGCAGACTG CGGCGTCTTA CAAGCGCTGC ATTATATAAT 7500 TAGATGATAA GAACCTATGT AAGAATGAAT AAAAGGCGAA GCCCTCGCAG TAGCGAGTCA 7560 GTTAGATTCA AACACCCGAA TTGAACTCAT TAAGTGTACG CACAAGTTTA TAGTGTGAAC 7620 A 7621 // ID JOCKEY2 standard; DNA; INV; 3428 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063425; jockey2. XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 3428 BP; 1080 A; 784 C; 703 G; 861 T; 0 other; ccttcgttgc attcggacgt tcgcgattgt ctagtctttt tgtagtgaaa catcaacaaa 60 tcttccaatc ttactgtgca cgcgcacaat ttctatttct atccactgca cagcagtggt 120 gacagacaat tacggtactt atcatagata agctcaacca agccgacgct ttgtacgccc 180 ctcgtctacg ccttgggcaa ggctgttaca gtttgttctc cgcttgttat ctctcgagct 240 tgcgaagttg acaactagta gtaacaccaa tgttaccaac accataacca cgactcaagc 300 tagctgttcg caggcaagaa gtgctgcaat gtctgatgcc atatttttgg gcactccaaa 360 aactattgtc tccaggatcc aatctgtggc aagtgtgctg ggccccacgc aactggctct 420 acgctatgca cgagtgacaa ttatttgtgc atcagctgcg gtggtgatca tgcttcaacg 480 gacaaaaact gccctgtcag aattgaaaaa ggaaagaagc tctagccaaa cccaaggctc 540 cctacttttg ttgctcatgg caacactgtc agctctaaaa aaggtgaaag ctatactcac 600 ggattcattc cggctgaggc catcagaagc aatatttcaa ccgccgatat tgtacaggcc 660 aaaggccctc aatttggttc acttgcgact caccaacacc acagtgatcc aaactcacca 720 caagacttag gctttggaaa aaagtttcag gcgctggaaa gcgccatcca ggatataaac 780 tcgagaatgg acaaaatctt caaactaatt ggggagactg tggaatcaaa aaaggttttt 840 agagaactgg ttcagatctt catatcacgg acgtcaaaat gacccaacca ataataaaaa 900 tcggattgtg gaatacacga ggactagcta gaaactctga ggagcttcag cttttcctaa 960 ggatgcaaaa tattgatatc atgcttgcca ccgaaacaaa cgtgcgggaa ggacagcgta 1020 tctctctacg tggttactcc acctacgacg ccagtggaaa cagcagaggt ggagccactg 1080 tcatagtgca ctcttccttg aatcattgtc cccaaccccc aatctcaacc aacgaccggc 1140 aggtagcgag cgtgcagctg cagacctcag agggaacagt tacttaccca cccagcgagc 1200 ggttgataaa agcttatatt gagtaacttt ttgtaacact tggtggcata tttatgcagg 1260 tggagactac aatgcaaaac atcattggtg gggaatctca agagcctgcg ccagaggttt 1320 acagtgcgtt atttaaaaat aaataatatc aattgtaata atggatatta aaactgtgat 1380 tcgctcccta tccagagaaa aagctaaagg aatcatccaa ggaatttgca agaacaggag 1440 atttccgaac ccagcagatc tacagaaggt tatcaaatcg gctcaataaa gtgctgaccc 1500 aaagaaaaca gctacatata tacaaccttt tggattacat gggcaccgat gccccttcac 1560 aattctccct gtggagaatc acgaaatggt ttaagtttca ggcttgtcag aaatccgctg 1620 caaggagccc gtctggtggt tggtccgcac atcacaggac aaagcagaag tgtttgcaag 1680 gagcttggag caaagatttt agccgctcgc atatgccgat gcgaatcatt gccgtttggt 1740 tgcagagtct cttttaacgc catttcaaat ggcactgccc gcggatcctg tcactctcga 1800 agaactgaaa caacttgttt ccttgttgaa ttcaaaaaaa aaccgcggtt catgaccttc 1860 tcgacaacag aacgataaaa accctatcag accatgctct gcaatttctg gcacaaaatt 1920 ttaacagcgt tttacttgag ggttatttcc caaaagtctg gaagactgca aacataatcc 1980 tgatactgaa accagggaaa aggccaacag aggtagactc gtatagacca atcagcctcc 2040 tcctttccct tggtaaaatt atggaaagga taattctcaa cagaatgcgt gacgttgagc 2100 ctgtagtatt ggcgatacct gatttccaat tcggattccg gactcaacat ggaacatccg 2160 agcaactcca aagagttgtc aattttgcgc tagaagccct agaaaggaaa gagtatgcgg 2220 tagcagattt tctggacatt caacaagctt ttgacagact ttggcatcct ggacttctac 2280 ataaagcaaa gaaaatacta acaccccagc tattccagct tgtgacgagc tttttgttag 2340 gacgaacttt ctgtgtgacg accgatggat gcacttcgtc cgttaaagcc atcgaagctg 2400 gagtacctca aggaagtgtg ttaggcccta ctctatactc catcttctcc gcggacatgc 2460 ccacccagac agcagtcact ggattagatc gtaaagatgt gctgattgca acatatgctg 2520 acaacacagc ggtactgaca aaaagcaaca gcattatcga ggcctcagac gcactgcaag 2580 agtacttgga cgcattccag aattgggcag agttgtggaa catctgcatc aatgctgaca 2640 aatgtgcaaa cgtcaccttc acaaagcgca taggtagttg ccattctgtc tcacttaaag 2700 aaagagtgct tgaccacaag tcatcataca aatatcttgg agtgatttta gatagaagct 2760 taaccttcgg aaagcatgtc acggcgatac agcagtcctt taaaaataag gtttctaaaa 2820 tgtcctggtt aattgctgcc cgcaacaagc tttccctcgc caacaaggtt aaaatttaca 2880 agagcatatt agcttctggt ctattctatg ccatacaggt gtatggaatc gctgccaaaa 2940 cgcacctgaa taaaattcgg gttcttcagg caaaaactct gaggaagatc acaggcgctc 3000 cttggtacat gaggaccaga gatatcgaac gtgatctgaa tgtgacaaaa attggagaca 3060 ggattcagga aatagcgaaa aaatacaatg acagactaga atctcatcct aatagtctag 3120 ctagacgact aagtatcgca gcccaaagcc atcggacaag tacgagaaga agattaaaac 3180 gccaccaccc tcaagacctg actgaccggg acttgacata aatcctactt aattaaaatc 3240 ttctaaatat tactgcctaa acattccatt tctttaatac tcataatcat ataatctttg 3300 taaataactt agatattaag aatcgtttaa attgataccg attagattaa gttgtccaag 3360 aagggtaata gtacgctagt caattaaata aaataatttc gtttaataaa ttaattaaaa 3420 aaaaaaaa 3428 // ID G6_DM standard; DNA; INV; 2042 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063503; G6. XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 2042 BP; 609 A; 617 C; 462 G; 354 T; 0 other; cagtcgcgat cgaacactcg acgagtgcag acgtgcctgc ggatcgacag caaattgttc 60 acagttttaa gtcccgttac ttgtgcccag ccacttcgcg tcgcgtgatc ttgtcgcgcg 120 ttttgattgg cccagccaac gtacctaacg gtagttcgca ctaacaccat cgcacacccg 180 agtgtgcgtg ttatcagcgc aaaaaaccca gtgctcgaag cagcggtata tttgcaaagc 240 agcagccacg tgctgcctgg ctcaccggct tacggtgccc agcttccccc ccctcctcac 300 tccttatcaa ctttggagaa gatggactgg caggcccccc cgcgcaccca caagcttgga 360 acaacaccac gcaaaaaggc tctgagaaca cgcaagagca gctccagcag cgagggaagc 420 acctcgcata cagagccgga cgagataaag cgaaaaccgg caaagaaagc acagggagag 480 gagctggaag agaagccaag cactagcgca gctctgcgca agaagctcgc caacaacgcc 540 ttcgctttac tctcgagcga agaagacgag gacgaccaag agagctctga tgacgaaccc 600 ggacctaaag acgattccaa gcccaagacc cccgagaaac caaagcccac cccgaagacc 660 atcaagccac ctccgatttt tatccccgat gtgaccaaca tctcggcact cgtcaagatg 720 atcacgactc ttgtaggccc gaagaacaat tttacctaca agaccgtgaa tggcaacaac 780 gtacgtgtca tgatgccgga caaagagtcc tatacagctc tgcgtctcca acttgtggcc 840 caaaacaaga ggcatcggac tttccagccg aaagatgaac gtgcatacaa ggttgtcatc 900 aaaggactcc accactccac cgatcgtgag gaaatcattg aagaccttcg cagacaaggg 960 cacgctgtta gagatctgca caatcccatt ggcagaagaa ctaaagaacc gctgggaata 1020 ttcttcgcca acctggagcc ttccagcaac aacaaagacg tctaccaagt caagcggatc 1080 tgcaggtcgg tagtaaccat tgaaccgccg cagaagttca acgacgtgcc tcaatgcttc 1140 aggtgccaag gattcggtca tacacagcgt tactgcttcc tggaataccg atgtgtaaag 1200 tgtggaggcc ctcacgaatc gagggcatgt gagaagaggg aggacgacaa agcgtgctgc 1260 ttccactgcc aggcggacca tcctgcgtct ttcaagggat gccctgcata caaaagggcc 1320 aaagcactcg ctgctccgaa aacaaggccc gtcgctaatg ctaacaaggc gccgcccgtg 1380 gcatcaccaa acgtcacctc tggcaggagc taccgagacg ccctcaacgg agtgcacgca 1440 gcaccgcaga atcccacaac cccagtccaa acccaaacag aaaccccaca ctccggtcag 1500 atagaagcga tgttcgctcg catggaagga atgatggaaa ggatgatgga gcgcatgttc 1560 acccagatga cacagctggt ggccaccatt ctcaacagca agtcatgcaa ttaaagctcc 1620 acctagtcgt ctggaatgcg aacggcctgc agaacagcaa ggccatagtc gagcaccatc 1680 tgaagaccca ccagatcgat atcctactcg tagccgaaac ccacttctcc cccagatccc 1740 actcataggg aacttggaat cccacaagtc gctgatgaaa tctccaggct cagcgagaga 1800 tacctgaaaa ggctcgaaaa ccaccctaac cacctcgcca ccaacctgtt agacaatagc 1860 caaacaagca gacgtctcat gaggagacac cctctcgatc ttccacaaca atagacaaca 1920 catataaaac ccgccacaaa tacatgtaca atagtatccc ttaagctaat gttcccccgc 1980 aaaaccattt aattattgtc cactaggaca gattttaaat aaacaaacgc acgctacaaa 2040 aa 2042 // ID LOOPER1_DM repbase; DNA; INV; 1881 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063402; looper1. XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 1881 BP; 636 A; 292 C; 365 G; 588 T; 0 other; ttaacccttt cgcgcccaac gttgcttgtg agcaatttct atgttttcag agctaactgt 60 aagcgtgcct tttgttttta tctgctcatt ctttatgata acggtgcttt tatgtgtcta 120 ctctaaatat gaaaagcaaa aaacgcattt ggagattatt tgcagaaaat atttgcaata 180 aacagcacgc gcatcacaat aattattgtt tcgatattcg gtggaacttt tggaattagt 240 aatttacttt tataatactt taaaaagaac ctaaagtata aacaaataat tatttagcaa 300 agagttaaca agacagtgat atagtcaaac ttagttaata gtgaacaata agtaagtaaa 360 taattttcta agttgcttgt gagcaacctt gggcattcgt ggtaataatt ttcaccgaaa 420 gtgaaaggtt tgtctaacga agaaattgaa cgggctttaa acttcgattg ggatatttct 480 actgacgaag aatgtggtga tgagatggaa gacatttcta cggtactgga gaagaattta 540 gaaggaattc ttgaaagggg agaaagtatc gagattgacc ttctggaaaa tacagtgatt 600 gaaggatgtg aacctgactc ttgcgttgtt gttgacagtt gcttggagaa aattgatcca 660 aagacgctaa agtggagaac aaggccgttt gtagctccag agagtatttg ggaagatgac 720 aaaacttttg atgtcgggga gataaagacg ccagtggagt tcttctacac actttttgat 780 actcagctaa ttcatttaat ggcaaggcaa actgatatat atagtttgca ggagcacggt 840 attgaactta aatgtactga tgaggaaatc aaacgctaca ttggcatttt attgtacttt 900 ggtgttttaa aactaccgca attcagaatg gcatggtcaa aggatttaaa gattaccgca 960 ataactgatt caatgccgcg tgggagattt aaaaaaataa aacaatgctt acatttcaac 1020 gacaacgcca aacaattaaa aaaaggggat tgcaactatg ataaactcta caagatccgc 1080 cctttgctca gaattctcaa agaaaatttt ggaaaaacta acgcaggaag agcatcaaag 1140 tgtcgatgag caaataattg cattcaaagg tacgttttta attttctttt aaatttgctt 1200 tatttttatt aattgctttt gttgcaggtc gatccacgct tatacaatcc aaacctcata 1260 aatggggtct taaaattgtt tacgcgggct ggaatatctg gattagttta tgattttacg 1320 ctatatgttg gagaaggcac ttctccttct tatggcttgg gaatatcatc ttatgttgtc 1380 ttatatttgg cagaaagtct tcccaaagac aaaaatttta aactgtattt tgataattgg 1440 tttacgtctg taatccttct gatttcgttg aaggaaatag gaatctttgc aacaggtact 1500 gtacgtatga taaagttgaa cattgggtag tttttggaga aagaggacgt tgcagactgt 1560 gcaaaactgc aacaccgatg accaaatgcc ttacatgcaa agtccatctg tgctgcaata 1620 acaataaaaa ctgttttttt gtcataccac acttaaattg tcattataaa gaaaaatatt 1680 tcatattctg tgatttataa aaaaaaaaca atgcttacac atcactactg cccgacgttg 1740 ctcacaagaa aactttcgct accgcccaaa ctaatgggcg tggcatacta aaattttgct 1800 aaatttttct aaaaataaat gtaaaaacat taatgataaa acaaaatttc acgggtaaaa 1860 agttgggcgc gaaagggtta a 1881 // ID AF418572 standard; RNA; INV; 804 BP. XX AC AF418572; XX DR FLYBASE; FBgn0046701; Penelope. XX FT source AF418572:1..804 FT SO_feature CDS ; SO:0000316:172..>804 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0046700; Penelope\ORF1" FT /db_xref="SPTREMBL:Q95VB4" FT /protein_id="AAL14648.1" FT /translation="MERSPEPSININGRHAVCTATNMSYAKIKTKYKDSKRTINKFQLT FT LVKLTKLKSSLKFLLKCRKSNLIPNFIKNLTQHLTILTTDNKTHPDITRTLTRHTHFYH FT TKILNLLIKHKHNLLQEQTKHMEKAKTNIEQLMTTDDAKAFFESERNIENKITTTLKKR FT QETKHDKLRDQRNLALADNNTQREWFVNKTKIEFPPNVVALLAKGPKF" XX CC Derived from AF418572 (AF418572) (Rel. 70, Last updated, Version 3). CC Michael Ashburner, 15-August-2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 804 BP; 355 A; 163 C; 120 G; 166 T; 0 other; ctcagtaaca atctccacgt caaaaagcgc agacgtgtaa aataattgcc ctgaagacgg 60 tttgccgatg tgcaaccgaa atatatcgga agagaattga ataaaattgt ttttcattgt 120 ttgttttaac aaactcggac ctcgagccag ccaacaaata aatattgaaa tatggaaagg 180 tcgccagagc catcaataaa tatcaacgga aggcacgccg tatgcacagc aaccaacatg 240 agctacgcaa aaataaaaac taaatacaag gattcgaaaa gaacaattaa taaattccaa 300 ctaacactgg taaaattaac taaacttaaa tctagtttaa aatttttgtt aaaatgtaga 360 aaatcaaatt taatacctaa cttcatcaaa aacttgacac agcatttgac catactgacc 420 actgacaata aaacccaccc tgacataaca agaacattga ctagacacac acatttttac 480 cataccaaaa tattaaactt acttataaaa cacaaacaca acctattaca agaacaaaca 540 aaacatatgg aaaaagcaaa aacaaacata gaacaactga tgaccacaga tgacgcaaaa 600 gcgttttttg agagcgagag aaatatagaa aacaaaataa caacaacact caagaaaaga 660 caagaaacga aacacgataa gttacgagat caacggaacc tagccttagc ggataacaac 720 acgcaaagag agtggtttgt aaacaaaaca aaaatagaat tcccgccaaa cgtcgtagcg 780 ttactcgcaa aagggccgaa gttc 804 // ID QBERT standard; DNA; INV; 7650 BP. XX AC AF541947; XX DR FLYBASE; FBgn0063782; accord2. XX FT source AF541947:1..7650 FT SO_feature five_prime_LTR ; SO:0000425:1..223 FT SO_feature three_prime_LTR ; SO:0000426:7432..7650 FT SO_feature CDS ; SO:0000316:2298..3512 FT /db_xref="FLYBASE:FBgn0063781; accord2\gag" FT /protein_id="AAN34649.1" FT /translation="MLGKKANISNTVRGRLGLRSESGLSEIREATEPKEKDQSVQSEDN FT NPTNTMDSGNDTAAISPIVNSSRNTNLSLQQLLTLVHQLPTYEGPPDNLDRFIDRVDQL FT LLLASSVDQTTGGKYLLGTIRDKIKGRADEALNVCDVLLTWDDIKINLKRLYSSKKTEE FT MLVRELHNLPDGLSMGKLYYSAAKIRSDLMSLAREADPSAHSLAVKRDQYDRFCLNTFL FT TGLKDPLSSAIRNQRPETIEKAYEYGQIELNFHRSLNKHQDNRRRDNPFHRNHPYAKGQ FT PTNSDYNNRYIPRQQNYNNGNNARHVPRYDNNSNNNDSRFIQRQQNFNNNNNRRFTRQH FT NSHNDSNGRQDRETLGRNPFHNNEQPDQTLSHQNRNKHNPSAPLCNINDDANFLLEASG FT SQSAT" FT SO_feature CDS ; SO:0000316:3811..5034 FT /db_xref="FLYBASE:FBgn0063780; accord2\pol" FT /protein_id="AAN34648.1" FT /translation="MDILSSLNAKINLAESILETPETAIPILTRANPVDTVYNLPQNTK FT MLLPLPVNLMEGDFIYETTNLDNQVSITGGLNTANAGSAYFEVCNNSDQIQTLYLEEPL FT QAEEFSSQTHQYLNCMTAATDTERQDDQQLNILTEHLNSEEKQYILKLCKSFKRLFHNE FT NNQLTFSNAIKHSIPTTDNIPIHTKSYRYPFVHKDEVRKQISKMLEQNIIKNSHSPWSA FT PVWVVPKKSDNIGEKKWRLVIDFRKLNEKTVSDRYPIPNIADILDSIGKTMYFTTIDLA FT SGFHQIQMNPRDASKTAFTVENGHYEFTRMPFGLKNAPATFQRVMDNVLGDLVGNVCLV FT YLDDIIVFSPSLQKHIADIKSVFTKLQNANLKIQPSKCSFLRKEIDFLGHIVTQEGVKP FT NPLKIQTI" XX CC Sequence from BDGP, August 2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7650 BP; 1796 A; 1266 C; 1692 G; 2896 T; 0 other; GCGCTAGTTA CAATCCCGTG GCCGAGGGAG GAAAGCTTCC AAAAAAACGA CTGAGTAGTT 60 TGAGGTGTGA AATGAGAATG ATAATGAATT CTTTATTACA AATACGTATA ATGTATGTAT 120 CTCCCAAATA TCTTCGTCAG AGTTCGTATT GGCGATCTTC GTTGAACTTT CGCTCGGCTC 180 TCAGTTGCAG TACTGGGTAG ACCGAGAGAG GGGGTATGTT AACTCCTCCC CCCCCCCCCC 240 CCCCCAGTCC GAAGAGAAGG CCGTAATGGG AAATGGCCGG ATCGGAGCAA GGTGATTAGG 300 TGGCGTGTTG TTGGTATATG CCACGAACTC CGACTGTATT TTGAATTTAA CAGTTTTTTT 360 TTTTTTAATG GGTATCTTAA TGTTAAACAT GAGTGTTGTG TATATAAACG TTACATCTAA 420 TGCTAAAGTT ACATTTTGTT TACTTAATTT CTATCTTATA TTATTATTTG TTTCTTATGA 480 TATAACAGTT TAGGGATAAT GTTTAGTTTA CTGGCTAATG GCTATCGACT ACTTTATTTA 540 CGTAATTAAA GTGTCTAAAT TGAAGCTATT CCTACTTTAT AATAAATAAG TGTTCTTTCT 600 TGTTAATGTG GGTTTTTATG TTACTTAGTT TGATAATAAT ATATTTATTA AATAGTGCTT 660 GTCTTTCTTA TGTTCCTGCC ACTGTTTTCT TAACTCTTTT TCTAATTTTT TGTTTGTGTA 720 ACTTTTGACC TTTAGTTGTT AGGAAAGTGA CGCCTTTGTC CCTATCTACT TTGTGTAGTG 780 AAAATTTCGG TGTTATCTTG TTTCTGCGGT TTTCTTTTCT AAATATTGTA TCATTAGGCT 840 CGACTTGTAC TGGTTTCTCT CTGTCCTTGT TTAATTTAGC CAATCTTCTT TCTGCGGTCG 900 CGCTTAAATG TTCCTGTCCT TTTGGGTATA TGTCTTTTCT AAATTCGTTT AATTTTTTAA 960 TTTAATCGTG GTGATTGTCG ATAATTATTG TTTTGTTGAA TATGTGAGTT CTTCCGTGGA 1020 AGAGCTCGAA AGGGGTATAG GAAGTGCTGG AGTGTATTGC ATTGTTATAT GTTGCTACTG 1080 CTTCTGCGAG TATTTGGACG TGCTCGCGTT CTAGGCGTGC TTCCTTTCTT TTGTTCAAGA 1140 TTATCCTGTA TAGTTCGGTT AAGGTGGAGT GCAGTCCCTC GACAGAAGCA TTGCCCGTTG 1200 ATTGTTGAAA TGAAGTTGTA TGAGCTGTTA TTTGGAATTG AGTTAAGAGG TCGTTGAATA 1260 GACTGCCCGA AAATTCCGCG CCCTGGTCGA AAATTATTTT CTTTGGTGCG CCATAGTGAC 1320 TTATGAATTG TAATAGGGCA TCGGCGACCT TTATTGAATT TCTATTGCTT AGGGGGTAGG 1380 CTTGCGCTAG CTTGGTGAAT TTATCGATAA TTGTGAGGTT ACAAGTATAT TAATGAAATA 1440 TATGTCTATA TGTACGACTG CCAATGGACC CTCTGGGGCG GTTGGGTCTT GCATCAATGG 1500 TCTATGGGGA TGTCTATCGT ATTTTAGGGT TAGGCAGGTA TCACAATTTT TTATAATTTT 1560 TGTGATAGTG TCTTTTAAAT GCGGGAAGTA AATGTCCCTT TTCAAATGGT TATACGTTTC 1620 GTCTATCCCC CTATGATTGG TATTTAAGTG GTACGCACGA ATTTCGTTTT CCTGATCGTT 1680 AGGATTCGTA ATTTCGGTTA GCCATATATT GCAGCGGACA ATTTTGTATA AATCATTTTC 1740 AAAAAAATAG TTATTATAGG ACTCTTCTAT TATTTTGAAA ATATCGTCCG GTGCAAATAT 1800 AGCGACTGTT TTATTCGGTT TCATGATTGC TTGAAGAGTG TTTGTAACGG AGTCGAAAGT 1860 ATGCACGGGT TCTGTGATTA TGTGTCTTAC TTTATGGATA AATGGTGTTT CTACTTCTCT 1920 TAGCGTTGTT GATCCGGTAC AGAATATTAT TTGGCAATTG AAGTGATTTA GTGGTTGCGA 1980 AACGATTGGT ATAATTCGTG GGTCTTCGTT TATGTTTAAG TTTGGCTCAA TTCTGCTGAG 2040 TGCGTCAGCA ACTACGTTTT GCGACCCTTT CTGGTATACG ACTTCGTAGT CGTAAGCGGC 2100 CAGGATAGTT TTCCATCGAA GTAGTTTTGG ATTCTGACCT TTCAGATTTT CTAGCCAGAT 2160 TAATGGCTTA TGGTCTGTGA CTATTTTGAA TCTACGCCCA AAGAGGTGTA GTCTGAAGTG 2220 ACGTACAGAC CAAATGATAG CTAGCATCTC TTTCTCTATC GTAGAATAGC GGATTTCCGT 2280 GTTCGATAGT GTCCTGCTAG AGAAAGATAT GGGTCTGTCA TTAGTATCTG GACCTTGTGA 2340 CAGCACAGAG CCTATCGCAT AATTACTTGC GTCTGTGGTC AGCGTGAACG GTTTATTGAA 2400 ATCAGGGTAT ATGAGTATAG GATCATTACA AAGGAGATCT TTCGACGTTT CGAATGCTTG 2460 TCTAAATTCA TCGTCTATTA TTATAAATTT CTTCCCTTTT AACTGTCTCG TTATGGGTTT 2520 AGTTATTTTC GCAAAGTCCT TTATGAATTT CCTGTAATAC CCTAATAGTC CTAAAAATGA 2580 TTTAATTTCC CTGGTTGTCT TTGGACATGG GAAGTCCTAG ATGGTTTGAA TCTTAAGTGG 2640 GTTTGGTTTG ACTCCTTCCT GGGTTACGAT ATGTCCCAGG AAGTCGATTT CTTTCCTTAA 2700 GAAGCTACAC TTGCTTGGTT GGATTTTTAG GTTTGCGTTT TGGAGTTTTG TGAAAACAGA 2760 TTTAATATCT GCTATATGTT TTTGGAGTGA GGGGGAGAAT ACTATTATGT CATCGAGGTA 2820 GACGAGACAG ACGTTACCGA CTAGGTCACC CAAAACATTG TCCATAACGC GTTGGAACGT 2880 GGCAGGGGCG TTCTTGAGAC CGAAGGGCAT TCTCGTGAAC TCGTAATGCC CATTTTCGAC 2940 GGTGAATGCC GTCTTGCTCG CGTCGCGTGG GTTCATTTGA ATTTGGTGAA AGCCACTTGC 3000 TAAGTCGATT GTTGTGAAAT ACATGGTCTT TCCTATGCTG TCTAGTATAT CTGCAATGTT 3060 TGGGATAGGG TATCTATCAG AGACAGTCTT CTCGTTTAAT TTCCGAAAAT CGATTACCAA 3120 GCGCCACTTT TTTTCGCCTA TGTTGTCACT TTTCTTAGGT ACGACCCAAA CGGGTGCACT 3180 CCAGGGGGAA TGGCTGTTTT TTATGATGTT TTGTTCGAGC ATTTTTGAAA TTTGCTTTCG 3240 GACTTCGTCT TTATGTACGA ATGGGTAACG GTATGATTTT GTGTGAATCG GTATATTATC 3300 TGTTGTTGGT ATAGAGTGTT TGATGGCGTT AGAGAACGTT AGTTGGTTGT TTTCGTTGTG 3360 GAAAAGTCTT TTGAATGACT TGCACAGTTT AAGTATGTAT TGTTTTTCTT CGCTGTTGAG 3420 GTGTTCCGTC AGAATGTTTA ATTGTTGGTC GTCTTGTCGT TCTGTATCTG TTGCCGCGGT 3480 CATGCAGTTT AAGTATTGGT GGGTTTGCGA AGAGAACTCT TCTGCCTGTA ACGGTTCTTC 3540 GAGGTAAAGT GTTTGGATCT GGTCTGAGTT ATTGCAGACC TCGAAATAAG CTGATCCTGC 3600 GTTTGCGGTG TTTAAACCGC CTGTTATGGA TACTTGATTA TCCAAATTGG TGGTTTCATA 3660 TATAAAATCT CCCTCCATTA AGTTTACTGG GAGCGGAAGT AGCATTTTGG TATTTTGTGG 3720 CAGGTTATAT ACTGTGTCTA CAGGGTTTGC TCTTGTGAGG ATTGGGATGG CGGTTTCTGG 3780 TGTTTCTAGT ATGGATTCGG CAAGGTTTAT TTTTGCATTT AGGCTAGATA GGATATCCAT 3840 GCCTAGGAGG CCATTGAAGT GGGAGTGGAA ATCAAAATAA TAAGAACTTA AAATTGGGTA 3900 TTTTTTCGAA TTGTTTGAAA TTTATTTTAT CGGTATATTC CCGAATTTTG AATTTATTAA 3960 GAACAGTGTG GATATTGATA TCCGAGGTTA ATTTTTTTGT CGTGCTCTTC GTTAATAAGT 4020 GCTGGATTTA TGAAAGAGTG AGTAGATCCA GTGTCTATTA GGAATTTGAG AGGGTTTGAC 4080 ACCATGGAGT TTGGTGATAA GAAAACAAAT GGCAAGGAAG ATCCGTGAGT CTTCTTGCTT 4140 ATGTGGCCGA CTGGCTTCCC GAGGCTTCTA GTAGAAAATT CGCATCATCG TTAATGTTAC 4200 ATAGGGGAGC GCTTGGGTTA TGCTTGTTCC TGTTCTGGTG ACTAAGGGTC TGATCTGGTT 4260 GTTCATTATT GTGGAAGGGG TTCCTGCCCA GGGTTTCTCT GTCTTGTCGA CCATTACTAT 4320 CGTTATGGGA ATTATGTTGA CGTGTGAAGC GTCTATTATT ATTGTTGTTA AAATTTTGCT 4380 GTCTCTGTAT AAAGCGGCTA TCATTGTTGT TGCTGTTGTT ATCATATCGT GGTACATGAC 4440 GAGCGTTGTT GCCATTATTA TAGTTCTGTT GTCGTGGTAT GTAGCGATTG TTGTAGTCGC 4500 TATTGGTTGG TTGTCCTTTT GCGTATGGGT GATTCCTGTG AAATGGGTTG TCGCGTCGTC 4560 GGTTATCTTG GTGTTTGTTC AGGCTTCTAT GGAAATTTAA TTCAATCTGA CCGTACTCGT 4620 ATGCTTTCTC GATTGTCTCT GGCCTTTGAT TCCTAATGGC TGAACTAAGT GGGTCCTTTA 4680 ATCCAGTTAG GAATGTATTT AGGCAGAATC TATCGTATTG GTCGCGTTTG ACCGCGAGAG 4740 AGTGTGCACT AGGGTCGGCT TCTCTAGCGA GTGACATTAA ATCGCTTCTT ATTTTAGCGG 4800 CTGAGTAATA CAATTTTCCC ATACTGAGTC CGTCCGGAAG GTTGTGTAGC TCTCTAACAA 4860 GCATTTCTTC TGTTTTCTTG CTAGAGTATA GTCGCTTGAG GTTTATTTTG ATGTCATCCC 4920 AAGTAAGGAG TACATCGCAG ACATTCAGTG CTTCATCTGC CCTGCCTTTA ATTTTGTCAC 4980 GGATAGTTCC AAGTAAGTAC TTTCCGCCTG TTGTTTGGTC CACTGATGAG GCTAACAAAA 5040 GGAGCTGATC TACTCTGTCT ATGAATCTAT CAAGATTGTC AGGTGGGCCT TCGTAGGTTG 5100 GTAGTTGATG TACAAGGGTT AGTAATTGTT GTAGGCTCAG GTTTGTGTTT CTTGATGAAT 5160 TAACGATAGG ACTTATTGCC GCTGTGTCGT TACCGGAATC CATAGTGTTG GTGGGATTAT 5220 TGTCTTCGGA TTGAACCGAC TGGTCTTTTT CTTTAGGTTC TGTCGCTTCC CTTATCTCAG 5280 ATAGACCAGA TTCGCTTCTT AGCCCTAGTC TGCCCCTAAC TGTATTACTA ATGTTGGCTT 5340 TTTTTCCTAA CATTTACACG TATTGGGAAG AAATCTGTAT TTTTTTTTTG CAAATAATAG 5400 TGAATAATTA TATATTGGGG TATAGTTATG ATATGTGTAA TAATATGTAA AAATAGGTAT 5460 ATTAATAATA TCTTTTTGTT TAGGGAAAAT TTTGGGTGTT AGGCTTGTTT TGTTTTCGTT 5520 TGTATATTAA TGTAGTAATA TATTGTATTA TATTCTGTGA TGCTATGTAT ATGTATCTGT 5580 TTGAATGTGT ATTATATGTG TGTATTCTTT TCAAAATAAA AGAAAACCTG TATGTGTGTG 5640 TCCGTTACGC AGTTTCTTCT TATCCTTGTT TCCCGGTGCT TAGTATATGT CGTTCTTGTA 5700 GTGCTGCTGT GCTTCTATCC ACCTTTGTAG TTTTTTTTTT CGTGTTTTCG ACTTTAGCTG 5760 TAATACATGA TTTAAATAGC TATTGAATAG AGTGTATTTT TTATAGGTCT GTGAACATGT 5820 GACAACAGTA GGCGTTAAGT TGGTTTTTTT TTTTCTTTTT TTTTCTATAC ATATATGCAT 5880 TTTTTTTTTT GAGTGGGTTT TTTTTTAATT GAGTGTTTTT TTTTGTAGTG TATATACTGT 5940 ATATACTGGC GAAGCGTAGG AGTCATAATG TCTGCAAATC TAATTGAGGT TCATACGGCC 6000 GGACAGGGGT ACAAAGGGAC ATACGTTACA GGGTCTGAAA TGCTTTCTGT TATATACTTT 6060 TAACTTTCGC ACTCTGGAGC GACATCAGCT TTTGGCGGCA GCCAATGTGC AGTGATGCGT 6120 CGACGTCGCT AGGGTTATAC AAGCGGCGCA ACACAAACAC ATACATGCTT ACTTGATTTT 6180 GCCGAAGGAA ACTGTATCAC TGTGCAAATT TTTTTTTTTA TTCTTCCTTT TTTTCTTTTT 6240 TTTGTCTATT TTTTTTTATT TATTAGTATT ATTATGATTG TTATTATTAT TATTTTTTTT 6300 TCATGTAAAT GGATAAGTGA AAACGATACC GGTGCCTTAA CAGCGATTGT TGTACACATA 6360 TTCGGTCGTT AATCACACAA CTCACATTTA TTTTTATAAC CCACTGTGAA TTAGATCTCG 6420 TTGCCTTGGA GGCGACTGAT GTTCAAATTC ACCTCCTGCA CAGCGAGTTC TGTATTTGTT 6480 AAAAAAAATT TGTGTTCATT TAAATTTTTA AGTGAAAACG ATCCCGGTGC CTTATCAGCG 6540 ATTGATGTGC ACATATTCGG TCGTTAATCA CACAACTTAC ATTTATTTAT TTATTATTAT 6600 ATTTATTTCT TTTTTTTTTT TAAAAACACT TGTGTAGGAG GTCCCGTTGT CTCGGAGGCG 6660 ATCGGTGTCC AAAATTTATT CTTTGCGGAG CGAGTTCTGG TCACAACGTA GTCTAATATT 6720 CGACTGATCT CGTTCTTTAC CATTCTCTTT TCTGTTGTGC ACAACGACCA ACACTCAGCG 6780 GCAGCGACGT CGCTCTGTCG CCGTGCCAAA CGGTTGTTCT CAGAACTATA GAGACACAAC 6840 AACCGTTCGC AGTGCGAATA TAGGCTGATG TAATATTACC ATGGTGATGA AATTACCATT 6900 CTTATGAAGT AACCAAAGAG CAACCAATAC TCTCGCACAC ACCAAGCGCT CTCCGACACA 6960 CAAATGGTAA CCATACACAG TAGTTTAACA TGTCAGAAAA AATTTTTCCT CTTTTATTTT 7020 TTTTTTTTTT ATTTATTTAT TTTTTTTTTT TGTTGATTAA TTTACCATTT TTATGAGTTA 7080 GGGTGTATGG ATGATCCCGG TGCCTTATCA GCGATTGATG TACACATATA CGGTCATCAC 7140 ACACAGTAAC TTGTTGGTAT AATTGTTGTA AATTGTCCCG CACAAGCGAT CCCGGTGCCT 7200 TATCAGCGTT TGATGTACAC ATATACGGTC GCCTGTGAAG CAATTTGAAC TTGATTTCGT 7260 TTGTTTCCTT TGTTTTACTT GCACTTGCAC GTTATTATGT TTCAATACGC TTTATCCAAT 7320 TCCTTTCTAT TTCACCACTC ACTAATAAAA TCCCCACAAC CACTGCCGTC ACATGTTCAA 7380 TACCTCTTTA AGCGTAGTGG GCATTGTTAA TCCCTTGTCC GTTGACCGGC TGCGCCAGTT 7440 ACAATCCCGT GGCCGAGGGA GGAAAGCTTC CAAAAAAACG ACTGAGTAGT TTGAGGTGTG 7500 AAATGAGAAT GATAATGAAT TCTTTATTAC AAATATGTAT AATGTATCTC CCAAATATCT 7560 TCGTCAGAGT TCGTATTGGC GATCTTCGTT GAACTTTCGC TCGGCTCTCA GTTGCAGTAC 7620 TGGGTATACC GAGAGAGGGG GTATGTTAAC 7650 // ID McCLINTOCK standard; DNA; INV; 6450 BP. XX AC AF541948; XX DR FLYBASE; FBgn0063917; McClintock. XX FT source AF541948:1..6450 FT SO_feature five_prime_LTR ; SO:0000425:1..498 FT SO_feature three_prime_LTR ; SO:0000426:5953..6450 FT SO_feature CDS ; SO:0000316:1279..2562 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0063916; McClintock\gag" FT /protein_id="AAN34650.1" FT /translation="MAVAAPSPIILSDSNMIQVERQINGVEQFNGDPQTLYTFISRIDF FT ILALYQTTDERQKLTIFGHIERNIAGEVIRTLGVTNLTTWTELRTHLILNYKPQRPNHL FT LLEDFRNTQFRGNVREFLEEAERRRQILTNKLDLESDTAETTLYNQLIRTSIETLILRL FT PIHIQLRIVKCEIPNLRSLINILQEKGIYEIATTYKNNTKPVSNPIKSPNNATHRQTTN FT YYNHTTPFQPSYNAMYQPIRQPISYIPPQLPRTNPNPFSQYQYRQLHPQPNVSVIAQPR FT PLNRQHTFDQNRPGLSYSNALNTRENITTGGPALKRQRPSDSGQSRMSFDEAHYQEELD FT QTHQQFNPYMHYGNHPQYPFDLYMPYGPPPNNIPIYYMPYDPPQQHPVGIQEMAEAREE FT PTEPPMELITEAQTAENFRPQASEQANS" FT SO_feature CDS ; SO:0000316:2622..5642 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0063915; McClintock\pol" FT /protein_id="AAN34651.1" FT /translation="MATQNFFKLPIQEIAKKINSSNGQFIAQKKVTLPKNNLFPKPYDF FT FIYPFSTKYDLILGRQLLDEGTTSVNYGPRTVTIYGHVHEMIDAFLPSEEIHIQDTQNN FT SFRLDHLNSEEKAKLITLLKEFQDLQYKKGDQLTFTNNVKHTIRTSHNDPVYRKPYGYA FT PGLDTEVENQIKEMLDQGIIRESNSPYCSPIVVVPKKPDISGQKKYRIAIDYRYLNEIT FT IADKYPIPNMDEILSKLGGCNYFTTIDLIKGFHQIEMDPESIPKTAFTTKTGHYEYTRM FT PFGLKNAPATFQRCMNDVLRPLLNKICMVYLDDIIVFSASLEEHLQSLRAVFQALSNAN FT LKLQLDKCEFLKHDTYFLGHMVSPEGIRPNPEKLRAIENYPLPTKPKEIKSFLGLTGFY FT RKFIPHFAQIAKPLTTCLKKDKKVDIKNPEYIEAFKRLKLLISNDPILRSPDFKKKFVL FT TTDASNVALGAVLSQDGHPISYISRTLNDHETNYSTIEKELLAIVWATKTFRHYLLGRH FT FEIASDHQPLCWLHKMKEPNAKLTRWKFRLAEYDFDIKYVKGKENHVADALSRITIEEA FT FFTEATQHSAQEDNQNLISLTEKAVNNYNRQVIFTKGPEKVKQENYYKKKIIHISYETL FT THKKAKQYLIDYFVNNHSALYIDSDADFETIQAAHKEIINPSTTKVIRSLTLLKNIKSY FT AEFKELILQSHEKLLHPGIQKTKKLFSENYFFPNSQLQIQNIINECQVCNLAKSEHRNT FT KVPFKLTPSIEFCRDKFVIDIYSVEGKHYLSCIDTYSKFATLEQIKTKDWIECKNALMR FT IFNQLGKPKLLKADRDGAFSSLALKEWLETEGVELQLNTTKTGVADIERFHKTINEKIR FT IINTMKNTETDLSKIETILYTYNHKTKHDTTGQTPAHIFLYAGQPTLNTQELKRTKIDK FT LNKGREDHDIDTRYRKGPLQKGKLDNPYKPTKNVEQTDADHYKITNRNRAMHYYKTQFK FT KRKKINQTHTPTQMATS" XX CC Sequence from BDGP, August 2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 6450 BP; 2555 A; 1350 C; 952 G; 1593 T; 0 other; AGTGACGTAT TCACGCCAAC TCATCACGCT CAGTTAATTG AATATTCGCG CCACCCAAAC 60 TGGGCCGCTC TTGTAAACGG ATTCCCAAGC TCTCCTCAAC GTAAACATCC AAGAACATCA 120 CATGATTTTA CCGTTTAAAG CTATTTTCTG CGTTATCTGC GTGTCGATCG TTATTACTTC 180 ACTTCACAAC CTTGGGAGAT ATTCCGTTTT TTTCCTATTC TTTATTTCTG TAACCCATAG 240 AGGCCCCTCC TCAATTCTCA CTCACATTTG TAATCAGTCT TAAGTTGAAA GTTAGACCGT 300 AAAGTTCAAT AAGTCTTAAA CCGATATTTA ATAAAAAACA AAACCGAATT TATTCAGTGT 360 TTAATTTATT TCGGAGTTGA TCCTAGTTTA AATACGGATC ATTTGTTCGA CTTAAAAAAA 420 TAAAAACTAT TTAACTTTCA GCGTTGATCT TAGTTTAAAT ACGGATCATT TGTTCGGCTC 480 AATTAAAAAC TATTTAACTG GCGCAGTCGG TAGGATAACT CAAAGATTGC TTGATGCGAA 540 AATATAAAAT TTTTAATTCC TGTAATCTGT GATATCCGCC AAAGAACTTA CGTAGTGTTA 600 CTGTGAAAGT GAAGTGTTGA CCATTAATAC GCCTACACTT AAAAGAACAG TGAGACACAT 660 CTGTTTAAAC AACAGTGAAC TTCCAACATT TACCGGAGCT ATAAATCCAG AATTTTTTTT 720 TGGTTACTCC GAACAAGTGA ATTGCTGGAA AATAAAAAAT AAAAAAAAAA GAACCTAAAA 780 CTAAATATAT TAAGGAATTA CCAAATAAGT GAAAAAATTA AAATCTAAGT GAAAGAAAAC 840 ATAAGGAAGT AAACCCAAAT CAATCAGTTA AAAAAAAAAA TCTCTAATGC GAACTATTAA 900 ATAAGTGCAT ATAAAACAAA ACTATATACA CTTAAAGAAA ATATATATCT AAATAAACTC 960 TATCTAAGCT CAACGGAAAA TAACAAAAGA AATAAATCTT AACGTGTTAT CAAGAGAGGA 1020 GAGTTATAGA CTTCAAACGT AACATAAGTG AAAAATCAAT AGGAGTGAAT AAAATTTAAA 1080 TTAAGACAAA TAAAAAACTG AAGATATAAA GTTTGAATAA ATTAAAAATA AACTAAAACT 1140 ATACAATACA AAAATAAAAA TAACAATAAT AAAATTATGA AATAACCGCT ACTCACGAAA 1200 CATCAGCAAC TATAAACATT TAAAAACAAA AAAAAAAGAA ACCAACAATT TTGTACTCTA 1260 ATAAAAAAAA AAAAAAAAAT GGCAGTAGCA GCACCAAGCC CAATAATACT GTCCGACTCG 1320 AACATGATTC AAGTTGAACG ACAAATAAAT GGAGTCGAAC AATTCAACGG GGACCCACAG 1380 ACCCTGTATA CGTTCATCAG CCGCATCGAT TTCATCCTGG CACTATATCA AACGACCGAC 1440 GAACGGCAGA AGCTCACCAT CTTCGGACAC ATTGAGCGCA ACATCGCGGG AGAGGTCATC 1500 CGCACACTAG GAGTTACGAA CCTCACCACC TGGACGGAAC TCAGGACTCA TCTGATCCTA 1560 AACTACAAAC CCCAGAGACC AAATCATCTA CTATTGGAGG ACTTCCGGAA TACCCAATTC 1620 CGAGGTAACG TTCGTGAATT TTTAGAAGAA GCCGAACGTA GACGGCAGAT ATTAACAAAT 1680 AAGTTAGACT TAGAAAGCGA CACTGCAGAA ACTACCCTGT ATAACCAACT AATACGAACC 1740 AGTATAGAAA CATTAATTCT AAGATTACCA ATTCACATAC AGTTAAGAAT AGTTAAATGC 1800 GAAATTCCGA ATTTAAGATC ATTAATAAAT ATATTGCAAG AAAAGGGAAT TTATGAAATA 1860 GCAACTACAT ACAAGAACAA TACAAAGCCA GTCTCAAATC CCATTAAATC ACCTAACAAT 1920 GCAACTCACA GACAAACAAC TAACTACTAT AATCATACAA CACCATTCCA ACCATCATAC 1980 AACGCCATGT ATCAACCAAT TCGCCAACCA ATTTCATATA TACCACCTCA ATTACCCAGA 2040 ACTAACCCTA ATCCGTTTTC CCAATACCAA TACCGTCAAC TTCACCCTCA ACCTAACGTA 2100 TCTGTTATAG CTCAACCTAG ACCATTGAAT CGTCAACATA CATTTGACCA GAACCGACCA 2160 GGACTTAGTT ATTCAAACGC ACTAAATACA AGAGAAAATA TAACGACCGG TGGACCAGCA 2220 CTAAAGAGAC AACGACCGTC TGACAGTGGA CAATCACGTA TGAGTTTTGA TGAGGCTCAT 2280 TACCAAGAAG AATTAGACCA AACTCATCAA CAGTTCAATC CTTACATGCA TTACGGGAAT 2340 CATCCCCAAT ATCCTTTCGA TCTTTACATG CCTTACGGGC CGCCCCCCAA CAACATCCCG 2400 ATTTATTACA TGCCTTACGA CCCCCCACAA CAACATCCGG TTGGCATACA AGAGATGGCA 2460 GAGGCACGAG AAGAGCCAAC TGAACCTCCT ATGGAATTAA TAACAGAAGC TCAAACGGCT 2520 GAGAATTTTC GGCCCCAAGC CTCGGAACAA GCCAATTCAT AATTATTAAA CACAAAGGAC 2580 TCAATTTAAA ATGCTTAATA GACACAGGCT CAACAGTAAA TATGGCCACT CAAAATTTTT 2640 TCAAATTACC AATCCAAGAG ATAGCAAAGA AAATAAACTC AAGTAATGGC CAGTTCATTG 2700 CACAAAAGAA AGTAACGTTA CCCAAAAATA ATCTTTTCCC TAAACCATAC GATTTTTTCA 2760 TATATCCTTT TTCAACTAAA TACGACTTGA TTTTAGGACG ACAATTACTT GACGAGGGAA 2820 CTACATCAGT AAATTACGGA CCCCGTACAG TTACAATATA CGGACACGTG CACGAAATGA 2880 TCGATGCCTT CCTTCCTTCA GAAGAAATAC ATATTCAAGA TACACAGAAT AATTCCTTTA 2940 GGCTAGACCA CTTAAATTCC GAAGAAAAAG CAAAATTGAT AACCCTCCTA AAAGAATTCC 3000 AGGACCTTCA ATATAAGAAG GGAGACCAGC TCACATTCAC TAATAACGTA AAACACACTA 3060 TTAGAACATC CCATAATGAC CCAGTATACA GAAAGCCTTA CGGATACGCA CCCGGACTAG 3120 ATACCGAAGT AGAAAACCAA ATAAAAGAAA TGTTAGATCA GGGAATAATC CGAGAAAGCA 3180 ATTCCCCTTA TTGTAGCCCC ATTGTAGTAG TACCCAAGAA ACCAGACATC TCCGGACAGA 3240 AGAAATACAG AATAGCCATA GACTACCGTT ACCTCAATGA AATAACAATA GCAGACAAAT 3300 ACCCAATACC AAATATGGAC GAAATCTTAA GCAAGTTAGG AGGCTGCAAC TATTTTACTA 3360 CAATTGACTT AATCAAAGGG TTTCACCAAA TAGAAATGGA CCCCGAGTCT ATCCCCAAAA 3420 CAGCCTTCAC AACTAAGACA GGGCATTACG AGTATACGCG TATGCCATTT GGACTGAAAA 3480 ACGCCCCAGC TACCTTCCAA CGATGTATGA ACGATGTACT TCGTCCACTA TTAAATAAAA 3540 TCTGTATGGT ATACTTGGAC GACATTATTG TATTTTCAGC TTCCTTAGAG GAACATCTTC 3600 AATCCCTTAG AGCAGTCTTT CAAGCATTAT CTAATGCTAA TCTAAAACTC CAATTAGATA 3660 AATGCGAGTT TTTAAAACAC GACACATATT TCTTAGGACA TATGGTTTCT CCAGAAGGTA 3720 TAAGACCTAA CCCGGAAAAA CTACGAGCAA TAGAAAATTA CCCCCTTCCT ACTAAGCCGA 3780 AGGAGATAAA ATCATTTTTA GGACTCACAG GTTTTTATAG AAAGTTCATA CCCCATTTCG 3840 CACAGATTGC AAAACCCCTA ACAACATGTC TTAAAAAGGA CAAAAAAGTA GATATTAAAA 3900 ACCCGGAATA TATTGAGGCA TTCAAAAGGT TAAAACTCCT TATTTCAAAC GATCCCATAC 3960 TTCGATCACC AGACTTCAAG AAAAAATTTG TACTCACAAC AGACGCTAGT AATGTAGCTC 4020 TAGGAGCAGT ACTTTCTCAA GATGGTCACC CCATAAGTTA TATTAGCAGA ACACTTAACG 4080 ACCATGAAAC AAACTACAGT ACGATTGAAA AAGAACTACT AGCTATTGTT TGGGCAACGA 4140 AAACGTTTAG ACACTATTTG CTAGGTCGAC ATTTTGAAAT AGCAAGTGAT CACCAACCGT 4200 TGTGCTGGTT GCACAAGATG AAGGAGCCCA ACGCTAAATT AACAAGGTGG AAATTCAGAC 4260 TTGCAGAATA CGACTTCGAT ATTAAATACG TCAAAGGCAA AGAAAATCAT GTAGCAGATG 4320 CCCTATCCAG GATTACTATA GAGGAAGCGT TCTTTACTGA AGCTACACAG CACAGCGCCC 4380 AAGAAGACAA CCAGAATTTA ATTTCCCTAA CAGAAAAAGC GGTAAATAAT TACAATAGAC 4440 AAGTCATTTT CACTAAGGGA CCAGAAAAAG TAAAACAGGA GAATTATTAT AAGAAAAAGA 4500 TCATTCATAT TTCGTACGAA ACACTCACTC ATAAAAAAGC CAAACAGTAT TTGATAGATT 4560 ACTTCGTAAA CAACCACAGC GCTTTATACA TAGACAGCGA CGCTGATTTT GAAACAATAC 4620 AGGCAGCCCA TAAAGAAATT ATCAACCCAA GCACCACAAA GGTAATTAGA AGCCTAACTT 4680 TACTAAAAAA CATTAAATCA TATGCAGAAT TTAAGGAATT AATCCTTCAA TCCCATGAAA 4740 AGTTATTGCA CCCAGGAATC CAGAAAACTA AGAAATTATT TAGTGAAAAC TACTTTTTTC 4800 CAAATAGCCA ACTACAAATC CAAAACATTA TAAACGAATG TCAAGTTTGT AATCTAGCAA 4860 AGTCAGAGCA TAGAAATACC AAAGTCCCAT TTAAACTCAC GCCAAGTATA GAATTCTGTA 4920 GAGACAAATT CGTTATAGAT ATTTATTCAG TCGAAGGCAA ACACTATTTG AGCTGCATAG 4980 ACACTTATTC AAAATTCGCT ACACTAGAGC AGATAAAAAC CAAGGACTGG ATAGAATGCA 5040 AGAACGCCCT GATGCGCATA TTTAATCAAT TGGGAAAACC GAAGCTATTA AAAGCAGACA 5100 GAGATGGTGC ATTTTCAAGT CTAGCATTAA AAGAATGGTT AGAAACAGAA GGAGTAGAAC 5160 TACAGCTAAA TACCACAAAA ACAGGAGTAG CAGATATAGA ACGTTTTCAT AAGACCATTA 5220 ACGAAAAAAT AAGAATAATC AATACTATGA AAAATACTGA AACAGACCTG AGTAAAATAG 5280 AAACTATACT TTATACTTAC AATCACAAAA CAAAACATGA CACCACGGGA CAAACGCCCG 5340 CTCACATATT CTTATACGCA GGACAACCTA CCTTGAACAC GCAGGAACTA AAGAGAACTA 5400 AAATAGACAA ATTAAACAAA GGTAGAGAAG ACCACGATAT CGACACAAGA TACAGAAAAG 5460 GACCATTACA AAAAGGCAAA TTGGACAACC CATATAAACC AACCAAAAAC GTAGAACAAA 5520 CAGACGCTGA CCATTACAAA ATTACTAATA GAAACAGAGC TATGCATTAC TACAAAACAC 5580 AATTCAAAAA ACGAAAGAAA ATTAATCAAA CCCATACCCC GACTCAAATG GCTACCAGTT 5640 AGACTACACA GATTCACAAT TATATTTTGA GAAAAGAAAA CAAAGTGTAT AGTAATGAAA 5700 ATAAAGAAAT AAACAATGAA TGTGTCACCA ATATAATTAA GCACCTAAAT CCAATCTGTA 5760 ATTTTAAGCC AATACCCACA AATGAATTAA TGAAATATAT AGAATAATGA TGTATACAAA 5820 ACTAAAATTA AGAAAAAACA AAAAAAAAAA AATCAGGACA TACCACCACA AACTGGAATG 5880 GAAGAAGTTC CACTACCCTT ACTATATCCA TCAGTCCCAG CCCAAGTATA GGCTTATCTT 5940 TAAGGGAAGG GAAGTGACGT ATTCACGCCA ACTCATCACG CTCAGTTAAT TGAATATTCG 6000 CGCCACCCAA ACTGGGCCGC TCTTGTAAAC GGATTCCCAA GCTCTCCTCA ACGTAAACAT 6060 CCAAGAACAT CACATGATTT TACCGTTTAA AGCTATTTTC TGCGTTATCT GCGTGTCGAT 6120 CGTTATTACT TCACTTCACA ACCTTGGGAG ATATTCCGTT TTTTTCCTAT TCTTTATTTC 6180 TGTAACCCAT AGAGGCCCCT CCTCAATTCT CACTCACATT TGTAATCAGT CTTAAGTTGA 6240 AAGTTAGACC GTAAAGTTCA ATAAGTCTTA AACCGATATT TAATAAAAAA CAAAACCGAA 6300 TTTATTCAGT GTTTAATTTA TTTCGGAGTT GATCCTAGTT TAAATACGGA TCATTTGTTC 6360 GACTTAAAAA AATAAAAACT ATTTAACTTT CAGCGTTGAT CTTAGTTTAA ATACGGATCA 6420 TTTGTTCGGC TCAATTAAAA ACTATTTAAC 6450 // ID STALKER4 standard; DNA; INV; 7359 BP. XX AC AF541949; XX DR FLYBASE; FBgn0063897; Stalker4. XX FT source Release3na_arms:3L_21292630.. FT SO_feature five_prime_LTR ; SO:0000425:1..403 FT SO_feature three_prime_LTR ; SO:0000426:6957..7359 XX CC Sequence from BDGP, August 2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7359 BP; 2888 A; 1340 C; 1226 G; 1905 T; 0 other; TGTAGCATAT TGGACTAATC TACCCTAAGA ATACAATAGA TGATTGGGTA TAACATAGCG 60 TCAATACATT GTGACACTTT GTCATAATAA ATATAAATAT ACAAATATAC AAAAAGACCA 120 CCAAAAACTA CGTAAGCACT CCAGCGCCCC AGTAATACGA TCTAACGCTT ATACATAAGC 180 CGATCGCGGA GCGTGGGAAT GCTGAGCATG CACTTTGCAG CTCAAGTGGT CAATGCCTTC 240 TGCATGCATA TGTATATGTA TAAATGTAAG TAAGAATACA TAGATATAAG CAATGTATGT 300 GCGGGTTAGC TGAACCCAAC TTCAGCACAC TTTGATCATT CGAATAAACA GATTCAAACA 360 GAGCAGAGGT TCTGAGCTCG GAAACCAAAT CTTTTACATC TATACCTGTT ATATTTTTAA 420 AACAATTACA TGGCGACCGT GACAGGACAC GCGTGCATGA AGTTAACGAT TATAATAAAA 480 GCATGAAGTG TCCGTGGCCA AAAGAAATAA AATGAATAAA TTCAAACAAA ATTCAAATTC 540 GTGGCCAAAG AAATAAAAAC AATTACATGG CGACCGTGAC AGGACATTAC ATGGATAAAT 600 GCATTACATA CATAGAAATG CAGAAAGGAC AAATCTTCTT AAAAAACTTT TGAACATAAA 660 TGGCAGAGGA GCCACAATTC TCGAAGGCAG TACCCACTAA GCAAAGGGAC TTGCCAACTC 720 TGGAAGAGGC CCTACAGGAG AATCCAGCTA ATGCACCACG CCCACTCACG ATAAAAGAGT 780 ACAGAGCCAG GCAGCAGAAA AAACCGCAAA ATAAACATAA AAGAAGCGGT CAAAGAATTA 840 AGCTACTTCA GCAACGGCGA CTGGTCAAGG ACATGACCAA GTCGGCAAAA GACGAGGAAT 900 CCCGACAACG CTACATAGTG CGTCTTCAAG AAATAGAGAC TAAACTTCGC AAAGGTGCGA 960 AACAACGCAA ACGGGCTGCA TAAATGCCAA TGCCCCAATT TGCCAAAACT TCAATCAGAT 1020 GCATCCCGAC AACGCTACAA AGAGCGTCTT CAAACACAGG AAATAAGTTT TGCAAAGGTG 1080 CGAAACAACG CAAACGGGCT GCATAAATGC CAATGCCCCA ATTTGCTTTA GATTTAATTT 1140 CTACCCCAAG CCGATGCGGG AAACCGCTGC TTGGAAAATA CTAAAATCTG TTAGGCTTTT 1200 TCACTAACAA TGTGGTTGGA GAATTTTTTT TTTATTATTG TAAACGACTA TGTGAGCCAA 1260 CCACATATAT TAACCATTAA TATTTCCACG TCTCTGGCAC TGAGTATACA TATATACTCA 1320 GCTGCAAAAT GTTATTTGTG TAAAATGACA ACAATGAAAA AAGTTCTTAT TTTGACTAAT 1380 ATAAAAGAAA ATATTAATTT CATTTTTCCG ATTTTCAAGA AGAAAATATT TTTCCTTCCT 1440 TTAAGCTCCA AATAGAATAT CTTATTTTTT TTTCCTTTTT TAAAGATCCG TTTCATTGAA 1500 TATAGATAAA ATGGGATGGT TTAGCGATTC TAGTGAGGCA AAGGACAATA CTGCCAACGT 1560 AGTTAATAAC GTAAAAATTA TAGATCACAC AGACGATATA AATGCGTTGT GGATCTTATT 1620 GCTGATCATT ACAATAGTAC TACTTCTACA ATTTCTGCTT ACAATTTATG TTAAGCATAA 1680 CAAGATCATC AAAAGACGTT ATATAAATAG GGCAAATCGT TTAGACCAGA TTTAAAAAAA 1740 AAATAATATG GATTAGAAGA AAGCTTAATA AAAACTTTTT TTTTACGACA AGAATGGAAT 1800 GGAGCGAAAT AGCGATACGA ATAGACGAAT TTCGCTTCAG GTTCGATAAG TCTTATAAAT 1860 GTATCAATAG AGACGCAGTA ATAAAATCCG AAACTTTGAA AAATCATATA GAGACATTAG 1920 TAGGAGAATA TAATAATATA GTTACATTAG TAAATAAATA TGCAAATAGG CTCACATCTG 1980 AACATAATAA CAAATGTTTG AGGGTTATAA AATCCCTAAA CACAAGATTA AATAACATCA 2040 GAAAAAGAAG GCATATTCTG ATAGATGTAC CAGAAAGTCT AAGTCAATTG GTTGAATTCA 2100 ACACAGACCA GTTCAAAGAA CTAGACGAAT CTGTTCAATC AAGCGGCGCT GAGTCCGATA 2160 GTGACATTGA AACGCTAGAA GGAAGCGACC GAATTGAATT TAAATCTGAA CCAATAAAAA 2220 TTTCTGAGAT GGCACAGACA TTGATAGAAT TTATCAGGCT AGCCACATCT CTGATACCAG 2280 AGTTTGATGG TAAACCAGAA AATCTACAAA GTTTTTTGGA TGCTCTAGGT CTACTAGATA 2340 GCTTAAAGAG CACACATGAA ACGACAGCAG TAAGCCTAAT AAAAACTAAA CTTAAAGGCC 2400 ATGTAAGAAA CCTTATAAGT AATGAGCAGA CGATTGCTGC AATCATTACC CAACTGTCAA 2460 GTGCAGTAAA AGGAGAATCG GTAGAAGTGA TATCTGCCAA GCTTCTGAAT CTACAACAGA 2520 GAAATAAAAC GGCTAACCAA TACACCCAAG AGGTGGAGAA ACTGACAAAG GCCCTTGAAG 2580 GTGCCTATAT CAGTGAAGGT CTCAGCCAGT CCTTAGCAAA TAAATACAGC ACTACAACAG 2640 CTGTAAAAGC AATGACACAG AATTGCTCCA TTGATAAGGT AAAACTTATC ATGCAAGCAG 2700 GCACATTCAC AAACATGAAT GATGCCATCT CCAAATTTGT AAACAGTTGC ACAGAGATAA 2760 CAGGTCAAAG TAACACTGTA CTCTATTATC GACGAGGTGC AAATAATTAT AATAGAGGCG 2820 CCCGGGGTTA TAATCGTGGT AGAAATATCA ACCACAACAA TTACAACCGA GGTAGCAATA 2880 ACAACAATAA TAATAACTAT AATAACCGTG GAGGTAGGCG AGGCCAAAAC CAAGGGAGAG 2940 GCCGCGGAAA CTACAACCAT GGTAATAATA ATAATAGCAG TGTGAGAATC GCGCAAAATA 3000 CGTCGGAAAA CTAACAGAAC CCTTTAGGAA ACAACCAATA AATGTAAAAG TTCATTCCAT 3060 CAATTATAGT CTTAATATAT TCGTAACCTT CTATAATCAT TCCACTGAAA ATAAACTAAC 3120 ATTTCTCATA GATACTGGTG CAGATATCTC ACTTTTGAAA GTAAATTCTG ATAACTTCGT 3180 AATTCAAAAT GAAAAAATAA TAAACATCGA AGGCATAGGC CAAGGTGTGA TAAAGTCTCA 3240 AGGAACAACC TTAATAGAAC TCCAATCAAC AAAATATATT ATCCCACATG AATTTCATTT 3300 GGTAAACCCA AATTTTGCAA TACCATGTGA TGGAATAATA GGCATTGATT TTATAAAGAA 3360 ATTCAATTGT CAACTAGATT TCAAACCAAG TGAAGACTGG TTTATAATTA GACCCCAAAA 3420 TTTAAATTAT CCAATATATG TCCCGATAAC ATATAGCGCT GGCAACAATA CAGTTCTTCT 3480 GCCAGCCAGA TCACAAGTTA TTCGGAAAAT AGACATTAAT GTTGTAAATG ATTTCATATT 3540 TGTTCCTAAT CAAGAAATAC ACAATGGGAT TTATGTTGCA AATACAATAG CAGCATCCAA 3600 ACATGTATAC GTTCGACTTC TAAATACAAC TAATTTCGAC CAAGTGGTCA AAGTAAATAA 3660 AATACAATAT GAAAATCTAA AAGATTATGA CATTCATAAT ACCGACACTG GAAATAGAAG 3720 CGAACAAATA CTTTCAAAAC TAAAGAAAAA TTTTCCAGAC CAATTTAAAA ATCAATTAAC 3780 AGAATTATGC ACACAGTATA GTGATGTGTT CGGACTGGAA ACCGAACCTA TATCAACAAA 3840 TAATTTTTAT AAACAAACAT TAAGACTTAA AGATGATGAA CCCATTTATA TAAAAAACTA 3900 TAGAAGCCCG CATAGCCATA TTGAGGAAAT TCAAAAACAA GTAGGGAAAT TAATAAGCGA 3960 CAAAATCGTC GAACCGTCTG TATCTGAGTA TAACAGCCCA CTCTTGCTAG TTCCAAAAAA 4020 ATCATTACCA AATTCACAAG AGAAAAAATG GCGATTAGTA ATTGACTATC GTCAAATAAA 4080 CAAAAAACTT CTTTCTGACA AATTTCCACT CCCTAGAATT GATGACATTT TAGATCAACT 4140 AGGTCGAGCT AAATACTTTT CATGCCTTGA CTTGATGTCA GGTTTCCATC AAATAGAACT 4200 TGAGGAAAAC TCTAGGAATA TAACATCTTT TTCAACGAGC AATGGCTCAT ATCGCTTCAC 4260 GCGATTACCA TTTGGTCTTA AAATAGCACC AAATTCATTT CAGAGGATGA TGACTATATC 4320 ATTCTCGGGA TTAGAACCCT CTCAGGCATT CCTTTACATG GATGACTTAA TGGTGATAGG 4380 ATGTTCCGAA AAACACATGA TTAAAAACTT AACTGACGTT TTTAATGTAT GTAGGAAATA 4440 TAACCTAAAG TTGCATCCAG AAAAATGTTC ATTTTTCATG CACGAAGTGA CATTCCTAGG 4500 TCACAAATGC ACAGACAAAG GAGTATTGCC AGATGACAAG AAATATGACG TCATCAAAAA 4560 TTATCCTGTC CCTCATGATG CGGACAGTGC AAGACGATTT GTAGCATTCT GCAACTATTA 4620 TCGTCGATTT ATAAGGAACT TCGCCGACTA TTCACGGCAC ATAACTAGAT TATGTAAAAA 4680 GAATGTCCCT TTTGAATGGT CAAGCGAATG CCAGAACGCA TTCGAATACC TAAAAGAAAA 4740 TCTTATGCAC CCCACACTAT TACAATATCC TGATTTTCGC AAAGAATTTT GCATTATAAC 4800 GGATGCTAGT AAACAAGCTT GCGGAGCGGT TCTAACTCAG AACCGAAACG GGATTCAGCT 4860 CCCAATAGCT TATGCATCAC GTTCATTTAC AAAAGGAGAA AGCAATAAGA GTACAACGGA 4920 ACAAGAACTA GCGGCAATCC ATTGGGCAAT TACCCATTTT AGACCATACA TTTATGGCAA 4980 GCATTTCACC ATTAAAACGG ACCACAGACC ATTAACGTAC CTATTTTCTA TGACTAATCC 5040 CAGTTCTAAA TTAACTCGCA TGCGGCTAGA ACTAGAAGAA TACGACTTCA CAGTAGAATA 5100 CCTAAGGGGG AAAGATAATT TTGTAGCAGA CGCACTCTCA CGTATAAATA TAAAGGAACT 5160 CAAAGACATG CAACATAAAG TCCTGAAAGT CACTACCAGG CAACAAAGTA GACAAGAAAA 5220 CTGTACAGTA ACAAACAAGG AACTATTGCC TAGGCAAAGT ATCCAAAATG TATCTAAGCC 5280 CAACGTACAC GAAGTCATAA CAAATGATGA AGTACGAAAA GTAGTGACCT TGCGAATAAC 5340 TGAATCTATT TGTTTACTAA AACGAGGAAA TAAAGTTATT GCAAGAATTG ATGTTGACGA 5400 TTTATATACC AATGGAATTT TTGATTTAGG TCAGTTCTTC CAAAGGCTTG AAATGCAAGC 5460 CGGTATACTA AAAATCAGCC AACTCAAATT GGCACCGAGT GAAAAAATCT TTGAAACCAT 5520 TTCAATAGAT AATTTCAAAA ATATGGGCAA TATAAAATTG AAAACATTAA GAGTAGCGCT 5580 ACTCCAGCCG GTGACCATTA TAAAAACTGA AAAAGAGATA CAATCGATAC TGTCTACATA 5640 TCACGACGAT CTAATTCAAG GAGGTCATAC AGGCATTACA AGAACGCTAG CGAAAATAAA 5700 AAGACACTAT TATTGGAAAA ATATGACTCG TCATATAAAA GAGTACATAC GTAGATGTCA 5760 TAAATGCCAA ATGTCAAAAA CAACGACACA TACAAAGACC CCATTGACTT ACACAGAAAC 5820 CCCAACAAAT GCTTTTGATA TAGTGATAGT GGACACAGTT GGTCCACTAC CGAAATCAGA 5880 ATATGGCAAC GAATACATCG TCACACTAAT ATGTGATTTG ACGAAGTATC TAGTAACCAT 5940 ACCTGTTGCG AATAAGAGCG CAAATACTGT CGCAAAAGCT ATATTCGAAA ATTTTATACT 6000 AAAGTACGGT CCAATGAAGA CGTTCATTTC GGACATGGGT ACCGAGTATA AAAACAATGT 6060 AATTCAAGAT ATGTGTAAAT ATATGAAAAT TGAAAATCTT ACATCCACTG CATATCACCA 6120 CCAGACTTTA GGGACAATCG AACGAAGTCA TAGAACATTC AATGAATACA TTCGTTCATA 6180 CATCTCTGCA GATAAAACTG ATTGGGACGT TTGGATACAA TACTTTACAT ATTGTTTCAA 6240 CACAACACCA TCAGTCATGC ATAATTACTG TCCATATGAA CTAGTCTTTG GAAGATTACC 6300 AAGGCAGTTC GCAAATTTTA ATAAAACAGA TAGAATAGAA CCACTGTATA ATATAGAAGA 6360 TTACTCAAAG GAAATAAAAT TTAGATTAGA AATAGCATAT AAAAGAGCTA GACTTTTGTT 6420 AGAAAAAGCT AAGTCTTATA GAAAACAACT TTATGATAAG AAAACTTCAG ATTTTCAATT 6480 AAAAATAGGA GATAAAGTTA TACTAAGGAA CGAATCGGGT CATAAGTTAG ATCCAGTATA 6540 TATAGGCCCT TATACTGTAG AAACCATAGA AGACAGAGAT AACATAGTAA TTAGAGATAC 6600 AAAACAAAAG AAGCAAAAAG TACATAAGGA TAGACTAAAA ATATATAATC AATGAAACGT 6660 TTCATTTCAC TTAAGAAAAG GTCTGATCAA CCTCAAAACA AAAAAAAAAA AAACACAAAA 6720 AAAATTTAAT TACTATTTTT CCTTCTAAGA AAGTTAAACA TAAATCCAAA AACATCGTAA 6780 TTCAACATAC ATTTTTTGTA TTATTCTGTC ATTATACAAA AATGCTTTGA GACAAAACAT 6840 TGCTAATAAT TAATAAGAAA AATCAATTTC AAAAAAATTT TTTCCTTTCT AAACACAATA 6900 TTAATATTGA GAACTCAATG ACTACATATA TTACGTCATT TCTTTAAAAA GGGAGGTGTA 6960 GCATATTGGA CTAATCTACC CTAAGAATAC AATAGATGAT TGGGTATAAC ATAGCGTCAA 7020 TACATTGTGA CACTTTGTCA TAATAAATAT AAATATACAA ATATACAAAA AGACCACCAA 7080 AAACTACGTA AGCACTCCAG CGCCCCAGTA ATACGATCTA ACGCTTATAC ATAAGCCGAT 7140 CGCGGAGCGT GGGAATGCTG AGCATGCACT TTGCAGCTCA AGTGGTCAAT GCCTTCTGCA 7200 TGCATATGTA TATGTATAAA TGTAAGTAAG AATACATAGA TATAAGCAAT GTATGTGCGG 7260 GTTAGCTGAA CCCAACTTCA GCACACTTTG ATCATTCGAA TAAACAGATT CAAACAGAGC 7320 AGAGGTTCTG AGCTCGGAAA CCAAATCTTT TACATCTAT 7359 // ID HOPPER2 standard; DNA; INV; 1593 BP. XX AC AF541950; XX DR FLYBASE; FBgn0067381; hopper2. XX FT source AF541950:1..1593 XX CC Sequence from BDGP, August 2002. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1593 BP; 507 A; 301 C; 250 G; 535 T; 0 other; ATAAAAGTTA AAAGTTTCTA AAGTTAATTT TCAATATTAA TATTGTCTAA AATTTCATAG 60 TCGTCTTCCT CTTCACAATC AGCAGAGTCT GAAGAATCGT TATCAGGTTC GAAAGCTAAC 120 ATTTGAATGA CTTCTGGGGA AAGAGGCAGT CGCTTATGTT TTTGAACGCG CCTCCTTAAA 180 TTAATTGATT ATATTATGGG ATCCGAAGTA TCCTTTGCTC TGTGAAAGAG ATCTGCGAAG 240 CTACAAATAC GATTATGCTT TCTTGAATGG TGAAGTCTGT CCGACTTGTA AATTTTATTC 300 CTAGATTCTG CTGCCTCTTC TCCAAAATAG TCGAACTGTT TGCTCCAAGA TATTTTTGAA 360 GTGAACCAAT ATTTTGTGGA CTGTCGCTGT CATGGGAAGC CATGGATATT TGTCAACAAT 420 AATTTGTGCA GTTGTATGAC AAAGCTGCTC ATATTTTTCA AGGTCTATAG GTAATAGACA 480 TGACAGACAT ATTAATATTG TCCTCATATT AAAAATGAGC TGGATGTCAA CGCCTGTTAT 540 TTCGGAAAAT GCCTTATAGT TATTAAATGC ACGACGCGCC GTATTGTCAA CATTAGAACT 600 ACCGAATCCG CCTTGTTTTG GTTGATCAAC CTTAAGCGAT AGTTTTTCCC AAAACATTCG 660 CTGGGTGCAT TTTTTTCGCT CTAACTCCAT ATTTTTTTAT CAATTCCACC CACGATTCTC 720 CACTTTTTTA CTACAGTTTT GTACCCCATA TTTGGTACAA ACGCCGAAAA TCTACAATCT 780 TCCAACAAGG AAGTGGACTT AATCCATATT TTAAATTTCC TTGAATTTCC TTCTTTTACC 840 TTAAGGAGTA GGTGTTCTTC TAACCACTTT GAATAAACTT TCGAAAATTC GTTTAATTTC 900 GATAGCATTT TCTCAAATTT CGGAAAAGAT TTACTGAAAA CATTCTCGCA TTTTCCTTTA 960 GGTATCGTTC ATTAAAGTCT AGCTTGCTAT CAGAAAAATG CCCACTGATA AAAGTGTAAA 1020 AAGTATTTTC CTTTTGACGA AAACCCTTTT GCTTGCGCCA CACTTCCAGC AGGTCAGCAC 1080 TGGCAATCGA GATATTGCTT CCTAAAACAT AATATTTCTC AAAAAACCGC AAACGCACAT 1140 AGAGACTACA TGATATGAGC TAAGAATTGA ACACACTACA ACATGGATAT AAACACTTAC 1200 TGAACAAATT TGAACAAATT GTTGTAGCTC TCTTCAAAGT TGCAATTTTT TTCAAACAGC 1260 TACATGTGGA CATCACTTGC TAAATGTACA AATAGTTAGT AGTAGACGCA CACAATAAAC 1320 AATATATTAA CAGGAACACA TAATACAAAC CTGAAGATTG ATTATCCATT TCAAATTATA 1380 CTCTTTTGCG ATCTTCTTTT TAATTTCTAA CACTTTGAAA GTTAAGCTAA ATGCAGCCAC 1440 GTGGTATGTG CTCGCAACAG CTGAAATTAA CAGCTGTTAT TATAATGGTG CGCTGTTAAA 1500 TTAACTTTTG CGGGCTGAAA CATAACAGTT TAGAGTATTT CCAATATATT AATACTAAAA 1560 TACTGCAAAT TTGCATACTT GTGAAAAAAC ACA 1593 // ID STALKER2 repbase; DNA; INV; 7672 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063455; Stalker2. XX SY synonym: new-Stalker XX FT SO_feature five_prime_LTR ; SO:0000425:1..424 FT SO_feature three_prime_LTR ; SO:0000426:7248..7672 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 7672 BP; 3078 A; 1430 C; 1194 G; 1970 T; 0 other; TGTAGTGTAT CTACCCTCAC TATAACTCTA CTCTACATAT ATATAAGTAA CGTACATACA 60 TTGTGACACT TTGTTGCAAA CACAAATAAA CATAATTCAC ATCAAAGACC ACATGCACTT 120 ACATAAACAC TCCAGCCAAT GAAATACGAT CTAACGCTTA TACATAAGCC GATCGCGGAG 180 CGTGAGAATG CTGAGCATGC ACTTAGCAGC TCAAGTGGTC AAGCCATACA TAACATATGT 240 ATGCCTTCTG CATACACATG TATATGTATA TACAATATGT ACAATATGTA AGAACACCAT 300 GTACGGGTAG CTGTACCCAA AGACAGCAAC ATAGGATTCA TTCAAATAAA ACGATTCAAA 360 CGGAACAGAC GCTCTGAGCT ATTCAATATC TATTACACTG AGCTATTACT TATTACTTAT 420 TACATGGCGA CCGTGACTTG GTCTCGAGTC TGCCTCTGTG TTGTGCCTCT GTGTATTGTG 480 ATTATGTATC GTGTATTGTG TTTGTGTTAA AAAAAAAAAA AAAAACAACA TTTTGTGCCT 540 ATTATATTCA ACGTGTGAAC ACACAAACAT AACTAAAATG TACTTGTAGT ACTTGGACGC 600 ATTAAATGCA AATATATGTT CAATTGAAAC AAACAAAGCA CAAAAACACA CAAAAACACA 660 CACAAAAACG CACAAAAACA CGCAAACACA GCACACACAG AAGAACAAGC GGCGATTGAT 720 GGCAGACGAG CCACAATTCG CCAATGCACA ACCACAAGTG CAACGGGACC AGCCAACGCT 780 AGAGGAAGCG TTGCGGTTAA ACAACGCCGA TGGACCACGC CCACTCACAG TAGCTGAGTA 840 CCGGGCACGG CAGGAGAAGA AACAACTACG AAAGCACAAA CGCTCAGGAC GGAGGATTAA 900 ACTACTACAA CAACGCCGAC TGGTCAAGGA AATGACCCAG TTGGCAAAAG AAGAATCAGC 960 ACGACAACGC TACCAAGAGC GTTTGGAAGC CATTGAACAA GAACTTCGCC AAAGTGCGAA 1020 GACACGCAAA CGGGCTGCTT AAATGCAAAT GCCCCAATTT GCCCTAGAAT TAAAATAAAC 1080 CCAAGCCCGA GGCGGGAAAC CGCTGCTTGA AAATACTAAC AAAACTTTAT GCTAGGCTTT 1140 ATACTAAAAA ATGTGGTTGG AGATTTTTTT TTTAAAATCC TATTGAATTG TAAATTGTGA 1200 CTTGTGAGCC AACCACATTA GCACTATCAA ATTTCCATGT CTCTGCCACT GAGTATATAT 1260 ACTCAGTTGC AAATAATTTG TGTAACTTTA ACAACTTTAA ATTTTCTATT TCAAAAATAA 1320 AATTTTACAT TTTAAATTTA AATGAATAAC ATTTTAAAAA TGGATTAAAA AAAAAAAAAA 1380 AATAACTTTC TTAAAGAGAA TTTTCTTTCT AAAAATACAA GAAAAAATAG ATTTTTATTT 1440 TTCATATTAT ATATATAATA ACATTATATA TAATACTTCA AAAAAAAAAA CGTTTCATTG 1500 AATATATAAA AATATTCAAC ACAAATTAAA ATAAATTTTC TCCTCAATTT AAAATTACAC 1560 AAATCAAACA AAAATTAGAA ATAAAATTAG GAAACCTAAT AAGTAAAACA ATTATATTAC 1620 ATGTCATTAA AATTGAATGA GCATTAAAGA TAATTCCATA CCGAAGATAG ACCCCCTAAA 1680 GACAATCATT TCGGAACTGT CTACTATGTT AGACAAAAAT TCCAATATAG CGGACTCCAC 1740 CGCAAACGTA ATCAATACAG TTCAGGTATC ACAGCCCGAA CTTAAAATAA TTACGGTACT 1800 CCTCATAATA ATAGTAGTAC TGCTATGTGC GAGTATGATA ACGAAATTGT ACAAACTACA 1860 TAACAGATGC CTCAAGAAGA AATACTTGAG TAAGGCATTG GACCTAGATA AGGTCTAAAG 1920 TAGTACCCTG CGTACAAATT TCGAATATTC AATTAAATAG AAATTATCTT TACATATATA 1980 TATATAAAAA AAAAAACAAT ATCAAACCCA CTATAACTAG GGTCAACAGA AATAATAACA 2040 GTTGGTAAGA ATTAGAGCAT GGAATGGCAT GAAATATGCA CAACAATTAA AAATATTAAA 2100 ACTAAATTCG ATAAGACATA TAAATGCTTA TCTGCCCAAT AGGCCTATAC AGGCTGAAAC 2160 AATTAAAAAA CATGCATCTA CATTAGTCGA CTGCTTTAAT GAAGCACGAA CACTAATATA 2220 CGAACATAGA GAAACACTTA ATTCTGAACA CTGGTCTAAG TTATCAAGAC TATTAATCAA 2280 ACTTCGATTA AACTTGATAG CAGTTAAAAG GAAATTTAGT TTGGAAATTT CAATTCCAAC 2340 AATACTAAAT ACCCCTTTAA CGATTGATAC AGACGAACAA TCTGAATCAA TAGAACCAGA 2400 AGAAATCGAT TTAGAGGTCA CCGAGTCGGA AGAAAAATAT ACCGACATCG AGGATAAAGA 2460 CCTACTAAAC CTAACAATTC CAGCTATATT AACATTAGCT GAAGACACTA ATCAGGAAAA 2520 ATTTAAAACA GAAAACATAA TGGGCACAGT CCAATATTGA TTTTCTGAAT ACAGCATCAA 2580 AGCTTATACC CGACTTTGAC GGTAAGGCTG AAAACTTAAC AAGTTTTATA GATGCTCTAA 2640 ACATTGTAGA TACAATCAAA GGTGAGCATG AGTCTCTAGC TGTTTCGGTT ATAAAAACCA 2700 AACTTAAAGG CCATGCAAGA AACCTCATAA GTAATGAGCA GACAATTGCT GCAATCATTA 2760 CCCAACTGTC AAGTGCAGTT AAAGGAGAAT CGGTAGAAGT TATATCAGCT AAGCTTCTCA 2820 GTCTCCAACA AAAGAGTAAA ACGGCCAACC AATACACCCA AGAGGTGGAG AAACTGACAA 2880 AGGCTCTTGA AGGCGCCTAT ATCAGTGAAG GTTTAGGCCA AACTCTTGCC AATAAATACA 2940 GCACTACCAC AGCTGTAAAG GCCATGACGA AAAATTGCTC CATTGATAAG GTAAAACTTA 3000 TCATGCAAGC AGGCACATTC ACAAACATGA ATGATGCCAT ATCTAAATTT GTAAACAGCT 3060 GCACGGAGAT AACAGGTCAG AGTAACACTG TACTCTATTA TCGACGAGGT GCAAATAACA 3120 ATAATAGAGG GGGCCGAGGT TATAATCGCG GCAGAAATGG CAACAATTAC AACCGAGGCA 3180 ACAATAATAG TAATAACTAT AACAACCGTG GAGGAAGACG AGGCCGAAAC CAAGGTAGAG 3240 GCCGCGGCAA CTCCAACCAA GGTTACAATA CCAACAATGT GAGAGTGACG CAAAACACGT 3300 CGGAAAACTC ACAGACCCCT TTAGGAAACA ATCAATAAAT GCTAGAGTTC ATTCCATCAA 3360 TTATAGTCTT AATATATTCG GTAACTTTTT ACAATAATTC AACCTGATAA CAAGTTAACA 3420 TTTCTCATTG ATACTGGTGC AGATATCTCA CTTTTAAAAG TTAATTCAGA TAACTTCAAA 3480 ATTCAGGATG ACAAAATAAT AAACATCCAA GGCATAGGCC AAGGTGTAAT AAAGTCTCAA 3540 GGAACAACCC TAATAGATCT TCAATCAACA AAATATATTA TTCCACATGA ATTTCATTTG 3600 GTAAACTCAA ATTTTTCAAT ACCATGTGAT GGAATAATAG GCATTGACTT TATAAAAAAA 3660 TTCAATTGCC AACTTGACTT CAAACCAAGT GAAGACTGGT TTATAATTAG ACCCCAAAAT 3720 TTAAATTATC CAATATATGT CCCAATAACA TATAGCGCTG GCAACAATAC AGTTCTTCTG 3780 CCAGCCAGAT CACAAGTTAT TCGGAAAATA GACATCAATA GCGTAAATGA TCAAATATTC 3840 GTTCCTAATC AGGAAATACA TAATGGAATT TATGTTGCGA ATACAATAGC AGCATCAAAA 3900 AATGTATATA TTCGACTTCT AAATACCACT AATTTCGACC AAATGGTCAA AGTAAACAAA 3960 ATCAAATATG AAAATCTTAA AGATTATGAC ATTCATAATA CCAATCTAGA AGATAGAAGC 4020 GAAAAAGTAC TATCATTACT GAAGAAGAAT TTTCCAGAAC AATTTAAAAG TCAATTAACT 4080 GAATTATGCA CAAAGTATAG TGATGTGTTC GGACTGGAAA CCGAACCCAA TATCAACAAA 4140 TAATTTTTAT AAACAAACAT TAAGACTTAA AGATGATGAA CCCATTTATA TTAAGAACTA 4200 CAGAAGCCCG CATAGCCATA TCGAAGAAAT TCAAAAACAA GTAGGGAAAT TAATTAACGA 4260 CAAAATCGTA GACCCATCTG TATCTGAATA TAACAGCCCA CTCTTGCTAG TTCCGAAAAA 4320 ATCATTACCG AATTCAGAAC AAAAGAAATG GCGATTAGTA ATTGACTATC GTCAAATTAA 4380 TAAGAAACTA CTTTCTGATA AATTCCCACT CCCTAGAATT GATGACATTT TAGATCAACT 4440 AGGTCGAGCT AAATACTTTT CATGCCTTGA CTTGATGTCA GGTTTCCATC AAATAGAACT 4500 TGAAGAAAAC TCAAGAAATA TAACATCTTT TTCAACAAGC AATGGCTCAT ATCGCTTCAC 4560 GCGATTACCC ATTTGGTCTC AAAATAGCAC CAAATTCATT TCAGAGAATG ATGACTATAT 4620 CATTCTCTGG TTTAGAACCT TCTCAGGCAT TCCTTTATAT GGATGACTTA ATGGTGATAG 4680 GATGTTCCGA AAAACACATG ATTAAAAACT TAACTGATGT TTTTAATATA TGTAGGAAAT 4740 ATAACCTAAA GTTGCATCCG GAAAAATGTT CATTTTTCAT GCATGAAGTG ACATTCCTAG 4800 GTCACAAATG CACAGACAAA GGGAGTTTTG CCAGATGACA AAAAATATGA CGTCATCAAA 4860 AATTATCCTG TCCCTCACGA TGCGGACAGC GCAAGACGAT TTGTAGCATT CTGCAACTAT 4920 TATCGTCGAT TTATAAAGAA CTTCGCCTGA CTATTCACGG CACATAACTA GATTATGTAA 4980 AAAGAATGTT CCTTTTGAAT GGTCAAGCGA ATGCCAAAAC GCATTCGAAT ACCTTAAAGA 5040 AAAGCTTATG CACCCCACAT TATTACAATA TCCTGATTTT CGCAAAGAAT TTTGCATCAT 5100 AACGGATGCT AGTAAACAAG CTTGTGGAGC GGTTTTAACC CAGAACCGAG ACGGAATTCA 5160 GCTCCCAATA TCTTATGCAT CACGTTCGTT TACAAAAGGG GAAAGCAATA AGAGTACAAC 5220 GGAACAAGAG TTAGCGGCAA TTCATTGGGC AATTACCCAT TTTAGACCAT ACATTTATGG 5280 CAAACATTTC ACAATCAAAA CAGATCACAG ACCTTTAACA TATCTATTTT CTATGACCAA 5340 TCCCAATTCA AAATTAACTC GCATGCGACT AGAGCTTGAA GAATACGACT TCACTGTAGA 5400 ATACCTAAAG GGAAAAGATA ATTTTGTGGC AGATGCACTG TCACGTATCA CTATAAACGA 5460 GCTGAAAGAC ATATCAGCAA ATGTACTAAA AGTCACTACA AGACAGCAAA GTAAACAGAA 5520 AAATATCTGC GCAGATACTA ATATAAATAA ACAAGAAGAA ACTCTTGTTA ACGCTTCTAA 5580 GCCCAACGTA TATGAAGTCA TTAATAATGA CGAAATACGA AAAGTAGTGA CTCTGCGAAT 5640 AACTGAATCA AAATGTTTAT TCAAACATGG AAATAAAGTT ATAGCAAGAA TTGATGTTAG 5700 CGATCTGTAC ACCAATGGAA TTCTAGACTT AGGTCAGTTC TTCCAAAGGC TTGAAACACA 5760 AGCCGGTATA CATGAGATCA GCCAACTCAA AGTGGCACCG AGCGAAAAGA TCTTTGAAAC 5820 AATTTCAATA GACTCTTTTA AAAAAATGGG CAATAAATTA TTGAAAATAT TGAGAGTAGC 5880 CGCTACTCAA GCCGGTGACC CAAATATATG ATCAAAAAGA TCAAGAAGCG ATACTGTTTA 5940 CATACCATGA CGATCCAATT CAAGGAGGTC ATACAGGCAT TACAAGAACG CTAGCAAAAA 6000 TTAAAAGACA TTATTATTGG AAAAATATGA CACGGTCATA TCAAAGAGTA CGTTAATAAA 6060 TGTCAAAAAT GCCTTACGTC TAAAACAACG ACGCATACAA AAACACCCCT TACAATAACG 6120 GAAACACCAA CTTGCGCTTT CGACAGAGTG ATAGTAGACA CTATAGGTCC ACTACCCAAA 6180 TCGGAAAATG GTAATGAATA TGCAGTCACT CTAATCTGTG ACTTAACAAA ATATTTAGTT 6240 GCCATACCAA TTCCCAACAA AAATGCAAAT ACAGTCGCAA AAGCTATATT TGAATCATTC 6300 ATTCTGAAGT ACGGTCCAAT GAAGACGTTC ATTACGGACA TGGGAACAGA ATACAAGAAT 6360 AGCATTATAG CCGACTTATG CAAATATCTG AGAATAGACA ATATAACGTC TACAGCACAC 6420 CACCACCAGA CATTAGGTAC CGTTGAAAGA AGTCATCGAA CCTTCAATGA GTACATTCGT 6480 TCATATATAT CAGTAGATAA AACTGACTGG GACATATGGT TACAATATTT TGTGTACTGT 6540 TTTAACACTA CACCATCCGT AACGCATAAT TACTGTCCAT ATGAACTAGT ATTTGGTAAG 6600 ACTAGTAATT TAGCAAAACA ATTTAGTAAC ATAGATAATA TAGACCCTAT ATATAACATA 6660 GATGATTACG CTAAAGAAGT TAAATTTAGA CTAGAAAACG CGTATAAAAG AGCACGCATA 6720 TTATTAGAAA GAAATAAAGT AAAACAGAAA ACCAGTTATG ATAACAAAGT TAGCGATTTT 6780 AAACTAAAAG TAGGAGATAA CGTTTTAATA AGAAACGAAA CAGGTCACAA ACTAGAACCA 6840 ACGTATCTAG GGCCATTCGA AGTAATTAGA ATCGAAGAAA CCAATAACAT AGTAATTAGG 6900 AATAAAAAGA CCAAAGACCA GAAAGTTCAT AAGGATAGGT TAAAAATATA TAATTAATGA 6960 AACGTTTTAT ACAAAATAAA TAACTAAAAA AAAAAAAAAA GGGTCTGATC AACCGAAAAA 7020 AAAAATTTAA TAAGTTTTTG TCTAAGAAAG TTAAAAATAG AAGCATAACG TAATTATAAC 7080 CCCCCCAAAA AAAAAAAAAA TATAAAAAAA AAAAAATTCT TTTACACAAA TTATTCTGAG 7140 ACCAAACTTT TTCTAATAGA AAGGATAAAT AAGAAATAAT ATAAGAAAAA AAATGTTTTA 7200 ATTAATTAAA TGACTACATA TATTACGTCA TTTCTCTAAA AAGGGAGGTG TAGTGTATCT 7260 ACCCTCACTA TAACTCTACT CTACATATAT ATAAGTAACG TACATACATT GTGACACTTT 7320 GTTGCAAACA CAAATAAACA TAATTCACAT CAAAGACCAC ATGCACTTAC ATAAACACTC 7380 CAGCCAATGA AATACGATCT AACGCTTATA CATAAGCCGA TCGCGGAGCG TGAGAATGCT 7440 GAGCATGCAC TTAGCAGCTC AAGTGGTCAA GCCATACATA ACATATGTAT GCCTTCTGCA 7500 TACACATGTA TATGTATATA CAATATGTAC AATATGTAAG AACACCATGT ACGGGTAGCT 7560 GTACCCAAAG ACAGCAACAT AGGATTCATT CAAATAAAAC GATTCAAACG GAACAGACGC 7620 TCTGAGCTAT TCAATATCTA TTACACTGAG CTATTACTTA TTACTTATTA CA 7672 // ID STALKER3 repbase; DNA; INV; 372 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0063454; Stalker3T. XX FT SO_feature long_terminal_repeat ; SO:0000286:1..372 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase6.6.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 372 BP; 138 A; 73 C; 73 G; 87 T; 1 other; TGTAGTGTAT CTACCCTCAA TATGTArAGT AGAGTTAATA TGTAAGTAAG TAATATGTAA 60 AGTAGAGTTA ATATGTAAGT AAGCAAAAGA CCACCAACAC TTACATGAAC ACTCCAGCTC 120 TTGAAATACG ATCGAGCGCT TAAACATAAG CCGATCGCGG AGCGTGAGAG TGCCGAGCAT 180 ACACCTAGCA GCTCAAGTGA TTAAGATAAG ATAAGATAAG ATAACAAACA CGTAGTCTTA 240 AGCGCGTCAT GTGCGGGTGG CTGTACCCAA GAACAGCAAA GTGAATTCAT TCGAATAAAC 300 CGCTTCAAGC AGAGCAGAGC CAAGTCTATT ATATCAACTT CAAAAATACC GTATAACCTT 360 GAACCTATTA CA 372 // ID AF541951 standard; DNA; INV; 1064 BP. XX AC AF541951; XX DR FLYBASE; FBgn0064134; Bari2. XX FT source AF541951:1..1064 FT SO_feature five_prime_LTR ; SO:0000425:1..119 FT SO_feature three_prime_LTR ; SO:0000426:946..1064 FT SO_feature CDS ; SO:0000316:565..873 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:;" FT /protein_id="AAN34654.1" FT /translation="MKEVDINSYEKRLSFALEYKEKPIEFWFDVLWTDEIGFQFQRSFS FT KKFMHFPKKLKVECRPANQSIWWWHGDVLGLFKLLWIWRHGTDRGYNKSNRIPPYLK" XX CC Derived from AF541951 (Rel. 73, Last updated, Version 1). CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1064 BP; 373 A; 172 C; 201 G; 318 T; 0 other; aacgaaagaa gaagtggaaa aaataactgt ttattggcta aaaatagaag cattgaactg 60 atccttgtct gttgtcaaaa gtatttacac acaattttta cgcgtcaaaa ttatttacac 120 agttcaaatt gaagattagt ctttgatttg cattgagtaa gtgaagttct agttgatgtt 180 gatcgcttaa ttggaactta aactaactgt taacaggtaa gttaatagat aaaactaagc 240 atataatttg taaaatggca aaaacaaagg agctatcagt tgggcatcga gtcgagattg 300 tcactaaact taagactggt tcttctgcat ccaaattagc tgagttgtac aaaatatcac 360 gtaaaactgt gtacaattta gttaaaaaaa aggacactgg gaaatttgga aaattaaaag 420 aaacctggcc gaaaagttgc attgaaccca agactgcaga cacattattg gagttgtact 480 cagtaatccc accattagtc ccgttttaat tgccgcagct tcgaaaaatt taattggaaa 540 acatatagta cactgcgtcg cagaatgaag gaagttgaca tcaactccta tgaaaaacgt 600 ctctcatttg ccttggagta taaagagaag cctatcgagt tctggtttga tgttttatgg 660 actgatgaaa ttgggtttca gtttcagaga tcctttagca agaaatttat gcatttccca 720 aaaaaactaa aggttgaatg ccgtccagct aatcaatcga tttggtggtg gcacggtgat 780 gttctggggc tgtttaagct actttggatt tggagacatg gtaccgatag agggtacaat 840 aaatcaaaca ggatacctcc atatcttaaa tgagcacgca ttcacctcag gaaatagact 900 gttctccacg aaaaaaaaac gcaaaaaaca attaaaaaag ctgtatgtaa ataattttga 960 cgcgtaaaaa ttgtgtgtaa atacttttga caacagacaa ggatcagttc aatgcttcta 1020 tttttaaacc ctaaacagtt attttttcca cttcttcttt cgtt 1064 // ID DME487856 standard; DNA; INV; 8556 BP. XX AC AJ487856; XX DR FLYBASE; FBgn0063919; Max-element. XX FT source AJ487856:73..8628 FT SO_feature five_prime_LTR ; SO:0000425:1..321 FT SO_feature three_prime_LTR ; SO:0000426:8236..8556 FT SO_feature polyA_signal_sequence ; SO:0000551:282..287 FT SO_feature polyA_signal_sequence ; SO:0000551:8445..8450 FT SO_feature primer_binding_site ; SO:0005850:394..411 FT SO_feature RR_tract ; SO:0000435:8296..8305 FT SO_feature CDS ; SO:0000316:1710..7202 FT /db_xref="FLYBASE:FBgn0063918; Max-element\gag-pol" FT /protein_id="CAD32253.1" FT /translation="MPLEGDKKKTPAPGKEQKFNPQSPISTRGRPAPKSVSPRVKTATP FT TPKPKVLPSTSSKTTPRLVISRPVTRASSESSLSRKSESPSAVGTDFLRVTRSTTKRMA FT SSEQPTPANAALHKFIAVSDRVSLFEAKINTPDQASPSLHTLQVRLQQVRALWDKVERE FT YETCSDLMAQEGSLDTVPILQAKYDYCYSVYESCAAQIGETIDRATPQVAQAPSQPLIS FT SGCRLPPCDTEVFDGDYLRWPTFRDLFTAIYVNNPRLTPVEKLFHLLTKTSGEAKAIVA FT KSPLTNDGFASAWEALRDRFQNKRLLVNSQLKLLFNLSSISQESGHALKELQSTIQGCL FT TALEHSQVSTENWDCILVFLCASKLPKQTLSLWEQSLTAKSEIPAWEEMNAFLSERYRT FT LEAIEDMKPTQAVPKRLQSFETKVSTKQKGCDICSKENHPVRLCPRFLQMSVDSRSGYI FT KKKQLCLNCFARGHQLRDCTSMHSCFTCKGRHHTLLHRSPPNSENASSSTSPPTQQLPR FT PSSRNASASTSAVQNFFASGTSAVLLSTAMIDVCHLGTNYRARALIDSGSEATFISERL FT FNLIKLPFRNTRTQVSGLNHSVSAKSSKLCHFGIRSPTKPGLQLDTEAYVLPELSGKLP FT SYPIPRNSLKDLPALRWADPTFFESSQIDVLIGADILPSIMMDGTRQNICGSLLGQETI FT FGWVLTGPISKGKPKRVASFTTQVHQTGDPDSLDTLLSKFWEVEDLPVKMVKESDSYCE FT RNFLQTTTKDASGRYVVTLPFRDPENTGSDLGYSRSIALAQFLRNENRLKRDFPLKEQY FT DSVIQEYLDLGHMKEVPPTHNSPSYHLPHHAVVKPESTTTKLRVVFNASSPSANGISLN FT DILHAGPVLQSDLTIQVLKWRYFQYVFSADITKMYRQIWVDPKHTPFQRILFRNKEGDI FT RDFELKTVTFGVNCAPFLAIRVLQQLAEDIQVPFPNASRIIQQHMYVDDVLAGANSVNE FT AQSSIRELQAALSASGFPLRKWTSNNKSVLKDVPAEHLLHSEFLDIDAESTAKTLGIRW FT RAKSDEFYFVPPDIVVEASYTKREVLSQIARLFDPAGWLAPFIIRSKIFMQEIWLQNLG FT WDDKLPTEMSQRWQSFLEEYSDLNQIRVPRWIWYQPEVVIEHHGFCDASQRAYGAAIYI FT RVEMGQKILTRLLTAKTRVAPVKTVSLPRLELCGAVLLTEMVTAILPHMPSASSDIRCW FT TDSTIVLAWLRKPACNWTTFVANRVAKITQATPVDCWAHVRSEQNSADLASRGVSLQEL FT AENHLWWHGPEWLQGPRELWPAQSDTLPVTELEQRAVKVHFVKGPSIDFLERFSKLDKA FT LRVLVYVQRFFKRCRKGSFLPSSRPTSEEIREAERTLTSIAQRRAYGQELQHLTEKRPL FT PVSSPLVTLFPFIDQHGLLRACGRLTASKTLQYDERHPILLPYDCRLSRLIVQFTHQIT FT LHGGSQLIVRLIRTKYWIPKIKNLVKAVVNPCKICTIYKKRLQTQLMGDFPTDRVSFSR FT AFTYTGIDYAGPFEIKNYTGRACLITKGYVCVFVCFSTKAIHLEPTSDLTTEKFLAAFA FT RFVARRGCPQRVHSDNGKTFVGAAALISRDFLQAIKESVTDAYSHQGLVWRFIPPGAPH FT MGGLWEAGVKSFKTLFLKSTSVRKYTFEELATLLAKIEACLNSRPLSPMSEDPSDLLAL FT TPGHFLIGGPLLSTAEPEIKGEAKSIINRWQHLKAQHQQFSARWKEEYLKELHKRSKWQ FT FPTRNLQADDMVVVKEDNLPPNEWRLGRIVSAFPGADERIRVVEIRTSRGTIKRPVHKV FT ILLPMEDKESSVPRD" XX CC Derived from AJ487856 (Rel. 71, Last updated, Version 1). CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 8556 BP; 2217 A; 2323 C; 1897 G; 2119 T; 0 other; TGTTCACGCC AGAAGGGCGC AAACAGTTGA GCGGCAGTGT AAGCGGCCAA TGCTTGTACT 60 CGAAGCTCCT CCACGGCTCG CAAGCACCGC TGCTCGCGCT CTCCTTCTCT TCTCTCGCTC 120 TCCTCCCTTC CCGCAGGGTA TTCACCACTC CGCGGTCCCG CGGTATTTCT ACTGTTAGTG 180 TTAAGGTGGT TGTAATTTAC CAGTGCGTGA ATAAAGAACG AAAGGTTGAA GTCCACCCCG 240 CGCGTTTTTA ATTTCGGGGA AAATTTCAGC AGCCGCCGCA GCAACCACAA CCAAAGGAAT 300 CTTTCCGCCC ACTCCATTTC ATGGTCCTTC GAGCCGGATG AGCTCTGATC CTGCCAGCGC 360 TCCTTGCGGA AAATTCAAGA TAAGTACGGC TTTAAATCCT ATTCCGTCCG TGGTCGCCAT 420 TTTCGTTTTT TCCAACGGCG TTTATTCTTT CCGTCCATCC GTATCAAGTG AAAAGTGAAA 480 ATATATAAGT ACATGCGCCC AGCCACCGCG ATTACTTTTT TTTTTAAGTG CATAAACAAG 540 CACTTTGCCA ATACCGGCCC CCTAATTCGT CTAACGGTGC TCCGCTGATA TCCAAGCGCA 600 TGTACATATA TACATATACA CACTGTCCAA AATTCACTGC TATCGTCCGC TGCTAAACTG 660 CTGTTTTTTT TTTTGTGCCT CAATAAAATA TACAATACAA TACAATATAC AAATTTAATT 720 AAATTCAATA AAATATACAA TACAATACAA TGTACAAATT TAATTAAATT CAATTAAATA 780 TACACATACA TATAATCACC AAAAATTTCA CATACACACA AATATACATA GCCATATACA 840 TACATTACGA CATATCCATT GTCATCCAAC GATATTCACA TACACACAGT CGCGACGTAC 900 ACAGACTTGC CAAGTGCAAG TTCTTTCGTC TCCTCTTTGT TCGTTATTGC CCTGACAAAA 960 AAACCAGCGC ATAGCGACGA GAGAATACCC TATTGTCGAG ACATAAAACA AAATAAATAA 1020 GTCCGACGAT AAAATTTAAA AATAAATTTT AAAATATTTT ATTTATTTAT TTTTATTATT 1080 TAATAAATAA TTTAATAAAA AAAAGGGAGA GAAACGATTG TTTTAATTAT TATTATTCTT 1140 GATTATTAGT TAATAATTAT TTAACCCAAA TAATAATTTT TTTTTTTTTT ACAAGCTATT 1200 TCTGGCTCCC ATTTCTGGTT CCTAAATTAC AATAATAAAA TTGTGGTCCT AAATCCGGTA 1260 GCTATTTTTA TAAATTTAAA CTTGAATTTT CTAGCTATTT TCCATATATT AGCTATTGTT 1320 TTTCGGGATT TCCAGTCGAT AGCGATTTTT TTTATAAATC CTTGGCTATT TTCCAGACTC 1380 CAGCCGGTAG CTATTTTTTT TGCTTTTCGG GATTTTTCAG TCGGTAGCGA TTTTTTATAA 1440 ATCCTTGGCT ATTATCCAGA CTCCAGCTGG TAGCTATTTT TTTTTTTTTT TTTTAGCTAT 1500 TTTTTCAGAA ATTGCAATAA ACTGAATTTT AGCTATATTG TGCTTCAATC TCCAGATCCC 1560 AATTGCAACC TTCTCAATTT AAGTTTTTTT TCCAGTTGAT TTAAGATCTA TTGGGGCCTT 1620 ATTTCACAAA AATTGAGATT TTTTTTTTTT TTTTTTCCTT CGTGCTCCAA TCTCCTTCTT 1680 GACTTCCTTC TCCTAGGGCA TAGCTGAGCA TGCCCCTAGA GGGAGACAAG AAAAAGACAC 1740 CCGCACCTGG AAAAGAACAA AAGTTCAACC CGCAATCTCC GATATCCACT CGCGGTAGGC 1800 CAGCACCCAA GTCCGTTAGT CCCAGGGTCA AAACCGCGAC TCCCACTCCG AAACCAAAGG 1860 TTTTGCCATC GACCAGTTCG AAGACAACTC CTAGGTTGGT CATTTCCCGA CCAGTGACGC 1920 GAGCGTCTAG TGAGTCTTCT CTATCGAGAA AATCCGAGAG TCCCTCCGCT GTCGGGACCG 1980 ATTTCCTTCG CGTCACACGC TCTACCACAA AGAGAATGGC ATCCTCTGAA CAGCCGACGC 2040 CCGCAAACGC AGCGTTGCAT AAATTCATCG CCGTCAGCGA TCGCGTAAGC CTTTTCGAAG 2100 CGAAGATCAA CACTCCAGAT CAAGCCTCTC CGTCCCTACA CACGTTACAA GTCCGTCTGC 2160 AACAGGTGCG AGCCTTATGG GACAAAGTGG AAAGAGAGTA CGAAACATGC TCTGACCTAA 2220 TGGCCCAAGA AGGATCCCTA GACACAGTGC CTATTCTCCA GGCCAAATAT GACTACTGCT 2280 ACTCAGTATA CGAGTCATGT GCAGCACAAA TTGGCGAAAC AATCGACAGA GCAACGCCTC 2340 AAGTCGCGCA AGCACCCTCT CAGCCGCTAA TTTCGTCCGG CTGCCGCTTA CCTCCATGCG 2400 ACACGGAAGT CTTCGATGGC GACTACCTCC GATGGCCCAC TTTCCGGGAC CTATTCACGG 2460 CAATTTACGT AAACAACCCC AGGCTGACTC CGGTCGAGAA GCTATTCCAC CTCCTTACCA 2520 AAACAAGTGG CGAAGCAAAA GCCATCGTGG CGAAATCTCC TCTCACGAAT GATGGTTTTG 2580 CTTCGGCTTG GGAGGCGCTT CGAGATCGGT TCCAAAATAA GCGACTTTTA GTCAACAGCC 2640 AGCTCAAGCT TCTTTTTAAC TTGAGCTCAA TTTCGCAAGA ATCTGGTCAT GCTCTAAAAG 2700 AGCTACAATC CACGATTCAG GGGTGTTTAA CGGCACTAGA GCATTCCCAG GTATCCACAG 2760 AAAACTGGGA TTGTATCTTG GTTTTCCTTT GCGCGAGCAA ACTGCCAAAG CAGACCCTGT 2820 CCTTATGGGA ACAGTCGCTA ACTGCCAAAT CCGAGATCCC AGCTTGGGAA GAAATGAATG 2880 CCTTCCTGAG CGAAAGGTAT CGGACATTGG AAGCCATCGA AGATATGAAA CCGACTCAGG 2940 CGGTTCCCAA AAGGCTTCAA TCCTTCGAGA CAAAGGTCAG CACCAAACAG AAAGGGTGTG 3000 ACATATGTTC TAAGGAGAAC CATCCGGTAC GATTGTGCCC GCGTTTTCTT CAAATGTCTG 3060 TTGACTCGCG CTCCGGATAT ATAAAAAAGA AGCAGCTATG TCTGAATTGT TTTGCCAGAG 3120 GTCATCAGCT ACGTGACTGC ACAAGCATGC ACAGCTGCTT TACATGTAAA GGCAGGCACC 3180 ATACGCTGCT GCATCGCAGC CCCCCAAATT CCGAAAATGC GAGTTCCTCA ACCTCGCCCC 3240 CTACACAGCA ACTTCCGAGA CCCTCTAGCA GAAATGCATC GGCAAGCACC TCAGCGGTGC 3300 AAAATTTCTT CGCGTCCGGC ACTTCAGCCG TCCTACTGAG CACAGCGATG ATAGACGTGT 3360 GCCATTTGGG GACGAACTAC CGAGCCCGAG CCTTAATCGA CTCGGGATCC GAGGCGACGT 3420 TCATTTCAGA ACGCCTGTTC AACCTTATCA AGTTGCCATT CCGCAACACC CGGACCCAAG 3480 TCTCCGGGTT AAATCATTCG GTCTCCGCGA AATCCTCGAA GCTGTGTCAC TTCGGGATTC 3540 GCTCTCCGAC TAAGCCAGGT CTACAGTTAG ACACTGAAGC GTACGTCCTT CCGGAGCTCT 3600 CAGGCAAACT GCCCTCCTAT CCGATCCCTC GGAATTCTTT GAAGGACCTG CCCGCACTCC 3660 GCTGGGCAGA TCCTACCTTT TTTGAGAGCT CTCAAATTGA TGTACTGATC GGGGCTGACA 3720 TTCTACCATC CATAATGATG GATGGCACCC GACAAAATAT TTGCGGCTCG CTCCTGGGCC 3780 AAGAAACTAT TTTCGGGTGG GTGCTAACGG GGCCGATTTC CAAGGGTAAG CCGAAACGGG 3840 TCGCATCCTT CACGACCCAG GTGCACCAAA CAGGCGACCC TGATTCGCTC GACACGCTTC 3900 TCAGCAAGTT CTGGGAGGTG GAGGATCTAC CAGTAAAGAT GGTAAAAGAG TCGGACTCCT 3960 ATTGTGAAAG GAACTTCCTC CAAACGACCA CAAAAGACGC AAGCGGGAGG TACGTGGTGA 4020 CGCTCCCGTT CCGGGATCCC GAGAATACCG GATCCGATTT AGGATATTCA AGGTCCATTG 4080 CGCTAGCTCA GTTTCTTAGA AACGAAAATC GTTTAAAAAG AGACTTTCCA TTAAAAGAGC 4140 AATATGACAG CGTGATCCAG GAGTACCTGG ATCTGGGGCA TATGAAAGAA GTTCCGCCGA 4200 CTCACAATTC TCCCTCGTAT CATCTTCCTC ATCACGCGGT AGTTAAGCCC GAAAGCACCA 4260 CTACAAAGCT TCGCGTTGTA TTCAACGCTT CAAGTCCGTC GGCAAATGGG ATCAGCTTAA 4320 ATGATATTCT TCATGCTGGT CCAGTCCTAC AATCCGATCT AACGATCCAG GTCCTGAAAT 4380 GGCGTTATTT TCAATACGTC TTCAGTGCAG ACATTACGAA AATGTATCGT CAGATCTGGG 4440 TCGATCCAAA ACACACCCCG TTTCAAAGAA TTCTGTTCCG AAACAAGGAA GGAGATATCC 4500 GAGACTTCGA ATTAAAAACA GTTACCTTCG GGGTCAACTG CGCCCCCTTC CTCGCCATTC 4560 GAGTCTTGCA ACAGCTGGCA GAGGATATCC AAGTGCCATT TCCAAATGCC AGCCGCATCA 4620 TCCAGCAGCA CATGTACGTC GACGACGTTC TGGCAGGGGC GAATTCCGTA AACGAAGCCC 4680 AAAGTTCAAT TCGAGAGTTG CAAGCAGCCC TAAGTGCCTC CGGGTTTCCG CTAAGAAAGT 4740 GGACCTCAAA CAACAAAAGC GTCCTTAAAG ACGTCCCGGC CGAACACCTC CTTCATAGCG 4800 AGTTCCTCGA CATCGATGCC GAAAGCACGG CCAAGACGCT CGGTATACGG TGGAGGGCAA 4860 AGTCCGACGA ATTCTATTTC GTTCCTCCAG ACATAGTTGT GGAGGCCTCC TATACAAAGC 4920 GAGAAGTTTT GTCCCAAATC GCTAGGTTGT TCGATCCTGC CGGGTGGCTC GCGCCGTTCA 4980 TAATCCGGTC AAAAATATTC ATGCAGGAGA TTTGGTTGCA AAACTTAGGC TGGGACGATA 5040 AGCTTCCCAC TGAAATGAGT CAGCGGTGGC AATCGTTTTT AGAGGAGTAT TCCGACCTCA 5100 ACCAGATCCG GGTTCCGAGA TGGATCTGGT ACCAGCCTGA GGTAGTCATA GAGCACCACG 5160 GTTTCTGTGA CGCGTCTCAG AGAGCCTATG GAGCGGCTAT ATACATCCGC GTCGAGATGG 5220 GGCAAAAGAT CCTGACTCGC CTGCTTACAG CCAAAACACG AGTAGCCCCA GTCAAAACCG 5280 TCTCCCTTCC TCGGCTAGAG CTCTGTGGTG CCGTGCTGTT AACCGAAATG GTGACAGCAA 5340 TCCTTCCGCA CATGCCCTCC GCCAGTTCAG ATATCCGCTG CTGGACAGAT TCCACAATCG 5400 TCTTAGCCTG GCTACGAAAG CCTGCGTGCA ACTGGACCAC ATTTGTGGCC AACAGAGTTG 5460 CCAAGATCAC GCAGGCGACG CCAGTCGACT GTTGGGCGCA CGTTCGCTCA GAACAAAACT 5520 CCGCCGATCT AGCCAGTCGA GGCGTATCCC TTCAGGAATT GGCAGAAAAC CATCTCTGGT 5580 GGCATGGACC AGAGTGGCTG CAAGGGCCAC GAGAGCTATG GCCAGCACAA AGCGACACTC 5640 TTCCAGTGAC AGAGCTAGAG CAGCGCGCGG TCAAAGTGCA TTTTGTCAAG GGCCCGTCCA 5700 TCGACTTTCT CGAACGTTTC TCCAAATTGG ACAAGGCCCT GCGAGTTCTG GTCTATGTTC 5760 AACGCTTCTT TAAACGCTGC CGCAAAGGTT CGTTCTTGCC CAGTTCACGA CCTACGAGTG 5820 AAGAGATCCG AGAGGCAGAA CGAACTCTGA CCTCCATTGC GCAGCGCAGA GCATATGGCC 5880 AAGAGCTTCA GCATTTGACC GAGAAAAGGC CTCTTCCAGT GTCAAGTCCT TTGGTGACTT 5940 TGTTCCCGTT CATTGACCAA CACGGCCTTT TAAGAGCGTG CGGCCGTCTC ACTGCATCCA 6000 AAACCCTGCA ATATGATGAA CGTCATCCGA TTCTTCTCCC CTATGACTGT AGACTGTCGC 6060 GCCTCATCGT CCAATTTACT CACCAGATTA CTCTTCATGG CGGTAGTCAA TTGATCGTAC 6120 GCCTGATCCG AACCAAGTAT TGGATTCCTA AAATCAAGAA TCTGGTGAAG GCTGTGGTCA 6180 ATCCGTGTAA GATTTGCACG ATTTACAAGA AGAGACTCCA AACACAGTTG ATGGGCGACT 6240 TCCCGACAGA TAGAGTCTCC TTCTCGAGGG CGTTCACCTA CACGGGTATC GACTATGCCG 6300 GCCCTTTCGA AATAAAAAAT TATACAGGGA GAGCGTGTCT CATAACGAAG GGATATGTCT 6360 GTGTGTTTGT GTGCTTTTCC ACGAAGGCTA TCCATTTGGA ACCCACTTCC GATCTAACAA 6420 CCGAGAAATT TCTAGCGGCC TTTGCTCGTT TCGTAGCGAG GCGCGGTTGT CCTCAGCGGG 6480 TCCATTCGGA TAATGGTAAA ACCTTTGTGG GGGCGGCAGC TTTGATTTCC AGGGACTTTC 6540 TCCAAGCCAT CAAAGAGTCC GTGACTGATG CATATAGCCA CCAGGGACTC GTATGGCGGT 6600 TCATCCCACC AGGGGCTCCA CATATGGGGG GTTTGTGGGA AGCAGGGGTG AAAAGCTTCA 6660 AAACCCTGTT CCTTAAATCG ACGTCAGTCC GCAAATATAC ATTTGAGGAG CTGGCGACGC 6720 TTCTCGCTAA GATTGAGGCT TGCCTCAACT CTAGACCCCT CTCCCCTATG TCCGAAGATC 6780 CCTCAGATTT GCTGGCCCTC ACACCAGGCC ATTTCCTTAT TGGAGGGCCG TTGCTTTCCA 6840 CGGCGGAACC CGAGATAAAG GGCGAAGCCA AGTCGATAAT AAATCGATGG CAACACCTCA 6900 AGGCACAACA TCAGCAGTTT AGTGCACGGT GGAAAGAGGA GTATCTTAAG GAACTCCATA 6960 AACGAAGCAA ATGGCAATTT CCGACCAGGA ATCTCCAAGC CGACGATATG GTAGTTGTCA 7020 AAGAGGACAA TCTACCACCG AACGAATGGC GGCTCGGCAG AATCGTTTCT GCCTTCCCAG 7080 GAGCCGACGA GCGCATTCGA GTCGTCGAGA TCCGTACGTC TCGCGGCACC ATAAAACGCC 7140 CTGTCCACAA AGTTATTCTG CTACCGATGG AGGACAAAGA GTCCTCCGTT CCTAGGGATT 7200 AGGGCACTTC CCCCATTCCG GGGCCAAAAT AGTAGTCGGC TCATACTAAG CTTCTTCATT 7260 TCTTTTATTG GCAGACTTAC TCACTCCTCC AATAATGGCT CCTCGTCCTC GTGCCACCCA 7320 GTCGTTGGAG AGCAGACGTA CTCGAGGTAT TAAATCCTAC CGCTGCCGAG TCTGCTCTGG 7380 AATCCATCCT CTTCGAAAGT GCACGAGGTT CCACAAACTG AGCGTTGAAA AGCGCCTTCG 7440 GGCAGTACTT ATTAATAAGT ACTGCTCGAA CTGCCTCGCC CATCAGCACT CAGGAGGAGA 7500 CTGTCGCAGC CAAGAGGGAT GCAAGAAGTG TGGAGGAGAC CACCACACCC TACTCCACAT 7560 GCACGAGGTC CTCCCCGCTC CGAACCCGGC AGCGCTACCA GCTCCGACGC GCAGAGAGCG 7620 CCATCCCGCT GCACCTCGTC CGGTCCGGAT TTCCCGCACT CCGCCACCAG CCCCAGTCGC 7680 CAACAATCGC CCGCGTCAGC AGCAGCAGAA GGCTGTCCCC ATACTGCCTA CAGCCATCGT 7740 CGTGCTGGAC ACGGGCTCGA AGACCTTCGA GACCGGGGCC ATGATCGACC CATGCATGCC 7800 GGTGAGCAGC ATCGACCGGT CGTTAGCGGC TGCGTTCCGG CTGCCCATCA CTCGGCTGGG 7860 AGGCAACGAG ATCTGCTCGG TGACACTCCG GTCCCGAACC AGCACCTTCC GACTCAACGT 7920 CGTCCTGAAG ATCGATCCCA TCCTAAGGAT CCGGACACCC ATCCGAGCTC TGAGCGACGC 7980 CGCCAGGGCC AAGTTTGACG GTGTTCGTCT CGCGGACGAG CGCTTCCACC GGCCGGCCTC 8040 CATCTCCCTC GTGTTGGGAT CAGATGTATA TGCCAATTTG ATCCAACCGG GGTTCCTAAA 8100 AATTGAGGAT GGGTTGCCCG TCGCGCAGAA CACGGTGTTC GGATGGACCG TTTCTGGAAC 8160 GTGCGCGAAA TGAAGAGGAC GATCCTGATA TCTAGCCGGC CAGGGATACC TTTGCCTACG 8220 CAAGGGGGGG GAGAATGTTC ACGCCAGAAG GGCGCAAACA GTTGAGCGGC AGTGTAAGCG 8280 GCCAATGCTT GTACTCGAAG CTCCTCCACG GCTCGCAAGC ACCGCTGCTC GCGCTCTCCT 8340 TCTCTTCTCT CGCTCTCCTC CCTTCCCGCA AGGTATTCAC CACTCCGCGG TCCCGCGGTA 8400 TTTCTACTGT TAGTGTTAAG GTAGTTGTAA TTTACCAGTG CGTGAATAAA GAACGAAAGG 8460 TTGAAGTCCA CCCCGCGCGT TTTTAATTTC GGGGAAAATT TCAGCAGCCG CCGCAGCAAC 8520 CACAACCAAA GGAATCTTTC CGCCCACTCC ATTTCA 8556 // ID BS3 standard; DNA; INV; 1790 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0067624; BS3. XX FT source nnnnnnnn:1..1790 FT SO_feature CDS ; SO:0000316:<2..1618 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; BS3\RT" FT /protein_id="" FT /translation="IKRRCARKAPLADNSGAWCRTFIEQAEVFAVHLAERFQPLNLASP FT QDVDATHDQLSQALQMDLPMQPITPSEIADVIAKQNPKKAPGHDTTCNSTLRTLPRCAI FT LYITLLFNAMVRLQYFPPQWKLGIISMIHKPGKPEKNPGSYRPISLLPSISKVFERLIA FT ARMVRIMEAKGILPEHQFGFRAGHCTVEQLHRVVEQILSAFENKEYCNALFLDVREAFD FT RVWHSGILLKIKNTLPAPYFGLLRSYLEKRRFAVRFHSALSNEHNVAAGVPQGSVLGPL FT LYCLYSYDMPRPDVSLPGTSMLATFADDVCVTYRSCCEHDAADDIQDFASTFAEWAKRW FT NIGINGDKSANVCYTLKRKTPPAVLIDGTPVPQSNSAKYLGVILDRRLNFSKQVSAMRV FT RIRAAASKHFWLINSRSKLSLSNKVTIYKQIIAPIWRYGCQIWGLACDSQIRRIQAAQN FT KIARMITGCEWYVRNTTLHKDLKLATVFEAINMHSSRYHDRLERHRNRLAKALSRARPP FT RRLHRRQPKDLIIRSPLTRARR" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 1790 BP; 483 A; 460 C; 435 G; 411 T; 1 other; tatcaaaagg cgatgtgcgc ggaaagcgcc attggccgat aacagtggkg catggtgccg 60 gacttttata gagcaagccg aagtgtttgc tgttcacctc gcggagcgat tccagccttt 120 gaacctcgcc agtccccagg atgttgacgc gactcacgac cagctgtccc aagcgctcca 180 aatggacttg ccgatgcaac cgatcactcc cagcgagatt gctgacgtca tcgccaaaca 240 aaaccccaaa aaagctccag ggcatgatac tacctgcaac tccaccctaa ggacactacc 300 gagatgtgcg atcctctaca ttacgttgtt attcaacgct atggtgaggc tgcaatactt 360 ccctccacag tggaagctcg gtattatctc catgattcac aaacctggaa agcctgaaaa 420 gaaccctggg tcctaccggc caatcagtct cctcccttcg atctcgaagg tgtttgagag 480 actgattgct gcccggatgg tcaggattat ggaagcgaag ggtatcctgc ccgagcatca 540 gtttggtttt cgtgctggac actgtacggt agaacaacta caccgagtgg tcgagcaaat 600 cctatcggct ttcgaaaaca aggagtactg caatgcactt ttcctggatg tacgtgaggc 660 gttcgatcgg gtgtggcact ccggtatcct gctcaaaatt aagaatacgc tgcctgcacc 720 atacttcggc ctcctgaggt cgtatctcga aaagagaaga tttgcggtac gattccactc 780 ggctctgtca aatgagcata atgtggcagc cggagtacca cagggaagtg tacttggtcc 840 gctgctttac tgcctgtaca gctacgacat gccacggcca gacgttagcc tacccgggac 900 gtcaatgttg gccacatttg ctgatgacgt gtgtgtcacc tacaggtcct gctgcgaaca 960 cgatgctgct gacgacatcc aggacttcgc atcaacattt gcggaatggg caaaacgttg 1020 gaacattggc atcaatggtg acaaatcagc gaatgtgtgc tacacgctga aaaggaaaac 1080 accaccggct gtgctcatcg atggcacccc tgtcccccag tccaattcag ccaaatatct 1140 tggtgtaatc ttggatcgga gactaaactt ctcgaagcaa gtgagtgcga tgagagtgcg 1200 tatacgtgca gcagcgtcaa agcacttctg gctaattaat tcgcgaagta aattgtcact 1260 ctccaacaag gtgacaattt acaagcaaat tatagcgcca atatggaggt atggttgtca 1320 gatttggggc ttggcttgcg acagccaaat tcgtcgcatt caggctgccc aaaacaaaat 1380 cgccaggatg attaccggct gcgaatggta cgtaaggaac acaaccctgc acaaagacct 1440 caagctagcc actgtctttg aggcaataaa catgcactcc agccggtacc acgacaggct 1500 agagcgccac agaaatcgcc tagccaaggc actgtccaga gctcgcccac caagaaggct 1560 ccacaggaga cagccgaagg acctcatcat acggtccccc ttgacgagag ccaggagatg 1620 atgatgatct cataattgtt atattgttaa ttgttatatt tgtattattg ttttgttaca 1680 gctaactttt gttagccggc gcgcctgaaa gggctgaatt aatagcgacc aaggtgacaa 1740 aggggttttc agctaccccg tatgccttaa taaagtataa aaaaaaaaaa 1790 // ID BS4 standard; DNA; INV; 754 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0067624; BS4. XX FT source nnnnnnnn:1..754 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 754 BP; 240 A; 165 C; 154 G; 195 T; 0 other; tcatatctgg atggacggaa gcttatggtc aggtacacgg atacctactc cgctccctgc 60 caaatgttag ctggagttcc gcagggcagc gtccttggcc cgctgctcta ttctttatat 120 actgctgacc tacctagacc tacctacgag aatgcgcaat acccctctaa ggctatcatt 180 gcgacttacg ccgatgatat tgcggttctt tacagatcca aatgccgtat tgaagctgca 240 aatgggttgc aaggatacct tcaaacttta tcggcgtgga gtagaaggtg gaacatgaag 300 gtcaatcctt taaaaacatt caatccatgt ttcacactta aaaggcttgc tacaccagca 360 atacaatttg aaggtgtgac attggaacag ccatctcaag cgaaatacct cggcataacc 420 ttagataagc gccttacttt tgggccacac ataaaaacaa taaccaaacg atgtggccaa 480 agaatgcagc acctaagatg gctgataaat aaaaggagca caatgtcgct aagagccaaa 540 agagcagtat acgtggattg tatagcccca actgacttaa aagactggga ttcagcaaga 600 cgattcgtta atcgtccagc cctcccaaaa cctgctagta atcctcgtga gaggtttggt 660 tcattaagat ttgtttttat atgtgttgtt aagatgctaa tcagaatgca tggtttcaac 720 gcttaataaa ataatatatt aaaaaaaaaa aaaa 754 // ID DOC4 standard; DNA; INV; 2791 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0069587; Doc4-element. XX FT source nnnnnnnn:1..2791 XX FT SO_feature CDS ; SO:0000316:<1..1743 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; Doc4-element\ORF2" FT /protein_id="" FT /translation="IQNEIREQRQREQDERRLSSIKNNAYFSFVSTETLKPSQDQRAHS FT CSPSLPSTVPKSWSEECASPMPQPTTNYTQTPLTIVAKPPLLSLAAVTTSTVASLTTAI FT ASTTTLTATVSSTQTIQSTTSRAFALQAKQLTATITDLRESKSNPSGRPNSVMQTGMDR FT YVFVKRKRSPQRTTGNRAKINCGSETKTTTTNNKNMFALLAENASEEANKTTDSVSQKP FT KPPPIYIQEITTNALANKIVQLIANNNFHVIPLRRGKIQETKLQVKTEDQFHAVSKFLN FT ENGKKYYTYQLKSSKGLQVVLKGIEPDVTPNEVANALHEKGFNVKSVINILNKDKKPQP FT LFKVELDPTSQTLKRNEVHPIYNLQFLLHRKITVEEPHKRNGPVQCSNCQEYGHTKTYC FT TLPSVCVACGDLHESGTCPANKLDPNSKKCGNCGENHTANYRGCPVYKDLKSRMNKRIA FT SARSTNNTQKIWYPKNVWYPPNRFLTFSSPTPNCHPLPPVSTIPGVSFASALKSGTEGP FT ASTTEFSKAPQPRAVPDDLQQPTSAIEKIMLSFQATMTEFMSFMRTTMQDLTCALVQVL FT ASQHSK" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 2791 BP; 909 A; 697 C; 517 G; 661 T; 7 other; caaaacgaaa ttcgtgagca acgtcaacgt gaacaagacg agcgtcggct ctcttcaatc 60 aaaaacaacg cgtatttctc gttcgtttct acggaaacgc tcaagccctc ccaagaccag 120 agagcgcact cttgctctcc gtcgttgcca agcacagttc caaaatcttg gagcgaagag 180 tgcgcttccc caatgccgca accaacaaca aattacacac aaactccgtt aaccattgta 240 gccaagcctc ctctcttaag cctagctgca gtaactacca gcacggttgc ctctctcaca 300 actgcaatcg catcgacaac aacattaacc gcaacagtct cttcaactca aacgattcag 360 tcaacgacct cacgggcttt tgcattgcaa gcaaaacaat taacagctac aattaccgat 420 ttacgtgagt caaagagcaa tccaagtggc cgcccgaatt cggtcatgca gactggtatg 480 gatcgctacg tttttgtaaa acgtaaacgg agtccgcaac gcacaacagg taatagagcc 540 aaaataaatt gtggtagcga aacaaaaact actactacca acaataaaaa catgtttgca 600 ctgctagcag aaaacgcaag cgaggaagct aataaaacga cagatagtgt atctcaaaaa 660 ccaaagcctc caccaatata catccaagag attactacga acgcactagc aaataaaata 720 gttcagctta tcgcaaacaa caattttcac gttatacctc taagaagagg aaaaattcaa 780 gaaactaagc ttcaggttaa aaccgaagac cagtttcatg cggtttcgaa gtttttaaac 840 gaaaacggca aaaaatatta cacgtaccaa ctcaaaagca gtaaagggct gcaagtggta 900 ctaaaaggca ttgagcctga tgtcacaccc aatgaagttg cgaatgcact tcacgaaaag 960 ggctttaacg ttaaatcagt tattaatatt cttaataaag ataagaagcc ccaaccgctc 1020 ttcaaggtcg agctagaccc aactagccaa acactaaaga gaaatgaagt gcacccaatt 1080 tacaatctcc aattcttatt acaccgtaaa attactgtag aagaaccaca taagcgtaat 1140 ggtccagtac aatgctccaa ctgccaagaa tatggccaca ccaagacata ttgtaccctt 1200 ccttccgttt gcgtcgcctg cggggatctg cacgaatctg gcacttgccc ggctaataaa 1260 cttgatccaa attcaaaaaa atgtggaaat tgtggtgaaa accacacagc taattaccgg 1320 ggatgccctg tctacaaaga tcttaaaagt cgcatgaaca aacgaattgc ttcggctcgt 1380 agtactaata atacccaaaa aatatggtac cccaaaaatg tatggtaccc tcccaatcgt 1440 ttcctgacat tttcttctcc aacgccaaat tgtcatcctc ttcctccagt aagcaccata 1500 ccgggtgttt catttgctag tgccctaaaa tcgggaacgg aaggtcctgc gtccacaaca 1560 gagttctcaa aagcgccgca acctcgagcc gtgccagacg acctgcaaca acccacaagt 1620 gccatcgaaa aaattatgtt atcattccaa gcaacgatga cggaatttat gtcgttcatg 1680 agaacaacaa tgcaggatct tacgtgtgcc ttagtacaag tacttgcatc tcaacattca 1740 aaataaaaaa atgtcctccc ttcgtatatg tttgtggaat gcgaatggcg tctcccgcca 1800 caaacttgat ctcgctaggt tcttgaaaga taaagaagtt gatattatgt tgctctcaga 1860 gacccatcta accagcagtc gcattaaaca taacttcctt gatcgttttg agaaaaacta 1920 cctacaagct acatctgtgg tggtcyattc tagtggcggt aatataactc tggccgccgt 1980 ctactgtccg ccccgtttct caattactga gaatcaattt atggaattct tttgctttcc 2040 tttagagctt cttaacgmtg tacaahdtga adacttctca ctgtaaggtg caatgcgaat 2100 acctccgagg acatgcgaag gggagtccct cagggtagcg ttyttggtcc gacactgtat 2160 ctcctataca cagctgacat ccctacaagt gatcgaacaa cactatcaac ttttgctgat 2220 gacaccgcaa ttctcagccg atcgaaatgc cctctgcaag catcagcgca cctggctggt 2280 cacctaactg ttgtggagaa ttggctggct aactggcgta ttgcaatcaa cgagcagaaa 2340 tgcaaacatg tcacatttac gcttaatagg cgtagctgtc ccccattaac tctaaataac 2400 gtccaaattc cacgaagtga ttcagccatt tacctcggag ttcatctaga cagaagatta 2460 acttggcgta agcacattga agccaagaaa acgcatttga agctgaaagc cagtaggttt 2520 cactggctta tcaactyacg ctctccacta agtttggact acaaggtgct gttatacaac 2580 acagttctaa aaccaatatg gacatatgga tgtcaactat ggggcaatgc atgcaacagc 2640 aacattgaaa taatacaacg agcacaatca aaaatcctgc gaacaataac tggtgcgcct 2700 tggtacatcc ggaacgcgaa catccaccag gacctacgga ttccagttgt aaaaactgaa 2760 attgtccagc agaaggctaa atacctaaag c 2791 // ID DOC5 standard; DNA; INV; 4682 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0015786; Porto1. XX SY synonym: Doc5-element XX FT source nnnnnnnn:1..4682 FT SO_feature CDS ; SO:0000316:211..1947 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; Porto1\ORF" FT /protein_id="" FT /translation="MSASDAQNQKRELQAKQNKNNSYLSFTHISNYTNNYQSLHPQSDG FT RARSCSPALERPNNYSQHRSAECAPNSSARTPTTTSAIPINTHSTLPLTAVTLSALPLW FT TLPLSATSETNTTTVATIATTTTVFYVSMPICSSSANIKTPSSIPASTKINKTQSSLPN FT QQNLKLYRDNQSPNSSKDLLNNNKKQRNKAQTATQKTLKKYWLAEPHSSNRFELLAQDD FT EDENQTDNGTNATINTQMQIEHKSAKPPPIYVQNVENIYALTTALNSLEETRYELKALS FT SNEIKIQPLETTHYHNILRLLKDKSTKYYTFRPKDQRGFKVILRNVHHATDKDDIINEL FT AQQGHEVINLHNIQRYDSKQPLPLFSIELKMKDNNKDIYKIEHLLHCKVIFEPPRPKRT FT LPQCTNCQKYGHTKNYCTKDPTCVKCAGKHSSSSCTINTRSDRTSIKCALCSENHTANY FT KGCMVYKALQIQKYPTLRKKEIPVTETTHQTTQHSTTNNVVSLKTPALSYAEVLNKNQV FT TSHQTNSTPQPTIVPDIQPASTQLSQTAQNDFDELKQMMKQLIAQMTNMMNIFTLLLFK FT LDK" FT SO_feature CDS ; SO:0000316:?..? FT SO_feature start_codon ; SO:0000318:?..? FT /db_xref="FLYBASE:; Porto1\ORF" FT /protein_id="" FT /translation="NRLQYYLSKLETTLSTNYSLWKATKKFTNSTIHKPPIKRNNNTWA FT RSNKEKADTIYNDNAELPRQITNLNCAIKTITKAEIKSHIKQIKNKKAPGYDLIGGKIL FT KELPDAIITYIRNLFNGILRLQYFPITWKVGQIKAIPKPNKNANTVSSYRPISLLPVLS FT KVFEKVLFEKLLFNRIEPILKQKNIIPPHQFGFRKQHSTVQQVHRVINKITDDFDKKKF FT CYAVYLDIAKAFDKVWHKGLLHKLRNILPYNLYTILQSYITDRYFYIKYDNECSDIMPV FT SAGVPQGSVLGPILYLIYTSDIPAPTTPGSMIATFADDTVLMSSSACEKTAASTLQQIM FT TQTVSWFERWGIEINKDKTQQVIYTKRQPKNFAIKINDNQILLHPYAKYLGLTIDSKLT FT WKQHILNKRNEIKNKFRQLNWLIGKHSKLELHNKIAIYKAIIKPIWTYGIELWGTASKS FT HLQLIQRTQSKMLRIITKAEWYIRNEDIHNDLNIETVNETIKKKQQKTHKSTT" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4682 BP; 1899 A; 1057 C; 676 G; 1050 T; 0 other; catctgaact tcaaccccga cggacgtgtt tttaacagag gtccagttaa tatcgaagtt 60 taagtgcaaa attcaagccc aagtgcaacc atcatataaa agcgaaaaga cgctaacaca 120 tatttatatg tgttaaaaac cttatgtctc tctgtgaaca agataaacaa aaaaaaaaaa 180 acaagtgaaa acaataaaca aaacaaaaat atgagcgcat ccgacgctca aaatcagaag 240 cgagagcttc aagctaaaca aaacaaaaac aactcctacc tctcattcac ccacatatca 300 aactacacaa acaattatca aagcttacac cctcaaagtg atggaagagc gcgatcttgc 360 tctccagcat tggagagacc caacaattac tctcagcaca ggagcgcaga atgcgcgcca 420 aattcatctg cacgtacacc aacaacaact tccgctatac caataaacac acactcaact 480 ctcccgctca cagcagttac gctctcagct ctcccactct ggactcttcc gctctcagca 540 acttcagaaa ccaacaccac aactgtagca actatagcaa caacaactac agttttctat 600 gtttcaatgc cgatttgctc ttcttcagcg aatattaaaa cgcccagctc aatacccgcc 660 tcaacaaaaa taaacaaaac tcaatcctct ctcccaaacc aacaaaactt aaaactttac 720 cgagacaatc agtctccaaa cagttccaaa gatctcttaa ataacaacaa gaaacagagg 780 aataaagctc aaaccgctac ccaaaaaaca ctcaaaaagt actggctagc cgagccacac 840 tcaagcaacc gttttgagct attagcacaa gatgatgaag acgaaaacca aactgacaat 900 gggacaaatg caactataaa tacgcaaatg caaattgaac acaaaagtgc aaaaccgccc 960 cctatatatg ttcaaaatgt tgaaaacata tacgcactaa ctactgcttt aaattcattg 1020 gaagaaacac gatatgaact aaaagcacta tcaagtaatg aaataaaaat acagccacta 1080 gaaactactc actatcacaa catattaaga ctattaaaag ataaatcgac caaatactac 1140 accttcagac caaaagacca gagagggttt aaggtaattc ttcgtaacgt ccatcatgct 1200 acagacaaag acgatattat caatgaacta gctcagcaag ggcatgaagt tataaattta 1260 cacaacatcc agcgctacga ctccaaacaa ccattgcctc tattctctat tgaactcaaa 1320 atgaaagaca ataataaaga catttataaa attgagcacc tacttcattg caaagtaatc 1380 tttgagccac cgcgaccaaa gcgaacactt cctcaatgta caaactgcca aaaatatgga 1440 catacaaaaa attactgcac caaagaccct acttgcgtaa agtgcgctgg aaaacacagc 1500 tcatcatcct gtactattaa cacaagatcg gacagaacaa gcattaaatg tgcactatgc 1560 agtgaaaacc acactgcaaa ctataaaggt tgtatggtgt acaaagcact gcaaattcaa 1620 aaatacccaa ctcttcggaa aaaagaaata cctgtaacag aaacaacaca tcaaacgaca 1680 caacacagta caacaaataa cgtagtaagc ttaaagacgc cagctttatc ctatgccgaa 1740 gtcttaaata aaaaccaagt aactagccat caaaccaact caacacctca acctactatt 1800 gtacctgata tacagcctgc ttcaactcaa ttaagccaaa ctgcacaaaa tgattttgac 1860 gaacttaaac aaatgatgaa acaattaata gcccaaatga caaatatgat gaacattttc 1920 acgcttttat tattcaaact tgacaaataa acacctaaca atcgccatct ggaatgctaa 1980 cggactttca cgccatttac atgaactaaa aacattctta aatgaaaagc aaattgaggt 2040 catgctcatt tctaaaacac acttaactga gaaaacctat tatataaaca tacctaacta 2100 taatatttct tctacttatc acccggacgg taaggcccat ggtggtactg cagtaataat 2160 taaaaaaagc atcaagtgca tagagctcga tggattcaaa aaggactaca tacaggctac 2220 tactaaatcg acctcagaca caacgggtcc aataaatata tctgcagttt attgcccacc 2280 taaatttaat aatacaaaag accaatacct cgatttccta aaatcactgg gaaatcggta 2340 cttggcagga ggtgactata atgccaaaca cacaacttgg ggggtccaga cttaccactg 2400 caaaggggac gtcaactgta tgaagcaata agaaacaata atagccgtgc tctaagcaca 2460 ggagagccga cttactggcc aactgacaca aataaactac tggatttaat tgacttctgc 2520 attacaagaa atataacatg cgagaaccta acaataaaat catgcctaga cctctttccc 2580 gaccattccc caatattact tacattacat ggtgggctag aagaaaacag caacgactct 2640 cctttattta aagacaaaac aaattggaac atgtttcgcg acattttgga acagaatagg 2700 aatataaata tctctctaaa gacaaacaac gcactagact ccggagtagc atatttaaat 2760 gagaacatca tagatgcagc aacgcaatcg acaccatcta taaaaatgaa atgagaaaaa 2820 tcaggcaaaa gcgtacactt aggaggatac ggcaaaggac taggcatcca gaagataaaa 2880 acaaactaaa tagagcaaca gacgagctca agagaactct cagggaagac aaagataacc 2940 gacttcaata ctaccttagc aaacttgaga ctaccttatc tacaaattat tccctgtgga 3000 aagcgaccaa aaaatttacg aactcaacta tacacaagcc acctataaaa agaaacaaca 3060 acacatgggc aagatcaaac aaagaaaaag cagacaccat ttacaacgac aatgctgagc 3120 ttccaagaca aatcaccaac ttaaattgtg caattaaaac tataacaaag gcagaaatta 3180 aatctcacat aaaacaaatt aaaaacaaga aagcccccgg atatgattta atcggaggaa 3240 aaattcttaa agaactaccg gacgctataa tcacctatat tagaaacttg tttaacggaa 3300 tccttcgatt gcagtacttt cccataactt ggaaggtagg ccaaattaaa gcaattccaa 3360 aaccaaacaa aaacgccaat actgtaagct cgtacagacc aatcagctta ttacctgtcc 3420 tctcaaaagt atttgaaaaa gtcctatttg aaaaactcct atttaataga atcgagccaa 3480 tcctaaaaca aaaaaatata atacctcctc accaattcgg ctttcggaag cagcactcga 3540 ctgtccaaca ggtccatcga gtaattaaca aaataacaga cgattttgac aaaaagaaat 3600 tttgctatgc tgtgtacttg gacattgcta aagcatttga caaagtatgg cacaaaggac 3660 tgctgcataa gctaagaaat atactaccat ataacctata taccattctt caaagctata 3720 taacggacag atatttctat attaaatacg acaatgaatg ctcagatatt atgccagtgt 3780 cagcgggtgt tccccaaggt agtgtgcttg ggcccatatt atacctgatt tatacctccg 3840 atattccagc gccaactacc cctggatcaa tgattgcgac gttcgcagac gatactgtgt 3900 taatgtcgtc aagcgcgtgc gaaaaaacag ctgcaagtac actacaacaa atcatgacac 3960 aaacagtatc ctggtttgag agatggggta tcgagataaa taaagacaaa actcaacaag 4020 taatatacac aaaacgacaa cccaaaaact tcgcaattaa aatcaatgac aatcaaatat 4080 tattacaccc atatgccaag taccttggac taactattga ctcaaaacta acatggaaac 4140 agcacattct aaataaacga aatgaaataa aaaacaaatt tcgacaactt aactggctaa 4200 taggaaaaca ctcaaaacta gaattacata ataaaatagc aatatacaag gcgataataa 4260 aaccaatatg gacgtatgga atagaactct ggggaactgc cagtaaatcc cacttacagc 4320 taattcagcg aacgcaatca aaaatgctca gaataatcac aaaagccgaa tggtacatta 4380 gaaacgaaga tatacataat gatttaaaca tagaaacagt caacgaaact attaaaaaaa 4440 agcagcaaaa aacacacaaa tcaactactt agccaccaaa actcaaactt gagacaaata 4500 ccgatgacag agttggaacc aagaagatta aaacgcagaa ctccaacaga attaaactat 4560 taaattatta atgtattagc gtatccttgc actgggaagg gccgcttctt ttctttttca 4620 catctagtca accaattact tattgttatt ttttaaaaaa cagattgtaa tttaaagaaa 4680 aa 4682 // ID FW2 standard; DNA; INV; 3961 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0067421; Fw2. XX FT source nnnnnnnn:1..3961 FT SO_feature CDS ; SO:0000316:2..1249 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; Fw2\ORF1" FT /protein_id="" FT /translation="XSTTQTAIDRYIQIKRKLSPQNYSMGNKSKINCDITRQKTNMPNN FT DNRFAILADLDDLPNSDKDIKKTPKPPPIYIREKSSSALVNKKMDLIEKDSFHIIPLTK FT GNIHETKVQVKTEVNFRALTKYLNDAKKNFYTYQPKSSKGLQSSIKGLRPEITPEEISG FT AQIEQGFKPKSVINIFNKDKKPQPLFKVELEPDSWALKKNESHPIYKLQYLLHRRISVE FT EPHKRKGPVQCANCQEYGHTKTYCTLRTVCVACGELHSSVNCPSNKADPGMKKCSNCGG FT NHTANYRGCPVYKDLKNRLHQKVTMMRGQSTPSTIIPSKHTPDVHLNSLTNRNVTFASA FT LKSGLASTNPTTPFPLAEQNKVNADQPTGQPQGNIETMIFNLQQSMTEFMTFMRTTMEN FT LMRNQDLLIQMLVSQQSK" FT SO_feature CDS ; SO:0000316:1256..1249 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; Fw2\ORF2" FT /protein_id="" FT /translation="MAPLRITLWNANGVSRHKLEMAQFLQDKHIDVMLLSETHLTNKYK FT FQINGYIFYGTNHPDGKAHGGTGVLIRNRIKHTFRNDFGTNYLQATSLNIQLGNGQITL FT AAVYWPPRFTISENQFLDFFNTLGDRFIAAGDYNAKHTHWGSRLVTAKGKLLYKTIIKV FT SNKLSCASPGTPTYWPTDPRKIPDLIDFAVTRNISHNHINAESLPDLSSLAGLINYLQY FT ADLKNSPCRVTSNRTNWLKYKKFVSSHIQCLNPRKKQTVQYMQRKPNVQIEQLVLKKRR FT LRREWQSYRSPFAKQGLNNATHTIRKEVINEQECAQRPYIAQLSPFSTKYPLWGLHPSV FT SPPVESVVPIRNSAGEWVRSDKHRASTFAEHLQNVFQPKPATNNFVLQEQTVVSQTTQN FT TIEFRPTEIAKIIKELKPNKSPGSDLINPKMIIELPFCAVQTICQLFNAINRIGHFPSL FT WKKSIIIMIPKPGKDHTVPSSYRPVSLLPCMSKLFEKCLLTFIIPYLRTFNKIPEHQFG FT FQEKHGTIExVNRITTEIRTAFEKREYCTAIFLDVAQAFDRVWLDGLMHKIMSTLPECT FT HKLLKSYLHNRVFSVRCGTVTSEDHIIEAGVPQGSVLGPTLYLLYTADIPTSRQLTIST FT FADDTAILSRSKCPRQATAQLALYLAHIEKWLSDWRIKVNEQKCKHVTFTLNRQDCPSL FT ILNNTVIPKSNEVTYLGIHLDRRLTWRRHIEAKRTHLKLKANSLHWLLSIRSPLRLEYK FT VLLYNSVLKPTWTYGSQLWGNASNSSVEIIQRAQSKILRTITGAPWYVRSDNLHRDLFI FT LPVRDEIAKQMEKYRNKLRAHPNRLARDLTRLSSRTRLRRNDMPTQR" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 3961 BP; 1386 A; 941 C; 701 G; 932 T; 1 other; atcgaccaca cagactgcta tagatcgata catacaaata aagcgaaaac taagcccgca 60 aaattattca atgggtaata aatcaaaaat aaattgtgac ataacaagac agaaaactaa 120 catgcccaac aatgataacc gatttgccat tttggccgat ttagacgatc tgccaaattc 180 tgacaaagac attaagaaga cccctaagcc gcctccaatc tacatcagag aaaaaagttc 240 cagcgccctc gtaaacaaaa aaatggacct tattgagaaa gacagtttcc acataattcc 300 cctgaccaag ggcaacattc atgaaacaaa agtgcaagtc aaaacggaag taaacttcag 360 agcgcttacc aaatacctca acgatgccaa gaaaaacttt tatacgtacc agccaaaaag 420 cagcaaggga ctacaaagta gtattaaagg ccttaggcct gaaataaccc ctgaggaaat 480 atcaggagcc caaatagaac aaggttttaa gccgaaatca gttatcaaca tatttaacaa 540 agataaaaag ccacaacctc ttttcaaggt cgaacttgaa ccagactctt gggcattgaa 600 gaaaaacgag agtcacccca tttacaagct gcaatatctc ttacaccgta gaatctctgt 660 tgaagagccg cataagcgca aaggacctgt acagtgtgca aattgccaag aatatggtca 720 tacaaaaact tattgtaccc ttcgcactgt atgtgtagct tgtggtgagc ttcacagctc 780 tgttaactgc ccatcaaaca aagctgaccc cggtatgaaa aagtgtagta actgcggagg 840 taaccacaca gcaaactaca gaggttgccc tgtatataag gatctaaaaa atcggctaca 900 ccaaaaagtg actatgatgc gtgggcaaag cacgccgagc acaattatac cctcaaaaca 960 tacacctgat gtccacttaa acagtctgac caacaggaac gtaacatttg caagtgcgct 1020 taaatcaggt cttgcatcta caaatcctac aactccattc ccactcgctg aacaaaacaa 1080 ggtaaatgct gatcagccaa caggacagcc acaaggcaat attgaaacaa tgatctttaa 1140 cttgcaacaa agcatgaccg aattcatgac atttatgaga acaaccatgg aaaatcttat 1200 gcgtaatcag gatctattga tacagatgct ggtatcgcag cagtcaaaat aatcaatggc 1260 tcccttacgt ataactctat ggaatgccaa cggcgtttcg cgacacaagc ttgaaatggc 1320 tcaatttctc caagacaaac atatcgatgt aatgctcctt tctgaaacac atcttacaaa 1380 caagtacaag tttcaaataa atggttatat attctacggc accaaccacc ctgatggtaa 1440 agcacatgga ggcactggag tcctgattag aaaccgcata aaacatactt tccgcaacga 1500 ctttgggaca aactacttac aggcaacatc tttaaatata cagctgggta acggacaaat 1560 aacacttgct gctgtatact ggccacctcg ctttacgata tccgaaaacc aatttttgga 1620 ttttttcaac acactagggg atcgcttcat agctgctggt gactacaatg ctaagcatac 1680 gcactgggga tctcgtcttg taaccgctaa gggaaaacta ttgtacaaaa caattattaa 1740 agtaagcaac aagcttagtt gcgcctcccc gggcacacct acatattggc caacggaccc 1800 cagaaagata ccagacttga ttgactttgc tgttacaaga aacatctctc ataatcatat 1860 aaacgccgaa tctctcccag acctatcttc tctggccggg ctcattaatt atcttcaata 1920 tgcggatctg aaaaattcac cctgtcgagt gacatccaat agaactaact ggctaaaata 1980 caagaaattc gtaagttcgc acattcaatg cctgaaccca agaaaaaaac aaactgtaca 2040 gtatatgcaa cgtaagccaa atgttcaaat tgagcagctt gtcttgaaaa agcgccgcct 2100 acgacgagaa tggcaatctt atagatcgcc atttgcaaaa caagggctca ataacgcaac 2160 tcatacaata agaaaggagg taatcaacga gcaggaatgc gcacaacgcc cctacatagc 2220 gcaattatca ccctttagca caaaataccc attatgggga ctacacccct ctgtaagccc 2280 accggtagaa tctgttgtgc caataagaaa ttctgcaggt gaatgggtcc gcagcgataa 2340 acatagagct tctactttcg ctgagcacct tcaaaacgta tttcaaccaa agccggcaac 2400 gaataatttc gtattacaag agcaaacagt tgtaagccaa acgacacaga atacaataga 2460 atttcgacca actgaaattg ctaaaatcat taaggaatta aagcctaata agagccccgg 2520 cagcgatcta ataaacccta agatgatcat cgaacttcca ttttgtgccg tccaaactat 2580 ttgccagctc ttcaatgcaa taaatagaat tggccacttc ccttcgttat ggaaaaagtc 2640 gatcatcatt atgataccga aaccgggaaa ggatcataca gttccatcat catacagacc 2700 tgtaagttta ctaccatgta tgtcaaaact cttcgaaaaa tgtctgctga cctttattat 2760 accctatctg cgaactttta acaaaatacc ggagcatcaa ttcggttttc aggagaaaca 2820 tggaacaatc gaasaagtca atcgaatcac aactgaaatt cgaactgcat tcgaaaagag 2880 agagtactgc actgccattt ttctcgacgt ggcacaagca tttgacagag tctggttaga 2940 tggcctaatg cataaaatta tgtcaacact ccctgaatgc acccataagc ttttaaagtc 3000 gtacctgcac aatagagtgt tctcagttag atgcggcact gttacgtctg aggaccacat 3060 catagaagct ggggttcctc aaggcagcgt acttggccca acactatatc tactatatac 3120 agccgatata ccaacatcga ggcagctaac aatatctaca tttgccgatg acactgcaat 3180 cctgagccgc tctaaatgtc cacggcaagc tactgcacaa cttgctctct acctggcaca 3240 tattgagaaa tggctttctg actggcgaat aaaagtcaat gaacagaaat gtaaacacgt 3300 aacttttacc ctcaacagac aagattgccc ttcgctcata ctaaataaca ctgttattcc 3360 aaaatcaaat gaagttacgt atctaggtat tcatcttgac agacgactca cttggcgtag 3420 acacatcgag gcaaaaagaa cacacctcaa actaaaagcc aatagtctac attggcttct 3480 tagtatacgc tctcccctaa gactagaata taaggtctta ctctacaact ccgtcctaaa 3540 accgacctgg acctatggct cccagctatg ggggaacgcc agtaacagca gcgtagaaat 3600 catccaacga gcccaatcga aaatcctgag aaccatcacc ggcgcaccgt ggtacgttcg 3660 aagtgataac ttacataggg atttattcat acttccggtc agagatgaaa tagccaaaca 3720 aatggaaaaa tatcgcaaca aactacgtgc ccatccgaac aggcttgcga gagacttaac 3780 acggctatct agccgaaccc gacttcgtcg taatgatatg ccaacccaac gataattatt 3840 agggccacac aaactcatat cagttgcaat agtaactgtt agttaagtac tttttaagat 3900 ttgtaaactt attgttagtc tcataatgag aagcttcaat aaacaaacca acgcataaaa 3960 a 3961 // ID FW3 standard; DNA; INV; 3132 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0067420; Fw3. XX FT source nnnnnnnn:1..3132 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 3132 BP; 1051 A; 803 C; 557 G; 720 T; 1 other; agagaaaatc tcgaatactt ttgtaaacga aataattgaa cttgttcgaa aatacaactt 60 ccgcataata ccccaaaaaa gggacagcat ccacgaaact aaggtccaag taaagaccga 120 agagagcttt aggacgctta caaaatacac tacaaaaatc tactacacct atcaactgaa 180 aagtagcaat ggccttcaag tcgttctaaa gaatatagac ctgaagtaac accttctgag 240 ataacgcagg catttaatga taaggatttt aaagtaaaaa ccgtcttcaa cattctgaac 300 aaagatagaa aaccacaacc actcttcaaa gttgagctgg aacctgacac taaagccatc 360 cggaaaaaag atgtgcaccc aatctacaag ctaaagtttc ttttgcatcg tagaattacc 420 gtagaagaac acgtagaaca cactagcgca aagggtcagt ccaatgcagg aactgtcaag 480 aatatgtcca aacaagagga tactgcacat tgcgtccagt ttgtgtcatc tgtggagagc 540 ttcatgactc tgctcattgc acgacaaata aaaagtgcgg aagctgtggc ggtaaccacr 600 cagcaaacta tagaggctgt ccaatataca aagacctcaa aagccggctt cacccccgag 660 taacgcctgt acgcctccat cacgcacaca atgcacttaa accaaagaac tctcagaaat 720 aaacccagac gtatttttct caaccgcaaa gagatcctca tttggcccgg ggatcacctc 780 acaaaatgtg actttcgcaa acgtcctgaa atggggactt actaaaccta gcactacgac 840 aaccactcta catactgccc aaattgatcc gacaccgaat cctcgattac cgacgcaaca 900 acaaagcaat atcgaaacca tgatgcatac tttgcagcag catatgattg aattcatgtc 960 cttcatgcgg actaccatgc aagacctcat gcgtaaccaa aatctcctca tggagatgct 1020 aatatcacaa cgttccaaat aatcaatggt tgccctacga atatcattgt agaacgccaa 1080 cggcgtttca cggctttaac tttaaatagc tcaatttcat catgacaatc aaattggcgt 1140 catgctactg tcggaaacac atctcactgc taaatataac ttccaaataa gcggcttttt 1200 gttttacgga acaaatcatc cagacgaaaa agcacatggc ggaaccggta tcctgattag 1260 aaatcgcatc aaacaccacc attatagttt gcagctattt acatgcaggc cacatctata 1320 aacgtactac aaggaagcgg caatattacc ctcgcagcgg tatactttcc cccacgttat 1380 attgtatccg caacaatttc tggacttctt caactcgcta ggggatcatg tcattgctgc 1440 aggtgactac aacgccaaac acaccaactg gggatcttgc tcgtcacccc taaaggaaac 1500 agttgtataa cgccatcata aatgtacata ctggcctacc gacccgaaca aaattcctga 1560 tttgatcgat tttgcaatca ccaaaaacat ccctcgaaac ctgaccactc gccagtcctc 1620 ttaatactga acgccctcca atgtcctgag actacaaagc agactagcag attaacatca 1680 aaaagaacaa actggatcaa gtacagaaac tatataagct cacatatcgc gctgaatcca 1740 aggcttaact ctgacgccga tattgaatcc gcaacaaata ctctcgagag tgtcttagct 1800 gcagctgccc gcatctctac accaaaaatt gggagtgcac ctcgtagccg ctcgatggca 1860 aggcttgtca tcgagcggct cgttcttcaa aaacgacgga tgcgacggga gtggcaagtc 1920 catagatcgc aattcgcaaa gcaaagacta ataaatgctt cacgtaaact ctccaatgca 1980 ctctggcaag aagagaaaaa cgcccaacgt cgctatatag aacaactatc aacatccagc 2040 agtaaaatct cactatggaa agcccatccc agtttaacta ccccggcaga aactatttca 2100 cctataagaa ccatcgcagg tggatgggcc cgaagcgata aagagagagc taccacattc 2160 gcaacgcacc ttcaacatgt gttccagccg aaccctgcta caagttcgtt tgtattaccg 2220 acgctaacag acgacaacca aacaccacat gagccaatcg aatttcgagc caacgaaata 2280 tcaaaaatta taaaggatca actatatccg aaaaagtccc caggatgtga tctaacaact 2340 ccaaagatta ttattgagct tccatactgt gctatatgca ctatctccca gctctttaac 2400 gctatcacaa acctcgggta ctttccagag agatggaaaa agtcgatcat aggaatgata 2460 ccgagggcag gaaaggacct ctcagtttcc tcatcataca ggcctataag cctattgtct 2520 cgtctttcga aactcttcga aaaatgtctg atgacccgga tcacccccta tctgatgaca 2580 cgtaacctta ttccagcgca tcagttcggc tttcgaaaga aacatggaac cattgaacaa 2640 gtcaatcgaa taacatcaga aatacgaact gctttcgaaa aacgcgaatg cacaggaata 2700 tttctggatg tctcccaggc atttgacaag gtctggctag acggtttaat gtttaacatc 2760 aaagcaatgc ttccccaaaa cacccacaaa ctcttgaagt cgtatctcta caacagagcc 2820 tttgctgtga ggtgtaactc tgatatgtcc aatgaccacg ctataaacgc tggagtactt 2880 ggacctacgc tatacgtcct ttatagatca gatataccca caagcagact gttaacgact 2940 tctacatttg ccgacgacac agcgatttta agtcgatcca aatgtcctct acaagcaaca 3000 actcaactct ctcgtcactt attggctgtg gagaggtggc tatctgattg gcgaatcaaa 3060 ataaacgaac aaaaatgtaa gcaagtaaca ttcacactca accggcaaga ctgcccttcg 3120 atcacatctg aa 3132 // ID HELITRON1_DM repbase; DNA; DRO; 564 BP. XX AC AE002840; XX DR FLYBASE; FBgn0067418; Helitron. XX FT source AE002840:9827..9264 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 564 BP; 177 A; 115 C; 107 G; 165 T; 0 other; ctgcgcgcga aagaatacat taatttgcga gacgctatca acaacaacgc cgacgtcgcc 60 gaaatcggta accatatcat tttaccgtaa tcgtaaatag gcagtccacg tcatatgcaa 120 gaatatttac aggatgctct gactttcgtg cgcgaatatg gacgaccatg tttatttatc 180 acgttcacat gtaatccaaa atggccagag attacatctt tactactgcc tggccaaatg 240 caatgcatcg ccatgacatt acagcacgtg tgttcagaca aaagttgaag tctttaataa 300 gtttcattac taaatcacat gtattttgtc ctacactttg ctggatgtgt tcggttgagt 360 ggcaaaagcg gtttggtttg gttcaacgac agaatctgtc ctgaagtaat cgatagtatt 420 atttctgcgg aaatattgta gatccatcca cttcattgct ttgcatggct aatagaaatg 480 tactaaaaat gtcccaaaaa attttaccaa taatacggtc agaaatgtcg acgaatacct 540 aatatatcgt cgaagaaatc ctga 564 // ID R1-2 standard; DNA; INV; 3216 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0067405; R1-2. XX FT source nnnnnnnn:1..3216 XX FT SO_feature CDS ; SO:0000316:<1..716 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; R1-2\ORF1" FT /protein_id="" FT /translation="XXCSAVVMCDDPSIPGKQSAERVRKEVAPALGVRVHEVRELKCGG FT AVIRTPSVSEMKKVVANKKLAEVGLKVQPKKSQRPKVQVFDVDSGIQPDGGIVRQQFKE FT EFSPAAFMKEMHLNTKPWSVTDGERANLTLEVDEKALNVLEQTGRVYIKWFSYRCRSLV FT HTYACHRCLGFDHKVSQCRVKETICPQCGQAGHTAPRCTNPVDCRNCRFKGHPSRHSML FT SFTCPIYGAVLARVNARH" FT SO_feature CDS ; SO:0000316:719..3216 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; R1-2\ORF2" FT /protein_id="" FT /translation="MFSFLQANCGRGRAPTIELxVRLRDSGHLFALVQEPYVDLAGRIT FT VFSDRRGKAAVYVDSTDSICMPIEPLVTEFGVCVSVTGSFGSIFLCSVYCQFNTGLEQ FT GVPSGMRYLGYLDAVLLLASRTPVILGLDANAVSPMWFSKLPERAQGSANYLRGELLS FT DWIQGCRAGVLNMPCDAFTFETPYARSDIDVTLTNDAASTCATYDWRVDEWDLSDHNI FT INVVVTRDPPNTVESFAPVPSWNFSAARWRPFEDEVTRLASELPDEFADTPLDDQVSA FT VRSLVHSVCDQVLGRRRPKTARKIVWWTAELSSKRQEVRRLRRRLQTARARASHDAEQ FT LVSQLREISDQYKELILKYKEDNWRRLVGENKDDPWGQVFRICRGRKRTTELGCLRXG FT GRQYVTWHDCAGVLLRTFFPFSSFRHPLSFLKRFHHRLSSEVDACVARLKSRRSPGMD FT GITAVIFKAVWRAIPEHITAFYSRCIRSGYLPSKWKRTIAVALLKGSDKDRSDPASYR FT GICPLLFVFGKVLEGIMVNRLKDVLPDGSRWQFGFREGRCVEDAWRHVVSTVAANQAQ FT YMLGIFVDFKGAFDHVEWDVVMRRLIDSGCREASLWRSFFSGRSASLVSRYGEVTVPV FT TRGCPQGSISGPFIWNLMMDSLLQRLEPLRGFSALADDLLLFVEGTRIVLESKGEQLM FT SVVGAWEVGVAVLTSKTALLLGHFAQSRHTTVRFAGASLPYVDKYRYLGVTVVERLNV FT LPHIKSLRDRLTGVVQALARVLCVDWGLSPRARRTIYAGLMVPCALFGASVWYDGPSS FT AMRHLVSCQRRILLGCLPVCRTVSTVAMQVLXG" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 3216 BP; 692 A; 827 C; 943 G; 751 T; 3 other; cttgctcggc ggtagtgatg tgcgacgacc cgagcatacc tgggaagcag agtgctgagc 60 gggtccgcaa ggaggtcgcc cctgccctcg gtgtccgtgt acacgaggtg cgtgagttga 120 agtgcggtgg agccgtcatc cgcacaccat cggtgagcga gatgaagaag gtcgtcgcga 180 acaagaagct tgccgaggtt ggtctcaaag ttcagccgaa gaagagccaa cgacccaaag 240 tccaggtgtt cgacgtcgac agtggaatac agcctgatgg aggaattgta cgtcaacaat 300 tcaaggaaga gttctcccca gccgctttca tgaaagaaat gcacctcaac acgaaaccgt 360 ggtcagtcac cgatggcgaa agagctaatc tgacgcttga ggttgacgaa aaggcgctga 420 acgtccttga gcagaccgga cgggtctata tcaagtggtt cagttaccgt tgccggtcgc 480 tcgtccacac ctatgcttgc cataggtgcc ttggatttga ccacaaagtt tcccagtgca 540 gggtcaagga aaccatttgt ccacaatgtg gacaagctgg ccacacagca cccagatgca 600 cgaatccggt ggactgccgg aattgtcgtt ttaagggcca cccttcgagg cattcaatgc 660 tgtccttcac gtgccccatc tacggtgcgg tgcttgcgag ggtgaacgct agacattaat 720 gtttagcttc cttcaggcaa attgtggccg tggccgagct cctaccatcg agctcgsagt 780 ccgcttgcgc gactctggcc acttgttcgc attggtgcag gagccttatg tggacctcgc 840 tggacgaatc acgggagtcc catccgggat gcgtgtgttt tcggacaggc gtggaaaggc 900 tgccgtctac gtggacagca cggattccat ctgcatgccg atagagccgc ttgtcaccga 960 gtttggagta tgcgtgagcg ttactggaag ttttggctca atcttcctat gctccgtgta 1020 ctgccaattc aacaccggac tcgaacagta cctcgggtac ttggatgcgg tgctgctgct 1080 agccagccgc acgcctgtca tccttggtct cgacgcgaac gcagtatccc ccatgtggtt 1140 tagtaaactc cctgagcgtg ctcagggatc agctaactac ttacggggtg agctgctgtc 1200 tgactggatt caaggatgtc gagccggagt gctgaatatg ccgtgtgacg cgttcacatt 1260 cgagactccc tacgcacgta gtgatatcga tgtgacactc accaacgatg cagcgtctac 1320 gtgcgctacg tacgactgga gagtggatga atgggatctt agtgatcaca acattatcaa 1380 cgttgtggtt acgcgagacc caccaaacac agttgagagc tttgctcctg tgccatcctg 1440 gaacttctcc gctgcacgct ggcgcccatt tgaggatgag gtgacaagac tggcttcgga 1500 actcccggac gaattcgccg acacgccgtt ggatgaccag gtctctgcag tgcgctcgct 1560 cgtgcactct gtgtgcgatc aagtgctggg acgcagacga cccaagaccg caaggaagat 1620 agtttggtgg actgccgaac tttcttctaa acgccaagag gtcaggagac tgaggcggcg 1680 gcttcagacc gctcgtgcgc gcgcaagcca cgatgccgag caacttgtct ctcagttgag 1740 ggaaatctca gatcagtaca aggagctcat tctgaagtac aaagaggata actggaggcg 1800 cttagtggga gagaacaagg acgatccatg ggggcaagtc tttaggattt gccgtggccg 1860 caaaaggaca accgaactcg gttgtcttcg ctnaggtggc aggcagtacg taacctggca 1920 cgactgcgcg ggtgttcttc tccggacctt ttttccattc tccagttttc ggcacccact 1980 gtcattcctg aagagattcc accaccgctt gtcctctgag gtagacgctt gtgtcgcaag 2040 gttgaagagc agacgctctc ccggcatgga cggcatcacc gcggtgatat tcaaggcggt 2100 gtggcgtgcc attcccgagc acatcacagc gttttactcc cgctgtatca gaagtggata 2160 ccttccctct aagtggaagc gcacaattgc agtggcacta ctcaaagggt cggataagga 2220 caggagtgat cctgcttctt accgcggtat ctgtccgttg ttgtttgttt tcggcaaagt 2280 gctagagggg atcatggtga accgattgaa ggacgttcta cccgatggaa gcagatggca 2340 atttggattc cgagaaggac gctgcgttga ggatgcttgg agacacgtcg tgagcactgt 2400 tgctgccaac caggcacaat atatgctcgg aatcttcgtg gacttcaaag gagccttcga 2460 ccacgtggag tgggatgtcg tgatgagacg actcatcgac tccggctgcc gagaagccag 2520 cttgtggaga agcttcttct ctggcaggag tgcaagttta gtcagcaggt atggtgaagt 2580 gactgttccg gtaacacgag gttgccctca gggatccatc agcggtccat ttatttggaa 2640 ccttatgatg gactcacttc tccagcgtct ggagccactg cgtggtttca gcgcgcttgc 2700 agacgacttg ttgcttttcg ttgagggtac ccgaatcgta ctggagtcga aaggcgaaca 2760 gctcatgtcc gttgttggag catgggaagt cggcgttgcc gtcctcacca gcaagacggc 2820 gctgctgctg gggcattttg cccagagtag gcacactaca gtacggtttg caggagcaag 2880 cctgccgtat gttgataaat accggtacct tggcgttaca gttgtcgagc ggttaaatgt 2940 tcttccgcat atcaagtcgc tacgtgatcg gctgactgga gttgtgcagg cattggcacg 3000 cgttctttgt gtggactggg gcctcagtcc acgcgccagg cggacaatat atgccggact 3060 catggtgcca tgtgccttgt ttggtgcatc ggtctggtat gatggaccca gctctgccat 3120 gaggcatctg gtctcctgcc agaggcgaat cctgcttgga tgcctaccgg tatgccgtac 3180 agtgtccact gtggcaatgc aggtgctcgs tggcgc 3216 // ID TC1-2 standard; DNA; INV; 1644 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0069340; Tc1-2. XX FT source nnnnnnnn:1..1644 XX FT SO_feature terminal_inverted_repeat ; SO:0000481:1..26 FT SO_feature terminal_inverted_repeat ; SO:0000481:1619..1644 FT SO_feature CDS ; SO:0000316:356..1375 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; Tc1-2\ORF" FT /protein_id="" FT /translation="MGKTKELSEFIKNEIIIKYNSGISVQNIIDLYKIPRATVYYQINK FT YKKTHTTKNVARSGRPRKTTQKDDGYILRKFKQNVLQTPRSVAKELKEGAEIDISERTV FT RRRLKEADFGTYVSRVIPLITPRNKLKRLDFAKKYVGQPASFWNNVLWSDESSFEFHCS FT KKIFFVRLPKQYRKKVAPVCQRINHSGGSVMFWGCVAFTGLGDLVPVDGTMNQRKYLDV FT LNNHAFPSGDKLIGESFILQQDNAPCHKAKLITQFLKDVCVNTLDWPPQSPDLNIIENL FT WSYLKRKRSANLSRSREETILEIQTLWKDISIDYIHSLVQSVPKRLQKVIDAKGGYIFY FT " XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 1644 BP; 570 A; 270 C; 291 G; 513 T; 0 other; cactagtggg cataagtatt tagacaccat tagcttgcaa ctgccgggct atgcttttca 60 tagtacagtc aaataaataa gtaaactatt gatgataaaa caaagagtgc aattacagtt 120 acacgcgaga ggtatgtttt gccgtttacc ttttcaatta tccttctgct gataagaaag 180 ctagacagac aacgatttgg gcggtggcaa aagtatttag acatcacatt taattagttc 240 gcattgatct gttgttcgaa gaatatcaaa tttttgactt ttttagatca tttgaacatt 300 tgttcttgat aacgtgttac tacctacttt tatttcattc ataaatttgt taaacatggg 360 gaaaacaaag gaactgagcg aatttataaa aaatgaaata ataattaagt ataattctgg 420 tatttcggta caaaatatca ttgatttata caaaattcca cgggcaacgg tgtattacca 480 aataaataaa tataaaaaaa cacacacaac caaaaacgtt gcgcgtagtg gccgaccacg 540 taaaacaact caaaaggacg atggttatat tttacggaag tttaagcaga atgttctaca 600 aactccccga tcggttgcca aagaactaaa agaaggagca gaaatcgata tcagtgaaag 660 aacggttcgc agacgtctta aggaagccga ttttggaaca tatgttagca gagttatacc 720 actaataact ccacgaaata agttaaagcg cctcgacttt gccaaaaaat atgttggtca 780 gcctgcatca ttctggaaca atgttttgtg gagcgatgaa agctcttttg agtttcactg 840 ctcaaaaaaa attttttttg ttagattacc gaaacaatat cggaaaaaag tggcaccagt 900 atgtcagaga ataaatcatt caggagggtc tgttatgttt tggggatgcg tagccttcac 960 tggcttggga gatttggttc ctgttgatgg aaccatgaat caaagaaaat atttagatgt 1020 ccttaataat catgcattcc cctctggtga taaattgatt ggagagtcct tcatactcca 1080 gcaggataat gctccctgtc ataaggccaa actgattacg cagtttctga aagatgtttg 1140 cgtaaacaca ttggattggc cacctcaaag tccagacctc aacataatag aaaacctctg 1200 gtcatattta aaaagaaaaa ggagtgcaaa cttatctaga agtcgcgaag aaacgatttt 1260 ggaaattcaa accttatgga aggatatttc aatagattat atccacagct tggtacagtc 1320 tgtaccaaaa cgtctgcaaa aggtgataga cgctaaggga gggtatattt tttattaatt 1380 tgtcataatt ttttagttga cttttttttt tcaatttata acatttaaat aatttgtcca 1440 aatacttatg ccactgctaa tttgtaaaaa agtaattttt gttaaatcaa caatgaagtt 1500 atccatatgt ttatgttacc taatttaact acgatagtct acctatttac taaaattatt 1560 aaacaatttt gctttaagta aacaatgagt tacaaacaaa tatattgaat atatttcttg 1620 tccaaatact tatgcccact agtg 1644 // ID G5A standard; DNA; INV; 2841 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0069433; G5A. XX FT source nnnnnnnn:1..2841 FT SO_feature CDS ; SO:0000316:16..2727 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:; G5A\ORF" FT /protein_id="" FT /translation="MGPLRIPTWNSIGASLKTNEFLALIELHDIDILLLSEVHFSSRSN FT FRVNGFTLHTANHPDDSQHGGAAILVGSNISHQPFATLSQNHIQAVVIQLTASRGNFNV FT ASVYCPPTLRWTDAIVGQFITQFGTKFLAAGDWNAKHRWWGNYRMRTRGRVLYSALAEE FT GIDIVATGEATCYPYRATASPSAIDFGISKGFRQQQIKVQLLSSDHLPLLFELDEDAQS FT FRSVTKMLSPQANIRIFKEHIEATVELNIPIDTCSRLEAYVDYFTPAIIEAARQATPTP FT HQARLTAIRRPPILSIEARDLLSHKRRLRRRYIATGDPSIYQQYSSTTNRLRRLLANNR FT KANLDTLLEGAGPDSNSGFSIWKLTRGIKRQPLFQSPIQDRGGLWLKTDDEKASAFASH FT LSSTFMPFNLTDDTNREAIANFLDTPTAPTRPIRHTSPQEVMMQLKALQPKKNPGYDGI FT DNRTAKFLPRKGVLALVKIFNVMLRLGHFPRQWKRARIVMIPKPGKSPTQIDSYRPISL FT LPTFSKVFERILLTRLMELPQVTEHIPQHQFGFRKSHGCPEQIHRLVKHITHGFEHKLY FT TVGVFLDVKQAFDKVWHEGLLYKMKDLLPAAHYAILSSFIADRTFDVAVRDSRSSMEHI FT HAGVPQGSVLGPFLYTLYTADMPTPVNNTDEASPAQLILATYADDTAMLASHSSLQFAS FT NAVQEWLHAIERWTAKWNIAINCTKSACVTFTLRPQTCLGLLFDGNTIDYVSSHCYLGV FT HLDRTLSWKAHITAVRAKSSWKLKKLDWLFHSSKLHMATKALLIKAILGPTLTYAIQVW FT GTASKTQLNRLRVIQSRAARHASGLPWYVSNRVIERDLKVVPLGDQINFHSSRYADRLS FT AHPNTLANDLVDPISLRRLKRIHPHELLTRKIV" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 2841 BP; 811 A; 761 C; 588 G; 680 T; 1 other; cctctgcatt taaaaatggg cccccttagg ataccaactt ggaattcaat tggggcctcc 60 ttgaagacca atgagttcct cgctcttatt gagcttcacg acatagacat cctcctgttg 120 tcagaagtcc acttctcctc ccggtccaac ttcagggtca atggctttac cctacatact 180 gccaaccatc cggatgacag ccaacacgga ggcgcggcga tccttgttgg atcaaatatc 240 tctcatcaac cattcgccac tctatcgcaa aatcatatcc aagcagtagt tattcagctg 300 actgcaagca gaggaaactt taatgttgcc tcagtatact gccctccgac tcttagatgg 360 acagatgcta ttgttgggca gttcatcaca caatttggca caaagttcct tgcggcaggt 420 gactggaacg caaaacacag atggtgggga aattacagga tgcgtaccag aggtagggtg 480 ctgtattctg cacttgcgga agaaggaatc gacatcgttg caactggaga agccacttgc 540 tacccctatc gtgcaactgc ctctccaagt gccatcgatt tcgggatctc caaaggcttt 600 agacagcaac aaattaaggt gcaactactg tcatctgayc atctcccctt gctgtttgag 660 ctagatgaag acgcccaatc atttagaagt gttacaaaaa tgctgtctcc acaagcgaac 720 attcggatat tcaaagaaca catcgaggct actgtagaac tcaacattcc cattgacacc 780 tgcagtaggc tggaagcgta cgtcgactac ttcacacctg caatcattga agctgcaagg 840 caggctacac caaccccaca tcaagctcgt ctcacggcca taaggaggcc tcccatttta 900 agcatagagg ctagggactt gctcagccac aaaagacgcc tcagaagacg atacatcgcg 960 acaggagatc ctagcattta ccagcaatat tcaagtacta ctaatagact gcgccgtctg 1020 ctagccaata accgtaaagc aaacctggat accttattgg agggtgctgg tccagatagt 1080 aatagtggat tctcaatatg gaaactcacg agaggcatca agagacagcc tttgtttcaa 1140 tcaccaattc aggatcgagg tggcctctgg cttaagacgg atgatgaaaa agcaagtgct 1200 tttgcttcgc acctgtcctc caccttcatg cccttcaatt taacagacga caccaatcgt 1260 gaagccatcg caaatttttt ggatactccg actgctccga cacgccccat aagacatacc 1320 tcaccgcagg aagtaatgat gcaactaaag gctttgcaac ccaaaaaaaa ccctggttac 1380 gatggcatag acaaccgaac tgcgaaattt ctaccacgca aaggagtgct tgctcttgtg 1440 aaaatattta atgtcatgct aagattaggt catttcccca ggcaatggaa gcgtgcgcgc 1500 attgtaatga tccccaaacc tggaaagtca ccaacacaga tcgactcata ccgcccgata 1560 agcctgctac caactttctc caaagttttt gagagaattt tgcttacccg cttgatggaa 1620 ctgccgcagg taacagaaca catcccacaa caccaattcg gtttcaggaa gtctcatggt 1680 tgccccgaac aaatccatcg ccttgtaaag cacatcacgc atggctttga gcacaagctt 1740 tacacagtcg gcgtattttt ggacgtgaaa caagcgtttg ataaggtgtg gcatgaaggt 1800 ctgctatata aaatgaaaga tctcctccca gcagcccatt atgccatcct cagctccttc 1860 atcgctgatc ggaccttcga tgttgcagta cgagactctc ggtcaagcat ggaacacatt 1920 catgcagggg ttccacaggg aagtgtgctt ggacccttcc tgtataccct gtacactgct 1980 gacatgccaa cacccgtcaa caacaccgat gaagcgtccc ctgctcagct gatcctggcc 2040 acctacgctg atgacaccgc catgcttgcg tcgcactcat ccttgcaatt cgcttccaat 2100 gcagttcagg aatggctgca tgcaatcgag cgatggactg ccaaatggaa catagccatt 2160 aactgcacta agtcggcctg tgtcacattt accctgcgac ctcaaacctg cctaggtctt 2220 ctcttcgacg gaaataccat tgactacgtc tcatcccact gctacctcgg agttcacctt 2280 gatcggacgc tgagttggaa agcccatatc acggcagtta gagccaaatc ctcttggaaa 2340 ctgaaaaagc tggactggct tttccactca agcaaactcc atatggcaac taaggctctt 2400 ttaataaaag ccatattagg tccaacactg acctacgcca tccaagtgtg gggaacagcc 2460 tctaagactc agctcaatag gcttagagta atacaatcgc gagcggcacg acatgcatct 2520 gggctaccct ggtatgtgag caaccgagtt attgaaaggg acttaaaagt tgtcccattg 2580 ggagaccaaa taaacttcca cagcagccga tatgccgaca gacttagcgc ccacccgaac 2640 acgttggcga atgatcttgt cgaccctatt tccctccgac gtctgaagag aatacatccc 2700 catgaactcc ttacacgtaa gatagtataa tctctacatt tgtatacact ataattaagg 2760 aacagaatcc ataatatctt ttaagctaca cattaggtta agttcagaag gaactgaacg 2820 attcctactg actaataata a 2841 // ID G7 standard; DNA; INV; 1192 BP. XX AC AC003788; XX DR FLYBASE; FBgn0067419; G7. XX FT source AC003788:1880..3071 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC REPBASE states this to be a consensus sequence. XX SQ Sequence 1192 BP; 364 A; 294 C; 257 G; 277 T; 0 other; tggtagctgg tgcagaaccg agaagcagaa agtcaacgcg ttcgccgacc atctgaaaaa 60 cgtctttacc ccttattgcc gctgctctcc tgctgatgca gtcatttata ctcctggagg 120 aacccctcga gcacgttgag cccataccaa cggtcacaga agaagaaact gctaaactaa 180 ttgctgctgt caagtgcctt gaatttagat acggcacctt ggaaacgtgc cgaggtagta 240 atgataccta agcctggaaa acctgagacc aatctcgctt tttatcgtcc cataagcttg 300 ctgccgaaag agtattttta agtagagcat tgccagttat ggacgaagct ggcctgattc 360 ccgatcacca gtttggcttc aggcggcgtc atcagaagaa atctgcccta gccttgaagt 420 acggtttctg aagaccttcc tcaagggtcg caagtttgcg gtgaaatttg gtgaagcgcg 480 ctctgatcat aaaggaattg gcccctgcta tacaatgctt acatggcaga actgactgtg 540 ctgcagagcc gtgggagacg ctgcaaccga gctgatgcaa agctggacct cctaaacgcc 600 tgacagaaaa agataactat agcggtcaac agcgaccagt taacggcttc cacattttct 660 ctccgacccc acaatccctg aaaatgccat ataacggtat caccctcgat aggcgactta 720 cctggaagcc gcatatcctg aaaaagataa accaagccaa ccagcccctc aagaaatatt 780 attggcttat aggaagaaga tcaaagttac caacttcgtc caaggttccc atatataaag 840 cgatcaggcc aatctggacg tatggcatcc aactatagca cagccaccgg cgctcacctg 900 ctgaatgaca acgtcacctt ggattcccta gggtcggaga agagataaag aagtgcagcg 960 atcggtacat aaagaggctt cataggcatc ccaacataac ggccatcttc ctactggaca 1020 atagtgaaca acgccgaaga ctgcgcagga ctcatcccct agatttggct cgagagccgc 1080 tgaataaata ttcaattcac ctgctaagtg tatgttaaaa aaattactta tccctggtgg 1140 atatttttaa ataaataaaa taaaatattt gaactaaaaa aatatataat aa 1192 // ID GYPSY7 standard; DNA; INV; 5486 BP. XX AC AE003788; XX DR FLYBASE; FBgn0067384; gypsy7. XX FT source AE003788:34068..28834 FT SO_feature five_prime_LTR ; SO:0000425:1..251 FT SO_feature three_prime_LTR ; SO:0000426:5236..5486 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC Repbase says the LTRs are divergent by 10%. XX SQ Sequence 5486 BP; 2005 A; 1066 C; 1059 G; 1356 T; 0 other; AGTTAACACA TTCAACGAGC AGCAAAGAGA GGCCAATGCA AGTACTGAAG CTTTCACGAT 60 AAGAGGCGAG CTTTTGGCTT GGGATCAAAG GCGAGCTTAA GAAATAAGGG AAGAGCGAAT 120 TGGATCATCG AACGCGTCGG TCGTCGCCAT GCAGACACAT ATTATTCAAT AAACTTACAA 180 GCTCAAATAT AATCCAAATA TAATTCGTAG GCCATAGACA CGACATCCAT AAAAACACAA 240 ATCTATTAAC TGGCGCCCAA CATTCACCTA CATTTCCCGA GACAACAAAA CTACGTTCAA 300 CCGACTTCAA CTGCGGATTT TCTTTCAACA TCAACAATCA ACATCAATAC AATAACAAAT 360 ATTAATTAAT AAAAAAAGGA TTAACTTAAC TTCTGTAAGT ACAACGCATA TTGGTTACTT 420 AATAAGATCT CTAACTTTCA TAACGACCTT TAAATTAACG AAGGTTATTG ATGGTTATTA 480 ATGACAAATA GAAGAAGACG AAGTTTTAGG CGACTTAGAG TAAAGTTTGA ATAGGTGGCG 540 AGAGCATTTA CAAAACCAAA GTGTGGGAGG TGGGAGAGAA GAAAATTGAT GGTTAAAGAA 600 GAATAAAGCT TGCGAAATGA ACTTATCGGC AGATCATTTG AAAGCAATAG TGGATAGTGC 660 CGTTGCTGGT GCGATTGCTA GTCAAAAATA AATATTTGAT AAACAGCTGC AATAAATAAG 720 TGCGCGTATA GAAAAAATTA CAGTGAACAC CCCAGAAGTG GAAACTTATA AAGATGCTGA 780 AATTGTACCA GGTGTAAGAT GTACTGAACC ATTAGACATA GTAAAGTCTT TGCCAGATTT 840 TGACGGAAAA AGTGAAACCT ATGTGTCATG GAGAAAAGCT GCCCATGTCG CTTTTAAAGA 900 TTTCGAGAAT TACGAGGGCA GTTCAACATA CCATCAAGCA CTTGGCATAA TGCGAAAATA 960 AAAAACTCGG CAAACACTGT GCTGGCGTAG TTTAACACTC CGCTGAATTT CAAGGCAATG 1020 ATAAATCGTC TCGATTTTAC ATACGCCGAC AAAAGGCTAG TATATTTAAT AGAGCAGGAG 1080 TTATCGACGT TAAGAGAGGG CGACATGGCA ATGACTGAGT TTTACGACGA AGTTGAAAAA 1140 AAACTTACTC TCCTTACAAA CAAGACAATA ATGACATGCG ATACCACTTT AGCGATTAAC 1200 GAAAAATACA GGTCAGACGC GTGTATTCAT AAGGGGAACA TAAAAGTCTT TGAGAGAGAC 1260 TTACCAACAG CTCTCGCTCT AGCACAGGAA GTCGAATCAA ACCACGAACG CTACCAATTT 1320 GCACTGAACT ATTCTAGAAG CTTAGGAGAA GGCCAGAAGG CTGAGAAAAA GCAAACTGAT 1380 AGGGATTGGC ATACGAACGT TCATCAATAG GGCATAAACC CTCATTTTAG TAAACGGAAG 1440 CCAGTGCCCA ACCCTGGCAA TCAGGCAAGT CAGACACGAC ACACATGATC AGTTAATGGA 1500 CACTGACGTA TCAAAGAGAA CTATTAGAAC GGGTCAATAC CAACCCGGGC AGCCACCTTC 1560 CTTGCATCAG GAAAGCGTAC CCTTACATCA AAACACTTGG CCGGCTCAGC AACAAAACAC 1620 ACGGCAATAC AGGCAAACTA GCTATGATGC AACAAAAAGA CCCTATAGTG GAACTGGCTA 1680 ACAGCACCAA AGCAAAAAAG AATTAACCAT CTAGCTTACG AAGAAGATTT AGCTGATACA 1740 GAAAATTACG AACAGGAAGC AGAAGTGGCT GTGGAGGAAT GGGAGGACGA ACTTGCATTT 1800 CATGATAGTG TTCATTTTTT AGACTTAAGT CCCTGTTACC ATTCATTGAA AGGAAAATTG 1860 CAGGGAGAAC CATAAAACTC TTGATTGGCA CCGGGTCGTC GAAAAACTAC ATACAGCCTC 1920 TTGCAGAACT AAAATATATA ATGCCGGTAC AAAAAGAATT TAAAGTAAAA TCGCTTCACG 1980 GTTACAACAC AATAAAACTT AAATGCCTAG TTAAGATATT CGATAAAAAC GTTCCATTTT 2040 TCATTCTTCC AGACCTCTCA AGCTTCGACG CCATAATAGG TCTTACCACA TTGACACAGA 2100 CAAATGCAAT ATTGGACCTA AAAAATAAAA CTTTAAAAAC TGGTAATACA GTTTAACCAA 2160 TTAAATTCAT AAGGTGTAAC AGCGTTAATT TCTCCGATAT TAAGGACATC ATCGTACCTA 2220 AGCCAATTGC TGATAAATTT CACACTATGC TGGCAGACAG AGTGGGTGTC TTCGCAGAGC 2280 CGGAAAAAGC ACTGCCTTAC AATACACATA TTGTTGCCAC CATACGTACC CAAGATAATC 2340 AACCAATATA CACAAGACTT TACCCTTACC CCATGAGTGT GGCAGATTTT GTGAATAAGG 2400 AGATGAAAGC ATTGTTAAAA GACGGCATAA TCCGACCCGC ACGATCACCA TATAATAGTC 2460 CGGTTTGGGT AGTCGATAAA AAAGGCACAG ATGATCAGGG GTGTAGACAT AAAAGAATGG 2520 TTATAGATTT TCGAAAGCTT AACTTTAAGA CCGTGGACGA CAAATATCCG ATACCAAATA 2580 TAACATGCAT ACTGTCAAAT CTGGGGAAAG CCAGGTTTTT CAGCACGTTA GACCCTAAAT 2640 CAGGATTCCA TCAAATCCTA CTCGCCGAAA AGGATAGAGA GAAAACAGCC TTTTAAATTG 2700 CAAACGGCAA GTACGAGTTT TGCTGACTTC CCTTCGGCCT AAAAAATCCA CCTAGTATAC 2760 TTCAACGTGC CATAGACGAT GTACTTAGGG ATGAAATAAG AAAGTCAATT ATTTTTTTAG 2820 AACGACTAGA GGATCATGTG GATCATATTC GGTGGGAACT GGATAGATTA TTCGAAGCTA 2880 ACAGAGAGTT TCTAGGAAAA AATCTCAATT TTTAAAGAAA GCGTCGAATA TTTAGGATTC 2940 ATAGTATCTA GCGGGGGTAT TGAAACCAAT CCTTATAAGG CAGAGGCTAT TAAAACCTAT 3000 TAAGAACCCA CAAATTTTTT CAACGTGAGA TCCTGTTTGG GACTAGCGAG TTATCGCAGT 3060 TTTATAAAGG ACTTTGCATC TATAGCCAGA CCATTGAGTG ACATTCTTAA AGGCGAAAAT 3120 GCCCAAGTTT CAGCCAGTCG CTCTAAGAAG ATTCAAGTTA GCCACGACAC TAAACAACGT 3180 TCTGCCTTTG AAAAACTTAA GAATATCCTT ACATCCGAAA ACGTAATGCT GCTTTATCCA 3240 GACTATAAAA AGCCCTTCGA CTTGACAACG GACGCCTCGG CACTGGGCCT CGGAGCGGTT 3300 TTATCCCAAG GCGGCAAGCC TATTACAATG ATATCTAGGA CTTTAAAGGA TAGAGAGCTC 3360 AGTTTCGCAA CAAACGAACA CGAACTCCTA GCTATAGAAA GAGAAAGAGT TCCTTAAAAA 3420 GTTTAAGGAA CTATCTGTAT GACGTCAAAA ATCTGAACAT TCACACTGAT CACCAGCCGC 3480 TAATTTTTGC CGTTTCAGAC AAAAACCCAA ACGCAGAAAT TAAAAGGTGG AAGGCGTTTA 3540 TAGACGAACA TTTCTATAAA TGTCTCTAAC ATTTTCTATA AGGCAGGGAA GGAAAACTTC 3600 GTTGCTGATG CACTATCTAG ACAAGCCATT CATGCTGTTG AAAGCGACGC TCGGTCAGAT 3660 ATATCGACAA TTCATAGCGA AATTTCACTT TCATATACCA TTGAAACAGT CGACAAGCCA 3720 GTCAATTGTT TCGGAAATCA GATAGTCTTA GAAGAAGGCA CAACATATTG TACTCGTACG 3780 TTTGTTATAT TTAGAAATAA CTCGAGACAT TTGATACAAT TCGCAAACAG GGAAACTTTG 3840 GTTGGCAGAA TCCGTGACGT GGTTAAACAG GATGTAACGA ATGCAATATA CTGCGAACTG 3900 CCCGTATTGG CATTCATACA AAACAGACTT GTGGAAGAGT TTCCTAGAAC GACATTTCGA 3960 CATACTAAAA AAATTGTCAA CGACATTTAT AACAAAGACG AACAAAGGGA AATAGTAACC 4020 ATCGAGCATA GCAGAGCACA TAGGGCAGCA CAGGAGAATG TAAAACAGAT TTTACAAAAT 4080 TATTTTTTTT CCCAAAATGT CACAGTTAGT AGCCAGTATT GTAAGCGACT GCTTAGTCTG 4140 TACAAAAGCC AAATACGACC GTCATCCTCG AAAACAAGTC CTTGGGAAAA CACCTATTCC 4200 CTCACTAATT GGCGAGACAT TACATATAAA TATTTTCTCC ACTGACAGAA AGTTCTCCAA 4260 ATTCGCGATA GTGCAACCGA TAGGCTCTCG AACAATAGCT GATATAGAAC CTGCAATAAT 4320 GCAGCTAATG AATTTCTATC CATGGGTAAA AACAATTTAT TGTGACAACG AACCATCTAT 4380 GAACTCTCAA TCTATCAGGT CACTCTTACT AAACCGATTC AATGTAACAG TGGAGAATGC 4440 ACCGCCACTC CATAGTATTT CCAATGGACA GGTGGAAAGG TTTCACAGCA AATTGGTAGA 4500 GATAGCACGA TGTTTAAAAC TAGAAAGAGG CCTAGATGAT ACGGTCAACC TCTTACTCCA 4560 GGCAGCCATT GAACATAATA AACGAACGAA CTGTCACTAA CAAAAGACCG ATCGACATTA 4620 TTCATGCAAT GCCTCCCGAA CTTGCAGAAG AAATAACAAA TAGAATTGAC AAAGCACAGG 4680 AGATACAGTT AAAAAGGATG AACGAATCAA GGTCTTTTTT TCAGGTTGGG AAAACGGTCT 4740 TGGTAAAACA AAATAAACAC TTAGGAAATA AAATCACCCC ACGGTACAAG GAGGAAAAGA 4800 TTGATGCAGA TATGGGAACT AATGTCCTTA TAAATGGAAG GGTAGTCCAT AAAGATAATC 4860 TACGCTAGGT ATCTGTTGTT TAAATAAATA AGGATTTGGC AATTATTTAA TTAAATAAAT 4920 CTTTGGCAAG TCTACTTCCC CAGAAGATAG AAAAGAATTC GAATCCAAAA ATTGGAAAAA 4980 AACCTTTACC AGTATTTTTC AATAACAAAA AAAAAGAAAA AAAAAAACAA CAAACTTAAC 5040 CAATTGGCAC TGTGCATAGT TTCTTAATAT ATTTATATTA TATCCAATTA TACTAGTCTT 5100 TAATGTTTGT AATTTGTAAT GAATAAGGCA CGGTCATCGC TTTAGCCATT ATAAGAGGTA 5160 GCCGGCAATG TTATCGTCAC CACACATAAA CCTTCTCAAG AAGTCTGTGG ACAGTCTTCG 5220 TCTTAGAAGG GGAGGAGTTA ACACATTCAA CGAGCAGCAA AGAGAGGCCA ATGCAAGTAC 5280 TGAAGCTTTC ACGATAAGAG GCGAGCTTTT GGCTTGGGAT CAAAGGCGAG CTTAAGAAAT 5340 AAGGGAAGAG CGAATTGGAT CATCGAACGC GTCGGTCGTC GCCATGCAGA CACATATTAT 5400 TCAATAAACT TACAAGCTCA AATATAATCC AAATATAATT CGTAGGCCAT AGACACGACA 5460 TCCATAAAAA CACAAATCTA TTAACT 5486 // ID GYPSY8 standard; DNA; INV; 4955 BP. XX AC AE003788; XX DR FLYBASE; FBgn0067383; gypsy8. XX FT source AE003788:10579..15229 FT SO_feature five_prime_LTR ; SO:0000425:1..304 FT SO_feature three_prime_LTR ; SO:0000426:4652..4955 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC Repbase says the LTRs are divergent by 7%. XX SQ Sequence 4955 BP; 1430 A; 1132 C; 1623 G; 770 T; 0 other; TGAGGAGTCT TAAAACTCCC CATAAATCTC GTGTAGGGGA GTGGCATAGC AACAGCCGCG 60 GTGCAGGGTA AAGCGGAAAG ACCGTGAATT ACAGTACAGT ACTGGACCTT GAACTCCAGC 120 GGGTCAAGGG CGAAGAGGTC GAACGGTAAC GGTGCGTGCA CAAAGAAGAC GGACGGTGAA 180 CTAACGGTAT CGCTCTGAAA CTTGGAGCGG GCCAAGGGCA AATAGGTCGA ACGGTAACGG 240 TGCGTGCGCA AAGAGGACGC ACCGTGAATT AAAGGCACAA TATGTGGACC TTGTACCCGG 300 GCGGGCCGTG CGCAAAAAGG GAAATAACGG GATCCCTGCG GCCTATAAAA GGTAGGCGCT 360 AGCACGGATG CGGGGCAGTC AGTGAGACAG CAAGATCGCG CGTGGAAGGA GATAGTGCGT 420 CGAAGCGTTA CCAGTAGGAG TTCGGGAGTG AGAGCGCGCG GTCAAGTAAC AAACAGTGGA 480 TGCGTCGGCG GCAAAAAGCC CGAAGGACAC GGAAGGAAGA TCACTACAAG CGAGTACAGA 540 AGGTGCCAGG ATGACACCGC AGGCCCGTCG AGGAACGACA ACAACAGATG AGAACTGGAC 600 CCGGTGCGGC GCGGTTTTGT GCCAAACCCA GCACAAGCCT GCCACAGATG CGGTGGCCAG 660 GGCCACTGGT CTCAGGAGTG CAGGAACCGG CCGATTAATT TTTGCTGGAC CTGCGGGAGA 720 ATCGGTCCAA GGACTTCGGA ATGCTGCCAC AGATCGGGAA ACGCCATGCG ACCCCAGCCT 780 CAGAGGGGCA ACCAGGGTTC GCAAGAATTA ATGGCCAAAC TGAAGGCGGA AGAAAGGCAA 840 CTGTCCGCCA CAGTGCTCAT AGACGGAGTA GAGATAAAAG CCACTGGCGG ACAGGTTGCA 900 GGCGGCAGGA GAGGTCTTAC CTACGAGGAG AGAAGTGAGA ATGGCAGATG GACGGTACGA 960 AGAAGTCACA TCGGTGATCG AAGTCAACAT AGGATTTGGC GAGAGGACCG TTCGGATGCA 1020 ACTGCTGATC CTACACAACA TAATCGACGC ACTGGTATTG GGTTGGGATT TCCTAACGAG 1080 AGTGGGAGCA CGTATGGAGT GCGCCGGACT GAGCGTAACA ATACCAGTTT GCTCAACGGG 1140 ACAAAGCAGG CCAAGGGAAA AGCTTTCAGT GGCAGTCGTG GAGAGGGCTG ACTTCTCAGA 1200 GAAGGACGTG GACGAATTCC TGAGGTCGGA GTTGACCAGT TTAGAAAACA TACAAGGGAC 1260 GTCAACCGTG ACAGTGCACC GAATAACGAT GAAGGACGAT CAACCAGTCA AGCAGCGGTA 1320 CTATCCGAAA AACCCCAAGA CTATGGGCTG GGCGCAGCCC TCACACAGCA TTCCGAGCGA 1380 GGCGAGCGAG TAATCTCCTA CTCCAGTAGA ACGCTGAACG CAGCGGAAAG AAACTACTCT 1440 GCAACGGAAA AGGAATGTTT AGCAATATTA TGGGTCGTCA GAAAGTTGAG ACCGTACCTG 1500 GAGGGCCGAT GTCATCACGG ACCATATGGC TCTGAAGTGG CTGAATAGCA TTTAGAGTCC 1560 GTCCGGCAGG ATAGCAAGGT GGGAGTTAGA GCTGCAACAG TACGATTTCG AAATCTCGTA 1620 CAGAAAGGGC CAGTTGAACA TCGTCGCGGA CGCGCTTTCA AGACAACCTC TGCAGGAGAC 1680 GAGCCGAAGA GTGAGCGTGG AGGACGACGA ACAATGGAGA GAGCAGCAGG GATGGAGGAG 1740 ATGCAAAGAA AAGTGAAGCA GCAGCCGCAG AAATTTCCGG ACTATTTGGA GGAGGACGGC 1800 AAGCTTCCGC ATCGGGCGGG CAACGAGGAT GTGGCATCGT GTAAGATGTG TGTACCAATC 1860 GGTCAGAGGC AGTGCGTCAT GACCGAAAAC TACGACATCC AACTACGGGA CACTTGGGAA 1920 GTAGCAAGAC GATTGCCAGG ATTGCAGCTC GTTTTTATTG GCCGGGAATG CATCGAGACA 1980 TAAGAAAGTA CGTGCGAAAC TGCGAGAGAT GCATGAAGTA CAAACCCAGC CAGCTACAGG 2040 CGGCCGAAGA AAAGGTTAAC ACAGGAGCCG GTAGAAGCAT GGGCAACCGT GTGCGCAGAT 2100 TTCGTGGCCC CTTGCTGCGG TTGAAACACG GAAATGTTCG TGCAGTTCAC AAGTAGGGCG 2160 TTCAGGAGGT TTCTGGATGA GCTGGGGGTG CGACACCAGC TCACGGCTCC ATATACTCCG 2220 CAAGAGAACC CTACGGAAAG GGCCAACAGG ACCGTCAAAA CAATGATCGC TTAATTCACA 2280 AGTGCCGATC AGAGGACATG GGACGAGCAG TGGCCGGAGC TGCAGCTGGC GGTTAACACA 2340 AGTGTGGCGG ATCACAGGAT ACTCGCCGGC GTTCATAACA CAGGGAAGGG AGCCGAGTCT 2400 GCCCAACGCG TTGTTTACGA AAAGACGACC GGGACCGGCA AGTGCACGCA GACACCAGCG 2460 GAAAACGCGG AGAAATTGAA GGATATCTTC GAGATGGTGC GGAGAAACAT GGAGAGGGCG 2520 GCACAGGATC AGGCACGCCA CTATAATCTC CGAAGACGAA GGTGGGGGAC ACAGCATGGG 2580 CCAAAGAGCA CCACCTGTCA AAGGCAGCCG AAGGTTTCGA GGCAAAATTG GCCCCGAGAT 2640 TTGACGGGCC GTACACAATA AAGAAGTTCA CATCACCAGT AATATGCGTC CTAGAACACA 2700 ACACAACCAA AAAAGAAAAG ACGGCACACA TCAGCGATCT GAAGCCGGGA AGTGCGGGCG 2760 CCGGCGTGCC AGGAGACTTC GAGGAATAAA CGGAAAAAAA AACACAAAAC GTTGGTGAGG 2820 AAGGGGGGAA GACGATACAA GGTATCGATA CGGATCAGAA GTCAACATCA CACACACACA 2880 CACACAGGCC CGTCAGAGCG CGAGAGCGGG ACAGGGAGGG TACGCAGCCA AGAACCGTTA 2940 ACGGGTAACA CACACACTCA GCTGCGAATT CCGTGGGAGG CGAGTGTACG GACGAGAAAA 3000 GCTCCGTTAC CGGTACGGGC CCACGGAATA ACGGCACAAG GAGCAGAAAG AAAACCGAGG 3060 TTTCGTTGCG GAGAGTGCGG CTCGAGTCTG TCTTTTCAAA TTCGAGATCG TCATGGCAGC 3120 ACACAGTTTT CACGGCCGGG GAGTACGAGA GACCGGCGCC GGGGAACAGC CCGGGGCGAT 3180 ATGGGATTCC GATTCCTATC TCCAGCTCCT GGCGTCACCC CTGGTATCGC GATCTCCATC 3240 CCCGGGGAGT GGGGAGGACC CGGAGGAAAT TCCGGACGTG GAACTTGAGG AAGACCAACC 3300 GATGGCGGAT ACGGAGGGCG CGGCTGAGGT CATCACCCTC TCTTCGGAGA GTGAGGAACA 3360 TGACGACGGA GGCGAAACGG TCACCCCGGA GTTGTCATCG GACGAGGATG AGACCGCGCG 3420 GATGGCTTAC CGAAGCGTCC GGCGGGCTAC GCAGGGCCAG TCGTGCCGCC GACCGAGGAT 3480 CCGGTGGACA TGAAGAGATT CATGGAGGAG CGAGTCGCCA CCCTGCGGCG ATTCCAGCAG 3540 ATGGTGGATG AGCGAAAGGT CGCCGACGCC AAGGCCGAAT GGCAGGAGCG GTTGGCTTGG 3600 CTGCAAGAGG AAGAAGCGTT GTGGGCACCA ACAGGGGAGA AGAGCCCGGG AGAAGCGAAC 3660 CCGGAGAAGA GTGGAGCTGG CCCGCGCCAG ATGAGGCGCC GCCACCACCA AAGCCGCGCG 3720 GCAAGGGGCG CGGGGGCGAA AGGTGCACTG GGCGAAGTTG TTAGAGCTTG GCGGGCTTCA 3780 ATGTGCGGGG GGACGGAGAG GTAAAAGTCA CGATATCAGA ATAAAAGATA CCGGACGGCA 3840 AAAAACAAAC CGTATGTCCG AGTTTGTCGT AAACGGGCAC TGGCACAATA ACTTAAACAA 3900 AAAATGGTTG CGGGTGTCAT CTGCAGCAGC CGAATTAACG CGCGAGGGGG TGACGCGCGG 3960 CCAGGCACGC AGGCCGGCAA ACGACGCGTG GCTGCTGCTG GCGCGGGACG AAGGAGGGGC 4020 CGTTGCAGAT GCAAGCGATC CTCCGGGGCG CTCGGGGCCA GGGGCAGCCA CCCCAGAGCA 4080 TGAAGGGGCA TACATGTGTT GGTGGAGCTG GAAAGAAGAG AGGATTCGTT AATATGGCGA 4140 TACGCAAGAA AGCGTTAGAC TCACCTTTCA ATGAGCTGCT ACACCGGATC ACGTGTGAAA 4200 AAGAATTGGG ATGGTGGCGT CGCGATAATG GCGACCGATC GATAGTGTGA TCGATAGAAA 4260 TACTGGATGA GCACGGATGC GCAAATGCGG CCACACTGTG GGCAGCGATC CGCACGCGAT 4320 GGCAGCGCCG GAATATCGGT AGTGACAACG CCGCTTACCG ATATCGATAA CACGAGTTCG 4380 GCAATGTTGC CAGGCGGGAA AATTCAAATA TCGAACTGGT GGAGCAGAGC AGACAGCAGC 4440 AGCACATGGA AGGTGAAAGA AGACCGCTAG GACAGAGGAG AACGGAGAAG GATATGTGAA 4500 GGGTCAAGAT GCCCCGTCGA GATGGCTAAC GCCAAAGCTG AGACCAGAAG GATGTAGGAC 4560 AACGAGCGGA GAAGTTGGAA GGAAGAGACA CCGGAGCGGA GGACCGTCGG GATCATTAAG 4620 TTTTTAAAAT CTTCTGAAGA AAGGGGGAGA TTGAGGAGTC TTAAAACTCC CCATAAATCT 4680 CGTGTAGGGG AGTGGCATAG CAACAGCCGC GGTGCAGGGT AAAGCGGAAA GACCGTGAAT 4740 TACAGTACAG TACTGGACCT TGAACTCCAG CGGGTCAAGG GCGAAGAGGT CGAACGGTAA 4800 CGGTGCGTGC ACAAAGAAGA CGGACGGTGA ACTAACGGTA TCGCTCTGAA ACTTGGAGCG 4860 GGCCAAGGGC AAATAGGTCG AACGGTAACG GTGCGTGCGC AAAGAGGACG CACCGTGAAT 4920 TAAAGGCACA ATATGTGGAC CTTGTACCCG GGCGG 4955 // ID GYPSY9 standard; DNA; INV; 5349 BP. XX AC AE002591; XX DR FLYBASE; FBgn0067382; gypsy9. XX FT source AE002591:<3717..>8393 FT SO_feature five_prime_LTR ; SO:0000425:1..336 FT SO_feature three_prime_LTR ; SO:0000426:5014..5349 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC Repbase says the LTRs are divergent by 6%. XX SQ Sequence 5349 BP; 1992 A; 1135 C; 968 G; 1254 T; 0 other; AGTTATCATA AGTAAGATCA ATAATACTGG ACTTAATACT AAGTCAGTAT CCGGTCCGAA 60 TCTACTACGA TAAATCAGCA GAATTTGCGG TTACTGTGAA TGGACACCCA CAGTGGGAAC 120 CACAGTGGTG CAGCGTTAGC AGTCTGGACG TTTTGGCGTG AGGAGTCACG AGGCGCAACG 180 TCGACAGAAA TAACGGCAGG TGTGCCGCTA TGAATATCTG AATGTGAAAT TGTACTTTAG 240 TTTTAAACAG AACTCAAAAG TGATCAATCG TATTCTGATG CACTTCATAA AGATAAAAAT 300 AATTATTATT AATAATAACT TAATAATTGT GTAATTGGCG CCCGAACGTA GTAAAAGTGA 360 TCTGAAACCG GATCAAAAGA ACTCGCAACG GAAAATATTT AATTAAAGAA CAACAAACAA 420 AATAACAGTG AAATCGTTTT ATATTAACAT ATAAAACAAA ACAACAACAA CGAAAAACTA 480 AAAATTAAAA GCCCTAGTGC AACTACACTA AGCGAGTGAA AAACAAAAAC CACGGCCGGT 540 TCATATGTAA CATATAAAAC GAAAACGTAC AACAACAACA AAAAATAAAA CCCCAGTGCA 600 AATGCACTGA GTAAACGAAA TATATAAAAC GAAAAATACA TAACCGACAA AAGAAGAACA 660 CAGGAGCGGA CCAGCACACA ACGGCACTAG AATAATAACG AAAAATGTAT GTGAGAAGTT 720 CATTGCCTTT TCTTTCTAAA ACAAATTAAA TAAAAATTAT ATTTAAAAGT GATTATCTGT 780 CGATTTATTG GCCTAGTGAG AGTAGTAGTA AACTAATATC AAAATTTTGT TCAGTACAAT 840 AAAAGCCATT GTTTCTTGCA AACAAACACG AATGTATAGT AACTTGAATC GAAGTAATTC 900 TAGTAGTGAC GAGGACACAA TACCTAGTTA GATGGAAAAA ATTAAAAACG AACAAGTTAC 960 GATAATGAAT ATAGAACAGT GCACAACAGG AAACCTCGCA GCAACACAAG TTGCGTAGGT 1020 TATCTGAATA TGGTAGAAAA TATGAATATG GTAGAAAAGC GGTCAAAGCC GCGCTTGAGC 1080 ACCAATCGAC GGTTTTCGAA CAGAAATTAT CTGACACGAC ACATTATATG CTACGACAAA 1140 TAGATACTAT TTCTATAAAA TCAGAAACTC CAAAGGTAGT AGTATACGAG GCAGCTAAAG 1200 TAATACCAGG GATTACGTGT GACGAACCTT TAGATATCGT AAAGTCCATA CCAGAATTCG 1260 ATGGCAAACA GGAAAATTAT GTTTCTTGGC GGCAAGCAGC AACGGTTGCT TACGAACTCT 1320 TTAGACCATA TAATGGGAGT TCGAAGCATT ATCAGGCTGT TTCAATAATA CGCAACAAAG 1380 TTAGAGGAAC TGCCAATTCA GCACTATCCT CATTTAGTAC GGTGTTGAAT TTCGACGCGA 1440 TTATAGCGCG CCTAGACTTC ACATACGCAG ATAAAACCCC ACTGCGTGTT ATACAACAGG 1500 AACTCGCAAC ACTGAGGCAG GGCTATATGC CCCTGCTCAA GTATTATGAC GAGATAGAAA 1560 AGAAGCTAGC ACTTCTAACC AACAACGGAC CAAATAATCA ATCACAGGGT CAGAAGGCTG 1620 GTAGTTTCAA ACAACGCCAC GCCCCCCCAA ATAATCAAGG CGTTGAGCCT ATGGAGGTTG 1680 ATAGCTGAAC ACGCTATAGC CAACAGACCC AGTATAGAAG AAATAATGAC CAATCCATGC 1740 GTAGCAGACA AATGCTGCAT CACACGTCCC TGGAATCACA GGGGATCAGA ATACGATTTG 1800 TTAGCACAAT CAGAGGTATT AGCAACCGAT GACGATGACG CATCCGACGG CGAACAGTGC 1860 AATTTTTTAG GGGAAACTCC CTGCTACCGT ACATTACAAG AACAGTAGCA GGGCATATTA 1920 TAAACCTCCG GATAGATACG GGAGCATCAA AAAATTATAT AAAACCACTT CCCTTTCTGA 1980 AAACCCTTAC ACCAGTTGAC AACCTGTTTC AAGTCAATTC TATACACGGG CATACCAAGA 2040 TAGAGCAGAA ATACCTTATC TATTTATTTG GGACGAATAC TTGTTCGTCC TAAACAGTTA 2100 AAAAAATTTC GACGGCATTA TAGGCCTCGA TTTATTAAGA AATGTTGATG CAACCATTAA 2160 TCTTACGAGC AATTTAATTG CACACAAATT CGGCTCAGAG CCCCTACAAT TTATAAAGTA 2220 TCAAAACGTG AACTTCATCA AAATTGACGA CAGTAACATT CCACTGGCGG TCAAAGAAAA 2280 TTTTAACAAA ATGATCATGA CGAAAATGTC ACTAGAAAAA ACGTCTGGTT ATTGATTATA 2340 GGAAACTAAA TCAAAAAATA ATAGACGACA AATACCCCAT ACATGCCATT TTCACCATAC 2400 ATGGGTAGGG CTCAATATTT CACAACATTG GACCTTACTT CGGGTTTCCA CCCGATTGAA 2460 CTAGCTGAAA GGGATCGAAA GAAAACAGAT TCAAAGAGAG GAGAACCTGG CATTCATTGT 2520 GTCAAGTAAG GGATTAAAAA CTTCGCCTGA AAAAATTCAT ATAAAAATTT TTATGTCCAA 2580 CCCCCAAATA CTCTATTCGG CCTAAGGTCC TTCTTAGGCT TATCCAGTTA CTACAGGTGT 2640 TTCACTAAAG GCTTTGCAGC CATGGCCAGA CCTCCTACAG ATATGTTAAA AGGTGATAAC 2700 GGAAAAGTAA GTACCAACCA ATCCAGGAGA GTTAGAATAA ACCTGAACCA AAGCCAACAA 2760 CAAGCATTCG AAAAGTTGAA AAACGTCCTA GCATCAAAAG ATGTGCTCCT TATTTACCCA 2820 AACTTCAACA GACCGTTTGA CCTCACAACT GATGCTTCGG CACATGGAGA CAAATGAGAC 2880 AAACTTCGCG ACTAACGAAC GAGAACTTCT TGCGATAGTC TGGGCATTAA AAAACCTAAG 2940 AAACTATCTG TACGGAGTCA AGAACCTTAA CATCTGCACC GACCACCAAC CGTTGACATT 3000 CACGGTATCG GACAGAAACC CAAACGCAAA AATAAAACGC TGAAAGCTTT TATCGACGAA 3060 CACAACGACA ATATCATTTA TACACCAGGC AAGGAAAATC GTGTAGCTGA TGCCTTGTCC 3120 CGTCAAAATG TTAACGCGCT AGACAATCAC CCTGACTCGA ACGAATCCGA CAATGACTCC 3180 AACATTGCTA CAATTCACAG CGAACAATCT CTTACATACT CAGTTGAGAC ATCCGACATC 3240 CATCTGTTAA AACCTTTATA CTATTCAAGG ACAAGCACCG ACAATATAAT CCAATTTACA 3300 GACCGAGAAA GTCTATGGCA CACTGTACTA GACTCAGTAA ATGCAAACGT TGTGAATGCC 3360 ATACACTGCG AACTCCCAAT ACTAGCCTTC CTACAACACA GACTAATAGA AACATTCCCC 3420 TCAACGACAT TCAGGTATTG TAAATACGTG GTAACGGACA TTACTGACAA GACAGAACAA 3480 ACCGAAATAA TAACCACAGA ACATAACAGA GCACATCGCG TAGCCCAAGA AAACGTCAAG 3540 CAGATATTGC GCGACTACTT CTTTTCGAAA ATGAGCCGCA CATTCTGTCA TCATCTTAAT 3600 CTTAATTATT GCGATGGCGA TCGCTACAAC AAAAATAACT GACTACTCAC ATACGGACTA 3660 TATACCAATT ATGGACGGAA ACGTAACGAT TTGGGACGAG TACTTAGGAC ACATGACAAA 3720 CGTGACATCC TACGAGATAT ACGCAGACGA GACCAAAAGG ACTATTGACT TATTACAAAA 3780 GGATCACATG AAAAGGATTC TTACTACAGA CCTCGGACAC ATTGACACGT TGATCGCTAC 3840 AATAAGAGTA AAACATAGAA ATAAGAGTAG TATTAACCCG TTAGGGACCG CATTAAAAGT 3900 AGTAGCCGGC ACACCAGATT TTGACGACTG GGAACAGATA AGGTTCCGGC AGGAACAATT 3960 GTTAGAGTCA GAAAAAAGAC AGGTAGAAAT AAACTACAAA TTACAAAATC GATTGAACCT 4020 TTTGACAAAA ACCTTAAACG ATATCAACAA AGCTGACGAA ATATACACCG AACATTTATT 4080 TGAAACAATG CTAGCAAAAA ATAGAATACA AATAGAGTAA ACTAGTAATG TTTAGAAAAT 4140 GGTTTGAAAA AATGGTAACT TCACTCACCT TGGCAAAAAT CCATTTGATC AACCCTGTCA 4200 TTTTGGACGA TATTGATATA AAAGAAATTA ACAATGAACA ACTCACGAAT GCCAGCGTAG 4260 CTGACATTCT AGAAGTAGCC AAAGCTAAAG TTTTCCAAAA CAATAATATG ATATATTTTT 4320 TAATTACATT TCCAAACCCT GAGTTAGTCT GTAAAAAGAT CAAAATTTTC CCCGTTCAAC 4380 ATAAAAATAC AATATTAAAC TTAGAAGACA GCAACATATT CGCAGATTGC GGCTCAAAAA 4440 CCATTGCAAT AAACCAATGC GAAGCCACAG TGAGCACGAC ATTCTGCAAA ACAACAAGTA 4500 CACCCACCTG TGCCGAACAA CTAACCGTCG CACAATGCGG CACCCGATCC AGCCACCTAA 4560 ATCCAATAAC AGAGGTCGAC GAAGGTATCA TCATCATTAA TGACGCTGCC ATGAGAGTCA 4620 CTGATCAAGA AGGCCTCAAC CGAACAATCA CGAGGTCTTA CCTCATAACC TACGTTGATC 4680 GGGTCTCCCT AAATGGGACA TTGTTCATCA ACCAACCAAG TCTCTGAAAC AGAAGCCAGC 4740 AGCATCCGTC GGGACTCAAA TCAATGTCAC GAGCCACAAA AATCACCACA GCCTGCCCTA 4800 CCTCCATGAG CTAAGCTTGA AGAATCTCAT ATGTAGCTTA AAAGCTGAAG TATTGTCGAG 4860 ACCGATCGTA AGCAGTGTCA TCACTTTCAC CATCTCACTC GCCTGTTTCG GCATTGTTTA 4920 CATCGCCTAT CACTGGTTCA AACGATAACG CCTAATCGAA CCAGAGACAG CTCTGCAGAG 4980 ACCCGAGGAC GGCCCTCACT TAACTAAGGG AGGAGTTATC ATAAGTAAGA TCAATAATAC 5040 TGGACTTAAT ACTAAGTCAG TATCCGGTCC GAATCTACTA CGATAAATCA GCAGAATTTG 5100 CGGTTACTGT GAATGGACAC CCACAGTGGG AACCACAGTG GTGCAGCGTT AGCAGTCTGG 5160 ACGTTTTGGC GTGAGGAGTC ACGAGGCGCA ACGTCGACAG AAATAACGGC AGGTGTGCCG 5220 CTATGAATAT CTGAATGTGA AATTGTACTT TAGTTTTAAA CAGAACTCAA AAGTGATCAA 5280 TCGTATTCTG ATGCACTTCA TAAAGATAAA AATAATTATT ATTAATAATA ACTTAATAAT 5340 TGTGTAATT 5349 // ID GYPSY10 standard; DNA; INV; 6006 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0067387; gypsy10. XX FT source nnnnnnnn:1..6006 FT SO_feature five_prime_LTR ; SO:0000425:1..364 FT SO_feature three_prime_LTR ; SO:0000426:5643..6006 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC Repbase says the LTRs are divergent by 6%. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 6006 BP; 2113 A; 1283 C; 1203 G; 1405 T; 2 other; AGTTAACACG AGCAAGTCAG AGGGGTGCAA CGACCCGCCG AAATGCCATC CAAGACCGGA 60 TGTGCCAACA ACAACATCGA CGACGCTGAA TCAGCCCAGT AACGCATTTC GCTTTAGCGA 120 AAGCGATGTG CArGGGTCAG CGTGATGCGA GGCGGGCATG TGCGCGCTGG GTGACCGTCG 180 AAGGCATGTA ACTCCAGACG GCACGAGGCG CCGACTAAGC GCTCTGCTAA GTCCGCTGCC 240 TCCAACCGAA GACTTGGCGG CCTTAATAAA TTCGATTTGT AGTTAGTTCG TAACAGACCA 300 GAATTCAGAG CAGTTGGCTC TAATAAAAGA ACCATTTAAT AATACAAACG TATTTGGTTT 360 AATTGACGCC CAACGCGCTT AAGTGTCGTA ATAACGTAAA CTCCGTAATA AAAATATCTA 420 AAAATCAAGG TCTCGTAACC GGGTTTTCTT TGAAAACGCT TCGGACTCGC ACGCTGCACA 480 ACAACAGTGA ACATCAACAC ACAAACAAAA ATATCTAATG CAAAAGTAAA ACAACGCCGC 540 ACAAGCAACA GTGAAAACCG CTGCATATCA ACAGTAAACA GCTAAAACGA CAGCCAGAGG 600 ATAAACGGTC AACCAGCAAC CGTCTTCGCT GCATTTTAAT ATACCAATAT AACATGTAAG 660 TGAGATCTTT ATTTTCAATT TACCTCTTTT AACGTCAACC AGATATGTGT GCCATTTTTG 720 AAAAAAATAC AAAACTCATA ATAAATAACG AATAAAACAA ATAAAAAGAG TGAATACAAT 780 TAAAACTTTA AGTATAGTGA TTTTAATTTA CAAATTGATT AGAAAAAAAA AAATGAAGAA 840 TGAAAATTTA AGTCTTTCCT AAGAAGTAAT TCTTTCTCAT ACAAAGATAT CATTGTCCGA 900 TGTATAGAAA TAGTATCGAA AACAACGACA GTGACGAGGA CTTAATACAG AGCTATAAAG 960 ACATAAGAAA CAATAACCAA CATGGGGGTT ACATTGACGG TACTGAGACC GACGAAGATA 1020 TAGAATTAAA AAACTTAATA GACGGCTTTA CAGACAGAGT AAAAATGGTT AACGATCAAG 1080 AATTTCAAAA TCAACCTCAG GACCTTCAAC AAGAGGCAAC ACCAACACAG GAACAGGCGA 1140 AAACCTCAGG TACCATTAGT AATAAAGATG TAATTAGACT CATGGAAGCG GCCGTAAATG 1200 GTGCACTAGT GCACCAGCAT CAACAAAAAA TATTCTCAGA CAAGCTTAAA GAGGTTGAAA 1260 ATAGGCAGAG AATAAACGAT GGTGGACCCT TAGTGGTATA CGAGCAAATA CCACATAATG 1320 CTGCTATTCC TTGTAATGAA CCACTAGACT TGGTAAAGTC CATACCCAGT TTTGACGGAA 1380 AACAAGATGA ATATGTGGTA TGAAGAACCG CAGCAGTTAA CGCCTACGAA ATCTACCATA 1440 ATAATATAAT AATATATAAT AGCTAGAATG GGCTGCGCAT GTGCAGAACA AACTCCAATC 1500 GAGGTGACGC AGCAGCAAAT GGTTACTATG CGTCAGGGAG ATCTTCCTCT AATGACTTTT 1560 ATAATGAGAT TGAGAGAAAG CTAACTCTTA TTATTAGTAG GACTTTTTTG TCTTATGAGA 1620 CAAATACTGC CGCTATCTTA AAAAATAGAT CACGGCAGGA CGCTTTGAAC GCTTTTGTGA 1680 CCGGGCTAAA AAAATCAGTC CGAAATGTAG TTCTCTCAGC TGCGCCCAAA GATTTGCCAT 1740 CAGCATTGGC CGTGGCTCAA CGGACAGAGT CCTGCAATGA ACGAGCCTGG TTCGTTGCAA 1800 GCTTTAATAA AAATGTAGAA GAAAAATCTT ACAATTCAGA AAACCGTCGC CAAGGTAACC 1860 GTTTTCATAA CACCCCACAA AGTAATAACC ACAACAACTC CCAGGGTGCT TTCCAGAGAA 1920 ACTAAAACAA TAATCCACCG TGCACTAGTA ATCAAGCTAA TGGCTCTCAA TTTTCTAAAA 1980 ACCAAAGATA AAAAGGCCAA GCGCATCAGA ACAATGAATC AAAAAATCTT CGATAAGGTT 2040 ATACGCAAAA CCTAGGTCCT GAACCCATGG ACGTCGATCC CACATCACGT TCTAAATTCA 2100 GGAGCGAACG GAGAGCTCGC AGTAGTCAAC GGTTGAACCA GAACAAGAGC AGTCAAATGA 2160 CCAGGAGTAT AGGGTAAAGT CATCCTACGA AGCAGCGGAA ATCGAGGCTG ATAACACGTC 2220 CGATTCCGAA TCATGTAATT TTTTAGGGGA CGTCCCTGCT CCCCCAAATA ATTCGTTCGA 2280 TAGCGGGGCG AGAAATTAGG TTACTGTTGG ATACAGAAGC CTCCAAAAAT TACATAAAAC 2340 CTCTAACAGA ATTAAAACAC TTCAAACCGG TGGAAACACC ATTTGAAGTC ACATCAATCC 2400 ATGGTCATAC AAAAATAGAA CAAAAGTGTC TGATCCATCT ATTCAATGTT AAGTCATACT 2460 TCTTCTTGTT AAACAACCTG AACGAATATG AAGGAATTGT TAGACTGGAT TTGCCAAAAA 2520 AGGTCAATGC AAAAATTGAT CTAACAAAAA ACATCATCGA GCATGATCAT GGTACGGAGC 2580 AAATTTTTTA CTCAAAATGC AGGAATGGTA ACTTTATTAA CATCGATGAC GTGGACGTGC 2640 CGAAAGCCAT AAACGAAAAT TTCAAAAAGA TGATCAAAAA CAGATCAAAA GCCTTTGCGG 2700 ACCCAAACGA TTCCCTCCCC TTCAAAATGA ATACGGTCGC CACGATCCGC ACTGACGGGG 2760 AACCCGTATA TTCAAAACTT TACCCATATC CGATGGGTGT AGCCGATTTC GTCAATACGG 2820 AGGTTAAGCA ACATCTAGCA GACGGAATAA TAAGGCCATC CCGGTCGCCT TACAATAACC 2880 CAATTTGGGT TGTTGATAAG AAGGGTTTTG GCGGAGAAGG TCATAGGAAG AAACGTCTCG 2940 TTATTAACTT CAGGAAACTG AATCAAAAAA CAATTGATGA CAAGTATCCT ATACCATTCA 3000 TATCGACCAT ACTGTCGAAC TTTGGAAAAG CTCAGTACTT CACGACTCTT GATCTGAAGT 3060 CGGGCTTCCA TCAAATTGAG CCCTCGGAGC GCTTTCGAGA AAAGACAGCT TTTTAGTAGT 3120 ATGAATTCTG CAGACTTCCC TTTGCTTTAA AAAATGCGCC TAGTATTTTC CAATAGACGA 3180 TGTTCTGAGA GAACACATCG GCAAAACTCG CTATGTCTAC GTCGATGACG TAATATTTTT 3240 CTCCCAAACA ATGGAGAGTC ATGCCAACGA TATAAACACG GTTCTGAAAA CTTTGTGCGA 3300 TGCAGGTATG AGAGTGTCTG TAGAAAAATC TATGTTCTTT AAAGAGAACG TAGAATATTT 3360 GGGATTCATA GTGTCCCGAG GGGGAATTAA AACTTCACCC GAAAGGGTTA AGGCTATAAA 3420 ACAATTTAAA CCTCCATCGA CATTGTTAAG TCTCAGGTCA TTTCTGGGAT TGGCCAGTTA 3480 TTATATAATA GATGTTTCAT AAAGGGGCTT TTTAGCATCG CAAGACCTCT GACGAATATT 3540 CTAAAAGGTG ACAACGGAAA AATTGGTGCT ACCACTCAAA GAAAGTCAAA CTGGAACGAG 3600 CAGCGAAAAT CATTCGAAAA ACTAAGAAAC ACCCTGGAGT CTGAGGATGT CATTTTGGCA 3660 TACCCAGATT CCACTCAGCC ATTTGACTTG AAAACTGACG CCTCTGGAAG CGGCCTAGGG 3720 GCTGTTCTTT TACAGATTTA ATCGGCAGCG GCAGAAATCA CCGCAAATTG CAGAACGTGT 3780 TCCGAAGCAA AACACCAATC GTCACCCAGT GCAACAAACC ATAGCGGAAA CATCAATTCC 3840 TGGTTACACT GGGGAAAGTA TCCACATAGA TATATTTTGG ACTGATCAAA AGCATTTTCT 3900 AACCTGTATC GACAAGTTTT CAAAATCCGC TATAGTCCAA CCAATCGATT CAAGAGCAAT 3960 CGTAGATATC AAAACTCCGA TACTACAACT AATAAATCTG TTCACCAAAA TAAAAACAGT 4020 TTACTGCGAC AATGAAAGAT CTATCAATTC ACAAACCATA CGAACCATCC TAGAAAATAG 4080 GTATGGTATA CGGGTCTCAA ATGCGCACCC GTTGCACAGC ACATCTAATG GCCAAGTTGA 4140 GAGATTTCAT AGCACCCTAG GGGAAATCGC ACGGTGCATC AAGATAGATC AAAACATAAC 4200 CGAGACGAGC GACCTTATTC TATTCGGAAC AATAGAATAG GACAGAACTG TCCACTCGGT 4260 TACAATTAAA AAGGGTCATG AAATAGTTCA CGCTATTCCA CCAGATTTTA CGAGCACCAT 4320 AAGAGACAAA ATCAAAGAGG CCCAAGAGAA AACACTTAGG TACTCAAATG CACACAAATG 4380 CAATAAACAG TACCAAATAG GCGAAAAAAT CTGGTTAAAA AACCAACAGA CGCCTGGGTA 4440 CCAAATTAAC GCCACTCTGC TCAGAAGAGG TCATCGAGGC TGATCTCGGC ACGACAGTGC 4500 TTATTATGGA GCAACGAACA TTACATCGTA CGAGACTTAC GCGGACGAGA CGAAACACGC 4560 GATGGATTTC TTCGAGAAGG AGCACATGAG ACGGGTACTT GAAACGGACT AGGAACGAAT 4620 AGAGACTCTT CTGGACACAC TAAAGGTACG TCACAGACAT GCCCGTAGTC TTAATTTCTC 4680 TTAATTTTCT CTTAATTTCT CTTAATTTGC TTTGAAAGCA ATAGAGTGGA CACCTGACTC 4740 TGACGGCTGA GACCAGGTGA GGTTTTGACA GGAACAGCTA ACGGACTCGG TAAACGGACA 4800 GATAGATTTA AACAACAAAA TACAATTGCA ATTAAACACA ATGACCTCGT CCATGAATTC 4860 TATTTTAAAA TCGGACGACT TAGACACAGA ACATTTGTAC GAGACGATTT TGGCAAAAAA 4920 CCGTATTGTA ATTCAAGAAC TTGAAAATTT AATACTTGCA ATCACCCTTT CCAAATTAAA 4980 CGTAATAAGT CCAATAATCT TGAATGACGT TGACGTAAGG GAGATTGAAA AAACTTTTCA 5040 AAATCGAGAC CTATTGTACT TTTTAATAAA ATTTCCGAAG CCTTTGTTAA CTTGTAGAAA 5100 AATAAGAATA TTCCCGGTAC AGCATGAAAA TAGAATCTTA GATTTCGAGG ACGGTAGCAC 5160 GGTCGCGGAT TGCGGGACGG AAACCTTCGC CGTCAAGGAC TGCAATGTAT CACCACCTTC 5220 TGCAGGAGAT CGAAAGCGCC AACCTGCGCA CAACAACTCA TCTCTGGCAT GGTCGCCCAC 5280 TGCAACACCC AGCCTGGACA CTTGGACCCA CTCACCATGA TCGACTAGGG AATGCTCATC 5340 ACGAACGATG TAACGATAAA TATCACCGAC GAAAAGGGAA TAAGCCGGAT AATATCAGGA 5400 ACTTACCCGG TATGATATAC CGAAAAAATT AAAATAAACG GCACCCTTTT ACGTTAACAA 5460 TATCGGAACA TCAAAGAAGA AAGCCGCAGT TTCAGCTATG GCCCAAGTAA ACGTTCTGAG 5520 ACATATAGAG CGCCTTACTC TGTCCTGGAA AGAAAGATGA TGATCTTTCT TATCTACCGT 5580 TTGAAAGCCA AACCAACTAA GACCATTGAA TCCGAGGACG AATTCATCTT AAGACAAGGA 5640 GGAGTTAACA CGAGCAAGTC AGAGGGGTGC AACGACCCGC CGAAATGCCA TCCAAGACCG 5700 GATGTGCCAA CAACAACATC GACGACGCTG AATCAGCCCA GTAACGCATT TCGCTTTAGC 5760 GAAAGCGATG TGCArGGGTC AGCGTGATGC GAGGCGGGCA TGTGCGCGCT GGGTGACCGT 5820 CGAAGGCATG TAACTCCAGA CGGCACGAGG CGCCGACTAA GCGCTCTGCT AAGTCCGCTG 5880 CCTCCAACCG AAGACTTGGC GGCCTTAATA AATTCGATTT GTAGTTAGTT CGTAACAGAC 5940 CAGAATTCAG AGCAGTTGGC TCTAATAAAA GAACCATTTA ATAATACAAA CGTATTTGGT 6000 TTAATT 6006 // ID GYPSY11 standard; DNA; INV; 4428 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0067386; gypsy11. XX FT source nnnnnnnn:1..4428 FT SO_feature five_prime_LTR ; SO:0000425:1..452 FT SO_feature three_prime_LTR ; SO:0000426:3977..4428 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC Repbase says the LTRs are divergent by 6%. CC REPBASE states this to be a consensus sequence. XX SQ Sequence 4428 BP; 1783 A; 786 C; 737 G; 1120 T; 2 other; TGTGGTATGC GTACATATAT ATTTAGAGTA CATATATACA TAGAGTACAA TGTACTCAGA 60 ATACTTACAT AAATCACTAA CATAATTACA TACGCATTGG CAGTGCTAGC ATAAACAATA 120 ATGTTCAAGT GGAAGTGTTA TATGATCGCC CACTTATGAA GCACACTGCA TAAGTGCATG 180 GTCAGCATTC CCACGCTAAC CAAATGCCGC TATGTACATA ATGTGCGCAC ATAGCATTTT 240 CTCTCAACTC TCTTAACATA ArTATATAAT ATAATATATA ACAGCCGCTT CTCTGCCGAC 300 TTCACCAGGC AGCGAAGGCG CACTTCATAG CCTAAGAGGA ATATTGTATT AGCTTTAAGA 360 GTCAGTCGAG TTTTACAACT GATACGAGCT TCACGATAAA AATAAACGGA GCTATGCTCC 420 AGAAAAACAT CTAAAGGATT TTCAATATAA CATGGCGACC GTGACAGGAC AGGGACCTAA 480 GGCCTAAGGA CATAGGGCCT AAGGACTACA ATCACAATCA ACACAAAATT TGTATTGGAC 540 TGGACTGGAA AGGCAGATGG AAGACAAAAT TCTCAGTGAG TAGAGTAAGG CAATGAGAAA 600 AAAGCCCGGC GGTGGGCGGC AAAAGTTAAA ACCGAGAATA GCTCACCTAA GGTGGCTATT 660 ATATTCTCAA GCAGAGCTTG ACTTCCTACA ACAAGAATTA ACAAAAGCGC CAATAAAAAC 720 CAAAAGCCAA ACTCTTTCAT TCATCAAAAC CACAAATAAT ACAAAAATTA AAACTGTAAC 780 ACCACCACCC AAAGCCAGCA TACCAGTTTA CAAGGAACAA TGCCCGTTGG ACTTCGAGGG 840 ACAATGGAGA GCCTATTGCA AATACACTTG CAAACATTGT AAGGACATCA ACCAGCAGAA 900 GGAGACCTGT GCGACAAACA GCACCGGCAC CGCTTAACGC GTCAAGTTAA GCCAGTAAGG 960 GCACCTCTAA CCACCAACAG CAGCAGCAGT ATCAGCAGCA ACAACAGCAG CTTGGGCAGC 1020 CAGCACCAGC ACCGTATGGA AAGGAAAAGG AAACCGACAG CGCTTCCGAG AGCGCAATTT 1080 TTTTTTTCTC AATAGCACTG CGTTGCAAAA ATTTTTTTTA ATGCACTACG ATGCATATTT 1140 TATTTATATA GCCTATAAAG CTATACACAT CCAACAGGCA GGAACATATG TAAAAATTAT 1200 AAGCAAACAT TTTTTACATA TACATTTTAG CAAAGAAATA ATAAACCTAA AATTCAAAAA 1260 TAAAGATACA AATATTAGAA AAATGTTTAA CGTAAATGTC AATAGTGACA AACAAAATAT 1320 ATAAAGAATA AATTTATGAA AATACTGAAG AGATCTTAAA ACCCAATATG GGCTTAGTTG 1380 ATACAAAAAC AGAACCCACA TTTTTTTCAA ATGCAAAATG CCACAAACAA AGTGGAGATA 1440 GAGTCAAACA ATTTAAGCTA TATAACAATA ATCTTGTTAG CAATTTTTAC TAATGATCAT 1500 ACATGCACTC ATAAAAGTGT ATAAACTCCA CAATAAATGC CTGAAGAAAC GATATAATAG 1560 CATGGCCGAT AGCCTAGAAA CAATTTAATT TTTAAATAAT GAGCTCATTA ATGAAAGAAA 1620 AATAATAATA AAATCTAAAA AAAATTTTTT TTAAATACTA TTATGTATTA TGTATAATCC 1680 GAATTATGGA TTGGCAAAAC TTAACAGCCA TCTTAAATAA AATAAAAGAA AAATTTGATA 1740 GGTCACATAA AAGTTTGTCT CAGAATAGAA CAATCCAAAA ACAAACTGCT AATGACCATA 1800 CAACCTTTCT GGTAGAATCC TTCAATCAGG TAAGATCCCT TATACATGAC CAAAGAGGGA 1860 AACTAGACAA AAAACAGTGG ACAACAGTGT CCAAATTTTT AATTAGACAA GAAATACTAT 1920 CCCTATTGTA ATAAATAGCA AAGAAAGGGT TCGATCTAAA CATATCGATA CCAACAATTC 1980 TAAATTCCCC TTTAACTCTA AGTGAAGAAG AAACAGAATA ACTCAATGAT TCTGACTCTG 2040 AACTAGAAAT CAAAGAAGAA GACTTGAACG ATCTAACAAT ACCAGCAATG TTGCAATTGC 2100 CAGAAGATTA AAAGAATATA GAAGATCAGA AAATAAAATT AAGCAAAATG ACAGACAATT 2160 CAGCCGCCAT TAGGGTCTAC ATAAGGGAAG TGTGGAGCGC AGTACCGGAG TTTGATGGAC 2220 AAAAGATCCA TCTTCAGAGG TTTATTGTGG TCATCAAATT AGCAGACCTT GCAAAAGGAT 2280 AATTTAAGGA CATTGCAGTC CAAGTAATAA AATCAAAATT GATTGGCACT ACTCTAAACC 2340 TGGTACTGAG CGAATCTACA ATTGAAGCAA TAATTAACAA ACTGCACACT TCAATTGTTG 2400 GGGAGACATC CCAAAACATC AAGGCAAAAT TATCCACTGT ACAACAAACA AAAAGGCAAA 2460 ACAGCTACAC AGTTCACAAC CGAAGTTGAT AATTTACGTG AGCTCTTAGA AGCCTCGTAT 2520 ATCGATGAAG GTTTACAAAG TGAGCAAGCC ATCGATGTTA TGATCAAAAA GACCAAACAC 2580 GAAAGTGTGA AAACTGTGCT AGAAGCAGGT ACGTGCACAA CTATGGATGA TGCCATAGGT 2640 GCATAAAAAC TAGCACCAGA GTAACCGGAA ATGTGAATCA TTTAATGTAA GAGAGGTTAC 2700 TCTAACTCTA GAGGCCAAGG TAATGGTCGC GGCAGAGGCA ATGGCCATTA CAATAGTCGA 2760 TACAATAACA ATAATAACAA TTACGGCGAT AACGGTATCC AAAAGAATGG TAACAATTAT 2820 AGATGCCGTG GCAACTCAAA GAGAGGAAGC CGAAACAATC AGAATTGAAA TAATAACAAT 2880 AACAATGATA ATGCTAGTGT GAGAGTAACC CAAAATAATT CGGGAAACTC GAAAAAAGCC 2940 TCAGATGCAC GGCAATAAAG CTCATGTACA TTTAATCAAT CTAAGTATTA ATATATTGCA 3000 TTTGATCATG TGCATTTGCA TTTGACCCAA TTCGACAGAG TGATAGTGGA CACTATAGGT 3060 CCTCTACCAA AGTCGGAAAA TGGCAACGAA TATGCAGTCA CTCTCATTTG TGACTTAACT 3120 AAATATTTAG TTGCGATTCC CATACCAAAC AAATGCGCAA CAGCAGTCGC TAAAGCAATT 3180 TTCGAATCTT TTATTCTTAC TTATGGTCCA ATGAAGACGT TCATTACGGA CATGGGGTAC 3240 ATAATATAAG AATTCAATCA TTACAGATCT TTTCAAATAT CTAAAAATTA AAAATATAAC 3300 GTCTACCGCT CACCATCAAA CAGTAGGTGT AGTCGAAGGA AGTCATAGAA CTCTAAATGA 3360 ATACATACGA TCTTACATAT CGGTTGATAA AACTGATTGG GATGTATGGA TTCATTATTT 3420 CGTCTATTGT TTTAATAGAA CCCCTTCTAT GGTACAAAAT TATTGTCCAT ATGAACTTGT 3480 TTTTGGTAGA ACAAATAATT TACCCAAAAA TTTTACTAAT ATAACTAGCA TAGAACCAAT 3540 ATACAATATA GATGATTATG CTAAGGGAAA GTAAATATAG ATTAGAAGTA GCATATAAAA 3600 GAGCTAGAAC TATGCTTGAA AAGCATTGAA AAAATAAGGA AAATTATGAT TTACAAACAC 3660 AAAATATAGA AAAAACAGTA GGATATAAAG TTCTATTAAG AAATGAAGTA GGTCGTAAAT 3720 TAGATTTTAA ATATACGGGA CCCTATACGG TAGAAAATAT AGAAGAAAGA GACAACATAA 3780 CAATATCAAA CAATAAAAAT TAAAAACAAA TATTACATAA AGATCGATTA AAAGTTTTTT 3840 ATTCATAAAT GACGTCATGA TTTAATACAA AAAGCATTAT AATAATGGAA AAATACAATT 3900 TAAAATGTCA AAAACAAAAA ATATATATAT ATAAAAAAAA AAACAACAAA ATAATTATTT 3960 TTTAAAAGGA GGGAGATGTG GTATGCGTAC ATATATATTT AGAGTACATA TATACATAGA 4020 GTACAATGTA CTCAGAATAC TTACATAAAT CACTAACATA ATTACATACG CATTGGCAGT 4080 GCTAGCATAA ACAATAATGT TCAAGTGGAA GTGTTATATG ATCGCCCACT TATGAAGCAC 4140 ACTGCATAAG TGCATGGTCA GCATTCCCAC GCTAACCAAA TGCCGCTATG TACATAATGT 4200 GCGCACATAG CATTTTCTCT CAACTCTCTT AACATAArTA TATAATATAA TATATAACAG 4260 CCGCTTCTCT GCCGACTTCA CCAGGCAGCG AAGGCGCACT TCATAGCCTA AGAGGAATAT 4320 TGTATTAGCT TTAAGAGTCA GTCGAGTTTT ACAACTGATA CGAGCTTCAC GATAAAAATA 4380 AACGGAGCTA TGCTCCAGAA AAACATCTAA AGGATTTTCA ATATAACA 4428 // ID GYPSY12 standard; DNA; INV; 10218 BP. XX AC AE003789; XX DR FLYBASE; FBgn0067385; gypsy12. XX FT source AE003789:>46854..38973 FT SO_feature five_prime_LTR ; SO:0000425:1..2336 FT SO_feature three_prime_LTR ; SO:0000426:7883..10218 FT SO_feature CDS ; SO:0000316:3141..4553 FT /db_xref="FLYBASE:; gypsy12\gag" FT /db_xref="" FT /protein_id="" FT /translation="MGLDRSPTRKSPSVSNPVCKLCAAEISTQDLYVTTCHHEFYRECI FT GNHFKKSEICSRCKLTCRPPAEATERVGRETRSKTKNRRNSRRGSFDISQRCGEKLAVK FT LKIAATVDGGPSTSASGANANEASSSAVSANAALLAMERRLLATLSEKMADLVQNAITS FT SMQRIMPTPSPAVVVTASEMSADHPNAYERQYLASPNPVPSPRSASSDLFDRPDKVVHI FT LNGWKIKYSGVGVSVDNFIYRVEAVTRQTLNGNFNLLCRNISVLFEGKANDFFWRYHKF FT DRVATMGTERFCTALRLQFRQSRDDGDIEELIRNTKQKPNETFDSFYDTVSELVDQLEQ FT PWTANKLVRVLRNNLRPEIRHEILNLDVRTVSELREICKRREAFLADVRRCSSYAKDTP FT FKREISEVCHESEDEVRSTYEAENDIESFSLVCWNCRIEGHRYQECIAERRVFCYGCGA FT ANTYKPSCRKCSKNFKVGMSKLPVKPKTSNAARNQSTMTDQ" FT SO_feature CDS ; SO:0000316:4740..7710 FT /db_xref="FLYBASE:; gypsy12\pol" FT /db_xref="" FT /protein_id="" FT /translation="KKCKASLDYISSIPTGPRDPRPFLPMRLLNCLVYGLLDSGASISC FT IGGGVVQAAMENEKFKSLIGEAATADGNSQRIVGLLKIEVEYGDIKKLLKLYVVPSLKQ FT DLYLGIDFWKLYDLLPANLKIAEILSPEPNQQTVVDQHELCEGDKAKLANVINCFPSFS FT QEGLGKTNLVSHSIDVGTARPVKQRHFPVSPAVEKAMYAEIDRMLRLGVIGESESAWSS FT PIVMVTKPGKVRICLECRKVNSFTEMDAYPLPQINGILSRLPRAEYISSLDLKDAYWQV FT PLDPKSRDKTAFTVPGRPLYQFKVMPFGLCNATSTMSRLMDKVVPAHLRNEVFIYLDDL FT LIVSSCFESHLNVLRELALQIKRAGLTLNVAKSHFCMRRVRYLGHIIGDGGIRTDPEKV FT SAITDFPLPKSLKSLRSFMGLCGWYRKFVANFATLSAPLTDLMTTKRKFLLTKEAIEAF FT SKLKECLSKAPVLCSPDFAKPFAIHCDASKSGVGAVLVQVSEEGDERPIAFVSKKLNKA FT QRNYTVTEQECLAAIVALKNFRAYVEGLPFKIITDHASLKWLMSNHDLNSRLARWALAL FT QRFKFEIEHRKGSLNVVPDTLSRVNEEIVAAMDLQEDLIVDFDSEFFQSGDYVKLVETV FT KENTSNFSDLKVESGFLYRKAEHLTGERMHDEYAWKLWVPKELVSKILARAHDSPLAAH FT GGIHKTLERIRRYYFWPGLVSDVRAYISACEVCKSTKSQNFTLRPPLGKAPESQRFFQR FT LFIDFLGPYPRSRSGNIGIFIVLDHFSKYVFLKPVKKIDSSVVIKYLEDELFMTFGVPE FT VILSDNGSQFRARTFQRLIRYGVKHTLTAVHSPQANASERVNRSVIAAIRAYLRLDQKD FT WDEFLSRICCALRSAVHSSIGTSPYYMVFGQHMITSGSTYSLIRRLNLLDDRSLKFDRH FT ESFEIMRKQAVDQMRNKHNENEKRCNIRSRVVSFVEGQEVYREISSQAVSKPVTTPSLD FT RRS" XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref CC Repbase says the LTRs are divergent by 1%. CC Repbase says that the translations are 'conceptual', based on alignments CC with similar proteins. XX SQ Sequence 10218 BP; 3033 A; 1995 C; 2287 G; 2903 T; 0 other; TGTAGAGGCG ACTGAAGCAG TGGGTGAGAA AAATCTCGAT GAATAATCGA TAGGGCGGTC 60 AGCTGTGACA CACTAAAGTG AGGTCTTACT AGATGTAGCA AGAAGTAGCT AGAATATACC 120 AGACTGGATG CAACCCTTAT TAGCTGCACC CGATAAGTTT GGTATAATTC GGTTATCGGA 180 GAAAATGTGT TGTTCAGCTG ATCGTGGTTT CCGCCGATGG CCTACGCCTC GACTAATGTT 240 TCCGGTTTGC CTTCTTTTCG ATCTGAACAG TCAACGCGTC AAGTTGCGAA AGTGCGTCGA 300 AAAATTTGAA CTTGCGGTCA TACGGAATTT TTTTGTCGCG CGAAATATCG AAAATACGGC 360 GTTCCTTCGC CTATAGTGAT GCGCGAAATA ATTCCGGCTT GTGTTTCCCG GCGATATTCA 420 GCCGAAGAGA TCGGCGTTCG TCACGGCCAT CTCACTGCCG CCTTGCCGCC AAATCCTAGC 480 TTCGACTGGG TCAAAGTGCA GGTTAGTGAG TTTCGAAATA TTGGTGAACC GTAGTTTGAT 540 ACGGGAATAT AATCGCAGCA GTTTTCGCTG GACATCTTAT ACTTTCGCAT CACTGCCCGG 600 GTCCTACGAT TTTTCGGAGC CGCCACCGGC GGGCAGAAGG AGAGACGGCA ACTGCGCCAG 660 GCGAGTAGTG GAGCGCCTTT CCAGCAAGCC GGAGCGAAGG ACCAGCCGCA GCAGCAGGAC 720 AGCGACCGCC CTAAGTCGGG GCCACCAGGA GGGATCTTTA AACAAATAAA AATGCAGCCT 780 AAGTAGAGCT TAGAGTGAGG TTAGAGTGAA TGGAAAATTT TCTTTTTGCT TGATTGGCAG 840 TCCAGAATGC CCGAAATTAA ATTAAATCAA AGTTAAATCG AGCAAAATTT CTAATTATAC 900 AAAAAAATGC TGAAAGCCAG ACCTAACCTT ACTAGAGAGT ATTAATTCAA CCTGATCGCC 960 ATCTTGGACA TAGAGTGGCA GAAATTAAAA TTTTTCCTAT GTAGTTTAAG TTTAACGTTA 1020 AGCTATAAGA ATATGTACTC GAATAGGAAA CTCGGTCATA CTGTTTAATA ATTTTTCTTG 1080 GATTTAAAAA AAATGAATTC TTATTAGTGA TTAAAATTTT TATATAGTAA GTGATTTAAC 1140 CTAAGCTAAT TAATTGTTTT AGAAGAGTTC TCTTTCGAAA AGTGCTCATT TAAATTTTCG 1200 AGTAGGGATT TATAAAAGTT TGATCGCAGT CGTGTAATCC AAAATTATGC CTCCTTATAG 1260 TGTTTGTGGA AAAAAAAGCC ACCTTTTCGT TTCAGCTCAT CTGCGCCGGG AGATCGATCT 1320 TTCGCCGAGG AGTGAAAAGT TGAGTGAGTC TCATTGATAA AACTTTATTT TATTGGAAAT 1380 TCGGTAGTAA ACCGATTACA TTTTGATAGA GGTCATCTTA TATTGTATAC ACACTGCAAA 1440 TAAAAAAAGT AATAATAATG GAGTGGCGAG AGCAGCCATT TGAAATATAA TGAGAAAAAG 1500 ACAAAAAAAT GAAAAAAAGA TCTAACCGAA TTTACTCGCG GCTCTCTCTC AAACAGAATT 1560 AAATTTTTTT TCGTTAAAGA TTCGTTATAA AGAAAACTCA AAGAAAATAA ATAAAATTAA 1620 TAATTCACAG AACGAGATTA TAGAGAGAAC GAAAATTATT AATGATTTAA ATGAAAGCTT 1680 TGTTCCTTCC CAAAAAAAAA AAAATATATA TATTTATACA CTAGTATTTT TAACTTTTAA 1740 TTGTAATGAT CTCGAACATC GAATTGAATC TACATTTTCT GATCTCGCAA CTGAGAGAAA 1800 AACTTTGCTT TGAATTCCTT TGAGTTACAC TAGTCTGTAT TTTTTATATT TACTTGGTTC 1860 CTTTAGTATG TACTTATTTT TAGATATAAA TATTGTTATT AATTATTAAG CATTGATTGA 1920 TTTTAAAAAT ATGAATTTCT AAAAACCTAA ATGCTGACTC GGCTGATGTT TCTCCTGACG 1980 TAGGGCACTA AAGGGGCCGC TAGAGCTATG CTCTTAAGGA AAGTGGTCTA AGGTAGGGTA 2040 GCGCAATGCG CGAACACGAT TCCACCTGTG TTTCGCAGGC GAAAAATTCT CAAACTTTCC 2100 TCCCATTCTA GAACTGTCCG CTTCCGAAAG TATTAATTGG CTTCCTCCGA TTTTTGTATT 2160 CTAGCTCCTT CATTTTGTTG TTGTTGCAAG CACGCTCAAT TTCTTGATCT TATTAATTGT 2220 TGTCTGTAAC GAAAAGAGAT CTGTACAAGC AAGACCACCA CACCCGGCCT CTAAAGCGTG 2280 CACCCGCAAG TACAGGTACT CCAGTCGTCC ATCCGTTTAA TTCGTTACAC GTTACAGTTT 2340 AGTTAAGTAT AATAATAAAT AAGATAAAGT GGCGCCCAAC GTGGGGCCTT AGTGAGTTAT 2400 TTTAAAATTC ATTAAGACTC GCGAATTAAT ATCTTCAATC CTTTCTTTAC ATTTTCTGTC 2460 TCACTTCCAA AAATTTTTGT GTTTAGAATT GGTTGCGGAG TACACGTCAA GTCCATACTA 2520 ATATTGCTAA CTGGCCATAA CTAAGACTGA TCACAAAAAT TTTCCTAAGT GTAGCAAAAG 2580 GAAATGTGGA TTTTGCTGAG GAAAACAGAG TAACAGTAAC TTCGGTCAGG TTTCTGTCCC 2640 GGTTTTATCT ACTTCCTTGA GCTAACTAAA TGCTATTCTA AAAAAAAGAT CATTAACGCA 2700 ACATGACGGA AAATCTTTGC GTTACTTAAG AGAAATGCAA GATCCTCATA TAGTTGACGA 2760 TTTTCCTCTT TTTACTATTC GTTTTAAATA AGATAGAGTT TCCTATAAGT TGGAATATTC 2820 GTGCCAAGGA AAAGGCACGA TCCAGGACTT CACTGTTACA CCTACTGGTG GTGTTGAAAT 2880 TTAGTCTAAT ATGGAATATC TTATTTTAAC GAAACTCCTA TGCCTTCGTG AGAATTTTGA 2940 GTCTAATATA GAATATCTTA TTTTAACGAA ACTCCTATGC CTTCGTGAGA ATTTTGAGCC 3000 AAAATAATAG ACAGCCCTAC GAAAGGAGAA CATCAAGTTC AGTTTTATTA TCTTTTCGCG 3060 CCTTATCGTT GATATTTTTA CGTATAAGTT TATTCTATTT TTGCTCGCCA TTTTTGAACC 3120 GGTAATTTTT CGCAAGTATG ATGGGGTTAG ACCGATCACC AACTAGGAAA TCTCCTAGTG 3180 TCTCAAACCC TGTTTGTAAA TTATGTGCAG CCGAGATCAG CACACAAGAT TTGTATGTGA 3240 CAACCTGTCA CCATGAATTT TACAGGGAGT GCATTGGCAA TCATTTTAAA AAAAGTGAGA 3300 TCTGTTCGAG GTGCAAACTA ACCTGTAGGC CTCCAGCCGA AGCGACCGAA AGGGTAGGGA 3360 GAGAAACTCG CAGTAAAACT AAAAATCGCC GCAACAGTAG ACGGGGGTCC TTCGACATCA 3420 GCCAGCGGTG CTAACGCAAA TGAAGCTAGT TCAAGCGCTG TAAGTGCTAA TGCAGCTTTG 3480 TTAGCCATGG AGCGTAGGCT TCTAGCAACG TTGTCAGAGA AGATGGCTGA TTTAGTACAA 3540 AACGCTATCA CCTCAAGCAT GCAGCGAATA ATGCCCACTC CGAGTCCAGC TGTAGTCGTT 3600 ACAGCCAGTG AAATGTCCGC TGATCACCCA AATGCATATG AGAGGCAGTA TTTAGCATCG 3660 CCAAATCCAG TTCCTTCTCC ACGAAGCGCT TCTTCCGATT TGTTTGATCG GCCAGATAAA 3720 GTCGTCCACA TATTAAATGG TTGGAAAATA AAATATTCTG GTGTAGGAGT GTCAGTGGAC 3780 AACTTTATAT ACAGAGTCGA AGCGGTTACA AGACAGACGT TGAATGGAAA TTTCAACTTG 3840 TTATGTAGAA ACATAAGTGT ATTGTTTGAA GGCAAGGCGA ACGACTTCTT TTGGCGCTAT 3900 CATAAAGCGG TGAAATTCTG TGAGAGAGGT TTTGCACAGC CCTACGACTG CAATTTCGAC 3960 AGAGTCGCGA CGATGGGGAC ATAGAGGAAC TGATCAGGAA TACCAAACAA AAACCAAATG 4020 AAACTTTTGA TAGTTTCTAC GACACGGTAT CCGAGTTAGT CGACCAGCTA GAACAGCCTT 4080 GGACAGCCAA CAAACTTGTC CGCGTGCTAA GAAATAATCT TCGTCCTGAG ATCCGCCATG 4140 AAATTCTAAA CTTAGATGTC CGAACGGTGT CAGAGTTGAG AGAGATTTGT AAGCGAAGGG 4200 AGGCATTCCT GGCTGACGTC AGAAGATGCA GTAGCTACGC GAAAGATACT CCGTTTAAAC 4260 GCGAGATCTC CGAGGTCTGT CACGAGAGTG AAGATGAGGT GAGGTCAACA TACGAAGCTG 4320 AGAACGACAT CGAATCTTTT TCCCTAGTGT GTTGGAATTG CCGAATCGAA GGACACCGAT 4380 ACCAAGAATG TATAGCCGAA AGACGAGTGT TTTGTTACGG GTGTGGTGCA GCAAACACCT 4440 ACAAACCAAG TTGTAGAAAA TGTTCAAAAA ACTTCAAGGT CGGCATGTCG AAGTTGCCAG 4500 TCAAACCGAA GACTTCAAAT GCCGCAAGGA ATCAGTCGAC AATGACCGAT CAGTAGAGAA 4560 CACCGATAAA AGTGTGTCTC TACCCCTCCC TGACGTACCA ATTAAACCGG CTATTAGACT 4620 GCTTCACAGC AAAATTTCGG AATTACGAAA CGTCTGCATT TCGGATTCCA GAAAAACTGC 4680 AATTTTTAAA CGGAAATCTC GTAGTGCTCG ACGCTTGAAA TTGTTCTGGA AGAACGTAAA 4740 AAAAATGTAA AGCCAGTTTG GATTACATCA GTTCTATTCC AACAGGACCT CGAGACCCTC 4800 GACCGTTCTT ACCGATGCGC TTATTGAATT GCCTGGTCTA CGGATTGTTG GATTCAGGCG 4860 CATCGATTAG TTGTATTGGA GGAGGGGTAG TGCAAGCTGC GATGGAAAAC GAGAAGTTTA 4920 AGTCGTTAAT AGGAGAAGCT GCGACAGCAG ACGGGAATTC TCAACGTATA GTAGGACTGC 4980 TAAAAATAGA AGTGGAATAC GGTGACATCA AAAAGCTTTT GAAGTTGTAT GTCGTTCCAT 5040 CGTTAAAACA GGATCTCTAT TTGGGAATTG ACTTTTGGAA ACTATATGAC CTCCTTCCTG 5100 CTAATTTGAA AATAGCTGAG ATATTGTCGC CTGAGCCCAA CCAGCAAACG GTGGTGGATC 5160 AGCACGAATT ATGTGAAGGT GACAAAGCCA AGTTGGCCAA TGTAATCAAC TGTTTTCCTT 5220 CCTTTAGCCA GGAAGGGTTA GGTAAGACGA ACTTGGTCTC TCATTCGATC GATGTGGGCA 5280 CCGCTAGGCC TGTGAAGCAA CGGCATTTCC CTGTCTCCCC CGCCGTCGAA AAAGCAATGT 5340 ATGCGGAAAT CGACCGGATG TTACGCTTAG GGGTGATAGG GGAGTCTGAG AGTGCTTGGT 5400 CTTCGCCGAT CGTGATGGTG ACTAAACCAG GCAAAGTCAG AATTTGTCTT GAATGTCGCA 5460 AGGTTAATAG TTTTACGGAG ATGGATGCAT ATCCGTTGCC CCAAATAAAC GGGATACTGA 5520 GTCGTTTGCC GAGAGCTGAG TACATATCGA GCCTAGATCT GAAGGATGCA TATTGGCAGG 5580 TCCCTTTGGA TCCTAAGTCT CGGGACAAAA CAGCTTTTAC CGTCCCGGGT AGACCGTTAT 5640 ATCAGTTTAA AGTCATGCCT TTCGGGTTGT GTAACGCCAC GAGCACTATG TCACGGTTAA 5700 TGGACAAAGT AGTGCCGGCT CATTTGAGAA ACGAGGTTTT TATCTATTTA GACGACCTAC 5760 TAATAGTATC TTCTTGTTTT GAGAGCCATT TGAATGTACT GAGGGAGTTA GCCCTGCAGA 5820 TAAAGCGCGC GGGCTTAACG CTAAACGTCG CCAAAAGTCA CTTTTGTATG CGACGAGTAC 5880 GCTATTTGGG CCACATTATC GGAGACGGTG GAATTCGCAC AGACCCTGAA AAGGTGTCCG 5940 CGATTACCGA TTTCCCATTG CCTAAAAGTT TGAAAAGCTT GCGCAGTTTC ATGGGATTGT 6000 GCGGATGGTA CAGGAAATTT GTCGCAAACT TCGCGACACT TTCTGCACCA TTGACTGACT 6060 TGATGACCAC GAAGCGGAAG TTTCTACTAA CGAAGGAGGC AATTGAAGCG TTCAGCAAGC 6120 TCAAAGAGTG TCTTAGCAAA GCTCCAGTCC TGTGTAGTCC GGATTTTGCG AAGCCGTTTG 6180 CCATACATTG CGACGCTAGC AAGTCAGGCG TGGGTGCCGT GCTGGTACAA GTGTCTGAAG 6240 AAGGTGACGA GCGTCCTATC GCTTTCGTTT CGAAGAAGCT GAACAAAGCT CAACGAAATT 6300 ATACCGTCAC AGAGCAGGAG TGTTTGGCCG CAATAGTAGC TCTCAAAAAC TTCAGAGCAT 6360 ACGTGGAAGG ACTCCCTTTT AAAATAATAA CCGACCATGC TTCGCTCAAG TGGCTAATGT 6420 CCAATCACGA TCTAAATTCA CGACTGGCGC GATGGGCATT AGCTTTGCAG AGATTCAAGT 6480 TCGAGATTGA ACACCGTAAG GGTTCTTTAA ATGTCGTCCC GGATACATTG TCTCGTGTTA 6540 ACGAGGAGAT TGTAGCCGCG ATGGACTTGC AAGAAGACTT AATTGTTGAT TTCGATTCTG 6600 AATTTTTCCA GTCCGGTGAC TACGTAAAGT TGGTAGAGAC CGTAAAGGAA AATACCTCGA 6660 ATTTTTCTGA TCTCAAGGTG GAGAGCGGGT TTTTATATAG AAAAGCCGAG CACCTGACTG 6720 GAGAACGAAT GCATGACGAA TACGCCTGGA AACTTTGGGT CCCCAAAGAA TTGGTATCGA 6780 AGATTCTGGC TCGCGCGCAC GATAGTCCGT TAGCTGCGCA TGGTGGCATA CACAAAACCT 6840 TGGAAAGAAT ACGGCGATAT TACTTCTGGC CCGGTCTCGT TTCGGACGTG AGGGCTTATA 6900 TTAGTGCGTG TGAGGTTTGT AAGAGTACAA AATCTCAAAA TTTCACTCTC AGACCACCAT 6960 TGGGAAAAGC GCCTGAGTCT CAGCGATTCT TCCAGCGTTT GTTCATTGAT TTTCTCGGAC 7020 CGTATCCTAG GTCAAGGAGC GGCAACATAG GAATCTTTAT TGTTCTGGAT CATTTTTCGA 7080 AATACGTGTT CTTGAAACCC GTAAAGAAGA TTGATTCCAG CGTCGTCATA AAGTATCTGG 7140 AAGACGAGTT GTTCATGACG TTCGGAGTGC CCGAAGTGAT ACTGTCTGAC AATGGTTCCC 7200 AATTTCGAGC TAGGACGTTC CAGAGACTTA TACGGCGTTA AGCACACGCT GACCGCTGTC 7260 CACTCGCCTC AGGCAAATGC TTCTGAGCGC GTGAACAGAT CAGTAATTGC TGCAATCAGG 7320 GCCTATCTGC GTCTCGACCA GAAAGACTGG GACGAGTTTC TCAGCCGAAT ATGTTGTGCG 7380 TTGAGGTCGG CGGTGCATTC TAGCATTGGT ACCAGTCCCT ATTATATGGT ATTTGGGCAG 7440 CACATGATCA CGTCAGGGTC GACGTATTCG TTGATCAGAC GCCTAAATCT CCTGGACGAT 7500 CGTTCTCTAA AGTTCGACCG GCACGAATCT TTCGAGATAA TGCGGAAGCA AGCTGTTGAT 7560 CAGATGAGAA ACAAGCACAA CGAGAACGAG AAGCGGTGTA ATATCCGTTC TCGTGTGGTA 7620 TCGTTTGTCG AAGGGCAAGA AGTTTATCGC GAAATTTCAA GCCAAGCTGT TTCCAAACCG 7680 GTTACAACGC CAAGTTTGGA CCGACGTTCG TGAAGTCTCG AGTTCGGAAG AAGATCGGCA 7740 ACGCGTATTA CGAGCTGGAG GATCTCCAAG GACGAGTTGT GGGTACCTAT CATGCCAAGG 7800 ACATTCGGCA GTAAGGTGCT TCCAGTCGGT CTCAGCCTAG TGTGTCTATG AGTACCCTAG 7860 TCTGAGTGTA GCGGGGGGGT TTTGTAGAGG CGACTGAAGC AGTGGGTGAG AAAAATCTCG 7920 ATGAATAATC GATAGGGCGG TCAGCTGTGA CACACTAAAG TGAGGTCTTA CTAGATGTAG 7980 CAAGAAGTAG CTAGAATATA CCAGACTGGA TGCAACCCTT ATTAGCTGCA CCCGATAAGT 8040 TTGGTATAAT TCGGTTATCG GAGAAAATGT GTTGTTCAGC TGATCGTGGT TTCCGCCGAT 8100 GGCCTACGCC TCGACTAATG TTTCCGGTTT GCCTTCTTTT CGATCTGAAC AGTCAACGCG 8160 TCAAGTTGCG AAAGTGCGTC GAAAAATTTG AACTTGCGGT CATACGGAAT TTTTTTGTCG 8220 CGCGAAATAT CGAAAATACG GCGTTCCTTC GCCTATAGTG ATGCGCGAAA TAATTCCGGC 8280 TTGTGTTTCC CGGCGATATT CAGCCGAAGA GATCGGCGTT CGTCACGGCC ATCTCACTGC 8340 CGCCTTGCCG CCAAATCCTA GCTTCGACTG GGTCAAAGTG CAGGTTAGTG AGTTTCGAAA 8400 TATTGGTGAA CCGTAGTTTG ATACGGGAAT ATAATCGCAG CAGTTTTCGC TGGACATCTT 8460 ATACTTTCGC ATCACTGCCC GGGTCCTACG ATTTTTCGGA GCCGCCACCG GCGGGCAGAA 8520 GGAGAGACGG CAACTGCGCC AGGCGAGTAG TGGAGCGCCT TTCCAGCAAG CCGGAGCGAA 8580 GGACCAGCCG CAGCAGCAGG ACAGCGACCG CCCTAAGTCG GGGCCACCAG GAGGGATCTT 8640 TAAACAAATA AAAATGCAGC CTAAGTAGAG CTTAGAGTGA GGTTAGAGTG AATGGAAAAT 8700 TTTCTTTTTG CTTGATTGGC AGTCCAGAAT GCCCGAAATT AAATTAAATC AAAGTTAAAT 8760 CGAGCAAAAT TTCTAATTAT ACAAAAAAAT GCTGAAAGCC AGACCTAACC TTACTAGAGA 8820 GTATTAATTC AACCTGATCG CCATCTTGGA CATAGAGTGG CAGAAATTAA AATTTTTCCT 8880 ATGTAGTTTA AGTTTAACGT TAAGCTATAA GAATATGTAC TCGAATAGGA AACTCGGTCA 8940 TACTGTTTAA TAATTTTTCT TGGATTTAAA AAAAATGAAT TCTTATTAGT GATTAAAATT 9000 TTTATATAGT AAGTGATTTA ACCTAAGCTA ATTAATTGTT TTAGAAGAGT TCTCTTTCGA 9060 AAAGTGCTCA TTTAAATTTT CGAGTAGGGA TTTATAAAAG TTTGATCGCA GTCGTGTAAT 9120 CCAAAATTAT GCCTCCTTAT AGTGTTTGTG GAAAAAAAAG CCACCTTTTC GTTTCAGCTC 9180 ATCTGCGCCG GGAGATCGAT CTTTCGCCGA GGAGTGAAAA GTTGAGTGAG TCTCATTGAT 9240 AAAACTTTAT TTTATTGGAA ATTCGGTAGT AAACCGATTA CATTTTGATA GAGGTCATCT 9300 TATATTGTAT ACACACTGCA AATAAAAAAA GTAATAATAA TGGAGTGGCG AGAGCAGCCA 9360 TTTGAAATAT AATGAGAAAA AGACAAAAAA ATGAAAAAAA GATCTAACCG AATTTACTCG 9420 CGGCTCTCTC TCAAACAGAA TTAAATTTTT TTTCGTTAAA GATTCGTTAT AAAGAAAACT 9480 CAAAGAAAAT AAATAAAATT AATAATTCAC AGAACGAGAT TATAGAGAGA ACGAAAATTA 9540 TTAATGATTT AAATGAAAGC TTTGTTCCTT CCCAAAAAAA AAAAAATATA TATATTTATA 9600 CACTAGTATT TTTAACTTTT AATTGTAATG ATCTCGAACA TCGAATTGAA TCTACATTTT 9660 CTGATCTCGC AACTGAGAGA AAAACTTTGC TTTGAATTCC TTTGAGTTAC ACTAGTCTGT 9720 ATTTTTTATA TTTACTTGGT TCCTTTAGTA TGTACTTATT TTTAGATATA AATATTGTTA 9780 TTAATTATTA AGCATTGATT GATTTTAAAA ATATGAATTT CTAAAAACCT AAATGCTGAC 9840 TCGGCTGATG TTTCTCCTGA CGTAGGGCAC TAAAGGGGCC GCTAGAGCTA TGCTCTTAAG 9900 GAAAGTGGTC TAAGGTAGGG TAGCGCAATG CGCGAACACG ATTCCACCTG TGTTTCGCAG 9960 GCGAAAAATT CTCAAACTTT CCTCCCATTC TAGAACTGTC CGCTTCCGAA AGTATTAATT 10020 GGCTTCCTCC GATTTTTGTA TTCTAGCTCC TTCATTTTGT TGTTGTTGCA AGCACGCTCA 10080 ATTTCTTGAT CTTATTAATT GTTGTCTGTA ACGAAAAGAG ATCTGTACAA GCAAGACCAC 10140 CACACCCGGC CTCTAAAGCG TGCACCCGCA AGTACAGGTA CTCCAGTCGT CCATCCGTTT 10200 AATTCGTTAC ACGTTACA 10218 // ID INVADER6 standard; DNA; INV; 4885 BP. XX AC NT_033778; XX DR FLYBASE; FBgn0067385; invader6. XX FT source NT_033778:1483681..>1488172 FT SO_feature five_prime_LTR ; SO:0000425:1..393 FT SO_feature three_prime_LTR ; SO:0000426:4494..4885 XX CC Sequence identified by REPBASE: CC http://www.girinst.org/server/RepBase/RepBase9.02.embl/drorep.ref XX SQ Sequence 4885 BP; 1486 A; 952 C; 1071 G; 1376 T; 0 other; TGTCGCATCA TTATTAGTCT TATTTTTATT TTCTATGTTC CATCTCTAAT AAACATGTCA 60 TCTCTATTAA ATAAAATTCG TATCGAGCTG TTCTTGTCTT CGTTTCTCTT TGATCGCTGT 120 TCGCTGTGTT CCGTTATGCG AGTTTAACGG GTTTTGCTCT GTTCTACATA GTCTCGGTTC 180 GACGATGCGT TAGAGTGAGA CAAATGCTTG TCCTGTGGTG AGTTCGGACC AGCATGTATC 240 AAGCGAGATA GAGCGATGTT GAAATGTACA CGGGGCACTT ATGTTTGAAA ACTCTGAGAA 300 AGCGGACGCG TGAATATGTC GCAACCGAGG AAGTGTACGA CTCGCGGGCG GAGCGCGGCA 360 ACAGAGGACC CCGAATCAGT TAACTTCCCG ACATCAGAAG TGGGATCGCC TCAGCCCCAA 420 GCCATCACTG ATGATCGCCT GTCCACAATT TTAGAAATGC AGCATCGTAA TCTATTGGAA 480 ATTGTAAATG CCGTGAGGGG CTCGCAGACT ACACAAGTAG TTGTACTACC CAAGTTCAAC 540 CCTGAGTCCG CCGGTTCAAG TGCAGCCACT TGGTGCTCTA CAGTGGACCT TATTGTAGGA 600 GAAAATCCTC TGGATGGCAG TGCTTTGCTC ATGGCCTTAA CCAAATCATT GGAGGGCAGT 660 GCTTCTAATT GGCTGTCACA AATTTGTTTC GCCGGAATGA CTTGGAGCCA ATTCCAAGAA 720 ATGTTCCTTC AGCAGTATGA AGGCAATGAG ACGCCAGCAG CAACAGTCTT TAATGTATTA 780 AACGGGCGGC CAAACGACGG CGAATGTCTT GCGCTGTATG GTAGCTGATT AATAACAACA 840 CTTATGGCAA AATGGAAGTC TATGACCGCA GAAGAAATTG CAGTGTCTGT TGCCCTGGCA 900 CATGCTGCAA ATATAGATGG TAGGCTGCAA CGCACTGTGT TCACAACTAC TGTCAAAACA 960 CGCAACGAGT TGCAGAACGA GCTAAGAGCG TTTTCGTATG GCAAGAGGAA GGACCATCCC 1020 GTTCCAGAAA ATTCTACCAG CAAACGAGCT CGCCTACACC CAAATGTTAA GTGCCACTTT 1080 TGTGGAAAAA TTGGCCACAA GATAGCTGAC TGCCGCTCCA TGAAAAACAA CTTAAAGAAT 1140 CAACAAGGAT CTAGTTCGAG TATTGGGCGC TTATCTGACT CTAAACCTGG GTCAATTACT 1200 TGCTATAGAT GTGGAAACCA GGGGCATATA GCGTCAGCTT GCCCTGCAAG ACAATCGTTG 1260 TCAAACCAAA CTAAAGCCGA CGAGAAGCGT GTCAACGTGT GTCACGTAGT CGAGCCAATT 1320 GGGACATTGA TATCATCTGG TGAGTCGTAT CCATTTTATT TCGACTCTGG AGCCGAATGC 1380 TCACTTGTAA GAGAATCTGT GTCCACCCAA CTCTCGGGCA CACGAATTAA CAACAATGTA 1440 GTTTTAAAGG GTATCGGAAA TAATACTGTT ACCAGTACAT TACAAATTTT GTCAAACGTA 1500 ACAATAAGTG GTTACTGTCT CGAAGTGCTT TTTCACGTAA TTCTTAATGA TTGCATTAAT 1560 TATAATATTA TAATTGGACG CGAAATTTTA AGTCAGGGAT TTAGTGCTAC TATAACAATA 1620 GATAAAATAG AGTTATGTAA AACAAGGTCT GTGCAAACCC TATCTGCTTA GAGTAGTAGT 1680 TTTAGTCTTG AAAATGTTAA TACCGAATTG TGTGGCGAGG ATAGGAAAAT CTTGGTAAAT 1740 CTTTTGAATA AATTCTGTGA CTCATTTATA GACGGTTTTC CCAAAAATCG TGTTACAACT 1800 GGCGAACTAG AAGTACGCTT AATTGATCCA ATAAAAACTG TACACAGACG ACCGTACCGA 1860 CTTAGTATAG AGGAAAAACA AATTGTCCGA AACAAGGTTA ATGAGCTGCT GTTAGATAAC 1920 ATCATCCGTC CTAGCAGCTC ACCGTTCGCC AGTCCAGTTT TACTCGTTAA AAAGAAAAAT 1980 GGTTCTGATC GCCTTTGCGT GGATTACCGC GAACTAAATA CAAACACAGT TGCAGAGAAA 2040 TATCCCTTAC CACTAATTAG TGACCAAATA TCTAGGTTGC GTGGAGCAAG TTTCTTTAGT 2100 TGCTTGGATA TGGCCAGCGG GTTTCATCAG ATACCTATTC ACGCAAATTC AATTGAGCGC 2160 ACGGCTTTTG TGACACCTGA CGGCCAATTC GAATTTCTAA CTATGCCCTT CGGGTTAAAG 2220 AATGCCCCAT CCGTGTTCCA GCGTGCAGTT ATGAAAGCTT TGGGTGAGCT TGCCCACTCT 2280 TACGTTATCG TTTATATGGA CGATATAATG ATTATCGCAG AAACAAAAGA AGAAGCTTTT 2340 GTAAGGTTAA GGACAGTTTT GAAAATATTA TCGCAGGCTG GGTTTTCTTT TAATATCGGA 2400 AAATGTTCAT TCCTGAAATC TTGCATTGAA TATCTGGGGT TTGTGGTAAA AGAGGGCGAA 2460 ATAAGACCAA ATCCATCTAA GATAAAAGCA TTAGTCGCTT TACCGCCTCC GCAGTCTGTT 2520 ACCCAAGTAA GACAAATTAT TGGCCTAGCC TCTTATTTTA GGCAGTTTGT GCCAAAGTTT 2580 TCAGAAATCA TGAAACCCTT ATATAGACTG ACCTGCAAAA ACAAAATATT TGAATGGAAA 2640 CTTGAACACG AACAAATTCG TCAAAAAGTC ACTAAATTGC TTACAGATGA GCCCGTCCTT 2700 GTTATCTTCG ATCCTCGGCA TCCCATTGAA CTGCATACAG ATGCCAGTAT GGATGGCTAC 2760 GGAGCAATTC TACTCCACAA AATAGATAAT AAACGTCGTG TAGTTGAGTA TTACAGCAAA 2820 CAAACATCCT TGACGGAATC TCGATATCAT TCGTACGAGC TTGAAACTTT AGCTGTGTAT 2880 AACTCCATGA GACACTTTCG TCACTATTTA CATGGGCGAA TTTGTTGTTT TTACAGACTG 2940 TAATTCCCTA AAAGCTACTC GCAACAAGAC TGAACTAACG CCGAGAGTAC ACCGTTGGTG 3000 GGCATATATG CAGTCCTTCG ACTTTGACTA GAATGACTTA GACTTAGAAT ATAGACCTGG 3060 TGCCATAATG GCACATGTTG ATTTCTTGTC ACGCAATCCA CTGCCATCTG CTCGGGTTAT 3120 TACTGGTGAG GAAGAAAAAC ATGTTCTATT GGCCAAAATA ACGGACAACT GGTTACTTGC 3180 AGAACAGCAA AAGGATTCAG AGATTTCCAC GATTGTTGTT AAAATACAGA ACAATGAATT 3240 GGGTGAGAGC TCGGCAAAAA GTTATGAATT ACGCTCGAAA ATGCTTTTTC GCAAAATTCA 3300 AAGGAACGGT AAAACTCGTT GCCTGCCAGT TGCCCCCAGA TCATTCAGAT GGTCAGTAGT 3360 GAACCAGGTC CATGAAGCAG TTGTACATTT GGGTGGGAAA AGACTTTAGA CAAAATGTAC 3420 GAATTTTACT GGTTTGAGAA CATGGCCAAA TATGTTCGTA AGTTCGTTGA TAATTGCATT 3480 ACGTGTAAGT TAACTAAGCC TCCGTCAGGA AAATTGCCAA TCGAACTCCA CCCCATACCA 3540 AAAGTAGAAA TTCCATGGCT ATAAGTTGTA CGACAAATCG CATAACGAAA GCCAGTCCTC 3600 TTGAATTACT AATCGGAAAA GAATGTAGAC CATTTAATAT GTTACCAATA TGTGAACAAG 3660 TTAATAAAGT CGATGTAAAT ATTATAAGAA ATATCGCGAG AGAAAATATT AAGAAGAACG 3720 CCTTGTATGA AAAAACTAGA TTCGATAAGC ACAAAGCCAA ATTTGATAAC TTTGGTGTTG 3780 GCGATTATGT TTTACTTAAG AACGAAGAAA GGCACCAAAC AAAATTAGAC CAAAAATATA 3840 AAGGACCTTT CCTCGTGACA GAGGTACTTA AGGGAGATCG TTATATTTTA AAATCTTTAA 3900 CTAATAAGCG GACTTATAAG TACCCACATG AAGCTTTGCG CAGTATGCCA ACAGAGGAGA 3960 TCCCCAAAGA GTTAGATCTA TGTGACGATC AAGAAAACGT TGAAAGAGAC GTTAGAAATC 4020 CCTTGGTGGA TTCCAATGTG GATGAAAACG TCGAAAGAGA CGTTAGAAAT CCCTTGGTGG 4080 ATTCCAATGG GGATGAAAAC GTTGAAAGAG ACGTTAGAAA TCCGTTGGTG GATGCCAATG 4140 TGAGCGAAAA GTTACTGAGT TGTTTGAAGA CTCAAGTGAA TGAGAGGCAT TGATGGATTT 4200 CAATGCGAGA TTGGGGACAC ATGCAACGTC GCCAAGTTGC CAGTGCTAGT AGGTACAAGT 4260 GTTACTGTGT TGACTTATTT GATGTCTGGT GACTGGCGGC GTGGCGGGTT GAATTGTCCT 4320 AGTGTGTTGC TAATAATAAC AAACGATCTT CTTGGTACTT CTGTCACTCG AGTTGGTCGA 4380 TAACAAGAAA AATAATAATA ATAATTACGT TTAATGTTAT CTTTCTAGAT TAAGCTTGTT 4440 TAATTTCAAA ACTTATATTA CACACGAGGA CGTGTGCTGG TCAGGAAGGC CGTGTCGCAT 4500 CATTATTAGT CTTATTTTTA TTTTCTATGT TCCATCTCTA ATAAACATGT CATCTCTATT 4560 AAATAAAATT CGTATCGAGC TGTTCTTGTC TTCGTTTCTC TTTGATCGCT GTTCGCTGTG 4620 TTCCGTTATG CGAGTTTAAC GGGTTTTGCT CTGTTCTACA TAGTCTCGGT TCGACGATGC 4680 GTTAGAGTGA GACAAATGCT TGTCCTGTGG TGAGTTCGGA CCAGCATGTA TCAAGCGAGA 4740 TAGAGCGATG TTGAAATGTA CACGGGGCAC TTATGTTTGA AAACTCTGAG AAAGCGGACG 4800 CGTGAATATG TCGCAACCGA GGAAGTGTAC GACTCGCGGG CGGAGCGCGG CAACAGAGGA 4860 CCCCGAATCA GTTAACTTCC CGACA 4885 // ID DNTOMRETA standard; DNA; INV; 7060 BP. XX AC Z24451; XX DR FLYBASE; FBgn0004357; Dana\Tom. XX FT source Z24451:1..7060 FT SO_feature five_prime_LTR ; SO:0000425:1..474 FT SO_feature three_prime_LTR ; SO:0000426:6587..7060 FT SO_feature intron ; SO:0000188:1277..5158 FT SO_feature CDS ; SO:0000316:859..2055 FT /db_xref="FLYBASE:FBgn0028827; Dana\Tom\gag" FT /protein_id="CAA80823.1" FT /translation="MAQPAQPENTLNESNLAEARGQLKDVPPFRGEPETLFTFISRVDY FT ILSLYHTNDVRQQRILLGAIERNIEGHVTRTLGLPTIEDWPTLRSRMINEYKPQAPNYK FT LLENFRETPYKGNLRAFCEEAERRRQILISKLHLEGNQSNLIIYLQAVRDSMKTLVRKL FT PIQLFTILAHHDIPDLRSLINIAQNEGIYEEHINFETNKNIEIKNKTPNFYQNPKAFKN FT YPINQSQYQPRYPQYPHPLQPNFNPYMHAPRPIYTQQLSNNQPMGPGQTYPGPNRYMNP FT QPIFNRIPFPKSNFNLTPQTQQPRMPSNPNFPLQQTTKRPRPSDSEQTKMSIDELRLQE FT AQEYEQNYQQPYEQEYYDYTQYQDQTYEEQCQAPINQDQAEINFDENFQSPAPEDTNT" FT SO_feature CDS ; SO:0000316:2115..5237 FT /db_xref="FLYBASE:FBgn0028826; Dana\Tom\pol" FT /protein_id="CAA80824.1" FT /translation="MMRKNFFSLPIHNTECEVFTSNGPMTLKDSITLPSNNIFRTPEQF FT YLHDFSDDYDVLIGRKLLNKAQGIINYKTHTITLFDKTYPLIDTDSNKGQFFYTQDSYE FT KPIPKSDKKIDFSPFRLDHLNPEETYKLKHLLNKFKDLQYFEGERLTFTNTIKHVLNTT FT HNSPIYSKQYPLAQTHENEVENQVQEMLEQGLIRESNSPYNSPTWVVPKKPDASGKAKY FT RVVIDYRKLNEITIPDRFPIPNMDEILGKLGKCQYFTTIDLARGFHQIEMDSESIQKTA FT FSTKRGHYEYVRMPFGLRNAPATFQRCMNNILRPLINKHCLVYLDDMIIFSTSLDEHLN FT SLQLVFEKLSESNLKLQLDKCEFLKKEATFLGHIVTPDGIKPNPLKVEAIASYPIPTKV FT KEIRAFLGMTGYYRKFIPSYADIAKPMTRYLKKGAKIDINNHEYVEAFEKLKTLITSEP FT ILQLPNFEKKFVLTTDASNLALGAVLSQDNHPISFISRTLNDHELNYSTIEKELLAIVW FT ATKTFRHYLLGRHFQIASDHQPLRWLHNLKEPNAKLQRWRIRLAEFDFHIEYIKGKQNS FT IADALSRIKVEENHFSEATQHSAVEDNNDLIQLTEKPINVFKKQIIFIKSDQNSVRQTT FT VFGNSITTIHYNNMTVENAKQFLLDHFISKSIAMYIVSDADFEIILTAYREIINPSYTK FT VTRSLILLNNVSSYAEFKEIILQAHEKLLHPGIQKMTKLFKENHYFPNSQLLIQNIINE FT CRVCNLAKTEHRNTKMPFKVTPSPGHCRDKFVIDIYSSEGKHYLSCIDIYSKFATLEQI FT KTKDWIECKNALMRIFNQLGKPTLLKADRDGAFSSLALKQWLESEGVELQLNTAKTGVA FT DVERLHKTINEKIRIINSSKNDEIKLGKMENILYIYNHKTRHDTTGQTPAHIFLYAGQP FT TLDAQKIKEQKINKLNDDRQEYDIDTKFRKGPLQKGKLENPFKENKNVEQTDPDHYKIT FT NRNRTTNYYKTQFKKRKKLMRSPFHRYLAHSDDTVPVAYHELCPTNSDRRH" FT SO_feature CDS ; SO:0000316:5176..6564 FT /db_xref="FLYBASE:FBgn0028828; Dana\Tom\env" FT /protein_id="CAA80825.1" FT /translation="MILSLLLTMSYAQQIQIGGIDTNHGYLLFSSKPIQRPSAFEHHCL FT TVNLTEINTITTYFGNKIQNSTDTPRIKFLYNKLIKELNGITLHKERRQKRGLFNFVGS FT AFKFLFGTLDDNDRIQFEEKLNSEAENSIKIHEFNEVMQFVNDGLQRIKKYENNRNSID FT TLVYELMQFIEYIEDLEMGMQLSRLGLFNPKLLNYDKLQNVNSENILLTKTSTWINYKN FT NEILIISHIPINHVLINTIKIIPYPDRNGYQLEYSGSDSYFENDNKIYNQDNKEVNSEC FT IANIIKRRNPTCNFVPALTKEIIKYIEPNVIITWNLTQTTLTQNCQNSNSNIQIKGNKI FT IRITQCKVKIENIILSENYLHPEIDLTPLYPPLNITKIKILKHNDIIKMISQNNITLYT FT IIIPAILALVAMILILKYINFNPFIFLYIKLRKQTERNQPQLQENELGENPLPTLYPSM FT PAQV" FT SO_feature intron ; SO:0000188:1277..5158 XX CC Derived from Z24451 (Rel. 44, Last updated, Version 6). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7060 BP; 2781 A; 1492 C; 1029 G; 1758 T; 0 other; agtgacatat tcactctcac acccatctac taatatccaa ttactgcata tccaattact 60 gcagccaagt aaacaaataa caagagctgc gcatcatttt tgctgtgcag aaaatttaat 120 ataaacaaat gaccgatatt cagcggcgat cgcgccttaa gcatcttcgc ttatacaaac 180 caaataatcc ataagctctc tcaatgaatc aagcgaccaa ctgcactctc attctaatgt 240 aaacaataga caatagccaa aaccaagatt ttcatctctt tgtaaatcag tcttaagctg 300 aaatccaatt gacacacacg aaacctttct ttcgtcccgt acaattttaa taaataataa 360 ataacaaaac atttttttta aaattggttt ttattttatt aaaaacggat cttagcgttg 420 tccttagtct ttcgacggga catttatttg actcaaaata taaactattt tactggcgca 480 gtcggtagga tacttcaagt atccgaaaaa aaagaaccgc gagtggaaaa taaattaaat 540 tttataatcc gcaattcgca aaaatacgct cttcaactgg gaaaataaat ccaattcggt 600 tatcccaaat aagtggaagc aaaacaaaat tcaaaaactc aaaaacctgt aagtccaaag 660 gcaaggtaaa aacatcgaac taagtgacaa tcaaataaaa acatcgaact aagtgacaat 720 caaacaaaaa catcaaacaa agtgacaata aaaaataaaa aataaaaaat aaaaaaataa 780 aataaaaata aaaataaaat ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaag 840 aacaacaaca aaaacaaaat ggcacaacca gcacaaccag agaatacatt aaacgagagc 900 aaccttgctg aggcccgtgg acaacttaag gatgtccctc catttagagg agagccagaa 960 acacttttta ccttcatcag cagagtggac tacatactgt ccctctacca taccaatgat 1020 gtgcgacaac aacgcatcct gcttggagcc attgagcgga acatagaggg tcatgtgact 1080 agaacattgg gtctgccaac catagaagac tggccgacct tgaggtccag aatgatcaac 1140 gagtataaac cacaggcgcc gaactacaag ctgctggaga acttccgcga gacgccctac 1200 aagggaaatc tgcgtgcatt ctgcgaggag gccgagcgac gacgccaaat actcatctcc 1260 aaattacatc tggaaggtaa tcaatcaaac cttattattt atttgcaagc ggtccgagat 1320 tccatgaaaa cactagttcg aaaattacca attcaattgt ttacaatttt ggcccatcac 1380 gacattccag acttgagaag tttaattaac attgcacaaa atgagggcat ttatgaagaa 1440 catattaatt ttgaaacaaa taaaaatatt gaaattaaaa ataaaactcc aaatttttac 1500 caaaatccga aagcatttaa aaattaccca atcaatcaat cccaatacca accgagatac 1560 ccccaatacc ctcatccatt acaacccaac tttaacccat acatgcatgc acccagaccc 1620 atttatacac aacaattatc taacaaccaa cccatgggac ccggacaaac ttaccccgga 1680 cccaacaggt acatgaatcc tcaaccaatt tttaacagaa ttccatttcc caaatccaat 1740 tttaacctaa ctcctcaaac acaacaacca cgtatgccat ctaacccaaa cttcccacta 1800 caacaaacaa ccaaacgacc aagaccatcg gacagcgaac aaaccaaaat gtccattgac 1860 gaactcagat tacaagaagc acaagaatac gaacaaaatt atcaacaacc atacgaacaa 1920 gaatattacg actacactca gtaccaggac cagacttatg aagaacagtg tcaggcacca 1980 atcaatcaag atcaagccga aatcaatttt gacgaaaatt ttcagtcacc agccccggaa 2040 gataccaata cttaataatt acccacaagg gttacgctca caaatgtttg attgacacag 2100 gatccacaat cagcatgatg cgaaagaatt ttttttctct accaatacac aatactgaat 2160 gtgaagtttt tacatcaaat ggcccgatga cattaaaaga ttcaataact ctgccaagca 2220 acaacatatt tagaacacct gaacaatttt acttacacga cttttctgac gactacgatg 2280 tgctaattgg cagaaaactg ctcaataaag cacaaggtat aataaattat aaaactcata 2340 ctattacact tttcgataaa acctacccat tgatagacac agattcaaac aaaggtcaat 2400 ttttctacac acaagattca tatgagaaac caatccccaa atcagataaa aaaatagact 2460 tttcaccatt ccgtctggat caccttaatc cggaggaaac ctataaatta aaacatttat 2520 taaacaaatt taaagatttg caatattttg aaggagagcg tttgacattc acaaacacaa 2580 tcaaacatgt attaaacaca acacacaatt cgcccattta ttcaaaacaa tatccacttg 2640 ctcaaacaca tgaaaatgag gtagagaacc aggtgcagga gatgcttgag cagggattga 2700 tcagagaaag taattcaccc tataatagtc ccacttgggt tgtaccaaag aaacccgacg 2760 cttcggggaa agcaaaatat agagtagtca ttgattacag gaagctaaat gaaataacca 2820 ttcccgatag atttcctatt cccaacatgg acgaaattct tggaaaactg ggaaaatgcc 2880 agtactttac aacaattgat ttggctaggg gttttcatca gatagaaatg gattcagaat 2940 ccatacagaa aactgcattt tcaactaaac gcggtcatta tgaatacgtt cgcatgccat 3000 ttggcttaag aaatgcaccc gccacattcc agaggtgcat gaataacata ctccgaccat 3060 taattaacaa acactgttta gtttatttgg atgacatgat tattttttcc acatctctag 3120 acgaacattt aaactcattg caattggttt ttgaaaaact gtccgaatca aatctaaagt 3180 tgcaactaga taaatgcgaa ttcctgaaaa aggaagcaac ctttctagga cacatcgtaa 3240 cacccgatgg aataaaacca aaccctctta aagtagaagc catagcatca tacccaatcc 3300 caacaaaagt aaaagaaatc agagcattcc tcggaatgac tggttattac cgaaagttta 3360 tcccaagcta cgctgacata gcaaaaccca tgacccgcta cttaaagaaa ggagcaaaaa 3420 tagacataaa caatcacgaa tatgtggaag cattcgagaa acttaaaacc ctaataacaa 3480 gcgaaccaat tctacaattg cccaattttg aaaagaaatt tgtattgact acagacgcta 3540 gtaacctggc tcttggagct gtcctttctc aagacaatca ccccatatcc ttcataagca 3600 gaacattgaa tgaccatgaa ttaaactaca gtaccattga aaaagaatta cttgccatag 3660 tttgggccac aaaaacattc cgtcactact tgttgggaag acatttccaa atagctagtg 3720 accatcagcc tctcagatgg ttacataatt taaaggagcc aaatgccaaa ctgcagagat 3780 ggagaattag attggctgag tttgactttc atattgagta cataaaaggg aaacagaatt 3840 caattgctga cgcactgtcc agaatcaagg ttgaggagaa tcatttcagt gaagccaccc 3900 aacatagtgc agtagaagac aataatgatc ttatccagtt aacagaaaaa ccaataaatg 3960 tattcaaaaa acaaataata ttcattaaat cagatcagaa tagtgtaagg cagacaaccg 4020 ttttcggaaa ttcaataacc acaattcact ataacaacat gactgttgag aacgccaaac 4080 aattcttact tgaccacttc atttccaaaa gcattgccat gtacatcgtg agcgatgccg 4140 atttcgagat catcctgaca gcctataggg aaattattaa cccctcttat actaaagtga 4200 ctcgtagcct cattttattg aacaacgtga gctcatatgc tgagtttaaa gagatcatac 4260 ttcaagccca tgaaaaactg ttgcatccag gtatccaaaa aatgactaaa ttattcaaag 4320 aaaatcatta tttcccaaat agtcaactac taattcaaaa catcataaat gagtgccgtg 4380 tgtgtaacct agccaagaca gaacacagaa acacaaaaat gccttttaaa gtcacaccta 4440 gccctgggca ttgccgcgat aaatttgtaa tagacatcta ttcatccgag ggtaagcatt 4500 accttagttg cattgacatt tactcgaagt tcgccactct agaacaaata aaaacaaaag 4560 attggataga atgcaaaaac gccctgatgc gtatctttaa ccaactcgga aaacctacat 4620 tgttaaaggc ggatagggac ggtgcatttt ccagcctagc tcttaagcaa tggctcgaga 4680 gtgagggtgt tgaattacaa ttaaacacag ctaaaacggg agtggcggat gttgagagat 4740 tacataaaac aataaatgaa aaaattcgca taatcaattc ctccaaaaac gatgaaatca 4800 aactgggcaa aatggaaaat attctttata tttacaatca taaaaccagg catgacacga 4860 ccggacaaac acccgctcac atatttcttt acgccggaca accaaccctg gacgcacaaa 4920 aaattaaaga acaaaaaata aacaaattaa atgatgatcg tcaggaatac gacatagaca 4980 ccaaatttag aaagggcccc cttcagaagg gaaaattaga aaacccattt aaagaaaata 5040 aaaatgtcga acaaaccgac ccggaccatt acaaaattac taatcgaaac agaactacta 5100 actattacaa aacgcagttt aaaaaacgaa agaaattaat gaggtcccca ttccacaggt 5160 atctggcgca cagtgatgat actgtccctg ttgcttacca tgagctatgc ccaacaaatt 5220 cagataggcg gcattgatac aaaccatgga tatcttcttt tttcaagtaa accaattcaa 5280 agaccatcag cattcgaaca tcattgccta acagtcaacc tcacagaaat aaacaccata 5340 accacatatt ttggaaacaa aatacaaaac agtacagaca caccccggat caaatttttg 5400 tataacaaac tgattaaaga actaaacggg atcactctac acaaagaacg tagacaaaaa 5460 cgcggtcttt tcaatttcgt aggctcagct tttaaattcc ttttcggcac actcgacgac 5520 aacgacagaa tacaattcga agaaaaatta aattcggagg cggaaaattc aataaaaatc 5580 cacgaattta acgaagtaat gcagtttgtt aatgacggat tgcagagaat caaaaagtat 5640 gaaaacaata gaaatagcat tgacaccttg gtttatgaac taatgcagtt cattgagtac 5700 attgaggacc ttgaaatggg aatgcagctc tcacgactgg gactgttcaa cccaaaacta 5760 cttaattacg acaagctcca aaacgttaac agcgaaaaca tattattaac caaaacatcc 5820 acttggatta actataagaa taatgaaata ctaataatct cacatattcc tattaatcat 5880 gttctaatta acactattaa aataatccct tatcccgata gaaatggtta ccaattggag 5940 tactcaggca gcgattctta ctttgaaaat gataataaaa tttataatca agacaacaaa 6000 gaagtaaata gtgaatgtat tgcaaatata ataaaacgaa gaaacccaac atgtaatttt 6060 gtaccggccc ttacaaaaga aataattaag tatattgaac caaatgtaat aattacatgg 6120 aatctgaccc aaacaactct tacacaaaat tgtcaaaatt caaacagcaa catacaaata 6180 aagggaaata aaattataag aataacacaa tgtaaagtaa aaatcgaaaa tattatttta 6240 agcgaaaatt atttacaccc agaaattgat ctaacacctt tgtatccacc gctcaacata 6300 actaaaataa aaatcttaaa acacaatgac attataaaaa tgatttccca aaacaatatc 6360 acactttata caattattat accggctatt ctggctttgg tcgcaatgat tcttattctt 6420 aagtacataa actttaatcc atttatattt ttgtatataa aattaagaaa acaaactgaa 6480 agaaatcagc cacaacttca agaaaatgaa cttggagaaa acccattacc cacattgtat 6540 ccatcaatgc cagcccaagt ataggctgcc tttttaaggg gggaggagtg acatattcac 6600 tctcacaccc atctactaat atccaattac tgcatatcca attactgcag ccaagtaaac 6660 aaataacaag agctgcgcat catttttgct gtgcagaaaa tttaatataa acaaatgacc 6720 gatattcagc ggcgatcgcg ccttaagcat cttcgcttat acaaaccaaa taatccataa 6780 gctctctcaa tgaatcaagc gaccaactgc actctcattc taatgtaaac aatagacaat 6840 agccaaaacc aagattttca tctctttgta aatcagtctt aagctgaaat ccaattgaca 6900 cacacgaaac ctttctttcg tcccgtacaa ttttaataaa taataaataa caaaacattt 6960 tttttaaaat tggtttttat tttattaaaa acggatctta gcgttgtcct tagtctttcg 7020 acgggacatt tatttgactc aaaatataaa ctattttact 7060 // ID DH14600 standard; DNA; INV; 227 BP. XX AC U14600; XX DR FLYBASE; FBgn0012361; Dhyd\Bungy. XX FT source U14600:181..407 XX CC Derived from U14600 (Rel. 63, Last updated, Version 2). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 227 BP; 65 A; 38 C; 39 G; 85 T; 0 other; ATATCTATTT ATTTTTTTTT TTTAAATTGC TGTTTTTATT TGCAATCTGT CTGACACAAG 60 ACAATAAACA ATTTTCTTTA CAATAATTTA CATAACCGGA TGAGATCATC CATTTATTTA 120 CCTACTTTAC ATATTTAACT ATTTCTGTGG GGCAGGTCTA GCACATGAAT CTTGAGTCGC 180 CTGAGGTGCA TGAGTCACAG TAGCTGAGCT AGAGCATGCG AGTAGAG 227 // ID DVRPPDV standard; DNA; INV; 845 BP. XX AC X03936; XX DR FLYBASE; FBgn0000513; Dvir\Dv. XX FT source X03936:660..1532 FT SO_feature direct_repeat ; SO:0000314:1..58 FT SO_feature direct_repeat ; SO:0000314:790..845 XX CC Derived from X03936 (Rel. 20, Last updated, Version 1). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 845 BP; 200 A; 166 C; 206 G; 273 T; 0 other; AATCATCGAA AAAGTTTCGA TTTTCGCACC GATTTTGATT TTGAAAAAAA AAACGTGACC 60 CCTTGCCATT TTGTCGACTT GGGTCGGGGT TTAGGTTTCA TTTTTTTTAT GCCAACCGAT 120 AGAACTAACC CTTACGGTTC AAAATGGTGT GAATACAAAA CTGGCATATT GGGAATATTC 180 AAAACAACAA AAGTTAATTT CGAACTATGG TTTTGTAGTC TTTAACATTC GTTTTGGAAC 240 GTCATATCTC CGCGCGGAGT TATGTCGTTT TGGAACGTCA TATCTCCGCG CGGTGTAATG 300 TCGTTTTGGA ACGTCATATC TCCGCGCGGA GCTTTGTCGT TTTGGAACGT CATATCTCCG 360 CGCGGAGTTA TGTCGTTTTT GAACGTCATA TCTCCGCGGG GAGCTATGTC GTTTTGGAAC 420 GTCATATCTC CGCGGGGAGT TATGTCGTTT TGGAACGTCA TATCTCCGCG TGGAGTTATG 480 TCGTTTGGGA ACGTCATATC TCCGCGGGGA GTTATGTCGT TTGGGAACGT CATATCTCCA 540 CGCGGAGTTA TGTCGTTTTG GAACGTCATA TCTCCGCGCG GAGTTATGTC GTTTTGGAAC 600 GTCATATCTC CGCGTGGAGT TATGTCGTTT GGGAACGTCA TATCTCCGCG GGGAGTTATG 660 TCGTCTTGGA ACGTCATATC TCCGCGGGGA GTTATGTCGT TTTGGAACGT CATATCTCCA 720 CGCGGAGTTA TGTCGTTTTG GAACGTCATA TCTTGGGCGG AGCTATTACC CGAGATATTG 780 AAAAAAAAAA ATCATCGAAA AAGGTTCAAT TTTCACACCG ACCTCGATTT TGAAAAAAAA 840 AACGT 845 // ID DBU133521 standard; DNA; INV; 9045 BP. XX AC AJ133521; XX DR FLYBASE; FBgn0013796; Dbuz\Osvaldo. XX FT source AJ133521:21..9065 FT SO_feature five_prime_LTR ; SO:0000425:1..1195 FT SO_feature three_prime_LTR ; SO:0000426:7850..9045 FT SO_feature intron ; SO:0000188:5624..6439 FT SO_feature CDS ; SO:0000316:<1267..2509 FT SO_feature start_codon ; SO:0000318:<1..2 FT /db_xref="FLYBASE:FBgn0027838; Dbuz\Osvaldo\gag" FT /db_xref="SPTREMBL:Q9XZR6" FT /protein_id="CAB39732.1" FT /translation="TMVRAWVYNLKKDDVLRYGDEFGVTLSGTLDVMRRQFGEWVETNE FT GRIPYSLTVAELARLHGRRPSNSEDVPTVIVGNDQYEEDPNAARQSLPSTSAAARLQQT FT EQGAWRATSTPQPEIRTRGPSSSEQEYPKVAKHVREWNFRFDGTSKPLEFLEQVEWSAD FT TYGLDLDLIPRAMPELLKGMALKWYVANNRHWRTWGTFVRSFQEFFFAEDYLEDLKDEV FT KRRKQMVDEPFKIYMVEMQTLMRPLRYGPDHEMKLIYNNSIPDLRAYARPYQFQSLMEL FT MKLADEFEELERDRERLRRLQRPARTRLMAMEEDDGHEEEMLRRGALEESPRPAPRTGA FT TGQRTHIPNPSRACRVCGQEGHRAVRCRNRALDFCWQCGRIGVRTVACCQSGNDQRYPQ FT SRGEREQCQTAPRH" FT SO_feature CDS ; SO:0000316:<2214..5658 FT SO_feature start_codon ; SO:0000318:<1..2 FT /db_xref="FLYBASE:FBgn0027837; Dbuz\Osvaldo\pol" FT /db_xref="SPTREMBL:Q9XZR7" FT /protein_id="CAB39733.1" FT /translation="WPRRGDAAERSPGGESSSCTKDWCHGTKDTHPKSIQSVPGVWPRR FT TPSCQVPQQGTGLLLAMWTHRRAHRGLLPIGKRPAVPAVQGGAGAMPNSPKTLIGSLHD FT EGHQLTAIVAIGAEQQKATIDTGASSSFISERLAKRLHGGGVVRATRRRIRLANGSCSE FT VNSQLDLKIRLGSRQMEVPLLVLPGVIDDLVLGCDFLAGMGTPWNVAAWRSTIEPRNPQ FT RSGRREAKLSVAIASGEIGHETPLDATQVDQKLIHADPEVDAFLRHELEKFQHVKGTTK FT ITEHRITMQDTRPIKQRYFPKNPKMQAEINKQVDELLVKGCIEPSKSPHTRTYSNGQGR FT KNGKWRLCVDFRQLNSRSIKDAYPLPRVHHILDQLREARYITSLDLKDGYWQIPMEKSS FT RPLTAFTVPGKGLFQWKVMPFGLHSAPATFQRALDQVIGPDMMPHAFAYLDDIIVIGRT FT RQEHMDNLREVFRRLRAANLRINIDKCDFFKKELKYLGHKVTENGIRTDPEKVAAIAQL FT KPPTNVKELRQYVGVASWYRRYVPDFASTVHPLNALLKKGVKWEWTEEHQRAFETVKAK FT LTESPVLACPDFSKPFCLQTDASNYGLGAILTQTSEEGERVISYASRTLNSAERNYSAT FT EKECLAIIWGIRKLRPYLEGYHFIVITDHMALKWLNSIESPSGRIARWALELQQYDFEV FT RYRKGKQNVVADALSRQPLEEDACCLAKSRETPDGTACRWLQRLRQDMRKAPQKFADYR FT EEAGNIYRHIPHQAGHEDVAAWKLCVSTDRRKQVLKENHDAVTAGHLGSRKTIARVAAR FT YYWPGMYRDVRNYVQRCEVCQRYKPSQLQAAGQMLTQVPEEPWATVCADFVGPLPRSKH FT GNTMLLVFIDRFSKWTEMVPLRSANTAALQKAFRERILARFGAPKVLITDNGTQFTSRA FT FKNFLDELGVRHQLTAPYTPQENPTERANRTVKTMIAQFAGSDQRCWDEALPELTLAVN FT SSVSASTGYTAAFITQGREPRLPKTMFDAQTLGTGQEAQSPIERAAKMREVLEIVRRNL FT ERAAQDQARITICGGGSGSRLLGTKCGRRNATCPMPRTDLQRSWHRDTEGHIRWSSLYR FT RSSAGISSDAEKRTRTAHVADLKPWRGETGESQSGAEQAA" FT SO_feature CDS ; SO:0000316:join(<5163..5623,6440..7746) FT SO_feature start_codon ; SO:0000318:<1..2 FT /db_xref="FLYBASE:FBgn0027839; Dbuz\Osvaldo\env" FT /db_xref="SPTREMBL:Q9XZR8" FT /protein_id="CAB39734.1" FT /translation="LTLAVNSSVSASTGYTAAFITQGREPRLPKTMFDAQTLGTGQEAQ FT SPIERAAKMREVLEIVRRNLERAAQDQARITICGGGSGSRLLGTKCGRRNATCPMPRTD FT LQRSWHRDTEGHIRWSSLYRRSSAGISSDAEKRTRTAHVADLKPWRGETGGQTRMRDKG FT SIAAVLDDTVASRPAWKSATAEGVLEQADVHIPFDVDAPTTPAAKRPGLKTRARFETAY FT YSELCEQRRSSNSTTSWKAHPADPRLLAAIRLERRPSTPCTNQGAGTTSRVRLPPVRLP FT FHPRRRIRPNLRRHQPYLELEPSSSEESRSPASSSGTERRTSLEGTRDEWPEEVAQLAT FT QLEAKGSSRTARIIWVRGVNTASAGHAWDVASSPSATMLVVVGRRKVWAPATRPVAATG FT VTGGERGSGKSSCIGQRFRWSYAKTKRVLSLPFLNVTTPLTGLRTTPILTKVQQQPRKV FT MAWSHPGVLKGIQSPSAVKNITVPAPRDEVCQLKDLAILPAHLRRSHPGAAITRGVFAA FT APGVWAEAAAGPSGPGASIRHQLGLEEWKLTLKIRKRKLKKNIIFTYQQRTELPAANGK FT QLVSSAALLTER" XX CC Derived from AJ133521 (Rel. 60, Last updated, Version 2). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 9045 BP; 2492 A; 2233 C; 2712 G; 1608 T; 0 other; TGTGAGGAGC AATATAGGTA TGCACGTATG CAGTGTATAT GAACACATGT ATTTGCTGTG 60 TGTTTATATG CACATACGTG CATGCATATA TTGCTCCACA CAGCGACGCG AACACTATAG 120 GAGCAACCCT CGCGACTGGA AGGGTCAGAG AGCGGACTAT AGTCTGTCTC TCTCGCTCCT 180 AAAGTGTCAG CAAATGGGCT CGCGTCGCTC ACCCGCGGGT CATAGGGCAC CGTTAGCCGA 240 GGTACCGCTG CAAGTATTGG ATGCTCTCTC TTTGCATAGC AACAGCGGTA CTTCGGCCAA 300 CGAAAAGCGG CCGCAATATT TGGCGGGCTG CTGGCGTCAC CCAGATCACG CCATTGTGCC 360 GTTATGTGGG CACGAGCGGC AGGCAGGCCC CGCTCTAGAG CCGAACAGGG CCTGCGGCGG 420 CCCTATAAAA GGGCACCGGC GGTGTCATAG AGGGCAATCA AGAATCGGAA TCAAGAATCG 480 AGAGTGAAAC TAATGTGGAT CAAGAATCAA GAGTGAAGTG AGATTCAAGT GAAAGTGCAA 540 GTACGAGGCA ACGCGAGTGA GGACACAAGA CATCGAAGGA CTACGATATC GACGCAGGAC 600 ACCGATATCC AGGACTCGAA TCAGGCAGGA CATCGTGGGA GCATCCAAAG TATCCTGGGA 660 GGATACTTCA GACGCCTAAT AAGGACCACC GACACGCAGA AGCAGCAGTG GCCATCAGCT 720 AGAGAGTTGA TACAGGGCAG CTGCAACTCG TAGTCTGGTC AGAAGTCTCG TCACAGAATT 780 GTGTAAGTTC ACGCAGCCGT GCGAATAGGT AGCAATTGTG CGCATAAGAA CCTAGGTGTG 840 CCAAGAGCGA GTCCAGTAGT AGGAGTAGCC TGTACAGCCG AGCGAACCGC CGGAGTAGAC 900 AGTGCAAATC CTGGCGTGAG CTGGAGCGCT TGGGCGAATC CTGGTGCTAC TGAGCCATAA 960 TTGTCTGCCC AGCCGGAGTG CGTGAACAAA CATAACGCAC GACGACTATA CAAGTGTGAT 1020 TATATCGCTA AATAATAATA AGTTTTATAA CGAGAATATA CCAAGTACCA AGCAAATTCA 1080 CAAAGTGTTA TTATTCAAAA TCGCAAGGGC TTCAACAAAT AAATTTGTTG AAACGAATCC 1140 ACAGTCGGCT GAAACTGGCC CGGCTTAGAC TCGGGTATAA CCGAAGGTTC GTTACAACTG 1200 GCGCCCAACG TGGGGCCCTG CGATAAGAGA ATAAGCAATA ATTGTGAACA ATGGTGCGAG 1260 CTTGGGTGTA TAATCTCAAA AAGGACGATG TGCTACGATA TGGTGACGAA TTCGGAGTCA 1320 CGCTGTCGGG CACATTGGAC GTGATGCGGA GACAGTTCGG TGAGTGGGTG GAGACGAATG 1380 AGGGAAGAAT TCCATATTCT CTCACCGTCG CCGAGCTGGC ACGACTGCAT GGACGCCGCC 1440 CTTCGAACAG CGAGGATGTG CCAACGGTCA TCGTGGGAAA CGACCAGTAC GAGGAAGATC 1500 CAAATGCGGC GCGGCAATCT CTGCCATCCA CCAGCGCTGC TGCGCGGCTG CAGCAGACAG 1560 AGCAAGGAGC ATGGAGAGCC ACGTCTACAC CGCAGCCTGA AATAAGGACG CGAGGGCCAA 1620 GCAGCTCAGA GCAGGAGTAT CCAAAGGTGG CCAAACACGT GAGGGAGTGG AACTTCCGGT 1680 TCGATGGCAC CTCAAAGCCT TTGGAGTTCC TGGAGCAGGT GGAGTGGTCA GCAGACACCT 1740 ATGGCCTGGA TCTGGATCTG ATCCCAAGAG CGATGCCAGA GCTACTGAAA GGCATGGCTC 1800 TAAAGTGGTA CGTGGCCAAT AATAGGCACT GGAGGACGTG GGGAACCTTC GTGAGAAGCT 1860 TCCAGGAGTT CTTCTTCGCC GAAGACTACT TGGAGGATCT CAAAGACGAG GTCAAGCGTC 1920 GAAAGCAGAT GGTGGACGAG CCATTCAAGA TCTACATGGT GGAGATGCAG ACTCTCATGC 1980 GGCCGCTGCG TTATGGGCCA GATCACGAGA TGAAGTTGAT TTACAACAAC AGCATCCCTG 2040 ATCTGCGCGC CTACGCTCGG CCGTATCAGT TCCAGAGTCT GATGGAGCTG ATGAAGCTGG 2100 CCGACGAGTT CGAGGAACTG GAACGAGACC GAGAGAGGTT GCGTCGGCTG CAGCGGCCAG 2160 CACGAACGCG TCTAATGGCC ATGGAGGAGG ATGATGGCCA CGAAGAGGAG ATGCTGCGGA 2220 GAGGAGCCCT GGAGGAGAGT CCTCGTCCTG CACCAAGGAC TGGTGCCACG GGACAAAGGA 2280 CACACATCCC AAATCCATCC AGAGCGTGCC GGGTGTGTGG CCAAGAAGGA CACCGAGCTG 2340 TCAGGTGCCG CAACAGGGCA CTGGACTTCT GCTGGCAATG TGGACGCATA GGCGTGCGCA 2400 CCGTGGCCTG TTGCCAATCG GGAAACGACC AGCGGTACCC GCAGTCCAGG GGGGAGCGGG 2460 AGCAATGCCA AACAGCCCCA AGACATTAAT TGGCAGCCTG CACGACGAAG GCCACCAACT 2520 GACGGCGATT GTGGCCATTG GGGCCGAGCA GCAGAAAGCC ACGATCGATA CTGGAGCCTC 2580 GAGCAGCTTT ATTAGCGAAA GGCTCGCCAA AAGGCTCCAT GGAGGGGGCG TGGTGCGCGC 2640 CACAAGGCGG CGCATACGCC TGGCGAACGG CAGCTGCAGC GAGGTGAACT CGCAACTGGA 2700 CCTAAAGATT AGACTGGGCA GTCGGCAAAT GGAAGTTCCA TTGCTGGTGC TGCCAGGCGT 2760 AATCGACGAT CTGGTGCTGG GCTGCGATTT CTTGGCCGGC ATGGGAACAC CTTGGAATGT 2820 GGCGGCTTGG CGCTCGACTA TCGAGCCAAG AAATCCACAG AGGAGTGGAC GACGGGAGGC 2880 AAAGTTGTCA GTGGCAATTG CCAGTGGAGA AATAGGCCAC GAAACGCCAC TGGACGCGAC 2940 TCAGGTGGAC CAGAAGCTGA TCCATGCGGA TCCGGAAGTG GATGCCTTTC TGAGGCACGA 3000 ACTGGAGAAA TTCCAACACG TGAAAGGCAC CACCAAAATC ACGGAGCATC GGATCACCAT 3060 GCAGGACACG CGGCCAATTA AACAGCGCTA CTTCCCCAAA AATCCGAAGA TGCAGGCGGA 3120 GATCAATAAA CAGGTGGATG AGCTGCTCGT AAAAGGCTGC ATAGAGCCTT CCAAGAGCCC 3180 ACACACGCGC ACCTATAGTA ATGGTCAAGG AAGAAAGAAT GGCAAATGGC GGCTGTGCGT 3240 GGACTTTAGG CAGCTGAACA GCAGATCCAT CAAGGATGCA TACCCTCTGC CTCGTGTGCA 3300 TCACATTTTG GATCAGCTGC GAGAGGCGCG CTATATCACG AGTCTGGATC TAAAGGATGG 3360 ATATTGGCAA ATACCCATGG AGAAATCCAG TCGACCATTG ACAGCGTTCA CAGTCCCAGG 3420 GAAAGGCCTG TTCCAGTGGA AGGTGATGCC GTTTGGCCTC CACTCGGCAC CTGCAACATT 3480 CCAGCGAGCT CTAGACCAGG TCATCGGCCC TGACATGATG CCACACGCTT TTGCGTACCT 3540 GGACGACATC ATCGTCATAG GCAGGACACG TCAGGAGCAC ATGGATAACC TGAGAGAGGT 3600 GTTCCGGCGA CTGCGGGCAG CCAATTTGAG GATCAACATC GACAAGTGCG ATTTCTTCAA 3660 AAAGGAGCTG AAGTACCTGG GTCATAAGGT CACAGAGAAT GGCATCCGCA CTGACCCAGA 3720 GAAAGTAGCC GCGATTGCCC AGTTGAAGCC ACCCACTAAT GTCAAAGAGC TGAGACAATA 3780 TGTGGGAGTG GCTTCATGGT ATCGTCGCTA TGTCCCAGAC TTCGCTTCCA CAGTGCATCC 3840 GCTCAATGCA CTACTCAAAA AGGGCGTCAA ATGGGAGTGG ACAGAGGAGC ATCAAAGGGC 3900 GTTCGAGACT GTGAAGGCCA AGCTGACGGA ATCACCTGTC TTGGCATGCC CTGATTTCTC 3960 GAAGCCATTT TGCTTGCAAA CGGATGCAAG TAACTATGGA CTGGGAGCAA TCCTGACGCA 4020 AACATCGGAG GAAGGGGAGC GCGTCATTTC GTATGCCAGC CGGACACTTA ACAGTGCTGA 4080 GAGAAACTAC TCAGCCACTG AAAAGGAGTG TCTGGCCATA ATCTGGGGCA TACGCAAACT 4140 CAGGCCATAC CTGGAAGGCT ATCACTTCAT AGTGATCACG GACCATATGG CGCTGAAGTG 4200 GCTCAACTCC ATCGAGAGCC CTTCAGGAAG GATTGCCAGA TGGGCGCTGG AGCTTCAGCA 4260 ATACGACTTC GAGGTTCGCT ACCGGAAAGG CAAACAAAAT GTCGTAGCTG ATGCGCTGTC 4320 ACGGCAGCCG CTGGAGGAGG ACGCATGCTG CTTGGCCAAG AGCAGAGAGA CGCCAGATGG 4380 GACAGCATGT CGCTGGCTAC AAAGGCTGCG CCAGGATATG CGGAAAGCCC CGCAGAAGTT 4440 CGCAGACTAT AGAGAGGAGG CTGGAAACAT ATACCGGCAC ATTCCGCACC AAGCCGGCCA 4500 CGAGGATGTG GCTGCATGGA AGCTGTGCGT GTCAACAGAT AGGCGCAAAC AGGTGTTGAA 4560 GGAGAACCAC GACGCTGTCA CTGCTGGCCA CCTTGGGAGC AGAAAAACCA TTGCTCGGGT 4620 GGCAGCGAGG TATTACTGGC CAGGCATGTA CCGGGATGTG CGCAACTACG TGCAGCGGTG 4680 CGAGGTGTGC CAGCGCTACA AGCCCAGTCA GCTGCAGGCA GCTGGTCAAA TGCTGACACA 4740 AGTGCCCGAG GAGCCATGGG CAACCGTATG CGCTGATTTC GTGGGTCCGT TACCGAGGTC 4800 GAAGCATGGG AATACGATGC TACTGGTATT CATCGATCGA TTCTCGAAGT GGACGGAGAT 4860 GGTGCCTTTG AGGAGCGCGA ACACGGCGGC GCTGCAAAAG GCGTTCCGCG AGAGAATCTT 4920 GGCTAGATTT GGCGCACCAA AGGTACTCAT AACGGACAAT GGCACCCAAT TCACCAGTCG 4980 AGCCTTCAAG AACTTTTTGG ACGAGCTGGG AGTGCGGCAC CAGTTGACGG CGCCATACAC 5040 ACCGCAAGAA AATCCGACCG AGAGGGCCAA CAGGACAGTG AAGACGATGA TCGCGCAATT 5100 TGCTGGCAGC GATCAAAGGT GCTGGGATGA GGCTCTGCCA GAGTTGACGC TGGCAGTCAA 5160 CAGCAGCGTG TCGGCATCTA CCGGGTACAC GGCAGCATTC ATCACGCAAG GGCGAGAGCC 5220 GAGGCTGCCC AAAACCATGT TTGACGCACA AACTCTTGGG ACGGGCCAGG AGGCACAGAG 5280 TCCCATAGAG AGGGCAGCCA AGATGCGAGA AGTCTTGGAG ATTGTGCGTC GAAACCTGGA 5340 GAGAGCTGCC CAGGACCAGG CGCGCATTAC AATCTGCGGC GGAGGCAGTG GAAGCCGGCT 5400 ATTGGGGACA AAGTGTGGGC GAAGGAACGC CACTTGTCCA ATGCCGCGGA CGGATTTGCA 5460 GCGAAGCTGG CACCGCGATA CGGAGGGCCA TATACGGTGG TCAAGTTTGT ATCGCCGGTC 5520 ATCTGCCGGC ATCAGCTCGG ATGCCGAAAA GCGAACCCGG ACCGCGCATG TGGCTGACTT 5580 GAAGCCATGG AGAGGCGAGA CGGGTGAGTC GCAAAGCGGG GCAGAGCAAG CCGCCTAGTA 5640 AAATTTTCAA AAAAAAAAAA AAAAAAAAGA GTGAATGCAC ATCGCGATGT GCATTGCTTT 5700 TAAGACACCT AAAAACGAAG AAAAATTTGG AATTTTACGA GAAGCAAACA GTTGGAACTG 5760 TTAACTGGGT GACGCGCAAC AACAAATTTT TAAATAACGC GCGCACTGCA GTGGAAGCGA 5820 ACAGTTAAAA CTGTTAGCTA AGTGACGCGC AACAACAAAA TTTTAAATTG CCGCGCGCAC 5880 TGTTACGTAC GTAACAACAA CACACACAAC AAACAAGCGT ACGAAACAAC AACACGCTCA 5940 GCTGACAAGA GTTGGACGCG CACGCATAAA CATAAACAAC AACAAGTATG CAAGCAGCGC 6000 ACAACAAATA ACACACACAG CTGAGTACGA ACGAGCGTGT GCAAGAGCCG GAGAAGGTCA 6060 AGGGCCTCGC GAGAGCGAGA GAGCGCAGCG AGCATCGGTT GGCTCGGCGA GAGCGCGAGC 6120 GAGTGCAGCG AGCATCGGTT GGCTCGCGAG AGGCGAGCGA GTGCAGCGAG CATCCGTTAG 6180 CTTGAGAGAG GCGAGCGATC GCACAGCGCA CCGCCGTTAG CTCGAGAGAG GCAAGCAACA 6240 ACAAAATGCA TGACATGCAA AACAGCCAGC AACAAAAGGC ACGCAGCATA CAAAACGGGC 6300 AACAACAACA TGCGTAGGCG CGGTAACCCA GTGAACAATT TGCATAGTGC ATTTAACACA 6360 GGCAACAACT ATGCAAGGCA AGAGAGAGCG TGCAAAGGCA TAGACTGCGT TTCCCACAGG 6420 TGGGCAGACG AGGATGCGGG ACAAGGGGAG CATTGCAGCG GTACTGGACG ATACCGTAGC 6480 GAGTCGGCCT GCCTGGAAGT CAGCAACTGC GGAGGGTGTG CTAGAGCAGG CCGACGTGCA 6540 TATCCCGTTC GACGTTGACG CGCCTACGAC GCCGGCAGCG AAGCGGCCGG GGTTAAAAAC 6600 GCGAGCACGC TTCGAGACGG CCTACTATTC GGAATTATGC GAACAAAGGA GAAGCTCAAA 6660 TTCAACAACG TCCTGGAAGG CACACCCGGC CGATCCGCGA CTGCTGGCCG CGATTCGTCT 6720 GGAGCGGCGT CCGAGCACGC CGTGTACCAA CCAGGGCGCG GGCACGACGA GCCGAGTGCG 6780 CCTGCCACCG GTCAGGCTTC CGTTCCACCC GCGACGACGC ATTCGCCCGA ACCTGAGGCG 6840 GCATCAGCCG TATCTGGAGC TGGAGCCATC TTCCAGCGAG GAATCGAGGT CGCCGGCCAG 6900 CTCCAGCGGC ACAGAGCGGA GGACGTCTTT GGAGGGGACG CGAGATGAAT GGCCAGAAGA 6960 GGTGGCCCAA TTGGCGACGC AGCTAGAGGC AAAGGGCAGC AGTCGGACCG CGCGCATCAT 7020 CTGGGTGAGA GGCGTGAACA CCGCATCCGC CGGACACGCT TGGGATGTCG CGTCTTCACC 7080 CAGCGCTACG ATGTTGGTTG TTGTTGGTCG ACGGAAAGTG TGGGCGCCTG CCACGCGGCC 7140 GGTCGCGGCA ACTGGCGTGA CGGGTGGAGA AAGGGGCTCT GGCAAATCCT CGTGCATTGG 7200 CCAGCGATTC CGCTGGAGCT ATGCGAAGAC GAAGAGAGTG TTGTCATTGC CATTTCTCAA 7260 CGTCACGACG CCGTTGACCG GTTTGCGAAC GACGCCGATA CTAACCAAAG TCCAACAGCA 7320 ACCCAGGAAA GTGATGGCGT GGTCCCATCC AGGAGTCCTG AAGGGCATCC AGAGCCCATC 7380 AGCAGTCAAG AACATAACCG TTCCAGCGCC GAGGGACGAG GTGTGCCAAT TGAAAGATTT 7440 GGCCATCTTA CCTGCGCACC TGCGCCGCAG CCACCCGGGG GCTGCAATCA CGAGAGGAGT 7500 GTTTGCAGCA GCGCCCGGCG TGTGGGCAGA GGCGGCGGCA GGTCCATCGG GTCCAGGGGC 7560 ATCAATAAGG CATCAACTCG GGCTGGAGGA GTGGAAGTTA ACGTTAAAAA TACGCAAGAG 7620 AAAATTAAAA AAAAACATCA TATTCACTTA CCAGCAACGC ACGGAGTTGC CAGCAGCGAA 7680 CGGGAAACAG CTGGTGAGCA GCGCGGCGCT GTTAACAGAG CGTTAACAGC AATGATCGAC 7740 GCGCGTATAG AGCACGCGTG TCATCGATAA CGCAAGGTGG AGCAGCAAGG AGAAAAGAGG 7800 CGCGGAATTC AAAAACACGA AGGTCGTGTT TTCCGAAGGA AGAGGGGAGT GTGAGGAGCA 7860 ATATAGGTAT GCACGTATGC AGTGTATATG AACACATGTA TTTGCTGTGT GTTTATATGC 7920 ACATACGTGC ATGCATATAT TGCTCCACAC AGCGACGCGA ACACTATAGG AGCAACCCTC 7980 GCGACTGGAA GGGTCAGAGA GCGGACTATA GTCTGTCTCT CTCGCTCCTA AAGTGTCAGC 8040 AAATGGGCTC GCGTCGCTCA CCCGCGGGTC ATAGGGCACC GTTAGCCGAG GTACCGCTGC 8100 AAGTATTGGA TGCTCTCTCT TTGCATAGCA ACAGCGGTAC TTCGGCCAAC GAAAAGCGGC 8160 CGCAATATTT GGCGGGCTGC TGGCGTCACC CAGATCACGC CATTGTGCCG TTATGTGGGC 8220 ACGAGCGGCA GGCAGGCCCC GCTCTAGAGC CGAACAGGGC CTGCGGCGGC CCTATAAAAG 8280 GGCACCGGCG GTGTCATAGA GGGCAATCAA GAATCGGAAT CAAGAATCGA GAGTGAAACT 8340 AATGTGGATC AAGAATCAAG AGTGAAGTGA GATTCAAGTG AAAGTGCAAG TACGAGGCAA 8400 CGCGAGTGAG GACACAAGAC ATCGAAGGAC TACGATATCG ACGCAGGACA CCGATATCCA 8460 GGACTCGAAT CAGGCAGGAC ATCGTGGGAG CATCCAAAGT ATCCTGGGAG GATACTTCAG 8520 ACGCCTAATA AGGACCACCG ACACGCAGAA GCAGCAGTGG CCATCAGCTA GAGAGTTGAT 8580 ACAGGGCAGC TGCAACTCGT AGTCTGGTCA GAAGTCTCGT CACAGAATTG TGTAAGTTCA 8640 CGCAGCCGTG CGAATAGGTA GCAATTGTGC GCATAAGAAC CTAGGTGTGC CAAGAGCGAG 8700 TCCAGTAGTA GGAGTAGCCT GTACAGCCGA GCGAACCGCC GGAGTAGACA GTGCAAATCC 8760 TGGCGTGAGC TGGAGCGCTT GGGCGAATCC TGGTGCTACT GAGCCATAAT TGTCTGCCCA 8820 GCCGGAGTGC GTGAACAAAC ATAACGCACG ACGACTATAC AAGTGTGATT ATATCGCTAA 8880 ATAATAATAA GTTTTATAAC GAGAATATAC CAAGTACCAA GCAAATTCAC AAAGTGTTAT 8940 TACTCAAAAT CGCAAGGGCT TCAACAAATA AATTTGTTGA AACGAATCCA CAGTCGGCTG 9000 AAACTGGCCC GGCTTAGACT CGGGTATAAC CGAAGGTTCG TTACA 9045 // ID DK29466 standard; DNA; INV; 979 BP. XX AC U29466; XX DR FLYBASE; FBgn0014755; Dkoe\Gandalf. XX FT source U29466:51..1029 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..12 FT SO_feature terminal_inverted_repeat ; SO:0000481:967..979 XX CC Derived from U29466 (Rel. 63, Last updated, Version 5). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 979 BP; 305 A; 150 C; 173 G; 351 T; 0 other; CAGTGCTGCC AGTTTGGCAA TTTAGTGGCT AGATCTGGCC ACTTTTAAAA AAATTTGCAA 60 CTTTTATTTT GTAAATGCTA TTAGCCACAA ATCTAGCAAT TTTAAATTAT TTTTTAGCAA 120 TTTCAGCAAC TTTATTATAA AACTAGCAAT TCGTATATTT TTCTTCAGTG ATTTTGCATT 180 TCTTATGCCT TTTTACGATT ACGATATAGG AACTTGTATT TAACCCAGTT GTTCGATTAT 240 ATGAACTAGT TTATGGTGTT TGAAAACTCT TTAAATGTGA TTGGTGAACA AGAACAAGAT 300 GCCGAAAGCT AATAAGCAAA GTTTTCGTGA TGCCTGGCTG CAAGATGACG AGTTCAAGCA 360 ATGGATTCGT AAGGATTGCA CTGATCAAAC ACGAGCTTAT TGCGCGTATC GCCAATCAAC 420 TATTAACGTA AAGCTTTTTG ACATCCGCCA CCACAGTGCG TCAAAAAAAA AAAAAAAAAA 480 AATGAGACTG TGATAGGCGT ATGTACCCAA AAGAATAAGT TGCCTTTTGT TAGAAAATCA 540 ACCAAAACCG AGGAGCAGGA AGCAACATTA TCCTTGCATA TTGCTCAGCA CACGGCGATT 600 GCCGGCGATT ACAGGTATGG ACTAGAGCGT CATGAAAAGT ATTGCCATAA CTATGATCTG 660 ACATATGAGT ACTTGATTCA AATTACTGGT AGTGCGAGGT ACGCTACTGA ATGTGACGCT 720 GAGGAACTGG AAAATAATTT AAGTATTACT TAGTTATTAT TTAGTTTCAC TTTTTAGTTT 780 TTTAGTTTCA ATTTTTATTA AGTTGTTTCT AATTTGTATT TGTTTTTTGT TTGAAAATAT 840 ATATGTATAT TTGTTAAATA TCAAAATTTT AATGGTTTAG CAATTTTTTT TGGCAATTTT 900 CACAACATTT TTAGCTATTT TTAGCCATTT TTTTTTTCCA AATCTAGCAA TTTTTGCTTA 960 GGAGATTCTG GCAGCACTG 979 // ID DMMAR standard; DNA; INV; 1286 BP. XX AC M14653; XX DR FLYBASE; FBgn0002651; Dmau\mariner. XX FT source M14653:1..1286 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..28 FT SO_feature terminal_inverted_repeat ; SO:0000481:1259..1286 FT SO_feature CDS ; SO:0000316:172..1209 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0013835; Dmau\mariner\T" FT /protein_id="AAA28678.1" FT /translation="MSSFVPNKEQTRTVLIFCFHLKKTAAESHRMLVEAFGEQVPTVKK FT CERWFQRFKSGDFDVDDKEHGKPPKRYEDAELQALLDEDDAQTQKQLAEQLEVSQQAVS FT NRLREMGKIQKVGRWVPHELNERQMERRKNTCEILLSRYKRKSFLHRIVTGDEKWIFFV FT SPKRKKSYVDPGQPATSTARPNRFGKKTMLCVWWDQSGVIYYELLKRGETVNTARYQQQ FT LINLNRALQRKRPEYQKRQHRVIFLHDNAPSHTARAVRDTLETLNWEVLPHAAYSPDLA FT PSDYHLFASMGHALAEQRFDSYESVKKWLDEWFAAKDDEFYWRGIHKLPERWEKCVASD FT GKYLE" XX CC Derived from M14653 (Rel. 63, Last updated, Version 2). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. CC [Not convinced best element, see transposons/mau_mariner.clustal] XX SQ Sequence 1286 BP; 399 A; 270 C; 304 G; 313 T; 0 other; ccaggtgtac aagtagggaa tgtcggttcg aacatataga tgtctcgcaa acgtaaatat 60 ttaccgattg tcataaaact ttgaccttgt gaagtgtcaa ccttgactgt cgaaccacca 120 tagtttggcg caaattgagc gtcataattg ttttctctca gtgcagtcaa catgtcgagt 180 ttcgtgccga ataaagagca aacgcggaca gtattaattt tctgttttca tttgaagaaa 240 acagctgcgg aatcgcaccg aatgcttgtt gaagcctttg gcgaacaagt accaactgtg 300 aaaaagtgtg aacggtggtt tcaacgcttc aaaagtggtg attttgacgt cgacgacaaa 360 gagcacggaa aaccgccaaa aaggtacgaa gacgccgaac tgcaagcatt attggatgaa 420 gacgatgctc aaacgcaaaa acaactcgca gagcagttgg aagtaagtca acaagcagtt 480 tccaatcgct tgcgagagat gggaaagatt cagaaggtcg gtagatgggt gccacatgag 540 ttgaacgaga ggcagatgga gaggcgcaaa aacacatgcg aaattttgct ttcacgatac 600 aaaaggaagt cgtttttgca tcgtatcgtt actggcgatg aaaaatggat cttttttgtt 660 agtcctaaac gtaaaaagtc atacgttgat cctggacaac cggccacatc gactgctcga 720 ccgaatcgct ttggcaagaa gacgatgctc tgtgtttggt gggatcagag cggtgtcatt 780 tactatgagc tcttgaaacg cggcgaaacg gtgaatacgg cacgctacca acaacaattg 840 atcaatttga accgtgcgct tcagagaaaa cgaccggaat atcaaaaaag acaacacagg 900 gtcatttttc tccatgacaa cgctccatca catacggcaa gagcggttcg cgacacgttg 960 gaaacactca attgggaagt gcttccgcat gcggcttact caccagacct ggccccatcc 1020 gattaccacc tattcgcttc gatgggacac gcactcgctg agcagcgctt cgattcttac 1080 gaaagtgtga aaaaatggct cgatgaatgg ttcgccgcaa aagacgatga gttctactgg 1140 cgtggaatcc acaaattgcc cgagagatgg gaaaaatgtg tagctagcga cggcaaatac 1200 ttagaataaa tgattttttc tttttccaca aaatttaacg tgtttttgat taaaaaaaaa 1260 acgacatttc atacttgtac acctga 1286 // ID DHMINOS standard; DNA; INV; 1773 BP. XX AC Z29098; XX DR FLYBASE; FBgn0010242; Dhyd\Minos. XX FT source Z29098:15..1787 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..253 FT SO_feature terminal_inverted_repeat ; SO:0000481:1519..1773 FT SO_feature CDS ; SO:0000316:join(332..745,806..1477) FT /db_xref="FLYBASE:FBgn0013814; Dhyd\Minos\T" FT /protein_id="CAA82359.1" FT /translation="MSQYSMQKNFRLLQISRSLATMVRGKPISKEIRVLIRDYFKSGK FT TLTEISKQLNLPKSSVHGVIQIFKKNGNIENNIANRGRTSAITPRDKRQLAKIVKADR FT RQSLRNLASKWSQQLAKLSSESGRDKLKSIGYGFYKAKEKPLLTLRQKKKRLQWARER FT MSWTQRQWDTIIFSDEAKFDVSVGDTRKRVIRKRSETYHKDCLKRTTKFPASTMVWGC FT MSAKGLGKLHFIEGTVNAEKYINILQDSLLPSIPKLLDCGEFTFQQDGASSHTAKRTK FT NWLQYNQMEVLDWPSNSPDLSPIENIWWLMKNQLRNEPQRNISDLKIKLQEMWDSISQ FT EHCKNLLSSMPKRVKCVMQAKGDVTQF" XX CC Derived from Z29098. CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1773 BP; 626 A; 299 C; 342 G; 506 T; 0 other; CGAGCCCCAA CCACTATTAA TTCGAACAGC ATGTTTTTTT TGCAGTGCGC AATGTTTAAC 60 ACACTATATT ATCAATACTA CTAAAGATAA CACATACCAA TGCATTTCGT CTCAAAGAGA 120 ATTTTATTCT CTTCACGACG AAAAAAAAAG TTTTGCTCTA TTTCCAACAA CAACAAAAAT 180 ATGAGTAATT TATTCAAACG GTTTGCTTAA GAGATAAGAA AAAAGTGACC ACTATTAATT 240 CGAACGCGGC GTAAGCTTAC CTTAATCTCA AGAAGAGCAA AACAAAAGCA ACTAATGTAA 300 CGGAATCATT ATCTAGTTAT GATCTGCAAA TAATGTCACA ATACAGCATG CAAAAAAATT 360 TTAGATTGCT GCAGATCAGT AGAAGTTTAG CAACGATGGT TCGTGGTAAA CCTATTTCTA 420 AAGAAATCAG AGTATTGATT AGGGATTATT TTAAATCTGG AAAGACACTT ACGGAGATAA 480 GCAAGCAATT AAATTTGCCT AAGTCGTCTG TGCATGGGGT GATACAAATT TTCAAAAAAA 540 ATGGGAATAT TGAAAATAAC ATTGCGAATA GAGGCCGAAC ATCAGCAATA ACACCCCGCG 600 ACAAAAGACA ACTGGCCAAA ATTGTTAAGG CTGATCGTCG CCAATCTTTG AGAAATTTGG 660 CTTCTAAGTG GTCGCAGCAA TTGGCAAAAC TGTCAAGCGA GAGTGGACGC GACAAATTAA 720 AAAGTATTGG ATATGGTTTT TATAAAGTAT GTTTTGTTAT TACCTGTGCA TCGTACCCAA 780 TAACTTACTC GTAATCTTAC TCGTAGGCCA AGGAAAAACC CTTGCTTACG CTTCGTCAAA 840 AAAAGAAGCG TTTGCAATGG GCTCGGGAAA GGATGTCTTG GACTCAAAGG CAATGGGATA 900 CCATCATATT CAGCGATGAA GCTAAATTTG ATGTTAGTGT CGGCGATACG AGAAAACGCG 960 TCATCCGTAA GAGGTCAGAA ACATACCATA AAGACTGCCT TAAAAGAACA ACAAAGTTTC 1020 CTGCGAGCAC TATGGTATGG GGATGTATGT CTGCCAAAGG ATTAGGAAAA CTTCATTTCA 1080 TTGAAGGGAC AGTTAATGCT GAAAAATATA TTAATATTTT ACAAGATAGT TTGTTGCCAT 1140 CAATACCAAA ACTATTAGAT TGCGGTGAAT TCACTTTTCA GCAGGACGGA GCATCATCGC 1200 ACACAGCCAA GCGAACCAAA AATTGGCTGC AATATAATCA AATGGAGGTT TTAGATTGGC 1260 CATCAAATAG TCCAGATCTA AGCCCAATTG AAAATATTTG GTGGCTAATG AAAAACCAGC 1320 TTCGAAATGA GCCACAAAGG AATATTTCTG ACTTGAAAAT CAAGTTGCAA GAGATGTGGG 1380 ACTCAATTTC TCAAGAGCAT TGCAAAAATT TGTTAAGCTC AATGCCAAAA CGAGTTAAAT 1440 GCGTAATGCA GGCCAAGGGC GACGTTACAC AATTCTAATA TTAATTAAAT TATTGTTTTA 1500 AGTATGATAG TAAATCACAT TACGCCGCGT TCGAATTAAT AGTGGTCACT TTTTTCTTAT 1560 CTCTTAAGCA AACCGTTTGA ATAAATTACT CATATTTTTG TTGTTGTTGG AAATAGAGCA 1620 AAACTTTTTT TTTCGTCGTG AAGAGAATAA AATTCTCTTT GAGACGAAAT GCATTGGTAT 1680 GTGTTATCTT TAGTAGTATT GATAATATAG TGTGTTAAAC ATTGCGCACT GCAAAAAAAA 1740 CATGCTGTTC GAATTAATAG TGGTTGGGGC TCG 1773 // ID DFU309320 standard; DNA; INV; 928 BP. XX AC AJ309320; XX DR FLYBASE; FBgn0044997; Dfun\Isfun-1. XX FT source AJ309320:1..928 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..14 FT SO_feature terminal_inverted_repeat ; SO:0000481:915..928 XX CC Derived from AJ309320 (Rel. 68, Last updated, Version 1). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 928 BP; 276 A; 181 C; 175 G; 296 T; 0 other; ttataccctg tacagggata tattaacttt gtgaaaaaaa ttgttgtaac aggcagaagg 60 aggcatctat gaccctataa agtatatata ttcttgatca gtatcaacag ccgagtcgat 120 tgagccatgt ccgtctgtct gtctgtccgt ccgtccgtct gtccgactgt atgtacgcgt 180 cgatctcagc aactattaga gctagagaca ccaaatttgg catgaaggtt cctctatacc 240 atacgcagat caagtttatt ttaaattttg gatacccatc ccgcgcaaaa aattagagtt 300 tcaaaagggt attctttaaa caccttaaac accttcgaaa aggctaaagt cgaccgcatt 360 cgacagtata ataccctgta caggcttcta tattagaatt atgttgataa taggtacttt 420 gattaaaata aaagtaagaa cataacaggt gtttaaagaa tacctctttg aaactctcat 480 tttttgcata cgaattcttt gcatgcttat agcagctaac tttcacatat ttatacatat 540 tcaattaaaa gctgagtgtt acgacactta atttacttat gcttgtgtcg cagtagttcg 600 ctatctcgct cgcacaaata tgcacacatg gtaagcagac gcatgtgtca tagtcgttcg 660 ctgtctcgct cgcacacatg tgctgcctgc cgctctacga ttccgtggaa aaataacttt 720 tattgctgat tatctgatga aactttcagc gttccttcta tatatcattc ttaacgcacc 780 tgttaagtaa gaagtgtata cgttgaaaaa tgtggctatt attcgttttt tttcaatttg 840 cgggggaagg gggggaggta tccaaaatgt aaaataaact tgatctgcgt atggtataca 900 ggaaccttta tgcctgtaca gggtataa 928 // ID U73803 standard; DNA; INV; 5540 BP. XX AC U73803; XX DR FLYBASE; FBgn0023239; Dsub\bilbo. XX FT source U73803:536..6065 FT SO_feature CDS ; SO:0000316:<464..1865 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:;" FT /db_xref="SPTREMBL:O46183" FT /protein_id="AAB92392.1" FT /translation="RRALPGWTVPLLCSSWEPYGEIIKYCKQNQKRERHSPNACREKSQ FT AGEGSGPKAGPIRSDAGKTCTVKDAGMNPGGSHPKPGMGGKSTPKTDTMVGQLSNIAAA FT QPSGEAAGEDLPSTSVQAKKYSYAEKRSAGHILRRQNASQEVSPTADWLKKVEWASTVL FT PNFSVEPQKVAAQQKRQRSQETPGPAAKRSRILPNVSFAQIAKERTLIGVLDKGSAEGK FT IPRSQWKWVEAALADRCFELLEKDPGPPPVCKDMGWFQGNIKVVACEDERSVKLYKAAV FT AQIGEVYAGAKLVAVDWSEVPSRPRARIWVPATFKEPERILTMLQRCNPTLPTSDWKVA FT KVEASKGPTNQAVVILNKESLAPIEAARGELNFGFSSVTMKVYKSDAAAEARSANKPVE FT QDVASEIEAPEVDPEPEPALESYSSETELLLDFEAMCRDDILDDSDADITVVENVSNEV FT SEASADKSPPL" FT SO_feature CDS ; SO:0000316:<1801..3465 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:;" FT /db_xref="SPTREMBL:O46184" FT /protein_id="AAB92393.1" FT /translation="QWWRMFLMKSLRLLQINLHHCKAASAALILRLVASGGDIVLIQEP FT WVVGGRVCGLATKDYNLIVAQTEGKIRTCILARKHLNIFLLHNFSNGDNTAASLELQGT FT RLNLVSSYMAHEESDPPSDLVRKIVSDSERSDTSLLIGCDANAHHTQWGSSDTNVRGES FT LFSFILNSNLFIGNRGNDPTFIIKNRQEVIDLTLLSHKLLDTIKSWRVLEDHSFSDHRY FT IETTLSLESTIPTSYVNPRKTNWDTYSAKMEELLPPLAPKCPDTQDDFNRLVDNFTEAC FT NKAFMAACPSTKPRGKKKPPWWSKHIDTLRKDCRTLFNRAKSSSEVAHWENYKSKLTLY FT KKETRRAKRASWHKFCSEIEDTSEAARLRKVLSKTSPSVGYLKRTDGTWTNSSEDSLHI FT LLETHFPGCTTIEPADTPSEGPETSMGHILTKRNISWAVNSFKSYKSPGPDQVIPAQLQ FT KAGDVAINWLQSIFRKILTVGKIPRAWLKAKIVFIPKAGKPSHTTPKDFRPISLSSFLL FT KTFERLVGLQLRKTIKPQAVRRSTCLPKREIHGNGTP" FT SO_feature CDS ; SO:0000316:<3386..5413 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:;" FT /db_xref="SPTREMBL:O46185" FT /protein_id="AAB92394.1" FT /translation="GKPLNLRLSGAQHAYRKGKSTETALHEVISAVEKSLHVKEYSLIA FT FLDIEGAFNNVIPGAITEALTDLGVDRHLVMLIDQLLTCRTVTSSMGSSTQSRYVNRGT FT PQGGVLSPLLWNIAVNKILCDLEGEGCKVVAYADDVAIIFSGKFPQTLCELMTAKLARL FT SEWTKSRGLGINPSKTELVLFTNKYKIPPLNPPILNGCRLSFSDSASYLGLVIDKKLSW FT NLSIKDRVKKATIALYTCKKAIGLKWGMNPRIVQWIYLAIVRPILLYGVTVWWTALSKG FT TITKQLSKVQRTAALSISGALSTTPTDALNAILCLQSPELAGKEQAEMAAIRLRDSDQW FT VSQRTGHASILNGNNIVPAKTDYCVPREYTDTPFETIIPHRNDWLEGPPGPKEAIQIFT FT DGSKLDNKVGGGIYSELLNISYSFRLPDHCSVFQAEVIAIKEALSCLQELTPEATYINI FT YSDSQAAIKSLNAITTSSATVANCRKSLHEMAYQFVISLIWVPGHQDIEGNCIADELAR FT AGTTIPLLNDKEDIRMPMATCKLRIKEHFKKLTNDRWQTVPLCRITRQTWPNINRKRTD FT ELCKLSRSRCSSVIRSLTGHWLIGTHANRLGAPYNDFCRSCRDEDEEETVEHLFCSCPA FT LSRRRLQYLGSPFLNDISDMSTISPRRIAGFIRASGWDNG" XX CC Derived from U73803 (Rel. 54, Last updated, Version 1). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 5540 BP; 1604 A; 1460 C; 1407 G; 1069 T; 0 other; TCTATTTACG TCCGATGATC CTATTGGGCT TTGAGGTGGC TCAGAGGGGG CCTAGCTGAG 60 GCCACCGGAC GGGTCGCAGG TTGCGGATAG CGAACGGCCT ACCCCAGTGG GTAGGTCAGC 120 TGTGAATAGG CGGGCATCCT AAAGAGCATC AACTAAGTTG GCTCTGAGGG AGAGTACCGA 180 AATAAGCAGT CCCGGACAGC CAGCGGGTGT TCGCGACGCA GCAACACTGA CGATGCGCTT 240 GTGGTGCCGC CTCTCATTAA GTAGCTCACA CAATCCCATC CCTACACAAC ACCTACCCAC 300 ATCCTAGCCC CAACACTCAC CTCACACCGG GCCGGACTAA GGTGTCACTC GACCCGTGCC 360 AAGAGTCAGC CTGGCTGGGG GCCCGGTATA AACACTAGCC CAGTCGTGAA CGGAGAGCTC 420 AGGGATTTGG CAGCCCCCTG TTCTAATTAT GCCTTACCCG GGTAACGCAG AGCTCTGCCG 480 GGGTGGACCG TTCCCCTTCT CTGCAGCTCG TGGGAACCAT ATGGAGAAAT CATTAAATAT 540 TGTAAACAAA ACCAAAAACG AGAGAGGCAC TCCCCAAATG CCTGCAGGGA GAAATCCCAA 600 GCGGGAGAAG GCAGCGGCCC AAAGGCTGGC CCGATCCGCT CAGACGCAGG AAAGACCTGC 660 ACGGTGAAGG ACGCGGGTAT GAATCCGGGT GGATCCCATC CCAAACCTGG GATGGGTGGA 720 AAGTCAACAC CCAAAACTGA TACCATGGTA GGGCAGCTAA GCAATATTGC GGCTGCCCAA 780 CCTTCCGGTG AAGCGGCGGG TGAGGACTTG CCATCTACCA GCGTCCAAGC CAAAAAGTAT 840 AGCTATGCCG AAAAGAGAAG TGCTGGGCAC ATACTCCGTC GGCAGAACGC CAGCCAAGAA 900 GTCAGTCCGA CAGCAGACTG GCTGAAAAAA GTAGAATGGG CCTCGACGGT GTTGCCCAAC 960 TTCAGTGTGG AGCCACAGAA GGTGGCCGCA CAACAGAAGA GGCAACGCTC TCAAGAGACG 1020 CCCGGACCTG CCGCGAAGCG ATCCAGGATT CTACCAAACG TATCCTTCGC GCAGATTGCC 1080 AAGGAGAGGA CGCTGATAGG CGTTCTCGAT AAAGGCAGCG CGGAGGGGAA AATACCCAGA 1140 AGCCAATGGA AGTGGGTGGA AGCGGCACTG GCCGACCGCT GCTTCGAGCT TCTAGAGAAG 1200 GATCCCGGGC CACCCCCAGT CTGCAAGGAC ATGGGATGGT TCCAGGGAAA TATAAAAGTG 1260 GTAGCCTGCG AGGATGAGCG CTCCGTGAAG CTATACAAAG CTGCGGTGGC GCAAATCGGT 1320 GAGGTCTACG CAGGGGCGAA GCTCGTCGCT GTAGACTGGA GCGAGGTGCC AAGTAGGCCA 1380 AGAGCCCGTA TATGGGTACC GGCCACTTTC AAGGAGCCCG AGCGGATCCT AACGATGTTG 1440 CAGAGATGCA ACCCCACACT GCCAACCTCA GACTGGAAGG TGGCTAAAGT GGAGGCATCA 1500 AAGGGCCCCA CAAACCAGGC GGTAGTGATC CTGAACAAGG AATCGCTAGC CCCGATCGAG 1560 GCAGCCAGGG GAGAGCTCAA CTTCGGGTTC AGCTCGGTGA CCATGAAGGT CTACAAGTCG 1620 GATGCAGCGG CCGAGGCGCG TTCTGCCAAC AAACCAGTTG AGCAGGACGT TGCCTCGGAG 1680 ATCGAGGCAC CGGAAGTAGA CCCGGAGCCG GAGCCGGCCC TAGAGAGCTA CTCCTCGGAG 1740 ACGGAGCTGC TGCTCGATTT TGAGGCGATG TGTCGGGACG ACATCCTCGA CGACTCAGAC 1800 GCCGATATAA CAGTGGTGGA GAATGTTTCT AATGAAGTCT CTGAGGCTTC TGCAGATAAA 1860 TCTCCACCAC TGTAAAGCAG CATCCGCTGC TCTTATACTC CGCTTAGTCG CGAGCGGAGG 1920 AGACATAGTC CTAATCCAAG AGCCCTGGGT GGTAGGAGGC AGGGTCTGTG GATTAGCGAC 1980 AAAGGACTAC AACCTAATTG TAGCCCAAAC GGAAGGTAAA ATAAGAACCT GCATATTAGC 2040 AAGAAAGCAC TTAAATATCT TTCTGCTCCA CAACTTTAGC AACGGCGACA ACACGGCGGC 2100 CAGCCTAGAG CTACAAGGGA CACGCCTGAA CCTGGTGTCG TCTTACATGG CTCATGAGGA 2160 AAGCGATCCT CCCAGTGACC TCGTTCGCAA GATTGTCAGC GATAGCGAGA GGTCGGACAC 2220 AAGCCTACTA ATAGGCTGCG ATGCCAACGC TCACCACACC CAATGGGGGA GCTCGGATAC 2280 AAATGTAAGG GGTGAGTCAC TTTTTAGCTT CATCCTTAAC TCCAACCTAT TTATTGGAAA 2340 TCGGGGTAAT GACCCCACTT TCATTATAAA AAACCGCCAA GAGGTTATTG ACCTCACTCT 2400 GTTATCTCAC AAACTGTTAG ACACTATAAA AAGTTGGAGA GTCCTAGAGG ACCACTCCTT 2460 CTCCGACCAC AGGTACATCG AGACAACCCT ATCGCTCGAA AGCACCATAC CCACTAGCTA 2520 TGTAAACCCA AGAAAAACCA ACTGGGATAC GTATAGCGCA AAAATGGAGG AATTACTCCC 2580 CCCCCTAGCC CCAAAATGCC CGGATACACA AGACGATTTT AATCGTCTTG TGGACAATTT 2640 CACGGAAGCT TGCAATAAGG CCTTCATGGC AGCTTGCCCC TCGACCAAAC CAAGAGGGAA 2700 AAAGAAACCC CCCTGGTGGT CTAAACATAT AGATACCCTC CGAAAAGACT GCAGGACTCT 2760 CTTTAACAGA GCCAAAAGCA GCAGCGAGGT AGCGCACTGG GAAAATTACA AAAGTAAGCT 2820 AACCCTATAC AAAAAAGAAA CAAGGAGAGC AAAAAGGGCC TCTTGGCATA AATTTTGCTC 2880 CGAAATTGAG GACACGTCAG AAGCCGCAAG GCTACGCAAG GTCCTGTCAA AAACATCCCC 2940 CTCGGTGGGA TATCTCAAGA GAACTGATGG TACGTGGACA AACTCTAGCG AGGACTCCCT 3000 ACACATTCTT CTCGAAACTC ACTTTCCTGG GTGTACGACC ATCGAGCCGG CAGACACCCC 3060 AAGTGAAGGC CCGGAAACCT CGATGGGGCA CATCCTCACA AAACGGAATA TAAGCTGGGC 3120 AGTAAACAGC TTTAAATCCT ACAAATCGCC TGGCCCGGAC CAAGTCATTC CGGCTCAGCT 3180 ACAAAAAGCC GGGGACGTGG CCATCAACTG GCTGCAAAGT ATCTTCAGGA AGATACTGAC 3240 AGTGGGAAAA ATCCCCCGCG CTTGGCTAAA GGCTAAAATA GTCTTTATAC CCAAGGCAGG 3300 AAAACCCTCT CACACAACCC CAAAAGACTT CAGACCAATA AGTCTATCGT CCTTCCTTCT 3360 TAAAACCTTT GAAAGACTAG TCGGGCTGCA GCTGAGGAAA ACCATTAAAC CTCAGGCTGT 3420 CAGGCGCTCA ACATGCCTAC CGAAAAGGGA AATCCACGGA AACGGCACTC CATGAAGTAA 3480 TCTCGGCGGT AGAGAAATCT CTTCACGTCA AAGAATACTC CCTAATCGCT TTCCTAGACA 3540 TTGAGGGAGC TTTTAACAAC GTCATACCGG GAGCCATCAC AGAGGCACTG ACTGACCTGG 3600 GGGTGGATCG TCACTTGGTG ATGCTCATAG ATCAATTGCT CACATGCAGG ACAGTGACAT 3660 CATCAATGGG ATCGTCCACT CAGTCAAGGT ATGTCAACAG AGGCACCCCG CAAGGCGGTG 3720 TCCTGTCTCC CCTTCTATGG AACATAGCCG TCAATAAGAT CTTGTGTGAC TTGGAAGGGG 3780 AGGGCTGCAA GGTGGTAGCT TACGCGGACG ACGTTGCCAT TATCTTCTCG GGAAAATTCC 3840 CTCAAACACT CTGCGAACTA ATGACCGCAA AGCTCGCACG ATTGTCAGAA TGGACAAAGT 3900 CGCGTGGACT GGGTATTAAT CCCTCTAAAA CGGAACTTGT GTTGTTTACA AACAAATACA 3960 AAATCCCGCC CCTCAACCCC CCAATACTAA ACGGATGCAG GCTCTCCTTC AGCGACAGTG 4020 CCAGTTACTT AGGGTTGGTA ATTGATAAAA AGCTCAGCTG GAACCTAAGC ATCAAAGACA 4080 GAGTGAAGAA GGCTACGATA GCCCTCTACA CTTGCAAGAA GGCCATTGGG CTAAAATGGG 4140 GCATGAACCC AAGAATAGTC CAATGGATCT ACCTAGCAAT AGTTAGACCA ATACTGCTCT 4200 ACGGAGTCAC AGTGTGGTGG ACTGCCCTAT CGAAGGGGAC CATCACAAAA CAACTAAGCA 4260 AGGTGCAGCG AACTGCAGCC TTAAGCATCA GTGGAGCTCT GAGCACAACG CCAACGGATG 4320 CGCTAAATGC TATACTTTGC CTGCAGAGCC CTGAACTTGC AGGTAAGGAG CAGGCAGAAA 4380 TGGCAGCAAT CCGTCTCAGA GACTCCGACC AATGGGTGTC ACAACGCACC GGCCATGCAT 4440 CCATCCTAAA TGGAAACAAC ATCGTCCCTG CAAAAACGGA CTACTGCGTT CCGAGGGAGT 4500 ATACAGATAC CCCCTTTGAG ACAATCATCC CTCACAGAAA CGACTGGCTT GAGGGACCGC 4560 CTGGTCCAAA GGAAGCCATT CAAATTTTTA CTGATGGCTC AAAGCTTGAC AACAAGGTCG 4620 GAGGAGGAAT ATACTCCGAA CTGCTGAATA TAAGCTACTC CTTCAGGCTC CCGGATCACT 4680 GCAGTGTCTT CCAAGCGGAG GTCATAGCGA TCAAGGAAGC TCTGAGCTGT CTTCAGGAAC 4740 TAACCCCTGA AGCGACTTAC ATAAACATCT ACAGTGATAG CCAAGCTGCA ATCAAATCAT 4800 TGAATGCGAT AACGACAAGC TCGGCCACAG TTGCGAACTG TCGCAAATCT CTTCACGAGA 4860 TGGCTTATCA GTTCGTCATC AGCCTAATAT GGGTCCCGGG CCACCAGGAC ATTGAAGGCA 4920 ACTGTATAGC AGATGAGCTG GCCAGAGCTG GAACAACAAT CCCCCTTCTC AATGATAAGG 4980 AGGATATTCG TATGCCAATG GCCACCTGCA AGCTCAGAAT AAAAGAACAT TTTAAGAAAC 5040 TTACAAATGA CAGATGGCAA ACTGTGCCAC TATGTCGCAT AACTCGGCAA ACATGGCCTA 5100 ACATAAATAG GAAGCGCACC GATGAGCTCT GCAAACTCAG TAGGAGTAGG TGCAGCTCAG 5160 TCATACGCTC CCTTACGGGA CACTGGCTAA TAGGCACACA TGCAAACAGG CTGGGAGCCC 5220 CTTATAACGA TTTCTGTCGA AGCTGTAGAG ATGAGGACGA GGAGGAGACT GTGGAACACC 5280 TTTTCTGCTC CTGCCCGGCT CTCAGTAGAA GGAGACTCCA ATACTTGGGC TCTCCTTTTC 5340 TAAATGACAT TTCGGACATG TCTACAATAA GTCCCAGACG GATCGCTGGC TTCATAAGGG 5400 CATCCGGATG GGATAATGGA TAAAATAGGG TCTCACGGGA GAGAAGGGAG CGAAACATGC 5460 GGTATCACAA TGGGCCGAAA CCGGCCTAAG TGTGTCGGGT CTCTGACAAT CCCCGACAGC 5520 CGCCTCAACC TAACCTAACC 5540 // ID DSV28T24 standard; DNA; INV; 7779 BP. XX AC X60177; S39346; XX DR FLYBASE; FBgn0005661; Dsil\Loa. XX FT source X60177:1..7779 FT SO_feature CDS ; SO:0000316:689..1816 FT SO_feature CDS ; SO:0000316:1786..7716 XX CC This is a consensus sequence according to Felger & Hunt (1992). CC CDS annotation from Felger & Hunt (1992). CC Derived from X60177 (Rel. 37, Last updated, Version 8). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 7779 BP; 2297 A; 1808 C; 1900 G; 1774 T; 0 other; ttaaccaccg ttagcaaaaa acattaatga ttttgatttc tttattggct tcatgtcgat 60 ctaaactagt tgatcgattt ccaaatcaat accacaatta ttgttaaaag agaatgagat 120 tgttcttggt agattatttc gccactttcc tcttttccga aaacaattct tcaaattaaa 180 gtagaatcag taattggtta agggaaatca cgattgatat atttataatt tcaagatgaa 240 accattttat taacggttaa gtctcaacgt gattcataaa ttcataaatg gcagtgcagc 300 gaagtaacga agatatagca gctagcataa atgctgatat ggtattgtgt aacatttgta 360 gagccccagt tctgattaat actgaaattg cggagacccc ttgcaaacat aaattccatc 420 ggtaacctga ttacgtacta atgaaacatg tccgtcgtgt agaacgccat gtacactttc 480 ccagttaatt gatgtcaaaa gccagtcatc gaaagggcct tttcaagttc ctcgaacagt 540 tcgtggtggt catcgagttg gtgctacaca taggaatata cctaatatca aatccaataa 600 ctctggagca caggcttaca ggcctaatac taatttcaat gtcgggaata gacgggattc 660 attcaatgct tcgcttgacc catccatccc ttctgaacag cgaattcagc aacttatttc 720 aaactcgtta gatacatttc gagccagtat ggttgtagca gtgtctgacg agatcatttt 780 agctataaga aaccttaata tctcaaataa ccagcaggaa gctcgaattg aatcaggagt 840 cgagcaaaat attcgaacag ctgtagataa tcgaagcact gattccattg aacgacccga 900 caaagtctca agtattattt cgagttggca tgtacaattc tgtggatctt ttaatgaaat 960 ttccgtggaa gattttattt accgcatcaa tcgcttgacc gacgaatgct tgaacggaaa 1020 ttggaatttg ctttatcaat ttgcaattat tttgttctgt ggacctgcat tacagttcta 1080 ctggcggttt caaagaacaa atagtcggtc taattggttt caactatccg atgccttgcg 1140 tgagcgatat agagaacaac gcccggatga gaaaataaaa gattctttaa gatctcgtaa 1200 acagcgaagt ggagaacgtt ttgttcaatt tttggatgct atccagtgta ttgctgatac 1260 actgcgagaa ccaatgacag atcgggaatt agtggctaat attaaaagaa atgttaaggt 1320 tgaaatgagg ttagaactac ttcatgtagc ttcacccaat atagcaaccc ttcgaactga 1380 atgtcacaag cacgaacaat tctgtttaag tatgtcttca aaacctgtgc ctcggccaac 1440 taacaccagt cattttctta atgaagtcat acatgaggaa gaatcgatag atacttccca 1500 tatatattct acagagtcaa ccgagataaa tgcaatacgc tcagttgata ggataaaatg 1560 ttggaattgt aatgaggtag gtcatagata tcaagattgt ataaaaacca gacgcagacc 1620 aggcgtattt tgctatggtt gtggccgttt tgataccacg catcggaaaa cttgcaacag 1680 gatgtccacc gagcataatc agttgatatc cacctacagt ctcggatcaa caatttgtac 1740 cccgccaatt cctaacaata acacgagcct atcgaagaat ctaagatgca aacagttgca 1800 tggcaacttg aattaatatt attcaaaaaa aaaattattg ccagcgtaga gcggaaatat 1860 tctatttgta aatcaattcg cttcccaaaa gttagaatag atgaattttg gagaaacaag 1920 cgatcgccaa agaacattac atttatctct accatccgca accgtaatga ttccgcgtcc 1980 atatttacaa atcttatgtt gtttggtcaa agctatttag cattattgga tagcggtgct 2040 aataaaagcg tgatcggagg acaattggcc atacgattgc ttgccgcaga cctaaagtta 2100 aaaaaattga aaggcaattt tcgcactgca gatggccaac acgaaaacgt attaagtgct 2160 cttcttattc cgttggagta tgacttatta cagaaagaat ttgaatttat gattctacct 2220 tcaatcacac aagacatcat ctgtggaatg gatttttgga agtcatttgg tattaatatt 2280 tctaccacga ctgtaataag cgaattagac tacagtgatc aaacgtgtaa taataggtgt 2340 ctagtgttac agagaggacc tggtctaaag aacgtgtggg atggttcgga aagggctgag 2400 ctgaggcacc aggcgggtcg cgggttgcgg atagggaacg gccttcctgc agggaggtta 2460 gctgcgaata atacgaggtt cgtcgtaaat aagcagtccc ggacaaacag cgggcgcata 2520 atgtcgatcg taccgacaag gcgcttgtat tgccgccttc caaagtgtag cttaccacaa 2580 ccgcccctat ctccccacat atttacctac cctatcctat cccctcatct cactccgggc 2640 tggactaagg cgtcattcga accgtgccga gagtcagctt ggctggcggc acggattaaa 2700 actagccaag tcgttaatcc atccatcctc cacaccgtgc tgctccagca actattagga 2760 gatgccgcag gcacaagaaa ctttggctgt gaggcatggg ggcgatggct ggtacaccca 2820 gaccaccgcg cccagcaccg aacgccgcga ggcaacccgc tcagcgccca aagaggaagg 2880 gagagttcct gccacctagg ggtcaactcg gtggactgga tgacccgaac cggaggaatc 2940 tctgcggttc aagggtcaaa tggtacctcc aatcaggaac tcacccctga tgaagccctg 3000 aaaaaggcca aagagcccag ggaggagatt gcagcacatg cgccacgccc gaagaggcgt 3060 acgagcaatc tcacaccacc ggtggccacc cccaagagga tttgcccaga ggttgtgggc 3120 gctacccgcg gctcggcaac ggcaaaggtg ctgaacaaaa atggggcagg gaatctagac 3180 ctgcgcccaa gcacgtcgcg tgcagcggcc acgaagccca gctcgccgac acccaccgag 3240 cctcgcctgt catatgcaga catggcaaaa aacgttaggg tagccgtgtt gcccgtggat 3300 ttcccacggg tcatgctcag tcacggagat ttgtctgtgt tggaagaggc catcatagac 3360 gaagttattg cgtctggtgg agatatcgcg gcctcattca cgggcattca cttccgggtt 3420 ggattcctgc taattgaatg ctccggtgag gcctcagccg cctggctgag gaccgcaaca 3480 tcgaggctga aatcgtggaa gggcgtgccc cttaagtgca aggtgggaga tgacataccg 3540 tcgccccact gcatcacgct gttttgcccc aggagtgtgg gtcggtccac cgaatccctg 3600 ttggttctgc tgaggaacca gaacaggatc gagaccgaca cctggaaggt gatctccagg 3660 aggaacgagg gtggaggagc cctcctggtg atcgcgatcg acgagctatc caaatgtata 3720 ttgtggagaa ggggcaccat gtcttctccg ctacggacca tccccgtaag tggactaaag 3780 aagaagactg gagcaaagcc ccaacctgct ccaacgggca aggaggatgt ctcagtgggt 3840 aacatcacca catcagacac tcctgcctcg gagcagagca gtccaaccgc gcagcccgcc 3900 gacccgtcgg aggacctcgc agaagacgag gaactggcag acgtgacgct ctgccaggag 3960 gggattctgg gggaggaaga catccccatg tcctcgcaag agctggcgga ggagctgcag 4020 gatgcagctg tagctgacgt cgttatgacg aacagcggtg acgggtgcag ctccaggccc 4080 cattcaccgt gtcagctgga gccttcccag aagtacatga cgaagggcat caacacaggg 4140 ctggcccagg taaacatcca ccgggctaag gcagcctcgg cggtcttagc aaggatgttc 4200 accaacaaac accttgggct ggccctggta caggagccgt gggtgaacaa tggcattaag 4260 ggcctgttca cagccgactc aaaggtaatc tgggatcgga gagatccagc acccagagcc 4320 tgtatcatgg taaggaaaag tattaacttt aatatccttt cagaattctt gactagagac 4380 tgcgttccca tattggtgta taccaaaggc agcgcggtct ctatggtcat cgtgtctgca 4440 tatttcgcag gggatgcgcc ctgcccacca ccagaggtcg aaaggctggt ggagtattgc 4500 aggaaagaga agatgccggt gctcatcgga tgcgatgcca gtgcgcacca tacgatatgg 4560 ggcagcagtg acatacattt aaggggtgag tgcctaactg attttatttt taaatataat 4620 ctagaactag aaaatgtcgg atctgctccg tcatttgtta ctaggatcag ggaagaggtg 4680 ctggacatca ccctaatcag tcggtcccta aagcctcacc ttagggaatg gcatgtttcc 4740 caagaggaat ccatgtctga ccataggact atcctattta atttaaaatt aaatacggat 4800 agtagcacac caggtcgcaa ccccaggaga accaactggg aaggctataa gtcgacctta 4860 gggctcaacc tagccaacgg actatctggt acacccagga atccgataga gctggacaga 4920 gccacggatg acctcaataa gtgcataatt agtgcatttg aggagaattg cccggtaggg 4980 aaaagattca tggaaaaaga tgccccatgg tggaacgaca gtctggaaag gctgcgcgtc 5040 accacacgtc gcctcttcaa taaagccaag agagacggaa tatgggaaca ataccgcgaa 5100 tgccttacct cctataataa ggagataaga aaggctaagc ggaagaatta cagagacttc 5160 tgtgaaagca tagttagcac aagcgaaggc gccaggctcc acagagcact ggcgaaacga 5220 acgcctgatg ctaaccttgc actgaagcgt ggggataatt ccttcacgat tagcaataag 5280 caaagattag aattactctt cgagacgcac ttcccaggct gcacacccct gcaggaagag 5340 gctatcgtag gaataagcag atatagaccg tccactgacg actgggcatg cgccaagtcg 5400 acagtcacaa aggagaaact aaattgggca attggcacat tccagcccta taaatctccc 5460 ggaatggacg gcatatcgcc agccttcctt caaacgggcc aggatatact cctctcccgt 5520 attaggaaag ctctagtaag tagcctagcc ctcggacaca taccgagcgc atgcaggaga 5580 gcaagggtag tcttcatccc gaaagcaggg aagaaagata ttaccgaccc gaagtccttc 5640 agacccatca gcttgacatc gtttttattg aaaacgttgg aaaagatggt ggactacaag 5700 attagaagca ctttgctcaa gcaaaggccg ctgcacccag cgcagcatgc atatagagta 5760 ggcaggtcta cggacacagc actttatcag ctgcaacgca ccttgagtgc ggcaattgat 5820 tataaggaag tagctttgtg cgccttccta gacatagagg gtgcctttga caatacatca 5880 cacgatgcga tcaaggacac cctctcgaga aggggcctgg atcctaccac cagcagatgg 5940 attctcgcac tgctgcgatc caggcaggtc acagcatcag tgcatgatag caccgtaacg 6000 gtcctaacca ccaagggctg tccccaaggg ggggttctgt ctccgctact ctggagtctg 6060 ttggtagacg aactactaaa cagactcact aacagtggta tacaatgtca aggttatgcc 6120 gatgacattg ttatcatggc gcgaggaaaa tttgaagaat cactctgtga catggtccag 6180 tctgggctaa ggataacgta tgactggtgt aaggaggtcg gactcaacct taaccctacg 6240 aaaacagtca tcgtcccctt taccagacga cataaactac agaggatgag gcaaatatgg 6300 ctctcaggta ccccactaga aagaagtagg gaggttaaat acctgggcgt catatttgac 6360 agtaaactta actttggcac ccatgtgcag aatgccatgt taaagtgctc cagagcgctt 6420 tacacatgtc gcagcatagc cggcaaatca tggggcacat caccaaagat agtaagatgg 6480 ctatacctaa tggtagtaag acccatgcta acctatgggg taatagcatg gggtgacaga 6540 gcacggttga tcaccgtgaa aaagcaactg caaaaattgc aaagaatggc ctgtgtctgt 6600 atgacaggag taatgtgcac ctgcccaaca atggcccttg aagccttaat ggagctcacg 6660 ccactccacc acatcataag gctcaagcag aaagcgacgc ttttaaggat gtcagcagaa 6720 ggagttggat gcccaactct ctcaaatgaa ctgcccctgc tattgcaacc cagggacgaa 6780 atgaaagtcg aatacatctt cgaacgtaat ttcacagttt acatgagcag taaaaggaac 6840 tggacaactc tggaagaggt ccaccctatg aagccgcaca ccataaggtg gtacacagcg 6900 gatcactcac caaccagggc acaggtctcg gtgtggtggg ccctcgggtg tcataccacg 6960 aatccctacg gaacgcacac aagcatattc caggctgagg tatgtgcgtt aggaaaatgt 7020 gcgattttaa ccttaaaacg taactatcgg aatacagaca catccatact atccgatagt 7080 caagcagcat taaatgcaat aacggggact aaaataacat caaagatagt ccaggaggct 7140 cgttcaaagc taaacctact tgggactcac aacaggcttg cctgcgatgg gtcccgggcc 7200 acagggatat accgggtaac gaggcggctg ataagcaggc gagaatgggg gcagaaaggc 7260 cccctgatag gaccagaacc gtattgtggc ataggcagac acactatacg gctggtactt 7320 agaaatgaag agaaacaggt gcggcagcag agctggtcgg aggcagtagg tctcaggcaa 7380 gccagatgtc tcctcggtgg ttataatctc aagcgattta agcaagttat aaccatggga 7440 aagaacaacc ttaggatcct caccggtcta atgacggggc actgtcgact aagaagtcac 7500 ctaactagac taggtatata tagtagcgat ctctgcaggt tctgtgaaat agaggaggaa 7560 tcctcggtac acatcctagc agaatgtgtc gcactagcta gaaggagatg cagcatcctg 7620 gggatgcatg tcttgaattt tagagacata gaagacctca acccaacaaa gatcctcaca 7680 ttcgttcggg aagtggggct gatggaagag ctataggctc agaagggggc acaatagatc 7740 taaaaggtcg cggtgcaact tccccaataa taataataa 7779 // ID DHUHUH3 standard; DNA; INV; 1658 BP. XX AC X63028; S51651; XX DR FLYBASE; FBgn0003948; Dhet\Uhu. XX FT source X63028:1..1658 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..50 FT SO_feature terminal_inverted_repeat ; SO:0000481:1609..1658 FT SO_feature TATA_box ; SO:0000174:293..296 FT SO_feature CDS ; SO:0000316:366..1121 FT /db_xref="FLYBASE:FBgn0044280; Dhet\Uhu" FT /db_xref="SPTREMBL:Q02881" FT /protein_id="CAA44763.1" FT /translation="MGKRTTIEQRNLILEHFKIGYSYRQIAKMVNLSTTTVFNIIRRFV FT DENRIEDKGRKAPNKIFTEQEKRRIIRKIRENPKLSAPKLTQQVQDEMGKKCSVQTVRR FT VLHNHDFNAQVPRKKPFISTKNKGTRMTFAKTHLDKDLEFWNTVIFEDESKFNIFDSDG FT RNYVWRQSNTELNPKHLKATVKHGGGSVMVWACISAAGVGNLVCIETTTDRNVDLRILK FT ENLLQSAEKLGIRRTFRFYQDNDQDNNQA" XX CC Derived from X63028 (Rel. 36, Last updated, Version 7). CC Michael Ashburner, 17-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1658 BP; 547 A; 315 C; 343 G; 453 T; 0 other; tatatacagt gtctcacagc tcaactggaa cagtgcctag caaaaaattt aattgcctgc 60 agtaaactaa ttatccaata ttttttaaaa attccaaaga ccgatggcag gtacatatat 120 taactaccat aatgaatata tgatcccaat aaactggggt tttccaccgg ctaggccggg 180 ttatgtaaca aagtacctta atttatggtt tcatattatt tggaacaatg gcgttatgga 240 cacctgggtg ccataaaacc cggatttttt acgtcaggtt gattattttc ggtataaata 300 gaccaatcct tcgtagtcag tttagttata tcctgcatct cgggtgcaac cagccaacaa 360 ggcatatggg caagcggact accattgaac aacggaatct gatcctggaa catttcaaga 420 ttggatattc atatcgccaa atagctaaaa tggtaaatct aagtaccaca actgtattca 480 acatcattcg gcgcttcgtc gacgaaaatc ggatagagga caagggcaga aaggcaccaa 540 acaagatttt caccgaacag gagaagcgga ggatcatcag gaaaataagg gaaaatccca 600 agctatcggc tccaaaactg actcaacagg tgcaggatga aatggggaaa aagtgcagtg 660 tgcaaactgt gcgccgggtt ctgcacaacc atgactttaa tgcccaagta ccacggaaga 720 agccatttat aagcacaaaa aataaaggga ctaggatgac gttcgccaaa acccacttgg 780 acaaggattt ggagttctgg aacacagtca tatttgaaga tgagtccaaa ttcaacattt 840 ttgactcgga cggacggaat tatgtgtggc gacagtccaa tactgagctg aatccgaaac 900 acctaaaggc aacagtgaag cacggcggag gaagtgtcat ggtatgggca tgtatctcgg 960 cagccggcgt cggaaatttg gtgtgtattg aaacaacaac ggacaggaat gtggacctca 1020 gaatattaaa ggaaaattta ctccaaagtg ccgagaagct aggaatccga cgtactttcc 1080 ggttctacca ggacaacgac caggacaaca accaagcata agtccggatt agtacagtcc 1140 tggcttatct ggaactgccc ccacatgata attccaccgg cccagtctcc agatgtaaat 1200 gttatttaaa atttgtggga tctgctggaa aataacatcc ggaatcacag atccaatctc 1260 aaaaatgttt tgctggatga gtggagcaaa atcagtccag aaactacccg gaagctggta 1320 tcttccatga ataataggtt aagggcagtt attaaggcta aaggatatca tactaagtgt 1380 taacatcctt atttaagttt ttatacgcca aatatgttac ttttttaaga ctgttcgaat 1440 taagctttga catgtatttt ggatatgttt tcagtttttg actaatttta attaattaat 1500 taatatttta gtaaaaacta aagattattt ttcaaacatg atatagcatg aaacaatttg 1560 gcatttaaac attttgcatt tgtttctttg tttaaacttt atagcacttt aaaatatttg 1620 ctaggcgctg ttccagttga gctgtaagac actgtata 1658 // ID DSRN standard; DNA; INV; 6644 BP. XX AC D83207; XX DR FLYBASE; FBgn0015168; Dsim\ninja. XX FT source D83207:1..6644 FT SO_feature five_prime_LTR ; SO:0000425:1..316 FT SO_feature three_prime_LTR ; SO:0000426:6329..6644 FT SO_feature transcription_start_site ; SO:0000315:183..188 FT SO_feature primer_binding_site ; SO:0005850:323..338 FT SO_feature CDS ; SO:0000316:<411..4493 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044186; Dsim\ninja\pol" FT /db_xref="SPTREMBL:O02006" FT /protein_id="BAA19771.1" FT /translation="RFRRPARICVSAASATYRQRSRSQRQWRAVSDIEGRSLRRTPVRT FT SRSQRQGDARSVICGDRSQRSEGVRDAILEAQRRHHLELGKMQHSPRKSARLNGGEATP FT ITTASQQPASSGAGTRTRVNITAASIPCPATTVTTVASQPRSTAVTAASSVPEVNQPLV FT LELMERIAALERELEKARSLESVSTANCAPIAVGPSAVGANSGASGRPPFWSGQPIPTS FT NGEALHNGVGSVPYNGDGASGAACTLPPSSSGPPLLTTSNYFVEPLCATGTAQPAHGLV FT LPGVSIHNAATASPLVGSYAATTPSGIQGAYGPRKLPDLPIFGGQPEEWPIFSCAFVET FT TRAYNCTDLENNQRLLKALKDEAREAVKALLIHPGNVSAVMEQLRFRFGRPEQLIRSQL FT NNVREVQPISEHNLAKIIPFATRVSNLAAFLQSAKAEQHLGNPTLMEELVAKLPTSKRV FT DWARHAATIAPFPTVVHFSAWLQEYANVVCTVLDVEGKEPRRRLLHASVDHNECDQQDD FT RHGGCSICGGQHGILNCRKFIAASPQERWSNVKRHRLCFNCLRSGHTARSCYTQGECQI FT NGCRREHHRLLHGADEERRPLQRGGFRRHEGNQQPTVSRRSPARRPSLRDGHKDQERNR FT QPAVPSNSLERGAPREAGAPMQRNLSCVDAEGGRLLFRILPVTLYGAGRKVDTYALLDE FT GSSVTMIDDELRRDLGVQGERRQLNIKWFGGKATREPTNVVSLKISGVGKPTRHVLRNV FT YAVSSLSLPMQTLSRRDVQGVHRDARLPMKPYSNVVPKLLIGLDHGHLGLPLRTRRFAR FT EGPYAAVTELGWVVFGPVSGQPTTPSPRSSLLAVSVDDAMEKMVEDYFDMENFGVKTAP FT PVAASDDVRAQRILEDTTVKVGRRYQTGLLWKDDHVVLPPSYEMAYRRLVNVEKKMKRN FT KPLAQEYDRIIKDYVSKGYARRLQPEEVAVRSDRLWYLPHFSVENPNKPGKVRLVFDAA FT AKVGGTSLNSELDKGPQHYKPLPAVLFHFREGAVGVCGDIKEMFHQVLIRPEDRCSQRF FT LWRDGDDERDPDVYEMNVMTFGAACSPSAAHYVKTMNALKYRDSDPRAVKAITDYHYVD FT DYVDSFATESEAISVSTRVKEIHKDAGFELCQFSSSSPTVETALGPGRVKSVGWGEAEE FT KILGMRWQVATDDFRFNVEYHRVPSSVLSGDRVPTKREYLSLVMSTFDPLGFLCCLMVT FT AKLLLREIWRQKIQWDEPLPEELSKAFAIWRKEMDAVGQFRCPRHYFGRGAVRAVELHV FT FVDASQAAFAAVAYWRVTYEDDDVQVSFVSAKTKCAPMRTMTIPRLELQAAVLGTRLMN FT TVKQEHSVVITDLLLWT" FT SO_feature CDS ; SO:0000316:<4471..6237 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044186; Dsim\ninja\pol" FT /db_xref="SPTREMBL:O02007" FT /protein_id="BAA19772.1" FT /translation="RTCCYGLDSKTVLRWIGSTHRRYKQFVGNRVAEILESSKVSQWRW FT VPTADNAADDATRSQKGVDLSQESRWLRGPAFLRQPAASWPGPEEGTERVPDAPDEEEM FT PSEFALVAADDFVIPFQRFSSFSRLVRTTAWVLRFARWCRKQRNELEEYGLTAAECKAA FT ENLLVRQAQLESFPDEMRSAETGQDVGGSSDIRGLVPYLDEDGILRAYGRIDAALCMPY FT SARRPVLLSHRHSLTELIVRDFHDRMKHQNVDATIAEIRTKFWVTKMRRVMRKSHLIVQ FT RVQVAATATDAADNGTPSGRQTGCGWMAIQIHRTGLLWATAGDCVPSQGEALGRLVYVF FT DDKGDSPGVAHDLSTDSCIIAIRNFVCRRGPVYRLRNDNGKNFVGADREARRFGDVFEM FT EKLQSELSSRSIEWVFNCPANPSEGGVWERMVQCVKRVLRHTLKEVAPRDHVLESFLIE FT AENIVNSRPLTHLPVDADQEAPLTPNDLLKGVANLPDTPGLDAECPRRVLRESSGGLLA FT CSETVSGGGGSWSTCLRLCAARSGAAERSPSTRVIWSSSAILPWPDESGARASWRRSTA FT ELMESSDALRCA" FT SO_feature primer_binding_site ; SO:0005850:6318..6327 FT SO_feature polyA_signal_sequence ; SO:0000551:6544..6549 XX CC Derived from D83207 (Rel. 53, Last updated, Version 4). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 6644 BP; 1603 A; 1589 C; 2098 G; 1354 T; 0 other; tgtcgcggat cgaatattgt tatcgatagg ctagtatttt gagaagtccg aatgtggacg 60 gatttgtaag cccatatgtg tctgggcaca ttgttttcgc cattgtaaat tgccgggaaa 120 atttagcttt tcattgtcgt gcaagagttg gaggacacac tgcggtgagc taataagtta 180 agttagttac aattgtaaaa cattaattct tccagaataa aacgctttct actaccacga 240 attagtctgc cctttctttc gggaaccaat gcgtggagta gccgtttaag gcaactcctt 300 gtgacgcacg acgacaactt tttattcgca gtcctaggcg actgcagggg caacttgcgc 360 tggaatgacg gtttagacgg ccagctagag agttgccgga gctggagtga cggtttagac 420 ggccagccag gatttgtgtg agcgcagcca gcgctacgta ccggcagagg agtcgcagcc 480 agcgacagtg gcgcgcagtc agcgacatag agggacgcag cctgcgtcga acgccggtac 540 gaacgagtcg cagccagcga caaggagacg caagaagcgt catttgtgga gaccgcagcc 600 agcggtcaga aggcgtcaga gacgcaattt tggaagcgca gaggcgccac catttggagc 660 tggggaagat gcagcattcc ccaaggaaga gtgcccggct gaacggaggg gaagccaccc 720 ctataacaac agcgagtcag cagccagcca gtagtggagc aggaactcgg acgcgggtga 780 acatcacggc ggcgtcgatt ccttgcccgg ccactacggt gactacagta gcttcccaac 840 ccagaagtac tgctgtcaca gctgcgagtt cagtaccgga ggtgaaccag cccctcgtgt 900 tggaactcat ggagaggatc gcagcgttgg agagggagct ggagaaggct agatccctgg 960 aaagtgtgag caccgccaat tgcgcgccaa tcgcagttgg cccaagcgca gttggcgcca 1020 acagtggagc gtcggggcgg ccgccatttt ggagcggcca gccaataccc acatctaacg 1080 gagaggcctt acataacggg gtcgggtcgg tgccatacaa cggtgacggt gcgagcggtg 1140 cggcctgcac gctgccgcca tcttcgagtg ggccgccatt gctaacgact agcaactatt 1200 ttgtggagcc actgtgtgca acaggcactg cgcagccagc gcatggactc gtgctaccgg 1260 gcgtgagcat ccacaatgcg gcaacagcat cgccacttgt cggatcctac gccgcgacga 1320 cgccgagtgg aatccaggga gcatatgggc caaggaagct tccggacttg cctatatttg 1380 gagggcagcc cgaggagtgg ccgatcttca gctgtgcgtt cgtggagacg acccgagcgt 1440 acaactgcac ggacctggag aacaaccaga ggttgttgaa ggcgctgaag gatgaagcgc 1500 gcgaggcagt gaaggcgcta ttgattcatc cagggaatgt cagcgccgtg atggagcagc 1560 tgcgctttag gttcggccga ccggagcagc ttatccgcag ccagctcaac aacgtgcgag 1620 aggtgcagcc aatttcggag cacaatttgg cgaagatcat tcccttcgca actcgagtga 1680 gtaacctcgc ggccttcttg cagtcagcga aagcggagca acacctgggg aacccaaccc 1740 tcatggagga gcttgtggcc aagctgccaa cgagcaagcg agtggactgg gccaggcatg 1800 ctgcaacgat tgcgcccttt cccactgtag tccacttcag cgcgtggcta caggagtacg 1860 caaacgtggt gtgcacggtt ttggacgtcg agggaaagga gccgaggcgc cgacttctac 1920 atgcgagcgt cgaccataat gaatgcgatc aacaggatga tcggcatgga ggttgttcca 1980 tctgtggagg acagcatgga atattgaatt gcagaaaatt tattgcagct tcgccacagg 2040 aaaggtggag caatgtgaag aggcatcggc tctgcttcaa ttgcctgcga agcgggcaca 2100 cggctagatc ctgctatacg caaggtgagt gccagattaa tggatgccga agggagcatc 2160 accgtctgct acatggtgcg gacgaggagc gaaggccgct gcagcgaggt ggcttcagac 2220 gccacgaagg gaaccagcag ccaacagttt ccagacgcag cccggccagg aggccttcgc 2280 tacgagatgg tcacaaggac caggagagga accggcaacc agccgttccc agcaacagcc 2340 tggagagagg agccccgcgt gaagcgggag cgcccatgca gaggaatttg agctgcgttg 2400 acgccgaagg aggccgtcta ctgttccgta tactgccggt tacgctgtac ggagcggggc 2460 gaaaggtgga cacgtatgcg ctcctagatg agggatcctc cgtcacgatg atcgatgacg 2520 aactacgaag ggatcttgga gtgcaaggag agcgtcggca gctaaatatc aaatggtttg 2580 gaggtaaggc aaccagagag cctaccaacg tggtgagtct gaagataagt ggagttggaa 2640 agcccactcg ccatgtatta agaaacgttt atgccgtttc gagtttgagt ttgccgatgc 2700 agacattgag ccgacgagat gtccagggcg tgcacaggga tgcgcgtctg ccgatgaagc 2760 cttacagcaa cgtggtgccg aagctgctca tcggcctgga tcacggacat ctgggattgc 2820 cacttaggac gaggcggttc gctcgagagg gaccgtatgc ggccgtaacc gagctgggct 2880 gggttgtgtt tgggcctgta agtgggcaac cgaccacgcc gtcaccgagg tccagcctac 2940 ttgccgtgtc agtggatgac gcgatggaga agatggtgga ggactacttc gacatggaga 3000 actttggagt gaagaccgcg ccgccggtcg cagccagcga cgatgttcgg gcccaaagga 3060 tactcgaaga caccacggtg aaagtggggc gccgctacca gacgggatta ctctggaagg 3120 acgaccacgt tgtgctgcca ccgagctatg agatggcgta caggaggctg gtcaacgtcg 3180 agaagaagat gaagcgcaac aagccgttgg cgcaggaata cgatcggatc ataaaggatt 3240 acgtgtctaa aggatacgcg aggaggttgc agccggagga ggtcgcggta aggagcgatc 3300 gtctatggta tttgccacat tttagtgtcg aaaacccaaa caagcccggc aaggtacggc 3360 ttgtgtttga tgctgcagcc aaagttggag gaacctcgct aaactcggag ctggacaaag 3420 ggcctcagca ctataagcct ttgccagctg tgctctttca tttcagagag ggagccgtcg 3480 gagtctgcgg tgacatcaag gagatgttcc accaagtgct gatccgaccc gaggatagat 3540 gttcccaacg attcctctgg agagatggcg acgacgagag agatccggat gtctatgaga 3600 tgaacgtaat gacgtttgga gcagcctgct cgccgagcgc tgcgcattac gtgaagacta 3660 tgaatgccct gaagtatcgg gattcggatc cgagagcggt caaggccatc accgactacc 3720 attatgtcga tgactatgtg gacagtttcg ctacagagag cgaggctatc agcgtatcta 3780 cccgagtgaa ggagatacac aaggatgctg gattcgaatt atgccagttt tcatccagct 3840 cacccaccgt ggagacggct ttaggacctg gtcgagtcaa gagcgtcgga tggggtgagg 3900 ctgaagagaa gatcctcgga atgcgttggc aagtagcaac agatgacttc agattcaacg 3960 tggagtatca tcgagtgcca agcagcgtcc tgagtggaga tcgagtccct acgaagaggg 4020 aatatttgag cctggtgatg tcaacgtttg atcccctggg attcctgtgc tgcctcatgg 4080 ttacagcgaa gctcttgctg cgagagattt ggaggcagaa gatccagtgg gacgaaccac 4140 taccggagga gttaagcaaa gcctttgcga tttggcgcaa agagatggac gccgtgggac 4200 agttccgatg tccgcgccat tattttgggc gtggagcagt ccgggccgta gagttgcacg 4260 tcttcgtgga tgccagtcag gcagcattcg cggcggtggc ctattggagg gtcacatatg 4320 aggacgacga cgtgcaggtg agcttcgtga gtgcgaagac gaagtgtgcc ccaatgagaa 4380 cgatgacgat cccacggctg gagctacagg cagcagttct tggaaccagg ctgatgaaca 4440 ctgtcaagca ggagcacagt gtggtcataa cggacctgtt gttatggact tgactctaag 4500 acggtgctga gatggatcgg cagcacccac cgccggtata agcagtttgt tggcaaccga 4560 gtggcggaga ttttggagtc gtcgaaggtt tcccaatgga gatgggtgcc tacagccgac 4620 aatgcggctg atgatgcgac gcggtcccag aaaggagtcg accttagcca ggaatcaagg 4680 tggctaagag gacctgcatt tttgaggcag ccagcagcca gctggccggg gcctgaggaa 4740 ggaactgagc gtgttccaga tgcccctgat gaagaagaga tgcccagtga gtttgcatta 4800 gttgcggcag acgattttgt tattccgttt cagagattct cgagcttcag tcgcctggtg 4860 aggaccacag cctgggtcct acggttcgcg cgctggtgcc gcaaacagcg aaacgagctc 4920 gaggaatacg gccttactgc agcagaatgt aaggccgcgg agaacctgtt ggtcaggcag 4980 gcacaattgg agtcgttccc cgacgagatg aggtcggcgg aaactggaca ggacgtcggt 5040 ggatcgagcg acattcgagg attggtgccc tacctagacg aggacgggat tctgcgagct 5100 tacggcagaa ttgatgccgc actgtgcatg ccgtacagtg cgaggagacc cgtattactg 5160 tcacacaggc acagtctgac agagctgatt gtgagagact tccacgacag gatgaagcat 5220 caaaatgtgg atgctacgat tgcggagatc cggacaaagt tctgggtcac aaagatgaga 5280 cgtgtgatgc ggaagagtca tctcatcgtg caacgagtgc aagttgcagc gaccgcgacc 5340 gatgccgccg ataatgggac cccatccgga agacagactg gatgcgggtg gatggccatt 5400 caaatacaca ggactggact actttgggcc actgctggtg actgtgtccc gtcacaagga 5460 gaagcgttgg gtcgccttgt ttacgtgttt gacgacaagg gcgattcacc tggagtggcg 5520 catgacctgt cgacggattc ctgcataatt gcgatcagga acttcgtctg ccgtagaggg 5580 ccagtatata gactgcgcaa cgataacggc aagaacttcg tgggagctga cagggaagcc 5640 aggcgctttg gtgacgtatt cgagatggag aagcttcaga gtgagttgtc aagcagaagc 5700 attgaatggg tttttaattg cccagcgaac ccgtctgagg gcggagtttg ggagcgcatg 5760 gtgcagtgcg tcaagagagt actgcgtcat accctgaagg aagttgcacc gagggaccat 5820 gtattggaga gtttcctgat tgaggcggag aatattgtaa actcgcgtcc gctcacccac 5880 ttgcctgtgg atgcggacca ggaggcgccg ttgacgccaa acgatctact caagggagta 5940 gccaatctgc cggatacgcc tggattggat gcggagtgcc caaggagggt tctacgagaa 6000 agcagtggag gattgctcgc ctgctccgag accgtttctg gaggaggtgg gtcctggagt 6060 acctgcctac gcttgtgcgc cgcgagaagt ggtgccgccg aacggagccc atccaccagg 6120 gtgatatggt cttcgtctgc gatcctgcct tggcccgacg agagtggcgc aagggcatcg 6180 tggaggagat ctacagcgga gctgatggag tcgtcagacg cgctaaggtg cgcgtgaacg 6240 acaacggcct acctaggaca atgatgcgac ccgtctctaa acttgcagtt ttggatttga 6300 gtgaagcggt tcttcacggg gtcggggatg tcgcggatcg aatattgtta tcgataggct 6360 agtattttga gaagtccgaa tgtggacgga tttgtaagcc catatgtgtc tgggcacatt 6420 gttttcgcca ttgtaaattg ccgggaaaat ttagcttttc attgtcgtgc aagagttgga 6480 ggacacactg cggtgagcta ataagttaag ttagttacaa ttgtaaaaca ttaattcttc 6540 cagaataaaa cgctttctac taccacgaat tagtctgccc tttctttcgg gaaccaatgc 6600 gtggagtagc cgtttaaggc aactccttgt gacgcacgac gaca 6644 // ID DV26847 standard; DNA; INV; 691 BP. XX AC U26847; XX DR FLYBASE; FBgn0011601; Dvir\Helena. XX FT source U26847:11..701 FT SO_feature CDS ; SO:0000316:<3..540 FT /db_xref="FLYBASE:FBgn0026901; Dvir\Helena\RTase" FT /db_xref="REMTREMBL:CAB35346" XX CC Derived from U26847 (Rel. 63, Last updated, Version 2). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 691 BP; 235 A; 167 C; 145 G; 144 T; 0 other; TGCGGATGCA TATCCCTCCA ATTCAGCTTG AAGGAGTTAC CCTGGAGCAG CCGCCACAAG 60 CTAAGTATCT CGGCATCACC TTAGACAAGC GCCTTACCTT TGGGCCACAC CTCAAAGCTA 120 CGGTAAAAAA ATGTCGGCGC AGACTGCAAC AACTGCGGTG GCTCAACAAC AAAAGGAGCA 180 CCCTGCCGCT GAGATGCAAA AGAGCTGTAT ATGTGCACTG TATTTTGCCG ATATGGCTCT 240 ATGGAGTGCA GATTTGGGGG ATTGCAGCCA AATCAAATTA TAAACGCATA CAGGTACTGC 300 AGAATCGGGT ATTACGACAG ATAACCAACT GTCCCTGGTA CGTACGCGGT TCTACACTCC 360 ATAAAGACCT CAAAGTGCAC ACAGTCGAAG AACAGATTGG AAGGCACACA AGCAGATACA 420 GCGACAGACT GCTGAGACAC CGCAGCCTGC TCGCCAGAGG ACTACTCCCT GCCCAACCTC 480 TAAGGCGACT CAAACGGCAA GGTTTTGCCA AGACGATTGG GCGCCAGTAA GACCACCCGC 540 ATTAAAATCC TCATACTACG TTATGGGGTT CGGCTCTAAA TAACATTTCT ATCCATGTGT 600 TGTTAAGGTA CTAACCATTA TGATTGTTAC AGGTTCTACA CTTAATAATA AAAAAAAAAA 660 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 691 // ID DV49102 standard; DNA; INV; 4158 BP. XX AC U49102; XX DR FLYBASE; FBgn0015679; Dvir\Penelope. XX FT source U49102:1..4158 FT SO_feature CDS ; SO:0000316:617..3190 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044148; Dvir\Penelope\ORF1" FT /db_xref="SPTREMBL:Q24736" FT /protein_id="AAA92124.2" FT /translation="MERSPEPSININGRHAVCTATNMSYAKIKTKYKDSKRTINKFQLT FT LVKLTKLKSSLKFLLKCRKSNLIPNFIKNLTQHLTILTTDNKTHPDITRTLTRHTHFYH FT TKILNLLIKHKHNLLQEQTKHMEKAKTNIEQLMTTDDAKAFFESERNIENKITTTLKKR FT QETKHDKLRDQRNLALADNNTQREWFVNKTKIEFPPNVVALLAKGPKFALPISKRDFPL FT LKYIADGEELVQTIKEKETQESARTKFSLLVKEHKTKNNQNSRDRAILDTVEQTRKLLK FT ENINIKILSSDKGNKTVAMDEDEYKNKMTNILDDLCAYRTLRLDPTSRLQTKNNTFVAQ FT LFKMGLISKDERNKMTTTTAVPPRIYGLPKIHKEGTPLRPICSSIGSPSYGLCKYIIQI FT LKNLTMDSRYNIKNAVDFKDRVNNSQIREEETLVSFDVVSLFPSIPIELALDTIRQKWT FT KLEEHTNIPKQLFMDIVRFCIEENRYFKYEDKIYTQLKGMPMGSPASPVIADILMEELL FT DKITDKLKIKPRLLTKYVDDLFAITNKIDVENILKELNSFHKQIKFTMELEKDGKLPFL FT DSIVSRMDNTLKIKWYRKPIASGRILNFNSNHPKSMIINTALGCMNRMMKISDTIYHKE FT IEHEIKELLTKNDFPPNIIKTLLKRRQIERKKPTEPAKIYKSLIYVPRLSERLTNSDCY FT NKQDIKVAHKPTNTLQKFFNKIKSKIPMIEKSNVVYQIPCGGDNNNKCNSVYIGTTKSK FT LKTRISQHKSDFKLRHQNNIQKTALMTHCIRSNHTPNFDETTILQQEQHYNKRHTLEML FT HIINTPTYKRLNYKTDTENCAHLYRHLLNSQTTSVTISTSKSADV" XX CC Derived from U49102 (Rel. 69, Last updated, Version 4). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4158 BP; 1694 A; 817 C; 666 G; 972 T; 9 other; gatcgaaaaa agcaacgtcg tttaccaaat accatgtggc ggggataaca acaacaagtg 60 caatagtgtc tacataggta caacaaaatc gaagctaaaa acaaggatta gtcaacataa 120 atcggacttc aaactaagac atcaaaataa tatacagaaa acagcactta tgacccattg 180 tataagaagc aacccacaca ccaaattttg atgaaacaac aatcttacaa caagaacaac 240 actataacaa gcgacacaca ttggaaatgc tacacataat taacacacca acctacaaac 300 gactaaacta caagacagac acagaaaatt gcgctcactt gtacagacac ctcttaaaca 360 gtcaaacaac ctcagtaaca atctccacgt caaaaagcgc agacgtgtaa aataatgtat 420 gtaaaatgtt cgaaataatg tttaatttat tgtattataa ttgttaattg ttttttgtat 480 cttggtgtta gtgccctgaa gacggtttgc cgatgtgcaa ccgaaatata tcggaagaga 540 attgaataaa attgtttttc attgtttgtt ttaacaaact cggacctcga gccagccaac 600 aaataaatat tgaaatatgg aaaggtcgcc agagccatca ataaatatca acggaaggca 660 cgccgtatgc acagcaacca acatgagcta cgcaaaaata aaaactaaat acaaggattc 720 gaaaagaaca attaataaat tccaactaac actggtaaaa ttaactaaac ttaaatctag 780 tttaaaattt ttgttaaaat gtagaaaatc aaatttaata cctaacttca tcaaaaactt 840 gacacagcat ttgaccatac tgaccactga caataaaacc caccctgaca taacaagaac 900 attgactaga cacacacatt tttaccatac caaaatatta aacttactta taaaacacaa 960 acacaaccta ttacaagaac aaacaaaaca tatggaaaaa gcaaaaacaa acatagaaca 1020 actgatgacc acagatgacg caaaagcgtt ttttgagagc gagagaaata tagaaaacaa 1080 aataacaaca acactcaaga aaagacaaga aacgaaacac gataagttac gagatcaacg 1140 gaacctagcc ttagcggata acaacacgca aagagagtgg tttgtaaaca aaacaaaaat 1200 agaattcccg ccaaacgtcg tagcgttact cgcaaaaggg ccgaagttcg ctctcccaat 1260 cagcaagaga gattttcctc tcttgaaata catcgcagac ggtgaggagc tagtgcaaac 1320 aataaaagaa aaggaaacac aagagtcggc gcgcacaaaa ttctctttgt tagtcaaaga 1380 gcataaaacc aagaacaacc aaaacagtag ggatcgagca atactggaca cagtggaaca 1440 gacacgaaaa ttactgaaag aaaatataaa tattaaaatt ctatcgtcgg ataagggcaa 1500 caaaaccgta gcaatggatg aggatgaata taaaaataaa atgacaaata ttttagacga 1560 cttatgcgcg tatagaacat tgagactgga tccgacatca agactacaga caaagaataa 1620 caccttcgta gcacaattat tcaagatggg tcttatttca aaggacgaaa gaaataagat 1680 gactacaaca acagcggtac ctccgaggat atatggacta ccaaaaatac acaaggaagg 1740 aactccactg agaccaatat gttcttccat aggatctcca tcttacgggc tgtgcaaata 1800 tataatacaa atattaaaaa atctgacaat ggactctagg tacaacatca agaacgcggt 1860 agattttaaa gacagagtca acaactccca gattagagaa gaggaaacat tagtatcttt 1920 tgacgtagta tccttatttc ccagcatacc aatagaatta gcacttgaca caataagaca 1980 aaaatggacc aaattagaag agcacacgaa tataccgaaa caactattta tggacatagt 2040 tagattttgc atagaggaaa acagatattt caaatacgaa gacaaaatat acacacaact 2100 taagggaatg ccaatgggat caccggcttc cccagtaatc gcagatatat taatggagga 2160 actgttggac aagattacag ataaattaaa aattaaacca agactcttga ccaaatatgt 2220 agatgacctt tttgccataa cgaacaaaat agacgtggaa aatattctaa aagaattgaa 2280 ttccttccac aaacagataa aatttacaat ggaattagaa aaggacggga aattaccatt 2340 tttagactct attgtaagca gaatggacaa cacactcaaa ataaagtggt ataggaaacc 2400 catagcctcc ggacgaatac tcaacttcaa ttcaaaccac ccaaagagta tgataatcaa 2460 tacagcacta ggctgtatga atagaatgat gaaaatatcg gacacaatat accacaaaga 2520 aattgaacat gaaatcaaag aacttttgac caaaaatgac ttccccccaa atataatcaa 2580 aacattatta aaaagacgac aaatcgaaag aaaaaagcca acagaacctg ctaaaatata 2640 caaatcacta atatatgtac cacgactatc agaacgcctc acaaactcag actgttataa 2700 caaacaagat ataaaagtag cacacaaacc gacgaataca ttacaaaaat tcttcaacaa 2760 gataaagtcg aaaatcccga tgatcgaaaa aagcaacgtc gtttaccaaa taccatgtgg 2820 cggggataac aacaacaagt gcaatagtgt ctacataggt acaacaaaat cgaagctaaa 2880 aacaaggatt agtcaacata aatcggactt caaactaaga catcaaaata atatacagaa 2940 aacagcactt atgacccatt gtataagaag caaccacaca ccaaattttg atgaaacaac 3000 aatcttacaa caagaacaac actataacaa gcgacacaca ttggaaatgc tacacataat 3060 taacacacca acctacaaac gactaaacta caagacagac acagaaaatt gcgctcactt 3120 gtacagacac ctcttaaaca gtcaaacaac ctcagtaaca atctccacgt caaaaagcgc 3180 agacgtgtaa aataatgtat gtaaaatgtt cgaaataatg tttaatttat tgtattataa 3240 ttgttaattg ttttttgtat cttggttgtt agtgccctga agacggtttg ccgatgtgca 3300 accgaaatat atcggaagag aattgaataa aattgttttt cattgtttgt tttaacaaac 3360 tcggacctcg agccagccaa caaataaata ttgatattaa atgataaagc tatatataat 3420 taactgaatc cacaaataaa caaaccaggt caagtagatc tggttctcgg cagctgacct 3480 aacacactca tttttttaaa attgaaataa tnnanatagc gggcttctcg gccagaacac 3540 agaattcggn cganattcaa aggagaggnn anatagtcgc gtcagccatt gaggttaaag 3600 atttggaaag attttgagag ttagaagaat aaaacaaaga acatgcagaa taccctgaac 3660 cctcaacttc caacatcaaa tacgatatat attcttctct agaagctata tgtcatgttt 3720 ggatcctagc tcttattatt taccaaaatt cccaaaaaac acgatatcga tatcgatttt 3780 tatcgattgc ttggaaacgg agtagtttat cgattatcgg aaacaatatc gatctgcgct 3840 ggcactagga gcacctacat ctaaaatttc aagtctctac cgcttatagg ttctgagatc 3900 cttgcgttca tatatacgga cggacggacn gtatacggac ggacagacat acgcatagct 3960 agatcgactc ggctattgat gctgatcaat atatacactt tatgggctcg gagatgctac 4020 cttctgcctg ttacatacat ttggattttc acaatacccc ttacaatata cccctatacc 4080 catatttaat gggttcaggg tgtaagttca aagttccctt gccaggattc gaactggcaa 4140 ctgaccgtat tacttgct 4158 // ID DVULYSS standard; DNA; INV; 10653 BP. XX AC X56645; S37633; XX DR FLYBASE; FBgn0004146; Dvir\Ulysses. XX FT source X56645:1..10653 FT SO_feature five_prime_LTR ; SO:0000425:1..2136 FT SO_feature three_prime_LTR ; SO:0000426:8518..10653 FT SO_feature polyA_signal_sequence ; SO:0000551:536..541 FT SO_feature polyA_signal_sequence ; SO:0000551:570..575 FT SO_feature TATA_box ; SO:0000174:580..584 FT SO_feature TATA_box ; SO:0000174:612..616 FT SO_feature polyA_signal_sequence ; SO:0000551:615..620 FT SO_feature polyA_signal_sequence ; SO:0000551:1138..1143 FT SO_feature primer_binding_site ; SO:0005850:2140..2150 FT /bound_moiety="tRNA-lys" FT SO_feature CDS ; SO:0000316:3422..4870 FT /db_xref="FLYBASE:FBgn0044141; Dvir\Ulysses\ORF2" FT /protein_id="CAA39966.1" FT /translation="MGRRSATAEPSWRAGANANEFPGSLMRETDPCKGGSPGRLGGKNW FT RNSLDLHGTWSSQKPRKTQCSNPAANAAIAQAHQRISACSAIWHQRVHPRGDASWLHGN FT DEGHPLMCLGRQGAKPAETNTAPKPPRTCRAGTANRRHPERTEVFQREHSTTVPPKPRR FT QAHRAEYSSSSPVYVADAQSRNWRNPNNRHFRVEDRRADDFQPEEAYANVRETAYMKLE FT RWNVKFDGEDAMNSVEDFVFRLEFLQRQYQCPWKEVLRGFHLLLTGRAREWYWMHVRHS FT RVDSWMQLRHALLDRFRGYQTEHEVMQELLQREQQASEGVDDYIHHMRQLAARFQKPLR FT DRELVRIIKRGLKESLAKYIYAMDVLTVDELRQECLEVERHMGRRSRTGYLQPSRCPQG FT TRPVVHEVEVPPHLTETPPGELEEAFVRTRNSSELYAGTRDSSTTSLETACPRSGKYSA FT IGVESRTRSARSVKTVRETPEGAR" FT SO_feature CDS ; SO:0000316:4836..8027 FT /db_xref="FLYBASE:FBgn0044142; Dvir\Ulysses\ORF1" FT /protein_id="CAA39967.1" FT /translation="KLSGKPQRERGDGGTDAFRDGDRGKAGEGHLPEPIIIGKETEIWN FT KDQDKANYNNSISKGGTLRGLPYEERVRAYMSARNRIFGERQLEGMTLATRRMVKARAR FT FRRRRITRRQVVEAVRREESIDPRVFAEVEVAGAKMKGLLDTGASVSLLGQGCRELVEK FT LGWEARPYESMVRTACMGANRPILGRVVLPVKYGIERLDIVFYMCPDLRQELYLGIDFW FT RAFEIAPELLGPARKSETPPEASEVTVANPEVAYYRDDDDCVTDPEMWDLDNDQRSQLE FT SVKRRFLQFEKDGLGKTHLLQHRIQLIEGAEPVKDRLNPLSPAKQEIVWAEVDKMLKLG FT IIEESDSPWSNRTTVVMRPGKNRFCLDARKLNSVTVKDAYPLPCIEGILSRSTRLILSL FT ASTLSSRSGNRDGGEEQGVYGVYCTRRPLYQFRHMPFGLCNAAQHFEAHDKVIPANLRS FT NVFVYLDDLLIISADFPTHLKYLELVAECLRNANLTIGMAKSKFLFRNLNYLGFIQLRR FT RTWRMDPGRVEAIRNIPNPRTVKELRSFLGTAGWYRRFIKNFAEISVPLTDALKKRTGR FT FVLSDEAIEAIESLKLALTTAPVLVHADFRRPFFIQCDASHYGVGAVLFQLDDEQQERP FT IAFFSAKLNKHQINYSVTEKECLAAKLAIHRFRPYVEMMPFTVITDHASLQWLMSLKDL FT SGRLARWSLELQAFPFSMQYRKGADNVCRHIVRSVEEVELTPEDLLGFQTPEFESPNIE FT ELIREVMSQQGKFPDLSSGRTDFSSRGTVHESLEDEVEGTSWKLWVPESLTAGLIQQAT FT RRTRRSAHGGMRKTLHALARQYYWPNMAIQVRDYVRKCDTCKETKAQNYRMQVGIGEEV FT RTDRPFQKLYIDFLGKYPRSKRGHAWIFVVVDHFSKFTFLKAMREATAADVVNFLVHEV FT FFKFGVPEVIHSDNGRQFVSKSFDAMVQAFGITHLRTPVYSPQSNAAERVNRTVLSAIR FT TYLGQDHREWDAYLPEVEVAIRNAVHSGYGSHSVLRGLWTADVPEWFQLQTGQEAVDHW FT PTTVFLTLTQRTDWL" FT SO_feature polyA_signal_sequence ; SO:0000551:9053..9058 FT SO_feature polyA_signal_sequence ; SO:0000551:9087..9092 FT SO_feature TATA_box ; SO:0000174:9095..9100 FT SO_feature polyA_signal_sequence ; SO:0000551:9101..9106 FT SO_feature TATA_box ; SO:0000174:9129..9134 XX CC Derived from X56645 (Rel. 38, Last updated, Version 6). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 10653 BP; 2889 A; 2390 C; 2887 G; 2487 T; 0 other; tgttgcattt gacgttatcg gcagcaccga gggttactca gttggttgcg tgcgatatta 60 tgggccactg ccgatatctt tttggcatat ggtcggagta tgccgggtga agctgggccc 120 acggcggacg caaaattttg aagggtggca acgccagaca tgcgatccac ggcgccaaaa 180 gcgacgattg ctattggccg cccgtttcgc tcttaaaggg aatacgagga gagtggatcg 240 catgcccttg gatatgccct tggccttcat ttgggccgtt gacccaaaaa aatgcagctg 300 tgcagatgag cagcaacggt gcccgcctcg caaatgtatg cgtgtgccga aattcggcac 360 cgatggtgtg gctgaaaaaa aatgtcgagg agttcgattt gttcagcggt taacggcggg 420 cgtgccagct cgaagtgagc aaaaaagttg gacattgcat ttctgcttga cagagtaaaa 480 ttttttccct tctgaaaaat taagccggca cataatacca atggcttaac agacgaataa 540 aagttttgtt tttgtgttct ttgccatgta ataaatttat ataaatcaaa ccgctcgaat 600 cacaaagaaa atataataaa aaaattgaaa gagacttaaa aatttaagtt gcgacgagag 660 acgaaccaaa taagccaaaa attaacggct ccggcaccaa ggagaccggc tcgcgcacca 720 gagaggccca cgaaatattc aaggccagga ttcagtgcag cggcgcaaac cagccatatc 780 cgcgacgtcc gccagcaagc gtcggcgcaa acgtgaaggc gacgagacga gccaccaacg 840 accgggctcc aacaacgaag aagtcggtcc acaggcgtag accccggtcc cacatccacc 900 gtagtgagtc cgttccattg ttctggccca tccctcggtt cgagcccctg gcatgagttc 960 agaagaattg actcctaatt gcactccctg aaaaggagta gatttttttt tatcgttaaa 1020 tttggttaat ttgtatttat aaaatatatt catacataca taggcactta cgccgagctt 1080 cactttagag tgcgagcggt gaggaaaact ccaacggtga tgcatctgcg cctgtaaaat 1140 aaaaaaaaaa aaaaaatcta aaattaactc ttgcgcgaac tatgctcaca tacttaccta 1200 atttgaattt acgcatttat atcgcccctt accacccccc agagagctct ttaggtaggg 1260 taaggtagaa ttaagcgatt taatacatac aatacacaat acgtgacata cccagtttta 1320 agtaattgtg cgggccattg ttcctaaatc ggggcaacga ttgcctctaa agtgcctcat 1380 tgcgcgtttc ttttgcggcg actagccaat atccttgtag tgtttcgtcg cctgttccgt 1440 tcagtgttgt aaaaaaaaaa aaaaaaaatt aattctataa gataagagaa aaatttagga 1500 attaatgcgc tgtgtcaatt agtataagta tatgcagcag taatacgcac ctgtgaagtt 1560 aatgaattat ccgcttgcat gagatcctga acagctgcaa gataaatgcc ataagcaatg 1620 attatgattg ttatacattt taagaaaacg caaccaatta tacgcaccta ttgaaagaaa 1680 tcatgtccgg ctaaaataat attgctgggg agagtgggca ttatggccga cacgggctaa 1740 tataggccat ttggtggcca aaatgaggga gacgagtgga ggtgatcgtc tcatggtagg 1800 gcgggtacgc tgggtccgga accggggccc gatctggaag agtaaggtgg ataaatagtg 1860 tttttttttg ttaatttata ttgagtgtct tacctttgtt gtttacattt tcatagttgt 1920 tacagctgtg gattttgttc atgagttagt agttccgaat atttgttatc gattatgcgg 1980 ttcattttga gtagtgttgt gtgagtctat tttgctacag acgtgtagct ttgttgttat 2040 cgattgtgcg gactgcgggc cagggatggg accacaccca gagaatccga tgagccagcc 2100 ggcgcggccc actgagaaac aagcagttcc ctaacaatat ggcgcccaaa ttgaagaagc 2160 actcggcttc tttgaagtcg gtgtgtaatc aaagaaaaat aaatacgata aaaatgaagg 2220 tacagaggaa aggactccct tggcttcagg accccacacc gctgaagtct tatgtcggca 2280 gagtaggtag ctcataatgc gcacggtgta gtggaatccg agaagcacga cggaagtaga 2340 ggttatctga atatccatcc ataagcatga ggggtttttt ttgtagtggt tgatggccgt 2400 tcgcatgttg gcgaaacatt tgcgactatc catcgttcgc ctggtggtag ccgcaacgac 2460 aaaggcatct catggttaaa aaaatgtgat attagggtag tttagtcaga tgttatttat 2520 tcaattagtt gtttttgtct cattttgtgt taatgttgat gtatgctggc cgtagttata 2580 cgagcttcgg agaccggttg acccagacga tatagagccg agcctgcgcc gcatctgtca 2640 cacagcatat cccaagaatg gccagaagag cgacggagaa cggatcttgc catggtgtcc 2700 attgacatgg ggtgttttct tgtcgcggtt gatgactggt aagtcacgtt tgctagttgg 2760 tgaacgttcg cgaccgtcaa tcgtagcctg gtggtagccg ccatgacaaa ataacctgtg 2820 aacgaaatgt gatattaggg taggtctgac tagagtttgt ttaatgtttt caatttgttt 2880 tgtcccactc tgtgttaacg tttggtgtgt gctggccgta gttaaatgag cttcggaggc 2940 aggtgaccca ggacgatata gagccgatcc tggcgccgca cctgtcacgc agcatactct 3000 aagggtgcac agaagtgcga cagaaaccaa tagccttcaa ggtagcgttc cttgcatgcc 3060 gtccgtcagt caaactgacg aactgagcat gggacgaatg ccttaagtgt tgtccactgt 3120 ccagattatg cgtagattat ttagcctgtg ttaggatttg tcgtctctcc ccaagtattt 3180 gaaattaatt agaaagaaaa caaaattttt gtgaagtggt ttttgtggat caatgtgtat 3240 taatttatta ttctttatta gtaagatttg ttagtgggat gtttatttat ttattaattt 3300 attagaaata ttagtaaagt ttttataagg gttgtacgcc ttgtagaaga tagggggtac 3360 tgttctcaga aggaataagt atggctgaga aacaggtgtc tgattgggct tctgacgact 3420 gatggggcgt cgctctgcca ccgccgagcc cagctggagg gctggtgcga acgccaatga 3480 atttccgggg agtttaatga gagagacaga tccctgtaaa ggagggtcac ctggtagact 3540 tggaggaaag aattggcgca acagcctaga cctccacgga acctggagtt cccaaaagcc 3600 ccgaaagact cagtgctcaa atccggcggc aaacgcggcg atcgctcagg cccaccaaag 3660 gatatctgca tgctctgcca tctggcatca gcgagtccat ccgcgaggag atgcgagctg 3720 gcttcatgga aatgatgaag ggcatccatt gatgtgctta ggccgccagg gagccaaacc 3780 cgccgaaact aacactgcgc cgaagccgcc aaggacatgt cgagccggga ccgcgaacag 3840 gcgccatccc gaaagaacgg aagtgttcca gcgcgagcat tccaccaccg tgccaccgaa 3900 gccacgtagg caagcgcaca gggcggagta ttcgtccagc agcccagttt acgtggcaga 3960 tgctcagtcg aggaactggc gtaacccgaa taaccgccat ttccgagtag aggaccggag 4020 agcggatgat tttcaaccgg aggaggcgta cgccaacgtg cgagaaactg cgtacatgaa 4080 gctcgagcgg tggaacgtca agtttgacgg cgaggacgcg atgaattcgg tcgaggactt 4140 cgtgttccgc ctagagttcc tgcagaggca gtatcagtgc ccgtggaagg aggtcctgcg 4200 cggtttccac ctgctcctga ccggccgcgc ccgcgagtgg tactggatgc acgtgcggca 4260 ttctagggtc gacagctgga tgcaactgcg gcatgccctc ttggacaggt tccggggcta 4320 ccagacggaa cacgaggtaa tgcaggagct tctacaacga gagcaacagg ccagcgaagg 4380 ggtggatgat tatatccacc acatgcgtca actcgccgcg cgattccaga agccgctgag 4440 agaccgagag ctggtgagga ttataaagcg tggtttaaaa gaaagtctgg cgaaatatat 4500 ttatgccatg gacgtactca ccgtggatga gctgcgccaa gagtgcctag aagtggagag 4560 gcacatgggt cgcagaagcc ggactgggta ccttcagccg tcgcgttgtc cacaaggaac 4620 gaggcccgta gtccacgagg ttgaagttcc accgcatctt acagaaacgc caccgggaga 4680 actggaggag gcattcgtgc gaacgaggaa ctcatccgaa ttgtatgctg gaactcgaga 4740 cagttcgacc acgtctttag agactgcctg tccaaggagc ggaaaatatt ctgctatagg 4800 tgtggaaagc cggacacgtt ctgctcgcag tgtgaaaact gtccgggaaa ccccagaggg 4860 agcgcggtga tggcgggaca gacgcgttcc gggacggcga ccgcgggaaa gcaggagagg 4920 ggcacctacc agagccaata atcattggaa aggagacgga gatttggaat aaggatcagg 4980 ataaggctaa ttataataat agtataagca aaggaggaac acttaggggc ttaccatacg 5040 aagaacgtgt tcgggcatac atgtcggccc gaaatagaat ttttggtgag cgacagctgg 5100 aggggatgac gctagccacc cggagaatgg tgaaggcacg tgcgcgcttc cgcagacgca 5160 ggattaccag gcgccaggtc gtcgaagcgg tgagacggga ggagagtata gacccacgag 5220 tattcgccga agtggaggtc gctggagcca aaatgaaagg gttgctggac acgggggcct 5280 cagtcagtct gttgggacaa ggatgccggg agttagtgga gaaactggga tgggaagcgc 5340 ggccatacga atcgatggtg aggacagcat gcatgggtgc caatcgccca attttaggtc 5400 gcgttgttct gcctgtgaaa tacggaatag aacggttaga tatcgtattt tacatgtgcc 5460 cggacctgcg acaggaacta tacctgggaa tcgacttctg gcgagccttt gagatcgcac 5520 cagagctgct cggtccggcc agaaagtccg aaacaccacc agaggcatct gaggtaacgg 5580 tagcgaaccc agaggtggct tattaccggg acgacgacga ctgcgtaacg gatccggaaa 5640 tgtgggacct ggacaacgat cagaggagtc agttggaaag cgtgaagcgc agatttctcc 5700 aatttgagaa ggatggccta gggaaaactc acctgctgca gcaccgaatt cagctgatcg 5760 agggagcaga acccgtgaaa gacagactta atccgctgtc cccggccaaa caggagatcg 5820 tgtgggccga ggtggataag atgctaaagt tgggcatcat cgaggagagt gacagcccct 5880 ggagcaaccg gacgacggtg gtgatgaggc ctgggaagaa caggttttgc ttagacgcca 5940 ggaaattaaa cagtgtaacg gtaaaagacg cgtacccgct tccatgcatc gagggcatcc 6000 tatcgcgatc gacgagactc attttatctc tagcgtcgac cttaagttcg cgttctggca 6060 atagagatgg aggagaagag cagggcgtat acggcgttta ctgtaccagg aggccgctgt 6120 accagttccg ccacatgcca ttcgggctct gcaacgccgc tcaacacttc gaggctcatg 6180 acaaggtaat cccggcgaat ctgaggtcca acgtattcgt atacctggac gacttgctga 6240 taatatcggc agattttcca acgcacttaa aatacttgga attggtggcc gagtgtctga 6300 ggaacgcgaa cctcaccata ggcatggcga agtcgaaatt cctgttccgc aacctaaact 6360 acctgggttt cattcaatta aggcggcgga cgtggcgcat ggatccggga agggtagaag 6420 cgatccggaa catcccgaat ccgaggacgg tcaaggaact acgaagcttc ttaggtacgg 6480 cggggtggta ccgccgattc ataaagaatt tcgctgagat atcggtaccg ttgactgatg 6540 cccttaagaa gagaacgggt agatttgtgt tgagcgacga ggccatagaa gccatagaga 6600 gcttaaagtt agccctcacc acagccccgg tgttagttca cgcagatttt cgaagaccat 6660 tcttcatcca atgcgacgca tctcactacg gagtaggagc tgtgttgttc cagctagacg 6720 acgaacaaca ggaaaggccg atcgcattct tctcggccaa actcaacaag caccagatca 6780 actattcggt gaccgagaag gaatgcctag ccgccaaact ggccatacat cgattccggc 6840 cgtacgtgga gatgatgccg ttcacggtaa tcaccgatca tgcgagtctg cagtggctta 6900 tgagtctcaa agacttgagc gggagattag ccaggtggtc cctcgaactg caagcgttcc 6960 ctttctccat gcagtaccgc aagggagccg acaacgtgtg cagacacatt gtccgaagcg 7020 tggaagaggt cgaactgacc ccggaagatc tgttgggatt tcagaccccg gagttcgaga 7080 gtccgaatat cgaggagctg ataagagagg tgatgagcca acaagggaag tttccggacc 7140 ttagcagtgg caggacggac ttctcttcaa gagggaccgt gcacgagagc ttggaggatg 7200 aggtcgaagg cacaagttgg aagctctggg tgccggagtc gctgacggca ggactcatcc 7260 aacaagcgac acggcggacg agacgctcag ctcatggagg catgaggaaa acgctgcacg 7320 cgctggccag gcaatattat tggcccaaca tggccataca agtcagggac tatgttcgga 7380 aatgcgatac atgcaaggag accaaggcac agaactaccg aatgcaggtc ggaattgggg 7440 aagaagtgcg caccgaccgc cccttccaga agctatatat cgacttcttg gggaagtatc 7500 cacggtcgaa acgagggcat gcgtggatat tcgtcgtcgt agaccacttc tcgaagttta 7560 ccttcctgaa ggccatgagg gaagccaccg cggcggatgt cgtgaacttc ctagtgcatg 7620 aggtgttctt caaatttggc gtgccggaag tgatccattc ggataatgga cgacaattcg 7680 tgtcaaagtc gttcgatgcg atggtccagg cgtttggcat tactcacctg cgcacaccag 7740 tgtattcgcc tcagagcaac gccgccgaac gcgtgaaccg cacagtgctg tcggcaatcc 7800 gaacctacct gggtcaggat catcgggaat gggacgcgta cctaccagag gtggaagtgg 7860 cgatccggaa tgcggtccac agcggctacg ggagtcactc cgttcttcgc ggtctttgga 7920 cagcagatgt acctgaatgg ttccagttac aaactggcca ggaagctgta gatcattggc 7980 cgaccacagt atttctgacc ttgacgcaaa ggacagactg gctgtaatcc gaagccaagt 8040 caaggaccac ctgcacaccg catacgagcg aagtcgccaa cggtacgacc accgcgcgcc 8100 gacagctcca tctggaaccg ggacaggaag tgtggaggcg caacttcgca ctaagcagct 8160 tcggcaaagc cttcaatgcc aagttcgccc ggaagttctt gaagagccgc gtcgtccgag 8220 ccgtgggaac caacgcctac gagttggaag atctccaggg acgcgtgcta ggcgtcttcc 8280 acgcaaaaga tatacgaaca taatcgtaat ttgctcacct ccttgccagg caacggcggc 8340 ggcagcgcca gtccccccct ttttttttgg gtcagcaaaa aaaaaaaaat atagattact 8400 tgtcgtgggt gtggcctacg ttgggcatcg ttattagccg gctgggtcaa tccatgaaat 8460 gcgcacgcag attccttgta actgtccttc atgtcgttac aataatctgc gtggtgatgt 8520 tgcatttgac gttatcggca gcaccgaggg ttactcagtt ggttgcgtgc gatattatgg 8580 gccactgccg atatcttttt ggcatatggt cggagtatgc cgggtgaagc tgggcccacg 8640 gcggacgcaa aattttgaag ggtggcaacg ccagacatgc gatccacggc gccaaaagcg 8700 acgattgcta ttggccgccc gtttcgctct taaagggaat acgaggagag tggatcgcat 8760 gcccttggat atgcccttgg ccttcatttg ggccgttgac ccaaaaaaat gcagctgtgc 8820 agatgagcag caacggtgcc cgcctcgcaa atgtatgcgt gtgccgaaat tcggcaccga 8880 tggtgtggct gaaaaaaaat gtcgaggagt tcgatttgtt cagcggttaa cggcgggcgt 8940 gccagctcga agtgagcaaa aaagttggac attgcatttc tgcttgacag agtaaaattt 9000 tttcccttct gaaaaattaa gccggcacat aataccaatg gcttaacaga cgaataaaag 9060 ttttgttttt gtgttctttg ccatgtaata aatttatata aatcaaaccg ctcgaatcac 9120 aaagaaaata taataaaaaa attgaaagag acttaaaaat ttaagttgcg acgagagacg 9180 aaccaaataa gccaaaaatt aacggctccg gcaccaagga gaccggctcg cgcaccagag 9240 aggcccacga aatattcaag gccaggattc agtgcagcgg cgcaaaccag ccatatccgc 9300 gacgtccgcc agcaagcgtc ggcgcaaacg tgaaggcgac gagacgagcc accaacgacc 9360 gggctccaac aacgaagaag tcggtccaca ggcgtagacc ccggtcccac atccaccgta 9420 gtgagtccgt tccattgttc tggcccatcc ctcggttcga gcccctggca tgagttcaga 9480 agaattgact cctaattgca ctccctgaaa aggagtagat ttttttttat cgttaaattt 9540 ggttaatttg tatttataaa atatattcat acatacatag gcacttacgc cgagcttcac 9600 tttagagtgc gagcggtgag gaaaactcca acggtgatgc atctgcgcct gtaaaatata 9660 aaaaaaaaaa aaaatctaaa attaactctt gcgcgaacta tgctcacata cttacctaat 9720 ttgaatttac gcatttatat cgccccttac caccccccag agagctcttt aggtagggta 9780 aggtagaatt aagcgattta atacatacaa tacacaatac gtgacatacc cagttttaag 9840 taattgtgcg ggccattgtt cctaaatcgg ggcaacgatt gcctctaaag tgcctcattg 9900 cgcgtttctt ttgcggcgac tagccaatat ccttgtagtg tttcgtcgcc tgttccgttc 9960 agtgttgtaa aaaaaaaaaa aaaaaattaa ttctataaga taagagaaaa atttaggaat 10020 taatgcgctg tgtcaattag tataagtata tgcagcagta atacgcacct gtgaagttaa 10080 tgaattatcc gcttgcatga gatcctaaac agctgcaaga taaatgccat aagcaatgat 10140 tatgattgtt atacatttta agaaaacgca accaattata cgcacctatt gaaagaaatc 10200 atgtccggct aaaataatat tgctggggag agtgggcatt atggccgaca cgggctaata 10260 taggccattt ggtggccaaa atgagggaga cgagtggagg tgatcgtctc atggtagggc 10320 gggtacgctg ggtccggaac cggggcccga tctggaagag taaggtggat aaatagtgtt 10380 tttttttgtt aatttatatt gagtgtctta cctttgttgt ttacattttc atagttgtta 10440 cagctgtgga ttttgttcat gagttagtag ttccgaatat ttgttatcga ttatgcggtt 10500 cattttgagt agtgttgtgt gagtctattt tgctacagac gtgtagcttt gttgttatcg 10560 attgtgcgga ctgcgggcca gggatgggac cacacccaga gaatccgatg agccagccgg 10620 cgcggccact gagaaacaag cagttcccta aca 10653 // ID AF056940 standard; DNA; INV; 6868 BP. XX AC AF056940; XX DR FLYBASE; FBgn0013099; Dvir\Tv1. XX FT source AF056940:1..6868 FT SO_feature five_prime_LTR ; SO:0000425:1..452 FT SO_feature three_prime_LTR ; SO:0000426:6416..6868 FT SO_feature CDS ; SO:0000316:857..1642 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0028756; Dvir\Tv1\gag" FT /db_xref="SPTREMBL:O76325" FT /protein_id="AAC33317.1" FT /translation="MTMELSEQHLNQALSQLRQVPSFDGSTDQLNAFIKRIDYILHLYP FT TRDVRQHSILYGAIELQVTGDAQRISQRTAANTWQELRNALIEEYKVQTPFEELLRRLY FT NTNYQGNVRKFIEELENKSFVILNKLALENIPSNTTLYTNAMNNTIKDVITKKLPDRLF FT MMLARHDITSTQKLKQVAQREGLYENSVTEKPKNNNVQGNPNNNRRNMGNYQQNANPTT FT ISSGYSNTNQSYHAQNKQHNKSEDNQKAHHEFPTEIKSR" FT SO_feature CDS ; SO:0000316:<1416..4982 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0028755; Dvir\Tv1\pol" FT /db_xref="SPTREMBL:O76326" FT /protein_id="AAC33318.1" FT /translation="MKIVLLKNLKIIMFREIQITIVGTWEIISKMLTQPPFQVVIPTRI FT RVITHKINSIISPKTIRKLTTSFQQKLSQGRYQNPLNYQISPIHAECTKPSRLTSRDSN FT SGQIRIWDNIGKFLFSKPREPNRNKSIHKNWIMELTKFLKCVIDTGSTINLMKTNRLNF FT PVYNETLKVHTINGVIELKQSIRLGASKICPSKQKFYIHDFSEHYDVLIGREYLEACQA FT KIDYAQGSVTLGEFNFCFRYNDEEVEEDMTAQECLDPPSTEDRPFNFAINNELIENNEF FT RLEHLNSEEKEKIKKVLHEFVISSYHEGDNLTFTSYNLNTKFLTKHEDPIYKRSHTNIL FT QLSNEEVIPFSDLIKPIPNGLPVIIVPKRNDAFGKPKFRLVIDYRHFNELTINDKYPIP FT IMDEILDKLGKCQYFTTIDLAKGFHQIQMDPGSIPKTAFSTKHGHYEYTRMPFGLKNAP FT ATFQRCMNNLLEDLIFKDCLVHLDDIIIFSTSLEEHILSLQKVFKKLREANLKLQLDKC FT EFMRKETEFLGHIITTEGIKPNPNKIQAIVKFPIPKTPKEIKSFLGLCGFYRKFIPNFA FT NIVKPLTLKLKGSKINIKDRDYELAFEKLKVLITSDPILIYPNFEKPFSLTTDASNMAI FT GAVLSQEHKPICYASRTLNEHELNYSTIEKELLAIVWATKYFRSYLFGRQFQILSDHRP FT LVWLNNMKEPNMKLQRWKIKLNEFDFQIKYVPGKENYVADALSRIQLNENFLGEDTIST FT RATIHSAQEDNSNHLQITERPLNYYNRQIEFEKGTENETKVTNYFHKTNIKITYKDMTN FT THAKELIKEYLCTKKSVLYFHNEADFPIFQEAYLEIISPNNSTKAMKTSTKLIDLQTYA FT EFKELILKKHKELLHPGIEKTINWFKETHYFPDYQNLINECETCNIAKTEHRDTKLTFE FT ITPEIANIREKYVMDFYIVGDKQFLSCIDIYSKFASLIEIKSRDWLETKRAILQVFNQM FT GKPIEIKADKDSAFMCTALQLWLKSEAVNINITTSKNGISDVERFHKTVNEKLRIINSD FT SDVENKLTKFETILYTYNHKTKHKTTNRTPADIFIYAGTPEYDTQANKEKLINNLNKKR FT TNYEIDTRYKHSPLVKSKTTTPFKKTGELRQIDDKHFEETNRGRKITHYKTKFKKKKKT FT NQSKYNNYRSTTDSDQNIQAPA" FT SO_feature CDS ; SO:0000316:complement(5005..5292) FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044143; Dvir\Tv1\epigene" FT /db_xref="SPTREMBL:O76327" FT /protein_id="AAC33320.1" FT /translation="MLTKPLLLRLAIFKPFISLFNFSTRYSICIRLLNSLLCSIKLLKV FT SSVFVILTVKQWEFVICRNVYGSCFKYEISVMGFDRINFNTLCIYIVYSY" FT SO_feature CDS ; SO:0000316:5445..6110 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0028757; Dvir\Tv1\env" FT /db_xref="SPTREMBL:O76328" FT /protein_id="AAC33319.1" FT /translation="MNELDQEKQKGKKLDILIFNLQHFTEYIEDIEMGMQLTRLGIFNP FT KLLKHDYLLHVNSEKLLNTKTSTWFKSDTNEILIISHIPREIIKSPVFEIIPYPDENNN FT ILTEIAHEKYFTQDKKVYSRETKKLINNKCLTGILNQITSECSYTKILQNFQINYIEPN FT IILTWNLPKTILNHNCINNEITIEGNNIIKIFNCSLQINEISISNNMLDYTSKHLRRQ" XX CC Derived from AF056940 (Rel. 62, Last updated, Version 4). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 6868 BP; 2840 A; 1292 C; 1004 G; 1732 T; 0 other; agtgacatat ccatagtcgc acccaccctt taacataact tagcacataa gcataagcac 60 aattataaac atttctaata ttgtaaacaa agagcctggt gataacatcg acattggtca 120 agatcaaact gtcccccttt ggtagacata aacaattcac cctaaatatc acgccactgt 180 ggtagacata aacaattcac cctaaatatc acgccactgt caatgttccc atcagagtac 240 tatcccacgc ataattgctg acgcagctca aacttcactc aattttgact caaactctct 300 caaagcgaag tttgctaagc gaccgcaaca gttagataaa tcagttcata ttcgggaagc 360 aaatctgaat gtactcaaca aaagtagccg ttaagtgaaa ataataaatt gaaatatatt 420 tttttatagt aacctaacgt gtaaaactta attggcgcag ccggtaggat acgtcgggaa 480 aaattaaaga tcgctaaagg ataattaatt cccagatcta acgcacaatc gcgaccgaaa 540 gtgttgtcgg agttgtaata aaaattttcc tcgaatacgt atccgcacaa gaaccttccg 600 gtccctagtt agcttaataa tattgaccct ttgggcaata aagctactac agttaataac 660 gactggcttt gcatatcgtt tgatccctgc aatacgttat tactgagtct ttttaagaca 720 cacacacaaa aaaaaaaaac atttaaccaa taatccatca caaactttaa cctaaaataa 780 aagtgcaaaa tcaaagaaca ataaagttca aacctacaaa ctcaaaacta ttcaaaaaaa 840 agtgtattac accaagatga caatggaatt atcagaacaa cacctcaacc aggccctttc 900 acaacttaga caggtgccaa gctttgacgg atcaacggat caactaaacg ccttcattaa 960 aaggatagat tacatcctac acctgtatcc aacccgagat gttagacaac atagcatctt 1020 atacggagcc atcgaattgc aagtcacagg agatgctcag agaatatcac aaaggacagc 1080 agccaacact tggcaggagc tgagaaacgc attgattgaa gaatacaaag tgcagacacc 1140 atttgaagaa ctccttcgac gcttatacaa cacgaattac caaggaaacg tccgtaagtt 1200 catcgaggag ctcgagaata aatcttttgt tatattaaat aagttagcac tagaaaatat 1260 acctagcaat acaacccttt atacaaatgc tatgaataac acaatcaaag atgttattac 1320 aaaaaaatta ccagataggc ttttcatgat gttagccaga cacgacatta cgtcgacaca 1380 aaaactaaag caagtagctc aaagagaggg attatatgaa aatagtgtta ctgaaaaacc 1440 taaaaataat aatgttcagg gaaatccaaa taacaatcgt aggaacatgg gaaattatca 1500 gcaaaatgct aacccaacca ccatttcaag tggttattcc aacacgaatc agagttatca 1560 cgcacaaaat aaacagcata ataagtccga agacaatcag aaagctcacc acgagtttcc 1620 aacagaaatt aagtcaaggt agataccaaa atcccttaaa ctaccaaatc agtcccattc 1680 acgctgaatg caccaaaccc tcccgactaa cgtcaagaga tagtaacagt gggcaaatta 1740 gaatctggga taacatcggg aaatttttat tcagcaagcc tcgggaacca aacaggaaca 1800 aatccattca taaaaattgg attatggaat tgacgaagtt cttaaaatgt gtcatcgaca 1860 caggttcaac cataaatcta atgaaaacaa accgcttaaa ttttccagtt tacaacgaaa 1920 cattaaaggt tcacaccata aatggtgtta ttgaattaaa acaaagtata cgcttaggag 1980 caagcaaaat ttgtccgagc aaacaaaaat tctatattca cgacttctca gagcattatg 2040 atgttctgat tgggagagaa taccttgagg catgtcaagc aaaaatagac tatgcacaag 2100 gatccgtaac cttaggagaa tttaactttt gttttagata taacgacgaa gaggtcgaag 2160 aggacatgac cgcacaagag tgccttgatc caccctcaac agaggacaga ccatttaact 2220 tcgctattaa taatgaatta atagaaaaca atgagtttag gctagagcac ctaaactcag 2280 aagaaaaaga aaaaataaaa aaagttttac atgaatttgt gatatccagt taccatgaag 2340 gtgacaattt gactttcact agttacaatt taaacaccaa attcctaacc aaacatgagg 2400 accccattta caagcgttcc cataccaata tcctccaatt atcgaatgaa gaagttattc 2460 catttagcga tttgattaaa ccaattccta atggtttacc agtaataatt gttccaaaga 2520 gaaacgatgc atttgggaag ccaaagttcc ggttagtcat agattaccgc catttcaatg 2580 aactaactat taatgataaa taccctatcc cgatcatgga tgaaatatta gacaaacttg 2640 ggaaatgtca atatttcaca accattgatc tagccaaagg cttccaccaa atccaaatgg 2700 atcctggttc aatacccaag accgcatttt cgactaaaca tggccattac gagtacactc 2760 gcatgccatt tggtcttaag aatgcacctg caacgttcca acgttgtatg aacaatctct 2820 tagaagattt aatttttaag gattgtctgg tccacttaga cgacattatt attttttcca 2880 cttcattgga ggaacacatt ttgtcgctac aaaaggtatt taagaagctt agagaagcta 2940 acttaaaact acagttggat aaatgcgagt ttatgagaaa agaaactgaa tttctcggtc 3000 acataatcac aactgagggt ataaaaccaa atcccaacaa gatacaagca attgtcaaat 3060 tcccaattcc aaaaacccca aaagaaatta aatcatttct aggattatgt ggtttctata 3120 ggaaatttat accaaatttc gctaatatag taaaaccatt aacacttaaa ttaaaaggga 3180 gcaaaatcaa tataaaagac agagactacg aactagcgtt tgaaaagcta aaagtactca 3240 taacatccga cccaatttta atatacccta acttcgaaaa accattttca ttaactacag 3300 acgcgagtaa tatggcaata ggggccgtcc tttcgcaaga gcataagcct atatgctatg 3360 caagtagaac tctcaacgag catgagttaa attattcaac gattgaaaaa gaattattag 3420 caatcgtttg ggccactaaa tactttaggt cctatctctt tggtcgacaa tttcaaatcc 3480 ttagtgatca cagaccactc gtctggttaa ataatatgaa agaacctaac atgaaattac 3540 aaaggtggaa gatcaagcta aatgagtttg atttccaaat taaatacgtc ccaggaaaag 3600 aaaattatgt ggcagatgct ctgtccagaa ttcaattaaa cgaaaacttc ttaggcgaag 3660 atacaattag cacaagagca acaatacata gtgctcaaga agacaacagt aaccatttac 3720 aaatcacaga aagaccactt aattattata atcgacaaat agaatttgaa aaaggaaccg 3780 agaacgaaac aaaagttacg aattactttc acaaaaccaa cataaaaatt acgtacaaag 3840 acatgactaa tacgcatgcc aaagaattaa taaaagaata cttatgcaca aagaaaagtg 3900 tgctatactt ccacaacgaa gcagattttc caatttttca agaagcctac ttagaaataa 3960 taagtccgaa taattcaaca aaggcaatga aaactagtac aaaattaata gaccttcaaa 4020 catatgcaga attcaaagaa ttaatattaa aaaagcacaa agaattatta catccaggaa 4080 ttgaaaaaac tatcaattgg ttcaaagaaa cccactattt cccagattat caaaatctaa 4140 taaatgaatg tgagacttgc aacatcgcaa aaacagaaca cagagataca aaattaactt 4200 tcgaaataac tccggaaatc gctaatatcc gagaaaaata tgtaatggat ttttacatag 4260 taggagataa acaattttta tcttgcattg atatctattc aaaatttgca tcattaatag 4320 aaataaaaag tagagactgg ctagaaacca aacgagctat attacaagtg ttcaaccaaa 4380 tgggtaaacc catcgaaata aaggcagaca aagactcagc ttttatgtgc acagctttgc 4440 aattatggct aaaatcagaa gcagtaaata ttaatataac cactagtaaa aatggtatat 4500 ccgacgtaga acgatttcat aagacagtta acgaaaaact aagaatcatt aacagcgact 4560 cagacgtcga aaataaactt acaaaatttg aaactatact ctacacgtac aatcataaaa 4620 caaagcacaa aacgactaac agaacaccag cggacatatt catatatgca ggtacaccag 4680 agtatgacac acaagctaat aaagaaaaac taataaataa tctcaacaaa aaacgaacaa 4740 attatgaaat tgacactaga tacaaacact caccgttagt taaatcaaag acgacaaccc 4800 cgttcaagaa aacaggagaa ctaagacaaa tcgacgacaa acattttgag gaaactaaca 4860 gaggcaggaa aataacacat tacaaaacca aattcaaaaa gaaaaagaag actaatcaaa 4920 gcaaatataa caattacagg tcaacaacag acagcgatca aaacatacaa gcaccagctt 4980 aaattaatta ttataatatt actatcaata actgtaaact atataaatgc acagagtatt 5040 gaaattaatc ctatcaaagc ccataacgga tatctcatat ttaaaacagg aaccatagac 5100 attccgacag attacgaatt cccactgttt aacggttaat ataacaaaaa ccgaagaaac 5160 ttttaacaat ttaattgaac aaagtaaaga attcaataac cttatacaaa tcgaatacct 5220 agtagaaaaa ttaaacagag aaataaacgg cttaaaaata gccaaacgca acaaaagagg 5280 tttagttaac atagtaggaa cagcatataa atacttattt ggaacattag atcaagaaga 5340 caaagcagaa attgaacaaa aaataagcaa cttagcagaa aacagcgttc aggttaacga 5400 attaaattac ataatagaag ctataaaaca aggggatagg aatcatgaac gaactagatc 5460 aagagaaaca aaaagggaag aaattagata tccttatatt taacttacaa cactttacag 5520 aatatataga agacatcgag atgggaatgc agctcacaag attaggaatt tttaacccaa 5580 aattactaaa acatgactat ttgttgcatg ttaattctga gaaattgttg aatacaaaaa 5640 cttccacttg gtttaaatca gatacaaacg aaatactgat aatatcccat attcctcgag 5700 aaataataaa aagcccggta ttcgaaataa ttccataccc ggatgaaaat aataatattt 5760 tgacagaaat agctcatgaa aaatatttta cccaagataa aaaagtttat agcagagaaa 5820 caaaaaaatt aattaataat aaatgtttaa caggaatttt aaaccaaata acatcagaat 5880 gtagttatac taaaattttg caaaattttc aaatcaacta catagaacca aatataattt 5940 taacttggaa tttaccaaaa actatattaa accataattg tataaataac gaaataacaa 6000 tagaaggaaa caatataata aaaatcttta attgttcatt gcaaattaat gaaatttcaa 6060 tttcaaacaa tatgttagat tatacctcaa agcatttacg taggcaatga tgttataaaa 6120 ttagaaccac tttcgtttgt acaaacgaaa caaattatta tatgcacata caaaatttac 6180 caatgtattc cagataatta caataaccat atttgtaatc atattaataa gtttaacact 6240 gtattgacat ataagttcaa aagcatacct aagaaattaa ttataaaata taaaaaaccc 6300 aaagtagaag aaacacctac aataatagga gatataccaa ttctaaaaga agaaaatgtt 6360 acactatatc caaacttaaa cacctgagga caggcacttt tctaatggtt gggggagtga 6420 catatccata gtcgcaccca ccctttaaca taacttagca cataagcata agcacaatta 6480 taaacatttc taatattgta aacaaagagc ctggtgataa catcgacatt ggtcaagatc 6540 aaactgtccc cctttggtag acataaacaa ttcaccctaa atatcacgcc actgtggtag 6600 acataaacaa ttcaccctaa atatcacgcc actgtcaatg ttcccatcag agtactatcc 6660 cacgcataat tgctgacgca gctcaaactt cactcaattt tgactcaaac tctctcaaag 6720 cgaagtttgc taagcgaccg caacagttag ataaatcagt tcatattcgg gaagcaaatc 6780 tgaatgtact caacaaaagt agccgttaag tgaaaataat aaattgaaat atattttttt 6840 atagtaacct aacgtgtaaa acttaatt 6868 // ID AF009439 standard; DNA; INV; 2485 BP. XX AC AF009439; XX DR FLYBASE; FBgn0020675; Dvir\Tel. XX FT source AF009439:573..3057 FT SO_feature five_prime_LTR ; SO:0000425:1..431 FT SO_feature CDS ; SO:0000316:720..>2485 FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044144; Dvir\Tel\Tel1" FT /db_xref="SPTREMBL:O16111" FT /protein_id="AAB66824.1" FT /translation="MFDEGASANGNDEATSASQVAVGTQAAVFKIKIKNLTDRLNRLSS FT ELDPARLRDVDDYELQDYISMASDLQAKFEIVCDGLLEVDHASVDEDLQTSFESTIRQL FT RLSLQRERGNRSKVQQIPHCSTFNSAAADDSRSTFVVPNHSRLPQLKLPEFSGGYTEWA FT DFSNLFTTVIDKDPYLTNIEKLQHLRSCLKGTALDTIRSLEISNANYAAALELLDKRFN FT NKRLIFQAHISEILGLRKVDKGATAQLREFSDKLNSHLRALKSMGSVEQIAGCVIVHTL FT LQKLDSVTQASWEDDAPLDVIPSCERFTTFIERRCQRLENADHATAMYTPSSQVGQNNS FT SRRTFVVTRNGTSACVFCEVAGHSIYKCLQFANLSPLLRLHEAKRLALCLNCLQRGHQL FT RVCGSSACRVCGSKHHSLLHLGNTSSHIAASSPNNAQDTETYSSSQNTLAALLSSPLTT FT AQHLKHDVVLLATAVINVKNRAGSLVPCRALLDSGSQLHIITSRLAHQLQLRKFKSTAI FT VSGIGDAAFASDGFSVNINVKSRVSEYSTCIPALIAPSITDNQPGFTLDPASWNIPSNI FT QLADPEFEYQAY" XX CC Derived from AF009439 (Rel. 66, Last updated, Version 3). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 2485 BP; 632 A; 599 C; 556 G; 698 T; 0 other; TGTTTGGTCA AGTAACAGCA GAGATAACAC CGACGGCTGC GGACATCATC CAACAGCAGA 60 AAATAACAAT CATAATTACA GCCTAAGAGC AAATAGTTTT TTTTGCGCGC TCTTTGTAAG 120 CTGCTGTCCA CTCTCCTTCT GAATTGCTTT GCTTGCATTG CTGCCCTTCG CTCTAACCGG 180 CGAGCGTCTT ATTGGAGTTT CGTTTCGATT CGCTTTACAT ATTGTTATAA CCAGTCAGTC 240 TTTCGTTTTG ATCGCAACCT AGGCACAACT AATTCGAGTT GGCTGCCAAG CAACAATTTA 300 AAACTTATAA ATTCTAATAA GTGCCCTGCG CGGCAAATAA ATCCATTTGA AAACTAATCT 360 AAGTGTTTGA ATTTCGCTGC ACAAGCAGTT AGAATTAATT GGTTGCAAAA AAGCTTGCAA 420 CTACACATTT TGGTGACCCC GACGTGATTT TTTCTTTCTT TCATTTTTTC TATTTTTTTT 480 CTTGTACTTC GTTCTTGTCG TGTCAACGGA CAGTTATCAG TTCGGGCAAA GCTGTGTGGT 540 GAGCCGTTTG CTTGCGCATC TGCATACAGT TGGTTGTTAT ACATAAACGC CGTAGCAAAT 600 CCCCGCTAAT CGTTGCGCGC TATAACCCGC TATACGCTAT CTCGCTCGAA CGAACACTTG 660 CGCGATCAGA CTTGTATTGT AGTTCGGCTG TGTGAACCTT GTTGTGCGCT ACATTCATAA 720 TGTTTGATGA AGGTGCAAGT GCAAATGGAA ATGATGAGGC GACATCAGCT AGTCAAGTGG 780 CGGTTGGAAC CCAGGCTGCT GTTTTTAAAA TAAAGATCAA GAATTTAACT GATCGACTCA 840 ATAGATTGTC CTCGGAACTT GATCCCGCTC GACTTCGCGA TGTTGATGAC TACGAGCTGC 900 AAGATTACAT AAGCATGGCA TCTGACTTGC AGGCGAAATT TGAGATAGTC TGTGATGGTT 960 TGTTGGAGGT GGATCATGCC AGCGTTGATG AGGATCTTCA GACAAGTTTT GAGTCAACTA 1020 TTAGGCAGCT ACGGCTGTCC CTTCAACGCG AGCGCGGAAA TCGAAGCAAG GTTCAGCAGA 1080 TTCCGCATTG TTCCACCTTC AATTCAGCCG CAGCCGATGA CTCGCGTTCT ACCTTTGTTG 1140 TTCCAAACCA CTCTCGATTG CCTCAACTTA AATTGCCGGA GTTTAGTGGA GGCTACACAG 1200 AATGGGCCGA TTTCTCGAAC CTGTTCACCA CGGTCATTGA CAAGGATCCG TATTTGACCA 1260 ACATTGAAAA ACTCCAGCAT CTACGGTCAT GCCTTAAAGG AACAGCGCTG GATACAATTC 1320 GCTCATTGGA AATTTCAAAC GCAAATTATG CTGCCGCTTT AGAACTGCTT GATAAGCGTT 1380 TTAATAACAA GCGTCTTATT TTTCAGGCAC ACATCTCTGA AATTTTGGGT TTGAGAAAGG 1440 TGGACAAGGG CGCGACTGCA CAGCTGCGCG AATTTTCAGA TAAGCTCAAC TCTCATCTAC 1500 GTGCTTTAAA ATCGATGGGC AGTGTGGAAC AGATCGCCGG TTGCGTCATA GTACATACGT 1560 TGCTGCAAAA ACTAGATAGC GTTACGCAGG CTAGCTGGGA GGATGATGCG CCGTTGGACG 1620 TCATACCATC ATGCGAGCGG TTTACAACCT TCATAGAGAG GCGTTGCCAA AGGCTGGAAA 1680 ATGCGGATCA CGCTACGGCA ATGTACACGC CTAGCTCCCA GGTGGGCCAG AACAACAGTA 1740 GTAGAAGAAC GTTTGTAGTG ACTAGGAATG GAACGAGTGC TTGTGTGTTT TGTGAAGTCG 1800 CAGGCCACTC TATTTATAAA TGTTTGCAAT TCGCAAATTT ATCGCCCTTG CTGCGCCTTC 1860 ACGAAGCCAA GCGGCTTGCG CTGTGCCTAA ACTGCCTGCA AAGGGGACAT CAGCTGAGAG 1920 TCTGCGGCTC CAGCGCTTGC AGAGTTTGTG GAAGCAAACA TCATAGCTTG TTGCATCTTG 1980 GCAACACAAG CAGTCACATC GCTGCTTCTA GCCCAAACAA TGCTCAAGAT ACCGAAACTT 2040 ATTCGTCATC CCAAAACACC TTGGCGGCAC TTTTATCTTC GCCTCTCACT ACCGCCCAGC 2100 ATCTCAAGCA CGATGTGGTC CTGCTTGCCA CTGCCGTCAT CAACGTGAAA AATCGCGCTG 2160 GCTCCTTGGT GCCTTGCCGT GCGTTGCTCG ACTCTGGGTC GCAGTTGCAC ATCATCACCT 2220 CTCGTCTTGC TCATCAGCTC CAGCTGCGCA AATTCAAGTC AACAGCAATC GTCTCTGGCA 2280 TTGGTGATGC AGCATTTGCG TCCGATGGGT TTTCGGTCAA CATCAATGTC AAATCTCGAG 2340 TGTCGGAGTA CTCCACATGC ATCCCGGCCT TGATTGCACC ATCCATCACC GATAATCAGC 2400 CTGGCTTCAC TCTTGACCCT GCATCATGGA ACATTCCATC AAATATACAA CTAGCTGATC 2460 CTGAATTCGA ATATCAAGCT TATCA 2485 // ID DMTRAM standard; DNA; INV; 3452 BP. XX AC Y08905; XX DR FLYBASE; FBgn0005772; Dmir\TRAM. XX FT source Y08905:1..3452 FT SO_feature five_prime_LTR ; SO:0000425:1..372 FT SO_feature three_prime_LTR ; SO:0000426:3080..3452 XX CC Derived from Y08905 (Rel. 51, Last updated, Version 3). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 3452 BP; 883 A; 867 C; 793 G; 909 T; 0 other; tgttcggagc agaacccagc cgattagctg cttgccaaat agcacctaat tcttggccct 60 cagccgctta ttttgtttgt tacttatgtc tatgtcactc atttgtttgt taaagctttg 120 cgcttgcttg ccctgctaaa cgctctctgc cagctcgctc ttcgctatct ccgctttgcg 180 tctgcctacc gacgtcggcc gagcgaagct gcgcttagcg atcggagcgg caatgtaaag 240 ggcaggcaag ccacacttgc aatttggatg tcacgcatta aagaacatat cgtaatttta 300 tttctgcgcc gagttttgtt taattcgaaa taattagtcg gccgattggg gataaaaaac 360 attatctcca cataaaattt ggtgaccccg acgtgatctc tgagttgcag tgtcaattaa 420 taatcattcg cgatcgacat tcgttagccc tagcaaattt tcgtcggtta agcacaattc 480 ggtgctaaaa cacacttaca tacacctaca tacacgcatt gctggctatt aatttctctg 540 tcgcacaatt cggtcagtgc agtgcggtag gcagtgcagc gaactcacta atacacacaa 600 gcggagtaca aagcggaatc ggacagctcg cacagccgca cagctaaagg cattagctgt 660 aatccctttg ctttcggttc ccacgtttat acgtatacag ggtgtctttt tcggtcgtgc 720 tgagagactc ttttaaaacc tcaacatggc agcacctgag cctaccaatg tcgcgaacgc 780 agcaatgccg agtgatgtag atttctacaa gcacaaggcc gagtccatcg cgcgccaact 840 aaaggccatg gatcgctttc ttaccaagga agagcttgcc cgagttagat gaggcagaac 900 ttcaagctcg cttagagcaa atcgagcgaa tgaatgcgga tttcgatgcc gctcaaacga 960 gccttgaaag gctggatttc ctgcagttag cccatgatgc ccggctggac ttttcgaatg 1020 tttatgtcaa ggttaggtcc aggctgtcgc gggagttgat ggctgctcgc acggtaaatg 1080 ttgccaattg aacggctcgg catactctcg aggggaattc gtcgttgttc gcctataata 1140 gtataggccg ttctcgaatg cccgagttgc agcttccgcg attcggtggg aactacatgg 1200 attggccaga attccactcg atgttctcga caatggtgca caaagaccat cgtataccaa 1260 tcatcgaaaa attccaatat cttcgtggat gtctagatgg tgctgcgctg gatacgattc 1320 gttccttgga actttctgag gagaattacg acaaggcgtt gaatttacta atgttgcgat 1380 tcgataataa actgttacat tttcaggcac acgtcaaggc tattttcggg ctgcaagggg 1440 tggagaaggg ctcggctatc ggcttgcgcg cgctcaggca caaaatcaat tcgcacttgc 1500 gggcacttca gaccttggcg accccgcagg agatatccga tgggttgctg atcttcatca 1560 taggcacgaa actggaccac aaaacaaagg agaaatggga tgagaacttg ccgacgtcag 1620 gattgcctcg gtggtcaagc atggcctcat ttctggaagc gagatgtcgg atgctggaga 1680 atttgggatc agccatggca acaagtccta gtcaacaggt gggagaagac aaacctgtca 1740 cccttatcac ctccagtaac gaccatccta accccatatg taaccattgc aattcctccg 1800 agcattacat atctagatgt caggcattcc tgaatctctc tgcgtttgaa cgacacaaag 1860 aagcaaagaa gagccgcttg tgtttgaact gcctcaacaa aggccatgaa ttgcagaggt 1920 gcaggtcagg actttgcagg cattgccagg ccaaacatca cacgctactc cacattccat 1980 cgggaactgg tgcttcatct tcctcttcac cggccgagga atcgatccag caagaggccg 2040 cgactgtgct tctagcaagc gggtgttcca gccctccccc ctcgatacag aaatctcagc 2100 ctagccagaa cgtgttgcta cctactgccc tcgtccatgt aacagatcgt tatggagcac 2160 ttatctcatg tcgtgccatt ttggattctg catcacaggc aaactttgta acatctagac 2220 ttgctgatca gttgcagttg gatcatcgct cgtcttatgt tcacatctct ggaatcggag 2280 attccattct accttcgagc aagtctgtac atatagttgt acaatcccag gacgcaagct 2340 atcgagcttc cttcgctgca attgtcacca actcaattac ggaaatgcag cctaacttcg 2400 gcctagacgc aaaggattgg ccaatgccga ataatctaaa actagctgat cctaatttct 2460 ccaagcccca acgtatcgat ctgttgatag gttctggttt gttcttcgat ttaatgtgcg 2520 tcggacagat tcgactatca gcccaattgc caacattgca ggagacaaaa cttggttgga 2580 tagtatcagg aagcattgat agctcggaga ataagcgtgc agctttagcc gcttttgaaa 2640 attcctcgtg catctctatt gacgattttc gacccacaac gctggagtac caaaacttag 2700 agcagcaatg caggaagcag ctgctcgagt gccaggtgca agtggaaaaa ctgcgatcgg 2760 agaatcagga actgcagcgc gaacttttcc atatattaaa aacctacata tccacgctaa 2820 atgaaattca actttcaaca atttctacat tgccaaattt cctgccattc tatacaaata 2880 cagaggttcc cgatgatcaa gacgtaatca cgacaagccg ccttcgtaaa gaagcgccga 2940 tttctcgctt cgacgatcat tccagcgtag ccgccagctg cgcccaagca agcatcttca 3000 agagggccgt tggaaaatta gcggttctgc cccttcagga tggatctgtt gaaagccttt 3060 gccttccaac ggggggtgaa tgttcggagc agaacctagc cgattagctg cttgccaaat 3120 agcacctaat tcttggccct cagccgctta ttttgtttgt tacttatgtc tatgtcactc 3180 atttgtttgt taaagctttg cgctttcttg ccctgctaaa cgctctctgc cagctcgctc 3240 ttcgctatct ccgctttgcg tctgcctacc gacgtcggcc gagcgaagct gcgcttagcg 3300 atcggagcgg caatgtaaag ggcaggcaag ccacacttgc aatttggatg tcacgcatta 3360 aagaacatat cgtaatttta tttctgcgcc gagttttgtt taattcgaaa taattagtcg 3420 gccgattggg gataaaaaac attatctcca ca 3452 // ID DMTRIM standard; DNA; INV; 3111 BP. XX AC X59239; XX DR FLYBASE; FBgn0004642; Dmir\TRIM. XX FT source X59239:1..3111 FT SO_feature five_prime_LTR ; SO:0000425:1..792 FT SO_feature three_prime_LTR ; SO:0000426:3062..3111 FT SO_feature polyA_signal_sequence ; SO:0000551:308..313 FT SO_feature TATA_box ; SO:0000174:740..745 FT SO_feature primer_binding_site ; SO:0005850:795..811 FT SO_feature CDS ; SO:0000316:768..1286 FT /db_xref="FLYBASE:FBgn0063547; Dmir\TRIM\ORF0" FT /protein_id="CAA41923.1" FT /translation="FTRESSPDTWLQTKVVFIPKAGKPSHTAPKDFRPISLSSFLLKAM FT ELLLGLHLTACIPSSLISDSQHAYRKGRSTETALHSITSIIEASLNFKEYTLVAFLDIK FT GAFNNILPTAITGALTDLGVDSRTVSLIDQMLQCRTVEASLGTLTCTRFVSRGTPQGES FT SRPSYGTWQ" FT SO_feature CDS ; SO:0000316:1193..1444 FT /db_xref="FLYBASE:FBgn0063546; Dmir\TRIM\ORF1" FT /protein_id="CAA41924.1" FT /translation="GITGDANVYQICQQRHTAGRVLSPLLWNVAVNTLLREIEGGGCRV FT VAYADDVAIAFSGKFPQTLCECITSTLTKMSNGQTNAG" FT SO_feature CDS ; SO:0000316:1390..2955 FT /db_xref="FLYBASE:FBgn0063545; Dmir\TRIM\ORF2" FT /protein_id="CAA41925.1" FT /translation="VHYKYSHENVEWADKCGLGVNPSKTELVLFTRKYKVPVLIPPRLC FT GETLVFSNNAKYLGLILDRKLDWKLSIEDRVKKATVALYTCRKAIGLKWGMTPYIVRWL FT YTAIIRPIMLYGVVVWWPALDRRTCLNKLSRVQRMAELCITGGLRTTPGEALDTVLDLL FT PVDLMGKKVATLAALRMREARLWKASAVGHSGILMRLPQLPERTDYCIPSDHLSTPFQV FT SIPSREDWEMGEPGPANAVHFYTDGSKLDGRVGGGVYCSELEISHCFRLPDHCSVFQAE FT IEAIKEAISIVSKLRLDTHLVCVFSDSQAAIKALGSISSNSATVKDCRRSLHEIAEQLD FT LFLIWVPGHRDIEGNDAADELARQGTTIPLLSEREQVAMPLATCRLLTHELFEQNANRR FT WQQTVSCKVSRLICSYRSKKRSAELYRLSRAQCFAVTRAITGHWQIGTHASRLSIPHND FT FCRSCRDEEEEESVLHFFCHCPALGNRRLRILGAAFLADISGLSEIKPGTLSKYIQATG FT WDCP" FT SO_feature CDS ; SO:0000316:2624..3103 FT /db_xref="FLYBASE:FBgn0063544; Dmir\TRIM\ORF3" FT /protein_id="CAA41926.1" FT /translation="YAHIDRRSAQQSCTDSAEHSVLQLLEPLRDTGRSALMPPDCQSLI FT TTSAGVVATRRRRNRSSTSSAIAQPLEIVVFVFSGPLSLRTFRACPKSNQGPCPNTSKP FT PDGTALNPAVVCSHGSGNEKGHMPMRQHNGPSIGPSELGGGRLLSPPSAYRLNLT" FT SO_feature RR_tract ; SO:0000435:3050..3059 XX CC Derived from X59239 (Rel. 35, Last updated, Version 3). CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 3126 BP; 799 A; 873 C; 789 G; 665 T; 0 other; attctcggac cacaggtacg tggaaactgt cttctccttc tcagtaccga agccagtccg 60 attccggaac atcaggcgga caaactggac tcggtactct gattacctct gtcgtgttct 120 ccctgagccc ccatctgagg aggagttctc cacggaggcc acgactcgtc ttcttaagat 180 ctttaccgac gcctgcaata aggcgctcga caaagcatgc ctctctggca agaacagagg 240 ccggaggaaa cctgaatggt ggaatccgaa actgggtgaa ctccggaaag cctcccggag 300 actcttcaat aaagctaagg ctgaaaacgt tgagcaaaac tgggccgaat acaaggccag 360 tctgtcaacc tacaacaaag aacttagaaa agctaaacgc gcctcctggc gtaagttctg 420 cagcgaaatc gagagtaact cagaagcctc acgcttgcgc agagttctct cgaagacaac 480 acccaccctg ggctacttga agaacaccga ccagtcgtgg actacgtcca gcgaggagtc 540 gctaaatctt ctcctaaata cccacttccc tggctgcgat gaaaacagac ccaactacct 600 cgcgcctcct tctgtcgcct caaatgccat cctgagactg ctaagccagg agaacatttc 660 ctgggcgatc agaagcttta aaccctacaa gtccgcgggg ccagacggca tttttcctgc 720 ccaactgatt cacgcgggat ataaagccat taactggctc aaaataattt acgagggaat 780 cttctcctga cacctggctt cagaccaaag tcgtattcat acccaaggca ggcaagccct 840 cgcacactgc cccaaaagat ttcagaccca taagtctatc gtcttttctt ctaaaggcga 900 tggagctgct cctggggctg catctaacgg cgtgcattcc gtccagtctg atctcagact 960 cccagcatgc ctatcggaag ggaagatcca ccgaaacggc cctacactca atcacatcga 1020 tcatcgaagc gtcccttaat tttaaggagt acaccctagt agccttcctc gacattaaag 1080 gcgcctttaa caacatcctt ccgaccgcca tcacgggcgc actgacggat ctgggcgttg 1140 actccaggac ggtgagcctg atcgatcaga tgctacaatg caggacggtt gaggcatcac 1200 tggggacgct aacgtgtacc agatttgtca gcagaggcac accgcagggc gagtcctctc 1260 gcccctccta tggaacgtgg cagtgaacac actgctgcgg gagatagagg ggggtggctg 1320 ccgtgtggtg gcgtacgcgg acgatgttgc catagcattc tccggaaaat ttccgcagac 1380 gttgtgtgag tgcattacaa gtactctcac gaaaatgtcg aatgggcaga caaatgcggg 1440 ttaggcgtca acccgtctaa aacggaactg gtgcttttca ctaggaagta caaagttccg 1500 gtactgattc cgccaagact atgtggggaa acgctagtct tcagcaacaa cgccaagtat 1560 cttggcctaa tcctcgatag aaagctcgat tggaaattga gcatagagga tagagtaaag 1620 aaggccacag tggccctcta tacttgcagg aaagccatcg gactaaaatg gggaatgacc 1680 ccctatatag ttcggtggct ctacaccgcc atcatacgac caatcatgct ctatggagtg 1740 gtggtatggt ggccagcctt ggacagaagg acatgcctca ataaactcag cagagttcaa 1800 cgcatggcag agctatgcat aactggcggg ctacgcacta ctccagggga agccctggat 1860 actgtgctgg acctcctgcc tgtggatctc atgggaaaga aggtggcaac acttgccgcc 1920 ctcagaatga gagaagccag actgtggaaa gcatccgcgg ttgggcactc gggaatcctg 1980 atgagactcc cgcaattacc agagaggaca gattactgta tccccagtga tcacctctcg 2040 acgcccttcc aggtatcaat cccatctagg gaggactggg agatgggcga accaggacct 2100 gcaaatgcgg tccacttcta cactgatggc tcaaagctag acggccgcgt gggaggcgga 2160 gtctactgca gcgagctgga aatcagtcat tgcttcaggc tcccggacca ctgtagtgtg 2220 ttccaagcgg agattgaagc catcaaggag gccatttcta tagtctccaa actacgtcta 2280 gacacgcact tagtgtgcgt tttctcggac agccaagcgg ctattaaagc tctaggctca 2340 atatcgtcga actcagcgac tgttaaagac tgccgcagat ctctgcacga gatcgcagag 2400 cagttggatc tcttccttat atgggtcccc ggccacaggg acatcgaggg gaacgacgcc 2460 gccgacgagc tagccaggca gggtactacg atccctctcc tatcggagag ggagcaggta 2520 gcgatgccct tagctacgtg caggctccta acgcacgaat tgttcgagca aaatgccaat 2580 agaagatggc agcaaaccgt ctcctgtaaa gtctcaagat tgatatgctc atatcgatcg 2640 aagaagcgct cagcagagct gtacagactc agcagagcac agtgttttgc agttactcga 2700 gccattacgg gacactggca gatcggcact catgcctcca gactgtcaat ccctcataac 2760 gacttctgca ggagttgtcg cgacgaggag gaggaggaat cggtcctcca cttcttctgc 2820 cattgcccag cccttggaaa tcgtcgtctt cgtattctcg gggccgcttt ccttgcggac 2880 atttcgggcc tgtccgaaat caaaccaggg accttgtcca aatacatcca agccaccgga 2940 tgggactgcc cttaatcctg cagtagtctg ctctcacggt tcaggcaacg agaaggggca 3000 catgcccatg cggcaacaca acggaccttc tataggtcca agtgagcttg ggggcgggag 3060 gctcttatcc cccccctcgg cctaccgcct taacctaacc taacctagtc c 3111 // ID TV1 standard; DNA; INV; 1728 BP. XX AC Z49253; XX DR FLYBASE; FBgn0015678; Dvir\Paris. XX FT source Z49253:1..1730 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..242 FT SO_feature terminal_inverted_repeat ; SO:0000481:1489..1730 FT SO_feature CDS ; SO:0000316:394..1440 FT SO_feature start_codon ; SO:0000318:1..3 FT /protein_id="CAA89219.1" FT /db_xref="FLYBASE:FBgn0044149; Dvir\Paris\T" FT /db_xref="TrEMBL:Q27281" FT /translation="MPGQRINENVIQLVYFHYHKGKCAKELAEMFSIKLRTIYNIINR FT AEKENRLELKHSGGRPAKLSRRDHSKILKQINENPQTSLRQLALDLKNDCNKTVSHET FT VRKVLKMHKYSSQIARKKPLLSAVNIQRRLNFSITNVNKPAEYWDDVIFCDETKIMLY FT YHDGPSKVWRKPNTALEQKNIIPTVKFGKLSVMVWGCISSKGVGELRIFNDVMTKEFY FT LDILKNELSRSAIKFGFVDPQNPSKQRYKLYQDNDPKHKSFLCRTWLLYNCSKVIDTP FT AQSPDLNPIENLWAFLKKRVGKRSPTNKNALIKAIQEEWIKIPEIYDLHNLIQSMSRR FT LRAVMDANGQYTKY" XX CC Derived from Z49253. CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1730 BP; 607 A; 312 C; 313 G; 498 T; 0 other; CAGTTTGTCA AGTAATTGTT TGCAAAGTGA AAAACAGCAC ACAAATTCAT TTTTAGGCTT 60 ATATTTTACC GAATTAAACT ATTTCTTCTT CATTTTTTTT GCATATAATT ATTATTCTAA 120 GTATTCTTAA TAGATATAAG GCACAAACAT AACAAAATTC AAAGCGCAAA TGTAAAAAAA 180 ACAGTCAAAA CTAAAATTAG GCAAATATTG CGCTGTCAAT TAATTCTTTG CAAAATGCAA 240 ACACAATTCG AAATAGAGCC CAAGCTACCT AACGGTACTT CAAATTCGAA GAAATTTGAA 300 TGCATGAGTC ATTGGTATGC AGTGTGCGAA TGCCTTCTAA ACGATTTTGC GATTTTAACA 360 GTTCACATCT TGCCTAATTG AGAGCTTCTT ACAATGCCTG GCCAACGAAT TAATGAAAAC 420 GTTATTCAAT TAGTTTATTT CCACTACCAC AAAGGAAAAT GTGCTAAGGA ATTGGCGGAA 480 ATGTTCTCAA TTAAGCTAAG AACTATTTAT AATATTATTA ACCGCGCCGA AAAAGAAAAC 540 AGATTGGAAT TAAAGCACTC CGGAGGCAGG CCAGCAAAGC TCAGTAGACG CGATCATTCA 600 AAAATTTTAA AACAGATTAA TGAAAATCCT CAAACTAGCC TCAGACAGTT AGCATTGGAC 660 TTGAAAAATG ACTGCAATAA GACAGTGAGC CACGAGACAG TTCGTAAAGT TTTAAAAATG 720 CATAAATACT CCTCGCAAAT AGCTAGAAAG AAGCCACTAC TATCTGCAGT AAATATTCAG 780 AGGCGGCTTA ATTTCTCTAT TACGAATGTC AACAAGCCCG CGGAGTACTG GGATGACGTT 840 ATTTTTTGCG ACGAGACTAA AATAATGTTA TACTATCATG ACGGACCCAG CAAAGTTTGG 900 AGAAAGCCCA ACACAGCTCT GGAACAAAAG AATATAATAC CAACAGTAAA GTTTGGCAAG 960 CTGTCTGTAA TGGTGTGGGG CTGCATATCG TCGAAAGGTG TTGGTGAGCT GCGCATTTTC 1020 AACGATGTGA TGACGAAGGA ATTTTACTTG GACATTCTGA AGAATGAATT GTCAAGGAGC 1080 GCAATAAAGT TTGGCTTCGT AGACCCCCAA AACCCCAGTA AACAGAGGTA CAAGCTTTAT 1140 CAGGACAACG ACCCCAAGCA CAAATCGTTT TTGTGCAGGA CTTGGCTGCT GTATAATTGC 1200 AGCAAAGTCA TTGACACCCC TGCCCAGAGT CCTGACTTAA ACCCGATTGA AAATTTGTGG 1260 GCTTTCCTAA AGAAGCGCGT CGGGAAACGG AGCCCAACTA ACAAAAACGC TCTCATCAAG 1320 GCCATTCAAG AAGAATGGAT AAAAATACCC GAAATATATG ACCTGCATAA CCTTATTCAG 1380 TCAATGTCTC GTCGTCTACG GGCTGTAATG GATGCTAATG GCCAATATAC CAAATATTAG 1440 TACCACAATA GCTATAATTT CCTAGATTTA GTTATGTATA CAGTACTAGT TTGCATTTTG 1500 CAAAGAATTA ATTGACAGCG CAATATTTGC CTAATTTTAG TTTTGACTGT TTTTTTTACA 1560 TTTGCGCTTT GAATTTTGTT ATGTTTGTGC CTTATATCTA TTAAGAATAC TTAGAATAAT 1620 AATTATATGC AAAAAAAATG AAGAAGAAAT AGTTTAATTC GGTAAAATAT AAGCCTAAAA 1680 ATGAATTTGT GTGCTGTTTT TCACTTTGCA AACAATTACT TGACAAACTG 1730 // ID SPOCK standard; DNA; INV; 4952 BP. XX AC AY144571; XX DR FLYBASE; FBgn0015678; Dmir\spock. XX FT source AY144571:1..4952 XX CC Derived from AY144571. CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4952 BP; 1554 A; 1113 C; 1113 G; 1172 T; 0 other; CAAAAAACGT TGTTCAACAC TGGTATGGCG CGTACCTAAA ATTGTTTGCG GAAGGAAAAA 60 ATTAACATAT TTCAAGACGT TCGCTGATAG TTTTAATCAA AAAGACTTTT TCCGAAGCTT 120 CTGGGGGTGC CTCACACGCC TACGAAGACG GCGCGAAGAA ATTCCCAAAG CTTAATCAAT 180 TAAGTCGAAA AACCTGGAAT TTACTGCATG GCGTCCCGGA ATAACTCTTA AAACGGCTCT 240 TGGAACTCCG TCTGGACCCT GAGAAGCATG GCAAGCATAG GAAAGACAAG GATGGCCACG 300 GAGAAGGCGA CGGGAGTTAA AATTTGCATG GAAACGCTGA AAAAAAGGGA AGGAGAAGCT 360 AAATCCGCCA TGCGGGTAGG AGTAAAGATG GCGGCAAAAA CGGCTCCGCA GCTTTTGGCT 420 TCTAAACTGC TGTGCAAGAA ACCAAACTTG TCGACGAACG TGTCGACAAA CATGTCCACG 480 AACACGACAA CAAGCATGAC ATCAAACATT TCTACGACGA AAGAGAGAGA GAAGACTGAC 540 GCGGTTTGGT GGCTTCCGTG AGTACGAAAA ACGGAAGTAA CCGCAAAATC ATCGTTAACA 600 AGTATTATGA GGGGCATAAA GGCCCATACA TTGTTTGTAT CGACAAAATG AGTGCGAATA 660 ATGCCAGAAT TGCTATAAAC CAGTTTGATC TATCCGATCT GCTGCTGAAA CAGGGCATTA 720 GCGATATCGA GGAGGTTGCT AGAATGGGCT ATGGCAGGTG CAAGATTGTG TGCGGGTCGG 780 CGGGAATATG CAAACGGGCT AGTGGAAAAG AGTGTTTTCG CTGCTCAAGG GTACTCAACC 840 CGAATCTTTA GGCACTTCGT CCAAAAAGCA GGGTTAATTT TTGGGTTCCC CGTGCACTAC 900 TCGGAGGAAC AGCTAGCGGA GCTTGTAACC TCGGATATCC CAATTGAGGA GATAGTTCGG 960 ATTACGCGAA GGGATAGAGC ATCGGGGGAA CGTGTACCTA CTGGCAGGGT GAAACTCTGG 1020 TTCAGAGGAG AGCAGTTACC AACCAAAATC AATTTTCTAT ATTCTAAAGT AGAAGTGAAA 1080 CCTTTTGTTC AGCTTATTCA ATGTTTCAGG TGTTTTAGGT TTGGCCATCT GGCGCAACAT 1140 TGCAAGAGTG GAGCGAAATG TTTCAAATGC GGACAAGATC GCAGCAAAGA TATTATCTGC 1200 TCAGGGCTGG TCTGTGCTAA CTGCAAGGGG GAACATGCTG CCACCCACCC GCAGTGTGAG 1260 GCCAGGAAAA AAGCGTACGC CATTCAAAAG TGTATGACAC TGGAAAACCT AACTAGAGCT 1320 GAGGTCAAGG CTAGATATCC GATCCTTTTT GATAGAACTT CCCCCAACCT AAGGGATCAA 1380 GGGGAATTTC CTAGACCCAG TTGGCAATCT AACCGCTCTT AGAGTCCGCC AGGGAAATTC 1440 ATTCTGTCAC CTCTTTCGCG GAAGTGACCA AATCCAATAG GGTTGCCGCC CGGAACGCAG 1500 AGGCGGCCAA GGCAGCAGCC AACATGGCAG AGTGGAAGAA CTCAATAAAC TCAGACTGCG 1560 TCCAAATGAG CGAGAAATTT GTTAGATCTT CATCCGTGGC ATCTTCTCTG GCCCAGGATA 1620 AGCTAAACGC ACTAGGAGCC AAACTCACAG CGTTCCTGAT TGATCCAAAC AACATCATTA 1680 AAGGGGATTC TTTAGCAAAT AAATTTATTA AGGAAATTAG AGAATTTGTC AATGCTTCCA 1740 ACGCAGAACT AGATAGGCGT CAGATTGCCC AAGGGATGGC AGTTGGTTCT CAAAACATCC 1800 AACAATGAAG ATTCTTCAGC TAAATATTCA AAGTCTGACA CACAACAACA ACAAACAGCT 1860 ACTCTCAATG TTCCTAGACC AGAACGCAAT AGATATTGCC ATCCTCTCAG AGATTTGGGT 1920 GTTGAACAAC GCGGACTGCA GCATTTTAGA CTATAACTTT TTCTGCATGC CCAGAGAGGA 1980 CGGCTACGGC GGGGTTGGCA TCTACGTCAG GAAAAACATT CAATTCAGCA TTTTAAAAAT 2040 AGAGAACGAC TTGGAAATTT TAGGAATAAA AACATTAAAC CTCAAGAGCA ATTTCAATAT 2100 TTTTAGCATA TATGCACCGC CATCAACAAC AGTTTCACAG TTTAAATCGG GCACAAAAAA 2160 GTTTTTGGAA TTCGCGGCTT CGCTTGACAT ACCCTCAATA GTGGGCGGGG ACTTCAACGC 2220 TAGGTCTTCT ATCTGGGGAA GTCCGATCTG TGATCGCAAT GGTCGCAGCA TTGAGAATTC 2280 CACCCGGGAG GCCGGATTTA TCTGCCTAAA CGACGGTTCG CCAACCTTCT CCAGGAGCCC 2340 ATCTCAATTC TCCGTTCTTG ACCTTTCTTT TTCGAACTAC AGAAATGGAA CTATTAACTG 2400 GCAAGTCCTC AAAACAAAAA TTACGAACAG CAACCACTTC CCAATTGTAT TATCAGTGCA 2460 AGGCCTAAAT GCCCCCCTCC AAGGCAAGAA GATCCTTCAC AACAGAATAG CCCGGATTCT 2520 GGGTGCAAGT GATTTCTCAA ATCTGGAGGA GCTCAATGAA AAAGTGAAAT CCCTGACCTT 2580 AAAAAACACG GTCACATTCC CACGCAAGAA CAAGTACGTT GCTAAGAAAT GGTGGAACGA 2640 GGAAACGGAA AAACTTTTTC GCGTGAGAAA CGCTTGCAGG CAAAAATTCT ACCTCACCAA 2700 AAAATTGGCT GATGCCAGAG CAACCTTAGA AGCTGACAAA AATCTACAAA ATCACGTAAA 2760 AAATCTTCGA AAGCGCAACT TTTCCAAGTT TATAGAAGAG GTCTCCTCAT CTCCCTCTAT 2820 GTGGAGCAGG GTGAAGAACA TCAAAAAATA CGGTCAAATC AAAAATGTAG GGAACAAATG 2880 GACGAACGAA GATGACAAAA ACTTTCTCAA TATGGTTAAA GTAACCCCAT CCACGTCAAA 2940 CAGTAGGCCC CCTAAGAAAA TTCCTGCCCC TTTGTAAGGA CCAGACGACC CGGAACCCCT 3000 GGAACTGGAA GAATTCCTGG CCTTCCTAAA AACCAGGAAC CCCAATTCGG CTAGAGGCCC 3060 TGATGGCATC TCGTTCAGGA TGCTTCAAAA GCTACCAAGT GGGAAAAACA AGACATTTTT 3120 GAAATTATAA ATAAAGTATG GCTGAGTGGC GTCATCCCTG AAGTCTGGCG CAGGATAAAA 3180 GTGGTACCCA TCCCCAAGAA GAATGCAGAC CCTACTTCCT TCCAAAACCA TAGGCCGATT 3240 TGTTTGATTA ACACGCTGTT TAAATCCATA GAGGGGTTGA TAAAGTTGAA GTTGGACGAG 3300 CACATAGCAA ACTGTGACCA GCTGCCGGTC AGGTCCTATG CCTTAAGGAG AAACAGGTCC 3360 ACGGCTCTTT GCATCAACGA CCTTATCAAC AAAGTTCTGG CGCTGAAGGC GAGGGGTTGT 3420 CAGGTTGTCG CAGCTTGCTG GATCTCCAGC TTCCTTAACA GACGAATTCT CATAAAGGGC 3480 AAAAGCGAGG TAGAGGTGAA CAGAGGTGTT AGCCAGGGTA GCTGTCTCAG CCCAACCCTT 3540 TTTAACCTTT ACACAGCCGA ACTACATGAT ATTAACAGTC ATAACTGCTT ACTGTTTCAG 3600 TATGCTGATG ACTTTTTCAT AGTTTGTTTT CATAAAGACG TATCCATTGC GAAAGGTGTT 3660 CTGGAAGACA GTATAGATAA GTTCGAAGAA AAATGCAATA GGCTAAACCT GTCATTCAAT 3720 CCGGTGAAGT CTAAGTGATT TACTTCAACA ATCGTAGAGC TGCGCTAAAT ATCTCGCATC 3780 AAGGAGTTGA GATCAGACAA GTAGAGCAGA TCAAATACCT AGGCAGGACT ATCTCAGCAA 3840 ACAACTCAGC CTCTGGTCAC ATTGATCTGG CCATCTCGGA ATCGAACAGA AACTGTGCGT 3900 TTCTCAGTGG GTGTCACTAT GGTATCAGCC CCAAAAGAGG GCTTATATTT TACAAGGCAT 3960 TTGTCAGAAG TAGGCTAGAG TACGCGTGCT CATCATTCTG CAACCTGTCC AAATCTGCCT 4020 TGGCCAAAAT CAAATCCCAC TGCAACCATC ACCTCAGAAA GTCGCTGGGT CTGATAATCT 4080 GCACTCTCGT TCCTATCATA TATCACATGG CAGGAGAACT TCCCCCGGAC TACAGGATGA 4140 GATTCCTGGC GGCGAAAGAG TTGGTTAAGG TGTTTGCATT TAATCTCCCA GCAAGTGAAA 4200 CAGTGTCAGC AAATAGGGAT TTAAACACGG GCTACGCTAA GGCATATCGG GAGTTTGGCA 4260 GCATAATAGA TAGAGTTGAG GTTTTCGAGA ACACAGCTTC ATTTTGCACT AAAATTAGCT 4320 CCGACTTAGA ATTTTTCAAG GGTTCTGCAC CCAACAAGCA CGCTATAGAC CAGGCAGGTA 4380 TCATGCTGCT CCTCGGGGAA AAGAAGGATC AGCTGACAAA GGGTGGTTTT GAAATCTATT 4440 ACACGGATGG GTCAGTCGCA GATGGGCATT CCAGCGCCGC TTTCCTGCAC GATAGGACAG 4500 GTTTTTTAGA TAGCTATTAC ACGCACAAAA CCCTTTCCTC GTTGTCCGCC GAGCTCCTTG 4560 CTATCGAGAA GGCATGAGAC CATGCTATTT TGTACAATTT TCCTCGTGTC GCGCTCCTGA 4620 CAGACAGCAG AAACGGGGCC CTCATTCTGG CGAAGAACAT TCAGGATAAC TCTATCGCCT 4680 ACAAAATCAG AGAAAAGATC CAACAAAATC CACATCTAAG AACTGTAGAG GTTCACTACA 4740 TTCCCGGACA CGCTACCAAA CAAACGCTGC GTACCAAACA CTAATTGACT TCATCGACGA 4800 AATTGATGTA AAGCTATAAA CAGTACTCCA ACATCTTCTA TTCTTTGACT TGGCAATTTC 4860 CACCTTCAAA AGGCGGGCGG CCAAATAATC CCACGCTACT CACGGGTAGT TTGGTAACTT 4920 AAATCCTAAA AAAAAAAAAA AAAAAAAGGT TT 4952 // ID WORF standard; DNA; INV; 4174 BP. XX AC AY144572; XX DR FLYBASE; FBgn0064494; Dmir\worf. XX FT source AY144572:1..4174 XX CC Derived from AY144572. CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 4174 BP; 1010 A; 1004 C; 752 G; 1408 T; 0 other; TTTTCTTTTG TGTCTCTGCT TTGGTGAGTT TACCGCTCGT TTTCTGCACT TCGTTGGATC 60 ATCTGCTTTG AATTATTGCC GCTGCAGCTG ATTGTCTGGC TCGCTCTGCT GTCTGCTCTC 120 CTACTTTACC TGCGATCTCA CTCCGTTCTC TTATTATTCG TGCACAGTGA TTCTACTGCT 180 GTCAATAATG GCAATCGTTT GTAGTGCAAA GAAATGTGAA TTCGGTGGTG TTATCATTGG 240 TGATTCATTT CTTAGCTGCT GGCTGTGCGA CAATTTTGCC CACATAAAAT GTGCTGGTGG 300 TGGAAATTTT GGCCGGCTGA ATGATCTGAT CTCGAAACGC ATGGGCCTGT CTTGGTCATG 360 TCTGGCTTGT CGGGAGATTG AGGCCGAAAT GCGCACATTT ATGAGACAGA CTCGAACTGG 420 GTTTCTGGAT GTCCGGAAAC AATTTGTGGC TCTCAACGAG AAATTTCTTG CCCTTGAATC 480 ACAGTTTCTT GGGCTCAAAC TCTTGAGCGA ATCGCCTAGA CGGAAAATTC CGCATAATGA 540 TACCAATCTC TTGCAACCTA ATCCGATTGG TACGCCTGCC TCCCACCTTC ATCCATCGGA 600 AGCATTTCCG TGTAGGCATG CCACGCCGTT GTCTGTAAAT CCGCCAACGG TGGCTCCGGA 660 GTTCTTTACA CCGAGCAATG TGCTTCCTTC GAGCACCCAA CTCCCCAACA GCATCAATCC 720 CATCCCTGCA GTTTCTGTGG CCGTCACTTC TGACGCGGTG AGTGCTCCTA GTGCGGGTGT 780 GCTGGTTGCT GACGACGTCT TGCCAATTCC TGCTCCAATT CCTGCTCCAA TTCCTGCTGC 840 AATTCCTACT CCAATTCCTG CTACAGCCAT TGCTGCCAAT TCCTCTGCCC TTGTACCGAG 900 ATCTCTTTTA GGTGTGGCTC CACCATCTCG ATCTCGCTCG GGACTTCAAC TTAAAGCAGT 960 GGTCCCTCGG AAAGCAATAT TTGTTTCTCG TCTTATTCCT GAGGCTACAA CGGAGGATGT 1020 TAAACAACAT CTTGCCACTA AACTTAACAC TTCGCCTGTT GATATAGTTG TGACTAAATT 1080 TTCATTTAAA CATAAGCGCA ACATATCATC ATTTAAAATT CTTCTCCCTG ATTCTTTACT 1140 ATCCTGTTCT CTAGAACCGT CAATATGGCC TGAGCATACA ATTGTGCATG AGTTCCTTCT 1200 CAAAGATTCG AACTCGAATC CAAGAATTAC CGAACATGCT CCAAAAAACT AATGTACTAT 1260 TTTTCCATGC ACTATCAGAA CGTTCGCAGT TTGCTGGGAA AGTTGCGTCA AATTCATACT 1320 AATAGCGCGT CCTTTGATTT CGATGCCATC GCGTTTACCG AAACCTGGCT TAACTCCTCT 1380 GTCAATGATC ACGAAATTTT CATTGATAGT TACACTATTT ATAGAATGGA CCGCCCATCT 1440 TTTGCAGGTG GGGTTCTGAT TGCAGTTAAA TCTGTTTTCT CATCTGAGTT ATTCCCATTC 1500 AATAACATTC ATGGAATTGA ATTTGTTGCA GTCAAAGTTC GTGTTGGCTC CGCATTTTTC 1560 TATTTAACCT GCTCTTACAT TCCTCCCAGG TCTGATGCTG AGCTTTACTT ACACCACCTT 1620 TCAGCAATTA ATAATGTTGT TTCTACATTA GGGTGCAATG ATCGAATTAT TGTCATGGGG 1680 GACTTTAACC TCCCATTTCT TTCTTGGCTG CCTTGTAATG ACGCTAACCT GTTGTTTCCC 1740 AATTGCCATA ATGACTTTAT CAATGGGCTA ACGGATATTT CCCTTGTCCA AATTAATGCC 1800 GTTAAAAACA TTAGGGACAG ACTTCTTGAC CTCGTTTTTG TTAACGATGG TTCTCTTACT 1860 ACTGTATCTA GAGCAAGCCC AATATCTCTC CCTGAAGATC CTTACCATCC AACTTTGTTG 1920 ATATCACTGG AGTGTACACA ATCTGGAGGT GCTGACAGTT CCATGGCTCC TTGTCACATA 1980 AAGTGTTTTC GCAAAACAAA CTTTATTGAT TTAGATCTAC ATCTCTCGCG TGTTGATTGG 2040 TCGTTTCTTT ATTCATTACC GAACATAGAT GCAACTGTTA ACTCGTTTTA TAGTTCTATT 2100 TACTCCGCCC TTAATACGTT TGTTCCAGAT ATCACTGTAC CTGTATCGTC CAAGCCTCCC 2160 TGGTTCTCCA AGTATTTGTC ATACTTAAAA AATAATAAAT CCCGGCTCTA TAAAAAGTAC 2220 CAGAAGTCGG GCTCCACGTT GGCCTTAGCT CTATACTCCT CCGCTCGCTC TCTGTTCCTT 2280 GCCGTCAATA GTCAGTGTTA TAATCATTAC CTTTCACAAT GTAGTAGTAA TTTTCGTAGT 2340 GATCCTAAAA AATTCTATTC GTTCGTTAAT TCTAAGCGCA AGTCAAACGT TTTTCCTCCG 2400 TCCCTTCATT ACCAGAACAA AACAGAAGCT TCTGCTGTTG GTATTGCAAA TTTATTCGCT 2460 AACTTTTTCC AAACAACGTA CTCGTCTCAT ATTTACAACG CATCCACTCC GTATCCGTAC 2520 CAGCTACCTC AAGCTAACAG TATTTTTCTG CCCTTTTTCG AGGAAAGCGT TGTTCTTGAA 2580 GGTTTGTCAT CTATGGACAT ATCGTTTTCT GCGGGTCCAG ATAAGGTACC AAGTTGCATC 2640 TTAAAACACT GTGCCCAGTC CCTTTGCAAG CCCTTGACCT TTCTCTTCAA CCTCTCCTTG 2700 GAACAGTCTT GTCTCCCAGT AATTTGGAAT GAGTCCTACA TCATTCCGCT TCACAAAAAA 2760 GGTTCAAGAT CAAACATTGA AAACTATCGT GGTATCGCAA AGCTTTCCGC CATCCCGAAG 2820 CTTCTTGAGT TCCTGGTCAC CCGGCAACTG CAACATCTTT GTTGCAGCTT GATATCTCCG 2880 TCGCAACACG GTTTTTTCAG ACATCGTTCA ACATCGACTA ACCTTCTTGA GTTTTCTAAC 2940 CTAATCCACC GTGGTTTTCA AATTGGTTTG CAGACGGATG TAGTTTTTAC GGACTTCAGC 3000 AAGGCATTCG ATTCTGTGAA CCATGCTCTG CTTATTCAAA AGCTCTCCTT ATTAGGGTTC 3060 CCAACGAATC TTCTAGATTG GATTTTGTCC TATCTCTCTA ACCGTACTCA ACGTGTTTTG 3120 TTTTCTAACG TGTTGTCAAA TACTGTTAAT GTTACTTCAG GTGTGCCACA GGGAAGTCAT 3180 CTGGGCCCGC TTCTGTTTAT TTTATTTGTG AACGACCTTC CTCAAGTTAT AACATACTCT 3240 ACTACACTAA TGTATGCTGA TGATGTCAAA ATCTGTCTTT CTTACTCTGA TTGGTATTTG 3300 CACACACGCC TTCAACTTGA TCTAAGTGAA CTACTATTGT GGTGTTCAAC TAATCTTCTT 3360 TTTCTGAACC TTTCCAAATG CAAACTTATG ACATTTTACC GTCGCGCTCC TCATTTTGTC 3420 TCATATGTTC TAGGAAATCA TGTCCTTGAG CGAATTTCGA GTTCAAATGA CCTCGGAGTC 3480 CTTTTTGATC ATAAGATGTG TTTCAACACC CATATAGCTG CAACTGTAAA TAAAGCTAAG 3540 GGTGTTTTAG CGTTCATCAA GCGTTGGTCC AAGGAGTTTG ACGACCCGTA CGTTACGAAA 3600 CAATTGTACA TCTCGTTAGT ACGTCCTATA TTGGAGTATT GTTCTTGTGT GTGGAGCCCG 3660 CAGTATAAAG AGCAGCAGGC TGTTATTGAA TCCGTGCAAA AGCAATTTTT AATTTTTGCC 3720 CTTCGGAACT TTAACTGGGA CTCGGGTAGA ATCTTGCCAC CCTACCGGTC TAGGCTAAAT 3780 CTTATTGACC TGCCGTCGTT GCACCATCGC AGAATATGCA ATGGCGTAAT GTTCGTGCAC 3840 AAGCTCCTTC TTGGGACTGT TGACTCCCAA ACTCTCTTGG GTCAGATTGA CTTGGCCGTT 3900 CCATCCAGAC CTACCCGTAC TTTTAGGCCT ATCCGTCTAC CCATATGTAG GTCTAATTAT 3960 GCTGATCATG AACCTTTTAG GGTTTTATGC CATAATTATA ACTCCCTCTG TCTAACCCTA 4020 TCCCCTGAAC TGTCTCTTAA ACTAATTGCA TGCAATATTT ATAATCATTT AAATTTAGCT 4080 AACTTTTAAC TTATCTTTTT CTGCCTTTAG TAACTAAGTA CCATTATTAA CAGATAATTT 4140 GTTAATTTAA TAAATAAATA AATAAATAAA TAAA 4174 // ID VEGE standard; DNA; INV; 884 BP. XX AC AF518730; XX DR FLYBASE; FBgn0066140; Dwil\Vege. XX FT source AF518730:1..884 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..12 FT SO_feature terminal_inverted_repeat ; SO:0000481:873..884 XX CC Derived from AF518730. CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 884 BP; 292 A; 151 C; 133 G; 308 T; 0 other; CAGTGTTGCC AACTTTTGTC GACCAAAAAA GCTGTGTTTG ACGGGAGAAA ATTAAAAAAG 60 AGCTTTTATT TTTACAAAAT GTTTAGTATT TGATCTAAAT CATCACAATA AATTTCCTTG 120 TCGTACGATC CCATAATGCC TTTGTTTGAC AGACGCAGAC CGTATCTATA AATATATACA 180 TTTCATTATT AACTTTTCAA TATTAGTTTT AAATATTAAT TTACCTGACG CATAAAATTG 240 AGTTAAGCAT TTGAGTGTTT ATTTTGTTTC GTAATTTGGT TTTTACGAGA TTGACTTGGC 300 TGAAAATTCT GAAAACAATG TCATTGCAAA AGTGGCCAAG TCCCCAAATT TATTATTTCA 360 ACCCGAATAC TTATGTATGT ATGTATGATC GCACTTCAAT TCAAAACTTT GTAGTGTCGT 420 TTTAAATTCG TCGCTATTAA GCCTCACTTT ACGGCACCTT TGCTTATATT CCTTTGGCAT 480 TTTGCAGACT TAATGGACTT AATGGCACAA TTTTGAAATC ACAAAGTTCA ATAATCGTTT 540 CGTTCACAAT AAGATCAATT GAACTATTTG AGCGAACTAA GCTTCTAGTT CAACTAATCA 600 TATTCAAGTT CAGTTGACTC CGCTCTATTA TCGATAAGTT CGCTTATCAA AAATTGAACT 660 ACTGAAATAT ATCGGTTTAA TTCAGCTACA ATGAACAACT ATTATTGTTA GTACGATTTA 720 TCGCTAAAAT ATTTTAAATT TCTTAGAAAA TGAAAGAGCG GAAAAAAGGC CTTTCATAAA 780 AAAAAATCCA GAATTCCCGC TGCCCGTTGT TTCGAAATTT TTCCCGCAAA TGCATTATCA 840 AAAAAGCCAG ATTTGACGGG AAAAAAGCTG ATTTGGCAAC ACTG 884 // ID MAR standard; DNA; INV; 610 BP. XX AC AF518731; XX DR FLYBASE; FBgn0066141; Dwil\Mar. XX FT source AF518731:1..610 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..11 FT SO_feature terminal_inverted_repeat ; SO:0000481:600..610 XX CC Derived from AF518731. CC Michael Ashburner, 18-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 610 BP; 189 A; 122 C; 111 G; 188 T; 0 other; CAGAGGTAGG CACAAAGAGA GCCAGATCAA ACTGTCAATG TATGCGTGCG CTTGCTTGTG 60 TACGTAAGCT GCTTTACACT GCGCGAATCG TATGTGAAGA AACGAATAAA AAAATTAAAC 120 TGCAAGTGAA ACTGCGGAAT TTTCCCACAA CCCTTTTTTA CCGAGTCGCC TTAAAAAACA 180 AAGTACAGTC ATGAGTCTTG GATTCCTTTT ACCTTAATGT CATAAAAATG AATGCATATT 240 TTGTATCGGC TATTATATGT ATATATTTTT CATACTATAC ATATATGTGC ATACATGTAT 300 ATCTTACATT TATATTACAT TATGCATGTT CGCTTATATT ATAATATACT TTTTACCCTA 360 TAGGGACAAA ATATAAATAA TAGCAAAATA TTTTTAAAGT TGCCATGTAC GAAAGTGAAG 420 CATTAATTTT TCTGCGTGTA GCTGCGTAAC AAAGCGACAT AGTACTGCTT TCGTCAAAAG 480 TCTGTGCGAC ACTAGCTGTT TTTTTACGCT CTCAACTTGT ACTTCGTGCT CACATCGGCA 540 TGCACACAGA CGGCCACACG AGCAATTGCC GAGCAAAAGT GCCCGCACGG GCTATGGGCG 600 CCTACCTCTG 610 // ID HEL standard; DNA; INV; 1317 BP. XX AC nnnnnnnn; XX DR FLYBASE; FBgn0020425; Helena. XX FT source nnnnnnnn:1..1317 XX CC Consensus sequence from Dimitri Petrov, 18-March-2004. XX SQ Sequence 1317 BP; 419 A; 305 C; 264 G; 325 T; 4 other; AAAGCTCCAG GAATTGACAG GATTTGTCAT GCCACGCTAA AGGTTTTACC TATAAAAGCG 60 ATAATATATA TAGCACTAAT CTTTAATGCT ATTTTAAGGA TCCAAGTGTT CCCAAGACAG 120 TGGAAAATGG CTGYTATTTT GATGATCCAC AAGCCTGGAA AACCWGAAGA TGATCCTGAG 180 TCGTATCGGC CTATAAGCCT CTTACCCTCC CTTTCTAAAT TATGGGAGAG ACTTATTGCC 240 AATCGGATCA ACGACATTAT AAGACAAGGC AATATCTTGC CGGATCATCA ATTTGGATTT 300 CGAAAGGGAC ACGGAACTAT TGAACAGGTC CACAGACTGG TGAAACACAT ATTACAGGCT 360 TTTGACGACT GCGAGTACTC CAACGCTGTC TTTATAGATA TGCAACAAGC CTTCGACAAA 420 GTATGGCATG TTGGATTATT ATGCAAGATA AAGACCCTTC TACCTGCGCC CTACTTCTGT 480 ATTTTAAAGT CATATCTGGA AGAACGACAA TTTAAAATCA CGGTGAGAAA TAGCTACTCC 540 TCTATATACC CAATGAGAGC TGGAGTCCCA CAGGGCAGTG TTCTCGGACC GCTACTATAT 600 TCCTTGTACA CTGCTGATAT CCCTTGCCCG AATTTCGAAC ACATGGAAGC ACCGAACAGG 660 ACTCTTATTG CAACCTATGC AGATGACATC GCAGTTGTAT ATAACTCTAG GGACAGCAGA 720 GAGGCAGCTA ACGGACTACA AGAATATATT AATGCTCTGG CAGCCTGGTG TAAACGGTGG 780 AACCTAAAAA TAAACCCACT GAAAACAACA AATCCATGCT TCACATTAAA AACGCTTATC 840 CCGAACACCC CTCCAATTCG GCTAGAAGGA GTTACCCTGA ATCAGCCCCT GCAAGCAACA 900 TATCTAGGTA TCACCCTGGA TAAACGGCTC ACCTTTGGGC CGCATCTCAA AAACACAGTA 960 AAGAAATGTG GTCACAGATC ACAACAGCTG AGATGGCTCA TGAATAGAAG GAGCACTCTT 1020 TCGATGAGGT GCAAAAGAGC TGTGTATGCG CACTGTATCG TACCGATATG GTTATACGGG 1080 ATCCAGATTT GGGGAATTGC AGCCAAATCG AATTATAAWC GTATCCAGGT GATGCAAAAT 1140 CGCGCAYTAC GACAAATAAC CAACTGTCCC TGGTATGTAC GTAACTCTAC ACTCCATAAA 1200 GACCTCAATA TTCACACAGT TGAGACACAA ATTGGGAGAC ATACAAGTCG ATATAGTGAC 1260 AGATTACTGA GCCATAGCAG TCTTCTTGCA AGACGTCTCA TCCCCGCTCG ACCTCTG 1317 // ID P_T standard; DNA; INV; 3329 BP. XX AC AF012414; XX DR FLYBASE; FBgn0020218; Damb\P-element_T. XX FT source AF012414:1..3329 XX CC Derived from AF012414. CC Michael Ashburner, 19-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 3329 BP; 1067 A; 606 C; 695 G; 961 T; 0 other; CATGATGCAA TTATATAAGG TGGTCTCGTC GGCTGCCGAA TCGTTCGGAC CTTAAGCTTC 60 AGTGCGCGCT CGCTTGTTGT AGTAAAAGGT TGGGTACGGA CAAATTTTTT GCGGAAACCT 120 TAACCCTTGT GAAAAAAAAA TGACATATTG CCGCGTTTGC CGCAAGCACG TGCTCCGTGT 180 GAAATTGATA AAGGCACCAA AATGTGTTGT GAGAAGAGAA TTGAGGGAGA AATCTCTAGG 240 CTGCAGCCTG GGAGAGAACT CCAAAATTTG CGACACGCAT TTTAAAGCTT CGGAGTGGAA 300 GTCAGCTCCA AAAAAAGGCC AAGTCTTTAA AAGAAGGCGC CTGAATGACG ATGCTGTTCC 360 GGAAAGAGAC CCAGATCCTG AACCAAATAT TTAAAAATTA GGCTACGCCG ATTCGAGCAC 420 GCAAACAGAG TAAGCTAACC GTGTTTAAAT AAATAAAACA ATAATTTACT TGTAATGTAG 480 GGTAAAATTG ATAAATCGTA ACATAATTAT GGAGAACGAA AGCCTGAGAA AACAACTTCG 540 CCTTATGGAG AAGGAGATGC TATCTTTACG CCAGCAACTT GGATAGTACG AGGAATTAAA 600 CAAATCTTTA GGAAAAATTT TCACCGAGAC GCAAATAAGT ATTTTGAAGA GTGGAGGAAA 660 GAGAGCTGTA TTTAATGCAA CAGACATTTC TCCCGCCATC TGCCTTCATA CCGCGGGACC 720 ATAACTATAC AATCATCTAT ATAGAAAAGG ATTTCCTTTG CCGAGTCGGG CAACATTATA 780 TAGATGGTTG CCAAATGTGA AAATTAATGC CGGTACGCTA GTCTATCACT ACGACCTCAT 840 GGAAAATGAG GGAAATGTCT GAAGTGATAA GCTCTGCGTT TTGTCCTTCG ATGAAATGAA 900 GGTTGCTGCT GCTTTCGAGT ATGACAGCTC AGCGGATGTT GTGTATGAGC CGAGCAACTA 960 TGTTCAACTG GCCATAGCTC GTGGCTTGAA AAAATCTTGG AAGCAACCAG TATACTTTGA 1020 TTTCAATGCT CGAATGGACG CGGATACTTT GCTATCAATT ATAAATAAGC TTCACAAAAG 1080 AGGTTGTTGT TGCCATTGTT TCTGATTTGG GCGCTGGAAA TCAAAGATTA TGAAGGGAGC 1140 TTGGCATATC GGAGAGTAAG TTTCGTATAC AAAATTATAC AAAAGAGATA TTTCAAACAT 1200 TTTTTTTTTA GCAAAGACCT GGTTTAGTCA TCCAGCGGAT GAAGAATTGA AAATATTTGT 1260 TTTTTCGGAT ACTCCACTCC GGATTAGTCA TCAACAGAAA TAGATTGACT AAGACGACAG 1320 TCCAGCAGAC AATCAGCCAC TGTGCTAAAT CAGATGTGTC AATATTGTTC AAGATTACCG 1380 ATAATCACAT CAATATTGGT TCGCTGGCTA AACAGAAGGT CAAATTGGCA ACACAACTGT 1440 TCTCCAGTAC AACCGCTATG AAGTGGAGAA CGCATGCGAA ACCGCTGATC TCTTCAAAAT 1500 ATTCAATGAT TGGTTTGACG TTTTCAATTC GAAATTATCA ACAGCAAATT CCATTGAAGC 1560 GACGCAGCCT TATGGCCAAC AAATCGAAGT TCAACGAAGC ATTTTGGCTA AAATGTCTGA 1620 GATTATGCGC ACGGAGATAG TTGGCAAAGC TCATAAGCTT CCGTTCCAAA AAGGTATTTT 1680 AATTAACAAT GCATCCCTTC CCGGTTTGCA TGGATATTTA TCGGAGAAGT ACGAGATTCG 1740 TTATATTTTA ACAAGTCGTC TTAACCAAGA CATTGTGGAG AATTTTTTTT GGCGCCATGC 1800 GGTCAAAGGT GGACAGTTTG ACCATCCGAC TCCACTTCAA TTTAAGTCTA GGCTAAGAAA 1860 ATATATAACA GGTATGACAA AATTGAAACA CTCTATTAAT TAATGATTGA ATTTAAAAAT 1920 ACTAATTGTC AAATGTTTAG CTCTATGTTT CAGCCAGAAA AGTTTTCATC GTGCTTGTTT 1980 GTAGGTTTGT TCTGTGTTCT GTATTTTGTG TTTTGTTTTG TGATTTACCA TGTTTTACAC 2040 AATCAAAATG TTGAATGAAA GACACTTCTC ACACTTGGTT GAACCTAGAT ATAACTACTC 2100 AAGCAGACCA AAGTCAACAC AACAAGGAGA ACGAATATGA AGATTTTGCG GTAGACATTG 2160 ATAACAACAT AGCACCAGAA ACTGAGCTAG ACGAGCTAAA GGAGGATGCC ATTGAGTACC 2220 TAGCAGGCTA TGTTATCAAA AAACTGCGAC TTTCCAATGA GTTGGCCGAA AATTCGACAT 2280 TTACCTAGGT AGATGAGGTC TCGCAGGGTG GACTCATAAA ACCTTCTGCA CAGTTCAAAA 2340 ATGAACTGAA AGAGCCGGAA ATAATATTTT TAAACATTAT AAAAGAACAT TTTAACATAA 2400 CAAAAAACGT TAAGGAAAAT TTGATAATGG ATTCCGAAAA TGTTAATATA AATTTAGATA 2460 TAACGAGAAG GAACGTGTGA GTCGCTTCTG GGGACAGGAC TCTACTTATA CCCGTACTAA 2520 GTCAGTGTTT ACATTTTGGA GCATAATTTT TTTAGAATTT TAAAATTAAA AAAAAAATTT 2580 TATATTTTAC ATTTTTTGCT ACATCTGTTA TCTACATCTA CATTTACAAT GTATATTATA 2640 CATCTAACAT GTAGTATACA TGTACCATTT AGAATCTCCT CCGCTAGGGG TTGCTCGCGC 2700 ACTGAACGCG GCAGTGTGTG CGAGCGAGAA AGACAATGAG TATCATGGTC GTTGGCTGGA 2760 GGGGGGCAGG GGAAGGCAAG CTGAAAATTA ATTTCTTCAT TCTGGCTATA ATTATGATCC 2820 GATTTGATTC AGATTCGGCA ATCTGGTAGA TATGGGCACT CTCTACATAC TATCTGTCGT 2880 ATCTTAAAAA TGTGCCCTGT CCAACAGATT TTCGTCTTTT GTGATGCGGA AAGGGGGTCA 2940 GCCCGAAATT TTGAAATACA CTTGTAGCAG TGGAATATCA CATGAGTCTG GAAACCAAAT 3000 TTGGTTGCCC TAGCTCTTTT AGTCTCTGAG ATCTAATCGC TCATCGAGAC AGACAGACAG 3060 ACAGACGGAC AGACAGACAT GGCTCTATCG ACTCGACTAT TGATGCTGAT CAAGAATATA 3120 TATACTTTAT GGGGTCGGAA AATTTTCCTT AAACACCTTT TATTATAACC GGCAAGTTTA 3180 AACCCCATGG ACCATCAGAG GGTTAATCCA CAAGCTCTAT TGTTGTTTCA CTCACACTTA 3240 CTGAGGTGAG CCGTTTCACT CGCACTTATT GTAAGTATTG TTAAGGGCCT GCTGCTGCTG 3300 ACGAGACCAC CTTATATAAT TGCATTATG 3329 // ID P_O standard; DNA; INV; 2986 BP. XX AC X71634; XX DR FLYBASE; FBgn0012207; Dbif\P-element_O. XX FT source X71634:1..2986 FT SO_feature CDS ; SO:0000316:join(173..453,542..1221,1288..2013,2198..2781) XX CC Derived from X71634. CC Michael Ashburner, 19-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 2986 BP; 1022 A; 508 C; 609 G; 847 T; 0 other; CATGGCAAAA CGAGTGCCTT CGTAAGGTAG TCTACTCGGC TGCCTTTGCT TTCGGGTTTA 60 CTAGCTCTTT TGCCAGTGCA CTTTGCTTGT CAAAGTGGAA GGTTGGTTAC GGACGATTTT 120 TTCTCGTTTT ATTTTTCGCA TTAACCCTAA CGTGTTCAAA AAAAAGAAAA AAATGAACTA 180 TTGTTATTTT TGCCGCAAGA ATGTACCGGG CGTGAAAATC ATTCATGCCC CGAAATGTGA 240 AATGAAGAGA AAGCTGTGGG AGGAAAGCCT GGGGTGCAGC CTAAGCAAAA ATTCCCAAAT 300 ATGTGATACA CATTTTAATG CGTCTCAGTG GAGGACTGCC CTCAAGGGGA AGATTTATAA 360 GAAGAGACGC TTAAACAATG ATGCCGTTCC GCAAAGAGAG AAAGAGGATG AAAGTGTTAA 420 AGAAGGCTAT GCTAATGCCA GTACGGAAAC TGAGTAAGTT TGGCGTGTGA AATATATATG 480 TACATATATT TCTTTTTGTA CATATGGATG GATATGTAAA CATAATGAAA CTGTCTTGTA 540 GGGACACAGT GATAAATCAT TCAACGTCCA TGGAAATAAA GACTCTGAGG CAAAAGATTC 600 GTGCTTTGGA GGATGAAGTA CAGAGCTTAC GTAAGCTCGT AGAGGACGCA AGCCAGTTAG 660 AGAAATCTTT AAGTACCATC TTCACTCAGA CCCAAATAAA AATTTTAAAG AGTGGTGGAA 720 AGAGGTCAGA ATTTAATTCA GATGACATAT CATGGGCTAT GTGCCTCCAT ACCGCAGGTC 780 CTAGGGCATA TAACCATCTG TACAAAAAAG GATTTCCATT ACCTTGTCGG GCAACATTAT 840 ACAAGTGGTT GTCAAACGTG GAAATACAGA CTGGTTGCCT GGATGTGGTC ATCGATCTTA 900 TGGACAATAT GGACATGGAT ACGGCAGATA AGCTTTGCGT ATTAGCATTC GACGAAATGA 960 AGGTTGCTGG TACATTCGAG TACGACAGCT CGGCGGATCT TGTATACGAG CCAAGCGAAT 1020 ACGTGCAACT TGCGATGGTT CGAGGATTGA AAAAATCGTG GAAGCAACCA GTGTTCTTCG 1080 ACTATGATAC CAGAATGGAC GTGCCAACTC TTTATGAATT AATAAAAAAA CTACACAGAA 1140 GAGGATATTT TGTGGTATCA ATTGTCTCCG ATATGGGTGC TGGAAACCAA AGATTATGGA 1200 GAGAGCTCGG TATATCTGAA GGTAAAGTAT ATTCTTATTA TGAATGAAAT CAAAGATAGT 1260 TGTTAATTTT TTTAAATTTA CAATTAGAAA AAACCTGGTT TGGCCATCCC GAGGATGAAG 1320 ATCTGAAAAT TTTCGTGTTT TCAGATGCAC CACATCTAAT AAAGCTGGTT CGTAACCATT 1380 ATTTGGCTAC AGGTTTACAT ATAAACGGAC AAACATTGAC CAAGTCGACT GTCGAACAAA 1440 CTATAACTCA CTGTTGTAAA ACAGATGTAA CAATATTGTT CAAAGTAAAT GAGAGCCATT 1500 TAAATGTTCG CTCCTTTGCC AAGCAAAAGG TCAAATTAGC AACACAATTA TTTTCGAATA 1560 CAACCGCCAG TGCCATCAGA CGCTGCTACT CTTTAGGCTA TCAAGTTGAA AATGCAGTCG 1620 AAACCTCGGA TTTATTTAAA CTCCTCAACG ATTGGTTCGA CGTTTTCAAT TCAAAGCTGT 1680 CAACGTCCAA CTGCATCGAA ACTACTCAGC CTTATGGCAA GCAGCTCGAA CTGCAAAGAG 1740 ACATTTTAAA ACAGATGTCT CACATTATGA GTAATAGGAT ATGCGGGCAG ACCCATAGGC 1800 TCCCATTCCA AAAAGGGATA CTAATAAACA ATGCATCTCT TGATGGACTG CATGCCTATT 1860 GCAATGAAAA GTACGGAATG GAGTATATTT TGACAAGTCG GCTGAATCAA GATATTGTTG 1920 AAAATTTTTT CGGAGCCATG CGGGCGAAGG GTGGACAACA TGACCATCCG TCACCCTTAC 1980 AATTTAAGTA TAGATTAAGA AAATACATTG TAGGTAAGAC AAAACTAAAA CAATTTGTTA 2040 ATTAGCAAAT AATTGATTTT AATAATAATA ATTGTCAAAT GTCTAGATTT ATGTTTCAGC 2100 AAAGTTTGAA TTGTGAATGT TGGTAGTTAT GTGTTGTCCT GTTTTATGTG TTTATTTGAT 2160 TATTTTCATT TATTAATAAC CCTTAAATAT TATTCAGCCA AGAATACAGA GTTGTTAGCC 2220 GGTAACGGAA ATGTTGACGA GGACAACTGC GACAGCTGGC TCAATCTCAA CATAACTCCA 2280 AATGGCAATA AGGAGAATGA GCCCGATGAA GGGAAGTGGA AAGGGTGGTC GAAAGAGTTC 2340 GAAGAGTTCG AAATAGAGAT GGACAACAAC ATAGCTGCAG AGTACATCAT GGATGAACTA 2400 ACGGAGGACG CTATGGAATA TCTAGCAGGT TATGTTGTTA GAAAATTAAG GTTGTCTAAT 2460 GAATCAACGC AATCTGGATT TACATATGTG GATGAGGTAT CGCACGGCGG GCTTATAAAG 2520 CCGTCAGACC AGTTTACTGC TACATTAAAA CACTTAGAGT CTATATTTAT AAATAATATT 2580 CACAATACTA TTGAAATAAC TAAAGACATA AAAAAAAAAT TATTAATTGC TGCTAAACAT 2640 GTACAAATTG ATAATAATGT AAAACAATTT TATTTTAAAA CTAGAATTTA TTTTAGACTT 2700 AAATATTTAA ACAAAAAATT AGCTATTAAG AATCAAAAAC AACGTTTAGT TGGAAATTCA 2760 AAATTATTAA AAATAAAATT ATAAACACAA AAATAACATC GTCTCAATTC ATTATTGTAG 2820 GCTATTTTCA CCCCAACAGG CAACAAGGGG TTAATCAACT CACAGCAGTG CCTTTCAGTC 2880 GCACTCATGT ACTATGTGCC ATTTCACTCG CACTCATTGC AATCATACAA GAGCTAGTTT 2940 GGCAGCCGAG TAGACTACCT TATACACAAG CACTTATTTT GCCATG 2986 // ID P_M standard; DNA; INV; 2935 BP. XX AC X60990; XX DR FLYBASE; FBgn0012207; Dbif\P-element_M. XX FT source X60990:1..2935 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..31 FT SO_feature terminal_inverted_repeat ; SO:0000481:.. FT SO_feature CDS ; SO:0000316:join(155..447,512..1179,1242..1967,2159..2733) FT SO_feature start_codon ; SO:0000318:1..3 FT /db_xref="FLYBASE:FBgn0044301; Dbif\P-element\T" FT /protein_id="CAA43305.1" FT /translation="MKYCKFCCKVVTAGVSLVHVPKCSVKRKLWEESLGCSLGGNSQI FT CATHFNDSQWKSTENKGQANKRRRLNKDAIPTKEIEPEPENVKEGYTSSSTQTECCSL FT SKENKSLRQTIRAMEYDLQRLRSQLEESRQLEESLGKIFTETQIKILKNGGKRSTFTS FT DDISAAICLHTAGPRAYNHLYKKGFPLPSRTTLYRWLSDVEIKTGCLDVAIDLMENDA FT MDEADKLCVVAFDEMKVAAAFEYDSSADVIYEPSNYVQLAIVRGLKKSWKQPIFFDFS FT TRMDADTLNNIIRKLHTKGYPVVAIVSDLGSGNQRLWSELGVSESKTWFSHPTDEHLK FT IFVFSDTPHLIKLVRNHYVDSGLTLNGKKLTKTTVQQTLNHCTKSDVSILFKISENHL FT NVRSLDKQKVKLATQLFSNTTASSIRRCYSLGYDVENACETSDLFKLLNDWFDVFNSK FT LSTANCIQSTQPYGKQLEFQRDVLEKMTKIMSSEILGKSQKLPFQKGIIVNNASLDGL FT FLYLKDKYNMEYLLTSRLNQDIVENFFGAMRSRGGQFDHPTPLQFKYRLRKYLIAKNT FT ELLRNTGNVEEDNTDSWLNLDFSSKNSENWENNPEDVEPEDDEQTIANNIPADIEMDE FT LTEDAIEYVAGYVIKRLRTSDCVEQSSTFTYVDEVSHGGLIKPSDQFKNKLKELEKIF FT LHYTKENFKITKNLKEKLIIAAQNVELDKFVISFYFKIRIYFRVKYLNKKICIKNQKQ FT RLLGNSKLLKMKL" XX CC Derived from X60990. CC Michael Ashburner, 19-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 2935 BP; 1003 A; 495 C; 589 G; 848 T; 0 other; CATAATGGAA TAACATAAGG TGGTCCCGTC GTGAGGCCGA AGTTTTACGA AGTATCCACT 60 TATTTCAGTG CACGTTTGCT TGTTGAAGAG AAAGGTTGTG TGCGGACGAA TTTTTTTTGA 120 CATTCTTTAA CCCTTAAGCG TGACACATAT TAATATGAAA TATTGCAAAT TTTGCTGCAA 180 AGTTGTGACG GCGGGAGTGA GTCTTGTTCA TGTTCCGAAG TGTAGTGTAA AAAGAAAATT 240 GTGGGAAGAA AGCCTGGGGT GCAGCCTGGG TGGAAACTCG CAAATCTGTG CCACACACTT 300 TAACGATTCG CAGTGGAAGT CTACAGAGAA CAAAGGCCAG GCAAATAAAA GAAGGCGACT 360 TAATAAAGAT GCCATTCCAA CAAAAGAAAT AGAGCCTGAA CCAGAAAATG TAAAAGAAGG 420 CTACACCAGT TCCAGTACGC AAACAGAGTA AGTTAAAATA GTAAAAGAAG CATGTGCAAT 480 AATGTAAACA ATAATGTAAC TGTGTTTGTA GGTGCTGTTC ATTGTCCAAA GAAAATAAGA 540 GCCTGAGGCA GACAATTCGA GCGATGGAGT ACGATTTACA ACGTTTACGA AGTCAGCTTG 600 AGGAGTCTCG CCAATTAGAA GAGTCTCTTG GGAAAATTTT TACAGAGACT CAAATTAAAA 660 TCCTTAAGAA TGGTGGAAAA AGAAGCACAT TTACATCAGA CGACATATCG GCAGCTATTT 720 GTCTCCACAC CGCTGGCCCT CGAGCGTATA ACCATCTATA CAAAAAAGGA TTTCCATTAC 780 CCAGCCGTAC CACGTTGTAT AGATGGTTAT CAGATGTGGA GATAAAGACA GGATGTCTGG 840 ATGTTGCCAT AGATCTTATG GAAAATGATG CAATGGATGA GGCCGACAAG CTTTGCGTAG 900 TGGCCTTTGA CGAGATGAAG GTCGCTGCAG CCTTTGAGTA CGACAGCTCA GCAGATGTGA 960 TTTACGAGCC CAGCAACTAT GTGCAACTGG CTATTGTTCG TGGTCTCAAA AAATCGTGGA 1020 AGCAGCCAAT TTTTTTTGAC TTCAGCACCC GAATGGACGC AGATACCCTG AACAACATAA 1080 TAAGGAAGCT ACACACAAAA GGGTATCCAG TAGTAGCTAT TGTATCCGAT TTGGGTTCTG 1140 GAAACCAAAG ACTTTGGTCA GAGCTTGGTG TATCAGAATG TAAGTTTGTC ACAAGCATTA 1200 AAATTAAAAA TAATCTTTAA TATTTTGTAA TTTTTTTTTA GCAAAAACCT GGTTTAGCCA 1260 TCCAACGGAC GAACATTTGA AAATTTTCGT TTTTTCGGAT ACACCACATT TAATTAAGTT 1320 GGTCCGAAAC CATTACGTGG ATTCCGGATT AACATTAAAT GGAAAAAAGT TGACGAAAAC 1380 GACAGTACAA CAGACACTTA ATCATTGTAC TAAGTCAGAT GTCTCTATTC TGTTTAAAAT 1440 AAGCGAGAAC CATTTAAATG TTCGCTCGCT AGATAAACAA AAGGTAAAAT TGGCAACGCA 1500 GCTTTTTTCC AACACTACCG CCAGCTCCAT CAGACGCTGC TATTCGTTGG GGTATGATGT 1560 GGAGAATGCT TGCGAAACGT CGGATTTATT TAAGTTGCTG AATGATTGGT TCGACGTGTT 1620 TAATTCAAAA TTGTCAACGG CAAATTGCAT CCAATCCACG CAGCCTTATG GGAAGCAACT 1680 TGAATTCCAA AGAGATGTTT TGGAAAAAAT GACAAAAATA ATGAGTTCCG AAATTCTTGG 1740 CAAATCCCAA AAGCTGCCAT TTCAAAAAGG GATTATTGTT AATAATGCAT CCCTGGATGG 1800 ATTGTTCTTA TATTTAAAAG ATAAATACAA CATGGAGTAT TTATTAACTA GCCGTCTTAA 1860 CCAAGACATA GTGGAAAATT TTTTTGGCGC TATGCGATCG AGGGGTGGTC AGTTTGACCA 1920 TCCAACTCCA CTACAGTTTA AGTATAGGTT AAGAAAATAT TTAATAGGTA TGTCAAATTT 1980 AGAAAAATGA ATTACAAATT AATTCATTTT ATTAATTAAA TTTTTTAAAT GTTTAGCTAT 2040 ATGTTTCAGC AAAGTGTGGA TCGAGAATGT AGGTAGTTAT GTGGTGTCTT ATGTGTTTTG 2100 TCTTTTATAT GTTTCTTTTA ATTTTATTAT TTACTAATAA TTCTTATACT TTATCCAGCC 2160 AAGAATACAG AATTGTTGAG AAACACTGGA AATGTGGAAG AAGACAACAC TGATAGCTGG 2220 CTTAATTTAG ACTTTAGCTC TAAAAATTCA GAAAATTGGG AAAATAATCC TGAAGATGTT 2280 GAGCCTGAAG ATGACGAACA AACCATAGCA AACAACATAC CTGCAGACAT CGAAATGGAT 2340 GAGTTGACGG AGGATGCCAT AGAGTATGTC GCCGGCTATG TCATTAAAAG GCTAAGGACG 2400 AGTGACTGCG TCGAACAATC TTCGACATTT ACCTATGTAG ATGAGGTGTC TCACGGCGGT 2460 CTTATTAAAC CGTCGGATCA ATTTAAAAAT AAGCTCAAAG AGCTAGAAAA AATTTTTTTG 2520 CATTATACAA AAGAAAATTT TAAAATAACA AAAAATTTAA AAGAAAAATT AATAATCGCA 2580 GCACAAAATG TAGAGCTGGA TAAGTTTGTA ATTTCTTTTT ATTTTAAAAT AAGAATATAC 2640 TTTAGAGTTA AGTACTTAAA CAAAAAAATT TGTATTAAGA ACCAAAAGCA GCGTTTACTT 2700 GGTAACTCAA AACTATTAAA AATGAAACTT TAAAAAAATT GCTCGTCTCT TTATTATAAA 2760 TGGCATATTC AAAGCCTACG GACATGCTAA GGGTTAATCA ACAATCACAT TGCCGCCTCA 2820 CTCACACTTA CTACGACACT CAGTTTACTA TGTTCCTTTC ACTCGCACTT ATTGCAAGCA 2880 TACAATAAGT GGATGTCTGT TGCCGACGGG ACCACCTTAT GGTTATTCCA TCATG 2935 // ID SGM standard; DNA; INV; 823 BP. XX AC AF043638; XX DR FLYBASE; FBgn0069871; Dsub\SGM. XX FT source AF043638:1..823 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..16 FT SO_feature terminal_inverted_repeat ; SO:0000481:765..780 XX CC Derived from AF043638. CC Michael Ashburner, 19-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 823 BP; 217 A; 189 C; 188 G; 229 T; 0 other; TTTTATAACC CGGTACTCGA AGAGTAAATA GGGTATTTTG TATTTGTGCA CAAAGTGAAT 60 GTATGTAACG CACAGAAGGA AACGTCTCCG ACCCCATAAA GTATATATAT TCTTGATCAG 120 CATCAATAGA CGAGTCGATT GAGCCATGTC TGTCTGTCCG TCTGTCCGTC CGTCTTGTTG 180 AGCGCCTGGA TCTCAGAGAC TATAAGAGCT AGAGCCACCA AATTTTGCAT CCAGACTTCT 240 GTATGCTCAC ACTGTTACAA GTGTATTTCA AAAAAAGAGC GTACGCCCCC TTCCGCCTCC 300 GCAAAAAAGG CGAAAACCTC CCAAGTCTAC AATTTTGAAG ATAGTAGAAA ACTATTCCGT 360 AGGGAGTGAC CATATCTATC AGATCACCAA ATTGGGGCAT TATTGGAGCT TTATTATAGC 420 CACAATGAAG ATATTAATTA GCAGTGGCCA AACCCACCCC GTTCCGCAGC TTATATTTGT 480 TCGCGCAATG TGTGGCGTCT GCCCCTGCGG TTCCTCCGCC TCTGCCATAT TAGCTTATAT 540 TTGGTTTTTT GCACATTCTC TCATTCACAC TTCTCTTCGC GCAGTGGCCG CACACTGGCC 600 GTCTGCCGCT GGCTCTGCCT CTGCCGTGAG TCTGCAGTGT GTGGGCTAGG AAGGGGGCCG 660 AGCTAAAGGA GCGTGTTGGC GAGAGTAGTG TTGTTGATGT AGATGACACA TGAAGAAAAA 720 ATTAAAAATT TGACAAATAA CCGCTATTGT ACAGATGTAG TACTGAGTGC CGGGTATAAA 780 AGTTGTGACG CGTAAGAAGC GTCTCACAAG TCCCTTCTCG TTG 823 // ID DDBARI1 standard; DNA; INV; 1676 BP. XX AC Y13852; XX DR FLYBASE; FBgn0020486; Ddip\Bari1. XX FT source Y13852:1..1676 FT SO_feature terminal_inverted_repeat ; SO:0000481:<1..198 FT SO_feature terminal_inverted_repeat ; SO:0000481:1423..1676 FT SO_feature CDS ; SO:0000316:331..1349 FT SO_feature pseudogene ; SO:0000336:331..1349 XX CC Derived from Y13852. CC Michael Ashburner, 19-March-2004. CC Any changes to original sequence record are annotated in an FT line. XX SQ Sequence 1676 BP; 600 A; 319 C; 348 G; 497 T; 0 other; TCGCTTATTG CTGCTGCTTT TGTACTCAGA AACGAACATA ACGGTATTTG TCGTCCTTCT 60 ACATCTATAC AGAAGCAAAA GAAAGAAGAA GTGGAAAAAA TAACTGTTCA TGGACTAAAA 120 AAAGAAGCAT TGAACTGATG CTTGTCTGTT GTCAGAAGTA TTTACACACA ATTTTTACGC 180 GTCAAAATTA TTTACACAGT GCAAATTGAA GATTAGTCTT TGATTTGCAT TGTGTAAGTG 240 AAGTTCTAGT TGATGTTGAA CGCTTAATTG GAAGTTTAAC TAACTGTTAA CAGTTAAATT 300 AATAGATAAA ACTAAGCATT TAATTTGTAA ATGGCAAACA CAAAGGAGCT ATCAGTTGAG 360 CATCGAGCCG AGATTGTCAC TAAATTTAAG GCTGGTACTT CTGCATCCAA ATTAGCAGAG 420 TTGTACAAAA TATAACGTAA GACTGTGTAC AATTTATTTA AAAAAAGGAC ACACTGGGAA 480 ATTTGGAAAA TAAAAAGAGA ACTGGCCGAA AAGTTGCATT GAACCCAAGT GACTGCAGAC 540 ACTATAGGAG ATTGTACTAT GTAATCCCAC ATAAGTCCCG TTAAAATTGC CGCAGACTTA 600 CGAAAAATTT AATTGGAAAA CATATTAGCG ATAGTTACAC GCGTCGCAGA ATGAAGGAAG 660 TTGACATCAA CACCTATGTT GTTCGTGCAG TAGTTAATAT TACCCCAAGA AACAAGGAAA 720 AACGTCTCGC ATTTGCCTTG GAGTATAAAG AGAAGCCTCT CGAGTTCTGG TTTGATGTTT 780 TATGGACTGA TGAAGTTGCG TTTCAGTTTC AGGGATCCTT TATCAAGCAA TTTATGCATC 840 TCCCAAAACA ACTAAAGGGT AATGCCGACC AGCCTATCAA TCGGTTTGGT GGTGGCACAG 900 TGATTTTCTG GGGCTGTTTA AGCTACTATG GATTTGGAGA CATGGTACCG ATAGAGGGTA 960 CACTAAATCA AACAGGATAC CTCCATATCT TAAATGAGCA CGCATTAACC TCAGGAAAAA 1020 GACTGTTCTC CACGAATGAC TGGATCTTGC AGCAGGACAA TGCTCCATCC GCAAAGGGCC 1080 GAGGTCCAAC CAAGTTCTTG AAAGACATAA ATCAGGCGGT ACTTCCATGG CCCGCACAAA 1140 GCCCCGATCT GAACATTATC GAAAATGTCT GGGCTTATGT TAAGAGTCAG AGGACATTTG 1200 AAAGAGATCG CCAGCGCGAT GAAGCTATAG CCGAAATTAC GAAAAAATGG TCCAAATTAC 1260 CTCTTGAGTT TGCGCACAAC TTGGTCCGGT CAATACCAGC AAGGCTCCAA GCTGTCATTG 1320 ATGCCAGAGG CAGAGTAACT AGATATTAGT TGGCTTATTA AAATAATATA AAATATTTCG 1380 TTTAAATTCA ATAAAAAACG CAAAAATCAA TTAAAAAAGC TGTGTGTAAA TAATTTTGAC 1440 GCGTAAAAAT TATGTGTAAA TACTTCTGAC AACAGACAAG CATCAGTTCA ATGCTTCTTT 1500 TTTTAGCCCA TGAACAGTTA TTTCTTCCAC TTCTTCTTTC TTTTGCTTCT GTATAGATGT 1560 AGAAGGACGA CAAATACCCT TATGTTCGTT TCTGAGTACA AAAGCAGCAG CAATAAGCGA 1620 TCAAAGTTGA CAATTGAAAA AAAAGCACGT TGTGTAAATA CTTTTGACCA CCTCTGT 1676 // ID TC3 standard; DNA; INV; 1743 BP. XX AC AC009537; XX DR FLYBASE; FBgn0061191; Tc3. XX FT source complement(AC009537:133476..135218) FT SO_feature terminal_inverted_repeat ; SO:0000481:1..27 FT SO_feature terminal_inverted_repeat ; SO:0000481:1717..1743 XX CC Casey Bergman & Michael Ashburner, 2-April-2004. XX SQ Sequence 1743 BP; 599 A; 284 C; 297 G; 563 T; 0 other; CAGTGCCGCT CAACTGAATA GGTTTGAAAA ATCTTTCTCA CATTTCTCTT TTTAGTTTTT 60 CTTTGGTCCA ATTGCGTTTT TTTCTATAAT TTTTTTTTTT TTGTTTTGAA CCTTCAATTC 120 CTTACTTATT ATGTGCAAGA AAAAAATAAT GGGAAAAATG CAACGGTAAT TCGGAATTTT 180 GCTCTAATCG TGAAATTTGC TCAATGTGTG GACTATTCTT GGCTCATCTG AATAGGTTTT 240 ACCAAAGACA CTTGAATAAT ATATAATATT AATTATTAAT TAATATTATT ATATATAGTA 300 ATATATTAAT ATTAATATTC TAGTGTCTTT GGTTTTACAC TTTTGACTTC ACTTTTTGTT 360 TGACCGCGTG ATTGTAAAGA GTTAAATTTT TTTTTGTATT CTAAAATTTA AATCATCTTT 420 TATCATCTTT TTCTAAAATG GGCCGCGAAA AACCTTTATC CGATTTTGAA AAAGTTCAAA 480 TCAAAGGCTA TATTGAATCT GGCTTAAAAC ACTTTGTAAT AGCCAAGAAA ATCGGTCGAA 540 GTCAAAACGT TGTGAGTAAT TTTCTCCGCA ATGAAGCCGA CTAATGGAAA AAATGAAAGG 600 AGGAAAAAAA TATTGTATGC AACAATACCA TAAGAGAGGC CACTTATACT ACGAACTGCC 660 TCAAATTCTC ACCTTTCTGC TGGAAAAATT AAGGAAAAAT GCGGTGTAAA TGCTAGTGTG 720 GCCACGGTAA AACGAATTAT TCAAAGTTGC AAATACTTGA ACAGATTACA AATGAAAAAA 780 AAAAACACCT CTCAATCATG CTCGTAAGGA AGCACGACTG CGATTTGCCC GAGAACACAT 840 GACGTGGAGC AAGGAGTGGA AGAAGGTGGT TTTCTCCGAT GAAAAGAAGT TCAACCTCGA 900 TGGACCGGAT GGCTACTATA ACTACTATTT CCATGATATA AGAAAGGAAG ATTGTTTTTT 960 AAGCCGTCAT CACACTTGTG CAGGTGGTGT AATGGTGTGG GGAGCCATAT CTTTTTATGG 1020 AACTTGCGAG TTCCAGTTTG TCACCTCGAA AATGAACGCA AACGTGTATA AGACTGTGCT 1080 TCAAAAGGCT TTTCCAGAGT TTTGTGAGAT TTATGGTCAT ATTCAATAGA CGTACCAACA 1140 TGACAACGTG CCCATCCATA CGGCTCGGAT TATAAAACAG TGGATCACAG ACCAAAACGT 1200 TAAATTGCTC GAATGGCGGC CGCCTTACTC CCCTGACAAT AACATTATTG AAAACATTTA 1260 AGGACTTTTG TCCAGCAGAG TGTACGAAGG AGGAAGACAA TTTAGTGACA CTGAGACCGT 1320 AGTTGAAGCA ATTCAAAAAG CCTGGGCAAC AATTTCACTA AATGAAATTA AAAAATATTA 1380 TGATCCCTTA CCAAACCGTA TGTTTGAAGT AATTAAGAAC AAGGGTGGCC ATACCAAATA 1440 CTAAATAATG CTAAAACAGT GCCTTTTATT ATTTAAACTA TCCAGTGCCT TGGAATTTTG 1500 GTTTATACAA ATAAATGTAT ACTTTATTAA ATACTATTAA ACTTTATTAA ACTTTTTTTG 1560 TGTTTTTGAA AAAATAAAAA TGATATAAAA ACAAATTTAA GAGATCAGAA ACTTTGCTAT 1620 TTAACAAATG AGTGATCTAT TCATACATAC AAATTTATTT CGCTTAAATC GGATTGCATA 1680 TAACTAAGTT ACAACGTTTC AATCCTAAAC AAATAGTCAA ACCTATTCAG TTGAGCGTCA 1740 CTG 1743 // ID Beagle2 standard; DNA; INV; 7220 BP. XX AC AF365402; XX DR FLYBASE; FBgnnnnnnnn; HMS-Beagle2. XX FT source 3R.Release.3:2000950-2008169 XX CC Casey Bergman & Michael Ashburner, 2-April-2004. XX SQ Sequence 7220 BP; 2491 A; 1622 C; 1399 G; 1708 T; 0 other; AGTTATTGCC CTATAAGTTA TCGTCCCACA TCTTATATTT CAACATCATT TCTGGGTAAA 60 CACAAGTGCC CAAAATGCTG ACCAAAGGTC TGCATTCTTT CCGCTGTCAA CGACCAGCTA 120 AAAGTGCGAT CATCTGCACT TCTCCGCCGC GTAGCCGCTG ACGACCACTG ATTCGCTGCC 180 GACGCCTGCT GCGACGCTGC CGACGCCCAC ACTTGATTGC TAGGGACTTA GGGAAATATT 240 TTGTATCTTA GCTTTAGTTT CAAATGAACA ATTACAATAA ACGGTCGCTT GCGATCTACA 300 AAATAAAAAT CAATAAACTG TAATTATTTA CTGGCGCCCG AACAGGGACC AGCGAATAAC 360 GCGTACGACA GACAAAATTC TAAGTCGCGA AGCAAAATAA AATTTTGCAA AAAAAAATTC 420 GTTGGTTAAA TTAGTGCCGA AGAAACTCCC GTGAGTTAAT AAAAAATTCG CGGTCGGCAT 480 TTATAAAAAA AAAAAAAAAA AAAAAAAACT TTTGTTTTCC GGAAGAGGAA AATATTCAAC 540 GGCAGAATAT TGCCCATCGG TGGATCACAT CTTTTGTCAG GCCAGCCCGC AGAAGACCTT 600 ATTTGGAGTG CTGACGTGGA TCCCCAGTAG CACAAGGCAA AAAGGATGAC CTTCACCCAG 660 CCAGGAGGAG TGACCACAGC AATTTTATTG TGAGACAAGA AAGAAATATT TTTTTTTATA 720 GAATTTATTT TAAGATTTAA AAATTTAATA CACTTGTACT TTAACATTTC ATCGACCCCA 780 GATCGGTTTC GACAGTACAT TTAATTTTTT AATTTCCCTT TTAATTTACT TACGGTTTTG 840 GAAAATATAA TTCAAGCTCG AATAGTTCGA ACGATTTAAA TTGCCAAGAT CATAACACTC 900 GTGCCTTTTT GTTTTGGTTA AGATCGTTTT GTGTAAGTGT TGTCCGAGTC AAAAGCCAAC 960 ATGGCAAACC CCAATAATCT AATAAGACCA CCGGTTCTCA GTTCTAATGA GAGACCGGTA 1020 CCCGTAGAAA GACCCGACCT GCCTGAACGG CAGCGCGTAG ATATGAATGT TGAACAGTTG 1080 ACAGGGTTAA TTGGCCAAAC GGTAGCCCAA ATTTTACCTG GACTGATAAA ACAAATGAAC 1140 AATGACACAC TTGACTTTAC TGACGTGAGA GATCAGGTCG TCGAGCCTGA ATACAGAAAC 1200 AATTTAGGGG ATTTTGACAG GGTACCTGAT ATCGTAAAAT CGATCAGGGA ATTCTCTGGG 1260 GACCCAGCAG AATTCGGCTC TTGGAAAAAG AGCGTTGATA GAATCATGGA AACTTATACC 1320 CCATTTGTGG GTACTCCAAA ATATTATGGT ATACTTCATA CCATAAGAAA TAAAATTGTC 1380 GGAAGTGCCG ACGTGGCACT CGAGTCATAC AGCATTCCGC TGGATTGGAA TTCCATGTCA 1440 AGATGTCTCA CTCTACATTA CGCAGACAAA CGAGACATAA CTACACTAGA ATACCAAATG 1500 TCAATTTTGG TCCAAGGTCG CCAACAGTCG GTGGAAGACT TCCACCAAGA CGTTTATAAA 1560 AATCTCTCGC TTATTTTAAA TAAGCTTAGC TGCATGCAAA TGACCAGAGA ATCCGAGCAT 1620 TTCATTACAA AAATGTACAG AGATAAAGCT CTGGACACTT TTATTAGAGG CCTTCGAGGA 1680 GACTTACCTC GCCTCTTGGC TATAAAGGAG CCAGCCGATC TCCCTTCAGC TTTGCATTAT 1740 TGCTTAAAAC TTGAAAACCA AACATACAGA TCAAACCATG CAACAAATAA AGGGCAACAG 1800 TCGAGCTTCA GAGTAAACGA AAAACCCATA CCGGCACCCC GCAATTTTCA GCCATCGAAC 1860 GCCTTTGGCC ATCGACAGCC TCCACCGGTC CCTCCACGAA ACTTCATGAG GGGACCACCA 1920 CAACACTTAG GTCAGCCATT CGCTGCGCCA AGACAACACA TGCTCAATTT CGGTAATGCA 1980 CCGTTCGCCG CACCCCGACA AAATTATCAA CAATTGCAAC AATATAATGC CCCACCCCGT 2040 CCTTTCGCGG CCAAACCGCA ACCAAGACCA GAACCAATGG ACGTGGACCG CAGCATACAG 2100 ACGAGAAATA TCAATTATAT GAACCGACCA CATTTTGACG CCGGCAAACG ACCGAGTGGA 2160 CAGACTACAG ACATAAACAA AAGACAAAGA AATTATAATA TCCAAACTAG GGGCATGGGA 2220 CATCCAAGTG CCAAAATAGC TGGGCCCAGT TCAAGCATGA CAGACTATCA ACGGTCGATG 2280 CAAGAATATG AAACCCAGAA CGACATAAAC GGCACCATGG ACGAATATTG CAACGAACTG 2340 ACGGACTACT TAAATGACGA TGAGCAACAA CAACAGATGC ATTTTTTAGA TTAGAAGACT 2400 CCTCACTGCC ATACTTCGAA TGTAGAGTGA GGAGTGGAAA GGTTCTAAAG GTGTTGATCG 2460 ACACGGGCTC CAACAAAAAT TATATTCAGC CAAAACTGGT GTCGAATGCA ATACCAAATT 2520 ACAAGCCTTT CATAGCTGCC ACTCCGGGTG GCGATATAAA AATAACACAC TACAAAAGAG 2580 CAGACCTTTT TGACTGGGAA ATAAAATTTT TCCTCTTAAC ATCTCTTACG ACATTCGACG 2640 CAATTCTCGG CAAAGACACC CTGAAAGAAA TGGGAGCACA GATAGATTTA GGAAATTTGA 2700 CCATGACACT AGGAAATGGG AAGAGGATTG CTATCAAGGA ACGAAAGTTC GAGGCTGTTA 2760 ATACAATCAG CCCCAGAATA GATCACTTGG GACAAAAACA AAAAGAAAAA CTGAATCGGG 2820 TAATTAATAA TTATCCAGGT CTCTTCGCAG ACCCAAACCA AAAACTGACC TTCACAACAA 2880 ATGTAAAGGC AGCAATTCGA ACCACATCGG ATACACCTGT GTATTCGAAG TTTTATCAGT 2940 ACCCGATGTC CCTAAAAGAT GAAGTCAATA AACAAATAGC GGAGCTTTTG CACGACGGAA 3000 TAATTCGACC ATCAAGGTCA CCCTACAATT CACCAGTGTG GATTGTGCCA AAAAAACTCG 3060 ACTCCTCTGG TAAAAAGAAA TACAGAGTCG TGATTGATTA CCGAAAACTC AATATGGTAA 3120 CAGTAGCAGA TAGGTACCCA ATCCCCGATA TTAACGAGGT TTTAGCCCAA CTTGGTGACA 3180 ACAAAGTTTT CTCGGTGCTC GATCTTAAAA GTGGGTTTCA CCAGATCCCA CTAAAAGACT 3240 CCGATATCGA AAAGACCGCC TTTTCCATAA ATCATGGAAA ATACGAGTTC ACTCGACTTC 3300 CATTCGGTCT GAAAAATGCA CCATCAATAT TCCAACGCGC ACTCGACGAT ATCCTTCATG 3360 AGCATATTGG CAAAATATGT TTCATCTATA TCGACGATAT CATTATCTTT AGCAAAGATG 3420 ATGAAACCCA TTACAAAAAC CTGGACATAA TTTTCAAGAC CTTGCAACAA GCCAACATGA 3480 AATGTCAGTT GGACAAATGC GAGTTTATGA AGAGGAAGGT AGAATTCCTG GGATTCGTCG 3540 TGTCCGACAA GGGCATAGAA ACCAATCCAA TCAAAGTACA GGCAATTTCA GACTTCCCAA 3600 TTCCAAAAAC ACTCAAAGAA CTGAGATCAT TCTTGGGATT ATCTGGATAT TACAGGAGAT 3660 TTATACCCAA CTACGCTAAG TTGGCAAAAC CACTTAGCTC GCTTTTGAGA GGGGAGGATG 3720 GACGAATTTC CAAGACATTA TCATCAAAAA AATCCGTCTC CCTTAATAAC GAAGCAATGG 3780 AAGCCTTCAA GAAATTGAAG AGCAGTTTGA TTTCCCCAGA CGTGATACTC CACTACCCAG 3840 ATTTTAAAAA AGAATTTCAC CTAACAACGG ACGCTTCCAA TTTCGCAGTG GGTGCTGTTC 3900 TTTCACAAGA AAACAGACCC ATTTCATTCT TATCGAGAAC ACTCTCGAAG GCAGAGGAAA 3960 ACTACGCCAC AAATGAGAAA GAAATGCTAG CCATTATCTG GGCTCTAAAA AAGCTCAAAA 4020 CTTACCTTTA CGGTAAAGCA AAGGTGAAAA TCTTTACCGA CCATCAGCCT CTGACCCATT 4080 CCCTCAGCAG CTGGAATGGA AATGCGAGAA TCAAAAGATG GAAGTCATAC CTCGAGGAAT 4140 ACGACTATGA AATTCTCTAT AAACCAGGCA AAGAAAACGC TGTGGCCGAC GCTCTGTCCA 4200 GAGGACCAAC AGCCGCGCAA ATAAACTCGG TAACCTCAAC AATGCACAGC TCTGACAGTT 4260 CGAGCCATGG GTTGATACCT AGCGTTGAAG CTCCAATAAA TGCATTTAAA AATCAAATTT 4320 TCTTCCGGAA AGCAGAGTCG GAGAATTACT CAGTTAGCAT CCCATTCCCG ACATTCCATA 4380 GGCATTTAGT GGACCGTAAA TTGTTCACAC CTGATAGCCT CTTGTCAGAT TTGAAGAAAT 4440 ATCTTAACCC ATCCGTGATA AACGGAATTT TCACATCCCA GGATGTAATG GGGAAAATAC 4500 AAATCCTCTA CCCCATCCAC TTTAAGGGTT TCAAGATTAG ATTCACTCAG ACTGAGGTCA 4560 AAGACCTTGT TACCGAAGCC GAACAGGAGG AAGAAATACT TAGGACACAC AACAGAGCAC 4620 ACAGAAACGC TTTGGAAAAT AAAGCTCAGC TGTCCGAAAG AGTGTACTTT CCTAAAATGA 4680 GGAAAAAAGT TTCAGCTATC GTGAGTCAGT GTTTGGTGTG TAAGACCTCT AAATATGACA 4740 GACACCCCAC ACATCCGGAA ATCAGAGAAA CTCCCTTGCC AGAATACCCC GGACAAATTA 4800 TTCATGTCGA CATCTATTCG ACAGAACGAT ATCTGGTGCT CACAGCAATC GACAAATTCT 4860 CCAAGCTGGC TCTGGGAAGA GTCATTAAGT CGAAAGCTAT AGAGGACATT AGAAAACCCT 4920 TAAGAGATAT CGTATTCTAT TATGGAGTGC CCAAACTAAT AGTAATGGAC AACGAAAAGT 4980 CCCTCAACTC AGCTTCTATC AAATTCATGT TGGCAGACCA GCTGGGCATT GAGCTCTATA 5040 AAGCACCTCC GTACAAGAGT ACGGTAAACG GACAGATAGA AAGATTTCAC TCCACACTCT 5100 CTGAAATAAT GAGATGCTTG AAAGGAGATG GGACACATAG AAGCTTCGAA GAACTTCTTG 5160 ATAGAGCTAT CTATGAATAC AACTTCACTG TCCATTCGGT CACAAAAAAA CGACCCCTAG 5220 AGGTGTTCTT CGGTAGGGGA ACTACCGCGT CACCAGAACA GTATGAACAA GCTAGACTGG 5280 ACAATGTAGA CCGTCTTAGA AAGAAACAGG AAACTGACAT TAAAAATTAC AACCGGACAA 5340 GAAAGCCCAT AAAGACCTAC ATCAAGGGCC AAGAAATTTT CGTTAGGGTT AATACAAGAT 5400 TAGGATCTAA GCTATCAAAC AGATTTAGAA AAGAAATAGT TAAGCAAGAT AGGAATACTA 5460 CAATACTAAC AGAGTCAGGA AAAATAGTGC ACAAAAGTAA CATCAGATCA TGATTACTTT 5520 CAGGATACTT ACTGTGGTCG CATTGACAAA TGGACTCATT GAAATAACTA ACTACACCGA 5580 CGCACAGACT GTGACGATTT ACAACGGGAT AGGACAGATA CAGATAGGAA CAACGAGAAT 5640 TGTCCATATC ATCGACTTGG ACCACGTACA ACTGACCATA GGAAAACTAA CAGACTACAT 5700 CGATCAGGAC TTCAACGACG ACAAGTCATA TCACTTACTG AACTACGAAC TGACACAGAC 5760 GAAGAACTTG CTCGACACTG TGATCCTGGC AAAGACAAGG AAAACTAGAT CGATAAATCT 5820 TATAGGGACG GCCTGGAAGT ATGTTGCCGG TAGCCCGGAT CACGATGACC TTGTAGCTCT 5880 GACCGACGGG ATAAACGACT TGACGGATAA CAACAATAGA CAGGTAATAA TCAATAGACA 5940 GCTGGAAAAT AGAATGAACC TACTAACCGA CGTGACCACG AAAATACAAA ACTCAATCAG 6000 AAAAGATTCT ACCTTAAAGG ATGAGCTGGC CATAAGACTA CAGAACCAGA TTAGGCTCGT 6060 AAAAGAAGAG ATAGTCAACA TTAAATTTGC CATTCAATGG GCAAGGCTGG GTGTTATGAA 6120 CACATTCCTG CTAAACGAGT TCGAGTTGAA AGAGATCGAC AGCCTACTCA AAATCAATAA 6180 TATGCCGACT TCTGCCTTAA CGGTCGAAGA AATGTTAAAA CTCTCAGACG TTTCCGTTTT 6240 ACATAACGGA ACTACAATTT TGTATGTTAT CAAAATCCCA ATTTTAAAAC CAACTAATTT 6300 CCTAAATTAT CAAATAAGGC CAGTAAAGAA AAACAATAAT ATCATTAATT TACCAGCTAA 6360 AGAAATTTTT AAATTTAAAG ACGAAATGTA CGGAATTAAT GCACAGTGAA CTGGAGGGAC 6420 ATCGAGAGAC GATTGAGCGG CACTTACTTG ATAACGTTCC TGAACGAGAC CCTGAAAATC 6480 AACGGCCGAG AGTTCAGTAA CAATGAAGTG GCTTCAACAA AACCTGCGCC ACCACTGATA 6540 CAGCTGACTC CATACGAGGT GGGACGCCTT CGAATTCTCT CGCTAGAAGT ACTCGAGCAA 6600 CTTCATCTGA ATAACACATC TGAGTTACGC AAACTTCGAA CGCACTCAAC AATAAACAGA 6660 GTGTTCGGCG CAACAATACT AATGATGATT GCTGTCCTGA TCTTCGTGAC AAATCGGTGC 6720 TGCCGGAGAG GAAAAAAGGT GGTCCTTCAA ATCGACGCAC CTGGCATTCA AAACATTGAC 6780 GCGGTGCCCA AACCACCACA GATCACCCCA CCAACAATGC GTTTCAACAA CATCCCGTTT 6840 TTTTGATGAC CGCTTGAGGA CAAGCGTCGA TTAAGAGGGG AGGAGTTATT GCCCTATAAG 6900 TTATCGTCCC ACATCTTATA TTTCAACATC ATTTCTGGGT AAACACAAGT GCCCAAAATG 6960 CTGACCAAAG GTCTGCATTC TTTCCGCTGT CAACGACCAG CTAAAAGTGC GATCATCTGC 7020 ACTTCTCCGC CGCGTAGCCG CTGACGACCA CTGATTCGCT GCCGACGCCT GCTGCGACGC 7080 TGCCGACGCC CACACTTGAT TGCTAGGGAC TTAGGGAAAT ATTTTGTATC TTAGCTTTAG 7140 TTTCAAATGA ACAATTACAA TAAACGGTCG CTTGCGATCT ACAAAATAAA AATCAATAAA 7200 CTGTAATTAT TTACTTATAC 7220 // ID DPSEMINIME standard; DNA; INV; 4622 BP. XX AC AC131959; XX DR FLYBASE; FBgnnnnnnnn; Dpse\mini-me. XX FT source AC131959:30620..35241 FT SO_feature CDS ; SO:0000316:2480..3364 FT SO_feature inverted_repeat ; SO:0000294:252.. 693 FT SO_feature inverted_repeat ; SO:0000294:4189..4622 FT SO_feature dinucleotide_repeat_microsatellite_feature ; SO:0000290: FT 100..111 FT /remark="repeat=TA" FT SO_feature tetranucleotide_repeat_microsatellite_feature ; SO:0000641: FT 150..197 FT /remark="repeat=GTCY" FT SO_feature region ; SO:0000001:117..149 FT /remark="Wilder and Hollocher core sequence" XX CC Sequence & annotation from Steve Schaeffer, 2 April 2004. XX SQ Sequence 4622 BP; 1198 A; 1081 C; 895 G; 1448 T; 0 other; TTATACCCGA TACTCAAAAT GAGTATTGGG GTATATTAGA TTTGTGGTAA AAGTGGATGT 60 GTGTAACGTC CAGAAGGAAT CGTTTCCGAC CCCATAAAGT ATATATATAT ATTCTTGATC 120 AGCATCAATA GCCGAGTCGA TTGAGCCATG TCTGTCTGTC CGTCTGTCCG TCCGTCCGTC 180 TGTCCGTCTG TCCGTCCCCT TCAGCGCCTA ATGCTCAAAG ACTATAAGAG CTAGAGCAAC 240 GATGTTTTGG ATCCAGACTT ATGTGATATG TCACTGCTAC AAAAATATTT CAAAACTTTG 300 CCCCGCCCAC TTCCGCCCCC ACAAAGTGCG AAAATCTGTG GCATCCACAA TTTCGACGAT 360 ACGAGAAAAC TAAAAACGCA GAATCGTAGA GAATGACCAT ATCTTTTAGA CTGCAGAATC 420 CGAATTGGAT CGTATTATTA TTATAGCCAG CATCACGAAA ACAATTTAAT TTTTTCTCGC 480 CCTGTCTCTC TCTAACACAC ACGTAGCATA GGCGGCTTTG CTTAGAGTAA AACATTAGCG 540 CCTAGATCTC AGAGACTATA AAAGCCAGAG CAACCAAATT TGGTATCCAC ACTCCTAATA 600 TATCGGACCG AGACGAGTTT GTTTCAAAAT TTCGCCACAC CCCCTTCCGC CCCCGCAAAG 660 GATGCAAATC TGGGGATATT CACAAATCTC AGAGACTATT AAAGCTCGAG TAACCAAATT 720 TAGTATCCGC ACTCCTGTTA GATCTCACTA TTCATTCAAA GCAAAATAAA AGCCGTGGGT 780 TTAAAGGTGG AGAAATTTAA CTTCTCTTAT GCCAGGGAGA TAGCGTCGTT TAAGATAAGC 840 ATCTCCCCAA CTCAATTTGA CACCATTTGC TCCACCAAAT TTTGGCAGGA GCATTTGGTG 900 GTGAAGGAGT TTAAGGCTAA GAAGAAGAAT AGGCCCCCCA TATCGCTTCC AAATCTTTCC 960 AGCGTGCCAC CCTCAACCTC AACTTCCTCT TCCTCAACTT CCCGTCTTGC TCCAACCTTT 1020 CCAAAAAACT AACTTCTCTT TTAGTAACCT ATCAGAATGT AAGAGGCTTG CGTAGTAAGC 1080 TCAGCATTCT TTTCCGGGAT AGTGTTGCAT TTGCTTCCCA CGTTATTGTG TTTACTGAAA 1140 CCTGGTTAAA GCCGGACATT CTTAGTTCCG AGGTTTTGGC AGGTCGGTAC ACAACTTTTA 1200 GAAAGGACCG TTCGTCTCGA CGTGCAGGAG GGGTTCTAAT TGCAGTGGAC TCTTACTTCA 1260 CGTCAGAACA CTTCACAGTC CAAGTTCAAC AGGAACTGGA ATTCCTTTGT GTAAAACTGA 1320 TTCTTCCCGC TTTCGCTATA TTCATTACTT GCTCGTATAT CCCACCTTCT TCGGATATTT 1380 CAATTTATGA GCAGCACTTG TCCGCTTTAA CCGCTGTTGC TTCCTCGCTA TCTGATAAAG 1440 ATCGTATGAT AGTTCTTGGT GACTTCAACT TGCCAGGAAC TGTTTGGTCT TCGGTAAACG 1500 AGTCTAGTAT CCTAGTGCCC ATGTCACGAC ATGACTTTGT TGACGGCTTG CTTGACCTGT 1560 CCCTGTCTCA AGTCAATCAT GTGAAAAATT CCTTGGGTCG ATTGCTTGAT CTATGCTTTG 1620 TATCGGATCC GACCATAGTG TTGTTAACCC GAGCCCTTCC GCTCACTATA CCTGAAGACG 1680 CCTACCACCC TACTTTCGAA GTGTCGCTAG ACATATGACC AACTGTATTG GATCGGTCGA 1740 GTAGACTACC TGAACGTGTC CGCTGCTTTC GTAAAGCCGA GTTTGCGAAG CTTAATACCC 1800 TAATTAGGGA TTTTGATTGG TCCGCTTTGT ACTTGTGCAC TGATGTCACA AAAGGCACAA 1860 ATATTTTTTA CAATGCTCTT GGCACATTTT TTGATTCTTG TGTCCCGCTT TCTTGTCCGA 1920 TTAGATCTGG AAAACCCCCT TGGTTTACCA AAGAGTTATC CAGTCTAAAA AACTTAAAAT 1980 CAAGACTTTA TAACAAATTT CAAGAAGTGG GTTCTCCTAC TTCTCATTCT CGTTATGTAA 2040 TAGCTCGGTC AAACTTTTCA GTTCTTAATG CTCAATGCTA TAAGAACTAC CTATCTCGTT 2100 GCAGGATACG TTTTTCTCAG GACCCTAAAC AGTTTTACTG CTTCGTAAAC AGTAAGCGTA 2160 GAACGTCCGC ACACCCATCC TCGCTATCAT TTTGTAATAC GTCGGCAAAT AATGATCAGG 2220 TAATTGCCGA TCTTTTTGCC CAGTTTTTCC AAACCACCTA TTCTGAGGAA CGCTACTCTG 2280 GTCAACCGTA CCCATACGGA TTACCGAGGT CGAACGGCAT TTTCAGTCCC TTGTTAAATG 2340 AGTATTCCCT ACTTCAAGAT CTTCGACTAG TTAAGCCGGT GTTTTCACCG GGTCCAGACG 2400 GGGTTCCAGG TTGTGTACTC AGGTACTGCG CCGAGGCTCT GTGTGGACCA TTGCTTAAAC 2460 TATTCACCAT GTCCATTGAT TCCTCTTGCT TCCCCCCAAT CTGGAGGAAT CGTTTATAAT 2520 TCCTCTCCAT AAAAAAGGTA GTAAGTCTGA TGCAAAAAAT TATAGAGGTA TAGCAAAGTT 2580 ATCCGCTATT CCCAAATTGT TTGAGAAGGT TTTAACTCCG CACTTGCAAC ATCTCTGCAA 2640 GTCACTTATA TCTCCAACTC AGCATGGATT TATAAGGCGG CGATCAACCA CCACGAACTT 2700 GTTAGAGTTT ACCTCTTTCA TTATTAAAGG CTTTCAAGGT AACTTACAGA CGGATGTTAT 2760 TTACACCGAC TTTAGTAAAG CATTCGACTC TGTAAACCAT TCCCTTTTAG CACATAAACT 2820 TGACCTTTTA GGGTTTCCGC CCAACCTCCT GAGATGGATT TCTAGCTATC TTTGTTCTAG 2880 GTCTCAAAGA GTCCTCTTCA AAAACTCCCT CTCTTTACCA GTAAAGGTTT CTTCGGGAGT 2940 ACCACAGGGC ATCCATCTAG GCCCCTTACT CTTCACACTC TTTATTAATG ACTTGCCTTC 3000 AGTAATAACA TACTCTCGAG TACTTATGTA TGCGGATGAT GTTAAACTCT GTGTCCAGTA 3060 CAAGGACATT TCATTTCATT CTCGCTTGCA ATCCGATCTC AATAGCTTTC AGTCATGGTG 3120 TTGTGCAAAC TTGTTACACC TTAATGCCTC GAAATGCAAA GTTATGACAT TTCATCGTTC 3180 TAGCCCTTTG TTGGCTCCCT ACACCCTATT TGGTGGTTCT CTTGAGAGAA TTACCCTGGT 3240 GGATGATCTG GGTGTTATGT TAGATCCCAA GTTAAAGTTT TCCGAACACA TTTCTACCAT 3300 GGTAAATAAG GCCATGGGCG TGCTTGGGTT TATAAAGAGG TGGTCAAAGC CTCCCATCCT 3360 TAGTTAACCG TACAAAAATC CGACGTGTTG AGTTGCATAA ACTTCACGAT TCCTATTAGA 3420 CTGACTAGAA ATTTTATACC GTTGTTTCTT CCACTTTGTA CATCGAATTA TTGCTTGCAA 3480 GCACCGTTTA GGGTCTTATG CTCGGATTAT AATTCCCTCT ACCATATTAT ATCCTCCACT 3540 AATTCTCTTC CTCTTATTAA ATTACTAATC CTTACACACC TTTCTAGTAA TTAGTAATTG 3600 TAGTGGTACT TGTATTTTGT ATTTTATTGC ATGCTTAATT TCTTAATAAG TTTAGTGCTA 3660 ATTTTCCTCG AATGTTAGTC TAATATCTAT CTTTCTTGCA TGCTCGCGTT TGGTTCGGCT 3720 ACGCACCGCG CGTCATGCGG CAGCGCCCCT CGGTCGGTTG GGCGGGAGGA GGGCTGCGTT 3780 TTGCCTGGGA TCCGCGCGTA ACAGCCTTCT GCTGGTGTCA CACGGGCCAC TTGACGGTGC 3840 AGTAATTGCA TCGCCTCTTG ATAGATGCAG TCATTGCATG TCAACGTCCA AAAAACGTAT 3900 ATCTCAAAAT TTCGCCCCAC CCCCTTCCGC CCACACAAAG GACGAAAATC TGTTGCATCC 3960 ACAGTATTGC AGATTCGAGA AAACTAAAAA CGCAGAATCA TAGATAACGA CCATATCTAT 4020 CAGATTGCTG AATCTGGATC AGATCAGATC ACTTTTATAG CCAGAAGGAA CAAATCAATT 4080 TGCAGTGGCT ACGCTGCGCC CGACGTCACG CTCAGACTGA TTTTCTGTCT CTCTCGCACG 4140 CACTCTTTGT CGTGTCGTTT AATATTAGCG GCGTCTGCCT CTGAGATTTC TGAGATTTTT 4200 GAATATACCC AGATTTTCGT CCTTTGCGGG GGCGGAAGGG GGTGTGGCGA AATTTTAAAA 4260 CAAACTCGTC TCGGTCCGAT ATATTAGGAG TGTGGATACC AAATTTGGTT GCTTTTATAG 4320 TCTCTGAGAT ATAGGCGCTA GTGCTTTACT CTAAGCAAAG CCGCCTATGC TACGTGTGTG 4380 TTAGAGAGAG ACAGGGCGAA AAAAAATAAA ATTGTTTTCT TGATGCTGGC TAATATAATA 4440 ATACGATCCA ATTCAGATTC TGCAGTCTAA AAGATATAGT CATTCTCTAC GATTCTGCGT 4500 TTTTGGTTTT CTCGTATCTT TAAAATTGTG GATGCCACAG ATTTTCGCCC TTTTTGGGGG 4560 CGGAAGTGGG CGGGGCAATG TTTTGAAATA TTTTTGTAGC AGTGACATAA CACAGTAGTC 4620 TG 4622 // ID Q standard; DNA; INV; 759 BP. XX AC AE002612; XX DR FLYBASE; FBgn0063900; Q-element. XX FT source complement(AE002612:6924-6166) XX CC Coordinates from Berezikov et al. 2000. XX SQ Sequence 759 BP; 231 A; 168 C; 116 G; 244 T; 0 other; CTGAAACTGC TCACCTTATC TCTGGAATCT TCACAATTCC CTCATATATG GAATGAGTCC 60 TTTGTGATTC CTCTTCATAA AAAAGGTAGC AAACTGGATG CAAGCAATTA CAGAGGAATC 120 TCTAAATTGT CGGCTATCCC AAAACTTTTT GAAAATGTTA TCACTCTTCA TTTGCAGCAC 180 CTTTGTAGAT CAATCATATC ACCGTGTCAA CACGGTTTTA TGAAACGCAG ATCAACAACC 240 ACTAACCTCT TGGAGCTAAC TTCTTTCGTA ATACAGGGAT TCAAAAATAA TCTTCAAACA 300 GATGTCATCT ATACTGATTT TAGTAAAGCA TTTGACTCTG TTAATCATTA CCTTCTAATA 360 AGAAAACTTG ATCTTTTAGG TTTCCCTATT GATCTTCTAA ATGGGATTTC AAGCTGTCTG 420 AATGGCAGGA CACAACAAGT CCTCTTTAAA AATTCTTTAT CTCGTATTTT ACGAGTCACA 480 TCCGGTGTCC CACAAGGGAG CCATCTTGGT CCGCTTCTTT TTACCTTATT TATTAAAGAC 540 CTCCCATTAA TAATAAAACA TTCGCGTGTA CTTATGTACG CAGACGATGT TAAATTATAT 600 CTCCAGTACA AGGACACTTC GTGCCATTTA GACTTACAAT CCGATCTAGA TCGATTTCAA 660 ATATGGCTCC AAGTGTAAAG TTATGACCTT TTGTCGTGCC AACCCAATAC GCATGACTTA 720 CACTCTAAGT GAGTGCTCCT TGAACAGAAT AACACGAGT 759 // ID BUT1 standard; DNA; INV; 769 BP. XX AC AF162798; XX DR FLYBASE; FBgn0063576; Dbuz\BuT1. XX FT source AF162798:600..1368 XX SQ Sequence 769 BP; 266 A; 117 C; 148 G; 238 T; 0 other; CAGTGATGCC AACTTGTTCT TTTTCCGGAC AGATCTGTCC TGTTTTTAAG CGCTTTGTCC 60 GGAAAATATA TCAAACATCC CCTATCCGGA AATCTGTCCT TTTTTGTGAA TGCGAATCGT 120 GTAAATATGA ACAGTTGGTT GGTTGTTTCC CCAGTTGGTT GGTTGTTTTC CCAGTTGATC 180 GGTTGTTGAT GCCACAGAAT TTTATTAAAT AATATTGCAG AGCAGTTTAA CAGAACTTTG 240 GCATAAAATA TACTAAATTG TGTACCGATA TTTTGTTAAG AACAGCGCCA AAATGAGTAT 300 AACAGTGATA AATTGATAGT GTAGGAGGTT TATATGTATT ATCGATATCG TTTCTTATCT 360 AAAGTTGAAT TTGAAAAGAG TGATAAAATG CCCAAGGAGA TATATAAACA AATATTTTGG 420 GATTGCTGGC TACAGCAGCC AGATTTCAAG CCGTGGATAG CCCGGGATGA GAACGACAAT 480 AATAAGGGGT ACTGTAAATG CAGTATAAAT TGCAAAATTG ACCAAATTCG CGCACATATG 540 GCTACAAAAA AACATATGAA AAACGCATCG TGTTTTCCCA AAAAAATAAA ATATGTTGGA 600 TCCATGGCGA CGTACGTGGC CAGGGAAGAG GAAAAATATA ATAAAAAAGA AAAAAAATAC 660 AAATTTAGAA AAAAAAAAAA TGTTTTCTGT CCTTTTTTCT TAGATTTTCT GCCTTTTTTC 720 CGTATTTTAC GGACAGTTTT GCCTAAAAAA AAAAAAAAAT GGCATCACT 769 // ID BUT2 standard; DNA; INV; 2775 BP. XX AC AF368884; XX DR FLYBASE; FBgn0063575; Dbuz\BuT2. XX FT source AF368884:458..3232 XX SQ Sequence 2775 BP; 866 A; 514 C; 469 G; 926 T; 0 other; CAGTGCTGCC AACAATTTGT TGCATGGACA CGCTAGATCA GCCTTTGAAA AAAGCTAGAA 60 ATAGCTAGAA ATATTTCTTC ATAGCTAAAA AAAAACGTAA AAAAAAACGT TAGATTAAAA 120 TTTTTTTTTT CCGGTTTATT TCAATATTTT ATAAGAATTC GTCAATCACT TCATCTTCAT 180 CGTTTTGATT TGTGTAGGTT TCATTGGTAC CTATCTTTCT TAAACATGCT GCTGGTAATT 240 TGTAATCGTG GCAACATTTT TTTTGGAGGC GTAGTCCGTA TCTACAAATA TAAAATTAAA 300 TGTAAATAAT AATTTTATAA AATATTAACA GAAAAATACC TTATATGTAA AATGGCGTTC 360 AAAGTTCTAT TAGCCATACT ATTTCTTAGC TTTGTTTTTA TTATGTTCAT TTGACTAAAA 420 ACCCTTTCTA CCTCTGCGTT CGACCAAGGC AACGATAAAA GGTTTAGTGC AAATGATGCC 480 AGTGCCGAAA ACCGTTTCGT CCCTGCCCCA TCTCGATATC TTAACACTTC GGCCCAAAAG 540 GGTATTGTCT CGTCTGTATT GCACCACTTA ATTAGGTGGA TGCTCGCGTA CTGCGTCAAA 600 ATGTCCTCAA TATCGGTGGA TTCCACCTTG AAATGCACGA GAATATTCAC AATGTCAGCT 660 TTTTGTGCAC ATAGTATTTT TTCTACAGAA AACAAGTTAA CTTGCTGAAG GATTTCAAAA 720 TTATCAGGCA ACCTGAAACA ATTTATAATT TAATAAAACA AAATATCACA ACTTATTCAT 780 AATTACATAC CTCTGTTTTA ACTCGTTGAT TAATGCTACT ATAAAATTGG AACATTTCAA 840 ACGAATTTCT GTCTCTATTT CATTCGAGAT TTTTTTTTCT TTAAGCTTTG TTTCAAATGC 900 ATATCCCAAA TAGCAGTCAT ACTTAACAAA TTGTTCGAAG TCTTCTTTTA AAATATCAAT 960 ATCGACATCA GGGGGAATAA TTTTACTTTT AAATGAATTT ATCGTATATA CGAGATCACT 1020 TAATAGTTTG CTTGAGTCTC CAGTGCTTGT TTGAAAAATT TTATTGATGT GCTGCATTTG 1080 GGCTAGCACT GGTTGCAAGA ACAGAAGATA CAAATGATTA ACGTCATCAC TATAAAGTTT 1140 TTGCAACGTC TGCGCCATAA GACATTTTTC ATTTGTTGCA GCAATGGTGA AGTGAAGCTT 1200 CAGTTCAATC CACTGCTGTA GAATTCGGCT AACTGCTTTT TCAATGGACA GCCACCTTGT 1260 CTCGCATACT CGAGGTATTT TAAGAGGATC CTAAAGAAAT ACATGTTATT CATATACATT 1320 ATATATATAC ATATCTATAT AAACAAACAT GTACAAAACA CAAACTACTT ACACACGCAT 1380 GTTAAATAGA ACGAAACGCA CACAATTGTT AATGTTGAAA AATGCAAAAA GATGTCTCTT 1440 TCACCTGGTC GGAATTTATA ATTTGATATA TTTTTTTGTA CTCAGTTTGT CTTTTCGTAC 1500 TGCAGGAAAA CCAATAATAT GTTTGAGCAA CAATGTACTC TAAGACCTCG GGAACAGTTT 1560 TGCAGGCATT ACTCACAGCC AATTGCAGTG AATGGCATAC ACATTTGATT AAAACTAAAG 1620 CCGGAACCAT TTGTTTTAAC TCCGTGTAAA CACTTTGTTT GGAACCAACC ATAACACTTG 1680 CGTTGTCCGT GCCGATTCCA ATCAAATTCG CAAGCTTTAA GTTAAAGTTA TTTTGTTCCT 1740 CGCAGAGGCT TTTCACTATC GTTTTTGAGT CGCCTGCCTC GAGCGGGACT ATGGATAAAA 1800 ATGTGGAGAC CATAACGCTC CGTTTTGAGC TAAAATAGTT TATGGCAACA CCTATACAGA 1860 AAGTAATTAT GATTAAGATG GTGTTTCAGA ATGATTAGAA TTATTACTTA CCAAGAGTCT 1920 TGTACACACT TATATCGGTA CTCTCATCAA TCAATAAGCT ATACTTCGCA TCACCGGTGT 1980 CCTCCTTCAG TAGCTCTATG AAGTGAGGAG CTAGAACGTT TTTTATTATG GTTGTGCACT 2040 TTGTCCTATG GAGCTTTAAT TGAGTGGCAT CCTGAAACTT TTCTTTGCAC AGCAATCCTA 2100 AATGGCCCAC AGGTTTGATG GAAGAATGTG TTGCAATATA TAAACACAAT CCCGCTTCTT 2160 GTTCGCAAGT TTTCAATGCG ATTGAAGAAA CAAATTTAAT TTTATTAGGC TGTGCAAAAG 2220 CCGCTGAAGT TGCCTTGTGC TTAGCAGTCT TGGCATGTGC GTGCAAATCT GAAAGTTTAG 2280 CATTAATGCT AGATTTGCAG TATTTGCAAT AGCATTTGCC CTCATCAGCT GCATCTACAC 2340 TTAACCAATC TTTAAATGCA GGATCTTGCA TCCATACTGT GCGCGGTTTT TGTTTATAAA 2400 TTTTCGGCAT TGTACTTGTA AACTTGTAAG CTACGCTCTA CGCTTTAAAA CTTGTGAAAT 2460 ACCCTCTTGG TGATGGCATA AAACACGATT CGGTAGACTT CACAGACATC GATAAGATTG 2520 AATGCTCGTC TGGCTTATCG ATGTTTGTTG TCATTCATAT GAAAAAATAC TATACGTATG 2580 AAAATGTGGC ATCATAGAGC AGAGAGCACA GATAAAATAC ACCAAGTTAC CAGATGTTGC 2640 AAAAATATGA AGTAAAGCTA ATTTTTCATG AAAAATAGCT ATAATTCCCG CTAGCCGCTA 2700 TTTCACAATT TATCCCGCAT TTCCTTGTTG CAACCCACTA GATCTAGTGG GAAAATAGCT 2760 AAATTGGCAG CAGTG 2775 // ID BUT3 standard; DNA; INV; 795 BP. XX AC AF368870; XX DR FLYBASE; FBgn0063575; Dbuz\BuT3. XX FT source AF368870:568..1362 XX SQ Sequence 795 BP; 263 A; 130 C; 145 G; 257 T; 0 other; TAGTGCTGGC AAACTATCGA TAGCACTATC GATATATCGA ATTTTCACAT TTTTGCGCCG 60 CCATCGATAT TTTTTATCCA CTATCGATAG GGATCGAAAA TAACGCTCGC AGTTTGACAC 120 ATCACTCGTT TTGGCGATTT GGTTATACGA AATTTATTAA AATATAATCC GGTATAAGCT 180 TTTTGGTTAG TGGTATAGAC GGTGGTATTG GCATGGACAA ATTTTTGATC GATAATAACA 240 AGGGGAAGTA TTATTTTCCA TTTCATTGTT AGAACCGTAA AACTAATTTT ATAAGTATGT 300 ATTAACATAG ATGGCGACAT TTGCAATGTG GATGTTGAAA ATACGGTGGC CAAAAAACCA 360 CGCATTGGCA AATCTAAAGT TTGCCAATTC TTTACAAGAA TTAATGATGG TAAATTGGGA 420 AAGTGTCGTT CATGTGATAA AACATACAAG ACAAGTGGAA ATACATCTAA TTTATTCGAC 480 CACCTCAAAC GTGTACATCC AACACTGCAA ATAAATGAAT TGCCAATTGT TAACAAAATC 540 GAGAGCTACG TAAGCGCAAC ATGTGATTCT TCAAACAAGC GTAAAAAGGA GCTTGATATA 600 GCGCTTATGG AGTACATTTC ATCAGACATG CAACCATTCT CCGTTGTTAT AATTTGCCAT 660 TTTGTTATTT TTCGTTATTT TCAAATTTAA ATGTAAATAA AGACGGTTTT AGCGATAGTA 720 TCGATAGTAT CGAATTTTTT GCTTAAAACA TCGAAAATAT CGAATGGCGC CACTATCGAT 780 AGTTTGCCAG CTCTA 795 // ID BUT4 standard; DNA; INV; 1447 BP. XX AC AF368868; XX DR FLYBASE; FBgn0063573; Dbuz\BuT4. XX FT source AF368868:207..1653 XX SQ Sequence 1447 BP; 396 A; 312 C; 269 G; 470 T; 0 other; CTATCGATAG CACTATCGAT AAATCGAATT TTCACATTTT TGCGCCGCCA TCGATATTTT 60 TATCCACTAT CGATAGGGAT CGAAAATAAC GCTCGCAGTT TGACACATCA CTCGTTTTGG 120 CGATTTGGTT ATACCACTAA CCATACAACA CATAGACTGG ACAACTCGAA CAAACTTTTT 180 TCACGAACCT GATTTTGATA TCCCGGCCTA GTCATCTTCG TATTTGCTCG GCTCTTTAAT 240 TTACTGTTCG GGCAGCGAAA ATATAGCAGT TGATATTCAG TTCGCTGCCA GCGCTCGAAC 300 GAAATGATTC TATTCTCATA TGAAGCACCG CTGCGTGCTA TTCTTTCTTT ACTCACTCTC 360 TCACTCACTC TCTCTATGCC TCTCATATAT ACATACATAT ATGTATGTAA GTCTGTGTGT 420 GTATGCAGGC ACAAGCCAGC AGCTCACAAA AATTGAAATT TGTCCGCGAG TCGCGATTCG 480 CGACGATCAC ATCTTCGATT CTGAATAAGA GTATGCCGAA CTCTTAAAAA GGTTTGCCAT 540 GCTCATGCAA AATTTTAGTG TTTTCGTGAT TATTTTTTTA AGGGTATGAA AGCGTGATGT 600 ACATAAGCAT TGCTTTTTCG CGATGAGCAT GCACACACAC ACATTTACCA TATCTTGCTA 660 CATGTATGAG CATTTATGTG ACTGTGAGCG AAACTATTGC TGCCTGTGGC AATTGTAGGG 720 CCCAACATTT GCAGTATGTT TTTCTTGTGG ACCATGGCAT TAGAATTGTG AGAACGGTCT 780 TAGATTTGTT CGACCTTCTA TGGCGATTTG TGTTGTATGG TTAGTGGTAG TAGGCTATAG 840 TGCTGGGAAA AAGATCGATC TAAAAACGAT ACATCTATGT TTTGTTAGAT ATCGAAAATA 900 TCGATACTGC TAACTTCGAT ACTCGATATA TCGCCATCTG TAGACTAATA TCACTAACCA 960 TACAACACAT AGATTGGACA ACTGGAACAA ATTAATGTGC ACACTTATTT TTTGACATTA 1020 GAACCCCATC AACTTCGTAT TTGCTCGGGT TTTTACTTTT CGGGTCGGGA GAGAAAAATT 1080 GCTGATGACA AAATCATTCC GCTTCGAGCG CTCGACGGTG ACGGTCGATA TTCCAAAAAG 1140 GCCTGACATG TACCGCTCTT TCCCTTGCTT TCTTTCTCTT TCTGCCCTAT GCTTTTTACT 1200 CTCTGCTCTC TTCTTACAAA AATATATGTA TGCTAGCACA AACAGCAAGC AGCTCACAAA 1260 AATTGAAATT TCTTCTCGAC GCGATCCCAT TGGTTCCCTT CTGAGCGAAG GGTATACCGA 1320 ATTCTTAAAC GGCTTTGCCA TGCTCATGCA AAATGTTAGC GTTTTCATGA TTATTTGTGG 1380 AAAGGGTATG AAAGTGTTGA TGTACATAAA CAATACTATT TCCCGATGAC CATACACACA 1440 CACATTT 1447 // ID BUT5 standard; DNA; INV; 669 BP. XX AC AY187768; XX DR FLYBASE; FBgn0063572; Dbuz\BuT5. XX FT source AY187768:1462..2130 XX SQ Sequence 669 BP; 253 A; 110 C; 116 G; 190 T; 0 other; CACTGTTAAA AGACTCAGTA GGTTACGCAA AGATGTTAAC TAACAGACCC CCCTGTGACG 60 TCACATGCTC ATAAGCAGCC AAAATAAGTT GTTTTGCAAC GAATTCGAAT AGTAAGAACC 120 ATAAAAATGA TCAATCCATA CTTTTATTTA CTTAAAATAC AAAAACTTAT TAGATAATGG 180 AAGAAAGTGT GAGGCCTTCT GGCAGCAGCA ACAGCAGTTC CTAAATGGGT ACTACGTCCG 240 AAAAAATGGT CAGAAGGCAT GATTTTTTTA ATTTTATGCA TTAGTATTGT TTATAAATGC 300 CAATATATAT CAATATTCAA ATAGAGTACA TGTTGAAGTC ATATCAGGTA AGCTGCTGCT 360 CACAGCTGCT TCCATGTCGA AAAGAAAGCC CAAAGACCGA AATAAATTTT ATGTATTAAT 420 TAAAAACCCC AAAAAGGACT GAGATTATAA GTAGTCAAAC ATAAAGAAAC AGATAATCTG 480 GTTCAACCAG CCATCAGAAA TCTTGGTAGA AGCTAACAAT GCCGCACCCA AGCTTTGGCC 540 AACAGATTTC TAATATGATG AATGAATATG ATAGATGTAA TGATGTGGAT TGCTGGGAAA 600 CGATTAGTAT ATGTTATTAC CACTTACTTT TGTAAAATAA AGTCGTTTAA AATAAAGTTT 660 AAGTTATAC 669 // ID BUT6 standard; DNA; INV; 387 BP. XX AC AY187768; XX DR FLYBASE; ; Dbuz\BuT6. XX FT source AY187768:420..806 XX SQ Sequence 387 BP; 87 A; 89 C; 99 G; 112 T; 0 other; CAGTGTTTGA AAAGCAAAGC AAAAACCATG GAAGCTGTCT GCTGTGCTGT GCTGTGCTGT 60 GCTGTGCTGT GCTGTGCTGT GCTGTTCGGC AGTCGACAGT TTTTGCTCTA GCTGCGCTAC 120 GACTGTCGCC AGTGAACGCA GTTCGAAGCG CTGCGCTGCT GATCAAGAAT TGTTTTTTGC 180 TTTTGCTGCG ACTGCGACTG TTGAAAATGC TCGCAGCAAC AGTTCTAGCA TTCTCAAAAC 240 TGCCGAAGCT GTCGATTCCC GAAGCGCTGT GCTGCTGATC AAGAAATGTT TTTGCTTTTG 300 CTGCGACTGC GACTGTCGAA AATGCTCGCA GCAACAGTTC TAGCAATAAC AGAACTGCGA 360 AAACTGTTCG CAGTTTTTTA AACACTG 387 // ID ISBU1 standard; DNA; INV; 1467 BP. XX AC AF368900; XX DR FLYBASE; FBgn0045754; Dbuz\INE-1. XX SY synonym: ISBu1 XX FT source AF368900:125..1591 XX SQ Sequence 1467 BP; 509 A; 282 C; 255 G; 421 T; 0 other; CAAGTAAGAA GGCTCTAGTC GAGACGGCTC GACTACGAGA TACCCTGAAC CCTGCAAAAA 60 AGTGATACTC TTCTTCTTTA CGCACGTATA AGCACGCGCT CTGCTAACCG CGCAGTTGCT 120 CTTGTATCTT CGAAAATCAA ATAGAAAAGA ATAAATAATA ATAAAAAATA AAGAAATAAT 180 AAAAAAATAA AAATAAAAAT AATAAATTAT GAAGGGCTTA GGCGAGGCTA GAACCTCCGA 240 CACCCTAACA TTCGTTGCAT CTGTCTAGCC ATCTACACCA CGGCTCACTT AAACGGCAAA 300 GTAGCGAAAA CGAATGAAAT TTGCCATATA CGAAAGAAAC ACTAAAGCAG TGGCTGAAAA 360 TACAAATACA ATTCAAAAAC AATTCTAACG GAATTCAAAA ACAATTGTGA TTCTTAGAAT 420 ATACACGCAT AGCAACATAG AAAACAGCCC AAGACTTTTT GTGTGTATAC ATACATATGT 480 ACATATGTGG ACACGCATAC GAAAATTGAT TTGCATTACT CAACGTTGCT TGCATTTCCA 540 TTGTTATTTT AATTGTTTTA AAATTTTTAA ATATATAATG GAATGTAATG AGAGTTCAGC 600 ATATGATCTA CATATTCTAT TTAACATTTA GGTTTTTTTT TAGAAATCAT AACTCTTTTA 660 CAATTAGCTT TACATGCACA AATATGTACA TATGTATGTA TGTACGATTG CAGCCACACG 720 CTGATCTACG CTTGCACACA TATACATACA TACGTGTATG TATGTGCGCC CGCAATGCGA 780 CTTGCTGTAT CGCTGCCTTG CTTGCTGCGC CACATGGGTT TTGAATAGAA AACTTTATGG 840 ACTAATAATT CTGTTTCTAG ATGGACAATC ATATTGATAT GTTCAGATTA AGATAATGAT 900 ATTCATATCT ATATGCCAAA CCAAACTGTA TGTCTCTAGT ATTTGAAATG CATTTTTTAC 960 ACATTTGGGG GTGGTGGGGC TAATTGGGGA CCGGAATCCA AAATCTAACA AAAACCGAAA 1020 TTGTATTAAA TTAGTATCTA TATATATGCA GAAGACTTGG AGACTCTAAC TCTAAAGATG 1080 AAATAGCTAG AATTTTAAAA CTGGTTCCCG ATAAATTCGT TTATCGATAA CTTTGCAAAT 1140 ATACAAGTTA TCGGATTTCT GAAACAAACT CGTACTGCGC GTGCAATATA CGAACCTAAC 1200 ACAAAAGGAA TGAACACTCT AGCTTTTGTA TCTTCGGAAA TCGACGCGTT CAAACATACG 1260 GACAGACGGA CAGACGGACG GACGGACAGA CGGACAGACG GACAGACGGA CGGACAGACA 1320 GACGGACATG GCTATATCAA CTTTTCTCGT CACCCTGATC AAGAATATAT ATACTTTATG 1380 GGGTCTGCCA CGCCTCCTTC TGCCTGTTAC ATACTTTTAA CAAAACCATT ATACCCTGAT 1440 TTCCCATTTT CAATGGGCTC AGGGTAT 1467 // ID ISBU2 standard; DNA; INV; 726 BP. XX AC AF368867; XX DR FLYBASE; FBgn0063571; Dbuz\ISBu2 XX FT source AF368867:811..1536 XX SQ Sequence 726 BP; 219 A; 148 C; 140 G; 219 T; 0 other; ATACCCTGAA CCCATTGAAA ATGGGAATTC AGGGTATAAT GGTTTTGTGA AAATGTATGT 60 AACAGGCAGA AGGAGACGTC GCAGACCCCA TAAAGTATAT ATATTCTTGA TCAGCATAAA 120 AAAATATGTA AGTTGATTGA GCCCTGTCCG TCTGTCCGTC CGTCCGTCCG TCCGTATGAA 180 CGTCGAGATT TCGGAAACTA CAAGAGCTAG AGAGCTGAAA TTTCACATGA ACACTCATAT 240 GTTACTACGC AGTACGAGTT TATTTTGTTT TTTCGATAAC TCATCCCATT TCCATCGATA 300 AAAAATCGAA AATTCTTTTC AATTTTTTAT GTTTTTTGAG CATAGCTTCT ACACCGTTTT 360 AGCTAAAGAC TTCAAATTTG TATATTTAGA TAGATATTAT GAATACGCAG ATTATGTAAT 420 TTTTGGATTT TGGATGCCAC GGCCCATTCG GCCCCAATTT GGAAAAAACG CATTTCAAAT 480 ACTAGAGACT TGAAATTCGG CACATAGTTA CATTTAGACA CTCTTAATCT ACTCTGAAGA 540 TCATGCATCT ATAAATGGAG TTATAAGCGT TCAAAGTATA GCCCAACCCA CCCCCAAGCC 600 ACGGCAGCAA CTGCGCGGTT AGCAGAGCTC GCGCTCATGC GATCGTAAAG AAGAAGAATA 660 TCACTTTTTT TGCAGGGTTC AGGGGATCTC GTAGTCGAGC CGTCTCGACT AGAGCCTTCT 720 TACTTG 726 // ID ISBU3 standard; DNA; INV; 993 BP. XX AC AY313771; XX DR FLYBASE; ; Dbuz\ISBu3. XX FT source AY313771:3584..4576 XX SQ Sequence 993 BP; 340 A; 182 C; 176 G; 295 T; 0 other; CAAGTATAAA ATGTTATGAG TAATAAGACT ACGACTACGA GATACCCTGA ACTCTGCATA 60 AAACTAATAC TCTTCTTATT TACGCGCGCT TTGCAAACCG ATCAGATGCT GCCGTAGGTT 120 GGGGGTGGGT TGGGCTATAC TTTGGACGCT TATAACTCCA TTTCTAGCTG AACGACCTTC 180 ATGAAAATTT CGGAATATAT TAATAGCATC CAAATGTATC AATTTGCAAA TTGTCTCTAG 240 TATTTGAAAT TAATTTTTTT TTTTAGTTCC ATTTTATAAA TGTCTGAAAA TATCGCTTCC 300 CATGTCAAAA AGCAGCTGGC CAAAAGTAAC AAACATGTTG AACGCTTTAA AAATGAAAAT 360 GCAATCTGGC TGGCAACTGA AATATGATTT TTCAGAAAAA TTCAAATTTC AAAAACAAGA 420 AAATCCAAAA TTCTAAAAAA ATAATCTAAG AATATATGCA AAATTTCAAG TCTATATCTT 480 TAAATTTGGA GTTTATGCCG CTAAATCGGC ATTTGTGCCC ATAGTGCAAC GCAGCAGCAA 540 CGCATAGAGA GAAAAAAAGA AAAACTGCGT CGCTGACCAA CGCAGCAGCA GTACACACAG 600 AAAAACTGCC GAAGACGGAA AACCCCACGG TTTATGCATG GCTCACTGAG TAAGAGGGAT 660 ATTCCTAAGC ACATTTTGAA AGTGTTATTT TGGGAGTTTT ATTTATAAAC GTCCATGCGA 720 AACACTTATT TGTGGACGTT TTGAAATTCG AAACGGGTAT TCGGGGTCTC AAATTTGAAG 780 TTTCTAGCTC AAACCGTGTT GAAGCTATGT CCAAAAAACG TAAAAAAGTA AACGTATTTT 840 CGATTTCTTA ACGATAATAA GGAAATGGGA CTTATCGAAA CAATCAAGAA TCTATATACT 900 TTTTGGGGTC TGCGACGTCT CCTTCTGCCT GTTAAATACC TTTTCACAAA ACCATTATAC 960 CCTGAATCCC CATTTTCAAT GGCTTCAGGG TAT 993 // ID NEWTON standard; DNA; INV; 1510 BP. XX AC AF368890; XX DR FLYBASE; FBgn0063569; Dbuz\Newton. XX FT source AF368890:291..1800 XX SQ Sequence 1510 BP; 454 A; 313 C; 304 G; 439 T; 0 other; CACTAACCAT ACAACACATA GACTGGACAA CTAGAACAAA CTTTTTTCAC GCACAAAAAT 60 ATGAGCGAAG TGCCGCACAA CTTCGTATTT GTTCGGTTTC GGCACTTCGC GCTCGGGCAG 120 TCAAAATTTG AAACGACGAA TTCAGTGCGC TGTAAGCGCT CAACCGGCAT GGTCGCATTT 180 TCATATGAGC ACATTCTGTG TGCTATTATT CTTTACTCTC CCTCTCTCTT TTTTCTCTCT 240 CTATCTCACT CTTTGCGAGC ACAAGCCAGT TGCCCACAAA AATTGAAATT TGTTCGCGAG 300 TGGCAATTCG CGACGATCCC GTCTGCCATT CTTAGTAAGA GTATGCCGAA CTCTTAAACG 360 GCTTTGCCAT TCTCATGCAA AATTTTAGTG TTTTCGTGAT CATTTTTGGG AAGGGTATGA 420 AAGCGCTGAT GTGCGTCAGT ATAATTTTTT TCGATGAGCA TGCACACACA CATTTACAAT 480 ATCTTGCTAC ATGTATGAGC ATTTATGTGA CTGTGTGCGA AACTATTGCT GCCTGTGGCA 540 ATTGTAGGGC CCAACATTTG CAGTTTTTCT TGTGGACCAC GGCATTAGAA TTGTGAGAAC 600 GGTCTTAGAT TTGTTCGACC TAGTAGACTG TCAAATCTAC TTCCCCCATT CATATGGCAT 660 TAATGTTTAT ATGCATACAT ATGCATTTAT TGTTATACAT GTTTAATAAA TAAAAATGGC 720 TATAATTAAT TTAGTAACAT AGCCACCAAA ATATTTTTGA ACGGGCTATA AACTATTTAA 780 GTGGATCTGG AAATTCATAT GTATTGACAA ATTTCAGAAT ACACCATGTA CTCATGATTG 840 CCAAAATGAC TGCTGCAAGC CGTAAAGGCT CATTTGACAG TCACTCTACA ATTCAAACCA 900 AACAAAATAG ACAAACTACT GGATCGCGCA GGTAGCAAGA AAACAAACTG CAAATGTTGG 960 GCCCTACAAT TGCCACAGGC AGCAATAGTT TCGCACACAG TCACATAAAT GCTCATACAT 1020 GTAGCAAGAT ATTGTAAATG TGTGTGTGCA TGCTCATCGC AAAAAAATTA TACTGACGCA 1080 CATCAGCGCT TTCATACCCT TCCCAAAAAT GATCACGAAA ACACTAAAAT TTTGCATGAG 1140 AATGGCAAAG CCGTTTAAGA GTTCGGCATA CTCTTACTAA GAATGGCAGA CGGGATCGTC 1200 GCGAATTGCC ACTCGCGAAC AAATTTCAAT TTTTGTGGGC AACTGGCTTG TGCTCGCAAA 1260 GAGTGAGATA GAGAGAGAAA AAAGAGAGAG GGAGAGTAAA GAATAATAGC ACACAGAATG 1320 TGCTCATATG AAAATGCGAC CATGCCGGTT GAGCGCTTAC AGCGCACTGA ATTCGTCGTT 1380 TCAAATTTTG ACTGCCCGAG CGCGAAGTGC CGAAACCGAA CAAATACGAT GTTGTGCGGC 1440 ACTTCGCTCA TATTTTTGTG CGTGAAAAAG TTTGTTCTAG TTGTCCAGTC TATGTGTTGT 1500 ATGGTTAGTG 1510 // ID GALILEO standard; DNA; INV; 2304 BP. XX AC AY187769; XX DR FLYBASE; FBgn0027840; Dbuz\Galileo. XX FT source AY187769:3729..6032 XX SQ Sequence 2304 BP; 701 A; 468 C; 452 G; 683 T; 0 other; CACTAACCAT ACAACACATA GACTGGACAA CTGGAACAAA TTAATGTGCA CACTTATTTT 60 TTGACATTAG AACCCCATCA ACTTCGTATT TGCTCGGGTT TTTACTTTTC GGGTCGGGAG 120 AGAAAAATTG CTGATGACAA AATCATTCCG CTTCGAGCGC TCGACGGTGA CGGTCGATAT 180 TCCAAAAAGG CCTGACATGT ACCGCTCTTT CCCTTGCTTT CTTTCTCTTT CTGCCCCATG 240 CTTTTTACTC TCTGCTCTCT TCTCACAAAA ATATATGTAT GCTAGCACAA ACAGCAAGCA 300 GCTCACAAAA ATTGAAATTT CTTCTCGACG CGATCCCATT GGTTCCCTTC TGAGCGAAGG 360 GTATACCGAA TTCTTAAACG GCTTTGCCAT GCTCATGCAA AATGTTAGCG TTTTCATGAT 420 TATTTGTGGA AAGGGTATGA AAGTGTTGAT GTTCATAAAC AATACTATTT CCCGATGACC 480 ATACACACAC ACATTTACAC TTCCTCGCTA TATGTGTAGG CATTTATGTG CTTGTGTGCA 540 ATATTCTTGC TGCCTGTAGA ACTATTAGAA TCCCAGCAGT TGATGTTCGT TCGGTTGGGT 600 AGCGATGTGA GCTTGCATAC ACACACACAT TTACACTTCC TCGCTATATG TATAGGCATT 660 TATGTGCATG TGTGCGATAC ACTTGCTGCA TGTAGCAATT CTAATTTCCG AGCGATTGCT 720 GTTGGGTAGC TGATGTGAGT CTACACACAC ACACATACAT ATGTAAGTGT ACGTCTGTGA 780 TGTCATAATC ACAATCGCCG AGTATCACTT GTATAGGCAC TTGAATGAAT GCTCTGCCTG 840 GGGTGTGCTT ACCTCAACTT TGAGTTAAGT AAACGATTTT CGGTTCAAAT TGCACGAGAG 900 GTCAAATATT TTCAATTACC TGTTCGGTTA TCATTTGTTC TTTAATATTT AGTACCTGTT 960 TTGACCCTAT AGGCAAAAAT GATACATGTT ATTCAAATGC TTGCGCAAAT TCAGGTCACA 1020 TATAACCCTT TTTAATAAAT GCCTACATAT ACAGGTAAAA TTTACCCTTT TATGCAAAAA 1080 ATATGCATCC GTTATGTAGA CATATTTTTC ACTAAATAAC AACAAAAATA TATTATGTTT 1140 TATGTTTTAT TTATTTTAAA TTTCTTATTC ACGAATCATT TTCAGTTTAC TTACAGGTAC 1200 TAAATATTAA AGAACAAATG ATAACCGATC AACAGGTAAT ACAAAATAAT TGACCTCTCG 1260 TGCAATTTGA ACCCAAAATC GTTTACTTAA TTCAAAGTAG GTAGGCACAC GCCCAATATT 1320 CGAGACCCAT GCACGAAACA TGCAAATGCC CCAGGCAGAG CATTCATTCA AGTGCCTATA 1380 CAAGTGATAC TCGGCGATTG TGATTATCAC ATCACAGACG TACACATATG TATGTATGTG 1440 TGTGCGTATG CATGCTGACA TTGGCTAGCC AACAGCAATC GCTCGGAAAT TAGAATTGCT 1500 ACATGCAGCA AGTGTATCGC ACACAGGCAC ATAAATGCCT ACACATATGG CAAGGAAGTG 1560 TAAATGTGTA TGTGTACAGA ACGATCAACT GCTGGGGTTC TAATAGTGCT CTACAGGCGG 1620 CAGCAAGAAT ATTGCACACA GGCACATAAA TGCCTACACA TATAGCGAAG AAGTGTAAAT 1680 GTGTGTGTGT ATGCAAGCTC ACATCGCTAC CCAACCAAAC GAACATCAAC TGCTGGGGTT 1740 CTAATGTGCT ACAGGCAGCA AGCATATTGC ACACAGGCAC ATAAATGCCT ACACATATAG 1800 CGAGGAAGTG TAAATGTGTG TGTGTATGGT CAGCGAGAAA AAATATTCTG TATGCACATC 1860 AACACTTTCA TACCCTTTCC ACAAATAATC ATGAAAACGC TAACATTTTG CATGAGCATG 1920 GCAAAGCTGT TTAAGAATTC GGTATACTCT TCGCTCAGAA GGGAACCAAT GGGATCGCGT 1980 CGAGAAGAAA TTTCAATTTT TGTGAGCTGC TTGCTGGTTG TGCCAGCATA CATATATTTT 2040 TGTGAGAAGA GAGCAGAGAG TAAAAAGCAT GGGGCAGAAA GAGAAAGAAA GCAAGGGAAA 2100 GAGCGGTACA TGTCAGGCCT TTTTGGAATA TCGACCGTCA CCGTCGAGCG CTCGAAGCGG 2160 AATGATTTTG TCATCAGCAA TTTTTCTCTC CCGACCCGAA AAGTAAAAAC CCGAGCAAAT 2220 ACGAAGTTGA TGGGGTTCTA ATGTCAAAAA ATAAGTGTGC ACATTAATTT GTTCCAGTTG 2280 TCCAGTCTAT GTGTTGTATG GTTA 2304 // ID KEPLER standard; DNA; INV; 722 BP. XX AC AF368884; XX DR FLYBASE; FBgn0063570; Dbuz\Kepler. XX FT source AF368884:4070..4791 XX SQ Sequence 722 BP; 241 A; 132 C; 140 G; 209 T; 0 other; CACTAACCAT ACAACACATA GACTGGACAA CTAGAACAAA CTTTTTTCAC GAAGCAGATT 60 ATGATTTCCC GTCCTAGTCA TCTTCGTATT TGCTCGGCTC TTTAATTTAC TGTTCGGGCA 120 GCGAAAATAT AGCAGTTGAT ATTCAGTTCG AGTCACATAA ATGCTCATAC ATGAAGCAAG 180 ACATAGTAAA TGTGTGTGTG CATGCTCATC GCGAAAAAGG AATGCTTATG TACATCACGC 240 TTTCATACCC TTCAAAAAAA TAATCACGAA AACACTAAAA TTTTGCATGA GCATGGCAAA 300 CCTTTTGAAG AGTTCGGCAT ACTCTTATTC AAAAGCGAAC ATGTGATCGT CGCGAATCGC 360 GACTCGCGGA CAAATTTCAA TTTTTGTGAG CTGCTGGCTT GTGCCTGCGT AGACACACAT 420 ACGTATATAC ATATATATAT ATATATATAT ATATATATAT ATATATATAT ATATATATAT 480 AGAGAGGCAT AGAGAGAGTG AGTGAGTAAA GAGAGAATAG CACACATCGG ACTTCATATG 540 AGAATAAAAT CATTTCGTTC GAGCGCTGGC AGCGAACTGA ATATCAACTG CTGTATTTTC 600 GCTGCCCGAA CAGTAAATTA AAGAGCCGAG CAAATACGAA GATGACTAGG ACGAGAAATC 660 ATAATCTGCT TCGTGCAAAA AGTTTGTTCT AGTTGTCCAG TCTATGTGTT GTATGGTTAG 720 TG 722 // ID YAKHETA standard; DNA; INV; 5691 BP. XX AC AF043258; XX DR FLYBASE; FBgn0024768; Dyak\HeT-A. XX FT source AF043258:34..5724 FT SO_feature five_prime_UTR ; SO:0000204:1..587 FT SO_feature three_prime_UTR ; SO:0000205:3276..5691 FT SO_feature CDS ; SO:0000316:588..3275 FT /protein_id="AAC01742.1" FT /translation="MAQVILSDDSSNDEVLSLFSSPESQNTPFYLEISPMSQNSNNSQ FT FNISIINMKKSSSNSAINSLKNPSEAAIKIINSLTYKGKENRENKNAQKDPLFLTNTN FT KETAGAKSSVSNGPVVSLLSVTHTNRGKKLTTAHNTNAAETTNTNMDDKKSGALRNFP FT FPTHEDNSMERNLSSSTKIGSKSISPHSLSPTHTSKVINISTNSRSKSPALANADTLH FT KLANIYDNSRDHNQGETQHKFITRNTFFQNLYPKPDISKLSLKNKATILAKTTKNKCI FT SPQLKGASLCSPVQPNLNFKVTTTHSMGNTSASRTLNRPAAKRDLFNSPPNSTNALPM FT SFSEVVAGTGPGIVASSDPAPITKTPGKRTNTNMDMDSFSYKTPNKKACVPTNFATPN FT PFPPLATPIFKSKAAKSIVEEIKAPRHPVESEKASARSVPAQTEIAPPPPKNSATELP FT PWQIVPQSRRAPPIHIKNVREIVPLLEKLNYSAGVDSFTTKTSIGNGVTIQAKDLTAH FT RIIKDILAKSGIPYYSNQNKSERGFRVVIRHLHPSTPCSWITSELQKLGHQTKFTRNM FT TNPATGGPMRMHEIEIVSAMDGSHLRILSIKQLGGQKVEIERKNRTRELVQCFRCQGF FT RHARNTCMKPPRCMKCAGQHWSSECTKPRSTPATCSNCQGNHISAYKGCPAYKAEKQK FT LAVNRIDFHKIRTIMDAKSNNNERQPRPPFNKTPRLPYSTEMAEARKEAARKSAMNPF FT RQNVKDSRPNLPYLSSHEIAIQKRLNKWRRSTNKASTNSRTNPKVKAPNMTKNPAQRH FT LEKFQNGLRKEQKNVEKAMEHKQGKDDSPPTTSRAALANLKPKIVKETTPSPQNINTF FT PENSQPDDPVIKLANRVDNLEKKIDILMALIIQARNAHE" XX CC Annotation from Mary-Lou Pardue (6 April 2004). XX SQ Sequence 5691 BP; 2041 A; 1425 C; 1025 G; 1199 T; 1 other; CAAACGAAAA ATAAACGGGA CAGTTAAGTG AAAATCTCCG CAAGTGAAAA GACAAAAGCA 60 AAGTTCAAAT AGCAAAGTTA ATAAAAAATA TTTTAAATTG CCTAAAAATT GTTTATAACA 120 ATTACAATTT AAAATACAAT CAAAAGTACA AAGCAAACGA AAAATAAACG GGACATTTAA 180 GTGAAAATCT CCACAAGTGA AAAAACAAAA AATTCCCGAC CGAAAAATTT ATATAGAAAC 240 AATTCGACAA ACAATTAAAA TAAACAATTA AAATAAACAT ATTAAAAAGC ATATAAATTC 300 GGTTTTTTCG GTGAAATACT TTGAGACAAA TCGACAAACT CAGCGAGCTG CAGATTTTAA 360 TAAAGAAGAG GAGCCATAAA GAAGAACATA AAGAGGAGCT ATAAAGAGGA ACCTTAAAGG 420 AGAACCATAA AGAAGAACGC TAACGTCGCA AAGAAGGAAG AACGCAAAAA AGAGGCAAAG 480 AAGGACTAGC AAAGAAAAGG AATCGTACCA AAGGACCCGC CAAAACCAAA GCCAGGGTAT 540 TTATACCACA AAAAGTATCG TTCCTTTACA TATAGTCAGC AAATATTATG GCCCAAGTAA 600 TTCTCTCTGA CGACTCTTCA AATGATGAGG TGCTCTCTCT TTTCTCGAGC CCAGAAAGTC 660 AAAATACCCC TTTCTACCTG GAAATCTCGC CCATGTCCCA AAATTCTAAC AACTCTCAGT 720 TTAATATCAG CATAATAAAC ATGAAAAAAT CGTCTTCCAA CTCTGCAATT AACAGTCTAA 780 AAAACCCTTC CGAGGCTGCT ATAAAAATTA TAAATTCACT TACATATAAG GGGAAAGAGA 840 ATAGAGAAAA CAAAAATGCC CAAAAAGACC CGCTCTTCCT TACTAATACC AATAAAGAAA 900 CAGCTGGCGC CAAAAGTAGC GTCTCAAATG GGCCAGTGGT TTCCCTTCTT TCTGTCACAC 960 ATACAAATAG GGGGAAAAAA TTGACAACAG CTCACAACAC CAATGCAGCT GAAACCACTA 1020 ATACCAACAT GGATGACAAA AAGAGCGGCG CTCTTAGAAA TTTCCCTTTC CCCACACATG 1080 AAGACAACAG CATGGAGAGA AACCTCAGCT CATCTACAAA AATTGGCTCT AAAAGCATTT 1140 CCCCTCACTC TCTCTCACCT ACACACACAA GCAAGGTTAT TAACATAAGC ACAAACAGCC 1200 GCTCAAAAAG TCCCGCGCTT GCAAATGCTG ACACACTACA TAAACTAGCC AATATATATG 1260 ACAATAGCAG GGACCACAAC CAAGGTGAAA CACAACATAA ATTTATAACT CGCAATACTT 1320 TTTTCCAAAA CTTGTATCCT AAACCTGACA TTTCCAAACT AAGCTTAAAA AATAAGGCCA 1380 CCATTCTCGC GAAAACTACA AAAAATAAAT GCATCTCCCC CCAACTGAAA GGCGCTTCTT 1440 TGTGTTCCCC TGTTCAGCCT AATTTAAATT TCAAAGTCAC CACTACACAC TCTATGGGTA 1500 ACACATCTGC ATCCAGAACC CTAAACCGGC CTGCAGCCAA GCGGGACCTT TTTAATTCAC 1560 CCCCCAATAG CACGAACGCA CTGCCTATGA GTTTTTCGGA AGTGGTGGCC GGAACCGGTC 1620 CAGGAATTGT GGCATCCTCT GATCCGGCAC CAATTACGAA AACCCCGGGC AAGCGCACAA 1680 ATACCAACAT GGACATGGAT AGCTTTAGCT ACAAAACGCC CAATAAAAAA GCATGTGTGC 1740 CCACCAATTT TGCGACCCCA AACCCTTTCC CCCCCCTAGC CACCCCCATC TTTAAAAGTA 1800 AGGCGGCCAA AAGTATTGTC GAGGAAATAA AAGCCCCCCG CCACCCAGTC GAGAGTGAAA 1860 AGGCCTCTGC ACGCAGCGTA CCGGCGCAAA CTGAAATTGC CCCCCCTCCC CCTAAAAATT 1920 CGGCTACCGA ACTGCCCCCG TGGCAAATAG TTCCACAGAG CCGCAGGGCC CCCCCTATTC 1980 ATATAAAAAA TGTTAGGGAA ATCGTGCCAC TATTGGAAAA GCTAAATTAC TCAGCAGGGG 2040 TAGACAGTTT TACAACAAAA ACGTCTATAG GAAACGGTGT AACTATACAG GCTAAAGACT 2100 TGACTGCACA TAGAATAATT AAAGACATAC TTGCTAAAAG TGGTATCCCA TACTATTCAA 2160 ACCAGAATAA ATCTGAAAGG GGTTTCAGAG TTGTTATTCG TCACCTGCAC CCCTCTACCC 2220 CTTGCTCGTG GATCACCAGC GAGCTGCAGA AGCTCGGCCA CCAGACTAAG TTCACAAGGA 2280 ACATGACAAA TCCTGCAACT GGTGGTCCGA TGCGGATGCA CGAGATAGAA ATTGTCTCGG 2340 CCATGGACGG AAGCCACCTT AGGATCCTCT CCATTAAACA GCTAGGAGGA CAAAAAGTGG 2400 AAATCGAAAG GAAAAACAGG ACGAGGGAAC TCGTCCAGTG CTTCAGGTGC CAGGGTTTCA 2460 GGCATGCTAG GAACACATGT ATGAAGCCCC CTAGGTGCAT GAAGTGCGCA GGCCAGCATT 2520 GGTCTAGTGA GTGCACTAAG CCAAGATCAA CCCCCGCCAC CTGTTCAAAC TGCCAAGGAA 2580 ACCACATCAG CGCATATAAG GGGTGCCCTG CCTATAAGGC AGAGAAGCAA AAATTAGCAG 2640 TCAACAGGAT AGATTTTCAC AAAATTAGGA CAATAATGGA CGCAAAAAGC AACAATAACG 2700 AACGTCAGCC CCGCCCCCCT TTCAACAAGA CCCCCCGACT ACCCTATAGT ACCGAAATGG 2760 CCGAAGCCCG AAAGGAAGCC GCCAGGAAGT CTGCAATGAA CCCGTTCCGG CAAAATGTGA 2820 AGGATAGTAG GCCAAACCTA CCGTACCTTT CTTCACATGA AATTGCCATC CAAAAACGCC 2880 TAAATAAATG GCGCCGGAGT ACAAACAAGG CCTCCACAAA TAGCAGGACC AATCCTAAGG 2940 TAAAGGCCCC AAATATGACT AAAAACCCTG CGCAAAGGCA TCTGGAAAAA TTCCAGAACG 3000 GGCTTCGAAA GGAACAAAAA AACGTAGAAA AAGCTATGGA GCACAAACAG GGAAAAGACG 3060 ACAGCCCCCC GACAACGAGC AGAGCTGCTT TGGCAAATCT AAAGCCAAAG ATAGTCAAGG 3120 AGACAACGCC CTCACCACAA AATATCAACA CTTTTCCAGA AAATAGCCAA CCCGACGACC 3180 CAGTCATTAA ACTGGCAAAT AGAGTTGACA ACCTGGAAAA GAAAATTGAT ATATTAATGG 3240 CCTTAATTAT ACAAGCAAGA AATGCGCACG AATAAACCTC TTGACGACGT ATTCCTGCTG 3300 ACCCCCACGA TGAAGACGTA AAGGATGCCA CCCACTAGGA CCAGCATATA CGAATCTGGA 3360 AGCTGATGGA CAGGAGAAGG ATCAACGCGC AGCAAAAACA TGATGAATTT AAGTCGACTC 3420 ATTGCAGCAG CGTATCGACA ACGTCACTTA TCTGAATTTT TTGCTGCAAC TCCTTTGAAA 3480 TGTCCCAAAA CACCAGCTGC AATCTCTACG AAAGCCATAC TTGACGACAA AAGACGACCC 3540 GACTGGGCGA AATGGACAAA CTAACTTCTT GACACTCATA ACACAAAAAA CATACAATGT 3600 TTTGATGCAG TTTCCTTATT GGGATTCCCC CTCGTTTTTA CAGACCCTAG GCGTGGTGCT 3660 GCCGACGCGC GAAAAGGTCG AAATCGTAAC CTCTATAATT GACTTTTTTC CTCCAGAAAT 3720 GCAATATCGA CAAAATCTCC GTTAAAGATT GAAATACTTT AATCAAAACA TAAAACAGAG 3780 CAATATATAA TAATATATAT AAAAATCCAC AATGTTTTGA TGCCGTCCCT ATTCCAGGAA 3840 GCCCCCTCGA TTTTATTGAC CCTTGGTGCG CGCTCGCGAC CCGGCAAAAG GTCAAAATCG 3900 TGACCCTTGA ATTTCGACTT TTTCTGGTAG CAACGATAAC ACTGACAATA TCCCTGAAAA 3960 AGAAAAACAT ATTAATAAAA GCATAAAACT ACGCAACCTA CTACTTCCCT GCTAGGCGCA 4020 ACGAAAAACG AAGCTTTTGG CACCCAAAAC ATTAAAATCC TATAATGTTT GGATGCCGTC 4080 TCCATGTTGG GGTGCATCCT CGTCCTTCTG AACCCTAGGT GCGCCGCTGC TGACCCGGTA 4140 AGGGGTCAAT CTCGCGACCC TTGAATTCGA ATTCGACTTT TTCCTGGCAG CAACGGCTTA 4200 CAGACAAAAT CCCTGTAAAA TCATAAAACA AATTAAAAAA CAAATTTATA AAAACATAAA 4260 ATCACGCAAC TTACCTTTTC CCGACTCGGC GAAATGAAAA ACGACGATTT TGGCACATAA 4320 AACATTAAAA ACCTAAAATG TTTGTGCCGT CTCCATGTTG GGATGCCCCC TCGCCCTTAT 4380 GAATTCCAAG TGCGGCGCTG TTGACCCGGC AAAAGGTCAA AATCTTGACC CTTAAATTCC 4440 ACCTTTTCTT GGTAGCAACG GCATACTGAC AATATCCCTG AAAAGAGCAA AATAAACTAA 4500 CAACAACAGA AAACTAAATG CATTTACAAA CTCACTGACA AACGAACCTC CGACGATGCT 4560 CTTTATAAAT CCAAGGCAAT GCATTCCTAC CCAAATCCCA CCGGAAAAAA TGAAAAATCC 4620 TGGACGATTT GGACCTGCAA AAGAAAAGAC AAAACAAAAA TCAGAAACAA AACTAAACAC 4680 ATAGATATAC TTATCTAATC CAATTTTCTG CATCCAATTC CATGGACACT ACGAGCGTCG 4740 GCCCTAAGCA AAGCATCTTG GCCCGGCGTC CACAAGATGC CTCCTTTCCC TGCGAGACGA 4800 CCCACAGAAG CGACGACCCT GGTGACTCCA AAGCAATGAC GACGCTCTCG CCGACACGAT 4860 GAACTCCAAC CGAAGTGAAG CAACTCCCTG GACCCTGGAA TAAGACGGCG ACGCGAATCC 4920 ACTACGCACT GAAGAACTCC AGCTGAAATG CAGCAACTCC TTGGACCTTG GAATAAGACG 4980 GCGGCGCGGG AATCCACTTC GCTTTGGACC ATTTGACGGC TGGCGACGAG ACCCATGGGC 5040 GGGTTGGCAG CCTCCTGCAA ATTAAAATAT AAATTGACGG CGGCGCGGGA TCTGTACACG 5100 AATATAAAAA CACTGACGAC TGGTGAAACT ACTGTAACAA AAATGAAAAC TCAAACGCAA 5160 AATAACGGCC GCTGTGGTGG TAAAAAGTAC CACACTTGTC AGCCGGCCAA AATTTACACC 5220 AGGAATACTT ACCAGAAACG CTGGCCCAAC TTTTTCCTTT CAAATCTGCG CGCAGCCTGC 5280 GGCTCCAAAT AACTGGGACA AACAACTCCC TTTCTCCACC GGCAATGACT CCTGAnGTTC 5340 GATGTTTCCG TCCTTCTGGC GGGGGCATCT GAAAATAGAA ACAATATAAA TGTTAAACTT 5400 AATTTAATTG ACAAATGCAA ATTTCCTAAA CCTTTAAAAT GTTAACAAAC AAAAACCATA 5460 TGTTAATGTT ACCATCCACG CAATGTTTAT AAGTAAGAAA ATACCAAACC ATGTTATACT 5520 TACCACACCT GTCCCAACCC TAACACTCAT CCCCAACAAT GTACAAATTC AAAATCGAAA 5580 ATAATTGTAC CTAGATATTG CATCTGTGTA ATCACAGGCA AATAAATGCG TGGATGCGGG 5640 GCAGAAATCA TCATTCTGTC TCCCGTACTT TCACCAGAAA CGTCAAAAAA A 5691 // ID TARTVIR standard; DNA; INV; 8500 BP. XX AC AY219709; XX DR FLYBASE; FBgn0066148; Dvir\TART. XX FT source AY219709:4651..13210 FT SO_feature five_prime_UTR ; SO:0000204:1..197 FT SO_feature three_prime_UTR ; SO:0000205:7957..8500 FT SO_feature CDS ; SO:0000316:198..3311 FT SO_feature CDS ; SO:0000316:3362..7957 FT /protein_id="AAO67563.1" FT /translation="MQKPTVSKSPPKSPHNASEITANSAVPPHRATEECSISERAKSE FT LSQLQQQLTKNPQKQLSREHRRQQLLRLCEEEERLMPDVPAQQPTLTLKHKPKLKTVL FT ANSSPPSSAPLRMSLTKSSKAKARKESQSSACREKSPEENTYVNVLESDDESDIAANK FT NYDQPQQQTTIAQVHNFNTNTSITSTPKAASHCDPSPNVSAHEVELHNVDNTREASYV FT NNIDELTDVFDDEDDDDDCYRPAAVINSSVADMLNRTTTVEHNSTATANRGEILANVK FT TSDAFAANKINSITFEKIEPIPQTITPTVAKGPKFNCSMLPTTSLSAKKSLKRKGTLP FT QQSKQKRAQTLAVQGNSRPGMKIQDFLKSSTRVTKATNKAAIINKYKKSNRNTKKVNP FT SSSTKLNNSSMDIDLDSSDISANNIGSDDDIVQIQSNAQEVTNHATKVQQQQQHRQQQ FT QHHQQQQQQQQQQPTKSTKIPTIFLPSISDIQQIIDMLNNTVGANKYTTKCTQNDGVR FT VQCSDLLSYNAAIALLGANTEIQMHTHQMRSERGYRVLLKNVHHSTPCEQIRAELAKH FT GHTVRFASVIKHRFERRPLNMFEVELAPNGDTNDKVLELKTLGNQHIEVERQLKRDEP FT VQCHRCQSFGHSKNYCRRPFACLKCGEQHPTTTCTKPRNTPAKCVNCKADHIASFKGC FT SVYKMEREKLAANRVRAAIDRQQQHQQQQQQQQQQQQQQQQQQHQQQQQQQQQQQQQQ FT QQQQQNRQLQQHQQQQQQQHRQLQQHHQQQQQLQHQHQQPQYLAINTSAPSGIKSTKE FT RKQQQQQLRQQQQKQQQQQQQQHQQLKPNNQLTYSQVASGQANTSLSHNPAMQHLKKY FT QQQLQLEQQQQQRQQQQQQQQQLQPEQQQQLQRQQRLRQQQLPEQQKRQQYQRQQEQQ FT SHEQQQRHQQQSQQQQHRQQADSPLLEALQQNVISINEMSKKIDMLISLLLTMAASNT FT NINKQGEQQTIPLTTSTSDTLNNVNKKNNFPNLMSTLISTAQPINNSQSANANCEHIA FT TDSSQYV" FT /protein_id="AAO67562.1" FT /translation="RRGVELLRSIHSNKKLNILATGGATHFPYTSRNRPSAIDIAVYR FT ALTMTGLQTHSTIDLESDHLPIHIGLRVGRFPYKQTTMRLLPINANVKKFQNHLDKNV FT RLNTEIISGPDIEDAIDILNKNIYNASLAATHPPRRHQQQQRQGTNTNRMNHGRFKLT FT NETIRLLAIKRQRKREHMVMRTPLTRSRLSQAQNKLKKALRMDKKKQTNKMFEQIDAT FT DRYKIQKLWRITNNIKRQPEPNWPLKIQATDNNNSRSNRTQWTKTSKEKAEVFAAYLE FT QRFSPIFSNTAEYRLQVNNEINTRQQTNDCDSGAGVAGVIPFRPITYTEVTKEIGCLT FT IKKAAGIDNIDNRVIKALPKKAILYLVMIYNCILRHGHFPRQWKCAAIKTILKPGKPT FT EDVASYRPISLLAGFSKIFERLLMYRLYECSEFAKAVPLHQFGFRKDHGTEQQLARVT FT QFILSAFEKKQYCSAVYIDIREAFDRVWHEGLLLNLARYCLIGCIMVLKSYLLDRTFI FT VHGNEGIKSRIGKVSAGVPQGSVLGPILYIIYTSDMPLPHIATPPNPTTTAATITNTE FT TSITNPYNCSMLLSTFADDTVIMSSADILQTSYSANQTYLRQFTIWCNRWCIQINDSK FT SAHVVHTLRNLNSRMNYLTPRLNGQEIPSKPRQKYLGVHLDRKLHMQHHVTQLRCRLK FT ALYNKLEWLIGNKSVLSVECKITIYKQMIAPIWRYALPIWGAMISDTQLRRIESTQNW FT ILRKIVKASWWTRNKDIRDTYDIGTVDEIFHTTSKRFADSLAIHPNANARKLIASPYV FT PIRLDRQRYSLQLQQHVRPLQQLVQQSQEQQQLPTLLRMELEEEEANSLRMQQQQQQQ FT TPQQQPARFSEMRINSLRRHYREGRITLEELKLAIREQPLVIQQLVLPRELAIQIYQQ FT QQQQQQQQQQQQQQQQQQQQQQQQQQQQQQHQQQQQQQQQQEQQQQQQQQQHQYQQQH FT QQQQQQQIQQQQQQLEGLQHQPNQSPDVEIVLDTPQQQQEQQLERPLPQPQQANLQQQ FT RLTEDEQQLEHRLERLRQQLEREMDEAANNEPTQQQRTPEQQQSNQQQHTIALTENKN FT TNINPSNSVCANITNFSISQSIIIAPYSLINIATDAPWQQNQPTTTQQLRQQHSNLHQ FT QHLQQQREVVEAPSSNNAQSTVRGAKRRRSSSTSCIEPKRRSPAAPDNQQQRQLHTLL FT GKRHSGGPAQHNCWPPKRQKLVQQQQQRQACLRRTRLGTWQLQMPHRLGIWLLLSCFG FT MQTHHKVPQKLVWAPPADYGQSSTNGRHLQIARRRPWHSWQRLQRRWSNCGRQTHRKA FT RPYNILIHET" XX CC Annotation from Elena Casacuberta (6 April 2004). XX SQ Sequence 8500 BP; 3098 A; 2143 C; 1624 G; 1635 T; 0 other; CAATTTAAAA AAAACACAAC TTAAAAATCA GAATTTTTCA ATTCTAATAA TTAACCAAGC 60 TTGCAAATAT ATTTGTGAAG TTATCTGAGT AATTGCTAAA CAGGCATAGT GGCAAGCAAA 120 TACAATAAAT ATTTAAAAGG TGCAATAGTG GTAAAGACAA ACATAACTGC ACCAGAAATA 180 ATATATAAAG TGCATAAATG CAAAAACCAA CTGTGTCTAA AAGCCCGCCT AAAAGTCCGC 240 ATAACGCTTC TGAAATTACC GCAAACTCGG CTGTCCCCCC CCACCGCGCA ACCGAAGAAT 300 GCAGCATAAG TGAAAGAGCG AAAAGTGAAC TTTCACAGTT GCAGCAACAG CTCACAAAAA 360 ATCCGCAAAA GCAGCTCAGC CGAGAACACC GGCGACAGCA GTTGCTAAGG CTATGCGAAG 420 AGGAGGAAAG GTTAATGCCC GATGTCCCAG CGCAGCAGCC GACGCTTACG CTCAAGCATA 480 AGCCGAAGCT GAAAACAGTT CTCGCCAACT CTTCACCGCC CAGCTCGGCA CCGCTGAGAA 540 TGTCTCTGAC GAAGTCGTCG AAAGCTAAGG CAAGGAAAGA GAGCCAATCA TCTGCCTGCA 600 GAGAGAAAAG TCCAGAAGAG AACACCTATG TCAATGTGTT GGAGTCTGAC GATGAAAGCG 660 ACATAGCAGC CAATAAAAAT TATGACCAGC CGCAACAGCA GACAACTATT GCACAGGTAC 720 ATAATTTCAA TACAAATACT AGCATAACTA GCACTCCCAA AGCAGCCTCT CACTGTGATC 780 CATCGCCAAA CGTATCGGCA CACGAAGTTG AACTTCATAA TGTTGATAAC ACAAGAGAAG 840 CTAGCTATGT TAATAACATT GATGAACTAA CAGACGTCTT TGACGACGAA GATGATGATG 900 ATGACTGTTA CCGGCCCGCT GCTGTTATTA ACAGCTCAGT TGCTGACATG CTTAACAGAA 960 CAACTACTGT TGAGCATAAC AGCACAGCCA CTGCTAACAG AGGGGAAATT CTTGCAAATG 1020 TTAAAACATC TGATGCATTT GCTGCCAATA AAATTAATAG TATTACTTTT GAAAAAATCG 1080 AACCAATTCC ACAAACAATA ACGCCAACAG TAGCAAAAGG ACCTAAATTT AACTGCTCTA 1140 TGCTACCTAC AACTTCCCTG TCTGCCAAGA AAAGTTTAAA ACGTAAAGGT ACACTGCCGC 1200 AACAAAGTAA ACAAAAGCGA GCACAGACAC TGGCAGTACA AGGAAACTCA CGACCAGGCA 1260 TGAAGATACA GGACTTCTTG AAGTCATCAA CAAGAGTAAC TAAGGCAACA AATAAAGCAG 1320 CTATAATCAA CAAATATAAA AAAAGCAATC GTAATACTAA AAAAGTAAAC CCCTCTAGCA 1380 GCACTAAACT CAACAACAGC AGTATGGATA TTGACCTGGA TAGCAGCGAC ATATCAGCAA 1440 ATAACATAGG CTCTGATGAT GACATTGTTC AGATCCAGAG CAATGCACAA GAGGTTACAA 1500 ACCACGCCAC CAAAGTCCAG CAACAACAAC AGCATCGACA GCAGCAACAA CATCACCAGC 1560 AGCAGCAACA ACAGCAACAG CAGCAGCCAA CTAAAAGCAC AAAAATTCCA ACAATTTTTC 1620 TACCCAGCAT AAGTGACATA CAACAAATAA TTGATATGCT CAATAATACA GTGGGTGCAA 1680 ATAAATATAC AACAAAGTGC ACACAAAACG ATGGAGTCAG AGTGCAATGC AGCGACTTAC 1740 TATCGTATAA TGCAGCTATA GCACTGCTCG GCGCCAATAC GGAAATACAG ATGCACACCC 1800 ACCAGATGCG AAGCGAGCGA GGATACCGTG TTTTACTTAA AAATGTCCAC CACTCCACGC 1860 CATGCGAGCA AATCCGAGCA GAATTGGCGA AACATGGACA CACGGTTCGC TTTGCCAGCG 1920 TCATTAAACA TCGATTTGAG CGACGGCCAC TTAACATGTT TGAAGTCGAA CTGGCACCAA 1980 ACGGCGACAC CAACGACAAA GTACTGGAAC TGAAAACTCT GGGAAACCAA CACATTGAAG 2040 TAGAAAGACA GCTGAAGCGA GATGAGCCTG TACAGTGTCA TCGGTGCCAA TCGTTTGGGC 2100 ATAGCAAGAA CTACTGTCGT AGGCCTTTTG CATGTCTCAA ATGCGGTGAA CAGCACCCGA 2160 CTACAACGTG TACTAAACCC AGAAACACAC CTGCTAAGTG CGTCAACTGC AAGGCTGACC 2220 ACATAGCCAG CTTTAAGGGA TGCAGTGTGT ACAAAATGGA GCGGGAAAAA CTAGCAGCAA 2280 ACCGTGTCCG GGCGGCAATT GATAGGCAAC AGCAGCACCA ACAACAGCAG CAACAGCAAC 2340 AACAACAGCA GCAGCAGCAG CAGCAACAGC AGCATCAGCA ACAGCAACAG CAACAACAAC 2400 AGCAACAACA GCAGCAGCAA CAACAACAAC AAAATCGGCA GCTACAACAA CATCAGCAGC 2460 AGCAGCAACA ACAACATCGG CAGCTACAAC AACATCATCA GCAGCAGCAG CAACTACAAC 2520 ACCAACACCA GCAGCCACAA TACCTAGCAA TTAATACTTC AGCTCCTAGT GGAATTAAGT 2580 CGACTAAGGA ACGGAAGCAA CAGCAACAGC AACTTCGGCA ACAACAGCAA AAGCAGCAGC 2640 AGCAACAACA GCAGCAACAT CAACAATTGA AGCCAAACAA TCAACTGACT TATAGCCAAG 2700 TAGCCAGTGG ACAAGCAAAT ACGAGCCTCA GTCACAACCC AGCCATGCAA CACCTCAAGA 2760 AGTACCAGCA ACAGCTACAA CTGGAGCAGC AACAGCAGCA ACGACAACAG CAGCAACAAC 2820 AACAGCAGCA GCTACAGCCC GAGCAGCAGC AGCAACTTCA GAGGCAACAG CGACTGAGGC 2880 AACAACAACT ACCTGAGCAG CAAAAACGGC AGCAATATCA ACGACAGCAA GAGCAGCAAT 2940 CACATGAGCA ACAGCAACGA CACCAGCAGC AATCGCAACA ACAACAACAT AGACAACAGG 3000 CGGACTCACC ACTCTTGGAG GCGCTGCAGC AAAATGTGAT AAGTATAAAT GAGATGTCCA 3060 AGAAAATAGA CATGCTAATT AGCCTGTTGC TCACAATGGC AGCAAGCAAC ACTAATATTA 3120 ATAAACAAGG AGAACAACAA ACAATCCCAC TAACAACATC AACCTCTGAT ACCTTGAACA 3180 ATGTCAATAA AAAAAACAAT TTTCCTAATT TAATGAGTAC ATTGATTAGC ACAGCACAGC 3240 CAATAAACAA CAGCCAATCG GCAAACGCCA ATTGTGAACA CATAGCCACT GACAGTAGCC 3300 AGTATGTATA GAGCGAACAA CACCAACAAT GCCAGCGCGA CTGCCACATT GAGTCAAGTC 3360 AATGATGGCA GACACACCAC TCACACTACT AGTGACACCA CAACTACTAC AACAACACAT 3420 TTGTCCAACC TTAACAGTCC ATTGCTTAAC AGGAGAAGGC CAAACATTAT TCCAAATGAC 3480 GAACCACAGC AGCGACCAAT ATGGCAACTC TTTTAAAATT GCATATTGGA ACGCTGCTGG 3540 CGTTAGAAAC AAACTAAATG AATTAGAGGT TTTCATGAAT CGACATAAAA TTGATATTAT 3600 GATGCTCGTC GAGCTACGAT TGGGCCCCAG CGCTCTCAAT CCAGGTCCCC CAATTACTAT 3660 TAACGGCTAC CATACATATG CGGCAACTCG CCCACCACCA CACCATAGAT GCGGGGGTGT 3720 TGCCACTTTA GTAAAACACG GAATCAAACA CATTGCCCTG GAAGCAATTG TGTTTGAATC 3780 CATGCAATCT GCACCAGTAG CTGTGACCCT GGCAAAAAAT GAGACCATTG TCCTAGCTCC 3840 AATTTATTGC CCACCGCAAT ATAATTGGAG CGCTGACATG TTTGCCAAAC TTTTCCAACA 3900 TTTTGAAAAA TTGAGCGGGT CCGGGTTCAT TTTATGTGGC GACTGGAACG CCAAACATTC 3960 CTGGTGGGGC AACCAACGTG CTTGTCGACG AGGGGTAGAG CTGCTAAGAA GCATTCACAG 4020 CAACAAGAAG CTGAACATTC TTGCTACAGG TGGGGCTACA CACTTCCCAT ACACAAGTAG 4080 GAACAGGCCA TCCGCTATCG ATATTGCGGT CTACAGGGCA TTAACAATGA CCGGACTACA 4140 AACACACTCT ACAATCGATC TAGAATCGGA TCATCTACCT ATACATATAG GCCTCAGAGT 4200 AGGCCGATTC CCGTATAAAC AAACAACAAT GCGATTGCTA CCGATAAATG CAAACGTAAA 4260 AAAATTTCAA AACCATTTAG ATAAAAACGT TCGTCTTAAT ACAGAAATAA TATCGGGACC 4320 CGATATTGAA GACGCCATCG ACATATTAAA TAAAAACATA TATAATGCTT CCTTAGCAGC 4380 AACACACCCA CCTCGTAGAC ATCAACAGCA ACAAAGACAA GGCACAAATA CAAATCGCAT 4440 GAACCATGGC CGTTTTAAAT TAACAAACGA AACAATTAGA CTTTTGGCAA TAAAACGGCA 4500 GCGCAAGAGA GAACATATGG TTATGCGAAC ACCTCTCACG CGAAGCCGAC TTAGCCAGGC 4560 CCAAAACAAA CTAAAAAAAG CGCTTCGTAT GGACAAGAAG AAACAAACGA ACAAAATGTT 4620 TGAACAAATA GATGCCACTG ATCGCTACAA GATACAAAAG CTTTGGCGCA TCACAAACAA 4680 CATCAAAAGG CAACCAGAGC CAAATTGGCC ACTTAAAATA CAAGCGACTG ACAACAACAA 4740 CAGTCGCAGC AATAGAACAC AATGGACTAA AACTAGTAAA GAAAAGGCTG AGGTCTTCGC 4800 TGCATACCTG GAGCAACGTT TCAGCCCCAT ATTCTCAAAT ACGGCTGAAT ACAGACTCCA 4860 GGTGAATAAT GAGATAAACA CTAGACAGCA AACTAACGAT TGCGACAGCG GCGCTGGCGT 4920 CGCTGGTGTT ATCCCTTTTC GCCCAATTAC ATATACTGAG GTGACAAAAG AGATTGGCTG 4980 CTTGACAATA AAAAAGGCTG CAGGCATCGA TAACATAGAC AATAGAGTCA TTAAAGCACT 5040 ACCAAAGAAA GCAATACTGT ACCTTGTCAT GATATACAAC TGCATACTCA GACATGGCCA 5100 TTTCCCGCGA CAGTGGAAAT GCGCTGCCAT AAAAACAATT TTAAAGCCTG GAAAACCCAC 5160 AGAAGATGTC GCCTCATATC GACCAATAAG CCTGCTTGCA GGTTTCTCCA AAATCTTTGA 5220 AAGGCTTCTC ATGTACAGGC TCTATGAATG CTCTGAATTT GCAAAGGCCG TGCCACTACA 5280 TCAATTTGGC TTTCGCAAAG ATCATGGCAC TGAGCAGCAA CTAGCACGGG TAACCCAATT 5340 CATCCTGAGT GCATTTGAAA AGAAACAATA CTGCTCAGCC GTCTATATCG ATATACGCGA 5400 AGCATTTGAT CGAGTATGGC ACGAAGGATT GCTCTTGAAC TTGGCAAGAT ACTGCCTAAT 5460 AGGCTGTATA ATGGTTCTCA AGAGCTATCT ACTTGATCGC ACATTCATCG TACATGGAAA 5520 CGAGGGCATT AAATCTAGAA TAGGCAAAGT TAGCGCTGGA GTGCCCCAGG GGAGCGTACT 5580 TGGGCCAATT CTATATATCA TCTACACATC AGATATGCCA TTACCACACA TTGCAACACC 5640 TCCCAATCCA ACAACAACAG CAGCAACAAT AACAAATACA GAAACGTCCA TAACTAACCC 5700 ATACAACTGC AGCATGCTAC TGTCAACGTT TGCCGATGAC ACAGTCATAA TGAGCTCTGC 5760 TGACATACTG CAAACATCAT ACAGTGCAAA TCAAACGTAT CTGCGCCAAT TCACCATTTG 5820 GTGCAATCGC TGGTGCATCC AAATTAATGA CAGCAAATCA GCTCATGTAG TACATACTCT 5880 ACGCAATTTA AACAGCCGAA TGAATTACCT GACTCCACGA CTCAATGGGC AAGAAATCCC 5940 CTCTAAACCA AGGCAAAAGT ACCTAGGTGT CCATCTGGAC CGAAAGCTCC ACATGCAACA 6000 CCATGTTACG CAGCTACGAT GCCGTCTTAA AGCACTCTAT AATAAACTAG AGTGGCTAAT 6060 AGGAAACAAA AGCGTACTCT CTGTAGAATG CAAAATCACT ATCTACAAGC AGATGATTGC 6120 GCCGATATGG AGATATGCTT TGCCGATCTG GGGTGCCATG ATCTCTGACA CACAGCTACG 6180 CCGAATAGAA TCCACACAGA ACTGGATTCT AAGGAAGATA GTGAAGGCCT CCTGGTGGAC 6240 TAGAAACAAA GACATACGTG ATACTTACGA CATAGGCACA GTGGACGAAA TATTCCATAC 6300 TACAAGCAAA AGATTCGCCG ACTCACTTGC TATTCATCCT AATGCCAACG CTCGTAAGCT 6360 AATCGCAAGT CCGTATGTTC CAATTCGTCT TGATCGGCAA AGGTATAGTC TGCAACTACA 6420 ACAACATGTT CGCCCTCTAC AGCAACTAGT GCAGCAATCG CAAGAACAAC AACAGCTGCC 6480 AACTCTGCTT CGCATGGAGC TTGAAGAAGA AGAAGCAAAC TCGCTCAGAA TGCAACAGCA 6540 GCAGCAACAG CAGACCCCAC AACAACAGCC AGCCAGATTC AGCGAAATGA GGATAAATTC 6600 GCTAAGAAGG CACTACAGAG AGGGACGAAT TACGCTCGAA GAGCTAAAAC TAGCAATTAG 6660 GGAACAGCCA TTGGTGATTC AGCAATTAGT ACTGCCTCGC GAGCTGGCTA TTCAAATATA 6720 CCAGCAGCAA CAACAGCAGC AGCAACAGCA ACAACAGCAG CAGCAACAGC AACAACAGCA 6780 GCAGCAACAG CAACAACAGC AGCAGCAACA GCAGCAGCAA CAGCAACAAG AGCAGCAGCA 6840 ACAACAGCAG CAGCAGCAAC ACCAGTACCA ACAGCAGCAT CAGCAACAGC AACAACAGCA 6900 AATACAGCAG CAGCAACAAC AGCTCGAGGG ACTCCAGCAT CAGCCAAATC AGTCGCCTGA 6960 CGTCGAAATA GTGCTAGATA CACCACAGCA GCAACAAGAG CAACAACTAG AACGGCCACT 7020 TCCACAACCT CAGCAAGCCA ATTTACAACA GCAACGGCTA ACTGAGGACG AGCAGCAGCT 7080 TGAGCACCGA CTTGAAAGAC TCAGGCAGCA ACTTGAGCGC GAGATGGACG AGGCTGCAAA 7140 TAACGAGCCA ACTCAACAAC AGCGAACACC AGAGCAGCAG CAATCAAACC AACAGCAACA 7200 CACAATAGCC TTAACTGAAA ATAAAAATAC AAATATTAAT CCAAGCAACT CAGTCTGTGC 7260 CAATATAACA AATTTTTCTA TTTCACAGAG CATAATAATA GCCCCATACA GCCTCATAAA 7320 CATTGCCACA GATGCTCCTT GGCAACAAAA TCAGCCCACA ACCACACAAC AGCTCCGCCA 7380 GCAGCACTCG AACCTTCACC AGCAGCACCT ACAGCAACAA CGGGAGGTAG TGGAGGCGCC 7440 CAGCAGCAAC AACGCACAGT CAACAGTCAG GGGGGCTAAA AGAAGAAGAA GCAGTTCAAC 7500 AAGCTGTATT GAACCCAAAA GACGCTCACC TGCAGCTCCT GACAACCAGC AACAGCGGCA 7560 ACTCCACACT CTACTCGGCA AAAGGCACTC GCAACAACGT CAAGCATGCC TTCGCAGAAC 7620 ACGGCTGGGT ACATGGCAGC TGCAAATGCC CCACAGGCTG GGAATATGGC TCCTACTCTC 7680 CTGCTTTGGC ATGCAAACAC ACCATAAGGT GCCCCAGAAG CTTGTGTGGG CACCTCCAGC 7740 AGACTACGGT CAATCCTCGA CGAACGGCAG ACACCTCCAA ATTGCCAGAA GGCGCCCGTG 7800 GCACTCCTGG CAGCGGCTCC AACGTCGATG GTCCAATTGC GGGCGGCAAA CACACAGGAA 7860 GGCCAGGCCG TATAACATAT TAATCCATGA GACCTGAAAT AGATAAATAA TAAGTAAATT 7920 TGCACCAACT AGAATTATAA ATTCCAACAA ATCCTTAAAT TCGCCCCTCT GGCAAACTAT 7980 GTTTACAATT TAACTCAATC AATATGTTAA CTCAGCTGTT CATAACGCAA TCGAAAGCAG 8040 CCCTGATAAC ATTAACAGAG GTATGTAATA ACAGAACCAT TGCAGTAATT GCTCAACTGT 8100 TCTGAACGCA ATCGCCAACA ACCCTGATAA CGTTAGACAG AGGCACTGTT AATAACAGAA 8160 ATAGTACCGC CAACGTGCTA AAAATTCAAA TAAGCCTGCT CAAGCATAGT ATTAAGTAAT 8220 GTTAAATGTA AAAAATAACT GTTAAATGTT TATTTATCGT AATTCGTCAC AATTTTCTTT 8280 TATGTTGTGA TATGCATGTA AAATTTTGTT AAAATATGTC ATACATGCCA GTAATTTGTC 8340 AGCTAGTCCG TTTCACAATT AGACCTATTT CGTTCCAATT CTGTCTCATA TATTGCATTG 8400 GAACTAGAAT TAAGCAATGA AAAAGACAAT AAAATGCGAC TACTAGCTTA ATCGCTAGAA 8460 GTTCGCCGTA GCGCACGCAT GCGCAGCAAT TTAAAAATTA 8500 // ID HETAVIR standard; DNA; INV; 6610 BP. XX AC AY369259; XX DR FLYBASE; FBgn0067468; Dvir\Het-A. XX FT source AY369259:7671..14072 FT SO_feature five_prime_UTR ; SO:0000204:1-1274 FT SO_feature three_prime_UTR ; SO:0000205:3999..6342 FT SO_feature CDS ; SO:0000316:1274-3998 FT /protein_id="AAQ75089.1" FT /translation="MSDPNTTPTTHLSAEPTISPGLLSNLSGLLLSPILTMPASMNGL FT LSISVDSLAPATALSPQALISTASETPAQKGQEQKRAAGKETENTVSETTTTSSITAK FT TTATNTNIESIHIPADAQAQLAHFKRLIAQLELDTTPPNSQAADNFTSYYDEHSSSDS FT IVNIMDSDDAPCTPKPAQPLPMSTFRHHTPTSVVATKTSTQVISPTYAAILAGNAANQ FT KPKNSTCISSSSDSSSNNAPGIGGPGGKITTTATTGASHYKLLHPTGKTANQIRKNNA FT RKSNRQNSAISRLSQQDAQRRNQFANRFTLLSESDFEAPDTHQSQQLPAKTNNLAVPL FT QAALSPVKQSTTTTTTPTNTRAALTSIHFATAATAVNATPSNSPDCQQQQQAGAINVS FT APAAKHYKPPQICIQHYNQPDQNGIDSVIAKLNGQNPPIIDYGLKLGGPGILRILPKT FT LETYSKIINVLNTDNSINYNTYQRREERSFRVVVRGIHASTSTDAIRNELTCMGYTIR FT NVYCPKYKNRSGPGTYQPNIFFVELAPDSTKNRSIFEIRNLCKYVVRFEWPKIDGKNL FT PQCHRCQRFNHTARYCRHPARCVKCGNEHLTQTCVKPANVPATCANCGSDHTANYKGC FT PLYLDLLQAKLLSLPNSKNISPNVRQPQPKLQRRQKQPQRQQSAQLQLQQQQQQQKQQ FT QVQLQRQLPKPKLINNKNNKKSKQNGGQIGTQPIQNQKLQTSTPAGVPRKSTTPITNN FT NINTANNANNSNPPTVSSSNSLDPRSRWARIQSLQQQQQQQRQLERPDSVQTREQIIS FT RQRQMLENWSIRQQATSANQPVTQQQSTPMQVDVEQRQLLPSPQQQQLYQTPSSNDLR FT NDILLQMANKQQEQLDGLMGSVLILQQQLQGVLRLGADSTPLAVPWSNSSQ" XX CC Annotation from Elena Casacuberta (6 April 2004). XX SQ Sequence 6610 BP; 2218 A; 1712 C; 1013 G; 1667 T; 0 other; TAAATCAATA AATTGCTATA GCTATATATA TTTTCAATTA CTTATTAAAT ATACATATAT 60 ACCTAGCATA TATATTGCTA CAAACTGTCC GTCTACGGAC AATCATACCC TCTTAGCTAA 120 AAAGAGAGCA ATTTAATCAA AAATTTATTA CCTATAAATA AATTAATACA TAACCAAAAA 180 TAACAATTAA CCCTTCAAAT CTCTAAAAAC AAGGTATTAC ATCAACTTTT GCCATAAAAC 240 TTCAAGCCCA AAGGTGACCA ATCAGATTAA CCTAAAAATT TCTAGCAATT TCTCAGCAAA 300 AGTTTATGCC AATATATGAC GTGTGTGTGT GTGGGCAATT GTTCTGATTT TTCTTTGCAC 360 ACCTATACAA TCGCTAAAAT AGCTGCTGTC GCGCAACTTG ACACACACAC GCATACACAC 420 ACTTATTACA GTTCATAAAT TACACATATA TACATCTATC AAAACATATA CATATATACT 480 TTCACATACA TATCTACAAC TCCTGTCAAA ACACATATAT AAACAAACAT ATATATCTAC 540 AGCAAACACA TACAAACTCA CTTATATACA TTACTTCCCC TCTTTTTTGA CGTCCATATA 600 AGTGACGCGC GCATAGAAAC CTGACGAAGC TGAGAGCAGG CGGAACTGTA GCTAGTCGAC 660 ATTCTGGTTT GTGAATTGAC TAAATCGGTC AACTTGGTGC GAAACCAAAA GCACACACTC 720 ACAAACAAAT CACAACAAAG CGCCATTTAG TCGTTTCATC CACCCATCTC CATTTTGCAC 780 TTTTCCTATT TCATATAAAC TCAACCTTTG CAAAAACCAA CAAAGAGCTC AAGCTTTTAC 840 ACGTCCCCTT TGTAGGCTGC TCTTAATTTT TCGTAAAGGT TGACACAAAC AAAATCAAAA 900 CTGGTGCTTC ATCACCAATA AACTAAATAA ATTAAAACTA AATACTTACC TTAATCAAAC 960 CTATTAAAAA ATTGAAGCTT AATCAGGTGT TTCGCCACCA ACAATATTAA GCTACTATCA 1020 CACATACACT TACTTATACT CACCTTATCA CACTTCTCAT ATACTTACCT ATACTTACCT 1080 TATCACACCT TCATATTACA TATATTCATA CTTACCTATA CTTACCTTAT CATACTCATA 1140 TATAATTATA ATTTCATACC GGTGTGCCAT CACCAATCAC AAACTATATA TAATATATAT 1200 ACTTACCTCA ACCATATACT CACCGTATAC CAATACCTAC CTTTCACCCA TATTTACCAA 1260 TTACCACTAT CAATATGTCA GATCCAAACA CTACGCCCAC TACTCACCTG TCCGCCGAGC 1320 CTACAATATC GCCAGGACTC TTGTCAAACT TATCAGGCCT ACTGCTCTCG CCGATACTGA 1380 CAATGCCGGC ATCAATGAAC GGTCTACTAT CAATTTCAGT AGACTCATTG GCGCCAGCAA 1440 CAGCCCTGTC CCCCCAGGCT CTCATATCGA CAGCCAGTGA AACACCTGCG CAAAAAGGAC 1500 AAGAACAGAA ACGCGCTGCG GGCAAAGAAA CAGAAAACAC TGTTTCGGAA ACGACAACAA 1560 CATCATCAAT AACGGCAAAA ACAACAGCTA CAAACACGAA TATCGAGAGT ATTCATATAC 1620 CAGCTGACGC TCAGGCACAA CTAGCACATT TTAAACGCCT AATTGCCCAG CTTGAGTTGG 1680 ATACTACTCC TCCAAACAGT CAAGCTGCCG ACAACTTTAC CAGCTACTAC GATGAACACT 1740 CCTCTAGTGA TTCAATCGTA AATATTATGG ATAGCGATGA TGCCCCTTGC ACACCAAAGC 1800 CAGCTCAACC GCTGCCGATG AGCACCTTTC GCCACCATAC TCCCACTTCG GTTGTTGCTA 1860 CAAAAACCTC CACACAAGTC ATTTCGCCAA CATATGCAGC CATTTTGGCT GGCAATGCTG 1920 CAAATCAAAA ACCCAAAAAT TCAACGTGTA TAAGCAGCAG CAGTGACAGC AGTTCCAACA 1980 ATGCACCCGG TATTGGTGGT CCCGGCGGCA AAATCACCAC CACAGCCACT ACCGGTGCAT 2040 CTCACTATAA GTTACTGCAT CCCACAGGCA AAACAGCCAA TCAAATTCGC AAAAATAATG 2100 CACGAAAGTC AAATCGCCAA AACTCGGCAA TTTCTCGCCT ATCCCAGCAA GATGCTCAAA 2160 GACGCAACCA ATTTGCAAAT CGGTTCACAT TATTATCTGA ATCCGACTTT GAAGCGCCAG 2220 ATACACATCA GTCCCAACAG CTGCCGGCGA AAACTAACAA TTTGGCCGTG CCGCTACAAG 2280 CTGCCCTGTC ACCTGTCAAA CAGTCAACAA CTACGACAAC AACTCCAACT AACACAAGAG 2340 CAGCATTGAC AAGTATCCAT TTCGCCACTG CCGCCACGGC CGTCAATGCT ACTCCTTCTA 2400 ACTCTCCCGA CTGTCAACAA CAACAGCAAG CTGGTGCGAT CAATGTTTCT GCCCCTGCGG 2460 CGAAACACTA CAAACCGCCA CAGATATGCA TTCAGCATTA CAACCAGCCA GATCAAAACG 2520 GCATCGATTC TGTAATTGCC AAATTAAACG GACAAAACCC GCCCATAATT GATTATGGAT 2580 TAAAATTGGG TGGTCCTGGC ATATTAAGAA TCCTGCCAAA AACTTTAGAG ACTTACTCTA 2640 AAATCATAAA CGTCCTCAAT ACGGACAATT CAATCAATTA TAACACGTAC CAGAGGCGTG 2700 AGGAGCGCTC ATTCAGGGTT GTGGTGCGGG GCATTCACGC CTCCACAAGC ACCGACGCAA 2760 TCCGGAACGA GCTGACCTGC ATGGGCTATA CGATTCGCAA CGTATATTGC CCAAAATATA 2820 AGAATCGCAG CGGCCCTGGT ACATATCAAC CGAATATATT CTTTGTGGAA TTGGCTCCCG 2880 ATTCCACAAA GAATCGATCT ATATTCGAAA TCAGAAATCT TTGCAAATAC GTCGTCCGCT 2940 TCGAGTGGCC TAAAATAGAT GGCAAGAATT TGCCACAGTG CCACCGATGC CAGCGCTTTA 3000 ACCACACGGC CCGTTATTGC CGCCACCCTG CTCGCTGTGT GAAGTGCGGT AATGAGCACC 3060 TCACACAGAC TTGCGTCAAA CCGGCAAATG TGCCAGCCAC TTGTGCAAAC TGCGGCTCGG 3120 ACCATACAGC CAATTACAAG GGCTGCCCCT TGTATCTGGA TCTGCTACAA GCAAAACTGT 3180 TGTCCCTGCC AAACAGCAAA AATATCAGTC CGAATGTTCG GCAGCCTCAG CCGAAATTAC 3240 AGCGGCGGCA AAAGCAGCCT CAGCGCCAGC AATCTGCTCA ACTGCAACTG CAACAGCAGC 3300 AGCAACAGCA AAAACAACAA CAAGTTCAGC TGCAGCGCCA GCTGCCAAAG CCTAAGCTAA 3360 TCAACAACAA AAATAACAAG AAATCAAAAC AAAATGGTGG ACAGATAGGA ACGCAACCAA 3420 TACAAAATCA AAAATTACAA ACTAGCACTC CAGCTGGAGT TCCCCGAAAA TCTACCACAC 3480 CAATCACAAA CAACAATATC AATACGGCAA ACAATGCCAA TAACTCCAAT CCCCCCACCG 3540 TCTCCTCATC CAACTCCTTG GATCCACGCT CTAGATGGGC GCGGATCCAA AGTTTACAAC 3600 AACAACAACA GCAGCAACGC CAACTTGAGA GACCTGACTC TGTTCAAACC AGAGAACAGA 3660 TCATCTCAAG GCAGCGCCAG ATGCTAGAAA ATTGGTCCAT CCGGCAACAA GCCACCTCCG 3720 CCAATCAACC CGTCACACAA CAACAATCCA CGCCAATGCA AGTTGACGTT GAGCAGCGAC 3780 AGCTACTACC ATCACCACAA CAACAGCAAC TTTACCAAAC ACCCTCCTCC AACGATCTGC 3840 GAAACGATAT ACTGCTTCAA ATGGCCAACA AGCAGCAAGA GCAGTTAGAC GGTCTGATGG 3900 GCTCGGTGTT GATATTGCAA CAACAATTGC AAGGAGTGCT CCGCCTAGGC GCAGACTCTA 3960 CACCGCTCGC TGTACCATGG TCAAACTCAA GCCAATAATT TGAATGAATT CAGAGCCCGT 4020 TTTCATTGGC TCATCTACTG CTCAATCAAA ATTAAAATCC TGGAAAAAAT AGCAACTTTT 4080 ACTTACCTCC AACTTACCTT TTATTAGTTA ATTTCCCAAA CCAAATCTTT ATTTTAGTAC 4140 TTACCCGTTA CTTACCTCCT TATTATTTTT ACCTACCTTA TTTACTTACC TTTTTAATTT 4200 ATCCTCAAGC AGCTATTATT TTCCTAAAAA TATACCAACC TCATCACAAA TTGCATTTCT 4260 CAATGCAACA CAACACAATC ATCAATGCAA TTGTCAACAA GTTGCAAAAT ACAAAAATGC 4320 CACAACAAAT TGCCAGTCAT GTTCCGCTAT GTGCGCCTCT GTGTCTGCTC GCTCACAACA 4380 CTCCAGCTTT TCCAATGCTC GAATACAGTG AGCTTATGCA CACACACGCG CACAATGAGC 4440 GCGCTTCCTA TTCGTCAATG CGCTGGCGCA CACTAAAACA TAAGCAGACG AAACAAGATG 4500 TCTGTTCCAC ACTTGCCCAC CCCATGACCA ATGGCAACAG AAGCTGCTAT TTGCATCCCT 4560 AACCGGCCTA CGAGGGAAGA CGGGCAACTT CCTTTTCAAG AAAAGCTCCA GAAATCCACA 4620 TTTGGCAGCA TCAAACATCA TCACTCCTTT ACACACACAC ACACACACAA TCACATCGAG 4680 AAAAAGCAAT GAACACAAAC ACAGCACACA ATATGCTTAT CATTGCCTCT TCACACCAAT 4740 GAATCTGCCA ATTCAAAATT CTACTTACTA AAAACTTCTC AATCTCAACC ATCTTTTTTA 4800 TCCCGCATCG CAAATATTTA ATCATAGCCA ACACGCATTT GATAACAGCC AGTGCAAATA 4860 TATTTAAATT ATTGAAATTT AAATGAACTG AGAGAAGGTA CTCGGCTAAA ATAACATCAA 4920 TTTTCTTATC AACTGCGGTG TTATTTATCT GCTTTTGATG TGGGAAACAG AGGATAGGGA 4980 GATTTTTTAA ATTGGGAAAA CAAAAAACCA ACAATCAACA CAGGACGGAC ATGACAAATC 5040 ACAGTTGTGA ATTGTGGAGT GGCCCTGTGT GTTCTCTGTT CATTTGCCAA CCCTATGTCT 5100 CGGGCAATGC AGACACACAC ACCATACAAT TGGGAACACA CACACACTGT ATATTCATGC 5160 GCCGAGTCCA ATTCTCTATC AATAAGAAAC AAGAAAACAT TTCTCCTTAT TTCGTACATC 5220 AAACACAGAC AAATCACAAC GAAAAATTAT TATTTTCATA AACTCCATTA AATAAATTCT 5280 CTCATCATTT TGCTGTACTC TTATCAATAA TGCTGGCTTA TGGCGAACCA GATTTGCTGA 5340 CGCCGCACAC CACTCCAATT GTATACTTCT TCTATACACT TGAATTTTTA TTTACTAAAA 5400 ATATAGAGAC ATATCATTTT AGCTCTTATC ATAAAGTTAA TTTTTTGCAC AAATGGTATG 5460 AATATTAAGT TGGCAAATTC ATATAATTTT GTAAAACATA CACGACACAC ACAAATTCCT 5520 AAACATGTAT ATACACATAA ACATATCACA AAACACAAAA ATTTAAAATG TATAGAATTA 5580 AGGAAGTATA TATCAATAAA CTCTGGAGGG GGGATTGGTT CGGTGTGTGA CGGTGGGAGG 5640 GGGGATACTT TGTACCGCTG CAGGATAGTG GCCTGTGTGT GGGTGTGCAT ACCCGTGATG 5700 GTTTGGAGCG GTTGACGAGC ATGAGCTAGT AGGCACCTTT AATGCTGCCT CGCCATGTCT 5760 CGAGCCCTCC AGGCCACCAC GACCATGCAT ACTCAAACAC TCGCTCGCTC TCTGTGCGAT 5820 GCGAATGCTC GATGCCCGAC GCTCGGTACT CGGTACTCGG TACACGATGC TCGGTGTCCT 5880 CGATCCCGTT TCCCATTCGT CTCATCCGAC CCGGTCTCCT GCTCAACCTA AAACAGAAAA 5940 AATTGAGTCT CTTTTTGGTG CCTGAAGTGA ACGGACGTGC TCGCGGAGAA ATAAAAAGTA 6000 AAAATCATAT AAATTCGGCA GCGCGCGCCA AAACGAACCG AACATAGTAT ATAGGGACAC 6060 GATTGCAAAG TGCAAATTGC AATACATATC CCTCGGTTTT TTCATTCGGT TTTTCACTCG 6120 CTCAATACAG TGGCTTTTTT TTCCTTGTAC ATATATATAG TGCACGTGGC CTGCTGTGAC 6180 TTTTGCACAA AGTGTCGCCA CGTGGACAAT CTAGAACTTT TGCTTTAGTT GTTCTGCTGC 6240 GCATCAGACG CTGGCGCACG GCGTGCAAAA ACACAAAAGT GGGTTATGTA TAACAAATAC 6300 ATAACTGCCT AGGGCCTTAT ATGTACTAAA ATTAACCACT ATAACTCATA CTCTCGTCAA 6360 TAAATCAATA ATAATTGATA TAACCTAAAA TAAAATTACA TATAAATCAA TAAATTGCTA 6420 TAGCTATATA TATTTTCAAT TACTTATTAA ATATACATAT ATACCTAGCA TATATATTGC 6480 TACAAACTGT CCGTCTACGG ACAATCATAC CCTCTTGCTA AAAAGAGAGC AATTTAATCA 6540 AAAATTTATT ACCTATAAAC AAATTAATAC ATAACCAAAA ATAACAATTA ACCCTTCAAA 6600 TCTTTAAAAA 6610 // ID TARTYAK standard; DNA; INV; 8444 BP. XX AC AF468026; XX DR FLYBASE; FBgn0026443; Dyak\TART. XX FT source AF468026:1319..9762 XX SQ Sequence 8444 BP; 2822 A; 2187 C; 1737 G; 1698 T; 0 other; AAAAAACATA TAAACATACC CACACAATAT AACGACACCA ATGCAACCAA AGCATTAAAA 60 ACCGAAAAAG CAGCTTCTCC CTCCCACACA TACTTACGCC AGACAAAACC AATAAAGCCC 120 GCCATAAACG CATTGCATGC CGCCCAAGAC ACAAACCCAA GCCCAGCAAT CAGCGCTGTC 180 ACTTACACAG ACAAACCCAC AGCTACTCAG AATATTTTTC CTGTCAAAAC TTTTGCAGAG 240 CTGATTAGAG AAAATGCAAA ACGCTCACCA ACTCCAATGC AAAATCCCCC TCAAGCAAAA 300 CATGACTCTG CCGCCCTCGG ACGCCCTCCG ACTGCAGCTA GAAAAAATCT AAATAAAACA 360 CTGATTTCTC CTAAAACTCC TGGGAAGCGC TGTGGGGACT GTCTTGATGA AGGCCTACTT 420 CAAACCTCTA ACAAAAAGGT TAGAATACGC GACGACTTCT CTGATGATGA TCTGGGGGTC 480 ACAAACCTAC TCTCTGAAAC ACCCTTATTC AAAAGCAAAG CAGCTATTAA GATTCGGCAA 540 GACTCGAGAA GAGAATCCCT GCAGAAGTCA GCTGAAATGG ACACAGCTCC AGCAATAAGT 600 CCCTCAAACG CAGCAGCCGA TCCCGACCTA CCGCCCTGGA AAACTGTTCC AGCTAGCAGA 660 AAACCACCAT CAATCTTCCT GTCCAATATA CAGCAGATTA TCCCGCTAAT AGAAAAACTA 720 AACTATAAAG CCGGGGTAAA TAGCTTTACT ACCAAGTCTG AACTTGGCAA CAATATTAGA 780 ATCCAGGCTA AAACGATGGA CGCCTACAAT GCAATTCAGA ATGTCCTCCT TGAAGCAAAC 840 ATTCCCCTAC ACTCTCACCA GCCAAAGAAT GCAAAGGGCT TCCAAATTGT AATTAGGCAC 900 CTCCACCAGT CAACCCCGAC CAAATGGATT GAAAGCCAAC TTCAAGACAT CGGTATAGCT 960 ACAAAATTTA TCAGGGCAAT GCAGTTTAGG GACACGAGAA ATCCTATGCG CATCCATGAG 1020 GTTGAGGTTG TACCCAAGGC TGACGGCAGC CATCTTAAGG TCCTGCTAAT AAAATCCCTT 1080 GGAGGACAAA CGGTCAAGGT TGAAAGGAAA CGGGTATCGA AGGATCCTAC ACAATGCCAT 1140 CGCTGCCAAT GCTTTGGACA CACAAAAAAT TATTGCAGAA ACCCGTTTAA ATGTATGAAA 1200 TGTGGCCAGC TGCACGCCAC GGTCTCATGC ACCAAACCCA AAAACCTTCC GGCTACTTGT 1260 GCAAACTGCA ATGGAAGCCA CGTTAGCAGC TATAAAGGAT GTCCTGTTTT CCAAGAAGCA 1320 AAGCAAAGAC TATCTATCAA CAAAATTCAA TCCCTTCACT CACAACCCAC CCACCTTCAG 1380 ACCCCCCGCA ATAAACATCC CTACCCAAAA CCCACCCACA TTCAGACGCC CCTCAATAAG 1440 CAGCCCTACA CAAACCCCCT CCCTCGCACA TTAGTAAACA ACACAAAACT ACCTGCCAAA 1500 AGAATCCAAG GAAAGAAGAT ATCGCAAAGG AATCTATCTA TAAATAAACG CTTAAACAGA 1560 ATCAGGACAT TGGACAGAAA ACCGAGGAAT GAGACAAGCC CGCCGACAAC TAGCAAAAAG 1620 GCCTTGGCCT CTCTAGAAGA AAGCAGAAAA AACCCAAATA GCGCCCTAAA CCCGGCCAAC 1680 ACCCATCTCA CTCATTTCCG CCCACCACCA TTAGCACAAA ATATTCCTAA TGACGAATCT 1740 AAGGAGGTGA GTGGGGAGCA ATACCTTTTA AATCGCATTG AAGGGATGGA AAAGAAGCTC 1800 AACAACCTTC TTGAAATCGT CACCCGCCTA CTAAGCCAAG GAAAAGACTG TCCAAAATCT 1860 CCAAAAAATC CTTTCCGAGA TCCAATCTTC GTTTAAATGC TCTTTCTAGT AACATCAGAA 1920 AGTGACGTCT CCTATGACTC GGGAGTGCAA CAGGGACATC CTTAAAATCG CTTTCTGGAA 1980 TGCTGGTGGG ATCAACAATA AAATAGATGA GCTTAAGCTG TTCATTCTAA ATATTGATGC 2040 CCACATAGTC ATAGTCACCG AAACTAGACT AGACAACAAT TCTACCAAAC TAGAGCTGCC 2100 AGGATATTTC ACATACTTAG CCCAAAATCC TGCCTCTAGC AAGAGAGGAG GAGTCGCCAC 2160 GATAGTAAAC AGTAGTCTCC GCCACATGGC CTTAGAACCG ATTGAAAAGG AATGCATACA 2220 GAGTGCCCCA ATAGTATTAC TGCCTGAAAA CAACAAACGC AGCGAAATGA TTGTAATAGC 2280 ATCTGTCTAC TGTCCGCCTT CGCTAAGCTG GTCGCCCCAC CATTTTACTG ACGTTCTCAA 2340 TTTTGCTGAG AAAACTATGG GAGGGCAGAC TAAGCTCATT CTATGTGGCG ACTGGAACGC 2400 AAAACATAGA CAATGGGGTT GTATACGCGC CTGCCAACGT GGCGCCGCAC TCTACGATGC 2460 AATTCAAGCA GACTCCATGG CTGAAATCGT CGCGACTGGC AGCGCTACAC ATTTCCCGCA 2520 CGATACAAGG AAAAGCCCGT CAGCAATAGA CTTCTCGATA TGTAAACGGC TTGGCAGGTA 2580 TGAAAAAAGA ATCTCCTCAA GTGCACACCT ATCCTCAGAC CATCTTCCCA TCTTACTTGA 2640 GATAAACCTA GATATAAAAA CCATCTCCCT GCAAAAACAA AACAACAATA TCCTCAAGAA 2700 AACAACGAAC ATTGAGCTCT TTAAGAACGT TCTAGAAAGG AAGATACTTC TAAACACTGA 2760 GATAAGGGTA GCAGAAGACA TAAATGACGC CATAAACATC TTTATTAAAA ACATCAAGGA 2820 CTCGGCTGCT GAATCAACTC CCTCCCCAAG AATTCTGATA ACCACAGAAG AAGATATGGG 2880 CAAGCTAACA GAAATAGTCA TACGCTCACA CTAGACGAAA ACACAAGCAG ATTGCTGGAA 2940 GAAAAACGTA TACAAAGTAG AATTTTTAAA GCTACTAGAA CGAACGAGGA CAAAACTAAA 3000 CTAAAAGCAG CTGAAAATCG ACTTAAAAAA GCAATTAAAA TCTTAAGAGA AAAGAGAATC 3060 AATGAGCAAA TTGAAGGAAT TGACACAAAT AACCCGGACA GAATGAGGAA AATTTGGAGG 3120 CTGCTAGATG AAGGGAAAAA AATGAATCAA CCCAACTTTC CCCTCAAATT AGAAACCAAA 3180 AAAGGCCCTA AATGGACTAA AACAATTAAG GAGACAACAG AAGCGTTTGT CTCCCACTTG 3240 GAAGGAAGAT TCAAGCCAAA TAAAATTGTA CCTGATTACC ACATAGATAA GGTTAACACC 3300 GGACTAAGAA TAATTAAGGA AAGCATGCTA ACAGAACGAC ATAATCTAAA CAAAAACCCC 3360 CATAACCAAC CCATTACGCT AAACGAATTA AATGAAGAAA TAAAAAACTT AAAGAATAGC 3420 AAAGCACCTG GTAAAGACCT TATAACAAAC CAGCTCATAA AAACCCTACC GACTAAAGCT 3480 ACCCTGTACC TTATCCTAAT CTATAACTCC ATACTTAGAT TAGGATACTA CCCTGAAGCC 3540 TGGAAACATG CACAGGTAAA AATGATCCTG AAGCCAGGGA AAAGCTCAAA CGAGCCGAAG 3600 TCATACAGGC CGATTAGTCT ACTCTCGGGA CTCTCTAAAA TGTTTGAAAG ACTACTCCTA 3660 AAAAGACTTT TCAGGGTAGA TCTATTCAAA AAAGCCATAC CACTGCACCA ATTTGGCTTC 3720 AGAAAAGAGC ACGGAACTGA GCAGCAAATA GCCAGGGTCA CCCAGTTCAT CCTCGAGGCC 3780 TTCGAGCGGA AGGAATACTG CTCAGCGGTT TTCCTTGACA TCTCTGAGGC CTTTGATAGG 3840 GTATGGCACG AAGGCCTTTT ACTTAAATTA GCTAAGATCC TACCTTACAA CCTATACATT 3900 ATACTGGAGA GCTACCTTAC AAATAGAACG TTCGAAGTTA AAGACCAAGC AGGAGAGACT 3960 TCGAGAACAG GACAAATAGG CGCAGGAGTG CCTCAAGGAA GCAATCTCGG ACCACTACTT 4020 TACTCTATCT TCTCCTCTGA CATGCCCCTC CCATATATCT ACCGCCCTTC ACCAACACAA 4080 AGAATTATGC TCTCAACATA CGCAGACGAC ACTATAGTCC TCAGCTCAGA CACACTAGCA 4140 ACTGCCACCA CAAGAAACAA CGAAAACTAC CTCAAGACAT TTTCGGACTG GGCGGACAAA 4200 TGGGGTATCT CAGTAAACGC TGCTAAAACC GGACATGTCA TTTTTACATT AAAAAACGAC 4260 TTACCTACAA ACTCAATGAA TGTGAAGATC AAGGGTCAAA CAATAAAGAA GGAAGCAGCA 4320 TCATACCTTG GCGTAACCCT TGATAGCAAG CTAACCCTTA GCTCTCACGT CACAAAGCTA 4380 TTGGGTAAAT ACTCTACAGC CTACAGAAAA TTGACATGGA TCCTAAACGG AAGAAGTAAA 4440 CTCCCTACTA AAACTAAGAT ACTGATCCTT AAATCAGTTT TATCACCAAT ATGGCAGTAT 4500 GCCATAGCAG CTTGGGGTCC CCTTGTGACA GATGCACAGA TAAGGAGGGT CCAGGTTGAG 4560 GAAAACAGAA AAATAAGAGA CATATGTAGA GCGGGAAGAT ATACGAGAAA CCAAACTATA 4620 AGGGACCTTT TTGGCGTCAA AACAGTAGAA GAATTCTATC AACAGGCTAT GCACAGGTTC 4680 TCAGAAACTA TAAAATCGCA CCCAAATATA GCTGTTCGCA GGATTCTCTC TAGGCACTAT 4740 ATCCCGAACA GACTAGAAAG AAGCAGGCAG AGGTACTTTA AAATGACAAA TGATCATATC 4800 ACGCAAAAGC AGACTGGACT TACCCTCTCA CCTAAACTCT TAAAAATCCC TGATATAGAT 4860 GACTGCAGAA CCGTAAAAAA GCGTAGCGAG AGAGAGAAAA TAAGACAAAT GCATCTAACT 4920 GAACTCCCCA CCTTGCTGAG ACTAGAGGAA GAGGAGGAAG AGCTCAAAAG AATAAAAAAA 4980 CAGGAAGAAA GGGAAAAAAG AGAAAGGGAA AACCAAAAGT GGCCTCCAGA TAGATGGTGC 5040 GAATTGGAAA TAAACCGATA TAATAAACAA TATAGAAAGG GCGACCTAAC CAGGCAGGAA 5100 GTTATAGAAA AATTCAGAGG GCAACCATTA AATGTACAAC GAATAATCCT ACCCGACTAT 5160 GAAGGGGACA TAAAATTAAA TCAAAACAAA CCAGGACAGG AGCAGAAGGC AGAAATAATC 5220 AAAACTGGCG GAAGGGGTGG CAAAATATTG AAAAGAGAGG AAAGAAATAT AAAAGGCTAA 5280 AGGCTAAGTT ACAGGTTACA TAAAAAGGGA AATCTGCTTA TATATACTAT GGTAAAATTA 5340 ACTTAACTAA TCACCTACTG GTTAACAAAA TAATTATGCC TGCATGGCAC AAGCTGCTTT 5400 CTCAAATCAT TTCTCCTGAC GCTATTGAAA ATCCATCTTT ACTTTCCAAC CGAGGGACTT 5460 GCGACTGCGG TCTTTCCGCC TTATTGGCTC CTTATGGATC CATCTGCTGC CGTATTGGGC 5520 GACACACCAG CGCTCCAACC TAAAAGAGAG ATAACATGTT TTAATTCACT TTCCTTTTCT 5580 CATAAACTAA ATCACAACAA CACCAACAGC GCATCGGGCG ACTGACAAAA GCATTAGCTC 5640 ACCAAGTCAG CAACAACAGC AGCAGCAAGA CCAGAATCAG TTGAGGAGGA GGCTTGGTGG 5700 TGTGCTGAAC ATTTCGCCGC CCACACCCAC CATTTCTGTA GGCCTGTATG ACCCTGAGCC 5760 CAACGCCGGC ATGGCAAATC CGGTCTTCCT GAGGAGGCGG GGCTCTAGTG TGAGGCGCGC 5820 TTTTGCCCCA AAAGCAACAA CGACAGCGGC AGTAGGGTCG GCGCCCCCTG CGTGACCGAG 5880 TCCATCTTAG CAATCGGTCC TTTTGGCGGT GTTATGCCGA CGCGGCGGTC GCGCTTATGA 5940 GGACTGCCTG CAATGCTTGG CCATGAGACG GCGTCATCAG CAACATTTCA ATCACGCTCA 6000 ACTGGTTGAC CGGGGGCAAC ATGCTTGCCA TGCAGCAACA ATGGTGCATG CAGTCCAGCA 6060 TGCAGCAACA ACAACAATAT TGGCAGCAGC AGCATAATGT TTCGCAGCAG ATTCCATCTG 6120 GCAACTTCCG ACGACCAGAC TCCTCGCTTG GCCCCTGGAG ACTCGCTCTG GAAAAAAAAG 6180 AAATAAGCAT GGATTAGTTT TTTCTTTCCC TCTAGCTATT GGATTAAATT TCTTTCCTGG 6240 ACAACCGGCG CCTTCATGCA GATGACGCTG GACTGCGGCG ATTTTTTCCC TCCAGCCACA 6300 GATGGGAAGC TCACATCATG GCATATATGC GGCATCGTGC TAATCAGCTC CCCCATGCTG 6360 AACGAGATCT TGATGCGGCG ATGCACTCAC TAGCGAGCAG CGGGAACAGC ATACAGCGTC 6420 AGCAACAGCA GGAAGAGACT TTGTAGCAGA AGCCTTTCGG CATTCTCCGA CGACCCGACT 6480 CCTCGATCGG GGTCAGGCAA ATTGTCCTGG ACCGACAAAT TTTCCCGGCG TTTCGTCCTG 6540 GAATAAGGAA GGAAGCACGG ATTAATTTCT CCCCTTCAGC TTACGACCCT TCTCACCAAC 6600 AGCAGCAACT ACAACAACAT CGGCAGCAGA CGACTTGCAG CTGCTTGGCT CACCTAATGC 6660 GACAGCAATA ATCCCGGCAA GGCCAGAAAC GGCACCGAGC AGAAGCACGA CAGGGTGAAG 6720 AACAACATGC CGCCCACGTC CACCAATCCG GAAGCCCTGC ATTAGGGGTC TGAAGCGCCA 6780 TTAACTGGAG GATCCGGCAG CGTATGCCTA CGCAGAAGCG GGGCGTCGTA AATCCGGTGG 6840 CCGGAATAGC GCCTTGAACA GCATGCTGTG TATGTTTGCC CCAAAAGCAG AAACTGCCAC 6900 AGCAAAGGAA CCGCAAGCAG GATCATCACT CGTGGCAAAC TGGCGTATTC CTGTTGAAGA 6960 CGCGGGACTG CGGCGCATTC TTCCTTTCCA ACCGGGGGCT TTAGCGCCTG CGGTATTTCT 7020 GGCTTTCCAC GGCTCCATTT GCTGCTGCAT CTGACGCCTC ATCAGCGCTC CTCCCTGAAA 7080 GAAAGAGAAT ATGTAGTAGT TTTCTTTCCA TTCCTTACAA TATCTCTCAC CAAACCTCCA 7140 CGACACAACA ACAGCAGCAG AAACAGGTCT AGCCCACAAT GGCAGCAGCT ACAGCAATCA 7200 GCAGAAGCAA GGCCGGCAAA AGCAACAACA AAAGCCCTCT CTGAAAGCCC GGATGTGGAG 7260 ACTGAAGCGC CGGAAACTTG AGGATCCGCC CCCGATGCAC GCCCACCAAC AATAAGCTGG 7320 AGGAGAAGTT ACTGAAGCAT GGGAAGCAGC ACCAGGGGTC GCATACGCAG TGTCAGGGCT 7380 TTGCGAACCA GATCGAATGA GGAGAGCCTG GAACAGCATA ACATCAGCGG CAGCAACAAC 7440 AGCGCCAGCA GGGTCAGCGG CGTCCTGCAT GGCTGTGTCC AAAACAACTC CAATTCAACC 7500 TGCCTGTATG ATGCAGCGCA CCACCGGCTT TCACAGGGAC CGTCTCAAAC GCTTGCAGGA 7560 GCCGGTGAAG GCCTGGGCGA AGACATGGAG ACACCAATGC GCGTCCTTTC GCCCCGGCGG 7620 CAACAGGCTT GCGCATGCAA TAAGAACAGT AAATGCCTCA AATAACAGCA GAAGATACAG 7680 ACGTCCAAAT TGGGAACACT GACGCCAGCC CTCACCGTGA ACGACTCGAC CTGTGCAGCA 7740 GCAGCAACCA TCAGCACCCA TGCCCTCACG CATCGGCAGC AGAGGGTGTT TTGGGCCCTT 7800 GACAATTCCC TGCGACTTGA CTTCTTCTTG GCACCTGGTC AATCATCACG TGCCAGCAAA 7860 CAGTGGCTTG TCACCCTGGA AGAAAGAAGA CTGGATTAGT TTCTCCCCTT CTAATACATT 7920 TTGTGTGTTG AAATGCATCA GGCGGCGACA CACTCACCAG CAACAGCAGC AACAACACCA 7980 GCGGCACCGG TAGCAGGAAA TGGATCCTCA GCAAAAACCA TCGGCACTTT CAGACGTCCC 8040 GTACTCCTCG CCTGGCCCTG GCCGGTAACA GGGGCTTGTT AGATAGAGAA GACGACGGTC 8100 ATCCGACGGA CAGCAGCCTG AAGATGGAAG CAGGCCTACG CTGCCCACCT CTCCGATGCC 8160 TGCAGCAGCA ACGGCAGCGG CTCATAAATG CAAACTGGCG CCAGCCCTCG GCTCTTCGGG 8220 CTCATGTAGG CGGTGACACA CTCACTAGCA GCTAAAACAG CAACGGCGGA ATTAGCAGAA 8280 GGCAATGTTT TGCCGCAGCT GCCAGATGGC ATACTCCTAG CTTGGCCCCT GCCCAAACCG 8340 TATTGGACCG GCAAATGATT CGACAATCCG ATCGAGTCGA CGAATTCGTC TTTTTTCCCG 8400 GTAACAACAG TATACTGAAA ACTTCCTTAA AAAAAAAAAA AAAG 8444 // ID NEOR1A standard; DNA; INV; 1757 BP. XX AC AY369259; XX DR FLYBASE; FBgn0013854; Dnet\R1A. XX FT source AY369259:1..1757 FT SO_feature start_codon ; SO:0000318:1..3 FT SO_feature CDS ; SO:0000316:<1..1134 FT /protein_id="AAF75688.1" FT /translation="VVGSSRLELETKGAQLMSIVGAWGAEAGVAVSTSKTAVMLLKGI FT LSRNRRPMVRFAGANLPYVDNYRYLGITVSERLNYQRHIASLRERLTGVVGALGRVLR FT VDWGVSPRDKRTIYAGLMMPCALFGASVWYRTTDRGVTAKKRLIRCQRLILLGCLPVC FT RTVSTVALQVLAGAPPFDLAAKKLAIKYKLKRDYPLEEGDWLYGQDLADLTSEQKMAR FT LDECMLSEWQLRWDGDVHGRVTYAFSERKFVYLRRDFRFTLRAGFFLTGHGSLNEFLH FT GRALSETPTCTCGAVSEDSLHVLCECPLYADLRDLDGLGVRNLHGVWDVSGVKETRER FT MQLLDAFADAVFTRRRNAQHVQDGQDGLDGPDGQVRRGLPVPS" FT SO_feature three_prime_UTR ; SO:0000205:1135..1757 XX SQ Sequence 1757 BP; 402 A; 417 C; 516 G; 422 T; 0 other; GTCGTGGGAA GTTCCCGCTT GGAACTGGAG ACCAAAGGCG CGCAGCTGAT GTCCATCGTA 60 GGCGCTTGGG GAGCTGAAGC TGGAGTAGCT GTATCAACCA GCAAGACGGC AGTCATGTTG 120 CTGAAAGGCA TTCTTTCACG CAACAGGCGA CCTATGGTGC GATTTGCTGG AGCAAACTTG 180 CCATATGTTG ACAACTACCG GTACCTTGGC ATCACAGTCA GCGAGCGTTT GAATTATCAA 240 CGCCATATCG CGTCGCTACG CGAACGGCTG ACTGGAGTCG TAGGGGCTTT GGGACGCGTA 300 CTGCGAGTCG ACTGGGGTGT CAGTCCTCGC GACAAGCGGA CCATTTATGC CGGACTCATG 360 ATGCCATGTG CACTCTTTGG TGCCTCGGTA TGGTATCGTA CGACGGATCG AGGTGTAACA 420 GCCAAGAAAC GTCTCATACG ATGCCAGAGA TTGATCCTGT TAGGATGTCT CCCGGTATGC 480 CGAACAGTGT CCACAGTGGC ACTGCAGGTG CTTGCTGGTG CTCCCCCGTT TGACTTGGCT 540 GCCAAGAAAT TGGCAATCAA GTACAAACTC AAGCGTGACT ACCCGCTGGA GGAAGGCGAT 600 TGGCTGTACG GTCAGGATCT GGCGGATCTG ACCTCGGAGC AAAAGATGGC GCGACTGGAT 660 GAGTGTATGC TGAGTGAGTG GCAACTCAGA TGGGATGGCG ATGTTCACGG ACGAGTGACA 720 TACGCGTTTT CCGAACGTAA GTTCGTCTAT CTTAGGCGGG ACTTTCGTTT TACACTCCGC 780 GCAGGATTCT TTTTGACTGG CCACGGATCG CTGAACGAAT TTCTGCATGG AAGAGCTCTG 840 AGCGAAACGC CCACATGCAC TTGTGGTGCT GTTAGCGAAG ACTCGCTTCA CGTGCTATGT 900 GAATGCCCAC TCTATGCAGA TCTTCGAGAC CTCGATGGAC TAGGAGTACG GAACCTACAC 960 GGCGTCTGGG ATGTATCTGG AGTGAAGGAG ACTCGGGAGA GGATGCAGCT GCTGGACGCA 1020 TTTGCCGACG CCGTCTTTAC GAGGCGACGA AATGCGCAGC ATGTCCAGGA CGGACAGGAT 1080 GGACTGGATG GGCCGGATGG ACAGGTGAGA CGTGGGTTGC CAGTCCCGAG TTAATAGCTG 1140 AAACTGACCT GTGTGGTTGG CGGTCTTAGA ATTCCACCAC AGTGCATCTC TGCTTGTCGT 1200 ATGAGACGAC TAAATGAGGT ACCTGGATTG TCAGTTCCGG CTGAAAAGCC AAAACTGACC 1260 CACCCGTGTG TTGGCGGGTC TTAGAATACC AACTCAGTAT GCAACCGCTT GTCGTACGAG 1320 GCGACTAAGG TGCACCCTTC CTGGATTACC CAGCGTCCGA GCCTATCGGG GCTAAATGGG 1380 GAGGGGATAC CGATCCCTCG ATTAATTTTT AATTAATCGA ATTCGGAACC ACGGGTTGAG 1440 ATAGTTTCTC CAAGGCTTCT CATTGAGGTC GGCCCCCTAG TGGGAGTTCG TGGTGGCTGT 1500 GTATAACGTA GCGTTGCGTT TCAACGCCGG AGAGAAACTG CTAGATGTTT CGGCCCTCAC 1560 CAAGGGTGAA TGCATACCCG ATGTATGCAT TCTATTGGCA CCTATGAGAT GGAGGCCTCT 1620 GCTAACATCA CTTAGGGGCG TTAGTTGGCG CTCAAACCAA ACTCGCTGCA CTATCTGCTT 1680 GCAGAAACGT GTACCGTGGT TGTAATCCCT TGTTGGAAGA CGCCACGTTA AATAAACTGG 1740 AGAGATCCTT TGGCAAT 1757 // ID TAKR1A2 standard; DNA; INV; 1753 BP. XX AC U23198; XX DR FLYBASE; FBgn0013903; Dtak\R1A2. XX FT source U23198:1..1753 FT SO_feature start_codon ; SO:0000318:2..3 FT SO_feature CDS ; SO:0000316:<1..1134 FT /protein_id="AAA91003.1" FT /translation="ARSISGPFIWNLLMDVLLQRLEPHCAVSAYADDLFLFVDGNSRA FT DLERKGEQLMGIVGAWGSEVGVSVSTSKTAIMLLKGSLSQQRRPTVRFAGTSLPYVIK FT CRYLGILVGERLSFLPHISALRDRLSGVVAGLARVLRVDWGLSSRAKRTIYRGLMMPC FT ALFGASVWYKAVRRGKSLKLLTSCQRTILSGCLPVCRTVSTVALQVLAGAPPMDLDAH FT RSALKFKLRKEIPLDTNDWLHGLDMTGMNWKDKMALLDERLLNEWQLRWDNAITGHVT FT REFFPEAAFVYKRKDFVFTLKAGFLLTGHGSLNAFLHGRTLSTTTACSCGEEDESWLH FT VLYECRLYNELRDLDALGIVQDQGRWIVAGVVETPERMRLLGVFADAAFSRRRTDAAD FT G" FT SO_feature three_prime_UTR ; SO:0000205:1184..1753 XX SQ Sequence 1753 BP; 390 A; 418 C; 527 G; 418 T; 0 other; CGCAAGGTCC ATTAGTGGTC CATTCATTTG GAACCTTCTG ATGGATGTGC TGCTTCAGCG 60 CTTAGAGCCA CATTGTGCTG TGAGCGCGTA TGCAGATGAT TTGTTCCTTT TCGTCGACGG 120 GAATTCCCGT GCTGATCTGG AGCGAAAAGG CGAGCAGTTG ATGGGCATCG TGGGAGCCTG 180 GGGATCTGAA GTTGGAGTGA GTGTGTCCAC CAGCAAGACG GCAATCATGT TGCTGAAGGG 240 AAGCCTTTCG CAACAAAGGA GACCGACGGT GCGTTTTGCT GGAACAAGCC TACCGTACGT 300 CATCAAATGC CGGTACCTTG GCATCCTAGT CGGCGAGCGG TTGAGCTTTC TACCGCATAT 360 CTCGGCACTT CGAGATCGGC TGTCTGGAGT TGTCGCAGGG CTAGCACGGG TGCTTCGAGT 420 CGATTGGGGA CTCAGTTCCC GCGCAAAGAG GACCATATAT AGGGGACTCA TGATGCCATG 480 TGCACTCTTT GGTGCCTCGG TCTGGTACAA GGCGGTGAGA AGGGGTAAAT CCCTAAAACT 540 CCTCACCTCG TGCCAGAGGA CCATCCTTTC GGGATGCCTA CCGGTATGCC GCACAGTGTC 600 CACGGTGGCA CTGCAGGTGC TTGCTGGTGC TCCTCCAATG GATCTGGACG CTCATCGGTC 660 TGCTTTGAAG TTCAAACTAC GGAAGGAAAT CCCCCTGGAT ACCAACGACT GGCTGCATGG 720 ACTGGATATG ACCGGGATGA ACTGGAAGGA CAAGATGGCT CTGCTAGACG AGCGTCTGCT 780 AAATGAGTGG CAACTCAGAT GGGATAACGC GATCACGGGG CATGTGACTC GGGAATTCTT 840 CCCAGAGGCA GCGTTTGTCT ACAAGAGGAA AGACTTTGTC TTTACTCTAA AAGCCGGATT 900 CTTACTGACA GGACACGGGT CGCTGAACGC ATTCTTGCAC GGCAGGACTC TCAGCACCAC 960 GACCGCATGC TCATGTGGGG AGGAGGACGA GAGCTGGCTT CACGTATTAT ATGAATGCCG 1020 GCTGTACAAT GAATTGCGTG ACCTCGATGC TCTCGGAATT GTTCAGGACC AAGGAAGATG 1080 GATCGTCGCG GGAGTAGTAG AGACCCCAGA ACGGATGCGT CTCCTAGGAG TCTTCGCGGA 1140 TGCTGCCTTT TCAAGGCGTA GGACGGATGC AGCGGATGGA TGAACCACAG GCGCTGGGCT 1200 TAGCCCAATC AACTCCTGCT GTGTGGCTAG CGGCGAAGAA TACTACCACA GCTTGTCATA 1260 GCTTGTCGTA GGAGGCGACT AATATGACAT GGGTGCCCTA TCCGAGCTTG TCGGAGCTTC 1320 AGGGGTGAGG CCTACCGAGC CTGTAATTTC GGTACCACGG GTTGAGCAGC TGTCCAAGGC 1380 TGCTCATTGA GGTTGGCCCC CATGTGGAGT ATCGTGGTGG CTGTGGTTGA TACCATATGC 1440 GGTAGAGCCT CGGTTCGACG TGGAGTTGCG TCATACACTC GGGGTCTGTG ACCCAAAGAT 1500 CAGTAGGGAT TTAGATAGAT CCCGCTCCTC AGCAAGGGGG AATGCTTGCC CGACAAGTAA 1560 GCATTCGAAT TGCTACCGGG GTGGTTGCTA TGTACATAGC TATAGCTTCT AGTCCGGGGC 1620 GTTGGTCTGG CGCTTAACCT AGACACATTG CACTATACAC TCACTTGTGG GTGTATAAGA 1680 GTGCCGTGGT TGTAATCCCT TCAGTGTGGA ACACGCCACG TTAAATAAGC TTCGGAGGGA 1740 TCCGATAGAC ACC 1753 // ID MERCR1A3 standard; DNA; INV; 3772 BP. XX AC AF015277; XX DR FLYBASE; FBgn0013836; Dmer\R1A3. XX FT source AF015277:143..3914 FT SO_feature five_prime_UTR ; SO:0000204:1..468 FT SO_feature start_codon ; SO:0000318:2..2 FT SO_feature CDS ; SO:0000316:<469..1786 FT /protein_id="AAB94026.1" FT /translation="TEEAQPRSCSARAAMKDALIMPPPDSPPRKIVACYEVSEDSDTS FT LAATITGKGSRSRLLLLPLRLYPLLHLLLSLLLPLLLLLLPMRGHRHPPPTAATVAAV FT PDAKVSSNSVVERMQHIERKLRKAILVEDVPNAVALSVLDLAAKYQELVLDMYGAMKE FT LETERRIRPQPVVTLAATTAAPAAPVAAPRIRKVAETWSAIVTSNNPEETPKQVAERV FT RKEVAPALGVRVHEVRELKRGGAIIRTPSSGEMRRVVANPKFKEVGLDVKQNAASKPK FT VMVRDVDSSITAKQFMNGMWNNHFSGRMSNVVFEKSVKITSKPWTAESGPTVNIQLEV FT DQKALDILEDHERIYVEWFSFRWHTVTPTYACYKCVSFDHRVAQCRMNEEICRQCGQA FT GHRASKCSNPVSCRNCSFKGMPSTHRMLSAACPIYGAVLARVASRH" FT SO_feature start_codon ; SO:0000318:2..3 FT SO_feature CDS ; SO:0000316:<1779..3772> FT /protein_id="AAB94027.1" FT /translation="TLIMFRVIQANCGRSRAAVIDLGVRMRDSGVTSALLQEPYVDRG FT GRITGLPAGMRVFSDSRNKAAVIVHQEVVCMPVSSLITEHGVYVSASGNFGSIFLTSV FT YCQFNAELEPYLLYMDAVLLLASRTPVIYGLDANAVSPLWFSKLPERSRGYLNRQRGE FT LLADWIQGSRAGVLNVHSNVYTFDNRRARSDIDVTIVSDSATTWAAYEWSVSEWDLSD FT HNIITIVVTLDPASTVESYAPVPSWKLHSADWRRFGDELRTASMDFPLEQFRAMTSDE FT QVTALRSLVHQVSDAVFGRQQLRAKRRVSWWTAALTDARRELRRARRRLQHARRTHSD FT SATVLASYFRIARKEYERMMLHEKRNWKRYVGEHQRHPWGSVYRICRGRKKCTDLGCL FT RWNNELVVTWAACANVLLHSFFPAAERPVDVPVPPEVPPVLDPIEVDTCIAKVRSRRS FT PGLDGITGDMVKAVCTAIPEHMTTLYTRCLADGYFPTEWKRPRVIALLKGPDKDRSDP FT ASYRGICLLPVFGKVLEGIMVNRIKEVLTDESRWQFGFRPGRCVEDAWSHVVSSVEAS FT AAKYVLGVFIDFRGAFDHVEWDAALRRLSDLGCREVGIWRSFFSDRKASIVSSFGEVN FT VDVTRGCTQGSISGPFIWNILMDVLLHRLETHCTISAYAD" XX SQ Sequence 3772 BP; 842 A; 997 C; 1094 G; 836 T; 3 other; CAGTCGCTTT CTGACGCTCA ACGCGAGCGG TTGTGTTTTT CGGCTGCGCG CATCGCAAAT 60 TCTCGTGAAT TTTGTTGTTG TTGTAAAATC AATTGTTTCG GCTCTTCCTC GAAAAGCAGT 120 GCGCGTGTAT TGGTGTGCAT ATACAGCAAG CACCGTTACC GTGCGTGTGT ATTAGTGCGC 180 GTGTCAAGCA AGCACCGTTA CTGTGCnCnG ATTGGCCAGC CAATCAGAAC TGACTGAACT 240 TTTTGAACTA AATTAATTTT TTGTGTAGCA TACTTTTGTG CGCnGACTAA AAGTAGTGCA 300 TACTTTTAGC GACGGACAAT TTTCAATAGT GAATTTTTTG TTTAACATAC TTTTGGCGGC 360 ACGAATAAAA GTAGTGAATA CTTTTAGTGC AGAATTTCCA ATAAGGAAAT AATGGATAGC 420 GACGCTAGTA GCAGCGCCGC GAGCTCGGTG CACTCTGCGC GTTCGGGTGA ACGGAAGAAG 480 CGCAGCCGCG TTCGTGCAGC GCACGAGCTG CGATGAAGGA CGCGCTCATC ATGCCACCGC 540 CGGACTCTCC GCCGCGTAAA ATCGTCGCAT GTTATGAAGT CAGCGAGGAT AGCGACACCT 600 CCCTGGCCGC AACCATCACT GGTAAAGGTA GCCGCTCCAG GCTGCTGCTG CTGCCCCTAC 660 GGCTGTACCC GCTGCTGCAT CTGCTGCTGT CCCTGCTGCT GCCCCTGCTG CTGTTATTGC 720 TGCCGATGAG AGGCCATCGA CATCCGCCGC CAACCGCCGC TACCGTCGCC GCTGTCCCCG 780 ATGCCAAGGT GAGCTCCAAC TCTGTGGTGG AGCGCATGCA GCACATCGAG AGGAAACTCA 840 GGAAAGCCAT CTTGGTTGAA GATGTGCCCA ACGCCGTTGC GCTCAGCGTG CTGGACCTTG 900 CGGCGAAGTA CCAAGAGCTG GTCCTGGATA TGTATGGAGC AATGAAGGAG CTGGAGACCG 960 AGAGGAGGAT CCGCCCCCAG CCCGTCGTCA CCCTAGCCGC CACTACTGCT GCCCCCGCTG 1020 CTCCAGTTGC TGCACCCCGC ATCCGCAAGG TTGCGGAAAC ATGGTCGGCA ATTGTGACGA 1080 GCAATAACCC GGAGGAAACT CCTAAGCAAG TCGCTGAGCG AGTACGCAAA GAAGTTGCGC 1140 CTGCACTTGG TGTACGTGTA CACGAGGTGC GAGAGCTGAA GCGGGGAGGT GCCATCATCC 1200 GCACCCCATC CTCTGGAGAG ATGCGTCGGG TTGTGGCGAA CCCCAAGTTC AAGGAAGTTG 1260 GACTTGACGT GAAGCAGAAC GCCGCTTCAA AGCCTAAGGT GATGGTCCGC GACGTGGACA 1320 GCAGCATCAC CGCGAAGCAG TTTATGAACG GGATGTGGAA TAACCACTTC TCGGGACGCA 1380 TGTCTAATGT TGTCTTCGAG AAGTCGGTGA AGATCACATC GAAGCCATGG ACAGCCGAGA 1440 GCGGACCCAC GGTCAACATC CAGCTGGAAG TAGACCAGAA GGCACTGGAT ATCCTGGAGG 1500 ACCATGAGAG GATCTACGTG GAGTGGTTCT CTTTCCGCTG GCACACCGTC ACGCCGACGT 1560 ACGCTTGCTA CAAGTGCGTC AGTTTTGACC ACCGAGTAGC ACAGTGCAGG ATGAACGAGG 1620 AGATATGCCG GCAATGCGGA CAAGCCGGAC ACCGCGCGTC GAAGTGCAGC AACCCTGTGT 1680 CCTGCAGGAA CTGCAGCTTC AAGGGCATGC CGTCAACGCA CCGTATGCTT TCGGCAGCGT 1740 GTCCGATATA CGGAGCTGTG CTCGCCAGGG TGGCCTCTAG ACATTAATAA TGTTTAGAGT 1800 CATCCAGGCA AACTGCGGCC GCAGTAGAGC TGCCGTAATT GACCTTGGAG TCCGGATGAG 1860 AGACTCGGGG GTCACGTCTG CGCTGCTCCA GGAGCCATAT GTTGACCGTG GAGGAAGGAT 1920 CACCGGATTA CCGGCAGGCA TGCGAGTGTT CTCCGACAGT CGGAACAAAG CCGCTGTGAT 1980 CGTACACCAG GAGGTCGTCT GCATGCCTGT CTCATCGCTC ATCACCGAGC ATGGCGTATA 2040 CGTGAGTGCG TCGGGAAATT TCGGCTCAAT TTTCCTCACC TCCGTATACT GCCAATTTAA 2100 CGCAGAATTG GAACCATACC TGCTGTATAT GGATGCGGTG CTGCTGCTGG CCAGCCGCAC 2160 GCCCGTCATC TATGGCCTTG ATGCGAACGC AGTATCCCCC CTGTGGTTCA GTAAGCTCCC 2220 CGAGCGCTCT CGGGGCTACT TGAACAGGCA GCGGGGTGAA CTGCTAGCTG ACTGGATTCA 2280 GGGCAGTCGA GCCGGCGTGC TTAATGTCCA CAGCAATGTG TACACGTTTG ATAATCGCAG 2340 GGCGAGGAGC GATATCGACG TCACTATCGT CAGTGATTCA GCGACTACGT GGGCTGCGTA 2400 TGAGTGGAGC GTAAGCGAGT GGGATCTGAG TGATCACAAC ATCATCACTA TTGTTGTGAC 2460 TCTCGATCCG GCAAGCACAG TTGAGAGCTA TGCTCCTGTG CCCTCGTGGA AGCTCCACAG 2520 TGCTGACTGG CGGCGTTTCG GTGATGAGCT TAGGACTGCA TCGATGGATT TTCCTCTGGA 2580 GCAATTTCGC GCGATGACAT CAGACGAACA GGTGACTGCA CTTCGCTCCC TAGTACATCA 2640 GGTGAGCGAC GCTGTGTTTG GACGACAGCA ATTGCGTGCC AAACGTCGTG TGAGCTGGTG 2700 GACTGCCGCT CTCACTGACG CACGCCGCGA GCTCAGGAGA GCCCGGCGCC GGCTGCAGCA 2760 TGCTCGTCGC ACTCACAGCG ATAGTGCTAC TGTCCTGGCC TCGTATTTCA GGATTGCCCG 2820 AAAGGAATAC GAGAGGATGA TGCTGCATGA AAAGAGAAAC TGGAAAAGGT ATGTCGGCGA 2880 GCACCAAAGA CACCCCTGGG GGTCTGTCTA CAGGATATGC CGAGGCCGCA AGAAGTGCAC 2940 CGATCTCGGA TGCCTTCGGT GGAATAACGA ACTGGTCGTA ACCTGGGCAG CCTGCGCGAA 3000 CGTCCTGCTC CACAGCTTTT TCCCTGCTGC GGAGAGGCCA GTGGACGTTC CTGTTCCACC 3060 CGAAGTGCCC CCAGTACTCG ACCCGATTGA GGTCGACACA TGCATCGCCA AGGTTCGGAG 3120 CAGACGCTCA CCTGGCTTAG ATGGCATCAC TGGGGATATG GTTAAAGCGG TTTGCACTGC 3180 CATCCCGGAG CACATGACAA CGTTGTACAC CCGTTGTCTG GCAGATGGGT ATTTCCCCAC 3240 TGAGTGGAAG CGCCCCCGTG TGATTGCGCT TCTCAAAGGC CCCGACAAGG ACAGGAGCGA 3300 TCCAGCGTCC TATCGGGGCA TCTGCCTGCT GCCTGTTTTT GGCAAAGTAC TGGAGGGGAT 3360 CATGGTAAAC CGTATAAAGG AGGTGCTCAC GGATGAAAGT CGATGGCAAT TCGGCTTTCG 3420 TCCCGGACGC TGCGTCGAGG ATGCCTGGAG TCATGTGGTG AGCAGTGTTG AAGCCAGCGC 3480 GGCTAAATAC GTGCTCGGAG TCTTCATTGA CTTCAGAGGA GCTTTCGACC ACGTCGAATG 3540 GGACGCAGCA TTACGCCGAC TATCGGATCT TGGATGCAGG GAAGTTGGGA TCTGGCGCAG 3600 CTTCTTTTCG GACCGAAAAG CTAGCATCGT CAGCAGCTTC GGCGAAGTCA ACGTAGATGT 3660 AACCCGTGGC TGCACGCAGG GGTCCATTAG TGGTCCATTT ATTTGGAATA TATTAATGGA 3720 TGTCCTCCTG CATCGCCTGG AAACGCACTG CACGATCAGT GCATATGCTG AC 3772 // ID NETR1B standard; DNA; INV; 2038 BP. XX AC AF248068; XX DR FLYBASE; FBgn0013854; Dnet\R1B. XX FT source AF248068:1..2038 FT SO_feature three_prime_UTR ; SO:0000205:1078..2038 FT SO_feature start_codon ; SO:0000318:1..3 FT SO_feature CDS ; SO:0000316:<1//1077 FT /protein_id="AAF75689.1" FT /translation="VEGQSRAEIEALAGAHLRTVCDWGNSVGVSLAMDKTTTMLLKGR FT LSASRHPSIGLNGAFLRYVTEVKYLGITFGERMCFTPHFTGLKRRLLGVVGQVRRILR FT NEWGLSRRAVRTIYNGLFVACATYGSSVWCDAVTTVVGRKKVLACQRVTMMGCMPVCR FT TVSTEAMQVLLGVPPLDLEVRRRAVLFKVKRRIPLLQGEWLADRNVESLGLSVCKKLL FT DECVMSDWQVRWDTCLNGRDTYRYIRDVTFVGSRPDFGFNLSLGFLLTGHGSLNAFLH FT QRRLSDTQECHCGLSEETWEHVLCECPSYEDLRSLSAFGVRQVRGGFDVSQALSTSDR FT VRLLNEFARAAFARRRVLTHQGIV" XX SQ Sequence 2038 BP; 429 A; 407 C; 612 G; 590 T; 0 other; GTTGAGGGGC AGTCGCGCGC TGAGATTGAG GCACTTGCGG GGGCGCATTT GCGCACTGTA 60 TGCGATTGGG GCAACAGTGT TGGCGTTAGC TTGGCCATGG ACAAGACGAC GACAATGCTG 120 CTCAAGGGCA GATTGTCGGC CAGTCGACAT CCATCTATTG GTTTGAATGG CGCCTTCTTA 180 AGGTATGTGA CTGAGGTTAA ATACCTTGGC ATTACCTTTG GTGAGAGGAT GTGTTTCACT 240 CCTCATTTCA CCGGTCTCAA ACGCCGGCTG CTTGGAGTGG TTGGACAAGT GCGTCGTATT 300 TTGCGGAATG AGTGGGGCCT CAGCAGGCGT GCCGTTCGCA CCATATACAA CGGTCTGTTC 360 GTTGCATGCG CAACGTATGG CTCGTCTGTG TGGTGTGATG CGGTTACGAC TGTTGTTGGT 420 AGAAAGAAAG TGCTGGCTTG CCAGAGAGTG ACAATGATGG GGTGTATGCC AGTGTGCCGC 480 ACTGTCTCTA CTGAGGCGAT GCAGGTTCTA CTGGGGGTTC CACCTCTTGA CTTGGAAGTT 540 CGGCGTCGGG CCGTTCTTTT TAAGGTTAAG AGGCGGATAC CATTGCTGCA GGGTGAATGG 600 TTAGCGGATA GGAATGTGGA GAGTTTAGGG CTTAGTGTGT GTAAGAAGCT GTTGGATGAG 660 TGTGTTATGT CTGACTGGCA GGTCAGATGG GACACATGTC TGAATGGGCG GGACACTTAC 720 CGATACATTC GTGATGTCAC GTTTGTGGGT AGCCGACCGG ACTTTGGTTT TAACCTAAGT 780 CTCGGGTTCT TGTTGACTGG TCACGGGTCT CTTAATGCTT TTCTGCATCA GAGGCGTCTC 840 AGTGACACAC AGGAATGCCA TTGTGGCTTG AGCGAGGAGA CATGGGAACA TGTTCTCTGT 900 GAGTGTCCTT CGTACGAAGA CCTTCGGAGT CTGAGTGCGT TTGGTGTGAG ACAGGTTAGG 960 GGCGGGTTTG ACGTCAGTCA AGCTCTCTCC ACAAGTGATA GGGTCAGATT GTTGAATGAG 1020 TTCGCACGTG CTGCTTTTGC CAGACGGCGA GTATTAACTC ATCAGGGAAT TGTATGAATC 1080 GGTTGAATGG GTTGGTATTT CGATTGTGAG TTTAAATGGA TTGACTGGTT ATTGGGGTAT 1140 ATGTATGTTG ATGGTTTGGA TCTGGGGTAC GAATGGTGGG GATTTGAATC AGAATGATGA 1200 ATGGTTTGTA TGGATTGCTG GCTTTGTGTT CGACGCAATG GGCCACCAGT CCCGAAGTTT 1260 CATGGAACTT TACTGGTAGT AGCCTTCGGG TTACAGTCTT GACCAGAGGA CGTGTCTGGT 1320 ACCACGGGCT CTTTGGGGTC ACAGAGGCGG TCTCAGCTCT CTGCCCATTT AGCTAAATCC 1380 TTCGGGATTG TTGGGGGCAT ATGCTGGCCG TTCTTTTGAA CGCTTTGATC TTGTATCAAT 1440 TATATTAATG CCACGTCTTG TGGAAGAGGT AGGAATAGAC CTCGCTCCTA ACAGGGAGAT 1500 GGTATGACCA TGAGCATATC ATCGACTGTG CCATCTGAAA ACTGATGTTA TCAGTTTTTC 1560 ATTCCTATTC CTTACAGCTA TGCTCATCTT TCTTACATTG GGCCACCAAC CCGTAACTGT 1620 TTTGGAGTTA AAATTGGAAG TGTCCTTCGT GGCACGAAGC TTGACCGGAG CTTTAATTCT 1680 GGTACCACGG GTGACCAGGT GCCCACGGAA TTTATTCCGT TCTGGTCTTT GGTTGCGGCC 1740 CTTCGGGGAG TATCGTGGTG GCTGTGGTTT ACCGTCCAAA TGCGGACAGA GCTTAGGCTC 1800 GACGTGGACC TGCGTTATAC AACCGGGCGC TGGACCACCA TAGTTCAGTA GAGGTTTTAG 1860 GTAGGCCTCG ATCCTCACCA AGGATGTCGT CACGACCAAA CAGTAGTGAG ACAATTGGCA 1920 CTTGTGGTTC GCCATGAGGG CGCTGATTGA CGCTTAAATT AATCCAGTGA ATGCACCGTG 1980 AATGCACCGT GAAATTAAGC CAATGTGGCA GGAGTTCACG TAAACACACC GACTTTCA 2038 // ID GEM standard; DNA; INV; 1730 BP. XX AC AJ131629; XX DR FLYBASE; FBgn0026463; Dsub\GEM. XX FT source AJ131629:1..1730 FT SO_feature terminal_inverted_repeat ; SO:0000481:1..128 FT SO_feature terminal_inverted_repeat ; SO:0000481:982..1137 XX SQ Sequence 1730 BP; 459 A; 368 C; 400 G; 503 T; 0 other; ACTACATCTG CAACTTAGCG GTTATTTGTC AAATTTTACA TTTTTTCTTC ATCTGTCATC 60 TACATCAACA ACACTACTCA CGCCAACACG CTCCTTTAGC TCGCCACCCT CCCCTAGAAA 120 CACACACTGC AGAGTCAGGG CAGAGTCGCG GCAGAGGCAG CGGCAGAGAC GTGTCAGGGG 180 CAGAGGCCAA TAAACTGAAG TATCTTTTGG GGGCTGCTTT TGTTCTCAAA AGTATAGCAT 240 ACTTTCAGGC TGAAAAGTTC CCAATTACAA AATAGCGTAG GACAGCCATA TCCTGCAGTC 300 CTTGTGGTCC TTGATGGACT CAAGCGATAC AGGAGACTCG TACCTTCGCC AGGAGGCATG 360 CCATGTCCTG ATCCGACAGG AAACAGCCGA GTTATAGCGT TGAAGGGGAT TATATAGAAA 420 AACAATATTC AAATGGCATT ATCCTTAATG TCCTAGCACC CGAAGTGATC CGGGACATTT 480 TCCCGATCCC GAGGAGGCGT GACATGTCCT TTTTCGACAG GAAACAGCTG AGCTATAGCC 540 TTGTAGAGAT TAAAAAGAAA AAACAACGTG AAAATGGCAT TATCCTTAAT GTCCTTGCAC 600 CCGAACTGGT CGCAAAGGAT CCAGGACATT GTGCCGATCA CGAGGGGGCG TGGCACATCC 660 TGGACGGACT ACAAATAAAG GAGAAAACAA CAAAACAAAT CTCTCCTGTT ATTTTTGTCA 720 AATTTTGACA GTCACCTCTT TGCACAGTGG CGCCCACACG ATGAAGACGG TTATTTCTGG 780 AACTAAAATA ACGTAGTTCG AGAATTGGCC GATCTGTGTT GGGACGCCAC TGTGAACTTT 840 TGAAATTGCT TCTAATTTTA GTTTCTTGGC GACTGAAAAA TGAATTGCGG TTTCATAGCA 900 TCCAGATTTT GACACAGTGA AAAAACATGG AAACCAGTTA CCTTCTGCTG ACTTCTGTTC 960 GCCTGGTACT TTGTCTGAAA AGGACAGACA GACATGGCTC AATCGACTCG GCTATTGATG 1020 CTGATCAAGA CTATATTATA CTTATGGGGT CGGAAACGTT TCCTGCTGTG CGTTACATAC 1080 AACCGTTATC CGCACAAATA CAATATACCC TATTTACTCT TCGAGTACCG GGTATAATGA 1140 ACTAATTTAG CGGAATGTTC GTGTTGAGTG GGTGGCTGAT GATGTGGCTG TGACGTCATG 1200 GCCATAAGGT TAACAGCCCA CTTAGCATTA TGCGGCTATT TGGTAATGTG CCACCCAATA 1260 ATAGACCAAC GAAATTGGCC CAACTGACCT TACGCTGCTC TTCGATTGCA ATTGGCTTTT 1320 ATTTCTCTCC TTCCACAATT GTTTGCTTAC TTTGTGTGTG TGTGTGTGTG TGTGTGTGTG 1380 GTGTGCCTTG AAATTGGTTA ATAAATTATT AAAGCCTTAC CGACTAAGAG GGTCATGACT 1440 TATGGTTTAG TTCCTAACTT AAAAGTAGGC GTATGCCACA GCTAACACTA ATTCTAGAGG 1500 AGCTGACGGC AACTGCGGTT AAAACAATAG TAGCCCGGTC GGGCCTTTAA TTATTAATTT 1560 CCTGGTGCAT TTCATTTTTG TAATGCACTT TTTTTGGGCG CTACTTCTAT TACTTTGACC 1620 GCAGTGTCCG GCCCTGTGGG GTTCTGTGGG ATTGTGTATC GATTAGCTTA TGGCACGAGT 1680 GGGTTAGTGT GGGGCCAAGT ATTTGGATTA GTGCAGTGTG TGTTTCTAGG 1730 // ID VIRUVIR standard; DNA; INV; 6564 BP. XX AC AY369259; XX DR FLYBASE; FBgn0067460; Dvir\Uvir. XX FT source AY369259:1107..7670 FT SO_feature five_prime_UTR ; SO:0000204:1..1230 FT SO_feature three_prime_UTR ; SO:0000205:3907..6564 FT SO_feature CDS ; SO:0000316:1231..3906 FT SO_feature start_codon ; SO:0000318:1..3 FT /protein_id="AAQ75088.1" FT /translation="MIGLRIGIWHANGLSNKTDELEQFIIRHNVDLMLISETRFNFNS FT RVQIPGYSIITANSPQTHLCIGGAAILISNHIEYQALPGISLPQLQCAIVRLRTDLGG FT ANIASCYWPPNHAVLADDYVCLFNQLGENFLIGGDWNAKHRLWGNMRRCPRGSILANI FT LMDSNKYNVLTTAEPTHFPSNSNRPSVLDFGIYAGISSHRLAISRVTELHSDHLPLLV FT QLKITINGQSLWNNPNLIHRSRPRRRNRHRLLTERSNLEIFHQKLESSINLNIEIDCI FT NDIDDMLENFMSKLHIAAEASNSRSAMASIIPPHQNSSNNTLNNENNNNVNLDLSSPQ FT PLPLSAFIRTDEHRELMRLKRRLRKRWARTRQLDHWLEWQRVGRQLANQLEQQRRDYV FT DYVLSNADPQKKGAFNLWHATKYLKRQPHAQPSVRNLNGHWCQSAEEQAEAFADELQA FT RFTPFNLAPSGQCERVKRTLNQQVNLYDNTLNAINPQYPILSSHIRHVTLCELNTYIG FT HLQMRKAPGLDQIDNYLIRSLPQKARLFLLLLFNGMLRLRYFPSAWKCAAVRMILKSG FT NRSAANLNSYRPISLLSTISKLFEKVIYERLNDELQSPLESESELECSRVIPNHQFGF FT RSGHGTIQQVHRIVEHINNSFERGHCTSGAFLDLQQAFDRVWHDGLLFKLRTHTSEPL FT YQLLKSFLSNRSFCVLSTNPTTNGNRGADACHSTLRPMVAGVPQGSVLAPLLFNIFVS FT DMPCIATHIQQVFQPPPGQTNTSIIGLTATYADDTAFLCSAMNAVVATTILQGHMHRF FT VKWANNWNIQINDNKSVHIIFTLRRQIANAGSLTTQLTINNNIIPAKSYVKYLGLNLD FT KKLNWARHARLQHKLCAKNCSNTNGYYLPANPDYH" XX SQ Sequence 6564 BP; 2169 A; 1676 C; 1040 G; 1679 T; 0 other; AAATCAATAA ATTGCTATAG CTATATATAT TTTCAATTAC TTATTAAATA TACATATATA 60 CCTAGCATAT ATATTGCTAC AAACTGTCCG TCTACGGACA ATCATACCCT CTTAGCTAAA 120 AAGAGAGCAA TTTAATCAAA AATATATTAC TTATAAATAA AATAATACAT AATCAAAAAT 180 AACAATTAAC CCTTCAAATC TATAAAAACA AGGTATTACA TCAACTTTTG CCATAAAACT 240 TCAAGCCCAA AGGTGACCAA TCAAATTAAC CTAAAAATTT CTAGCAATTT CCAGCAAAAG 300 TGTTGTGCCA ATATATGACA TGTGTGTGTG TGGGCAATTG TTCTGATTTT TCTTTGCACA 360 CCTATACAAT CGCTAAAATA GCTGCTGTCG CGCAACTTGA CACACACACG CATACACACA 420 CTTATTACAG TTCATAAATT ACACATATAT ACATCTATCA AAACATATAC ATATATACTT 480 TCACATACAT ATCTACAACT CCTGTCAAAA CACATATATA AACAAACATA TATATCTACA 540 GCAAACACAT ACAAACTCAC TTATATACAT TACTTCCCCT CTTTTTTGAC GTCCATATAA 600 GTGACGCGCG CATAGAAACC TGACGAAGCT GAGAGCAGGC GCAACCGTAG CTAGTCGACT 660 TTCTGGTTTG TGAATTGACT AAATCGGTCA ACTTGGTGCG AAACCAAAAG CACACACTCA 720 CAAACAAATC ACAACAAAGC GCCATTTAGT CGTTTCATCC ACCCATCGCC ATTTTGCACT 780 TTTCCTATTT CATACAAACT CAACCTTTGC ATAAACCAAC AAAGAGCTCA AGCTTTTACA 840 CGTCCCCTTT GTAGGCTGCT CTTAATTTTT CGTAAAGGTT GACACAAACA AAATCAAAAC 900 TGGTGCTTCA TCACCAATAA ACTAAATAAA TTAAAACTAA ATACTTACCT TAATCAAACC 960 TATTAAAAAA TTGAAGCTTA ATCAGGTGTG TCGCCACCAA CAATATTTAA GCTACAATCA 1020 CACATACACT TACTTATACT TACCTTATCA CACCTTCTCA TATACTTACC TATACTTACC 1080 ATTATCACAC CTTCATATTA CATATATTCA TACTCACCTC ATCACACCTT CATATCTTTT 1140 ATATTTACGC CCTATAAATA TAAATTTATC TCTTTCTGGT GTACCATCAC CAAAAACCAT 1200 ACTTTACTTA CCTTCACACA AACTGCTCGC ATGATTGGCC TCCGTATTGG CATTTGGCAT 1260 GCCAATGGTT TGTCAAACAA AACTGACGAG CTAGAGCAAT TTATAATACG TCATAATGTC 1320 GATCTAATGC TTATAAGCGA GACTCGCTTC AACTTCAACT CTCGTGTCCA AATCCCTGGA 1380 TACTCCATCA TAACGGCCAA CTCACCCCAA ACTCACTTAT GCATTGGTGG TGCAGCCATC 1440 CTAATCTCAA ACCACATCGA ATATCAAGCT CTTCCAGGCA TATCACTGCC ACAATTACAA 1500 TGTGCTATTG TGCGCCTCCG TACAGATCTG GGTGGTGCCA ACATTGCGTC CTGCTACTGG 1560 CCCCCCAATC ACGCAGTCTT AGCAGACGAC TATGTCTGCT TATTTAACCA ATTGGGTGAA 1620 AACTTTCTGA TTGGAGGGGA TTGGAATGCC AAGCATCGCC TCTGGGGCAA CATGCGACGT 1680 TGTCCTCGTG GCTCGATCCT TGCAAACATT CTAATGGACT CCAACAAATA TAACGTACTC 1740 ACAACGGCTG AGCCAACACA CTTCCCATCT AATAGCAATC GTCCAAGTGT ACTCGACTTT 1800 GGCATCTATG CAGGTATCTC GAGTCATAGG CTTGCCATAA GTCGAGTCAC AGAGCTCCAT 1860 AGCGACCATC TGCCCTTGCT AGTCCAACTC AAAATCACCA TTAACGGTCA AAGTCTATGG 1920 AATAATCCTA ACCTAATTCA TCGGTCACGC CCCCGCCGGC GAAACCGTCA CCGTCTACTC 1980 ACCGAACGCT CAAACCTGGA GATTTTTCAT CAAAAATTGG AATCATCCAT CAATCTCAAT 2040 ATTGAGATTG ATTGTATAAA TGACATCGAT GATATGCTCG AAAATTTTAT GTCCAAGCTG 2100 CACATTGCAG CCGAAGCGTC AAATTCTCGA TCAGCCATGG CATCCATTAT ACCCCCGCAT 2160 CAAAACAGCT CCAATAACAC ACTCAATAAC GAAAACAATA ACAATGTCAA TCTCGACTTA 2220 TCGTCACCTC AACCATTACC ACTGTCCGCT TTTATCAGAA CAGACGAACA TCGCGAATTG 2280 ATGCGACTAA AACGCCGTCT CCGTAAACGC TGGGCACGCA CGCGTCAACT TGATCACTGG 2340 CTTGAATGGC AGCGGGTTGG CCGACAACTG GCCAACCAGC TTGAACAACA ACGCCGTGAT 2400 TACGTTGACT ACGTGCTGAG CAATGCAGAC CCTCAAAAGA AGGGCGCCTT CAACCTGTGG 2460 CATGCCACAA AATATTTAAA ACGACAACCC CATGCTCAAC CCAGTGTTCG CAACCTCAAC 2520 GGTCACTGGT GCCAGTCTGC CGAAGAGCAG GCTGAAGCAT TTGCCGATGA GCTACAAGCA 2580 CGTTTCACAC CATTCAACCT TGCCCCATCA GGACAATGCG AACGTGTGAA GCGCACACTC 2640 AATCAACAAG TCAACTTATA TGACAACACT CTAAACGCAA TCAATCCTCA ATATCCCATT 2700 CTATCATCAC ACATACGCCA TGTAACACTA TGCGAGCTAA ATACTTACAT TGGGCATCTA 2760 CAAATGCGTA AAGCACCGGG GCTCGATCAA ATTGATAACT ATTTGATCAG GTCCTTGCCT 2820 CAAAAAGCCA GACTTTTCTT GTTGCTACTA TTCAATGGCA TGCTTCGACT ACGGTATTTT 2880 CCATCAGCAT GGAAATGTGC CGCCGTCCGG ATGATATTGA AAAGTGGCAA TCGGTCAGCT 2940 GCAAACCTCA ACTCTTACCG GCCAATAAGC CTATTATCTA CGATCTCGAA ACTATTTGAG 3000 AAAGTGATAT ATGAGCGGCT AAATGACGAA TTACAATCTC CACTAGAGAG CGAATCTGAA 3060 TTAGAGTGCT CCAGAGTGAT CCCGAACCAT CAATTCGGTT TCCGCTCTGG TCACGGCACC 3120 ATCCAACAAG TGCATCGGAT TGTCGAGCAC ATAAACAACT CATTTGAAAG GGGCCACTGC 3180 ACCTCGGGAG CTTTTCTCGA TCTGCAGCAA GCTTTTGATC GTGTCTGGCA TGACGGTTTA 3240 CTATTCAAAC TGCGCACACA CACGAGTGAA CCCCTCTATC AATTACTCAA ATCATTCCTA 3300 TCAAATCGTA GTTTCTGTGT ATTAAGCACA AATCCGACAA CAAATGGCAA CAGAGGTGCC 3360 GACGCCTGCC ACTCGACACT ACGACCAATG GTGGCCGGTG TGCCGCAGGG CAGCGTGCTG 3420 GCTCCACTAT TATTCAATAT ATTCGTCTCA GACATGCCAT GCATTGCTAC GCACATACAA 3480 CAAGTGTTCC AACCGCCGCC CGGCCAAACA AACACCAGCA TAATTGGTCT AACAGCCACC 3540 TACGCCGATG ACACAGCATT CTTATGCAGT GCCATGAACG CAGTGGTGGC CACCACCATT 3600 CTGCAGGGTC ATATGCATCG ATTTGTTAAA TGGGCCAATA ACTGGAACAT TCAAATCAAT 3660 GACAACAAAT CGGTGCATAT TATCTTTACC CTAAGGCGCC AGATTGCCAA TGCTGGCAGC 3720 CTAACAACAC AACTAACAAT AAACAACAAT ATCATACCGG CCAAGAGCTA TGTCAAATAT 3780 CTGGGTCTGA ATTTGGACAA AAAATTAAAT TGGGCACGTC ATGCCCGTTT ACAGCACAAA 3840 TTGTGCGCAA AAAACTGTTC AAACACAAAT GGCTACTATC TGCCAGCCAA TCCAGATTAT 3900 CATTAAAAAA CAAATTGCTT ATTTATCGAC AGATCATAGC CCCCACCTGG CACTACGGCA 3960 TTTGAACTCT ATGGCTGTGC AGCTGACACG CATCTGCGCC GTCTACAAAC TGCACAGTCG 4020 CGAGCTCTTC GCCTCATTAG CGGTGCACCA TATTATATTA CAAATCGCAC ACTGCACCGA 4080 GTAATGAAAG TGCCATCAAT ATGGGACCAG CTCAATCTAA ACGCCAGTCG GCACAACGAC 4140 CGACTGGAAA AACACACTAA CAAAATGGCT AGCACGCTAT TGCTTTCAGG TGGACACCGA 4200 GCCCAACGCC GTCTCCGACG TATCTGGCCA GGCATTGAAT TAATTCAACG CCAAGCGCCC 4260 GCCAACCTTT TTTAAATAAC TACACTAATA ATCAAAAATG TGACACCCCT CATGGGACAC 4320 ACATTCACCC ACGAAAAATA CTTTTTCCAA ACATCATAAA TAAACACTTT AGTTTAGGTA 4380 ACTCACTCAC CTTATCCCAT GGCGAGCCCA TAAGAAACAC TTGTCACATG GATTCAAAAT 4440 AACGGCAAAT ATATTATGCC GCAATTATTT TTGATGACAG CCGCTAAGCC GCAGCTAATG 4500 ACAACAAGAA ACAGCAGCAG AAGAAGAAGC AGCGGCTAAA ACAGCAGCAG AAGCATTGGT 4560 AGCAGACGAA GACAAAACGG CAGCAGAATG CAACAGCTAC ATCAACATCA ACTATAAGCA 4620 CGCCCAAATA ACCATACCAA TACAATCAAA TCAGAAATTA CACATGTAAT ATACACAATC 4680 ACATTCATAC CAAATATAAT ACAAAAATAA TACATTCTCA ATCAGCTGTA ATACACACAC 4740 TCACAATACT CACATAAGCT ATAATACACC TATTAGATAA AATACACATA TTTACTCTAA 4800 CACTGGCATA CACATATGCT CACACTCGAA TCAAACAAAA AACCAAATTT TACTCACTCA 4860 TCAAAAAAGA CAATGAATAT CAGGTAAAGT TATATCACCC GATTCCCCTC CCCCACCCAC 4920 AAATCATAAT GCCGATTGCT GGCGGCAACC AAATTACTGC CGCGAGAGAA GGCATTTCAA 4980 AAACGGCCTG CTGCCTAGCT ACAAATATAT ATATATATAT GCGTACTCAT ACACATATAT 5040 GTACATATTC TAAAAATAAT CCCCTTTATC CCACCAAAAC AATTGATAAC GCAGTGGCAA 5100 TGTCAATGTG TCAAAATTGC ATAGTCTACG AATGATGACA GCAGTATCAA TTTTTGCTAA 5160 AAACTAATTA ATATGCTGAA TACATCTGGA GACAATTTGC ACGCACACAC ACAAATCCGC 5220 TCGCATCAAT ATACTCCCAC ATATCACAAC AAATTTACAT CTCACAACAT TCTAGCAAAT 5280 ATAAACAATT AATTTCTCTC AAACATACAT TCTTACAAAA AAGATACTCT ATCAAAACTG 5340 CTGGCTTATG GCGAACTAGT TTTGCTGATG CCGCACTTTC ACCCTAAATT ATATATTACA 5400 CTAAACAATA ATCATAAATA ATCTTAACAA AAACTCTACC ACATCAAGCC CATACAAAAA 5460 ATATGGAGCA TTCACTTGAA GACGTGTGCA ATCGCGACAA ATGAATTATT CACTAAGTCA 5520 GTTCGGCCAC TTAATACCCA TATGTATACT CATTTGTCTA TACTGCGCAC ACACACACTC 5580 TCTTCACAAC TTATGCCAAA TTACACACAA CCACCACAAA ATATAACTTT ATAACACTTT 5640 CAAATCAAAA CCTCATGCAC ACATATACAC ATAAAAATTG TATAGCTTAA GAAATACTTT 5700 TTAAGAAATC AAAATTTATA ACGTTATGCA CAAATAGTAA TATATAATTT TAAAGTCAAT 5760 AAACTCTGGA GGGGGGATTG GTTCGGTGTG TGACGGTGGG AAGGGGGATA CTTCGTACCG 5820 CTACAGGATA GTGGCCTGTG TGTGGGTGTG CATACCCGTG ATGGTTGGAG CGGTTGACGA 5880 GCATGAGCTA GTAGGCACCT TTAATGCTGC CTCGCCATGT CTCGAGCCCT CCAGGCCACC 5940 ACGACCATGC ATACTCAAAC ACTCGCTCGC TCTCTGTGCG ATGCGAATGC TCGATGCCCG 6000 ACGCTCGGTA CTCGGTACTC GGTACACGAT GCTCGGTGTC CTCGATCCCG TTTCCCATTC 6060 GTCTCATCCG ACCCGGTCTC CTGCTCAACC TAAAACAGAA AAAATTGAGT CTCTTTTTGG 6120 TGCCTGAAGT GAACGGACGT GCTCGCGGAG AAATAAAAAG TAAAAATCAT ATAAATTCGG 6180 CAGCGCGCGC CAAAACGAAC CGAACATAGT ATATAGGGAC ACGATTGCAA AGTGCAAATT 6240 GCAATACATA TCCCTCGGTT TTTTCATTCG GTTTTTCACT CGCTCAATAC AGTGGCTTTT 6300 TTTTCCTTGT ACATATATAT AGTGCACGTG GCCTGCTGTG ACTTTTGCAC AAAGTGTCGC 6360 CACGTGGACA ATCTAGAACT TTTGCTTTAG TTGTTCTGCT GCGCATCAGA CGCTGGCGCA 6420 CGGCGTGCAA AAACACAAAA GTGGGTTATG TATAACAAAT ACATAACTGC CTAGTGCCTT 6480 ATATGTACTA AAATTAACCA CTATAACTCA TACTCTCGTC AATAAATCAA TAATAATTGA 6540 TATAACCTAA AATAAAATTA CATA 6564 // ID DTEII standard; DNA; INV; 5386 BP. XX AC M28878; XX DR FLYBASE; FBgn0013017; Dtei\I-element. XX FT source M28878:1..5386 FT SO_feature CDS ; SO:0000316:190..1479 FT /protein_id="AAA74494.1" FT /translation="MTDPPDIHNFTAKSYQSQLGNPKFIVIKRKDNNSLERISPFIIK FT KSVDFACGGEVEVCKRTRDGSLLIKTKNELQAIKLLKLTTMADVEVTATEHKTLNFSK FT GVIYCNDLRYIDEDTILQELKPQKVTEVKKIMKRQNPNSNSDTNNTTLIETGLIIITF FT ESHKLPEILRIGYETVRVRDYIPLPLRCKKCLRFGHPAPICKSLETCTNCSEIKHTND FT GETCTNEKNCLNCRNNPEINHQHSPLDRKCPTFIKNQELTAIKTTQKVDHKTAQRIYF FT ERHGFQLRDSYAKTLTNGTTQIKTNTQAPNIDANTHTIQSQNQNSYITPTSTPHTTSS FT KTTTTGPAKTTPLSNQPQQRHHHHCYDEPEDMDTYHPTDHPTFTPYSSQHTEDLKIKI FT YSKDKSNNLSTNLKASKIKAKALNKKKHTNNSDSESI" FT SO_feature CDS ; SO:0000316:1559..5263 FT /protein_id="AAA74495.1" FT /translation="MSLTVIQWNLKGYVNNYSQLLILIKKYSPHIISLQETHIQHTNT FT IPTPINYKLLTNIATNRFGGVRILVHKSIQYTTLNITSDPEAIAINIQSKIKLNIFST FT YISPTKNISDQTLQNTFNIQQTPSLITGDFNGWHPSWGSPTTNTRGKITQRFIDNTHL FT ILLNDKSPTHFSTHNTYSHIDLTLCSPILAPHANWKILNDLHGSDHFPIITTLFPTTN FT TQKFNRPFFKLKEANWDQYSAHTQQINEKYPTSQNVNKEAAQINRIILYSANLSIPQT FT SPNAHPTGFHGGTSTSTNCESKNNLRWKKLNRTITLENIIDYRRKNAKFRYELKKRKK FT EASSSFTSTIHPTTPSSKIWTNIRRFCGLNPIKQIHAISSPENNETTLASNEIANIFA FT QHLSELSGDWNFSEEFRNNKYRNNLHNYTPSPIAQTIEKNITYLELISALQTLKGSAP FT GLNRISYQMIKNSSPTTKHRITKLFNEIFNSHIPQAYKTSLIIPILKPNTDKTKTTSY FT RPISLNCCIAKKLDKIIAKRLWWLVTHSNLLSENQFGFKKGKSTSDCLLYVDYLITKS FT KMHTSLVTLDFSRAFDRVGVHSIIHQLQEWKTGPKIINYIQNFMINRKIIVRVGPHTS FT SPLPLSNGIPLGSPISVILFLIAFNKLSNIISLYKEIKFNAYADDFFLIINFKKNTNI FT SLDNLLNDIGNWGSYSGASLSLSKCQHLHICKKHHCTSKISCNNIQIPTVTSLKILGI FT TINNKYKWNTHIYSLLPKLYNKLNIIKCLSSPKFNCNTLSLLNVAKATVIAKLEYGLF FT LYGHAPKSILNKLKTPFNSAIRLALGAYRSTPINNLLYESNIPSLEMKRDLQIAKLSQ FT NLSFCKNRPIHKFVRHKKYKKKTLSIIDQTIKLSLELNLPYKPIKLHKYKPSWDLPNL FT IDTSLRGHKKQETSPESYRKLFEHTKNKLKPHSFIFTDGSKINCIITFAITTDTNILK FT QGILPPYSSVLTSETIAILEAIELIKTRRGKFGIWFDSLSAIDSIKNPNNNSFYPNRI FT RSLITQLAPKIKIMWIPGHSGIIGNELADQAAKLASNMPLIVTPNINNTDIKRHLKAE FT LATKQKENIINCNQWYQSLNTNNTHTCDYLKQTHQNWTRLDQIKIIRLRLGHTNITHQ FT HYLNPNPITVCPFCQGDLSISHILNSCPSLIQTKQAIFRTLPLDLLSKPNPENIQKIL FT VFLKKTRIIPPNFKKKKQMYNNKHSNHLTN" XX SQ Sequence 5386 BP; 2183 A; 1430 C; 614 G; 1159 T; 0 other; CAGTACAACT TCAACCTCCG AAGAGATAAG TCGTGCCGCT ATGTTTAAAG CCTCGCTTCG 60 GCCACAAGCC CTAAACTCTT ATCAGCAAAA CTTTGCACAA ACAAATATAA ACCACAAAGA 120 TAAACAAAAA AACACAACAA CAAAAACACC AAGACCTATA ACTCGGGCTG AAGCCTTTAA 180 CTAACAATCA TGACAGACCC ACCAGACATT CATAATTTCA CCGCAAAATC TTACCAATCC 240 CAATTAGGCA ACCCAAAATT TATTGTCATT AAAAGGAAAG ATAACAACTC GCTTGAAAGA 300 ATATCACCTT TCATCATTAA AAAATCTGTG GACTTTGCCT GTGGAGGAGA AGTTGAGGTA 360 TGCAAACGTA CAAGAGACGG CAGCCTATTA ATAAAAACTA AAAATGAACT ACAAGCCATA 420 AAACTCCTTA AACTAACAAC AATGGCAGAT GTGGAAGTAA CAGCAACTGA ACATAAAACA 480 TTAAACTTCT CTAAGGGAGT TATCTATTGC AACGACCTTA GATATATCGA CGAAGACACA 540 ATTCTGCAAG AGTTGAAACC ACAAAAAGTA ACTGAAGTAA AAAAAATAAT GAAACGACAA 600 AACCCCAACT CTAACTCTGA TACGAACAAC ACCACCTTAA TTGAAACCGG CCTGATCATT 660 ATAACATTTG AATCACACAA ACTCCCTGAA ATATTACGAA TCGGATACGA AACAGTCCGA 720 GTACGAGACT ATATTCCACT TCCACTCCGA TGCAAAAAAT GCCTTCGCTT CGGCCACCCA 780 GCACCAATAT GCAAAAGCCT AGAAACCTGC ACCAATTGCT CAGAAATAAA ACACACAAAC 840 GACGGAGAAA CATGCACAAA CGAAAAAAAC TGCTTAAACT GCAGAAATAA CCCAGAAATC 900 AATCATCAAC ACAGCCCACT TGATCGCAAA TGCCCCACTT TTATTAAAAA CCAGGAACTA 960 ACAGCCATTA AAACCACACA AAAAGTAGAC CATAAAACTG CTCAACGTAT CTATTTTGAA 1020 CGACACGGCT TCCAATTGCG AGACTCCTAT GCCAAAACAC TCACAAACGG CACAACACAG 1080 ATAAAAACAA ACACTCAAGC ACCTAATATC GACGCAAATA CACACACAAT CCAATCGCAA 1140 AACCAAAATT CGTACATCAC ACCCACATCA ACACCACATA CCACTTCATC GAAGACAACA 1200 ACAACTGGAC CAGCCAAAAC AACTCCACTA TCAAACCAAC CGCAACAACG CCACCATCAT 1260 CACTGCTACG ACGAACCAGA AGACATGGAT ACTTACCACC CCACTGACCA CCCCACATTT 1320 ACGCCATACT CATCACAACA TACAGAGGAC CTAAAAATAA AAATTTACTC AAAAGATAAG 1380 TCTAATAACC TATCTACTAA CCTCAAAGCA TCCAAAATAA AGGCCAAAGC TCTAAACAAA 1440 AAGAAACACA CTAACAACAG CGATAGCGAA TCCATATAGA ATCTTAAGCC CTACACAGAA 1500 CTTAAACTAA ACCGTCAACA CCACCCCTAA ACTAAGTTAT AAGCTTATCT CCTCCCAAAT 1560 GTCCCTTACT GTAATCCAAT GGAACTTAAA AGGATACGTA AACAACTACA GCCAACTCCT 1620 TATTCTCATT AAGAAATACT CCCCACACAT AATTTCCCTC CAAGAAACCC ACATTCAACA 1680 CACTAATACC ATTCCAACTC CAATAAACTA TAAGCTATTA ACCAATATAG CCACCAATAG 1740 ATTTGGGGGC GTACGCATAC TAGTTCATAA GTCAATCCAA TACACTACCC TCAATATCAC 1800 AAGCGATCCA GAAGCAATAG CCATAAACAT ACAATCTAAA ATAAAACTAA ACATATTTTC 1860 CACATACATC TCCCCTACCA AAAACATTTC TGACCAGACA CTCCAAAACA CATTTAACAT 1920 TCAGCAAACA CCATCTCTAA TTACGGGAGA TTTCAATGGA TGGCACCCGT CTTGGGGCTC 1980 CCCAACAACA AATACACGAG GGAAAATAAC TCAAAGATTC ATCGACAATA CACACCTTAT 2040 TCTACTAAAC GACAAATCTC CCACACACTT TTCAACACAC AATACTTACT CACATATCGA 2100 CCTCACACTC TGCTCTCCAA TCCTAGCCCC TCACGCAAAC TGGAAAATTC TAAACGATCT 2160 TCATGGCAGC GACCATTTTC CTATCATTAC GACCCTATTC CCAACGACTA ATACACAAAA 2220 ATTCAACAGA CCCTTTTTTA AACTGAAAGA AGCCAACTGG GACCAGTACA GCGCACATAC 2280 ACAACAAATA AACGAGAAAT ACCCTACCTC CCAAAACGTA AACAAAGAAG CCGCCCAAAT 2340 TAATAGAATT ATTTTATATA GCGCAAACCT TTCAATCCCG CAAACCTCAC CAAACGCACA 2400 TCCTACAGGG TTCCATGGTG GAACAAGCAC CTCAACCAAC TGCGAAAGCA AAAACAACTT 2460 GCGTTGGAAA AAATTAAACC GTACAATCAC CCTAGAAAAC ATTATTGACT ATAGACGTAA 2520 AAACGCAAAA TTTAGGTACG AATTAAAAAA AAGAAAAAAA GAAGCTTCCA GCTCATTCAC 2580 CTCAACCATC CATCCCACTA CCCCCTCATC CAAAATATGG ACCAATATAA GACGTTTCTG 2640 CGGACTTAAC CCAATAAAAC AAATTCATGC CATATCCAGC CCAGAAAACA ACGAAACGAC 2700 ACTAGCTAGC AACGAAATTG CTAACATATT CGCACAACAC CTTTCTGAAC TCTCCGGCGA 2760 CTGGAACTTT TCAGAGGAGT TCCGAAACAA TAAATACAGA AACAACCTAC ACAACTATAC 2820 CCCCTCTCCA ATAGCCCAAA CCATTGAAAA AAACATAACA TATCTCGAAC TCATCTCAGC 2880 GCTACAAACA TTAAAAGGAT CAGCTCCAGG ACTCAATAGA ATTTCATATC AAATGATAAA 2940 AAATAGCTCC CCCACAACCA AACACAGAAT AACAAAACTA TTTAATGAAA TATTTAATAG 3000 CCACATACCA CAAGCCTACA AAACAAGCCT AATCATCCCA ATCCTCAAGC CTAACACCGA 3060 CAAAACTAAA ACTACGTCTT ACCGACCCAT TTCTCTCAAC TGCTGTATAG CAAAAAAACT 3120 AGATAAAATT ATAGCCAAAA GACTCTGGTG GTTGGTGACA CATAGCAACC TACTTAGCGA 3180 AAACCAATTT GGATTTAAAA AAGGCAAATC GACTTCGGAC TGTCTACTCT ATGTAGACTA 3240 CCTCATAACG AAGTCAAAAA TGCACACCTC CCTCGTCACT CTTGACTTTT CAAGAGCCTT 3300 CGATCGAGTA GGTGTGCACT CCATAATCCA CCAGCTGCAG GAATGGAAAA CAGGCCCAAA 3360 AATTATAAAC TACATACAAA ACTTTATGAT TAACAGAAAA ATAATTGTCC GTGTCGGTCC 3420 GCATACATCA AGCCCACTAC CCCTATCCAA CGGAATTCCC CTTGGTTCAC CCATATCCGT 3480 AATACTCTTT CTCATAGCTT TTAATAAATT ATCTAATATC ATATCCCTAT ACAAAGAAAT 3540 AAAATTTAAC GCATACGCCG ACGACTTCTT CCTTATAATA AATTTCAAAA AAAACACAAA 3600 CATCAGCCTA GACAACCTAT TAAATGATAT AGGGAATTGG GGCTCCTACT CAGGGGCATC 3660 GCTATCCCTA TCAAAATGCC AACACCTCCA CATATGTAAA AAACATCACT GTACTTCGAA 3720 GATAAGCTGC AACAACATCC AAATTCCTAC CGTTACTTCC CTAAAAATTC TAGGTATAAC 3780 CATAAACAAC AAATATAAAT GGAACACACA CATATATTCA CTCTTACCCA AACTCTACAA 3840 TAAGCTAAAC ATAATAAAAT GTCTATCCAG TCCCAAATTC AATTGCAACA CGCTCTCTCT 3900 ACTTAATGTA GCAAAGGCAA CGGTCATCGC CAAATTAGAG TATGGCCTGT TTCTATACGG 3960 CCACGCTCCC AAAAGCATTC TAAACAAATT AAAAACACCA TTTAACTCCG CAATCCGTCT 4020 AGCCCTCGGA GCCTATCGCT CTACCCCAAT AAATAATTTA CTCTACGAAT CAAATATTCC 4080 CTCTTTAGAA ATGAAACGAG ACCTTCAAAT AGCCAAGCTA TCCCAAAACC TAAGCTTTTG 4140 CAAAAACAGG CCAATTCACA AATTTGTTAG GCACAAAAAA TATAAAAAGA AAACACTATC 4200 AATAATAGAC CAAACAATCA AGCTCAGCCT AGAACTTAAC CTACCCTACA AACCAATAAA 4260 ACTCCATAAA TACAAGCCAT CATGGGACCT CCCCAACCTA ATAGACACAT CACTCAGAGG 4320 CCACAAGAAA CAAGAAACAT CCCCAGAATC ATACAGAAAA TTATTCGAAC ACACAAAAAA 4380 TAAACTAAAG CCACATAGTT TCATATTCAC CGACGGTTCA AAAATTAACT GCATTATAAC 4440 ATTCGCCATA ACAACTGACA CAAACATCTT GAAACAAGGC ATACTGCCTC CATACTCATC 4500 CGTCCTCACC TCCGAAACAA TTGCCATACT AGAAGCAATA GAACTAATTA AAACCCGAAG 4560 AGGTAAATTT GGTATCTGGT TCGACTCCCT ATCAGCAATA GACTCAATCA AAAACCCGAA 4620 TAACAACAGC TTTTACCCAA ACCGAATAAG ATCCCTAATA ACACAACTTG CCCCTAAAAT 4680 TAAAATAATG TGGATTCCTG GCCATTCAGG AATAATAGGA AATGAACTTG CCGATCAAGC 4740 TGCAAAATTA GCAAGCAATA TGCCACTAAT AGTCACCCCA AATATAAACA ACACAGATAT 4800 AAAAAGACAT CTTAAAGCTG AACTTGCGAC AAAACAAAAA GAAAACATTA TAAACTGCAA 4860 TCAATGGTAC CAATCTCTTA ACACAAATAA CACACACACC TGCGATTACC TTAAACAAAC 4920 TCACCAAAAT TGGACCAGAC TCGACCAAAT AAAAATAATA CGACTTCGAC TAGGACACAC 4980 AAACATAACC CACCAACACT ACCTAAATCC TAATCCTATA ACAGTTTGCC CTTTTTGCCA 5040 AGGAGATCTT TCAATAAGCC ACATACTGAA CTCATGCCCA TCCCTTATAC AAACCAAGCA 5100 AGCCATATTT AGAACACTAC CCCTAGACCT TCTTAGCAAG CCCAACCCAG AAAACATACA 5160 AAAAATCCTA GTTTTCCTAA AAAAAACTAG AATTATACCA CCAAATTTTA AAAAAAAAAA 5220 ACAAATGTAC AACAACAAAC ACAGTAATCA TTTAACCAAT TAGATTTAAA AAATAAATTA 5280 ATAAATAAAA AATTGTAAAC CAACATAGCC ATAGGCCCAG CAGCTAGTGC TATATTACTT 5340 ATAGTTAGTT TAGTTTTGTA AACTGCTCTA TCTATAATAA TAATAA 5386 // ID OSV standard; DNA; INV; 1543 BP. XX AC AY089271; XX DR FLYBASE; FBgn0063755; Osvaldo. XX FT source AY089271:1..1543 FT SO_feature CDS ; SO:0000316:172..1527 FT /protein_id="AAL90009.1" FT /translation="MGVGAATVRFRNLVQKGPAEHRCRRTFKTTSAGDEPKNELEVHE FT QPREQQGCRCFEEMRRKVKQPPQKFPDYLEEDGKLYRHIAHRACNEEVASWKMCIPIG FT QRQRVMTENHDMPTAARYYWPGMHRGVRKYVRNCECCMMYKPSQLQAAGKMLTQVPEE FT PWATVCADFVGPLPRSKHGNSMLLVLVDRFSKWTEIVPMRRATTETMRKAVRERIVAR FT YGVPKVMITDNNVQLTIRAFRKFLEELGVRHQLTAPYTPQENPTERANRTVKTMIAQF FT TGADQRTWDEHWPELQLAVNTSVAETTGYSPAFITQGREPRLPNALFDEKTTGTGKCT FT QTPVENAEKLKEIFELVRRNMEKAVQDQARHYNLRRRPWKPKVRDTVWAKEHHLSKAA FT EGFAAKLAPRFDGPYMIKKFTSPVICVLEHKTSKKEKTAHISDLKPGSAGAGGPGDFE FT E" XX SQ Sequence 1543 BP; 471 A; 359 C; 469 G; 244 T; 0 other; GTAATCTCAT ACTCCAGTAG AACGCTGAAC GCAGCGGAGA GGAACTACTC GGCAACGGAA 60 AAGGAATGTT TAGCAATAGT ATGGGCCGTC GGAAGTTGAG ACCGTACCTC ATCACGGACC 120 ATATGGCTCT GAAATGGCTG AATAACATCG AGAGCCCATC AGGATAGCGA GATGGGCGTT 180 GGAGCTGCAA CAGTACGATT TCGAAATCTC GTACAGAAAG GGCCAGCTGA ACATCGTTGC 240 CGACGCACTT TCAAGACAAC CTCTGCAGGA GATGAGCCGA AGAATGAGCT AGAGGTCCAC 300 GAACAACCGA GAGAGCAGCA AGGATGTAGG TGCTTTGAGG AGATGCGAAG AAAAGTGAAG 360 CAGCCGCCGC AAAAATTTCC GGACTATTTG GAGGAAGACG GCAAGCTGTA CCGACACATA 420 GCTCATCGGG CGTGCAACGA GGAGGTGGCA TCGTGGAAGA TGTGTATACC GATCGGTCAG 480 AGGCAGCGCG TCATGACCGA AAACCACGAC ATGCCGACTG CAGCTCGGTA TTACTGGCCG 540 GGAATGCATC GAGGCGTAAG AAAGTACGTG CGGAACTGCG AGTGCTGCAT GATGTACAAG 600 CCCAGCCAGC TACAGGCGGC CGGAAAAATG TTGACACAGG TGCCGGAAGA ACCATGGGCA 660 ACAGTGTGCG CAGATTTCGT GGGCCCCTTG CCACGGTCGA AGCATGGAAA TTCGATGCTG 720 CTGGTCCTGG TGGACAGATT TTCAAAGTGG ACCGAGATCG TTCCTATGAG AAGGGCGACC 780 ACTGAGACGA TGCGAAAGGC TGTTCGAGAG CGGATAGTGG CCAGATACGG AGTGCCCAAA 840 GTGATGATCA CAGACAACAA CGTGCAGTTG ACAATTAGGG CGTTCAGGAA GTTCCTGGAG 900 GAGCTGGGGG TACGACACCA GCTCACGGCT CCATACACTC CGCAAGAAAA CCCTACGGAA 960 AGGGCCAACA GGACCGTCAA AACAATGATC GCGCAATTCA CAGGTGCCGA TCAGAGGACA 1020 TGGGATGAGC ACTGGCCGGA GCTGCAGCTG GCGGTTAACA CAAGTGTAGC GGAGACCACA 1080 GGATACTCGC CGGCGTTCAT AACGCAGGGA AGGGAGCCGA GGCTGCCCAA TGCGTTGTTC 1140 GACGAAAAGA CGACCGGGAC CGGCAAGTGC ACCCAGACAC CGGTGGAGAA CGCGGAGAAA 1200 TTGAAAGAAA TCTTCGAGCT GGTGCGGAGG AATATGGAAA AGGCGGTCCA GGATCAGGCA 1260 CGCCACTATA ATCTCCGAAG ACGGCCGTGG AAACCCAAGG TGAGGGACAC AGTATGGGCC 1320 AAAGAGCACC ACCTGTCAAA GGCAGCCGAA GGTTTCGCGG CAAAATTGGC CCCGAGATTT 1380 GACGGGCCGT ACATGATAAA GAAATTCACG TCACCAGTAA TATGCGTCCT GGAACACAAA 1440 ACAAGCAAGA AAGAAAAGAC AGCACACATA AGCGATCTGA AGCCGGGAAG CGCGGGCGCC 1500 GGCGGGCCAG GAGACTTCGA GGAATAAAAA AAAAAAAAAA AAA 1543 // LOCUS DME542581 10463bp DNA XX AC AJ542581; XX DR FLYBASE; FBgn0069343; TAHRE. XX FT source AJ542581:1..10463 FT SO_feature five_prime_UTR ; SO:0000425:1..452 FT SO_feature three_prime_UTR ; SO:0000426:6698..10463 FT SO_feature CDS ; SO:0000316:453..3350 FT /db_xref="TAHRE:; TAHRE\gag" FT /protein_id="CAD65868.3" FT /translation="MSTSDHLFSDDEVHSISSSPEQRNSPFHLEISPMSHESDNSQSN FT ISIINLRKLPLKPTNNISKCSSGTAINIFHSLSHKEKENMNTNIAQKDPLSIDNTAAD FT TDGAKSSILKGKLPSPPLSSHTYKGKLPPATTHTNISALTHTDASQREKIPTSVICTN FT AAAATNTNADLGAKTSDALGNFPSLSHSDHSMENNLSSSTKIGPNTNSPSSHILTNTS FT QATNISAESRSKFPAPTNTDARLKKAITSDKGEIHTQIQTNKSKEHNQENKPFNYLSC FT YASWSTSNPKPDISKLSLTRKSTNRTGNSGKRSISPHQKNASLCPSAQGNLNSNLNSN FT SNPKSSATPTEVNLSAARTLSRPAAKRDLFNSSSRSPEEQPMSFSEVVAGTGPDIIAP FT SAPAPLTKTPGKRTNSDLDCSSFKTPNKRLRATPNFETPSLFPPLITPVFKSKAAQSV FT YEESKARNGPPRQPLPCSNNASARSATAPPGIAPLPPQNTDVELPPWKIVPQSRRAPP FT ILVNNVREIVPLLEKLNYTAGVSSYSTRATEGNGVRIQAKDMTAYNKIKEVLTANGFP FT LFTNQPKSERGFRVIIRHLHHSTPCSWIVEELLKLGFQARFARNMTNPATGGPMRMFE FT VEIVMAKDSSHGKIISLKQLGGQRVDIERKNRTREPVQCYRCQGFRHSKNSCMRPPRC FT MKCAGGHLSSCCTKPRTTPATCVNCSGEHISAYKGCPAYKTEKRKLAVNNIDINKIRT FT IKDANITNYGRQGPPSRNNFPRLPFSSSTSNRTTAESRQDTARARRNNPFRQNRNEAR FT PIQPRFSSHDFAIQKRLNKWRRNSDNVSKKGTINPKDKPKPRTPNMTSNPAQKHLEMF FT QEKLRKARCERKEQDPEEKKTNIRMGDDESPPTTSRAARAFLKPRIIDDNIPTPMDTY FT SNPQKSPSDFDSKSLTQRVENIEKKIDNLMELLFKCLESTKESTLAHLMTS" FT SO_feature CDS ; SO:0000316:3386..6687 FT /db_xref="TAHRE:; TAHRE\pol" FT /protein_id="CAD65869.1" FT /translation="MTFNLNNGQASNTLKIGYWNSCGITNKTNELEAYIKKEGIQIML FT VTETRLERNSNALNIKGFHTYLAQNPTSHRKGGTATIVSQNIRHACLNPIETDFMQSA FT PIALIPSSRIRADMTIVAPIYCPPVYKWTTEQFSKLFNHFEMLLDGKSKFILGGDWNC FT KHRLWGNYLSCARGRSLSQSILARKDLDIVATGHATHFPFDKKKQPSALDFAICKGFH FT TQKLKTYSTDELSSDHLPIQIVLDPDDSDWQHNKTNNAIIQKRTDLTKFKKNLENKIL FT LNTEIRTGQDIDDCIDILINNIKSAANEATPPNRPQNNRTYNSSRRSTNLRLDEATKR FT LLEEKKELNRIFRAVGTDEARRWFKNVQNRLSKEIRKLKQKLLNRNLQDIDTTDRYRT FT QKLWKTTNTIKMQPRPCWPIKKDNDDSRQRTDYPWTRTLDEKAEAFASHLESRFRTNQ FT INDAKDREFVRNELNKFKSLNASGESGNSNFKPVTLAELNGLINSLELKKAPGTDNLN FT NKTIINLPTKARIYLILIYNNILRTGHFPNKWKHASISMIPKPGKSPFALNSYRPISL FT LSGLSKLLERILLKRLYDIDSFAKAIPSHQFGFRKDHGAEHQLARVTQFILKAFDEKD FT VCSATFLDITEAFDRVWHDGLLYKLSRLIPRYLFDLLENYLSNRTFSVRIDGETTSRI FT GNIRAGVPQGSILGPVLYSIYSSDMPYPIVKDYMRNISFPDYHPTNIILATYADDTII FT LSRSKYTKLAINLNQNYLNVFCRWSKKWDIAINAKKTGHILFSLKKEQTNIYTPPLIN FT GQRAAKLNKQRYLGLMLDRRLTFCAHMTLLKGKTIAAYKKLEWLIGKNSHLPKNAKIL FT LWKQIVSPIWHYAIAIWGSLVSDTQAKKIQTMENKYIRRIINASRYTRQADIRTKYNI FT KSFDEIFDKASQRYANSLTDHENPLIYDLLINAYKPNRLELSKNRYVKQLSKYILPLQ FT QHRPKPPAEPIYSSIHNYTKKEEAEIVAKMRTKFRQTLPTLLRIADQELEIRQSIAEE FT KRKEKEKSERRKALEKGPPDRWCELQINKYSKLYRKGLRAREEILELMLGQPATVIKI FT VIPDYEPEDDLTKKS" XX SQ Sequence 10463 BP; 3913 A; 2638 C; 1670 G; 2242 T; TTAAAAGCAA AGTTTTTTAA AAGTTCAAAA ATATAATAAA ATAAAGCAAA TTAAATTTAA 60 TTAATAAAAC AATTAATTTT ATTTAATAAA ATTAAACGCG CTTCGTCGGC AAATAACTCT 120 CACGCGCAAA TTTTATTAAA TTCGCCTTTC AAGTTGAAAA ATTAAAAGTT AAAATCGTCT 180 TCCGGCCGCA AAGTTTGAAC CGCGACGATA CAAACATTTA ATAGACAAAC AAAAAGCGAA 240 CAATAAATCA GTGAATTATT TGTGCAAGCT GCCGCCATAA CCAAAAGGAG AAGAAGCCAA 300 AAGACGAAGA GGAGAAAGTA AACCAGGAGA AATTAAGAAG ACAAGGAGGA AAACTTATTT 360 AAAAAGACGA CCTAAACTGG AGGACAAATT ACAAATTAAA AGCCAGGGTA TTTATACCTT 420 ACAAGTATCG ACCTAATATA ATATAAATAA ATATGTCCAC GTCCGACCAC CTATTTTCTG 480 ACGATGAGGT TCACTCAATC TCCTCAAGCC CAGAACAGCG AAATTCACCA TTCCACCTTG 540 AAATATCGCC CATGTCCCAT GAATCTGACA ATTCTCAGTC TAATATAAGC ATTATTAATC 600 TGAGGAAATT GCCCTTAAAA CCAACAAATA ATATTTCAAA ATGCTCTTCT GGGACTGCCA 660 TAAATATTTT TCATTCCCTT TCACACAAGG AGAAAGAGAA CATGAACACC AATATTGCCC 720 AAAAAGACCC CCTCTCGATC GACAATACAG CTGCAGACAC GGACGGCGCC AAAAGTAGCA 780 TCTTGAAGGG GAAATTGCCT TCCCCTCCGC TCTCATCACA CACATACAAG GGGAAACTAC 840 CTCCAGCAAC GACCCACACT AACATATCTG CTCTTACTCA CACTGACGCT TCTCAAAGAG 900 AAAAAATACC CACCTCAGTG ATCTGCACTA ATGCAGCTGC AGCTACTAAT ACAAATGCAG 960 ATCTAGGCGC CAAAACGAGC GACGCCTTGG GAAATTTCCC CTCCCTCTCA CACAGCGATC 1020 ATAGCATGGA GAACAACCTA AGTTCCTCCA CCAAAATTGG ACCCAATACA AATTCCCCTT 1080 CTTCTCACAT ACTCACCAAC ACAAGCCAGG CCACAAACAT AAGCGCAGAA AGCCGCTCAA 1140 AATTTCCCGC GCCCACCAAT ACTGACGCAC GCCTCAAGAA GGCCATTACT AGTGACAAAG 1200 GGGAAATTCA CACACAAATT CAAACAAACA AAAGCAAGGA ACACAACCAA GAAAATAAAC 1260 CATTTAATTA CTTAAGTTGC TACGCTTCAT GGTCAACTTC AAACCCAAAG CCAGACATTT 1320 CCAAACTAAG TTTAACTAGG AAATCCACCA ACCGTACTGG AAATAGTGGG AAAAGAAGCA 1380 TTTCCCCCCA CCAAAAGAAT GCTTCGTTAT GCCCTTCTGC TCAGGGTAAT TTAAATTCAA 1440 ATTTAAATTC AAATTCAAAT CCCAAATCTA GCGCCACTCC CACTGAGGTG AATTTATCAG 1500 CAGCCCGCAC CCTCAGCCGG CCGGCTGCCA AGCGCGATTT ATTTAATTCA TCTTCCAGGA 1560 GCCCAGAAGA GCAGCCTATG AGTTTTTCGG AAGTGGTTGC TGGAACAGGT CCAGATATAA 1620 TAGCACCCTC CGCTCCTGCA CCACTAACGA AAACTCCGGG CAAACGAACA AACAGCGATC 1680 TGGACTGCTC TAGCTTTAAG ACACCTAATA AAAGATTACG TGCGACTCCT AACTTTGAAA 1740 CTCCAAGCCT TTTCCCCCCG CTCATTACAC CCGTTTTTAA AAGTAAGGCG GCTCAATCTG 1800 TTTATGAGGA GTCCAAGGCC AGGAATGGAC CCCCCCGCCA GCCGTTACCC TGCAGCAACA 1860 ATGCTTCTGC TCGCAGCGCA ACAGCACCAC CCGGGATTGC CCCCCTACCC CCTCAGAACA 1920 CAGATGTAGA GCTGCCCCCC TGGAAAATCG TTCCCCAGAG CCGTAGAGCA CCCCCTATAC 1980 TAGTCAACAA TGTGAGGGAA ATTGTCCCAC TGCTGGAAAA GCTGAATTAT ACAGCAGGAG 2040 TCTCCAGCTA CTCCACCAGG GCAACAGAAG GAAACGGGGT CAGGATCCAG GCCAAGGATA 2100 TGACTGCCTA CAACAAAATC AAAGAAGTCC TGACCGCCAA CGGTTTTCCT TTATTCACTA 2160 ACCAGCCCAA ATCCGAGAGG GGCTTCCGAG TCATCATCAG ACACCTCCAT CATTCCACAC 2220 CATGCTCGTG GATAGTCGAG GAGCTGCTGA AGCTCGGATT CCAAGCACGC TTCGCTAGAA 2280 ACATGACGAA TCCAGCTACA GGTGGCCCCA TGCGAATGTT TGAAGTGGAG ATAGTCATGG 2340 CCAAGGACAG CAGCCATGGC AAAATTATCT CATTGAAACA ACTCGGTGGG CAAAGGGTGG 2400 ATATCGAAAG GAAAAACAGG ACTCGGGAGC CGGTCCAGTG CTACAGATGC CAGGGCTTCA 2460 GGCACTCCAA AAATTCATGC ATGAGGCCGC CCAGATGCAT GAAATGCGCT GGCGGTCACC 2520 TGTCATCCTG TTGCACAAAG CCAAGAACCA CCCCTGCCAC CTGCGTCAAC TGCTCTGGTG 2580 AGCATATTAG TGCGTACAAG GGATGCCCCG CTTATAAGAC TGAAAAACGA AAACTGGCGG 2640 TCAACAACAT TGACATCAAT AAAATAAGGA CAATCAAGGA CGCGAACATT ACCAACTATG 2700 GACGACAGGG CCCTCCCTCT CGCAACAACT TTCCCCGGCT ACCATTTAGC TCCTCAACCT 2760 CCAACAGGAC AACAGCCGAA TCCCGCCAAG ACACAGCAAG AGCACGGCGG AATAACCCCT 2820 TCCGGCAAAA CAGAAACGAA GCAAGGCCAA TCCAGCCACG CTTCTCCTCC CACGACTTTG 2880 CCATCCAGAA ACGTCTGAAT AAATGGCGCC GCAACTCTGA CAACGTCTCC AAAAAAGGCA 2940 CGATAAATCC CAAGGACAAG CCAAAGCCAC GAACACCTAA CATGACGAGC AATCCTGCAC 3000 AAAAACACCT GGAAATGTTC CAGGAAAAGC TCCGCAAAGC AAGATGTGAA CGCAAGGAAC 3060 AGGACCCGGA AGAAAAGAAG ACAAACATCA GGATGGGAGA CGATGAAAGC CCGCCAACCA 3120 CCAGCAGAGC TGCCAGAGCA TTCCTCAAGC CAAGGATCAT AGACGACAAC ATCCCCACGC 3180 CTATGGATAC ATACTCTAAC CCGCAAAAGA GTCCCTCTGA CTTCGACAGC AAAAGCCTTA 3240 CACAACGAGT GGAAAATATT GAAAAGAAAA TTGATAATCT AATGGAACTA TTATTCAAGT 3300 GCCTCGAATC CACGAAAGAG TCTACCTTAG CACACCTCAT GACCTCCTAA ATCACTAAAT 3360 TCTCTACGCA AATTAAAATA CTACTATGAC CTTTAATCTA AACAACGGAC AAGCTTCCAA 3420 TACCCTAAAA ATAGGGTACT GGAACTCATG TGGAATTACA AATAAAACAA ATGAACTGGA 3480 GGCCTACATC AAAAAAGAGG GTATTCAAAT TATGTTAGTA ACTGAAACCA GGTTAGAGCG 3540 AAATTCCAAC GCACTTAATA TCAAAGGCTT CCACACCTAC CTAGCACAAA ACCCTACATC 3600 CCACAGAAAA GGCGGCACGG CCACCATAGT CAGTCAAAAC ATTAGACATG CCTGCCTAAA 3660 TCCCATAGAG ACTGACTTCA TGCAGAGCGC ACCCATCGCA CTTATCCCCT CAAGTCGAAT 3720 ACGCGCAGAC ATGACAATTG TTGCCCCAAT ATACTGTCCG CCTGTATATA AATGGACCAC 3780 TGAACAATTC TCGAAGCTCT TCAACCACTT CGAGATGCTT CTCGATGGTA AGTCCAAATT 3840 TATACTTGGC GGCGATTGGA ATTGCAAACA CAGACTCTGG GGTAACTATT TATCCTGCGC 3900 TAGGGGAAGA TCTCTGTCAC AGTCAATCCT TGCAAGGAAA GACCTTGACA TAGTTGCTAC 3960 CGGCCATGCC ACACACTTTC CCTTCGACAA GAAAAAACAG CCCTCTGCAC TGGACTTTGC 4020 AATATGCAAG GGCTTCCACA CTCAAAAACT AAAAACCTAC TCCACAGATG AACTCAGCTC 4080 TGACCACCTT CCCATCCAAA TAGTCCTAGA TCCTGACGAC TCAGACTGGC AACATAATAA 4140 GACAAACAAT GCTATTATAC AAAAAAGAAC AGATCTCACT AAATTTAAGA AAAACCTAGA 4200 AAACAAAATA TTACTAAACA CCGAAATTCG CACAGGACAG GACATCGACG ACTGCATAGA 4260 CATTCTTATC AATAACATAA AATCCGCAGC CAATGAAGCC ACCCCACCTA ATCGCCCCCA 4320 AAATAACCGC ACCTACAATT CTTCAAGAAG ATCCACAAAC CTAAGGCTAG ATGAAGCAAC 4380 TAAAAGGTTA CTTGAGGAGA AAAAGGAGCT AAATCGAATC TTTAGAGCAG TAGGAACTGA 4440 CGAAGCCCGA AGATGGTTTA AGAACGTGCA AAATAGACTG TCTAAAGAAA TCAGAAAGCT 4500 CAAACAAAAA CTACTCAACC GCAACCTACA AGATATTGAC ACTACGGACC GATATAGAAC 4560 GCAAAAACTA TGGAAAACCA CAAATACAAT TAAAATGCAA CCTCGACCCT GCTGGCCGAT 4620 AAAAAAGGAC AATGACGACA GCCGTCAAAG AACTGACTAC CCCTGGACCA GGACTTTAGA 4680 TGAAAAAGCT GAAGCCTTTG CATCTCACCT TGAGTCCCGC TTCAGGACAA ACCAAATTAA 4740 CGACGCCAAA GACAGGGAAT TCGTCAGAAA TGAGCTAAAC AAATTTAAAT CATTAAATGC 4800 GAGCGGCGAA TCCGGAAACA GCAACTTCAA ACCAGTCACT CTGGCTGAAC TAAATGGCCT 4860 GATAAACTCA CTGGAATTAA AGAAAGCCCC AGGAACTGAC AATCTTAACA ACAAGACCAT 4920 AATAAACTTA CCTACAAAGG CCAGAATATA TTTAATACTT ATTTATAACA ACATCCTGAG 4980 AACTGGACAT TTCCCGAACA AATGGAAGCA CGCTAGCATC TCAATGATTC CCAAACCAGG 5040 GAAATCACCA TTTGCTCTAA ATTCATACCG CCCAATCAGC TTACTCTCTG GTCTTTCCAA 5100 ACTACTCGAA AGAATACTAC TGAAACGACT GTATGACATT GACTCTTTTG CCAAAGCAAT 5160 CCCTTCCCAT CAATTTGGTT TCAGAAAGGA TCATGGAGCG GAACATCAGC TGGCCAGGGT 5220 GACCCAATTT ATTCTAAAAG CTTTTGATGA AAAAGATGTC TGTTCTGCCA CATTCCTTGA 5280 CATTACGGAA GCCTTTGACC GAGTATGGCA CGACGGCTTG CTATATAAAC TATCCAGACT 5340 CATCCCCAGA TACCTATTCG ACCTACTTGA AAACTATTTA TCTAATAGAA CCTTCTCAGT 5400 AAGGATCGAC GGTGAAACAA CGTCTAGGAT AGGTAATATT AGAGCAGGAG TGCCCCAGGG 5460 CAGCATACTG GGACCGGTCC TCTACTCAAT ATACTCATCC GACATGCCCT ATCCCATCGT 5520 AAAAGACTAT ATGCGTAACA TATCCTTCCC TGATTACCAC CCAACTAATA TTATCTTAGC 5580 TACATATGCA GATGATACCA TAATTCTTAG CCGGTCCAAA TATACCAAGC TTGCGATCAA 5640 CCTAAATCAA AACTACCTTA ACGTCTTCTG TAGGTGGTCA AAAAAATGGG ACATAGCAAT 5700 TAATGCAAAA AAAACCGGAC ACATTCTTTT CTCCCTAAAA AAAGAACAAA CTAATATATA 5760 CACTCCCCCA CTAATCAACG GACAAAGAGC TGCCAAACTA AACAAACAAC GCTATCTCGG 5820 ACTTATGCTA GACAGAAGAC TGACCTTTTG TGCACACATG ACGCTGCTAA AGGGAAAGAC 5880 TATAGCTGCA TATAAAAAAC TGGAATGGCT AATAGGAAAA AACAGCCACC TACCCAAAAA 5940 TGCAAAAATT CTCCTCTGGA AGCAAATTGT CTCCCCCATC TGGCATTACG CCATAGCAAT 6000 CTGGGGCTCG CTGGTATCTG ACACCCAAGC AAAGAAAATT CAAACAATGG AAAACAAATA 6060 CATCAGACGA ATCATAAACG CCAGCAGATA CACGAGACAA GCAGACATAA GGACAAAATA 6120 TAACATTAAA TCATTTGATG AAATTTTTGA CAAAGCAAGC CAACGCTACG CCAACTCCCT 6180 CACTGACCAT GAAAACCCTT TAATATATGA CCTCCTTATC AACGCCTACA AGCCGAACAG 6240 ACTGGAACTA AGCAAAAACA GATACGTCAA GCAATTATCA AAATATATAC TGCCCCTTCA 6300 ACAACACCGA CCTAAACCAC CCGCAGAACC CATCTACAGC TCCATACATA ATTATACAAA 6360 AAAGGAAGAA GCTGAAATAG TCGCCAAAAT GAGAACCAAA TTCAGACAGA CCCTTCCAAC 6420 CCTCCTCCGG ATCGCTGACC AAGAATTGGA AATAAGACAA TCCATAGCTG AGGAAAAGAG 6480 AAAGGAAAAA GAAAAATCCG AAAGGAGAAA AGCCTTAGAG AAGGGACCGC CAGATAGATG 6540 GTGTGAACTT CAAATAAACA AATACAGCAA ATTATACAGA AAAGGACTAA GGGCCAGAGA 6600 GGAAATCCTT GAATTAATGC TCGGACAACC CGCCACTGTA ATCAAAATAG TAATCCCAGA 6660 CTATGAACCC GAAGACGACC TAACAAAAAA GTCTTGAAAA CTAAATAAAA GCAAATAAAA 6720 CCAAAAGTAC ATGTATTTAC AATAATCATT GATTGTATAA TTGGTGATTA TAATATAACA 6780 ATTATAATTA TTAATATAAT TGATTGTCAT AATTGTTAGC TATTGATTAT AAATAACTAA 6840 TCAAAATACA AACTACAAAC TATGACCGAC GGAAAGACGC ACGCCGACCT GCTTCTCTTC 6900 CTACAATAGC GATACACATC TCCTTCTCCA TAGTCAGCAT CTTTCTGTGG AAAAACAAAC 6960 CAATTAGATG GATGATACAA AAACACAAAT AATAACCACA CCTCAACGCA TCCGGATAAA 7020 AACAACCAAC GACAACGCAT TCCAGCTGAT CATGACGAAG TGATGCGAAT AAAATCACCA 7080 CCTGGACATA AAAGAAGAAT CGGTAGATGG ATATGAAAAG GATTGGTGCG GCGAAAGCAT 7140 GATGAATATA AGGCGACTCG CTGCAGCAAT ATATGCACAA CGTCACTTAC CTGAATCTTC 7200 TTGCCGCACA GTCTTTTGAA GATCCTTATC ACCGCTGCAA TCCACACACA TCGCCGCATT 7260 GCTAAAGACA GGCCATCTAA GCTGACCCAG CGCCGATTAG GACACTCTGT TCGACGAGCG 7320 CCTACAGCGA ATTGCCGCTA AAACCTAAAA ACAAAAAATT TATTAACAAA TGAAATACAA 7380 ATATACAAAA TTCAAATAAA ACAAAGCACT TACCTCACTG ACCGCATCCA ACTGCACGCA 7440 ACACATCCAT ATCCAACATA ACAGACAAGA GGAGACGGGC CCTCAAACGT AAAACAATAT 7500 CGCCAACTTT GCGCTTACAA ACACAAAAAA ATTTACAATT TTATGATGCC GTCTCCTCTT 7560 CCCGATGCCA CTGCCTCAAT ATGGATTACT AGCGCGGAGC CGACAGCACA CTAAAAGGCT 7620 GAAAAATTTG TCCTCAAAAT TAATATTTTT CCTTAGTACC ACTATCCCAA CGAATTTTCC 7680 GCAAACCTGA AATAAAAAGA AAATTAATAA GAAAGTGATA CAAAATTAAC TAAAAACAAA 7740 TAGAAAATAG CAAACCGGAC AAGCAAAGTA ACAGATATAA ATATGCTACT TCATCCTGCT 7800 GAAGACACGC ATCCATGCAT CCTTCTTCCA AGACTGCAAA ACAGAAAGAA GGAAACACAA 7860 GCTATACTGG GAAAAATATA TTAAATCACA ATACTTATCT AATTGCCAAT TTGAAGAATC 7920 CTAAACCACG GCATCCCAGC GGCGATGGCC TTATCTTTAG CTGCTACAAA TGTACCTGAA 7980 AACAAAAATC AAAAAACAAA AAGTAATTCA ACTATAAAAA CAAACATAAT ACTTACCTCC 8040 AGACTATTTT CCTCCCGAAA AACATACCTG GAAAACAAAA ACAATGCAAC TATATAAACA 8100 AATAAATACA CAAATAATAC TTACCTCCGA ACTGCATTCC ACCTAATGTA CCTGAAAAAA 8160 TACAAAAATT ACAGAAATCA CAAAAATAAA TAACAAATGC TATACTTACC TAATTTTAAT 8220 ATTACACCCA TTCCCATGGC CCAATCTTTG GGCGGTCCCC AGCAACAATT CCTGACCCGG 8280 AACATTCTAA AATAATAGGA AAATAAATAA GATTGCGACT CAAAATTAAG CAATAACACA 8340 AAAAAAAAAA CAACAAACCT GGCAGACAAA TTAGCTGACG ATAAATTACA ACACCATCCT 8400 GCTGACGTCA CGCACGAAAA TTCTCTCTCC TAAAACACGG GATCCCAGCG GCGATGTCCT 8460 TAACTTTAGC TGCTACAAAT GTACCTGAAA ACAAAAAAAC AAAAATTAAT TCAATTATAA 8520 AAACAAACAT AATACTTACC TCCAGACTAT TTTTCCTCCC GAAAAACATA CCTGGAAAAC 8580 AAAAACAAGG CAACTATATA AACAAATAAA TACATATATA ATACTTACAT CCAAACTACA 8640 TTCCACCTAA TGTATCTGAG AAATACAAAA ATTAGAGAAA TCACAAAAAT AAATAACAAA 8700 GGTTATACTT ACCTAAAATT TAATATTACA TCCATTCCCA TGGCCCAATC GTTGCGGCGG 8760 TCCTCAGCAA CAATTCCTGA CCCGGAACAT CCTAAAATAA AAGGCAAATA AAATAACACA 8820 CATAAAATAG CAAACCTGAC AGGCAAACTA GCAGATGCTA ATTTGCAACA CCATCCCGCT 8880 GACGTCACGC ACGAAAATCC TTTCTCCAAG AGCGCAAAAA CTGAAAAAAG AAACACAAGC 8940 TAACACTGGG AAATATATAC TTCTTAACAA AATACTTATC TAAATTGCCA ACCGGACGAC 9000 TTCAAAGCTG CGGCCTAATT TTAATATTAC ATCTATCTCC GTGGCCCAAT CTTTGCGGCG 9060 GTCCTCAGCA ACAACTCCTG ACCCGGAACA TCCTAAAATA AAAAGAAAAA CAAATAAGAA 9120 TGCGACTCAA AATTAAGAAA TAACACACAA AAAATAACAA ACCTGACAGG CAAACTAGCA 9180 GATGCTAATT TGCAACACCA TCATGCTGAC GTCACGCACG AAAATCCTTT CTCCTAAGAC 9240 CGCAAAAGCT GAAAAAGGAA ACACAAGCTA ATACTGGGAA ATATATATTT TTAACAAAAT 9300 ACTTATCTAA ACTGCCAAAT TGACGACTTC AAAGCTGCGG CATCATAACT ACAACGACCT 9360 CCAAAATAAC TTTCAAAAAA TGTACCTGAA AAACAGAAAC TAATGAAATC ATAAAAACAA 9420 ATATAATACT TACCCCCAGA TTAGCTATTA CCTGAAAAAC AAAAAATTAA TGCAACTATA 9480 TAAACAAATA AAAACAAATA TACTTACCAT CAAACTATCC ATCACATTGT ACCTGAAAAC 9540 AAAAACAAAT GCAATCATAA AAAACAATCC TAATGCAATA CTTACCTATC TCTAACTTTA 9600 CATTCATTTC CATGCCCGTA TCGCTGCGGC GGACTTCTGC AACAAATCTT GAACCGGCGG 9660 CCCCAAGCTG CCAATCCCGA CGCAATGGCC AATAGACGAG GCGCTCCTGG CAACTCTTGG 9720 CGAACAACTA AGCTGCATTC TACTGACGAC TCCTCTGCCA CAACGAGACA GATTCCTCCA 9780 TTACGATCCC AGCAGCTTTA AGACCTTGTA ACGACGGCTG CGCTGGACCC TCCTTGTTCT 9840 ACCTTCTTTT GATGACCGGC GAAGTGGCCC TGCAAATTAA CAATTTATCA AACAACTGCA 9900 ATCATCTGCC ACTGAGGTTG AAATATATAC CTACTAAATG ACAGCGGCGC GGGATACACT 9960 CACCTATAAT AGGCTGCTTG CAGCGCTGGC CGGACATGCA TATTGCAAGT GGCACGCATC 10020 CAGAGTCCAC AACAAGCCCC AGCCAGAATG CCAAAATTAC TCACCTGCAA TGTTTCCTGA 10080 GGCCTCCAGC GACTCGGTGC TTCCGTCCTT CCGACGGGGG TACCTGAAAA GAATAAATCA 10140 ATAACAATGT TAGTTTTAAA TTTCAATGTT TTCTATAAAA TCATTTCAAT TGTAAATTGT 10200 AAACATACCA TACAATATGA TAATGTTACC TGTCCTAATT ACTGTCAAAA GCCTAAGTCT 10260 ACAAAATACT AACTACTTTT ACATTATTAC TAACTATGTC CAACCCCCAA ACTCACCACA 10320 TGTAATGTTA CATTCAAAAA TGCAAATTAT TGTACCTATA TATTACAAAC TCTGTAATCA 10380 AAGGCAAAAT AAATTGTGGA TGCGGAACAG AATTTATTCT GTCTCCGTAC CTCCACCAGC 10440 AAAGTTAAAA AAAAAAAAAA AAA 10463 //