LOCUS PC175S.TXT 5440 BP DS-DNA CIRCULAR SYN 17-MAY-2003
DEFINITION -
ACCESSION
-
KEYWORDS -
SOURCE -
FEATURES
Location/Qualifiers
misc_RNA complement(>4..12)
/note="SV40 minor late 19S RNA"
promoter <4..>348
/note="SV40 promoter (ÆSELP)"
enhancer 26..168
/note="SV40 enhancer elements"
misc_feature
173..236
/note="SV40 21bp repeats"
rep_origin
193..>328
/note="SV40 Ori"
mRNA
242..248
/note="early-late transcription startpoints"
mutation 262..262
/note="G->A kills SELP ATG"
mRNA
282..288
/note="early-early transcription start points"
CDS
373..1092
/note="EGFP"
polyA_signal
1108..>1329
/note="SV40 Late PolyA"
promoter <1420..>1657
/note="hEF1alpha
promoter core"
5'UTR
1665..1931
/note="HTLV-1 R-U5"
CDS
1963..>3480
/note="MŸller's HPV16 L1"
mutation 2485..2485
/note="T->A makes Cys175->Ser"
polyA_signal
<3529..4191
/note="hEF1a polyA signal"
rep_origin
4192..4925
/note="MB1 Ori"
promoter 4926..5012
/note="EM7 promoter"
CDS
5013..5387
/note="ShBle (ZeoR)"
terminator
5388..5440
/note="terminator (rpmB/G)"
BASE COUNT 1312 A 1538 C 1479 G 1111 T 0 OTHER
ORIGIN -
1 CCCCTGTGGA ATGTGTGTCA
GTTAGGGTGT GGAAAGTCCC CAGGCTCCCC AGCAGGCAGA
61 AGTATGCAAA GCATGCATCT
CAATTAGTCA GCAACCAGGT GTGGAAAGTC CCCAGGCTCC
121 CCAGCAGGCA GAAGTATGCA AAGCATGCAT
CTCAATTAGT CAGCAACCAT AGTCCCGCCC
181 CTAACTCCGC CCATCCCGCC CCTAACTCCG
CCCAGTTCCG CCCATTCTCC GCCCCATGGC
241 TGACTAATTT TTTTTATTTA TACAGAGGCC
GAGGCCGCCT CGGCCTCTGA GCTATTCCAG
301 AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA
GGCTTTTGCA AAAAGCTTGA TTGGGATCCA
361 CCGGTCGCCA CCATGGTGAG CAAGGGCGAG
GAGCTGTTCA CCGGGGTGGT GCCCATCCTG
421 GTCGAGCTGG ACGGCGACGT AAACGGCCAC
AAGTTCAGCG TGTCCGGCGA GGGCGAGGGC
481 GATGCCACCT ACGGCAAGCT GACCCTGAAG
TTCATCTGCA CCACCGGCAA GCTGCCCGTG
541 CCCTGGCCCA CCCTCGTGAC CACCCTGACC
TACGGCGTGC AGTGCTTCAG CCGCTACCCC
601 GACCACATGA AGCAGCACGA CTTCTTCAAG
TCCGCCATGC CCGAAGGCTA CGTCCAGGAG
661 CGCACCATCT TCTTCAAGGA CGACGGCAAC
TACAAGACCC GCGCCGAGGT GAAGTTCGAG
721 GGCGACACCC TGGTGAACCG CATCGAGCTG
AAGGGCATCG ACTTCAAGGA GGACGGCAAC
781 ATCCTGGGGC ACAAGCTGGA GTACAACTAC
AACAGCCACA ACGTCTATAT CATGGCCGAC
841 AAGCAGAAGA ACGGCATCAA GGTGAACTTC
AAGATCCGCC ACAACATCGA GGACGGCAGC
901 GTGCAGCTCG CCGACCACTA CCAGCAGAAC
ACCCCCATCG GCGACGGCCC CGTGCTGCTG
961 CCCGACAACC ACTACCTGAG CACCCAGTCC
GCCCTGAGCA AAGACCCCAA CGAGAAGCGC
1021 GATCACATGG TCCTGCTGGA GTTCGTGACC GCCGCCGGGA TCACTCTCGG CATGGACGAG
1081 CTGTACAAGT AAAGCGGCCG CTTCGAGCAG ACATGATAAG ATACATTGAT GAGTTTGGAC
1141 AAACCACAAC TAGAATGCAG TGAAAAAAAT GCTTTATTTG TGAAATTTGT GATGCTATTG
1201 CTTTATTTGT AACCATTATA AGCTGCAATA AACAAGTTAA CAACAACAAT TGCATTCATT
1261 TTATGTTTCA GGTTCAGGGG GAGGTGTGGG AGGTTTTTTA AAGCAAGTAA AACCTCTACA
1321 AATGTGGTAA AATCGATAAG GATCCGGGCT GGCGTAATAG CGAAGAGGCC CGCACCGATC
1381 GCCCTTCCCA ACAGTTGCGG TGGAGAAGAG CATGCGTGAG GCTCCGGTGC CCGTCAGTGG
1441 GCAGAGCGCA CATCGCCCAC AGTCCCCGAG AAGTTGGGGG GAGGGGTCGG CAATTGAACC
1501 GGTGCCTAGA GAAGGTGGCG CGGGGTAAAC TGGGAAAGTG ATGTCGTGTA CTGGCTCCGC
1561 CTTTTTCCCG AGGGTGGGGG AGAACCGTAT ATAAGTGCAG TAGTCGCCGT GAACGTTCTT
1621 TTTCGCAACG GGTTTGCCGC CAGAACACAG CTGAAGCTTC GAGGGGCTCG CATCTCTCCT
1681 TCACGCGCCC GCCGCCCTAC CTGAGGCCGC CATCCACGCC GGTTGAGTCG CGTTCTGCCG
1741 CCTCCCGCCT GTGGTGCCTC CTGAACTGCG TCCGCCGTCT AGGTAAGTTT AAAGCTCAGG
1801 TCGAGACCGG GCCTTTGTCC GGCGCTCCCT TGGAGCCTAC CTAGACTCAG CCGGCTCTCC
1861 ACGCTTTGCC TGACCCTGCT TGCTCAACTC TACGTCTTTG TTTCGTTTTC TGTTCTGCGC
1921 CGTTACAGAT CCAAGCTGTG ACCGGCCCGC TCTAGAGCCA CCATGAGCCT GTGGCTGCCC
1981 AGCGAGGCCA CCGTGTACCT GCCCCCCGTG CCCGTGAGCA AGGTGGTGAG CACCGACGAG
2041 TACGTGGCCA GGACCAACAT CTACTACCAC GCCGGCACCA GCAGGCTGCT GGCCGTGGGC
2101 CACCCCTACT TCCCCATCAA GAAGCCCAAC AACAACAAGA TCCTGGTGCC CAAGGTGAGC
2161 GGCCTGCAGT ACAGGGTGTT CAGGATCCAC CTGCCCGACC CCAACAAGTT CGGCTTCCCC
2221 GACACCAGCT TCTACAACCC CGACACCCAG AGGCTGGTGT GGGCCTGCGT GGGCGTGGAG
2281 GTGGGCAGGG GCCAGCCCCT GGGCGTGGGC ATCAGCGGCC ACCCCCTGCT GAACAAGCTG
2341 GACGACACCG AGAACGCCAG CGCCTACGCC GCCAACGCCG GCGTGGACAA CAGGGAGTGC
2401 ATCAGCATGG ACTACAAGCA GACCCAGCTG TGCCTGATCG GCTGCAAGCC CCCCATCGGC
2461 GAGCACTGGG GCAAGGGCAG CCCCAGCACC AACGTGGCCG TGAACCCCGG CGACTGCCCC
2521 CCCCTGGAGC TGATCAACAC CGTGATCCAG GACGGCGACA TGGTGGACAC CGGCTTCGGC
2581 GCCATGGACT TCACCACCCT GCAGGCCAAC AAGAGCGAGG TGCCCCTGGA CATCTGCACC
2641 AGCATCTGCA AGTACCCCGA CTACATCAAG ATGGTGAGCG AGCCCTACGG CGACAGCCTG
2701 TTCTTCTACC TGAGGAGGGA GCAGATGTTC GTGAGGCACC TGTTCAACAG GGCCGGCGCC
2761 GTGGGCGAGA ACGTGCCCGA CGACCTGTAC ATCAAGGGCA GCGGCAGCAC CGCCAACCTG
2821 GCCAGCAGCA ACTACTTCCC CACCCCCAGC GGCAGCATGG TGACCAGCGA CGCCCAGATC
2881 TTCAACAAGC CCTACTGGCT GCAGAGGGCC CAGGGCCACA ACAACGGCAT CTGCTGGGGC
2941 AACCAGCTGT TCGTGACCGT GGTGGACACC ACCAGGAGCA CCAACATGAG CCTGTGCGCC
3001 GCCATCAGCA CCAGCGAGAC CACCTACAAG AACACCAACT TCAAGGAGTA CCTGAGGCAC
3061 GGCGAGGAGT ACGACCTGCA GTTCATCTTC CAGCTGTGCA AGATCACCCT GACCGCCGAC
3121 GTGATGACCT ACATCCACAG CATGAACAGC ACCATCCTGG AGGACTGGAA CTTCGGCCTG
3181 CAGCCCCCCC CCGGCGGCAC CCTGGAGGAC ACCTACAGGT TCGTGACCAG CCAGGCCATC
3241 GCCTGCCAGA AGCACACCCC CCCCGCCCCC AAGGAGGACC CCCTGAAGAA GTACACCTTC
3301 TGGGAGGTGA ACCTGAAGGA GAAGTTCAGC GCCGACCTGG ACCAGTTCCC CCTGGGCAGG
3361 AAGTTCCTGC TGCAGGCCGG CCTGAAGGCC AAGCCCAAGT TCACCCTGGG CAAGAGGAAG
3421 GCCACCCCCA CCACCAGCAG CACCAGCACC ACCGCCAAGA GGAAGAAGAG GAAGCTGTGA
3481 AAGCTACCCA CGGCCGAATA GCCGTGAGCC GGAATCCTGC ACGCTAGCAT TATCCCTAAT
3541 ACCTGCCACC CCACTCTTAA TCAGTGGTGG AAGAACGGTC TCAGAACTGT TTGTTTCAAT
3601 TGGCCATTTA AGTTTAGTAG TAAAAGACTG GTTAATGATA ACAATGCATC GTAAAACCTT
3661 CAGAAGGAAA GGAGAATGTT TTGTGGACCA CTTTGGTTTT CTTTTTTGCG TGTGGCAGTT
3721 TTAAGTTATT AGTTTTTAAA ATCAGTACTT TTTAATGGAA ACAACTTGAC CAAAAATTTG
3781 TCACAGAATT TTGAGACCCA TTAAAAAAGT TAAATGAGAA ACCTGTGTGT TCCTTTGGTC
3841 AACACCGAGA CATTTAGGTG AAAGACATCT AATTCTGGTT TTACGAATCT GGAAACTTCT
3901 TGAAAATGTA ATTCTTGAGT TAACACTTCT GGGTGGAGAA TAGGGTTGTT TTCCCCCCAC
3961 ATAATTGGAA GGGGAAGGAA TATCATTTAA AGCTATGGGA GGGTTTCTTT GATTACAACA
4021 CTGGAGAGAA ATGCAGCATG TTGCTGATTG CCTGTCACTA AAACAGGCCA AAAACTGAGT
4081 CCTTGGGTTG CATAGAAAGC TTCATGTTGC TAAACCAATG TTAAGTGAAT CTTTGGAAAC
4141 AAAATGTTTC CAAATTACTG GGATGTGCAT GTTGAAACGT GGGTTAATTA ACTAGCCATG
4201 ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT AGAAAAGATC
4261 AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA AACAAAAAAA
4321 CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT TTTTCCGAAG
4381 GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTTC TTCTAGTGTA GCCGTAGTTA
4441 GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT AATCCTGTTA
4501 CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC AAGACGATAG
4561 TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA GCCCAGCTTG
4621 GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA AAGCGCCACG
4681 CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG AACAGGAGAG
4741 CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT CGGGTTTCGC
4801 CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG CCTATGGAAA
4861 AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT TGCTCACATG
4921 TTCTTAATTA AATTTTTCAA AAGTAGTTGA CAATTAATCA TCGGCATAGT ATATCGGCAT
4981 AGTATAATAC GACTCACTAT AGGAGGGCCA TCATGGCCAA GTTGACCAGT GCTGTCCCAG
5041 TGCTCACAGC CAGGGATGTG GCTGGAGCTG TTGAGTTCTG GACTGACAGG TTGGGGTTCT
5101 CCAGAGATTT TGTGGAGGAT GACTTTGCAG GTGTGGTCAG AGATGATGTC ACCCTGTTCA
5161 TCTCAGCAGT CCAGGACCAG GTGGTGCCTG ACAACACCCT GGCTTGGGTG TGGGTGAGAG
5221 GACTGGATGA GCTGTATGCT GAGTGGAGTG AGGTGGTCTC CACCAACTTC AGGGATGCCA
5281 GTGGCCCTGC CATGACAGAG ATTGGAGAGC AGCCCTGGGG GAGAGAGTTT GCCCTGAGAG
5341 ACCCAGCAGG CAACTGTGTG CACTTTGTGG CAGAGGAGCA GGACTGAGGA TAAGAATTGT
5401 AACAAAAAAC CCCGCCCCGG CGGGGTTTTT TGTTAATTAA
//