[Biojava-l] Ensembl read problems
Schreiber, Mark
mark.schreiber@agresearch.co.nz
Tue, 26 Nov 2002 09:10:51 +1300
Hi -
My guess would be that line where it tries to join on a negative
location. That doesn't seem to make a whole lot of sense and suggests to
me an error in that record.
Can any embl experts confirm that?
- Mark
> -----Original Message-----
> From: saerts [mailto:saerts@mailserv.esat.kuleuven.ac.be]
> Sent: Tuesday, 26 November 2002 12:10 a.m.
> To: biojava-l@biojava.org
> Subject: [Biojava-l] Ensembl read problems
>
>
>
> Hello,
> I'm still experiencing problems when reading the EMBL
> formatted output of the "export data" of Ensembl. Both for
> human and mouse, although the parsing errors are different in
> both cases. I'm using a BioJava CVS-build of last week
>
> In one case, no features can be parsed (human).
> In the other case, some of the locations are out of the
> source location (mouse).
>
> I'm using the code and the sequences below, and the errors of
> both cases are pasted in this mail.
>
> Does anyone have an idea why this goes wrong?
>
> Thanks!
>
> Stein Aerts.
>
>
> ##################################
> code
> ###################################
> BufferedReader br = new BufferedReader(new FileReader("temp.embl"));
> SequenceIterator stream = SeqIOTools.readEmbl(br);
> Sequence seq = stream.nextSequence();
>
> ##################################
> case 1: ccnd1_human.embl
> ###################################
> This line could not be parsed: CDS
> join(-1151..-840,1654..1777,1995..2434)
> This line could not be parsed: 12014..12175)
> This line could not be parsed: CDS
> join(3927..4460,4728..4887,8890..9038,12014..12178)
> This line could not be parsed: exon -1151..-840
> This line could not be parsed: exon 1654..1777
> This line could not be parsed: exon 1995..2434
> This line could not be parsed: exon 2001..2407
> This line could not be parsed: exon 3927..4460
> This line could not be parsed: exon 3927..4142
> This line could not be parsed: exon 4728..4887
> This line could not be parsed: exon 4728..4887
> This line could not be parsed: exon 8890..9038
> This line could not be parsed: exon 8890..9038
> This line could not be parsed: exon 12014..12178
> This line could not be parsed: exon 12014..15370
> This line could not be parsed: variation complement(64..64)
> This line could not be parsed: variation 835..835
> This line could not be parsed: variation 838..838
> This line could not be parsed: variation 862..862
> This line could not be parsed: variation 1235..1235
> This line could not be parsed: variation 1579..1579
> This line could not be parsed: variation 1779..1779
> This line could not be parsed: variation 1948..1948
> This line could not be parsed: variation 2297..2297
> This line could not be parsed: variation 2365..2365
> This line could not be parsed: variation 2605..2605
> This line could not be parsed: variation 2612..2612
> This line could not be parsed: variation 2627..2627
> This line could not be parsed: variation 2634..2634
> This line could not be parsed: variation 2729..2729
> This line could not be parsed: variation 2777..2777
> This line could not be parsed: variation 2875..2875
> This line could not be parsed: variation 2878..2878
> This line could not be parsed: variation 3029..3029
> This line could not be parsed: variation 3421..3421
> This line could not be parsed: variation 4116..4116
> This line could not be parsed: variation 4117..4117
> This line could not be parsed: variation 4161..4161
> This line could not be parsed: variation 4177..4177
> This line could not be parsed: variation 5141..5141
> This line could not be parsed: variation 5174..5174
> This line could not be parsed: variation 5277..5277
> This line could not be parsed: variation 5539..5539
> This line could not be parsed: variation 6708..6708
> This line could not be parsed: variation 7310..7310
> This line could not be parsed: variation 7328..7328
> This line could not be parsed: variation 7341..7341
> This line could not be parsed: variation 7502..7502
> This line could not be parsed: variation 7518..7518
> This line could not be parsed: variation 7522..7522
> This line could not be parsed: variation 7859..7859
> This line could not be parsed: variation 8065..8065
> This line could not be parsed: variation 8118..8120
> This line could not be parsed: variation 8218..8218
> This line could not be parsed: variation 8314..8314
> This line could not be parsed: variation 8330..8330
> This line could not be parsed: variation 8548..8548
> This line could not be parsed: variation 8770..8770
> This line could not be parsed: variation 9038..9038
> This line could not be parsed: variation 9609..9609
> This line could not be parsed: variation 9807..9807
> This line could not be parsed: variation 10191..10191
> This line could not be parsed: variation 10482..10482
> This line could not be parsed: variation 10488..10488
> This line could not be parsed: variation 10513..10513
> This line could not be parsed: variation 10921..10921
> This line could not be parsed: variation 11025..11025
> This line could not be parsed: variation 11112..11112
> This line could not be parsed: variation 11221..11221
> This line could not be parsed: variation 11464..11464
> This line could not be parsed: variation 11534..11534
> This line could not be parsed: variation 11540..11540
> This line could not be parsed: variation 11635..11635
> This line could not be parsed: variation 11809..11809
> This line could not be parsed: variation 11988..11988
> This line could not be parsed: variation 12112..12112
> This line could not be parsed: variation 12243..12243
> This line could not be parsed: variation 12321..12321
> This line could not be parsed: variation 12865..12865
> This line could not be parsed: variation 12884..12884
> This line could not be parsed: variation
> complement(12900..12900)
> This line could not be parsed: variation 13000..13000
> This line could not be parsed: variation 13012..13012
> This line could not be parsed: variation 13090..13090
> This line could not be parsed: variation 13294..13294
> This line could not be parsed: variation 13465..13465
> This line could not be parsed: variation 13779..13779
> This line could not be parsed: variation 14048..14048
> This line could not be parsed: variation 14095..14095
> This line could not be parsed: variation 14358..14358
> This line could not be parsed: variation 14486..14486
> This line could not be parsed: variation 14490..14490
> This line could not be parsed: variation 14504..14504
> This line could not be parsed: variation 14636..14636
> This line could not be parsed: variation 14714..14714
> This line could not be parsed: variation 14796..14796
> This line could not be parsed: variation 14904..14904
> This line could not be parsed: variation 14939..14939
> This line could not be parsed: variation 14966..14966
> This line could not be parsed: variation 15158..15158
> This line could not be parsed: variation 15273..15273
> This line could not be parsed: variation 15338..15338
> This line could not be parsed: variation 15346..15346
> This line could not be parsed: variation 15463..15463
> This line could not be parsed: variation 15556..15556
> This line could not be parsed: variation 15606..15606
> This line could not be parsed: variation 15677..15677
> This line could not be parsed: variation 15778..15782
> This line could not be parsed: variation 15880..15880
> This line could not be parsed: variation 15975..15975
> This line could not be parsed: variation 16031..16031
> This line could not be parsed: variation 16053..16053
> This line could not be parsed: variation 16062..16062
> This line could not be parsed: variation 16114..16114
> This line could not be parsed: variation 16421..16421
> This line could not be parsed: variation 16511..16511
> Chromosome|17370|gcctagtaac
>
> ##################################
> case 2: ccnd1_mus.embl
> ###################################
> D:\JBuilder7\jdk1.3.1\bin\javaw -classpath
> "D:\SAE\Projects\dvl\classes;D:\SAE\Projects\sista.sequence\Mo
> tifFinding\classes;D:\SAE\Projects\sista.sequence\SequenceView
> er\classes;D:\SAE\java\jars\mm.mysql-2.0.14-bin.jar;D:\SAE\jav
a\jars\servlet.jar;D:\SAE\java\data4s.clustering\clustering.jar;D:>
\SAE\java\jars\lapack\classes.zip;D:\SAE\java\jars\webwise\web
wisefree.jar;D:\JBuilder7\lib\xerces.jar;D:\SAE\java\jars\oro\jakarta->
oro-2.0.4\jakarta-oro-2.0.4.jar;D:\SAE\java\ensj\ensj.zip;D:\S
> AE\java\jars\jaxp.jar;D:\JBuilder7\lib\junit.jar;D:\JBuilder7\
> lib\unittest.jar;D:\Program
> Files\jython\jython.jar;D:\Program
> Files;D:\SAE\java\jars\soap.jar;D:\SAE\java\jars\activation.ja
> r;D:\SAE\java\jars\mail.jar;D:\SAE\java\jars\regexp\jakarta-re
gexp-1.2.jar;U:\Projects\biojava\biojava-ensembl\ant-build\bj->
ensembl.jar;U:\Projects\biojava\ensembl\ant-build\ensembl-j.ja
> r;U:\Projects\biojava\biojava-live\ant-build\biojava.jar;D:\JB
> uilder7\jdk1.3.1\demo\jfc\Java2D\Java2Demo.jar;D:\JBuilder7\jd
k1.3.1\jre\lib\i18n.jar;D:\JBuilder7\jdk1.3.1\jre\lib\jaws.jar;D:>
\JBuilder7\jdk1.3.1\jre\lib\rt.jar;D:\JBuilder7\jdk1.3.1\jre\l
> ib\sunrsasign.jar;D:\JBuilder7\jdk1.3.1\lib\dt.jar;D:\JBuilder
> 7\jdk1.3.1\lib\htmlconverter.jar;D:\JBuilder7\jdk1.3.1\lib\tools.jar"
> -Xmx500m -Djdbc.drivers=org.gjt.mm.mysql.Driver ReadEmbl
> java.lang.IllegalArgumentException: Location [15972,16065] is
> outside 1..12647
>
> at
> org.biojava.bio.seq.impl.SimpleFeature.<init>(SimpleFeature.java:306)
>
> at
> org.biojava.bio.seq.impl.SimpleStrandedFeature.<init>(SimpleSt
randedFeature.java:74)
>
> at java.lang.reflect.Constructor.newInstance(Native Method)
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer$TemplateImpl.realize
> (SimpleFeatureRealizer.java:138)
>
> rethrown as org.biojava.bio.BioException: Couldn't realize feature
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer$TemplateImpl.realize
> (SimpleFeatureRealizer.java:144)
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer.realizeFeature(Simpl
eFeatureRealizer.java:94)
>
> at
> org.biojava.bio.seq.impl.SimpleSequence.realizeFeature(SimpleS
> equence.java:199)
>
> at
> org.biojava.bio.seq.impl.SimpleSequence.createFeature(SimpleSe
> quence.java:205)
>
> at
> org.biojava.bio.seq.io.SequenceBuilderBase.makeSequence(Sequen
> ceBuilderBase.java:168)
>
> at
> org.biojava.bio.seq.io.SimpleSequenceBuilder.makeSequence(Simp
> leSequenceBuilder.java:83)
>
> at
> org.biojava.bio.seq.io.SequenceBuilderFilter.makeSequence(Sequ
> enceBuilderFilter.java:98)
>
> at
> org.biojava.bio.seq.io.StreamReader.nextSequence(StreamReader.
> java:101)
>
> at ReadEmbl.main(ReadEmbl.java:19)
>
> java.lang.IllegalArgumentException: Location 12061, 33547
> {([12061,12216]), ([15972,16065]), ([33387,33547])} is
> outside 1..12647
>
> at
> org.biojava.bio.seq.impl.SimpleFeature.<init>(SimpleFeature.java:306)
>
> at
> org.biojava.bio.seq.impl.SimpleStrandedFeature.<init>(SimpleSt
randedFeature.java:74)
>
> at java.lang.reflect.Constructor.newInstance(Native Method)
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer$TemplateImpl.realize
> (SimpleFeatureRealizer.java:138)
>
> rethrown as org.biojava.bio.BioException: Couldn't realize feature
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer$TemplateImpl.realize
> (SimpleFeatureRealizer.java:144)
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer.realizeFeature(Simpl
eFeatureRealizer.java:94)
>
> at
> org.biojava.bio.seq.impl.SimpleSequence.realizeFeature(SimpleS
> equence.java:199)
>
> at
> org.biojava.bio.seq.impl.SimpleSequence.createFeature(SimpleSe
> quence.java:205)
>
> at
> org.biojava.bio.seq.io.SequenceBuilderBase.makeSequence(Sequen
> ceBuilderBase.java:168)
>
> at
> org.biojava.bio.seq.io.SimpleSequenceBuilder.makeSequence(Simp
> leSequenceBuilder.java:83)
>
> at
> org.biojava.bio.seq.io.SequenceBuilderFilter.makeSequence(Sequ
> enceBuilderFilter.java:98)
>
> at
> org.biojava.bio.seq.io.StreamReader.nextSequence(StreamReader.
> java:101)
>
> at ReadEmbl.main(ReadEmbl.java:19)
>
> java.lang.IllegalArgumentException: Location [33387,33547] is
> outside 1..12647
>
> at
> org.biojava.bio.seq.impl.SimpleFeature.<init>(SimpleFeature.java:306)
>
> at
> org.biojava.bio.seq.impl.SimpleStrandedFeature.<init>(SimpleSt
randedFeature.java:74)
>
> at java.lang.reflect.Constructor.newInstance(Native Method)
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer$TemplateImpl.realize
> (SimpleFeatureRealizer.java:138)
>
> rethrown as org.biojava.bio.BioException: Couldn't realize feature
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer$TemplateImpl.realize
> (SimpleFeatureRealizer.java:144)
>
> at
> org.biojava.bio.seq.SimpleFeatureRealizer.realizeFeature(Simpl
eFeatureRealizer.java:94)
>
> at
> org.biojava.bio.seq.impl.SimpleSequence.realizeFeature(SimpleS
> equence.java:199)
>
> at
> org.biojava.bio.seq.impl.SimpleSequence.createFeature(SimpleSe
> quence.java:205)
>
> at
> org.biojava.bio.seq.io.SequenceBuilderBase.makeSequence(Sequen
> ceBuilderBase.java:168)
>
> at
> org.biojava.bio.seq.io.SimpleSequenceBuilder.makeSequence(Simp
> leSequenceBuilder.java:83)
>
> at
> org.biojava.bio.seq.io.SequenceBuilderFilter.makeSequence(Sequ
> enceBuilderFilter.java:98)
>
> at
> org.biojava.bio.seq.io.StreamReader.nextSequence(StreamReader.
> java:101)
>
> at ReadEmbl.main(ReadEmbl.java:19)
>
> Chromosome|12647|nnnnnnnnnn
>
> ##################################
> ccnd1_human.embl
> ###################################
>
> ID Chromosome 11 71948701 to 71966070 ENSEMBL; DNA; HUM; 17370 BP.
> XX
> AC Chromosome 11 71948701 to 71966070;
> XX
> SV NO_SV_NUMBER
> XX
> DE Reannotated sequence via Ensembl
> XX
> KW HTG.
> XX
> OS Homo sapiens (Human)
> OC Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia;
> Eutheria; Primates;
> OC Catarrhini; Hominidae; Homo.
> XX
> CC This sequence was reannotated via the Ensembl system.
> Please visit the
> CC Ensembl web site, http://www.ensembl.org/ for more information.
> XX
> CC The reference, comment, description and feature table of
> the original
> CC entry can be found in the DDBJ/EMBL/GenBank database
> with the identical
> CC accession number.
> XX
> CC The /gene indicates a unique id for a gene, /cds a
> unique id for a
> CC translation and a /exon a unique id for an exon. These
> ids are maintained
> CC wherever possible between versions. For more information
> on how to
> CC interpret the feature table, please visit
> CC http://www.ensembl.org/Docs/embl.html.
> XX
> CC All the exons and transcripts in Ensembl are confirmed
> by similarity to
> CC either protein or cDNA sequences.
> XX
> CC In unfinished, rough draft DNA sequence gene structures can cross
> CC fragments and, in these cases, the order and orientation
> of the fragments
> CC is likely to be different from the order in the the International
> CC Nucleotide Sequence Databases DDBJ/EMBL/GenBank.
> XX
> FH Key Location/Qualifiers
> FH
> FT source 1..17370
> FT /classification="sapiens, Homo,
> Hominidae, Catarrhini,
> FT Primates, Eutheria, Mammalia,
> Vertebrata, Chordata,
> FT Metazoa, Eukaryota"
> FT /organism="Human"
> FT CDS join(-1151..-840,1654..1777,1995..2434)
> FT /transcript="AP001824.4.1.124888.120532.124117"
> FT
> /translation="MGGAVRLSMCRACGCLGWALGPEFAPAHLAVLGRLGSALRKRSRA
> FT
> AIARGPPRAPQKQAVWGEGELERLRSPGAGFRKRFPGRLKGLTMENSRSLSQVLSSRRQ
> FT
> AACGLPAFLVVPYCRATSTSPPKSRGTHSRRTGPPAPLFPGVTSHGLQGSFVEVAKSWS
> FT
> LQRAVGAVAASSRVRTLRRGAEEREGARGSRSESRARTQPGPTALPSCPGRAPAMEHQL
> FT
> LCCEVETIRRAYPDANLLNDRVLRAMLKAEETCAPSVSYFKCVQKEVLPSMRKIVATWM
> FT LEVRGFGRLS"
> FT CDS join(2210..2407,3927..4142,4728..4887,8890..9038,
> FT 12014..12175)
> FT /gene="ENSG00000110092"
> FT /cds="ENSP00000227507"
> FT /transcript="ENST00000227507"
> FT /db_xref="GO:GO:0000074"
> FT /db_xref="GO:GO:0005634"
> FT /db_xref="GO:GO:0008372"
> FT /db_xref="GO:GO:0016288"
> FT /db_xref="GO:GO:0016538"
> FT /db_xref="GO:GO:0000082"
> FT /db_xref="RefSeq:NM_053056"
> FT /db_xref="MIM:168461"
> FT /db_xref="LocusLink:595"
> FT /db_xref="HUGO:1582"
> FT /db_xref="RefSeq:NM_001758"
> FT /db_xref="LocusLink:893"
> FT /db_xref="SWISSPROT:P24385"
> FT /db_xref="EMBL:X59798"
> FT /db_xref="protein_id:CAA42470"
> FT /db_xref="EMBL:M74092"
> FT /db_xref="EMBL:M64349"
> FT /db_xref="protein_id:AAA52136"
> FT /db_xref="EMBL:M73554"
> FT /db_xref="protein_id:AAA58392"
> FT /db_xref="EMBL:Z23022"
> FT /db_xref="protein_id:CAA80558"
> FT /db_xref="EMBL:BC000076"
> FT /db_xref="protein_id:AAH00076"
> FT /db_xref="EMBL:BC001501"
> FT /db_xref="protein_id:AAH01501"
> FT /db_xref="EMBL:BC014078"
> FT /db_xref="protein_id:AAH14078"
> FT /db_xref="MIM:151400"
> FT
> /translation="MEHQLLCCEVETIRRAYPDANLLNDRVLRAMLKAEETCAPSVSYF
> FT
> KCVQKEVLPSMRKIVATWMLEVCEEQKCEEEVFPLAMNYLDRFLSLEPVKKSRLQLLGA
> FT
> TCMFVASKMKETIPLTAEKLCIYTDNSIRPEELLQMELLLVNKLKWNLAAMTPHDFIEH
> FT
> FLSKMPEAEENKQIIRKHAQTFVALCATDVKFISNPPSMVAAGSVVAAVQGLNLRSPNN
> FT
> FLSYYRLTRFLSRVIKCDPDCLRACQEQIEALLESSLRQAQQNMDPKAAEEEEEEEEEV
> FT DLACTPTDVRDVDI"
> FT CDS
> join(3927..4460,4728..4887,8890..9038,12014..12178)
> FT /transcript="AP001888.4.1.161505.2802.12770"
> FT
> /translation="VCEEQKCEEEVFPLAMNYLDRFLSLEPVKKSRLQLLGATCMFVAS
> FT
> KMKETIPLTAEKLCIYTDNSIRPEELLVTTGPRRPPPPASRTQDHGAGEGAGGGGRPAS
> FT
> DISAPPREGGPAAGRPCPGSGRDPSRPRPAALCALACDSHRVRAPRCGRKVGGARPPAA
> FT
> ARGAPRSALSLQFQVQMELLLVNKLKWNLAAMTPHDFIEHFLSKMPEAEENKQIIRKHA
> FT
> QTFVALCATDVKFISNPPSMVAAGSVVAAVQGLNLRSPNNFLSYYRLTRFLSRVIKCDP
> FT
> DCLRACQEQIEALLESSLRQAQQNMDPKAAEEEEEEEEEVDLACTPTDVRDVDI"
> FT exon -1151..-840
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon 1654..1777
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon 1995..2434
> FT /exon_id=""
> FT /start_phase=1
> FT /end_phase=1
> FT exon 2001..2407
> FT /exon_id="ENSE00001099159"
> FT /start_phase="-1"
> FT /end_phase="-1"
> FT exon 3927..4460
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon 3927..4142
> FT /exon_id="ENSE00001064288"
> FT /start_phase=0
> FT /end_phase=0
> FT exon 4728..4887
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon 4728..4887
> FT /exon_id="ENSE00000737399"
> FT /start_phase=0
> FT /end_phase=0
> FT exon 8890..9038
> FT /exon_id=""
> FT /start_phase=1
> FT /end_phase=1
> FT exon 8890..9038
> FT /exon_id="ENSE00000894874"
> FT /start_phase=1
> FT /end_phase=1
> FT exon 12014..12178
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon 12014..15370
> FT /exon_id="ENSE00000894873"
> FT /start_phase=0
> FT /end_phase=0
> FT variation complement(64..64)
> FT /replace="A|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1944129"
> FT /db_xref="TSC-CSHL:TSC1016673"
> FT /db_xref="HGBASE:SNP000218375"
> FT variation 835..835
> FT /replace="T|G"
> FT /note="heterozygosity=0.0758316"
> FT /note="heterozygosity_std_error=0.0411453"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212859"
> FT variation 838..838
> FT /replace="A|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:954618"
> FT /db_xref="TSC-CSHL:TSC0069836"
> FT variation 862..862
> FT /replace="C|T"
> FT /note="heterozygosity=0.0878642"
> FT /note="heterozygosity_std_error=0.0436568"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:954619"
> FT /db_xref="TSC-CSHL:TSC0069837"
> FT /db_xref="HGBASE:SNP000493743"
> FT variation 1235..1235
> FT /replace="T|A"
> FT /note="heterozygosity=0.0772231"
> FT /note="heterozygosity_std_error=0.0387438"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212860"
> FT /db_xref="HGBASE:SNP000527181"
> FT variation 1579..1579
> FT /replace="A|G"
> FT /note="heterozygosity=0.0253133"
> FT /note="heterozygosity_std_error=0.0248237"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212861"
> FT /db_xref="HGBASE:SNP001055438"
> FT variation 1779..1779
> FT /replace="T|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3017616"
> FT variation 1948..1948
> FT /replace="A|C"
> FT /note="heterozygosity=0.0116949"
> FT /note="heterozygosity_std_error=0.0163939"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212862"
> FT /db_xref="HGBASE:SNP000108556"
> FT variation 2297..2297
> FT /replace="G|T"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2220247"
> FT /db_xref="TSC-CSHL:TSC1283618"
> FT /db_xref="HGBASE:SNP000890098"
> FT variation 2365..2365
> FT /replace="G|C"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2930976"
> FT variation 2605..2605
> FT /replace="A|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3017617"
> FT variation 2612..2612
> FT /replace="C|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2930977"
> FT /db_xref="HGBASE:SNP000454984"
> FT variation 2627..2627
> FT /replace="A|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3017618"
> FT /db_xref="HGBASE:SNP000367599"
> FT variation 2634..2634
> FT /replace="C|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3017619"
> FT variation 2729..2729
> FT /replace="A|C"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3017620"
> FT /db_xref="HGBASE:SNP000140400"
> FT variation 2777..2777
> FT /replace="A|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2930978"
> FT variation 2875..2875
> FT /replace="T|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2930979"
> FT variation 2878..2878
> FT /replace="C|T"
> FT /note="heterozygosity=0.0342755"
> FT /note="heterozygosity_std_error=0.0272485"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212863"
> FT variation 3029..3029
> FT /replace="C|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3017621"
> FT variation 3421..3421
> FT /replace="C|T"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1352075"
> FT /db_xref="TSC-CSHL:TSC0484521"
> FT /db_xref="HGBASE:SNP000427023"
> FT variation 4116..4116
> FT /replace="A|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1050971"
> FT /db_xref="HGBASE:SNP001438239"
> FT variation 4117..4117
> FT /replace="A|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1131439"
> FT /db_xref="HGBASE:SNP001423091"
> FT variation 4161..4161
> FT /replace="G|C"
> FT /note="heterozygosity=0.0222195"
> FT /note="heterozygosity_std_error=0.0218437"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212864"
> FT /db_xref="HGBASE:SNP000173096"
> FT variation 4177..4177
> FT /replace="CCCCCCG|-"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212865"
> FT /db_xref="HGBASE:SNP000553120"
> FT variation 5141..5141
> FT /replace="A|G"
> FT /note="heterozygosity=0.0713305"
> FT /note="heterozygosity_std_error=0.0388588"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212866"
> FT /db_xref="HGBASE:SNP000220349"
> FT variation 5174..5174
> FT /replace="A|G"
> FT /note="heterozygosity=0.0243872"
> FT /note="heterozygosity_std_error=0.0239333"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:616294"
> FT /db_xref="HGBASE:SNP000366177"
> FT variation 5277..5277
> FT /replace="A|G"
> FT /note="heterozygosity=0.0917408"
> FT /note="heterozygosity_std_error=0.0424857"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212867"
> FT /db_xref="HGBASE:SNP000442103"
> FT variation 5539..5539
> FT /replace="G|C"
> FT /note="heterozygosity=0.0285659"
> FT /note="heterozygosity_std_error=0.0279412"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212868"
> FT /db_xref="HGBASE:SNP000455059"
> FT variation 6708..6708
> FT /replace="A|G"
> FT /note="heterozygosity=0.0570938"
> FT /note="heterozygosity_std_error=0.0344965"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212869"
> FT variation 7310..7310
> FT /replace="T|C"
> FT /note="heterozygosity=0.0665879"
> FT /note="heterozygosity_std_error=0.0364269"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212870"
> FT /db_xref="HGBASE:SNP000890321"
> FT variation 7328..7328
> FT /replace="A|C"
> FT /note="heterozygosity=0.011428"
> FT /note="heterozygosity_std_error=0.0160227"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212871"
> FT /db_xref="HGBASE:SNP000455060"
> FT variation 7341..7341
> FT /replace="T|C"
> FT /note="heterozygosity=0.0227238"
> FT /note="heterozygosity_std_error=0.0223308"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212872"
> FT variation 7502..7502
> FT /replace="G|C"
> FT /note="heterozygosity=0.49596"
> FT /note="heterozygosity_std_error=0.00949059"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:586459"
> FT /db_xref="HGBASE:SNP000137014"
> FT variation 7518..7518
> FT /replace="T|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:646056"
> FT /db_xref="HGBASE:SNP000956048"
> FT variation 7522..7522
> FT /replace="G|C"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:646064"
> FT /db_xref="HGBASE:SNP000956049"
> FT variation 7859..7859
> FT /replace="T|C"
> FT /note="heterozygosity=0.492742"
> FT /note="heterozygosity_std_error=0.0131291"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:647451"
> FT /db_xref="HGBASE:SNP000137313"
> FT variation 8065..8065
> FT /replace="C|T"
> FT /note="heterozygosity=0.011428"
> FT /note="heterozygosity_std_error=0.0160227"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212873"
> FT variation 8118..8120
> FT /replace="-|GAG"
> FT /note="heterozygosity=0.0227238"
> FT /note="heterozygosity_std_error=0.0223308"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212874"
> FT /db_xref="HGBASE:SNP000494495"
> FT variation 8218..8218
> FT /replace="A|T"
> FT /note="heterozygosity=0.0116949"
> FT /note="heterozygosity_std_error=0.0163939"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212875"
> FT variation 8314..8314
> FT /replace="G|C"
> FT /note="heterozygosity=0.0116949"
> FT /note="heterozygosity_std_error=0.0163939"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212876"
> FT variation 8330..8330
> FT /replace="A|T|C"
> FT /note="heterozygosity=0.0568555"
> FT /note="heterozygosity_std_error=0.0346319"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212877"
> FT variation 8548..8548
> FT /replace="T|C"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212878"
> FT /db_xref="HGBASE:SNP000470651"
> FT variation 8770..8770
> FT /replace="G|A|C"
> FT /note="heterozygosity=0.492188"
> FT /note="heterozygosity_std_error=0.0132213"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:602652"
> FT /db_xref="HGBASE:SNP000214986"
> FT variation 9038..9038
> FT /replace="G|A"
> FT /note="heterozygosity=0.499893"
> FT /note="heterozygosity_std_error=0.000550762"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:603965"
> FT variation 9609..9609
> FT /replace="-|G"
> FT /note="heterozygosity=0.5"
> FT /note="heterozygosity_std_error=0.000141421"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212879"
> FT /db_xref="HGBASE:SNP000494496"
> FT variation 9807..9807
> FT /replace="G|A"
> FT /note="heterozygosity=0.0814845"
> FT /note="heterozygosity_std_error=0.0538736"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212880"
> FT /db_xref="HGBASE:SNP000220350"
> FT variation 10191..10191
> FT /replace="T|C"
> FT /note="heterozygosity=0.0219751"
> FT /note="heterozygosity_std_error=0.0216077"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212881"
> FT variation 10482..10482
> FT /replace="A|G"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0110789"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212882"
> FT variation 10488..10488
> FT /replace="A|G"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0156679"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212883"
> FT variation 10513..10513
> FT /replace="T|C"
> FT /note="heterozygosity=0.0331398"
> FT /note="heterozygosity_std_error=0.02637"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212884"
> FT variation 10921..10921
> FT /replace="G|A"
> FT /note="heterozygosity=0.499419"
> FT /note="heterozygosity_std_error=0.00363462"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:649392"
> FT /db_xref="HGBASE:SNP000575116"
> FT variation 11025..11025
> FT /replace="T|C"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212885"
> FT variation 11112..11112
> FT /replace="T|C"
> FT /note="heterozygosity=0.0115604"
> FT /note="heterozygosity_std_error=0.0162065"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212886"
> FT /db_xref="HGBASE:SNP000220351"
> FT variation 11221..11221
> FT /replace="T|C"
> FT /note="heterozygosity=0.0219751"
> FT /note="heterozygosity_std_error=0.0216077"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212887"
> FT variation 11464..11464
> FT /replace="A|G"
> FT /note="heterozygosity=0.0224697"
> FT /note="heterozygosity_std_error=0.022085"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212888"
> FT variation 11534..11534
> FT /replace="T|C"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0110789"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212889"
> FT /db_xref="HGBASE:SNP000694930"
> FT variation 11540..11540
> FT /replace="G|A"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0156679"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212890"
> FT /db_xref="HGBASE:SNP000694931"
> FT variation 11635..11635
> FT /replace="C|A"
> FT /note="heterozygosity=0.485797"
> FT /note="heterozygosity_std_error=0.0176102"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212891"
> FT /db_xref="HGBASE:SNP000956443"
> FT variation 11809..11809
> FT /replace="G|A"
> FT /note="heterozygosity=0.458272"
> FT /note="heterozygosity_std_error=0.0291535"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2510467"
> FT /db_xref="HGBASE:SNP000140079"
> FT variation 11988..11988
> FT /replace="G|A"
> FT /note="heterozygosity=0.24013"
> FT /note="heterozygosity_std_error=0.0538745"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212892"
> FT variation 12112..12112
> FT /replace="G|A"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3206821"
> FT /db_xref="HGBASE:SNP000220346"
> FT variation 12243..12243
> FT /replace="A|C"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:7177"
> FT /db_xref="HGBASE:SNP000481163"
> FT variation 12321..12321
> FT /replace="C|A"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1803356"
> FT /db_xref="HGBASE:SNP000011291"
> FT /db_xref="HGBASE:SNP000314898"
> FT variation 12865..12865
> FT /replace="C|G"
> FT /note="heterozygosity=0.499937"
> FT /note="heterozygosity_std_error=0.00119942"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:678653"
> FT /db_xref="HGBASE:SNP000586739"
> FT variation 12884..12884
> FT /replace="T|C"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0156679"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212893"
> FT variation complement(12900..12900)
> FT /replace="G|A"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2510607"
> FT /db_xref="HGBASE:SNP000350269"
> FT variation 13000..13000
> FT /replace="T|C"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0156679"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212894"
> FT variation 13012..13012
> FT /replace="A|G"
> FT /note="heterozygosity=0.0222195"
> FT /note="heterozygosity_std_error=0.0218437"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212895"
> FT /db_xref="HGBASE:SNP000643371"
> FT variation 13090..13090
> FT /replace="T|G"
> FT /note="heterozygosity=0.0224697"
> FT /note="heterozygosity_std_error=0.022085"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212896"
> FT /db_xref="HGBASE:SNP000494497"
> FT variation 13294..13294
> FT /replace="G|T"
> FT /note="heterozygosity=0.0118332"
> FT /note="heterozygosity_std_error=0.016586"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212897"
> FT variation 13465..13465
> FT /replace="T|C"
> FT /note="heterozygosity=0.0118332"
> FT /note="heterozygosity_std_error=0.016586"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212898"
> FT variation 13779..13779
> FT /replace="A|C"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:2062445"
> FT /db_xref="TSC-CSHL:TSC1095355"
> FT /db_xref="HGBASE:SNP000172380"
> FT variation 14048..14048
> FT /replace="A|G"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0156679"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212899"
> FT /db_xref="HGBASE:SNP000254637"
> FT variation 14095..14095
> FT /replace="A|G"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0156679"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212900"
> FT /db_xref="HGBASE:SNP000334741"
> FT variation 14358..14358
> FT /replace="C|A"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212901"
> FT /db_xref="HGBASE:SNP000587591"
> FT variation 14486..14486
> FT /replace="A|G"
> FT /note="heterozygosity=0.0224697"
> FT /note="heterozygosity_std_error=0.022085"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212902"
> FT variation 14490..14490
> FT /replace="A|G"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212903"
> FT variation 14504..14504
> FT /replace="T|C"
> FT /note="heterozygosity=0.0224697"
> FT /note="heterozygosity_std_error=0.022085"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212904"
> FT variation 14636..14636
> FT /replace="T|A"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1051062"
> FT /db_xref="HGBASE:SNP000011058"
> FT variation 14714..14714
> FT /replace="A|G"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212905"
> FT /db_xref="HGBASE:SNP000334742"
> FT variation 14796..14796
> FT /replace="A|T"
> FT /note="heterozygosity=0.0111728"
> FT /note="heterozygosity_std_error=0.0156679"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212906"
> FT variation 14904..14904
> FT /replace="T|C"
> FT /note="heterozygosity=0.044421"
> FT /note="heterozygosity_std_error=0.0303298"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212907"
> FT /db_xref="HGBASE:SNP000989971"
> FT variation 14939..14939
> FT /replace="T|C"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212908"
> FT /db_xref="HGBASE:SNP000220352"
> FT variation 14966..14966
> FT /replace="G|T"
> FT /note="heterozygosity=0.0112994"
> FT /note="heterozygosity_std_error=0.0158436"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212909"
> FT variation 15158..15158
> FT /replace="A|G"
> FT /note="heterozygosity=0.059117"
> FT /note="heterozygosity_std_error=0.035657"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:7178"
> FT /db_xref="HGBASE:SNP000481164"
> FT variation 15273..15273
> FT /replace="C|A"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1051357"
> FT /db_xref="HGBASE:SNP000954786"
> FT variation 15338..15338
> FT /replace="T|G"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1803191"
> FT /db_xref="HGBASE:SNP000011057"
> FT /db_xref="HGBASE:SNP000172103"
> FT variation 15346..15346
> FT /replace="T|C"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:1803190"
> FT /db_xref="HGBASE:SNP000011056"
> FT variation 15463..15463
> FT /replace="T|C"
> FT /note="heterozygosity=0.0222195"
> FT /note="heterozygosity_std_error=0.0218437"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212910"
> FT variation 15556..15556
> FT /replace="A|G"
> FT /note="heterozygosity=0.0121216"
> FT /note="heterozygosity_std_error=0.0169853"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212911"
> FT variation 15606..15606
> FT /replace="T|C"
> FT /note="heterozygosity=0.0121216"
> FT /note="heterozygosity_std_error=0.0169853"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212912"
> FT /db_xref="HGBASE:SNP000643372"
> FT variation 15677..15677
> FT /replace="C|G"
> FT /note="heterozygosity=0.0359168"
> FT /note="heterozygosity_std_error=0.0285151"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212913"
> FT /db_xref="HGBASE:SNP000643373"
> FT variation 15778..15782
> FT /replace="-|GTGAC"
> FT /note="heterozygosity=0.0354908"
> FT /note="heterozygosity_std_error=0.0281872"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212914"
> FT variation 15880..15880
> FT /replace="T|C"
> FT /note="heterozygosity=0.0121216"
> FT /note="heterozygosity_std_error=0.0169853"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212915"
> FT /db_xref="HGBASE:SNP000470652"
> FT variation 15975..15975
> FT /replace="A|G"
> FT /note="heterozygosity=0.0121216"
> FT /note="heterozygosity_std_error=0.0169853"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212916"
> FT variation 16031..16031
> FT /replace="T|C"
> FT /note="heterozygosity=0.0115604"
> FT /note="heterozygosity_std_error=0.0162065"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212917"
> FT variation 16053..16053
> FT /replace="A|G"
> FT /note="heterozygosity=0.0459508"
> FT /note="heterozygosity_std_error=0.0313345"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212918"
> FT variation 16062..16062
> FT /replace="A|T"
> FT /note="heterozygosity=0.0118332"
> FT /note="heterozygosity_std_error=0.016586"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212919"
> FT /db_xref="HGBASE:SNP000538125"
> FT variation 16114..16114
> FT /replace="G|A"
> FT /note="heterozygosity=0.183844"
> FT /note="heterozygosity_std_error=0.0529259"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212920"
> FT /db_xref="HGBASE:SNP000517568"
> FT variation 16421..16421
> FT /replace="A|G"
> FT /note="heterozygosity=0.0124219"
> FT /note="heterozygosity_std_error=0.0174027"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212921"
> FT /db_xref="HGBASE:SNP000538126"
> FT variation 16511..16511
> FT /replace="G|A"
> FT /note="heterozygosity=0.161513"
> FT /note="heterozygosity_std_error=0.052613"
> FT /evidence="not_experimental"
> FT /db_xref="dbSNP:3212922"
> XX
> SQ Sequence 17370 BP; 3365 A; 4860 C; 5302 G; 3843 T; 0 other;
> gcctagtaac agcgccgcgc ccccattggc tcatgctaat tccagtttcc
> tctgtcttgc 60
> gcctgggatg ggggggtgaa gctccctcct ggacccagag ccggttgtgc
> cggagtgggc 120
> gagcctcttt atgccctgct gcccctagcc gacttcggcc cgcttcgcgc
> ctcgggctgg 180
> gccagggcgc acgcggggct cggggcccct cgccccacgg gatgggagag
> gccgggtgat 240
> agctccgggc cccataaatc atccaggcgg ccgccgggtc gggattttat
> gaatgaaaaa 300
> gcagctgggc cgcccttgtg cgcgggctga tgctctgagg cttggctatg
> cgggggccaa 360
> cgcgattgtg ggtgctcggg gagtgggggg gggcacgacc gtaggtgctc
> cctgctgggg 420
> caacccatcg ctccccatgc ggaatccggg ggtaattacc ccccaggacc
> cggaatatta 480
> gtaatcctaa ttcccggcgg gggagggggc gcgggaggaa ttcaccctga
> aaggtggggg 540
> tggggggggt cgcatcttgc tgtgagcacc ctggcgaagg ggagagggct
> ttttctatca 600
> gttttctttg agcttttact gttaagaggg tacggtggtt tgatgacact
> gaactatatt 660
> caaaaggaag taaatgaaca gttttcttaa tttggggcag gtactgtaaa
> aataaaaaca 720
> aaagttaaga cagtaaaatg tccttttatt ttttaatgca ccaaagagac
> agaacctgta 780
> attttaaaaa ctgtgtattt taatttacat ctgcttaagt ttgcgataat
> attggggacc 840
> ctctcatgta accacgaaca cctatcgatt ttgctaaaaa tcagatcagt
> acactcgttt 900
> gtttaattga taattgttct gaattatgcc ggctcctgcc agccccctca
> cgctcacgaa 960
> ttcagtccca gggcaaattc taaaggtgaa gggacgtcta cacccccaac
> aaaaccaatt 1020
> aggaaccttc ggtggtcttg tcccaggcag aggggactaa tatttccagc
> aatttaattt 1080
> cttttttaat taaaaaaaat gagtcagaat ggagatcact gtttctcagc
> tttccattca 1140
> gaggtgtgtt tctcccggtt aaattgccgg cacgggaagg gagggggtgc
> agttggggac 1200
> ccccgcaagg accgactggt caaggtagga aggcagcccg aagagtctcc
> aggctagaag 1260
> gacaagatga aggaaatgct ggccaccatc ttgggctgct gctggaattt
> tcgggcattt 1320
> attttatttt attttttgag cgagcgcatg ctaagctgaa atccctttaa
> cttttagggt 1380
> tacccccttg ggcatttgca acgacgcccc tgtgcgccgg aatgaaactt
> gcacaggggt 1440
> tgtgtgcccg gtcctccccg tccttgcatg ctaaattagt tcttgcaatt
> tacacgtgtt 1500
> aatgaaaatg aaagaagatg cagtcgctga gattctttgg ccgtctgtcc
> gcccgtgggt 1560
> gccctcgtgg cgttcttgga aatgcgccca ttctgccggc ttggatatgg
> ggtgtcgccg 1620
> cgccccagtc accccttctc gtggtctccc caggctgcgt gtggcctgcc
> ggccttccta 1680
> gttgtcccct actgcagagc cacctccacc tcacccccta aatcccgggg
> gacccactcg 1740
> aggcggacgg ggccccctgc acccctcttc cctggcggtg agaaaggctg
> cagcggggcg 1800
> atttgcattt ctatgaaaac cggactacag gggcaactcc gccgcagggc
> aggcgcggcg 1860
> cctcagggat ggcttttggg ctctgcccct cgctgctccc ggcgtttggc
> gcccgcgccc 1920
> cctccccctg cgcccgcccc cgcccccctc ccgctcccat tctctgccgg
> gctttgatct 1980
> ttgcttaaca acagtaacgt cacacggact acaggggagt tttgttgaag
> ttgcaaagtc 2040
> ctggagcctc cagagggctg tcggcgcagt agcagcgagc agcagagtcc
> gcacgctccg 2100
> gcgaggggca gaagagcgcg agggagcgcg gggcagcaga agcgagagcc
> gagcgcggac 2160
> ccagccagga cccacagccc tccccagctg cccaggaaga gccccagcca
> tggaacacca 2220
> gctcctgtgc tgcgaagtgg aaaccatccg ccgcgcgtac cccgatgcca
> acctcctcaa 2280
> cgaccgggtg ctgcgggcca tgctgaaggc ggaggagacc tgcgcgccct
> cggtgtccta 2340
> cttcaaatgt gtgcagaagg aggtcctgcc gtccatgcgg aagatcgtcg
> ccacctggat 2400
> gctggaggtg cggggcttcg ggcggctctc ttaagacttc cctgcaactt
> gttgcccaga 2460
> cccacgtttc tttgctactc acccccctcc cttctctccc gctagaactt
> tgaagtttgc 2520
> cgtggtgttt ctagggatcc gtattttcaa aataaaaatt gcgggtattt
> tctgaaggag 2580
> gaaggggtgg gggtgggggt gctaaaagta gggtttcgtg ggagggaaga
> aggcggtccg 2640
> ggaggggtgc cttcggagaa ggccagtgcc aggggcaccc caatgggccc
> gagggtgcgg 2700
> gctggcaggc tgggtgcgct ttgtgtccac cgcctgcgcc ccagcccggc
> tgcgcctcag 2760
> cggccgggag ccgccagctc cggggggagg gggcatagat ttgattttta
> aattaatatc 2820
> catggacacg tatgcaaggg ccgctcgtgc cagtattatg cgccatcttt
> gctcgtttat 2880
> tgcaaagcaa aagtgtttat taataattgg gggcagggtg ggggcgggga
> gcggccgccg 2940
> ggcgctgggg ccgcagctaa gggccgcgcg gctgccggga gcccgcggga
> ggggcgcagg 3000
> gacgcggcat gggtagtttt ggggggaccc cgctagggaa gggggggcct
> ttgttcaagc 3060
> agcgagtccc ggggcgcccc gaacgggcag cctgggccgg agagcacggc
> gagctgcaag 3120
> gtcgcgtggc ccccaagacg ccagggcttg atccccgtct gcagggatat
> cggcttggag 3180
> gaccttctcc gagcgagccg ggggcctggg agcacatttt cagaccttcg
> gtgggcgcct 3240
> gaggggcccg caagtatttt aaaataattt ttgaaagtgc ggcgtggtgc
> ccttgcgaga 3300
> gggaaacgcc gcccgcgccc agggggaagg gggggccccg gagtttgaat
> tcctggggct 3360
> ccccccggag cctgtaacga actcccaacc cccggcctgg gtaaagggtc
> gcccgagggt 3420
> cattttcagg gtttttttat gcacttagtt atttttttaa tatttttaaa
> tattttttga 3480
> aaagatgacg tctggggaaa tgcggcgcgg cggcctggga cgccaccttt
> gtgtctcgca 3540
> ggcgcggcgc ccaaccccgc ggcccgttcc gcggccccgc accccagttg
> gtgtcgaccc 3600
> ccagtcagag ggaccacgga gctccagggc gggccagggt cccgggggcc
> ggcagcccgc 3660
> gccgccgcgc acgccgccca gctgtgcccg ctcccgcccc caccgtgcca
> gcctcgcggg 3720
> gactttccct ttcagtttcg gggagggtgg gtactgggga cgcgcggggg
> agggggcgca 3780
> tcacgggaag ctcctgccgc ccccagcccc gacccctcgg cgccctccag
> acctggcggc 3840
> cctgccaagc gcgatggggg gtgcgggggc gtgcgggggg gcggcgcgac
> ctggcggcgg 3900
> cggtcacggg ccccgtgcct ccgtaggtct gcgaggaaca gaagtgcgag
> gaggaggtct 3960
> tcccgctggc catgaactac ctggaccgct tcctgtcgct ggagcccgtg
> aaaaagagcc 4020
> gcctgcagct gctgggggcc acttgcatgt tcgtggcctc taagatgaag
> gagaccatcc 4080
> ccctgacggc cgagaagctg tgcatctaca ccgacaactc catccggccc
> gaggagctgc 4140
> tggtaaccac tggaccccgc cgccccccgc cccccgcgag ccgcacgcag
> gaccacgggg 4200
> ccggggaagg tgcaggcggt ggcggccggc ccgcctctga catatctgct
> cctccgaggg 4260
> agggcggccc cgccgccggg cgtccctgtc cggggagcgg gcgggatcct
> agccgccctc 4320
> gtcccgccgc cctgtgtgcg cttgcctgcg actcccaccg cgttcgcgcc
> ccgcggtgtg 4380
> gccgaaaagt gggcggcgcg cgccctccag cggctgcacg aggagcgccg
> cgctcggcgc 4440
> tgagcctcca gttccaggtg gtgggaggtc tttttgtttc cacttgcaga
> gtcttttcac 4500
> gcggcgggcg ccttttctgt tttgatctgg gattgcgtgt tgccccagct
> cccttgagtc 4560
> cccagcattc gccagccctc ccctccaaca tccaggaccg cacgagacgc
> aggggccagt 4620
> gctctgagcc ggaggtgcgg cgtggcccgg cccccgtgct gccggcttcc
> ccgcgccccc 4680
> gggctggccc gcacctcccc tgatggccgc tcaccctgtg ttcgcagcaa
> atggagctgc 4740
> tcctggtgaa caagctcaag tggaacctgg ccgcaatgac cccgcacgat
> ttcattgaac 4800
> acttcctctc caaaatgcca gaggcggagg agaacaaaca gatcatccgc
> aaacacgcgc 4860
> agaccttcgt tgccctctgt gccacaggta gggcaggccc ggcagccccc
> ggcctcccct 4920
> tgagagccgg ctccttaggt gaccctggcc ggcttcttgc tctccacctg
> ggtgctgtct 4980
> gggaagatgt ccccagaccc cctcctgcgc tggagagcgc tcttccagct
> ctggtgagca 5040
> gaggccctgg attgtttgtc gcgctggatg gagggagatt tgctccctca
> cggccaccat 5100
> gcagtacctt gggcattggt gtggacggct cagcctgcct gtgtcccgtt
> actctggcct 5160
> cgtccttcag gccaggcagc ctgtggccac tccatgctga aaggggttta
> ccttggccac 5220
> agggccgcct cctttctcca cccacctcca gcccttcttg tgtccttaag
> gagcctgagc 5280
> tgcagaggcc ccctcctggc ctctcccagg ctgggccacc tgccagaggc
> gcctccaggg 5340
> gcggggagag ctgtcggcct gcctgcacca cgtgctctgg gcagccgagt
> gcaggggtgt 5400
> ccagcagagg agctcggctg cctgaggccc tgccaggggt gccggcagcc
> agccgggctc 5460
> agctgagccc tgagggggcg cttcagagca ctctcagctt gggccgccac
> cgtgggcagc 5520
> agaagcaccc agtcctcact tcccctggca tggccccaga ggcccctccc
> tgacatggcc 5580
> ttggccccag aacccagtgg ggacagactc gcacatacac agggtgccgc
> ctcctgctgt 5640
> ccccagccct gcctctgacc cccctgtgac cgcctccttc cctggcccag
> gaggcctggt 5700
> taccttcatg ggggagcatg gccccatccc acccagctct gctgtggccc
> acctttggtc 5760
> aagcctcagt tgtcacatct gtttgggggc tcactctggg tgacctaggc
> cacaaggccc 5820
> acggggcatc aaagaggcag tagcatcttc tcccctcccc agagggcaga
> gccccccaag 5880
> cctacttcag agctcccttc tgacaccggt agcccgcagc cggtattcca
> gaatgggttc 5940
> tggtttaggc gtgaggcctc ccccacctcc tccacctgct tggggcatga
> acccctcccc 6000
> cacgtttcca agcgagtccc caaggtgggc agatgaagat gccaaggatg
> tcgaccagtc 6060
> tggatgggtc tggggtgggg gggcatgcgg cagacaggga ggcattctct
> ggctggtgct 6120
> cctcagagga gagaggcctc cggagactcc agacagcctt ttatggagct
> gaaagtggct 6180
> tcagagaaat gcaaagtttc ctggagagaa cgtggggcgt ggttcttgca
> cagcctccct 6240
> acagggtggc tccagcagtg gagctcccct cccaggaccc ctgggtgcta
> gtgggaggca 6300
> gtgggcaggt gcagattctc gtccttccca ctactgcaca ccctttgtct
> gcgaaggcgc 6360
> ccccagcggt gggtgaagga ggagggacac ttggggaccc agctgtgcac
> gtgctctcag 6420
> tgactgtgga gtccactcca gggtgggtcc cgagggaggg gcaggagacc
> aggggaccca 6480
> cccctgcaaa gtgctccggg tcctgacccg tggccacccc atggaacgta
> actgagcagc 6540
> cagtgccttg ttcctgctgg acatctgtgg agacaagagt gacttacggc
> tgcttaaagt 6600
> cagaaacagg ttgaaggagg tggaggcgtg ggaaagagtc taggaaggtg
> tttttgccct 6660
> ccacgtggca aaggttacat ttaaaggtga tgctgggtgt tctccctgca
> ctaggcattc 6720
> ctggccccag gtccccagca ggtgtgcaca tgctgcatac actcacgcat
> gggggtttca 6780
> gggcaggtgc gcccttggct ccgtgggagg ccaggtgagg aacgtccagt
> gccaaggagc 6840
> ttccgggaca gctgtcactt ccctttacaa ccaggcagcg gatagggtca
> aatcctggag 6900
> ctttggtgtc taattctggg tggctcctaa tctaagcaca gacagcacca
> cacactgggg 6960
> tgggggcacg agcttctgaa acaacgtggc cccagtgact ccacgctgtg
> tgtgcccctg 7020
> gagacggggg ggtgcacaag gtgcggagcc agctagaacc tgtcgctccc
> tgcagaagcg 7080
> gtttctgtgt gcggttctga tttgcctcaa tgagaaggtt ttcattcatg
> gctcccggct 7140
> ctcagactgg gtggaactgc tcccatttaa aggggaaaag aggtggctcg
> gctcgttaag 7200
> gatttctttt tctaagttgt tacggcgccc agcagccggc tttgtctccc
> cttcagggtg 7260
> gctgcctttc ttcccggccc ctcgccggcg gccctctctt taacaaggcc
> gaagttgttt 7320
> attctctcgg gatgaagtct cggatgggcc gccacacccc tggcggcccg
> tgggggcccc 7380
> tctccctttg tgcctgggtc ggctcccatt cagctccccc gacccccctt
> gttcccgggc 7440
> gctcagtggc gcgagatgag gcgatggggc cgacaaagat gccacactca
> tccctgccga 7500
> cgtccggctc ccagcccagg gcccctggtt cctgtgcaga attcctcgtg
> ggtgtgacaa 7560
> aaggctgccc ccaggctccg ctggggtggg ggccaggcca agaggcacat
> cccacactgg 7620
> cccacctgtc cacggtaggc gcatgactgc cctgaggagg ggaggccggc
> attccccgcc 7680
> acaaaccagg acgtaattgg tggcagggct ctctgtggaa agagccagtc
> tgctgtttgt 7740
> ctaggaggtc agtcacagag gccccgagac gcccactact gcagcctggc
> aggcggatga 7800
> gcccagtatc tggcagtgac cagagggagt tttgtgcaga ccacaaaggc
> tgatgggccg 7860
> ccctagattg gtgtccctct tggaagtggg cccagatgtg cgggacagtc
> cccaggaagc 7920
> cccaggtgag ggcactggtg ccctcttggg aaagctgctc cctcctgggg
> cccggctccc 7980
> ggcccagtcc tccaggggtg tcccatggtg actggtgcta ggaaccccac
> acctcttccc 8040
> ttacttggga agtcactgga attgttgggc tacatcagac ggcccagaaa
> agtgtttttg 8100
> tcatcggcca gaaataggag agttgtgagt agagggcccg ggtggagttg
> gggtgtactt 8160
> ggtctgtgct ctgaaggtca ctgtgacagt catggtccca tggtaagggg
> catgggttgc 8220
> tggaagagct cttccttccc gagtgagcca agccgggctc tcctggcgcc
> agggcctgag 8280
> ccgcagccac accacagccg ccctgaaggc tgccggccag ggcttacccc
> tcaagggaca 8340
> cggaatggct tcatcagtac cctgcagccc cgtggcctgg cccgggtgga
> ggcctaggct 8400
> tcagccatgc gatgtccctt cagaatatga cttgtctgca atccctgctg
> ctggggggtg 8460
> gcaggtactt ggggtgaggg ttagggtcat agaagcgaca tctctacgtc
> ctcatatttg 8520
> cgtcatctaa ttttgttttt gtgaatacgt gataacattc acaaggctca
> agatgctaaa 8580
> aggatgagaa ggcagtgatg tccccatcac ctgtcctgtg tcttcccgtg
> gctttctctt 8640
> tccttggtta tgtttgagtc aacagtgggg ctgacgttcc aggagggtcc
> gtgggccagg 8700
> ctcttgctct ccgagtgccc agggatggct ggaggctgag gagggcctgg
> atgtggagcc 8760
> tcagataccg agtgcttccc ttcaggccgg gccgcttgct cagagccagc
> acacagggat 8820
> gcccggatca cgggggccct gagagggtcc cctgctcaca gcctccttcc
> ctctctcctt 8880
> ctgcctcaga tgtgaagttc atttccaatc cgccctccat ggtggcagcg
> gggagcgtgg 8940
> tggccgcagt gcaaggcctg aacctgagga gccccaacaa cttcctgtcc
> tactaccgcc 9000
> tcacacgctt cctctccaga gtgatcaagt gtgacccggt aagtgagggt
> gatgtcccag 9060
> gcagccttgc cggggcttac agggggagac acctagtgcc acggaaatgc
> cgaggctggt 9120
> gccaaggccc ccaagggtga caaggttggg gctggggctg ggcccctcgg
> accccaggcc 9180
> acagactgac agggcaccgg cttcttccac tgctcctaga acttactgac
> tggctgggag 9240
> gtcctcacag ccttctcacg tcccctgggg cttccaggag ccgtagagtt
> tctgggcgaa 9300
> gcgtccggga cggaggcccc aggcggcccc agccaatggt ctgtgtggtg
> atggtgtgtg 9360
> gggttaggcc caggcgagct ttgtttgggc cacaatgtgc gtggccaata
> aatagatgct 9420
> tgaaaagggc tcctgtgagg tccgagacac cggacaacgg gcggatagag
> acagccttgt 9480
> tgtttacggc ctctttgaga ggctgctgct gttaaaccct gggatgactg
> tgtctttctt 9540
> cttaaaaatg ccattgtttt attcccgagt cttttcttaa agaaagaatt
> aaaatgacaa 9600
> tcaaaagggt ttgtggcatt taccaaatta gaccagagag gtggccgggt
> cagccgccgg 9660
> ccccgcggtg tgtgagggag tgaccgcctg accccagctt ggggctgggt
> gggcctgcaa 9720
> gacccgtttt ggctctggcc tgggccgcct cttggtggtc tgccctcgag
> cctcccgggg 9780
> actccgcacg ggtctcagca gatgctatct agggtccacc tgcctgtccc
> ctgcctagtg 9840
> gtgcctctgt cccggggaca ctgggagtag cggctgccca gcccatgtgt
> gtctcggaag 9900
> aggaagaagc ttttttgccg tgggacaccg aagttggcag gggcctccct
> tctgtgttct 9960
> cggccatggc ctcccttgca ccctgccccg tgttatcctt tgggggtggt
> gaggtgtcct 10020
> cacccgctgt agggtggagg ccagcagccc gcagctctct caggaaaatg
> gctcagaaac 10080
> accatcgagg cctccagaag cccagcaaag agaaagcccc tccatcaaaa
> tgaaactcgc 10140
> gtctgcactt ttcatttcga actccacgcc ctgagtgaaa accgcttccc
> cgccaggggt 10200
> gactgccctg ggatgttgct gtcttcgggc agttgtggga agttgggcgc
> tggcccttat 10260
> ttgagtagag accatcttaa ctagattgga ggcacacgtc tcacagctga
> cagacacacg 10320
> gggtgaagtt acccgaggcg gagtccactc tgcctgatca gctagtgacc
> aacgtagctg 10380
> agcccagact cagaaaaacc gtccacagca gaggcccctg cattttctag
> ggcgtgttct 10440
> agaattttct ttggtgggtg gaatgtccat ctgtgcaaat cgggtgcgca
> gtgccacaca 10500
> ccagtgactt ttcgcggagg agcgtgctgc ctttttggag cttctggctg
> tgggagaaca 10560
> gctttgtcca ccggggtagc cttgcaggca gctgtggggc cagaggaatg
> aaggaaggtc 10620
> ctggagtcta gctgcatgtg tgaccctgga gtgggtcatg ggcgagggac
> gggccgcagg 10680
> tgaagaatcc ctggatggag ctgccaggcc cctggggctg agaattgaag
> ctggctggtg 10740
> ttttaggttg aacgtcagga gtcttgtatc tcaccccagg cctctggcct
> cagtttcccc 10800
> atctgtacag tgggactgtt tgtgcagcca gcccggccag cttcatttgc
> catgatgaga 10860
> atttatctga ggggcgggag aggaaagccc tccctataaa ggtacaggcg
> ctaaaatgtc 10920
> gtgacctcag tggtccacct aaaagtcgtt ctggcctggg tcatcgcctg
> tcgtgctatg 10980
> cctttgtcca gccccttctg gttgggagtt aagtggcacc tgtgcggcac
> gtggtggggc 11040
> tgtggcccag ccctgctcct tgtggaaggt ctgtttcctg ggctgcctag
> agacttggct 11100
> tgaagcccta gcgtggcttc ctggcagttg ggacacacac agccccaaca
> catggagccg 11160
> gttctccatc cagaagcccc cgggcagtaa gcagccactt caggctgcgt
> gggacttgcc 11220
> cgtggtggag cctaggagag gcccctggct gggcgtggcg ttccagattt
> cacggctgct 11280
> ctttcccact gacagtgtgg tgtggacgct gccaagggag tctggagccc
> cagagggtgg 11340
> aggtgcagga cttccaggag cgtccgtcgc actccacccg agggcgagca
> cctcagtggc 11400
> cgcagtgggt ggatgcatgc tgtgccaggc tgatggctgg ccccggggca
> caggcctgag 11460
> cgggagagga tggaggggag ggatcaatgg tccaggtccc cctggccacc
> cagcattcat 11520
> cctcagtcat gcacggccca aggcttcgac agccattgat catggaaggc
> caggttcacc 11580
> tcaagggctg ccacatggag aggttaagtc tgaaaaggct gaaaaggcag
> ggttcaaagg 11640
> gcctcctgtc cagatcagat ggcactgaat tccccaggga gctggcacgg
> ccagtgggaa 11700
> caggcggtga aggcgctgtt ggacatgggg acgggcaggg ggtgtgcagg
> gtgggcgggc 11760
> aagcatctgg tgtcttgtgg ctccagagac caggtgggag gtggaggcat
> ttggtcctga 11820
> gtgtcctgac aggtgatggc agctcccaca tctcgctcag gttcagagga
> ggcagcatgg 11880
> gccgagggac agtttttggc ttagtcttgc tcttataaag gcttccgggt
> catggcacct 11940
> gggaaggggc cctcgctgca ggccccttct aaggaccccc tcttcccacc
> tctccccacc 12000
> ctctctctct caggactgcc tccgggcctg ccaggagcag atcgaagccc
> tgctggagtc 12060
> aagcctgcgc caggcccagc agaacatgga ccccaaggcc gccgaggagg
> aggaagagga 12120
> ggaggaggag gtggacctgg cttgcacacc caccgacgtg cgggacgtgg
> acatctgagg 12180
> gcgccaggca ggcgggcgcc accgccaccc gcagcgaggg cggagccggc
> cccaggtgct 12240
> cccctgacag tccctcctct ccggagcatt ttgataccag aagggaaagc
> ttcattctcc 12300
> ttgttgttgg ttgttttttc ctttgctctt tcccccttcc atctctgact
> taagcaaaag 12360
> aaaaagatta cccaaaaact gtctttaaaa gagagagaga gaaaaaaaaa
> atagtatttg 12420
> cataaccctg agcggtgggg gaggagggtt gtgctacaga tgatagagga
> ttttataccc 12480
> caataatcaa ctcgttttta tattaatgta cttgtttctc tgttgtaaga
> ataggcatta 12540
> acacaaagga ggcgtctcgg gagaggatta ggttccatcc tttacgtgtt
> taaaaaaaag 12600
> cataaaaaca ttttaaaaac atagaaaaat tcagcaaacc atttttaaag
> tagaagaggg 12660
> ttttaggtag aaaaacatat tcttgtgctt ttcctgataa agcacagctg
> tagtggggtt 12720
> ctaggcatct ctgtactttg cttgctcata tgcatgtagt cactttataa
> gtcattgtat 12780
> gttattatat tccgtaggta gatgtgtaac ctcttcacct tattcatggc
> tgaagtcacc 12840
> tcttggttac agtagcgtag cgtgcccgtg tgcatgtcct ttgcgcctgt
> gaccaccacc 12900
> ccaacaaacc atccagtgac aaaccatcca gtggaggttt gtcgggcacc
> agccagcgta 12960
> gcagggtcgg gaaaggccac ctgtcccact cctacgatac gctactataa
> agagaagacg 13020
> aaatagtgac ataatatatt ctatttttat actcttccta tttttgtagt
> gacctgttta 13080
> tgagatgctg gttttctacc caacggccct gcagccagct cacgtccagg
> ttcaacccac 13140
> agctacttgg tttgtgttct tcttcatatt ctaaaaccat tccatttcca
> agcactttca 13200
> gtccaatagg tgtaggaaat agcgctgttt ttgttgtgtg tgcagggagg
> gcagttttct 13260
> aatggaatgg tttgggaata tccatgtact tgtttgcaag caggactttg
> aggcaagtgt 13320
> gggccactgt ggtggcagtg gaggtggggt gtttgggagg ctgcgtgcca
> gtcaagaaga 13380
> aaaaggtttg cattctcaca ttgccaggat gataagttcc tttccttttc
> tttaaagaag 13440
> ttgaagttta ggaatccttt ggtgccaact ggtgtttgaa agtagggacc
> tcagaggttt 13500
> acctagagaa caggtggttt ttaagggtta tcttagatgt ttcacaccgg
> aaggttttta 13560
> aacactaaaa tatataattt atagttaagg ctaaaaagta tatttattgc
> agaggatgtt 13620
> cataaggcca gtatgattta taaatgcaat ctccccttga tttaaacaca
> cagatacaca 13680
> cacacacaca cacacacaca aaccttctgc ctttgatgtt acagatttaa
> tacagtttat 13740
> ttttaaagat agatcctttt ataggtgaga aaaaaacaat ctggaagaaa
> aaaaccacac 13800
> aaagacattg attcagcctg tttggcgttt cccagagtca tctgattgga
> caggcatggg 13860
> tgcaaggaaa attagggtac tcaacctaag ttcggttccg atgaattctt
> atcccctgcc 13920
> ccttccttta aaaaacttag tgacaaaata gacaatttgc acatcttggc
> tatgtaattc 13980
> ttgtaatttt tatttaggaa gtgttgaagg gaggtggcaa gagtgtggag
> gctgacgtgt 14040
> gagggaggac aggcgggagg aggtgtgagg aggaggctcc cgaggggaag
> gggcggtgcc 14100
> cacaccgggg acaggccgca gctccatttt cttattgcgc tgctaccgtt
> gacttccagg 14160
> cacggtttgg aaatattcac atcgcttctg tgtatctctt tcacattgtt
> tgctgctatt 14220
> ggaggatcag ttttttgttt tacaatgtca tatactgcca tgtactagtt
> ttagttttct 14280
> cttagaacat tgtattacag atgccttttt tgtagttttt ttttttttta
> tgtgatcaat 14340
> tttgacttaa tgtgattact gctctattcc aaaaaggttg ctgtttcaca
> atacctcatg 14400
> cttcacttag ccatggtgga cccagcgggc aggttctgcc tgctttggcg
> ggcagacacg 14460
> cgggcgcgat cccacacagg ctggcggggg ccggccccga ggccgcgtgc
> gtgagaaccg 14520
> cgccggtgtc cccagagacc aggctgtgtc cctcttctct tccctgcgcc
> tgtgatgctg 14580
> ggcacttcat ctgatcgggg gcgtagcatc atagtagttt ttacagctgt
> gttattcttt 14640
> gcgtgtagct atggaagttg cataattatt attattatta ttataacaag
> tgtgtcttac 14700
> gtgccaccac ggcgttgtac ctgtaggact ctcattcggg atgattggaa
> tagcttctgg 14760
> aatttgttca agttttgggt atgtttaatc tgttatgtac tagtgttctg
> tttgttattg 14820
> ttttgttaat tacaccataa tgctaattta aagagactcc aaatctcaat
> gaagccagct 14880
> cacagtgctg tgtgccccgg tcacctagca agctgccgaa ccaaaagaat
> ttgcaccccg 14940
> ctgcgggccc acgtggttgg ggccctgccc tggcagggtc atcctgtgct
> cggaggccat 15000
> ctcgggcaca ggcccacccc gccccacccc tccagaacac ggctcacgct
> tacctcaacc 15060
> atcctggctg cggcgtctgt ctgaaccacg cgggggcctt gagggacgct
> ttgtctgtcg 15120
> tgatggggca agggcacaag tcctggatgt tgtgtgtatc gagaggccaa
> aggctggtgg 15180
> caagtgcacg gggcacagcg gagtctgtcc tgtgacgcgc aagtctgagg
> gtctgggcgg 15240
> cgggcggctg ggtctgtgca tttctggttg caccgcggcg cttcccagca
> ccaacatgta 15300
> accggcatgt ttccagcaga agacaaaaag acaaacatga aagtctagaa
> ataaaactgg 15360
> taaaacccca gcgtggtgcc tgcctctttg cttcctgggc tggccgtgag
> ccagggacgc 15420
> gtgtcctggt gccctagaac cagggcaggg tggcaggctt ggcggatgtg
> ggaggccgca 15480
> gcctgtcctg tgcgctgtgg gaagttcagc agcatcctga cctccatccc
> cgggatgaca 15540
> gtcacgccac ccgccgtgac aaccaagaat gtctcctgac actgccacat
> ccccgggggt 15600
> ggggacagaa tccagccagg agcaggcaca cccctcccaa ctgggaggaa
> gccctcagca 15660
> caggtgtgtg aggtgggagg cggtgtcctg tccccgggag gctccagaga
> ataatttgca 15720
> ggctgcctgg ctgggtgagc ccacctccaa ccacgcgaga caacagctcc
> ggcctgggtg 15780
> acgtgagcgg tgcccattga tggggaacat cttccccctc ttccttgccc
> caccagtttg 15840
> tcttcccggg ttatttgcag ataggaaaat aaataaagcc ggcattcgtt
> aaccctcttc 15900
> tggcgcaaac tgctgtttgc tctggatgaa tcatggtcct ttggcgacgc
> caggctccgg 15960
> gagagcaaag caccgtgtca gggccatgat ccggggtggc ctttcactgg
> gatcgtgggg 16020
> acctggaggc cgccttatag gacacccatg acgcccacct ctggatttca
> ggtgcacgtg 16080
> actggactta acttcaaacc ccagggtgga ggcaggtagt gggagtgccc
> tgggaaggtg 16140
> tcctcggacc ttggtcactg ctcctgaacc catctgtgag gctggtttgt
> cctcatccca 16200
> agctaagtgg aagctcaggt cccaagccac cgatgggtgc tacttgtcag
> ctgcaggttg 16260
> aatctccgtg gcctttatga agcacctgct gtctaccctt cctgccttgt
> agagcactcc 16320
> tcccagggct caacagtggg gccggggtgg tcggtgtgtt ggctccacag
> gcgcctgccc 16380
> tgggaggaag gtggggtgtg gagggaaacg cttggcccct gtaggtctcc
> accagcctct 16440
> cccctgaggg tgggggctcc gggagccttc ctcgagggag tcctatattg
> agtgggtggg 16500
> ggagcctgca aggtgcccct gacaggtcac atcagaaaga gctcaaggga
> cagtcggagc 16560
> cagaggtgac actggtggcc actcgggtgg ctcacaaggc ccagctcctc
> cttgctcctg 16620
> ggcaaattac tctgaaggca gggaccaggt ctgcaccatt gcggctctcc
> agttccaggc 16680
> aatggccagg tcctgtgtca gggctggggt cctagggaag ccatgtcccc
> acccccggcc 16740
> tgcagctggg tttacattca tcccccgaga gcacatgggt gtagcaggag
> gcctgtgcag 16800
> agagctccga ccatcgcaca gggcaccttt ggttgtttca cggagcaggc
> aagggagcca 16860
> tcggatcctg ttaggtttga gcaaggatgt ggggaagaag ctggagagcc
> actttgccat 16920
> gcagggagag gagcacatgg gtctagggat ctactttagt gtttggaagg
> ttttttaaga 16980
> tgaaagaggg atgtgtaggc tgataggtct ggcagagcca aaaggcagcg
> acatgtctac 17040
> tgggagagat ggagctgagc gcggggctca ggcagggtgg cagggcaggg
> ccggggccct 17100
> gggtgggtca ggtgggttca cagccaagtg tgtagagagg gcttgggccc
> agagtgaagc 17160
> agttgcaagc tctcccacaa cccattctct ctgtctcggc atctgtggca
> tcccgtgatg 17220
> ggtgggtctg tacacacccc acccctggct gtgccacaat gggggtgtct
> gtacacccct 17280
> cacccctggc tgtgccacga tgggggggtc tgtacacccc ccatccctgg
> ctgtgccacg 17340
> atggggggtt ctgtacatcc cccagtgatg
> 17370
> //
>
>
>
> ##################################
> ccnd1_mus.embl
> ###################################
>
>
> ID Chromosome 7 135379270 to 135391916 ENSEMBL; DNA; HUM; 12647 BP.
> XX
> AC Chromosome 7 135379270 to 135391916;
> XX
> SV NO_SV_NUMBER
> XX
> DE Reannotated sequence via Ensembl
> XX
> KW HTG.
> XX
> OS Mus musculus (House mouse)
> OC Eukaryota; Metazoa; Chordata; Vertebrata; Mammalia;
> Eutheria; Rodentia;
> OC Sciurognath; Muridae; Murinae; Mus.
> XX
> CC This sequence was reannotated via the Ensembl system.
> Please visit the
> CC Ensembl web site, http://www.ensembl.org/ for more information.
> XX
> CC The reference, comment, description and feature table of
> the original
> CC entry can be found in the DDBJ/EMBL/GenBank database
> with the identical
> CC accession number.
> XX
> CC The /gene indicates a unique id for a gene, /cds a
> unique id for a
> CC translation and a /exon a unique id for an exon. These
> ids are maintained
> CC wherever possible between versions. For more information
> on how to
> CC interpret the feature table, please visit
> CC http://www.ensembl.org/Docs/embl.html.
> XX
> CC All the exons and transcripts in Ensembl are confirmed
> by similarity to
> CC either protein or cDNA sequences.
> XX
> CC In unfinished, rough draft DNA sequence gene structures can cross
> CC fragments and, in these cases, the order and orientation
> of the fragments
> CC is likely to be different from the order in the the International
> CC Nucleotide Sequence Databases DDBJ/EMBL/GenBank.
> XX
> FH Key Location/Qualifiers
> FH
> FT source 1..12647
> FT /classification="musculus, Mus, Murinae, Muridae,
> FT Sciurognath, Rodentia, Eutheria,
> Mammalia, Vertebrata,
> FT Chordata, Metazoa, Eukaryota"
> FT /organism="House mouse"
> FT CDS
> join(complement(10313..10510),complement(8704..8919),
> FT complement(8111..8270),complement(4832..4980),
> FT complement(3423..3587))
> FT /transcript="7.135000001-135793178.382692.389779"
> FT
> /translation="MEHQLLCCEVETIRRAYPDTNLLNDRVLRAMLKTEETCAPSVSYF
> FT
> KCVQKEIVPSMRKIVATWMLEVCEEQKCEEEVFPLAMNYLDRFLSLEPLKKSRLQLLGA
> FT
> TCMFVASKMKETIPLTAEKLCIYTDNSIRPEELLQMELLLVNKLKWNLAAMTPHDFIEH
> FT
> FLSKMPEADENKQTIRKHAQTFVALCATDVKFISNPPSMVAAGSVVAAMQGLNLGSPNN
> FT
> FLSCYRTTHFLSRVIKCDPDCLRACQEQIEALLESSLRQAQQNVDPKATEEEGEVEEEA
> FT GLACTPTDVRDVDI"
> FT CDS
> join(complement(10313..10510),complement(8704..8919),
> FT complement(8111..8270),complement(4832..4980),
> FT complement(3426..3587))
> FT /gene="ENSMUSG00000031071"
> FT /cds="ENSMUSP00000033387"
> FT /transcript="ENSMUST00000033387"
> FT /db_xref="RefSeq:NM_007631"
> FT /db_xref="LocusLink:12443"
> FT /db_xref="SWISSPROT:P25322"
> FT /db_xref="EMBL:M64403"
> FT /db_xref="protein_id:AAA37502"
> FT /db_xref="EMBL:S78355"
> FT /db_xref="protein_id:AAB34495"
> FT /db_xref="MarkerSymbol:MGI:88313"
> FT
> /translation="MEHQLLCCEVETIRRAYPDTNLLNDRVLRAMLKTEETCAPSVSYF
> FT
> KCVQKEIVPSMRKIVATWMLEVCEEQKCEEEVFPLAMNYLDRFLSLEPLKKSRLQLLGA
> FT
> TCMFVASKMKETIPLTAEKLCIYTDNSIRPEELLQMELLLVNKLKWNLAAMTPHDFIEH
> FT
> FLSKMPEADENKQTIRKHAQTFVALCATDVKFISNPPSMVAAGSVVAAMQGLNLGSPNN
> FT
> FLSCYRTTHFLSRVIKCDPDCLRACQEQIEALLESSLRQAQQNVDPKATEEEGEVEEEA
> FT GLACTPTDVRDVDI"
> FT CDS join(12061..12216,15972..16065,33387..33547)
> FT /transcript="7.135000001-135793178.391330.412816"
> FT
> /translation="KKLVKERTASRRHTRRKPLTFRAHAFRSPSPDIGVPKVRTLRDSD
> FT
> STVAPKVMAPLPEGIGQKLLVGLRGSMAPVCPQCTSRSGILRNAIPIIETLTILEQTFG
> FT ALESQAPWCFEACSVFLGILIQPAYILFEFLF"
> FT exon complement(2001..3587)
> FT /exon_id="ENSMUSE00000266344"
> FT /start_phase=0
> FT /end_phase=0
> FT exon complement(3423..3587)
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon complement(4832..4980)
> FT /exon_id="ENSMUSE00000205949"
> FT /start_phase=1
> FT /end_phase=1
> FT exon complement(4832..4980)
> FT /exon_id=""
> FT /start_phase=1
> FT /end_phase=1
> FT exon complement(8111..8270)
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon complement(8111..8270)
> FT /exon_id="ENSMUSE00000205951"
> FT /start_phase=0
> FT /end_phase=0
> FT exon complement(8704..8919)
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon complement(8704..8919)
> FT /exon_id="ENSMUSE00000205954"
> FT /start_phase=0
> FT /end_phase=0
> FT exon complement(10313..10510)
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon complement(10313..10647)
> FT /exon_id="ENSMUSE00000266376"
> FT /start_phase=0
> FT /end_phase=0
> FT exon 12061..12216
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon 15972..16065
> FT /exon_id=""
> FT /start_phase=0
> FT /end_phase=0
> FT exon 33387..33547
> FT /exon_id=""
> FT /start_phase=1
> FT /end_phase=1
> XX
> SQ Sequence 12647 BP; 2763 A; 3160 C; 3111 G; 2614 T; 999 other;
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 60
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 120
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 180
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 240
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 300
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 360
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 420
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 480
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 540
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 600
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 660
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 720
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 780
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 840
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnc 900
> gggacaggcc acggctcctc tcatggcgct gctaccgatg actcccagga
> tcccagacgt 960
> tcagaaccag attctcattg ctttgtatct ttcacgttgt tttcgctgct
> attggagggt 1020
> cagttttgtt ttgttttgtt ttacaatgtc agactgccat gttcaagttt
> taatttcctc 1080
> atagagtgta tttacagatg cccttttttg tacttttttt tttaattgtg
> atctattttg 1140
> gcttaatgtg attaccgctg tattccaaaa aaaaaaaaaa aacaggttcc
> tgttcacaat 1200
> acctcatgta tcatctagcc atgcacgagc ctggcaggca ggtgggcggt
> ctgcctccag 1260
> ggatcctggg accctgatgg cgatcgtcct gtcatgctgg gcccttcatt
> tgatctggga 1320
> catagcatca cagcagtcag ggcacctgga ttgttctgtt atcgatattg
> tttcttgtag 1380
> cggcctgttg tgcatgccac catgctgctg gcccgggggg atttgctctg
> agtctccggt 1440
> gcatcattta atctgttagg ttctagtgtt ccgtcttgtt ttgtgttaat
> tacagcattg 1500
> tgctaatgta aagactctgc ctttgcgaag ccagctgcag tgctgtaggc
> ccccaagttc 1560
> cctagcaagc tgccaaacca aaacgggcac caccagctca gctgaggcat
> cccagccagg 1620
> caggaccctt gagggccgct gtatccatgg tgatggggtg aggttttggc
> caaaaggcca 1680
> aagactggtg gtgggtccac ggaatctgcc ctgtgacatg aaaggctttg
> aggggctctg 1740
> gctggtggcc aggttggctt tttgtatttc tggttgacac accatggcgc
> ttcccagcac 1800
> agacatgtga ccagcatggt ccaggaaaaa aaaaaagaca aaaaatctag
> aaaataaaat 1860
> tggtaaaatc tcagctcact gttgtctgtg ttttctggga acaggagtgg
> ggtgatgcca 1920
> cagccatgtg gggtggcatc tctggccctg gcaccaacct gggattccca
> aggggaaggg 1980
> tgtatgcaag tgacagccaa cccccccccc gttgcccaat gaaagaccaa
> tctctcagac 2040
> atggccctaa accttctcca gcagggcaga ggtgtgcgtt tgaatcaagg
> gagatcacat 2100
> tgctttgagt cacactggtc atgggcagcc tttcccataa atactcttct
> gtagccttaa 2160
> gtataaatta gacattttag tgtttaaaag cctcctgtgt gagacttaag
> agttgtcccc 2220
> aatctccttg tccaggtaat gccatcatgg ttcctacttc caaacaccag
> ctggcaccaa 2280
> aggatccctt caactttgca ggacagatcc cggtggtgcg agaacagagt
> tctctcttct 2340
> tgacccaaca aaacttccca agcacctcat actaccagcc ctacaacctg
> ttgtacagcc 2400
> atctgaatgc gtgtgtggac atccccatcc attccattag aacccctccc
> cccaacacac 2460
> accagcaaca ctgcctccag ctagctgacc aaaagtgctt tgaaatggaa
> tggttttgga 2520
> acatagagaa agaaaacaca acaaaacaaa ccacacacaa aagctgtggg
> tttaacctga 2580
> tgtgggcagg ctgcagggcc ttcaggcaaa aaccagcatc tctctaaaca
> gagctacaaa 2640
> ataggaaggt tacaaaaata gaatatatta tgtcactatt tcatccctac
> cgctgtgtgg 2700
> ctggtgctgt gaacctgcag gttgttgggg ggagagggca cattaagctg
> agccctgggg 2760
> aggggggttg tgccacacgc catgagacca ctagaggtcg cactgaccat
> ctataaggtt 2820
> aagaggtctg cccacccctg ggataaagca caggtatcag gtaacagggc
> tgtaggcact 2880
> gagcaagcca gggagtgagg ctgagggcct aagctgtggc ttttcagcaa
> agcagagtac 2940
> atttctcaac ctaaaaactc ttctacttta acaatggttt gctgtatttt
> ttttccttcc 3000
> tttttaaagc aatgtttctt aagctttttt ttaaaaacat ataaaagaat
> caaaactcct 3060
> tctcaagact tcccctgtgt taatcctact cttagaacag acaagcacat
> taatagaaaa 3120
> ctagttgatt actggggtac agaattctat ttttgtagca ccccctggct
> ccctactctc 3180
> agggtgatgc agattctatc tctctctctc tctctttaag acagctttca
> gatatttttt 3240
> tctcttttgc ttaaatcaga tagaaaggag aaagattaag gaaaaacaac
> caacaaagag 3300
> aacaaaactt cctcttctgg taacaacatg tccccaaggg ggacgtcgtc
> aggagcaccc 3360
> gggctggctc cttcctcttt gcgggtgcca ctacttggtg gctcccgcct
> gcccggtggc 3420
> cctcagatgt ccacatctcg cacgtcggtg ggcgtgcagg ccagaccagc
> ctcttcctcc 3480
> acttccccct cctcctcagt ggccttgggg tcgacgttct gctgggcctg
> gcgcaggctt 3540
> gactccagaa gggcttcaat ctgttcctgg caggcacgga ggcagtcctg
> gagaggcggg 3600
> cggagatgga gggggtcctt gtttagccag aggccggtgg ccagggcctt
> gacctgcgag 3660
> gcaggctttg ctaagagcaa ggcttgagga aactccccaa gccaccccta
> aatccagtga 3720
> gagctgccag agactagaaa ggtactgcta ctctctaggt tcctcctccc
> aggctgtacc 3780
> ctgaacttcc ccacaaggag cgtctatctc tcataacttt ttgccttggg
> tggcctctga 3840
> gagaggacag ctccagcaca ggtagggtct ggaaagcatt tgcccaggaa
> taagactcat 3900
> gaactgaagc cctgcctagg caggctgaag ctagggcact gtctatattc
> ctgaagccag 3960
> gaggattcct ggggaacacc tgggaagccc ccacattcaa aacttgtgta
> ttgttccccc 4020
> tttgtttgat gaacaccatg gctagaggat ggaagggcaa tggccccatt
> tcacagatgg 4080
> agaaagtgag gcccaagtag cctagtatgg acccacagac acccttccct
> ctctgaagcc 4140
> tgtgggctcc aacctgccca cataccctat tctcagttct caggagcccc
> agacacttag 4200
> gggaaggggg ctgcatgctc tacaattgag atatgtagga atggggcacg
> gggtgtccct 4260
> gtggctgatg ccaggcaggg cagaacagct gtcacacctg gaagccttcc
> acatgccagt 4320
> ggctctgacc tccattgaac ccccaactga agttcaagcc agaactggac
> ttttgcctgc 4380
> ctatgccaac ttgtgatgcc cttgaggaaa cattcacggc cagagcccca
> agcatgatca 4440
> gctgtactcc ttggcagaaa ggtacagtta gacgtagcac agtctgcctg
> atgcaaaccc 4500
> ctctctcaca caaagaaagt tctagaatgt gtaggtctat accagagtcc
> tttatggatt 4560
> caatccccaa cgtctgcact gagggagata tggtcctatg ctggctgggc
> cagaccgggc 4620
> ttcacatgcc actacccata cacattgcag gggcactgtc accacaccac
> agccctctat 4680
> gcctttggga cccatgactc tgggaagaag ctcgtgggga cccagatttg
> gcacctctca 4740
> gctccagccc agtcacacag tgccttggca ctgcccacca aggttcccag
> gcccttgcca 4800
> gctgtagcct cctttgatga gacttactca ccgggtcaca cttgatgact
> ctggaaagaa 4860
> agtgcgttgt gcggtagcag gagaggaagt tgttggggct gcccaggttc
> aggccttgca 4920
> tcgcagccac cacgctccca gcagctacca tggagggtgg gttggaaatg
> aacttcacat 4980
> ctacagcgag agggacagag acatgcctat gattcagctc tcacccacag
> cctcggggcc 5040
> ggtcttcaag cccttccaat ggatgcccac atcaactcta gctcccttgg
> agacacctaa 5100
> gtggcactag gtgcacacag tgctgcatct ctagacccca gcacatcgga
> ggattacggt 5160
> gtccaggcca gagccttgat gctacctgtc aataaggcct tggctcaaac
> tttaaaatgc 5220
> tgcctggcat acacaccggg ttcaatcccc agcacttcca gcaaacatca
> aggaggggtt 5280
> ggctatgtcg ccctaagagg atggaatcta tgaaggtgtg gtgctttcac
> atctgcatcc 5340
> cagtatttgc gcggctgagg tgggaggcag ggatctctgt ctcttccaga
> gtcagcctgg 5400
> cagcacccca aggactagag acacatctca gacacaggag aaagctgtag
> agatgacagg 5460
> cccaggttcc tcagaggcca ggctagactc cacagaaggg agttagcctt
> cccggcttac 5520
> tgacccaggt tggggaagac atgtgacccc agtccaggaa aacccagtag
> gaatagactc 5580
> agagggagct gggaatagcc acctcagatc tccaggccaa ggtccctcat
> gaggggccct 5640
> ggagtcttag ggactctcat atgtacatgt cccaactggg aagtagtggg
> gtgtcatttt 5700
> aggacttcat ctggggtcac caactccaca cctctcaggt ccactggact
> cagcatcctg 5760
> cagttgtatg tcacaccgat ggaatgcctc tccgcataaa cccaccaaag
> atgagtctga 5820
> aaactgcttc cctgtcacgt ccaacaaaga aatagtgaca gggacagctg
> gtcacttggc 5880
> ctgcgaatca gggcttccag gccagaggag gacacaagaa tgaacagatg
> tcagctccca 5940
> tcaatcatct tgtgtgtcac ccagagcctg ggatcaagga ggctggaagg
> aatgggatca 6000
> ggaaggcaga gagttgctgt agcgaggtgg ccctctgggg accaccctga
> gagaacatac 6060
> agctttggct gttaaaggga tccagcagac aagcaatacc tccaggtggc
> accaaggaca 6120
> gttcatgcta gtgagtccag tgaaccaaaa cctggctatg gagtcaagtt
> caaatcacaa 6180
> acagcccttt tacagggagt gagactccag ccagcaggac caggagtccc
> cagccccttc 6240
> cagagcctgt ttttctccct cccacgggca gtagcaagat ctgttatgtc
> ctataaggat 6300
> ccctagctac taacatctgt gtcttcacac tggttgctga tgacataggt
> gtaccctggc 6360
> tcagtggcat gtctgacacc tacatacttc attaagttca gtttcccact
> gtctgtatcc 6420
> atccacatag cgctctggcc acctgtgtta acaaccccaa ccctagctag
> cacattgttc 6480
> aactgcagtc aggaatccag gttggggggg cggtttgcag aagggaagtt
> atgccttccc 6540
> atctcctata gggggaaccc ctgtcacaca gacgacagat ctcctggcct
> tgatgtcagc 6600
> atctttccat tcacggtgat gtcacactgg ccttttccca aagtggttcc
> tgtgtgacaa 6660
> aggagctggt tgctcctaca gagaaacaaa tgctccagac cacagaacac
> tggcctaccg 6720
> caggtcacca agctgtcctt tacccccacc tgtattccag gggcacagac
> aaaggcagta 6780
> aggagagctg gaagagctga gtccagctgt acccagatat agcgcttact
> ggttactctg 6840
> ctggcatcta ggggtcccgg gaggggcaca ctactgccag aactatcctg
> ggggaggttg 6900
> tgcctctcca tccccaagtt atctctggga aactctgctg cctctctgaa
> ggcactttca 6960
> gctccataaa aggctgcctg gagcttccaa aggctccgct ctcctcagag
> aaaccccagc 7020
> cagctgcctc cctgactgcc agcagggcca cccccacccc ttagatcctt
> ccagcctact 7080
> gatggccttg ctactcccat ctgccacccc cttgtttggt acccttgggg
> cgggctcagc 7140
> actctcggca ggtagttggc aagctagaat ttccaaattc tcaccaccag
> cagcttagaa 7200
> acactggcta tggggctgtc agacataaga agatgtcact gagacagcgg
> gggtgaggtg 7260
> gaggttgggg gtggggggag atactctacc ggcccttctg acatcacagg
> ccaaacagct 7320
> atagcaactg aggtctgatc agggccacag aggaaggggg tcctcaacag
> gtgggagaac 7380
> accacacccc ctcaatattg ctaagcctcc tgggtagggt tctcagttag
> aggtgaggag 7440
> cagctgtcag aagtagtact gaaaccactc agaagtagca gggttgtcta
> tgtgtgtggg 7500
> ttcctgcatg gctgtctacc atgacttatg tcagcctaat ctacctgggg
> tctggggcca 7560
> gggagggtcc cttggaggag tcatagggag gttctagttt ctctgtttcc
> agccaaccct 7620
> cccaggactc agccaaaaag tactggcctg cagtacccct ggtggggcct
> ctaagttcct 7680
> gctctgggac tgtactgtat gtagctcgtg gggctgaggc tggaccattt
> gtcccatcca 7740
> tgtaggaacc cctagcaggt ggctcagcct gggagaagct ggtgtgtgtg
> tgtgtgtgtg 7800
> ctctatagct cgagctcctt aagaacagag gcaaggcagg aagcctagga
> aaagaggcaa 7860
> aaacctcttc agaaagcatg ctgtggccgt accacttcct ggcagtggaa
> aggttggcac 7920
> agggaagctg tgtcattctg ttaggtctcc aagatactag ccaagagaat
> gtgccaccca 7980
> gtatcatcta tctctccaac aaactatctg caccctgggc tcagaataga
> tagcaaggaa 8040
> agaggcctta accccagggg gagtggcaag gagcccgctc ttcctgggaa
> gaatgggggt 8100
> ggagccttac ctgtggcaca gagggccaca aaggtctgtg catgcttgcg
> gatggtctgc 8160
> ttgttctcat ccgcctctgg cattttggag aggaagtgtt cgatgaaatc
> gtggggagtc 8220
> atggcggcca ggttccactt gagcttgttc accagaagca gttccatttg
> ctgcagagaa 8280
> tgaggaacac ggtcaaaagg aaggcaggct gattcgagaa ggtctaaaga
> gttgggtcaa 8340
> cccgggaccc catacaggta aacaacaaaa tgtgcctgta gaaatactaa
> ggggacagac 8400
> tgcggacaaa aagaatctcc caccgccagc gccgtggcct gcacgtcgct
> agggggcgca 8460
> caccgcccac atttcagagt ggtttcacca cagacatggt ggggagtcca
> gtacggaaag 8520
> agggcgggca cagaggtaga atgggttggc attctgctca ctgccatttc
> ggtgtcgtgc 8580
> ggtgcggtgc gggtgccttt cgtggggcag gagatctgtc catggatggc
> tggatctcct 8640
> gccatttgca gctcccacca cccagcataa agagggattg tcggtggtgg
> gttacgtggt 8700
> taccagcagc tcctcgggcc ggatagagtt gtcagtgtag atgcacaact
> tctcggcagt 8760
> caagggaatg gtctccttca tcttagaggc cacgaacatg caggtggccc
> ccagcagctg 8820
> caggcggctc ttcttcaagg gctccaggga caggaagcgg tccaggtagt
> tcatggccag 8880
> cgggaagacc tcctcttcgc acttctgctc ctcacagacc tgcggggatc
> cggaccctga 8940
> ccgtcaacgt gggagggggg cgtcgctcga tacgacccca ttcccttctc
> ctagattgca 9000
> cagagaggga ttccggtggc caactccctg ggatgcgctc ctgctccgag
> ggttcccagt 9060
> gcccacccac cctgaaactg aaagggaaag tctttgggca gctggcgggg
> ccggtcgaag 9120
> caatggtggc cacggagacc ccggtcccct gttgaaatta ggaagccaca
> agccagagaa 9180
> ccccaccgcg gggctggcgg ggcgccgggg gctgggcgcc gctcgagcgc
> acgcgagaca 9240
> caatggtggc gtcccaggcc gccgcgctgc atttcccaga cgtcatcttt
> tcaaaaaata 9300
> tttaaaaata tttaaaaaat aactaggtgc ctaaaaacga cctgaaaata
> accatcggac 9360
> gacccttgat ccaagccgag ggtcaggagc tcagtggaca tcggggggtg
> agctccagca 9420
> actcaaactc cgggggcccc ccttttgcag ggtgcgggcg ctgtttgcct
> ctacaagggc 9480
> attatgcctc actttaaaaa ttattttaaa atattttggg acccctgagt
> ctcctgctga 9540
> acatctgaaa atgtgctttt gtccccctcc gcccccagtt tgacctagac
> aggttcctcc 9600
> aaatccagaa ccgcacgcac aacagtcagt ctcattgact agggggccac
> gcgaccttgc 9660
> agcctggcgt tttctctggg ccaggcagcg tgtgcggggt acccaggtcc
> gctgtttgaa 9720
> caaagaatcc cccaccccaa atattcctca cactgtctta tttagtgatt
> cccggccgcc 9780
> acgaggcccc tcgtgcctgt cccagcgccc ctactgccgc tccctgccct
> gcccccaatt 9840
> attaataaac actttttgct ttgcaataaa ggagcaaaga tggcgcataa
> tattggcacg 9900
> agcggccctt gcatacgtgt ccatggatat taatttaaaa atcaaattta
> tgccccctcc 9960
> ccccttgagc tggcgaaacg cgcaggggcg gcccccgccg aggcgctgca
> ggaatggggc 10020
> acaagcgggg gccacagaag ctcagcctgc accctctgtg gtgtcgctga
> caccgtcctc 10080
> cctcgaaggc acccctcccc gactgccctc caccaggaaa aggaattccc
> atcttcccaa 10140
> ctcccaaacc ttcctcgttc aaaaggaccc aatctattta ttttgtgaat
> accgagtcct 10200
> agcaacgcac ccggaaaact ccgtgacgga agggaagaga agggaggaag
> atgttggtga 10260
> tagcaacaag ttgccagggg gtcccaagct agcagcctgt aaagtctctt
> acctccagca 10320
> tccaggtggc cacgattttc cgcatggatg gcacaatctc cttctgcacg
> cacttgaagt 10380
> aagatacgga gggcgcacag gtctcctccg tcttgagcat ggctcgcagc
> acccggtcgt 10440
> tgaggagatt ggtgtcaggg tacgcgcggc ggatggtctc cacttcgcag
> cacaggagct 10500
> ggtgttccat ggctcttccg tctggggagg gctgtggtct cggttgggct
> ccggtggcgg 10560
> ctgctgacgc gcgctgcctc gcgctgtact gccggtctcc ggagcgcgcg
> gagtctgtag 10620
> ctctctgcta ctgcgccgac agccctctgg aggctgcagg actttgcaac
> ttcaacaaaa 10680
> ctcccctgta gtccgtgtga cgttactgtt gttaagcaga gatcaaagcc
> gggcagagaa 10740
> aagggggagg gggagggccg aggcgggtgc agggggaggg ggctcgggag
> cagtgagggg 10800
> ggggggcggc gggggggnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnnnn 10860
> nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
> nnnnnnnagc 10920
> ccaaaccctt ccccctcagc cttcgtagat atgcaaatcg ctcggactgc
> ttctctccaa 10980
> actgggaaag agcggggcaa gggggagggg ggcttctttc cctaagaggg
> tcctccgaga 11040
> cctgtggagg tggggtagtg gcggctctgg acaggaggac agctaggagg
> gaggccaggc 11100
> cacacgcaag ccaaggaaga atgtatggaa ggggagcctg ggggtgcaag
> gacaccctgt 11160
> tcttaaaccg ggagaatggg tgcgtttccg agtacgccac gaggcaccca
> caggcggacc 11220
> cattgcttag aaatcccagc gtccctgtct tctttcaatt tcatcaacac
> gtgtaaaatt 11280
> caggaactgt gtaacacccg aagatgccag acgagcccta agctctcgtg
> caggctccat 11340
> cctggcgcac tggggtggtt gcaaagggcc agaggggtat ctgaacttta
> aaggatcccc 11400
> agcctagcat gcgctcgctc aaaaataaaa taaaaccccg aaaattccag
> caacagctca 11460
> agatggtggc cattatttca tctattcctc ctcgctgggg gggtggggtg
> ggatctgaga 11520
> tttgtcttca tacagtgacc ggtttatcct ggccagggtc ccccagtatc
> cccctcctcc 11580
> actgcaggcg gtttgcccaa gaaaaataaa ccgttgaaga aaaggttgag
> acacgatagg 11640
> ctccttccta atttattttt taattaagaa aagtaaatcg ctgcaagtta
> ttagtcgccc 11700
> ttccaggaac cagaccaccg gaagcttctt gattggcttc gttgagggtg
> gtggtgtttc 11760
> tccaccttta gaattgcagt cttgagttgg gggagggggg gggcggctgg
> tgagaatcaa 11820
> gataaacttg tgagccgata aggtgatatt taaagagaat ctatagatgg
> tcgtggttaa 11880
> ctgaatggac tcctaagttt ataatttata gattcaaata aaatattttt
> agaataaagc 11940
> ggttccacca ccttcgtcca ttatttagtt ccctagtttt ggaaacgttt
> tcacaatacc 12000
> cgacttcaag ctaggcgaac tattcacaca catttcccac ggaatacttt
> cgtgtcccag 12060
> aagaaactgg taaaggagcg cacagcgtcc aggcggcaca caagacgcaa
> acccctcacc 12120
> ttccgagctc atgccttccg ctccccctcc cccgacattg gggtccctaa
> agtccggacc 12180
> cttcgggaca gtgattccac tgtagctccg aaggtggtgg gtggcccggg
> cagagtgccc 12240
> acgcttgctc tggccacagg ccccgccaag cctgggagcc ttagcctaag
> cacgagggcg 12300
> gccaggctgc cttttcattc ataaaatccc taccgggcgg cagtaagcat
> gatttatggg 12360
> gctgcgcgct atcacccgcg gctgagtgcc tgcgaactga gcaggctagc
> ctagccaggg 12420
> tgcgccacca agggcagcgg ggcataaagg ggctcgccca ccctgcagca
> cgggctagcg 12480
> acccgcagcc accccccacc accgccaccg cgggaaagac caggaaactg
> aaatttgcat 12540
> gagccaatcc ggactcagtg ctgttactag gcggaccggg cagatctcca
> cgaggaactt 12600
> cgttgagctc tgcgctcagg gagaggagca aaagtgcaag tccggtt
> 12647
> //
>
>
>
>
>
>
> _______________________________________________
> Biojava-l mailing list - Biojava-l@biojava.org
> http://biojava.org/mailman/listinfo/biojava-l
>
=======================================================================
Attention: The information contained in this message and/or attachments
from AgResearch Limited is intended only for the persons or entities
to which it is addressed and may contain confidential and/or privileged
material. Any review, retransmission, dissemination or other use of, or
taking of any action in reliance upon, this information by persons or
entities other than the intended recipients is prohibited by AgResearch
Limited. If you have received this message in error, please notify the
sender immediately.
=======================================================================