[Bioperl-l] getting non-seq data from NT_ files

Heikki Lehvaslaiho heikki@ebi.ac.uk
Tue, 12 Feb 2002 17:32:16 +0000


Noah et al,

I was confused. RefSeq documentation (at places) still claims that NT_ files
are part of the RefSeq database. There is pointer at the NCBI FTP server to
genomes section where these files are.

	ftp://ftp.ncbi.nih.gov/genomes/H_sapiens/

There is one NT_ file per human chromosome. That explains why these are not
part of any common database distribution. Each file is megabytes long, so
there is no simple way of displaying them.

If you need them, sequence in various formats or without sequences, they are
all there.

As an exercise, one could try to download one of them try the genbank parser
on them. Make sure you have machine with lots of memory!

	-Heikki