[EMBOSS] Counting the number of sequences in a file
andrew warry (BITS)
andrew.warry at bbsrc.ac.uk
Wed Jul 21 05:28:48 EDT 2010
Watch out for the infoseq header line you should use -noheading to avoid a +1 to your total
infoseq -sformat=genbank gbvrt1.seq -noheading -auto | wc -l
Andrew
-----Original Message-----
From: emboss-bounces at lists.open-bio.org [mailto:emboss-bounces at lists.open-bio.org] On Behalf Of Mahmut Uludag
Sent: 20 July 2010 21:57
To: Peter; Peter Rice
Cc: emboss at emboss.open-bio.org
Subject: Re: [EMBOSS] Counting the number of sequences in a file
>> $ seqret -filter -sformat=genbank gbvrt1.seq | grep -c '^>'
infoseq prints a separate line for each sequence, following command line may
also work.
> $ infoseq -filter -sformat=genbank gbvrt1.seq | wc -l
Mahmut
_______________________________________________
EMBOSS mailing list
EMBOSS at lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/emboss
--
Disclaimer: This e-mail and any attachments are confidential and intended solely for the use of the recipient(s) to whom they are addressed. If you have received it in error, please destroy all copies and inform the sender. This email and any attachments are believed to be free from viruses but BBSRC accepts no liability in connection therewith.
More information about the EMBOSS
mailing list