[EMBOSS] Counting the number of sequences in a file

andrew warry (BITS) andrew.warry at bbsrc.ac.uk
Wed Jul 21 05:28:48 EDT 2010


Watch out for the infoseq header line you should use -noheading to avoid a +1 to your total

	infoseq -sformat=genbank gbvrt1.seq -noheading -auto | wc -l



Andrew

-----Original Message-----
From: emboss-bounces at lists.open-bio.org [mailto:emboss-bounces at lists.open-bio.org] On Behalf Of Mahmut Uludag
Sent: 20 July 2010 21:57
To: Peter; Peter Rice
Cc: emboss at emboss.open-bio.org
Subject: Re: [EMBOSS] Counting the number of sequences in a file


>> $ seqret -filter -sformat=genbank gbvrt1.seq | grep -c '^>'

infoseq prints a separate line for each sequence, following command line may 
also work.

  > $ infoseq -filter -sformat=genbank gbvrt1.seq | wc -l


Mahmut
 

_______________________________________________
EMBOSS mailing list
EMBOSS at lists.open-bio.org
http://lists.open-bio.org/mailman/listinfo/emboss

-- 
Disclaimer: This e-mail and any attachments are confidential and intended solely for the use of the recipient(s) to whom they are addressed. If you have received it in error, please destroy all copies and inform the sender. This email and any attachments are believed to be free from viruses but BBSRC accepts no liability in connection therewith. 




More information about the EMBOSS mailing list