compseq: is U an amino acid
    Gary Williams, Tel 01223 494522 
    gwilliam at hgmp.mrc.ac.uk
       
    Wed Aug 21 08:18:06 UTC 2002
    
    
  
U codes for the amino acid selenocysteine.
See the IUPAC documentation for one-letter amino-acids:
http://www.chem.qmul.ac.uk/iupac/AminoAcid/A2021.html
and
http://www.chem.qmul.ac.uk/iubmb/newsletter/1999/item3.html
regards,
Gary
> "JAEN (Jacob Engelbrecht)" wrote:
> 
> I have been using compseq for protein sequences and wondered why 'U'
> is reported as an amino acid?
> I looked in the code (nucleus/embnmer.c) and found it was specifically
> accounted for, whereas 'X' which in many databases  as unknown is not
> specifically accounted for.
> 
> Would it not make sense to have options which made specific symbols
> part of the alphabet or left them out:
> -leaveout XU or -include BZXU
> 
> Jacob Engelbrecht, Phd
> Insulin Research
> Novo Nordisk
> 6A1.038 Novo Alle
> DK-2880 Bagsvaerd
> Denmark
> tel: +45 4442 4403
> mail: jaen at novonordisk.com
-- 
Gary Williams               Tel: +44 1223 494522  Fax: +44 1223 494512
mailto:G.Williams at hgmp.mrc.ac.uk            http://www.hgmp.mrc.ac.uk/
Bioinformatics,MRC HGMP Resource Centre,Hinxton,Cambridge, CB10 1SB,UK
    
    
More information about the EMBOSS
mailing list