Big Chemical Encyclopedia

Chemical substances, components, reactions, process design ...

Articles Figures Tables About

UniProt Archive

UniProt is a central repository of protein sequence and function created by joining the information contained in Swiss-Prot, TrEMBL, and PIR. UniProt is comprised of three components, each optimized for different uses. The UniProt Knowledgebase (UniProt) is the central access point for extensive curated protein information, including function, classification, and cross-reference. The UniProt Non-redundant Reference (UniRef) databases combine closely related sequences into a single record to speed searches. The UniProt Archive (UniParc) is a comprehensive repository, reflecting the history of all protein sequences. [Pg.16]

The UniProt Archive (UniParc) provides a stable, comprehensive, nonredundant sequence collection by storing the complete body of publicly available protein sequence data. Although most protein sequence data are derived from the translation of DDBJ/EMBL/GenBank sequences, primary protein sequence data are also submitted directly to UniProt or derived from the PDB entries. The Archive also captures protein sequence data from other sources such as Ensemble, International Protein Index (IPI), NCBI-RefSeq, FlyBase, and WormBase. Each protein sequence is assigned to a unique UniParc identifier (UPI ) and represented only once in the Archive. In UniParc, the... [Pg.601]

The Universal Protein Resource (UniProt) provides the scientific community with a centralized, authoritative resource for protein sequences and functional information with three database components. (1) The UniProt Knowledgebase (UniProtKB), produced by a combination of automation and over 25 years of human curation, is the central protein sequence database with accurate, consistent, functional annotation and extensive cross-references. (2) The UniProt Reference Clusters (UniRef) provide clustered sets of sequences from UniProtKB (including splice variants and isoforms) in order to obtain complete coverage of sequence space at several resolutions. The UniRef 100 database is particularly useful for Mass Spec identifications as it exposes known sequence variation and splice-form annotation contained in UniProtKB records. (3) The UniProt Archive (UniParc) provides a stable comprehensive sequence collection by storing the complete body of all publicly available protein sequence data. [Pg.204]


See other pages where UniProt Archive is mentioned: [Pg.95]    [Pg.206]   
See also in sourсe #XX -- [ Pg.601 ]

See also in sourсe #XX -- [ Pg.204 ]




SEARCH



Archival

Archiving

UniProt

© 2024 chempedia.info