Sequence collection searching

The identification of peptides and proteins using MS information can be done in three different fashions (i) de novo sequencing, (ii) library searching, and (iii) sequence collection searching. [Pg.213]

The second postprocessing step is the automated enhancement of the TrEMBL annotation to bring TrEMBL entries closer to SWISS-PROT standard. There is an increasing need for reliable automatic functional annotation to cope with the rapidly increasing amount of sequence data. Most of the current approaches are still based on sequence similarity searches against known proteins. Some groups try to collect the results of different prediction tools in a simple way, e.g., PEDANT (Frishman and Mewes, 1997) or GeneQuiz (Scharf et al., 1994). However, several pitfalls of these methods have been reported (Bork and Koonin, 1998). [Pg.57]

Collections of tools for DNA and Protein sequence analysis, searching and presentation. Supports restriction enzyme mappings, open reading frames, primer design, sequence alignment, property calculations. [Pg.104]

A set of amino acid sequences called reference sequences Is collected from the NBRF database for the proteins that have the same function. For the 1-th segment of the target protein, similarity search Is done against one of the reference sequences. When the similarity score of the best local alignment Is above a certain level (maxd score), the similarity score is saved. This step Is repeated against the other one of the reference sequences until all the reference sequences are searched In pairs by the i-th segment. [Pg.114]

The UniProt Reference databases (UniRel) provide nonredundant data collections based on the UniProtKB and UniParc, in order to obtain complete coverage of sequence space at several resolutions. UniRef databases (sequence collections clustered by sequence identity, for performing faster homology searches) are created as representative protein sequence databases witli high information content. [Pg.602]

In the bioinformatics realm, SRS (Sequence Retrieval System) [2] is a popular system, which uses a centralized collection of data resources primarily in flat text file form and, more recently, handles XML (Extensible Markup Language) files as well. Data resources are treated in a federated manner since each is maintained in its original form. However, SRS contains a large number of cross-references between corresponding fields in various data sources, so that keyword searches can be done across them. SRS thus performs more structured searches across the information than what a simple text search provides (such as web indexes perform, for example). Even though the data model implicit in the cross-reference tables is not very deep, SRS provides a useful way for users to browse and do simple queries across a large number of data sources as well as to integrate results from some computational methods. [Pg.242]

Time is the enemy when considering human actions. As a plant ages, minor modifications and changes can individually or collectively cause human performance problems. For instance, in almost every plant over 5 years old, a search would probably reveal a series of control circuits arranged in illogical sequence. The most common reason is due to a minor modification, and the most convenient solution was to rearrange the control switches slightly. [Pg.88]

A collection of known patterns of amino acids has been assembled and is available from the author or from the EMBL, Data Library (Bairoch 1991, EMBL 1991). The data base is called PROSITE and contains in the excess of 500 different motifs that one can search for in a sequence of unknown function. [Pg.271]

Figure 1 illustrates how SELEX is used to select an RNA species that binds tightly to ATR In step (1), a random mixture of RNA polymers is subjected to unnatural selection by passing it through a resin to which ATP is attached. The practical limit for the complexity of an RNA mixture in SELEX is about 1015 different sequences, which allows for the complete randomization of 25 nucleotides (425 = 1015). When longer RNAs are used, the RNA pool used to initiate the search does not include all possible sequences. RNA polymers that pass through the column are discarded those that bind to ATP are washed from the column with salt solution and collected. The collected RNA polymers are amplified by reverse transcriptase to make many DNA complements to the selected RNAs then an RNA polymerase makes many RNA complements of the resulting DNA molecules. This new pool of RNA is subjected to the same selection procedure, and the cycle is repeated a dozen or more times. At the end, only a few aptamers, in this... [Pg.1030]

The computer has become an essential tool in biochemical research. A computer may be used for the routine jobs of word processing and data collection and analysis. In addition, if a computer is connected to the Internet, then it may be used for biochemical literature searching, accessing information about nucleic acid and protein sequences, predicting protein structure, and seeking research methodology. In this experiment, students will be introduced to all of these skills in bioinformatics. [Pg.211]

Big Chemical Encyclopedia

Chemical substances, components, reactions, process design ...