Big Chemical Encyclopedia

Chemical substances, components, reactions, process design ...

Articles Figures Tables About

Connection table canonicalization

The task is now to take one of the numberings as the standard one and to derive a unique code from it, which is called canonicalization. This can be accomplished by numbering the atoms of a molecule so that it is represented later by only one connection table or bond matrix. Such a unique and reproducible numbering or labeling of the atoms is obtained by a set of rules. [Pg.59]

Algorithm I - Registration - Canonicalization of Connection Tables. A connection table for a chemical substance with n atoms can be numbered in as many as n different ways. The problem of generating a canonical form involves selecting a... [Pg.142]

W.T. Wipke H) first achieved the canonicalization of a molecular structure representation in a computer program that includes all aspects of stereochemistry. In some other schemes, stereochemistry is not invoked to aid in the mambering of the atoms. Such schemes are incomplete. If we relegate stereochemistry to a footnote and the numbering of the atoms and the connection tables of two isomers can be the same, then we lose most of the above-stated advantages of canonicalizing. [Pg.131]

For the number of atoms involved, usually 25-40, the straight selection sort was noticeably faster than the tree selection sorts discussed by Knuth. (] ) The optimum sorting procedure in this case is very much an open question. The numbers of the ligands in each row were sorted by the bubble sort (17) because these numbers are usually in order before canonicaliza-tion, since they are usually in the same order in the reactant molecule at hand as they are in the already canonicalized product molecule connection table. The most efficient sort for an almost sorted list is the bubble sort. [Pg.146]

The first solution uses some algorithm that transforms any connection table of a molecule into a unique, canonical, form. The best known of these, the Morgan algorithm, chooses the numbering based on the numbers and properties of the neighbors of each atom of the structure. It is the basis of the Chemical Abstracts System Chemical Registry Service. There is also a canonicalization scheme for the SMILES notation of a chemical structure. ... [Pg.220]

An alternative, and highly efficient, structure search uses a hashing function, a computational procedure that takes some data record, such as a connection table or unique SMILES, and converts it to the computer address at which that record is stored." Although hashing functions may lead to more than one record with the same address, the detailed molecular comparisons need to be carried out for just those few molecules." " Because of canonicalization and hashing functions, identity searching is generally very fast. [Pg.220]


See other pages where Connection table canonicalization is mentioned: [Pg.2821]    [Pg.2821]    [Pg.191]    [Pg.115]    [Pg.378]    [Pg.761]    [Pg.471]    [Pg.136]    [Pg.141]    [Pg.146]    [Pg.2734]    [Pg.2815]   
See also in sourсe #XX -- [ Pg.136 ]




SEARCH



Canonicalization

Connection table

Connectivity table

© 2024 chempedia.info