Add hydrogens

6.9 Add hydrogens

In order to add hydrogens to a molecule or to change the ionization state, you can use the following dialog box that can be shown by Edit Add Hydrogens menu item:

Add hydrogens

6.9.1 Dialog options

In Actions box, you can choose to add the hydrogens and/or to ionize the molecule in the current workspace. You must remember that if you want to ionize, the molecule must have the hydrogens and the algorithm add/remove the hydrogens according to the predicted pKa values and the specified pH.

In Molecule type box, you can select the molecule type (Generic organic, Protein, Nucleic acid), in Position of hydrogens, you can select where the hydrogens will be placed in the molecule file (After each heavy atom or at Residue end) and in Options, you can also choose the nomenclature type used for the hydrogen atoms (normal progressive number or IUPAC nomenclature), the possibility to process the selected atoms only (Consider selected atoms only) and algorithm used to add the hydrogens. More in detail, if you check Use the bond order to assign the atom types, instead of the bond geometry, the bond order is not used to detect the atom hybridization. This feature is useful when the molecule has uncertain geometry (e.g. 2D instead of 3D structure) and the bond order is correctly assigned. When you enable the ionization feature by checking Ionize the molecule, you can also specify the pH of the solution (Ionization pH) in which the molecule is virtually present as solute and the tolerance (Ionization tol., in pH units) used to establish which protonation state is largely present. To this end, the Henderson-Hasselbalch equation is used: if you specify 1 as ionization tolerance, it means that, at the specified pH value, the concentration of the ionized form must be at least 10 times higher than the non-ionized one to consider the molecule as ionized.
Finally, you must click Add to add the hydrogens and/or to ionize the molecule.

6.9.2 About the method to add the hydrogens

To add the hydrogens, the recognition of atom types and valences is needed. The powerful ATDL geometry-independent engine can't be used because the atom valences are incomplete due to the missing hydrogens and so the implemented method is based on the original code included in Babel 1.6 (© 1992-96, W. Patrick Walters, Matthew T. Stahl) that detects the atom valences considering the atomic distances and the bond angles. Unluckily, this is the method limit, because some structures don't have a canonical geometry and so the valence detection can be wrong. VEGA introduces an algorithm that fixes the atom type and the valence detection when the user specify the type of the molecule. You must remember that this check is disabled selecting Generic organic molecule type.
The method can be summarized in the following steps:

All hydrogens are removed in order to allow the correct atom type detection.
Atom type and valence attribution: the atom types are detected using the Meng convention (E. Meng and R. Lewis, J. Comp. Chem., 12, pp 891-898, 1991) updating it with Nim atom type to detect the nitrogens in five member rings.
Optionally, to fix the incorrect atom types, VEGA ZZ uses two templates (NA.hyb for nucleic acids and PROTEIN.hyb for proteins) placed in the Data directory. They are text files witch format is explained in the header.
The required hydrogens are placed and bonded.

6.9.3 About the method to ionize the molecule

To predict the ionization state of a molecule, the knowledge of the pKa values of each moiety included in its structure is needed. At this time, it was implemented a fast method based on the recognition of acid and basic groups whose pKa values are known. The Handerson-Hasselbach equation is used to evaluate the ratio between the acid/base pairs at the specified pH value and the group is considered ionized only if the this ratio is grater than the ionization tolerance defined by the user. The algorithms is so summarized:

Recognition of the acid hydrogens by using PKA.tem ATDL template.
Attribution of the pKa values (see PK.par file in Data directory).
Deletion of the hydrogens if the ratio between non-protonated (negatively charged) and protonated (neutral) state overcomes the ionization tolerance.
Recognition of the basic groups by using PKB.tem ATDL template.
Attribution of the pKa values.
Addition of the hydrogens if the ratio between protonated (positively charged) and protonated (neutral) state overcomes the ionization tolerance.

All these steps are repeated until there are no hydrogens to add/remove.