Fast Sigma: a QSPR method to estimate COSMO sigma-profiles

Introduction

The Fast Sigma program reads a molecule in SMILES format and estimates all of the properties required for a COSMO-RS/-SAC calculation: the HB-/Non-HB-/OT-/OH- \(\sigma\)-profiles, COSMO surface area, and COSMO volume. This code uses QSPR techniques similar to those applied in our Property Prediction program. The efficiency of these techniques means that this program can provide estimates for these essential COSMO-RS/-SAC properties in milliseconds. This allows for quick estimates to be done for a new molecule of interest as well as drastically expedited searches through screening databases of molecular candidates as compared to the full-fledged COSMO-RS strategy, which requires costly DFT calculations.

Input options

A list of the input options and examples of their usage is given below.

Flag Purpose Example
-h [–help] Produces help message $AMSBIN/fast_sigma –help
-s [–smiles] Input molecule as SMILES sting $AMSBIN/fast_sigma –smiles <SMILES> …
-m [–mol] Input molecule as .mol file $AMSBIN/fast_sigma –mol <mol file> …
-d [–display] Display problem results $AMSBIN/fast_sigma -d …
-o [–output] Write output to file $AMSBIN/fast_sigma –o <output.compkf> …
–method Chose a COSMO-RS/-SAC method $AMSBIN/fast_sigma –method COSMO-RS …

There are currently two supported methods: COSMO-RS and COSMOSAC2016. One of these method names must be entered after the –method flag. The default method is COSMO-RS.

<output.compkf>
The fast sigma program writes the output results to a file in .compkf format. The chosen output filename should generally end with .compkf. This suffix helps other parts of the code (COSMO-RS/-SAC/-UNIFAC/Solvent Optimization) recognize the format and use the file accordingly. If no filename is supplied the program writes to a file called CRSKF.
SMILES_string or .mol file
Though COSMO-RS/-SAC can make estimates for many types of molecular species, the fast sigma program currently only supports organic, neutral, closed shell molecules.

GUI Input

The simplest way to use the Fast Sigma program is through the COSMO-RS GUI. There are two ways to do this:

  • SMILES string: Compounds → List of Compounds → Add Compound using QSPR (Fast Sigma) → SMILES and select Add.
  • .xyz file: Compounds → List of Compounds → Add Compound using QSPR (Fast Sigma) → .xyz, and select Add.

A .compkf file will be saved that can be used as input in COSMO-RS calculations.

Examples

This example calculates COSMO-RS (the default) parameters for phenol:

$AMSBIN/fast_sigma --smiles "c1ccccc1(O)" -d
       sigma value       Total profile          HB profile
            -0.025               0.000               0.000
            -0.024               0.000               0.000
            -0.023               0.000               0.000
            -0.022               0.002               0.002
            -0.021               0.054               0.054
            -0.020               0.263               0.263
            -0.019               0.523               0.523
            -0.018               0.684               0.684
            -0.017               0.828               0.828
            -0.016               0.801               0.801
            -0.015               0.732               0.716
            -0.014               0.642               0.597
            -0.013               0.653               0.519
            -0.012               0.678               0.487
            -0.011               0.607               0.423
            -0.010               0.567               0.382
            -0.009               0.646               0.245
            -0.008               4.183               0.023
            -0.007               7.405               0.000
            -0.006               7.912               0.000
            -0.005               6.701               0.000
            -0.004               5.544               0.000
            -0.003               4.658               0.000
            -0.002               3.899               0.000
            -0.001               4.097               0.000
             0.000               6.109               0.000
             0.001               7.854               0.000
             0.002               8.640               0.000
             0.003               9.726               0.000
             0.004              11.175               0.000
             0.005              12.524               0.000
             0.006               8.673               0.000
             0.007               2.255               0.000
             0.008               1.174               0.161
             0.009               1.279               1.159
             0.010               1.442               1.442
             0.011               1.759               1.751
             0.012               1.795               1.788
             0.013               0.838               0.829
             0.014               0.095               0.093
             0.015               0.054               0.054
             0.016               0.030               0.030
             0.017               0.000               0.000
             0.018               0.000               0.000
             0.019               0.000               0.000
             0.020               0.000               0.000
             0.021               0.000               0.000
             0.022               0.000               0.000
             0.023               0.000               0.000
             0.024               0.000               0.000
             0.025               0.000               0.000
       Molecular Mass =        94.0418648120 g/mol
           COSMO Area =       127.5012207186 Angstrom**2
         COSMO Volume =       122.0791950835 Angstrom**3
Gas Phase Bond Energy =        -2.9875007647 Hartree
          Bond Energy =        -2.9968155744 Hartree
           Dispersion =        -4.5319123638 kcal/mol
           Deltaediel =         0.0000000000 Hartree
                Nring =         6
     Chemical Formula =         C6H6O
               SMILES =         c1ccccc1(O)

Additionally, we calculate the COSMOSAC2016 parameters for Ibuprofen as a mol file:

$AMSBIN/fast_sigma --mol Ibuprofen.mol --method COSMOSAC2016 -d
       sigma value       Total profile          OH profile          OT profile
            -0.025               0.000               0.000               0.000
            -0.024               0.000               0.000               0.000
            -0.023               0.000               0.000               0.000
            -0.022               0.000               0.000               0.000
            -0.021               0.009               0.009               0.000
            -0.020               0.062               0.061               0.000
            -0.019               0.395               0.385               0.000
            -0.018               0.914               0.881               0.000
            -0.017               0.925               0.879               0.000
            -0.016               0.840               0.781               0.000
            -0.015               0.652               0.590               0.000
            -0.014               0.697               0.606               0.000
            -0.013               0.604               0.499               0.000
            -0.012               0.561               0.398               0.000
            -0.011               0.725               0.418               0.000
            -0.010               0.833               0.350               0.000
            -0.009               1.282               0.230               0.000
            -0.008               2.141               0.158               0.000
            -0.007               5.133               0.085               0.000
            -0.006              10.428               0.048               0.000
            -0.005              14.386               0.000               0.000
            -0.004              23.816               0.000               0.000
            -0.003              26.081               0.000               0.000
            -0.002              23.295               0.000               0.000
            -0.001              21.443               0.000               0.000
             0.000              22.124               0.000               0.000
             0.001              20.652               0.000               0.000
             0.002              24.315               0.036               0.000
             0.003              15.722               0.086               0.035
             0.004              11.878               0.171               0.092
             0.005              13.670               0.288               0.197
             0.006              10.405               0.381               0.307
             0.007               5.479               0.561               0.413
             0.008               3.525               0.713               0.613
             0.009               3.358               0.823               1.055
             0.010               3.879               0.639               1.840
             0.011               4.503               0.180               3.025
             0.012               2.708               0.083               2.006
             0.013               0.930               0.020               0.745
             0.014               0.061               0.000               0.104
             0.015               0.000               0.000               0.000
             0.016               0.000               0.000               0.000
             0.017               0.000               0.000               0.000
             0.018               0.000               0.000               0.000
             0.019               0.000               0.000               0.000
             0.020               0.000               0.000               0.000
             0.021               0.000               0.000               0.000
             0.022               0.000               0.000               0.000
             0.023               0.000               0.000               0.000
             0.024               0.000               0.000               0.000
             0.025               0.000               0.000               0.000
       Molecular Mass =       206.1306798160 g/mol
           COSMO Area =       278.4276940312 Angstrom**2
         COSMO Volume =       279.3341044098 Angstrom**3
Gas Phase Bond Energy =        -7.1463537624 Hartree
          Bond Energy =        -7.1619486814 Hartree
           Dispersion =        -9.7153055452 kcal/mol
           Deltaediel =         0.0007518662 Hartree
                Nring =         0
     Chemical Formula =         C13H18O2
               SMILES =         CC(C)Cc1ccc(C(C)C(=O)O)cc1