openmolecules.org

 
Home » DataWarrior » Functionality » Suggestion: tautomer-check
Suggestion: tautomer-check [message #886] Wed, 06 May 2020 15:40 Go to next message
nbehrnd is currently offline  nbehrnd
Messages: 208
Registered: June 2019
Senior Member
DataWarrior's model to assign Druglikeness depends on the encoded structure a
tautomer is represented. With Ambit-Tautomer, Kochev et al. published an open
source tool (Java based), to predict tautormers and to rank their likelyhood.
If wanted, their executable may be run without the larger Ambit framework, just
by

java -jar ambit-tautomers-2.0.0-SNAPSHOT.jar

Thus, I would like to suggest DataWarrior could implement a function to check
if the structures to consider could reasonably yield a tautomer worth to probe
equally. This equally could be complementary to your recently published reference
tautomer.dwar.


Out of curiosity, I drew a pyridone, a pyrazole, and a thalomide with ACD ChemSketch
in two tautomeric forms, exported the SMILES strings (as defined by ACD ChemSketch)
into a .smi file:

index.php?t=getfile&id=176&private=0

With openbabel, it was converted into a .sdf accessible for DataWarrior by

obabel -ismi tautomers.smi -osd -O tautomers.sdf

successfully read and used to compute the Druglikeness. At least as the examples
about «hydroxypyridine» and the enol form of the thalomide differ in the results
examined.

index.php?t=getfile&id=182&private=0

To ease replication of the findings, the relevant files are provided below.

Publication about Ambit-Tautomer: https://doi.org/10.1002/minf.201200133
github-entry about Ambit-Tautomer: https://github.com/ideaconsult/apps-ambit/tree/master/tautom ers-example
Re: Suggestion: tautomer-check [message #899 is a reply to message #886] Mon, 11 May 2020 20:59 Go to previous message
thomas is currently offline  thomas
Messages: 648
Registered: June 2014
Senior Member
This is, of course, and important issue. Having the right tautomer to start with is crucial for any reliable prediction.
The intention behind the work on the published tautomer database is to allow others and ourselves to work on algorithms
to predict the most prominent tautomer. If we had such an algorithm with a reasonable performance at hand, we could use
it as input before training any prediction model. It also would allow to normalize pKa databases to improve or develop
algorithms to more reliably predict pKa-values, which is also in big demand. We have it in the pipeline, but it will
take time...
Previous Topic: Black and White structures
Next Topic: Working with Form Views
Goto Forum:
  


Current Time: Thu Apr 18 10:59:17 CEST 2024

Total time taken to generate the page: 0.04936 seconds