openmolecules.org

 
Home » DataWarrior » Functionality » How to perform general R group searches in DW (Substructure searches)
How to perform general R group searches in DW [message #1458] Tue, 28 December 2021 15:49 Go to next message
Jo W
Messages: 34
Registered: July 2021
Member
In the Chemical Editor Concepts section of the online manual, the instructions state to use "?" when using specify an unusual/custom atom label, a specific isotop, an abnormal valence and/or a radical state. when drawing a structure in the strcture editor.

However, when I type"R" to specify any group / atom and then do a substructure search in the dataset or via chembl, no structures are returned. So this is obviously the wrong command/label.
I have searched the manual and the forums but can't find the answer.

So for example, if you wanted to search for a series of alkane alcohols in a dataset. How do you create:
R-CH2OH as opposed to tediously searching for CH3CH2OH, CH3CH2CH2OH

Also how do you search using two different R groups at the same time within the molecule?
e.g., R1-CH2O-R2
Many thanks
Re: How to perform general R group searches in DW [message #1463 is a reply to message #1458] Tue, 11 January 2022 07:06 Go to previous messageGo to next message
nbehrnd is currently offline  nbehrnd
Messages: 204
Registered: June 2019
Senior Member
Dear JonW,

at least the nightly-builds version of DW contains a function to define a substructure query. Follow Chemistry -> From Chemical Structure -> Add Substructure Count to open a new instance of the sketcher, and decide in favour/against the criterion of overlapping substructure matches. In the matrix view, you obtain a new column with the number of occurrences of the pattern; at the right hand side, there will be a new slide ruler to filter the display. By substructure, one may define queries permitting more than one atom type on one site, too:

/forum/index.php?t=getfile&id=488&private=0

Norwid
  • Attachment: 01.png
    (Size: 53.10KB, Downloaded 153 times)
  • Attachment: 02.png
    (Size: 31.65KB, Downloaded 156 times)
  • Attachment: 03.png
    (Size: 25.74KB, Downloaded 150 times)
  • Attachment: 04.png
    (Size: 31.28KB, Downloaded 554 times)
Re: How to perform general R group searches in DW [message #1467 is a reply to message #1463] Tue, 11 January 2022 16:41 Go to previous messageGo to next message
thomas is currently offline  thomas
Messages: 646
Registered: June 2014
Senior Member
DataWarrior uses an atom table derived from MDL's original when defining the Molfile format. This table contains R, R1 till R16, A, the amino acids, e.g. 'Ala' and a few others as pseudo atoms. Therefore, you can use these labels, but there is no logic behind them. If a Chembl molecule would contain an R as pseudo atom, you could find it with a substructure search.

You could use the following substructure definintion to exclusively find alkyl alcohols:
/forum/index.php?t=getfile&id=489&private=0

This is a C-O with the carbon defined to have no pi electrons and the oxygen set to have no further non-H neighbour. Then there are to exclude groups:

1) The '!C' is an ANY atom except carbon. This prevents any matches to molecules with non-carbon atoms in addiction to the required OH.

2) A C=C double bond to prevent double bonds anywhere. Actually this is not quite correct. Better would a carbon atom with 'at least one pi-electron' as exclude group.

Two R-groups at an otherwise specified molecules could be done analogously.
  • Attachment: t.png
    (Size: 6.74KB, Downloaded 417 times)
Re: How to perform general R group searches in DW [message #1546 is a reply to message #1467] Thu, 17 March 2022 22:17 Go to previous messageGo to next message
Jo W
Messages: 34
Registered: July 2021
Member
Thanks Thomas and Norwid
Many thanks for your comments and suggestions.
I didnt respond straight away because, to be honest, I didn't fully understand your answers!
Having now come back to this from time to time over the past few months its been interesting to try out those approaches, but I believe they still don't quite address what I want to do. If you or anyone else has further suggestions, please do post them here or PM me directly. When is the next version of DW going to be available and is there any new instruction videos coming out?
Re: How to perform general R group searches in DW [message #1549 is a reply to message #1546] Thu, 17 March 2022 22:42 Go to previous message
nbehrnd is currently offline  nbehrnd
Messages: 204
Registered: June 2019
Senior Member
Dear Jon,

as soon as you have a running installation of DataWarrior,[1] a click on «I have read and understood the disclaimer» opens a new menu to provide access to the development versions with the .jar to substitute for improved functionality. Depending on the operating system available to you, this either is the archive for Mac/Linux,[2] or for Windows.[3] At present, the current development version was prepared Pi day, 2022-03-14.

Norwid

[1] https://openmolecules.org//datawarrior/download.html
[2] https://openmolecules.org/datawarrior/dw550x.zip
[3] https://openmolecules.org/datawarrior/dw550win.zip
Previous Topic: Functionality request: MPO score
Next Topic: Chemistry in 3D
Goto Forum:
  


Current Time: Fri Mar 29 10:10:34 CET 2024

Total time taken to generate the page: 0.07711 seconds