openmolecules.org

 
Home » DataWarrior » Functionality » Similarity Search
Similarity Search [message #2329] Mon, 28 October 2024 14:55 Go to next message
Rishi is currently offline  Rishi
Messages: 2
Registered: July 2023
Junior Member
Hi,

I want to conduct a similarity search for one compound against ~60K compounds within the same set. I'm only really interested in the similarity between the one specific compound against the 60K set, and not the similarity between all other compounds within the set.

When I run a search such as the FragFP similarity search the estimated time is >50 hours. I assume this is because it's calculating similarity between all compounds with the 60K set.

Is there a way of just conducting the search on the one compound against the set to reduce calculation times of similarity based positions?

Many thanks!
Re: Similarity Search [message #2330 is a reply to message #2329] Mon, 28 October 2024 16:39 Go to previous messageGo to next message
thomas is currently offline  thomas
Messages: 711
Registered: June 2014
Senior Member
If you have an open DataWarrior window with 60k compounds and draw a structure in the structure filter to perform a similarity search, the search should be immediate. What exactly do you do?
Re: Similarity Search [message #2331 is a reply to message #2330] Mon, 28 October 2024 16:50 Go to previous messageGo to next message
Rishi is currently offline  Rishi
Messages: 2
Registered: July 2023
Junior Member
I want to conduct both FragFP searches and also flexophore searches, but on one specific compound against the larger set
Re: Similarity Search [message #2339 is a reply to message #2331] Thu, 31 October 2024 10:18 Go to previous message
thomas is currently offline  thomas
Messages: 711
Registered: June 2014
Senior Member
That should be straight-forward:
- open the file with the 60k compounds
- if you don't see a structure filter, then add a new one from Edit->New Filter
- choose similarity(FragFp) or (Flexophore) in the filter and draw or paste in the query struccture
(if option "similarity (flexophore)" is missing, you need to calculate flexophores (Chemistry->From Chemical Structure->Calculate Descriptor->Flexophore"

FragFp filtering should be immediate, flexophore filtering may take a few seconds. If the filtering doesn't seem to finish, then the filter process crashed. Then, most likely there is something wrong with the data. You can try then to delete the FragFp or Flexophore column and recreate it. Then try again. Or send me the file (ideally just a few rows that have the same problem) and I check...
Previous Topic: polycyclic molecule displaying issue
Next Topic: Can I see all the molecules from a given patent at once?
Goto Forum:
  


Current Time: Thu Oct 31 15:22:09 CET 2024

Total time taken to generate the page: 0.00626 seconds