openmolecules.org

 
Home » DataWarrior » Cheminformatics » AllFragFp
Re: AllFragFp [message #2182 is a reply to message #2165] Wed, 24 April 2024 14:45 Go to previous messageGo to previous message
thomas is currently offline  thomas
Messages: 693
Registered: June 2014
Senior Member
AllFragFp is substantially different from FragFp: it is hashed and uses 2048 bits. AllFragFp internally generates all substructures of a given molecule with up to 6 connected bonds including stereo chemistry. These substructures are converted into a canonical representation from which a hash code between 0 to 2047 is generated, for which the corresponding bit is set. The original idea was to accelerate the substructure search by a more discriminating descriptor than the FragFp. If the AllFragFp descriptor is available in a DataWarrior file, then DataWarrior uses that for substructure pre-screening. Since the sub-structure search is usually fast for mot more than some hundred thousand molecules, one shouldn't bother to use the AllFragFp. For many millions, however, it makes a significant difference.

Regarding the value of similarities calculated by this descriptor, I didn't really investigate it applicability domain. It certainly will produce very fine grained similarity values, but the SkeletonSpheres descriptor is will probably generate more intuitive ones, because by design single atom replacements cause less large losses of similarity compared to other substructure based descriptors.
 
Read Message
Read Message
Read Message
Read Message
Read Message
Read Message
Previous Topic: running a DW macro in batch
Next Topic: Create a macro to split a .dwar file into multiple .dwar files
Goto Forum:
  


Current Time: Sun Aug 25 01:26:29 CEST 2024

Total time taken to generate the page: 0.04008 seconds