alvaDesc is the next generation tool for the calculation of a wide range of molecular descriptors and a number of molecular fingerprints (Mauri, A. (2020). alvaDesc: A Tool to Calculate and Analyze Molecular Descriptors and Fingerprints. In K. Roy (Ed.), Ecotoxicological QSARs (pp. 801–820). Humana Press Inc. https://doi.org/10.1007/978-1-0716-0150-1_32).
It calculates more than 4000 descriptors independent of 3-dimensional information such as constitutional, topological, pharmacophore. It includes ETA and Atom-type E-state indices together with functional groups and fragment counts. Additionally, alvaDesc implements an extensive number of 3-dimensional descriptors such as 3D-autocorrelation, Weighted Holistic Invariant Molecular descriptors (WHIM) and GETAWAY. If needed, alvaDesc can calculate partial charges using the Gasteiger’s “Partial Equalization of Orbital Electronegativity” (PEOE).
Molecular Properties, Drug-like and Lead-like Indices
alvaDesc provides the calculation of several model-based physicochemical properties such as molar refractivity, topological polar surface area (TPSA), molecular volume estimations, three LogP models (Moriguchi, Ghose-Chippen and Wildmann-Crippen octanol-water partition coefficient), a LogP consensus model (LOGPcons) and a LogS aqueous solubility model (ESOL).
In order to get a quantitative estimation of synthetic accessibility of molecules, alvaDesc includes the synthetic accessibility score of drug-like molecules (SAscore).
There is a significative list of drug-like and lead-like alerts including the well-known Lipinski alert index. Considering the importance of drug-likeness when selecting compounds in the early stages of drug discovery, alvaDesc includes the calculation of the quantitative estimate of drug-likeness (QED).
alvaDesc carries out the calculation of MACCS166 fingerprint, Extended Connectivity Fingerprint (ECFP) and Path Fingerprint (PFP) and allows the customisation and calculation of the most used hashed molecular fingerprints. The calculation of hashed fingerprints can be tuned not only with respect to the fingerprint size, fragment type and dimensions but even by defining atom and bond parameters considered during fragment identifications (e.g. atom type, aromaticity, the number of attached hydrogen atoms, connectivity). alvaDesc provides the fragments identified during fingerprint calculation as SMARTS strings (SMARTS is a line notation developed by Daylight Chemical Information Systems for representing molecular substructures).
One of the most innovative features of alvaDesc is its capability to handle both full-connected and non-full-connected molecular structures, such as salts and ionic liquids. All of the molecular descriptor calculation algorithms provide different theoretical approaches for the calculation of molecular descriptors on such structures.
alvaDesc provides different tools to carry out a first exploration of your molecular dataset:
- molecule structure verification using PubChem services
- molecule structure visualisation and filtering
- Principal Component Analysis (PCA), t-SNE and correlation analysis
Due to its capability of calculating large numbers of molecular descriptors, alvaDesc provides variable reduction tools, including the fast V-WSP method (variable reduction method adapted from space-filling designs).
A short video introduction:
The software is 64bit and it’s available for Windows, Linux and macOS. It is provided both as an easy to use command line tool and as an intuitive graphical interface.
- The perfect tool to prepare your molecular dataset for alvaDesc is alvaMolecule
- alvaModel creates QSAR/QSPR models starting from an alvaDesc project
- A tutorial showing how to build a QSAR model using Alvascience tools
alvaDesc is also available as:
- a set of KNIME nodes. Please check the alvaDesc KNIME Plugin page for more information.
- a Python module. Please check the alvaDescCLIWrapper page for more information.