TY - JOUR
T1 - Navigating the Chemical Space and Chemical Multiverse of a Unified Latin American Natural Product Database
T2 - LANaPDB
AU - Gómez-García, Alejandro
AU - Jiménez, Daniel A.Acuña
AU - Zamora, William J.
AU - Barazorda-Ccahuana, Haruna L.
AU - Chávez-Fumagalli, Miguel
AU - Valli, Marilia
AU - Andricopulo, Adriano D.
AU - Bolzani, Vanderlan da S.
AU - Olmedo, Dionisio A.
AU - Solís, Pablo N.
AU - Núñez, Marvin J.
AU - Rodríguez Pérez, Johny R.
AU - Valencia Sánchez, Hoover A.
AU - Cortés Hernández, Héctor F.
AU - Medina-Franco, José L.
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/10
Y1 - 2023/10
N2 - The number of databases of natural products (NPs) has increased substantially. Latin America is extraordinarily rich in biodiversity, enabling the identification of novel NPs, which has encouraged both the development of databases and the implementation of those that are being created or are under development. In a collective effort from several Latin American countries, herein we introduce the first version of the Latin American Natural Products Database (LANaPDB), a public compound collection that gathers the chemical information of NPs contained in diverse databases from this geographical region. The current version of LANaPDB unifies the information from six countries and contains 12,959 chemical structures. The structural classification showed that the most abundant compounds are the terpenoids (63.2%), phenylpropanoids (18%) and alkaloids (11.8%). From the analysis of the distribution of properties of pharmaceutical interest, it was observed that many LANaPDB compounds satisfy some drug-like rules of thumb for physicochemical properties. The concept of the chemical multiverse was employed to generate multiple chemical spaces from two different fingerprints and two dimensionality reduction techniques. Comparing LANaPDB with FDA-approved drugs and the major open-access repository of NPs, COCONUT, it was concluded that the chemical space covered by LANaPDB completely overlaps with COCONUT and, in some regions, with FDA-approved drugs. LANaPDB will be updated, adding more compounds from each database, plus the addition of databases from other Latin American countries.
AB - The number of databases of natural products (NPs) has increased substantially. Latin America is extraordinarily rich in biodiversity, enabling the identification of novel NPs, which has encouraged both the development of databases and the implementation of those that are being created or are under development. In a collective effort from several Latin American countries, herein we introduce the first version of the Latin American Natural Products Database (LANaPDB), a public compound collection that gathers the chemical information of NPs contained in diverse databases from this geographical region. The current version of LANaPDB unifies the information from six countries and contains 12,959 chemical structures. The structural classification showed that the most abundant compounds are the terpenoids (63.2%), phenylpropanoids (18%) and alkaloids (11.8%). From the analysis of the distribution of properties of pharmaceutical interest, it was observed that many LANaPDB compounds satisfy some drug-like rules of thumb for physicochemical properties. The concept of the chemical multiverse was employed to generate multiple chemical spaces from two different fingerprints and two dimensionality reduction techniques. Comparing LANaPDB with FDA-approved drugs and the major open-access repository of NPs, COCONUT, it was concluded that the chemical space covered by LANaPDB completely overlaps with COCONUT and, in some regions, with FDA-approved drugs. LANaPDB will be updated, adding more compounds from each database, plus the addition of databases from other Latin American countries.
KW - Latin America
KW - chemical multiverse
KW - chemical space
KW - chemoinformatics
KW - databases
KW - diversity
KW - drug discovery
KW - natural products
KW - virtual screening
UR - http://www.scopus.com/inward/record.url?scp=85175418518&partnerID=8YFLogxK
U2 - 10.3390/ph16101388
DO - 10.3390/ph16101388
M3 - Article
AN - SCOPUS:85175418518
SN - 1424-8247
VL - 16
JO - Pharmaceuticals
JF - Pharmaceuticals
IS - 10
M1 - 1388
ER -