Minería de Patrones Secuenciales aplicada a la Predicción del Plegamiento de Proteínas

J. Quintana-Zaez, Héctor R. Velarde-Bedregal, Guillermo Calderón-Ruiz, Cosme E. Santiesteban-Toca

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

Sequence mining consists of finding statistically relevant patterns in data collections represented sequentially. These, are an important type of data, where it matters the order that occupy the elements in the set and that finds a wide range of applications in Bioinformatics and Computational Biology. The prediction of protein structures is one of these applications. Where, a protein is no more than a sequence of amino acids forming patterns known as alpha helices, beta sheets and turns. For purposes of our investigation, these collections or secondary structures would be the itemsets, while the amino acids that make up the entire sequence, the items. Despite multiple attempts to predict protein folding, the algorithms developed to date only reach a 35% effectiveness. That is why we propose SPMCcm, an algorithm based on the prediction of frequent sequences and a scheme of classifiers. Which uses the information provided by the amino acid sequence, in two stages. Where, the first stage learns of the interactions between the secondary structures of the proteins, which it extracts as frequent sequences or itemsets. Meanwhile, the second stage learns of the interaction between the amino acids present in the interacting structures or items. The experimental evaluation showed that SPMCcm behaves in a similar way, independently of the base classifier used, reaching accuracies in the prediction of up to 48%, higher than the 35% reported by the literature, without using large computational resources and possessing explanatory capacity.

Título traducido de la contribuciónMining of sequential patterns applied to the prediction of protein folding
Idioma originalEspañol
Título de la publicación alojada17th LACCEI International Multi-Conference for Engineering, Education, and Technology
Subtítulo de la publicación alojada"Industry, Innovation, and Infrastructure for Sustainable Cities and Communities", LACCEI 2019
EditorialLatin American and Caribbean Consortium of Engineering Institutions
ISBN (versión digital)9780999344361
DOI
EstadoPublicada - 2019
Evento17th LACCEI International Multi-Conference for Engineering, Education, and Technology, LACCEI 2019 - Montego Bay, Jamaica
Duración: 24 jul. 201926 jul. 2019

Serie de la publicación

NombreProceedings of the LACCEI international Multi-conference for Engineering, Education and Technology
Volumen2019-July
ISSN (versión digital)2414-6390

Conferencia

Conferencia17th LACCEI International Multi-Conference for Engineering, Education, and Technology, LACCEI 2019
País/TerritorioJamaica
CiudadMontego Bay
Período24/07/1926/07/19

Palabras clave

  • Classification schemes
  • Contact maps
  • Mining sequential patterns
  • Protein folding

Huella

Profundice en los temas de investigación de 'Minería de Patrones Secuenciales aplicada a la Predicción del Plegamiento de Proteínas'. En conjunto forman una huella única.

Citar esto