Evolutionary Strategy for Optimizing Neural Models for Compound Similarity Estimation

Authors

DOI:

https://doi.org/10.24215/15146774e066

Keywords:

graph neural networks, evolutionary computation, compound similarity, metabolic pathways

Abstract

Molecular similarity evaluation is a key aspect of bioinformatics and poses a significant challenge when dealing with compounds with unknown structures. In this context, graph neural networks have proven effective in generating representations based on the topology of chemical reactions. However, designing these models and selecting their hyperparameters requires exploring a vast range of options. Evolutionary algorithms naturally arise as a solution for searching these extensive spaces, including the hyperparameter space of neural architectures. This study presents a comparison between a traditional hyperparameter search approach, based on expert knowledge, and a method leveraging evolutionary computation for the same task, specifically in compound similarity estimation. Using a predefined architecture, experiments are conducted to compare both approaches across different datasets. The results indicate that the evolutionary computation-based method successfully identifies suitable hyperparameters for the evaluated architecture, achieving a performance comparable to the expert-driven approach while eliminating the need for human intervention in the selection process.

Downloads

References

Arita, M.: From Metabolic Reactions to Networks and Pathways, pp. 93–106 (2012)

Baeck, T., Fogel, D., Michalewicz, Z.: Evolutionary Computation 1: Basic Algorithms and Operators. CRC Press (2000)

Bajusz, D., Rácz, A., Héberger, K.: Why is tanimoto index an appropriate choice for fingerprint–based similarity calculations? Journal of Cheminformatics 7(1), 20 (2015)

Borzone, E., Di Persia, L.E., Gerard, M.: Neural model-based similarity prediction for compounds with unknown structures. In: Applied Informatics. pp. 75–87 (2022)

Fujimoto, K.J., Minami, S., Yanai, T.: Machine-learning- and knowledge-based scoring functions incorporating ligand and protein fingerprints. ACS Omega 7(22), 19030–19039 (2022)

Gilmer, J., Schoenholz, S.S., Riley, P.F., Vinyals, O., Dahl, G.E.: Neural message passing for quantum chemistry. Proceedings of Machine Learning Research, vol. 70, pp. 1263–1272 (2017)

Goldberg, D.E.: Genetic Algorithms in Search, Optimization, and Machine Learning. Addison-Wesley (1989)

Hutter, M.C.: Differential multimolecule fingerprint for similarity search–making use of active and inactive compound sets in virtual screening. Journal of Chemical Information and Modeling 62(11), 2726–2736 (2022)

Kang, J.S., Kang, J., Kim, J.J., Jeon, K.W., Chung, H.J., Park, B.H.: Neural architecture search survey: A computer vision perspective. Sensors 23(3) (2023)

Kumar, A., Zhang, K.Y.J.: Advances in the development of shape similarity methods and their application in drug discovery. Frontiers in Chemistry 6 (2018)

Kyriakides, G., Margaritis, K.: Evolving graph convolutional networks for neural architecture search. Neural Computing and Applications 34(2), 899–909 (2022)

Rahman, S.A., Advani, P., Schunk, R., Schrader, R., Schomburg, D.: Metabolic pathway analysis web service (Pathway Hunter Tool at CUBIC). Bioinformatics 21(7), 1189–1193 (2004)

Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph attention networks (2018)

Vignolo, L.D., Milone, D.H., Scharcanski, J.: Feature selection for face recognition based on multi-objective evolutionary wrappers. Expert Systems with Applications 40(13), 5077–5084 (2013)

Vignolo, L.D., Rufiner, H.L., Milone, D.H.: Multi-objective optimisation of wavelet features for phoneme recognition. IET Signal Processing 10(6), 685–691 (2016)

Vignolo, L.D., Rufiner, H.L., Milone, D.H., Goddard, J.C.: Evolutionary splines for cepstral filterbank optimization in phoneme classification. EURASIP Journal on Advances in Signal Processing 2011, 8:1–8:14 (2011)

Published

2025-04-01