Bionoi: A Voronoi Diagram-Based Representation of Ligand-Binding Sites in Proteins for Machine Learning Applications
Bionoi is a new software to generate Voronoi representations of ligand-binding sites in proteins for machine learning applications. Unlike many other deep learning models in biomedicine, Bionoi utilizes off-the-shelf convolutional neural network architectures, reducing the development work without sacrificing the performance. When initially generating images of binding sites, users have the option to color the Voronoi cells based on either one of six structural, physicochemical, and evolutionary properties, or a blend of all six individual properties. Encouragingly, after inputting images generated by Bionoi into the convolutional autoencoder, the network was able to effectively learn the most salient features of binding pockets. The accuracy of the generated model is evaluated both visually and numerically through the reconstruction of binding site images from the latent feature space. The generated feature vectors capture well various properties of binding sites and thus can be applied in a multitude of machine learning projects. As a demonstration, we trained the ResNet-18 architecture from Microsoft on Bionoi images to show that it is capable to effectively classify nucleotide- and heme-binding pockets against a large dataset of control pockets binding a variety of small molecules. Bionoi is freely available to the research community at https://github.com/CSBG-LSU/BionoiNet.
Publication Source (Journal or Book title)
Methods in molecular biology (Clifton, N.J.)
Feinstein, J., Shi, W., Ramanujam, J., & Brylinski, M. (2021). Bionoi: A Voronoi Diagram-Based Representation of Ligand-Binding Sites in Proteins for Machine Learning Applications. Methods in molecular biology (Clifton, N.J.), 2266, 299-312. https://doi.org/10.1007/978-1-0716-1209-5_17