Symmetry 2021, 13, 1849. https://doi.org/10.3390/sym13101849 www.mdpi.com/journal/symmetry
Article
An Application of the Eigenproblem for Biochemical Similarity
Dan-Marian Joiţa
1,
*, Mihaela Aurelia Tomescu
2
, Donatella Bàlint
1
and Lorentz Jäntschi
1,3,
*
1
Chemistry Doctoral School, Babeş-Bolyai University, 400084 Cluj, Romania; balintdonna@gmail.com or
donatella.balint@ubbcluj.ro
2
Department of Mathematics and Informatics, University of Petroșani, 332006 Hunedoara, Romania;
MihaelaTomescu@upet.ro
3
Department of Physics and Chemistry, Technical University of Cluj-Napoca, 400641 Cluj, Romania
* Correspondence: djoita@chem.ubbcluj.ro (D.-M.J.); lorentz.jantschi@chem.utcluj.ro (L.J.)
Abstract: Protein alignment finds its application in refining results of sequence alignment and un-
derstanding protein function. A previous study aligned single molecules, making use of the mini-
mization of sums of the squares of eigenvalues, obtained for the antisymmetric Cartesian coordinate
distance matrices Dx and Dy. This is used in our program to search for similarities between amino
acids by comparing the sums of the squares of eigenvalues associated with the Dx, Dy, and Dz dis-
tance matrices. These matrices are obtained by removing atoms that could lead to low similarity.
Candidates are aligned, and trilateration is used to attach all previously striped atoms. A TM-score
is the scoring function that chooses the best alignment from supplied candidates. Twenty essential
amino acids that take many forms in nature are selected for comparison. The correct alignment is
taken into account most of the time by the alignment algorithm. It was numerically detected by the
TM-score 70% of the time, on average, and 15% more cases with close scores can be easily distin-
guished by human observation.
Keywords: eigenproblem; eigenvalues; molecular alignment; orthogonal alignment; biochemical
similarity; antisymmetric matrix
1. Introduction
Just visualizing two simple similar structures leads to an immediate detection of pat-
terns. Similarity is of convenience for humans, but to power automatic decision mecha-
nisms for a PC, it must be measurable. It is mostly used for comparing proteins, but the
growing number of PDB structures (currently over 180,000) is many orders of magnitude
higher than what the human eye can compare. Because of the large number, it takes days
even for current programs to search the database for a query structure. A more reasonable
time can be achieved by developing new algorithms [1].
Protein alignment finds its application in refining results of sequence alignment and
understanding protein function [2,3]. Choosing the alignment that is most geometrically
similar is an easier task compared to evaluating its biological significance [4]. The pursuit
of the best method is in progress, with multiple programs being developed during the
past decades:
• CAB-Align uses the residue–residue contact area to identify regions of similarity [5].
• Caretta uses rotation-invariant technique signals of distances derived from overlap-
ping contiguous stretches of residues to find an initial superposition [6].
• DALI [7].
• LS-align generates fast and accurate atom-level structural alignments of ligand mol-
ecules through an iterative heuristic search of the target function that combines com-
parisons of inter-atom distance with mass and chemical bonds [8].
Citation: Joiţa, D.-M.;
Tomescu, M. A.; Bàlint, D.; Jäntschi,
L. An Application of the
Eigenproblem for Biochemical Simi-
larity. Symmetry 2021, 13, 1849.
https://doi.org/10.3390/sym1310184
9
Academic Editors: Anthony
Harriman and Enrico Bodo
Received: 13 August 2021
Accepted: 23 September 2021
Published: 2 October 2021
Publisher’s Note: MDPI stays neu-
tral with regard to jurisdictional
claims in published maps and insti-
tutional affiliations.
Copyright: © 2021 by the authors.
Licensee MDPI, Basel, Switzerland.
This article is an open access article
distributed under the terms and con-
ditions of the Creative Commons At-
tribution (CC BY) license
(https://creativecommons.org/li-
censes/by/4.0/).