The denominator in Equation 1,P(D), represents the likelihood of the data, and is calculated using standard methodology[62], and the numerator is calculated in a manner similar to that described previously (e.g.,[7],[64],[65]). or not site-specific functional shifts characterize the evolution of a protein and, if so, points to the specific sites and lineages in which these shifts have most likely occurred. Applying RASER to a dataset composed of large samples of HIV-1 sequences from different group M subtypes, we reveal rampant evolutionary shifts throughout the HIV-1 proteome. Most of these rate shifts have occurred during the divergence of the major subtypes, establishing that subtype divergence occurred together with functional diversification. We report further evidence for the emergence of a new sub-subtype, characterized by abundant rate-shifting sites. When focusing on the rate-shifting sites detected, we find that many are associated with known function relating to viral life cycle and drug resistance. Finally, we discuss mechanisms of covariation of rate-shifting sites. == Author Summary == The AIDS epidemic, inflicted by the human immunodeficiency virus (HIV), has already claimed 25 million lives, thus posing a global threat. Since its discovery, several HIV subtypes have emerged, DNAJC15 characterized by distinct genomic sequences and variable geographic locations. Here, we investigate the nature of the genetic differences among the subtypes. The neutral theory of evolution suggests that most genetic differences marginally affect the function of the encoded proteins (hence neutral) and thus occur randomly. Alternatively, changes in protein function are reflected by a pattern of nonrandom genetic differences. To address this issue, we developed a computational method, which studies the differences between sequences of different HIV subtypes, and estimates which of the explanations is more likely. Using a large sample of HIV protein sequences, we discovered that part of the variability among the subtypes is not random and possibly reflects different functional constraints imposed on the subtypes during the course of their evolution. An in-depth inspection of these nonrandom changes revealed a correlation with biological traits, such as drug resistance and mechanisms facilitating viral entry into the host cell. Interestingly, nonrandom changes are also characteristic of a viral strain that recently emerged in the former Soviet Union. == Launch == Genomic variety is normally an integral feature from the Individual Immunodeficiency Trojan type 1 (HIV-1). This high variety has led to the introduction of several distinctive sets of the trojan, characterized by distinctive DNA sequences. HIV-1 is normally traditionally categorized into 3 groupings: M (main), O (outlying), and N (brand-new)[1],[2]. The M group makes up about 90% of reported HIV-1 attacks, and it is further split into nine subtypes: A, B, C, D, F, G, H, J, and K, each which is connected with a particular geographical area roughly. Subtype C makes up about half of most brand-new attacks almost, and predominates in southern and eastern Africa, India, and Nepal. Subtypes A, D, G, H, and K have already been discovered in different parts of Africa. Subtype F is normally common in central Africa, SOUTH USA and P005091 east European countries, whereas subtype J is normally exceptional to Central America. Subtype B is normally predominant under western culture (European countries, the Americas, Japan, and Australia). Therefore, subtype B may be the most examined subtype in the lab broadly, despite being in charge of just 12% of global attacks[3]. Different HIV-1 subtypes screen just as much as 2030% deviation within their Env nucleotide sequences[4]. Alternatively, the Gag and Pol P005091 sequences of different subtypes screen much less variety, given that they encode the three essential enzymes (protease, change transcriptase (RT), and integrase) as well as the viral structural protein, which are much less tolerant to adjustments. Huge distinctions among subtypes can be found in the accessories and regulatory proteins Nef also, Vif, Vpr, Vpu, Rev, and Tat. For instance, subtype C encodes a truncated Rev proteins and an elongated Vpu proteins[5], both which are useful. To time, inconclusive P005091 evidence is available P005091 on the overall aftereffect of the high hereditary variety of HIV-1 subtypes on proteins functionality (analyzed in[1],[3]). Many studies have discovered positive Darwinian selection to have an effect on only specific clades in the Env[6],[7], protease, and RT proteins[8],[9]. Aswell, adaptive coevolutionary occasions were found to describe a number of the variability between subtypes[10]. Lately, differential conservation of placement 31 in the Tat proteins among different subtypes was discovered to correlate with different efficiency from the this proteins in subtype C[11]. Alternatively,.