About Browser

You can use this ontology browser to quickly locate protein kinase proteins and a lot of information related to the proteins, including the sequence, structure, function, mutation and pathway information on kinases.

You may initiate ProKinO browsing by selecting one of the following items in the Browse menu:

  • Organisms, which provides the list of organisms; select an organism of interest and click on it to display the kinase proteins for the organism. Subsequently, you can select and click one of the proteins to explore.
  • Proteins, which provides the list of kinase proteins for many organisms; select a protein of interest and click on it to display the protein data.
  • Kinase Domains, which provides the hierarchy all kinase domains; you will be able to expand the groups into families and subfamilies, eventually finding the domains; again, select one of interest and click on it to display the kinase domain information.
  • Diseases, which provides the list of all diseases (limited to cancers at this time) that are affected by kinase proteins; select one of interest and click on it to display the cancer data.
  • Functional Domains, which provides the list of all functional domains; again, you may select one of interest and click on it to display the functional domain data.

It is also possible to initiate ProKinO browsing by searching for a specific kinase protein, disease, pathway, or for any object. The searching uses a set of terms, possibly separated by OR, AND and AND NOT boolean connectors. It is possible to use wildcards, for example, FGFR*. Phrases should be enclosed within double quotes.

The Query functionality for Proteins, Arbitrary SPARQL queries can be submitted (however, the size of the resulting set is limited to 1500, at this time).

The ProKinO ontology has not been released for downloading, yet.

After you initiate ProKinO browsing by selecting and displaying the data information page for one of the proteins, kinase domains, diseases, or functional domains, you will be able to explore the ontology by activating the hyper-links leading to a variety of related data.

Usually, a data information page includes the properties of the shown entity (for example, cellular location or tissue specificity of the proteins), and links to other related ontology entities (for example, functional domains, sequences, pathways of the proteins).

The protein data is subdivided into sub-areas, including the general information, functional features, mutations, pathways, and external references. These sub-areas of information are available by clicking on the provided tabs. The mutations are further subdivided into substitutions, insertions, deletions, complex mutations, and other.

Schema

The picture below is an outline of the ProKinO schema as a UML class diagram. It shows major ProKinO classes and object properties (relationships connecting ProKinO classes). It is available in PDF, as well

ProKinO Schema

People

ProKinO is a collaborative effort between the Evolutionary Systems Biology Group Lab of Dr. Natarajan Kannan at the Biochemistry and Molecular Biology Department and Dr. Krys J. Kochut's lab at the Computer Science Department, both at University of Georgia, Athens, USA. Gurinder Gosal, also at UGA, created the initial version of the software system to automatically populate ProKinO from the selected data sources.

Current Team Members
Natarajan Kannan
Dr. Natarajan Kannan
PI
BioInformatics
Krzysztof Kochut
Dr. Krzysztof Kochut
Co-PI
Computer Science
Saber Soleymani
Saber Soleymani
Research Assistant
Computer Science
Amitabh Priyadarshi
Nathan Gravel
Research Assistant
Bioinformatics
George Bendzunas
George Bendzunas
Postdoctoral Scholar
Bioinformatics
Amitabh Priyadarshi
Elika Bozorgi
Teaching Assistant
Computer Science
Amitabh Priyadarshi
Amitabh Priyadarshi
Research Assistant
Computer Science
Safal Shrestha
Safal Shrestha
Research Assistant
BioInformatics
Former Team Members
Liang-Chin Huang
Dr. Liang-Chin Huang
Research Assistant
BioInformatics
Abbas Keshavarzi
Dr. Abbas Keshavarzi
Research Assistant
Computer Science
Annie Kwon
Dr.Annie Kwon (IoB)
Postdoctoral Scholar
UCSF
Daniel McSkimming
Dr. Daniel McSkimming (IoB)
Director, Bioinf. &
Comp. Biology, USF
Shima Dastgheib
Dr. Shima Dastgheib (CS)
Technical Lead
Genentech/Roche
Gurinder Gosal
Dr. Gurinder Gosal (CS)
Researcher, Univ. of
British Columbia
Reshmi De
Reshmi De (CS)
Assistant Vice President
Citibank, Canada
Anish Narayanan
Anish Narayanan, M.D. (IoB)
Residency
UTHSC-San Antonio

Data Sources

The following table shows the versions of the datasources used for population of the ProKinO version loaded in this browser.

SourceVersion
COSMIC94
Reactome77
UniProt2021_03
KinBaseAugust, 2012
PseudokinasesSeptember, 2018
MSADecember, 2015
Sub-domainsJuly, 2021

Disease Data COSMIC(Catalogue of Somatic Mutations in Cancer) database ` is used as the source of information regarding kinase cancer mutations. In addition to mutations, other information such as, the primary sites, histology, samples, description and other relevant features is also obtained from COSMIC.

The mutation data was obtained from the Sanger Institute Catalogue Of Somatic Mutations In Cancer web site, http://www.sanger.ac.uk/cosmic Bamford et al (2004) The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website. Br J Cancer, 91,355-358.

Pathway and Reaction Data The pathway and reaction data in ProKinO is obtained from Reactome.

Reactome project. http://www.reactome.org/ (26th January 2011).

Function Data The information regarding functional domains and functional features associated with kinase domains is obtained from UniProt. The information regarding the functional domains associated with kinase domains, the crystal structures solved for each kinase domain, isoforms, and functional features associated with kinases, such as the modified residue, signal peptide, topological domain, cellular location and tissue specificity, are obtained from UniProt.

The UniProt Consortium, Ongoing and future developments at the Universal Protein Resource Nucleic Acids Res. 39: D214-D219 (2011).

Classification and Sequence Data Data regarding protein kinase sequence and classification is obtained from KinBase.

The Protein Kinase Complement of the Human Genome, G Manning, DB Whyte, R Martinez, T Hunter, S Sudarsanam (2002). Science 298:1912-1934.

Pseudokinase data and their classification Pseudokinase sequences were identified from the Uniprot reference proteomes database and classified into pseudokinase families in the Kannan Lab at the University of Georgia. Pseudokinase protein names were either obtained from UniProt or assigned by the pseudokinase classification if the protein name is not provided by UniProt.

Kinase Sub-domain Data To capture the sub-domain information in ProKinO, we have used a motif model [1, 2] with key motifs corresponding to each of the sub-domains in the kinase domain [3].
  1. Neuwald AF, Liu JS, Lawrence CE. Gibbs motif sampling: detection of bacterial outer membrane protein repeats. Protein Sci. 1995 Aug;4(8):1618-32. PubMed PMID:8520488; PubMed Central PMCID: PMC2143180.
  2. Kannan N, Neuwald AF. Did protein kinase regulatory mechanisms evolve through elaboration of a simple structural component? J Mol Biol. 2005 Sep 2;351(5):956-72. PubMed PMID: 16051269.
  3. Hanks SK, Hunter T. Protein kinases 6. The eukaryotic protein kinase superfamily: kinase (catalytic) domain structure and classification. FASEB J. 1995 May;9(8):576-96. Review. PubMed PMID:7768349.