• Login
    View Item 
    •   Home
    • Computer Science
    • Faculty Research and Publications
    • Articles
    • View Item
    •   Home
    • Computer Science
    • Faculty Research and Publications
    • Articles
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of Effat University RepositoryCommunitiesPublication DateAuthorsTitlesSubjectsPublisherJournalTypeDepartmentThis CollectionPublication DateAuthorsTitlesSubjectsPublisherJournalTypeDepartmentProfilesView

    My Account

    Login

    Statistics

    Display statistics

    Gene Ontology GAN (GOGAN): a novel architecture for protein function prediction

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Author
    Mansoor, Musadaq
    Nauman, Mohammad
    Rehman, Hafeez Ur
    Benso, Alfredo
    Date
    2022-08-01
    
    Metadata
    Show full item record
    Abstract
    One of the most important aspects for a deep interpretation of molecular biology is the precise annotation of protein functions. An overwhelming majority of proteins, across species, do not have sufficient supplementary information available, which causes them to stay uncharacterized. Contrastingly, all known proteins have one key piece of information available: their amino acid sequence. Therefore, for a wider applicability of algorithms, across different species proteins, researchers are motivated to make computational techniques that characterize proteins using their amino acid sequence. However, in case of computational techniques like deep learning algorithms, huge amount of labeled information is required to produce good results. The labeling process of data is time and resource consuming making labeled data scarce. Utilizing the characteristic to address the formerly mentioned issues of uncharacterized proteins and traditional deep learning algorithms, we propose a model called GOGAN, that operates on the amino acid sequence of a protein to predict its functions. Our proposed GOGAN model does not require any handcrafted features, rather it extracts automatically, all the required information from the input sequence. GOGAN model extracts features from the massively large unlabeled protein datasets. The term “Unlabeled data” is used for piece of information that have not been assigned labels to identify their characteristics or properties. The features extracted by GOGAN model can be utilized in other applications like gene variation analysis, gene expression analysis and gene regulation network detection. The proposed model is benchmarked on the Homo sapiens protein dataset extracted from the UniProt database. Experimental results show clear improvements in different evaluation metrics when compared with other methods. Overall, GOGAN achieves an F1 score of 72.1% with Hamming loss of 9.5%, using only the amino acid sequences of protein.
    Department
    Computer Science
    Publisher
    Springer
    Journal title
    Soft Computing
    DOI
    https://doi.org/10.1007/s00500-021-06707-z
    ae974a485f413a2113503eed53cd6c53
    https://doi.org/10.1007/s00500-021-06707-z
    Scopus Count
    Collections
    Articles

    entitlement

     
    DSpace software (copyright © 2002 - 2023)  DuraSpace
    Quick Guide | Contact Us
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.