Machine learning techniques applied to the cleavage site prediction problem

Gloria Ineś Alvarez, Enrique Bravo, Diego Linares, Jheyson Faride Vargas, Jairo Andreś Velasco

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The Genome of the Potyviridae virus family is usually expressed as a polyprotein which can be divided into ten proteins through the action of enzymes or proteases which cut the chain in specific places called cleavage sites. Three different techniques were employed to model each cleavage site: Hidden Markov Models (HMM), grammatical inference OIL algorithm (OIL), and Artificial Neural Networks (ANN). Based on experimentation, the Hidden Markov Model has the best classification performance as well as a high robustness in relation to class imbalance. However, the Order Independent Language (OIL) algorithm is found to exhibit the ability to improve when models are trained using a greater number of samples without regard to their huge imbalance.

Original languageEnglish
Title of host publicationAdvances in Artificial Intelligence and Its Applications - 12th Mexican International Conference on Artificial Intelligence, MICAI 2013, Proceedings
Pages497-507
Number of pages11
EditionPART 1
DOIs
StatePublished - 2013
Event12th Mexican International Conference on Artificial Intelligence, MICAI 2013 - Mexico City, Mexico
Duration: 24 Nov 201330 Nov 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 1
Volume8265 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference12th Mexican International Conference on Artificial Intelligence, MICAI 2013
Country/TerritoryMexico
CityMexico City
Period24/11/1330/11/13

Fingerprint

Dive into the research topics of 'Machine learning techniques applied to the cleavage site prediction problem'. Together they form a unique fingerprint.

Cite this