TY - JOUR
T1 - Effect of using varying negative examples in transcription factor binding site predictions
AU - Rezwan, Faisal
AU - Sun, Yi
AU - Davey, N.
AU - Adams, Roderick
AU - Rust, A.G.
AU - Robinson, M.
N1 - The original publication is available at www.springerlink.com Copyright Springer
PY - 2011
Y1 - 2011
N2 - Identifying transcription factor binding sites computationally is a hard problem as it produces many false predictions. Combining the predictions from existing predictors can improve the overall predictions by using classification methods like Support Vector Machines (SVM). But conventional negative examples (that is, example of non-binding sites) in this type of problem are highly unreliable. In this study, we have used different types of negative examples. One class of the negative examples has been taken from far away from the promoter regions, where the occurrence of binding sites is very low, and another one has been produced by randomization. Thus we observed the effect of using different negative examples in predicting transcription factor binding sites in mouse. We have also devised a novel cross-validation technique for this type of biological problem.
AB - Identifying transcription factor binding sites computationally is a hard problem as it produces many false predictions. Combining the predictions from existing predictors can improve the overall predictions by using classification methods like Support Vector Machines (SVM). But conventional negative examples (that is, example of non-binding sites) in this type of problem are highly unreliable. In this study, we have used different types of negative examples. One class of the negative examples has been taken from far away from the promoter regions, where the occurrence of binding sites is very low, and another one has been produced by randomization. Thus we observed the effect of using different negative examples in predicting transcription factor binding sites in mouse. We have also devised a novel cross-validation technique for this type of biological problem.
U2 - 10.1007/978-3-642-20389-3_1
DO - 10.1007/978-3-642-20389-3_1
M3 - Article
AN - SCOPUS:79955767455
SN - 0302-9743
VL - 6623
SP - 1
EP - 12
JO - Lecture Notes in Computer Science (LNCS)
JF - Lecture Notes in Computer Science (LNCS)
T2 - 9th European Conference, EvoBIO 2011 Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics
Y2 - 27 April 2011 through 29 April 2011
ER -