|Table of Contents|

[1] Weng Jianhong, Zhou Tong, Sun Xiao, Lu Zuhong, et al. Support vector machine for prediction of meiotic recombinationhotspots and coldspots in Saccharomyces cerevisiae [J]. Journal of Southeast University (English Edition), 2006, 22 (1): 112-116. [doi:10.3969/j.issn.1003-7985.2006.01.024]
Copy

Support vector machine for prediction of meiotic recombinationhotspots and coldspots in Saccharomyces cerevisiae()
基于支持向量机的酵母重组热点和冷点的预测
Share:

Journal of Southeast University (English Edition)[ISSN:1003-7985/CN:32-1325/N]

Volumn:
22
Issue:
2006 1
Page:
112-116
Research Field:
Biological Science and Medical Engineering
Publishing date:
2006-03-20

Info

Title:
Support vector machine for prediction of meiotic recombinationhotspots and coldspots in Saccharomyces cerevisiae
基于支持向量机的酵母重组热点和冷点的预测
Author(s):
Weng Jianhong, Zhou Tong, Sun Xiao, Lu Zuhong
State Key Laboratory of Bioelectronics, Southeast University, Nanjing 210096, China
翁建洪, 周童, 孙啸, 陆祖宏
东南大学生物电子学国家重点实验室, 南京 210096
Keywords:
meiotic recombination hotspot coldspot dinucleotide abundance support vector machine
减数分裂重组 热点 冷点 二联碱基丰度 支持向量机
PACS:
Q617
DOI:
10.3969/j.issn.1003-7985.2006.01.024
Abstract:
A novel method for predicting hotspots and coldspots using support vector machine(SVM)based on statistical learning theory is developed. This method is applied to published 303 hot and 48 cold open reading frames(ORFs)in Saccharomyces cerevisiae. The sequence features of general dinucleotide abundance and dinucleotide abundance based on codon usage are extracted, and then the data sets are classified with different parameters and kernel functions combined with the method of two-fold cross validation. The result indicates that 87.47% accuracy can be reached when classifying hot and cold ORF sequences with the kernel of radial basis function combined with dinucleotide abundance based on codon usage.
使用基于统计学习理论的支持向量机(SVM)方法, 提出了针对重组热点和冷点分类预测的新方法.对酵母基因组的303个重组热点开放阅读框(hot ORF)以及48个重组冷点开放阅读框(cold ORF), 提取了序列的一般二联碱基丰度特征, 以及基于密码子使用偏性的二联碱基丰度特征, 然后使用二倍交叉验证方法, 选择不同的核函数和对应参数, 对数据集进行了训练和分类预测.研究结果表明, 当使用径向基核函数, 并采用基于密码子使用偏性的二联碱基丰度特征时, 预测准确率为87.47%.

References:

[1] Lichten M, Goldman A S.Meiotic recombination hotspots [J].Annu Rev Genet, 1995, 29(1):423-444.
[2] Gerton J L, DeRisi J, Shroff R, et al.Global mapping of meiotic recombination hotspots and coldspots in the yeast Saccharomyces cerevisiae [J].Proc Natl Acad Sci USA, 2000, 97(21):11383-11390.
[3] Baudat F, Nicolas A.Clustering of meiotic double-strand breaks on yeast chromosome Ⅲ [J].Proc Natl Acad Sci USA, 1997, 94(10):5213-5218.
[4] Klein S, Zenvirth D, Dror V, et al.Patterns of meiotic double-strand breakage on native and artificial yeast chromosomes [J].Chromosoma, 1996, 105(5):276-284.
[5] Zenvirth D, Arbel T, Sherman A, et al.Multiple sites for double-strand breaks in whole meiotic chromosomes of Saccharomyces cerevisiae [J].EMBO J, 1992, 11(9):3441-3447.
[6] Karlin S, Cardon L R.Computational DNA sequence analysis [J].Annu Rev Microbiol, 1994, 44(1):619-654.
[7] Lin K, Kuang Y, Joseph, J S, et al.Conserved codon composition of ribosomal protein coding genes in Escherichia coli, Mycobacterium tuberculosis and Saccharomyces cerevisiae:lessons from supervised machine learning in functional genomics [J].Nucleic Acids Res, 2002, 30(11):2599-2607.
[8] Vapnik V N.Statistical learning theory [M].New York: Wiley, 1998: 375-440.
[9] Friedel C C, Jahn K H, Sommer S, et al.Support vector machines for separation of mixed plant-pathogen EST collections based on codon usage [J].Bioinformatics, 2005, 21(8):1383-1388.
[10] Kliman R M, Irving N, Santiago M.Selection conflicts, gene expression, and codon usage trends in yeast [J].J Mol Evol, 2003, 57(1):98-109.
[11] Kliman R M, Hey J.Reduced natural selection associated with low recombination in Drosophila melanogaster [J].Mol Biol Evol, 1993, 10(6):1239-1258.
[12] Marais G, Mouchiroud D, Duret L.Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes [J].Proc Natl Acad Sci USA, 2001, 98(10):5688-5692.
[13] Marais G, Piganeau G.Hill-Robertson interference is a minor determinant of variations in codon bias across Drosophila melanogaster and Caenorhabditis elegans genome [J].Mol Biol Evol, 2002, 19(9):1399-1406.
[14] Perry J, Ashworth A.Evolutionary rate of a gene affected by chromosomal position [J].Curr Biol, 1999, 9(17):987-989.
[15] Fullerton S M, Bernardo Carvalho A, Clark A G.Local rates of recombination are positively correlated with GC content in the human genome [J].Mol Biol Evol, 2001, 18(6):1139-1142.
[16] Ma J M, Zhou T, Gu W J, et al.Cluster analysis of the codon use frequency of MHC genes from different species [J].Biosystems, 2002, 65(2, 3):199-207.(in Chinese)
[17] Richard J E, Lin K, Tan T.A functional significance for codon third bases [J].Gene, 2000, 245(2):291-298.

Memo

Memo:
Biographies: Weng Jianhong(1981—), male, graduate;Lu Zuhong(corresponding author), male, doctor, professor, zhlu@seu.edu.cn.
Last Update: 2006-03-20