|Table of Contents|

[1] Yuan Xiaojie, Yu Shitao, Shi Jianxing, Chen Qiushuang, et al. Question classification in question answeringbased on real-world web data sets [J]. Journal of Southeast University (English Edition), 2008, 24 (3): 272-275. [doi:10.3969/j.issn.1003-7985.2008.03.005]
Copy

Question classification in question answeringbased on real-world web data sets()
真实网络数据集自动问答系统中的问题分类
Share:

Journal of Southeast University (English Edition)[ISSN:1003-7985/CN:32-1325/N]

Volumn:
24
Issue:
2008 3
Page:
272-275
Research Field:
Computer Science and Engineering
Publishing date:
2008-09-30

Info

Title:
Question classification in question answeringbased on real-world web data sets
真实网络数据集自动问答系统中的问题分类
Author(s):
Yuan Xiaojie, Yu Shitao, Shi Jianxing, Chen Qiushuang
College of Information Technical Science, Nankai University, Tianjin 300071, China
袁晓洁, 于士涛, 师建兴, 陈秋双
南开大学信息技术科学学院, 天津 300071
Keywords:
question classification question answering real-world web data sets question and answer web forums re-ranking model
问题分类 自动问答系统 真实网络数据集 问答网络论坛 重新排序模型
PACS:
TP391
DOI:
10.3969/j.issn.1003-7985.2008.03.005
Abstract:
To improve question answering(QA)performance based on real-world web data sets, a new set of question classes and a general answer re-ranking model are defined.With pre-defined dictionary and grammatical analysis, the question classifier draws both semantic and grammatical information into information retrieval and machine learning methods in the form of various training features, including the question word, the main verb of the question, the dependency structure, the position of the main auxiliary verb, the main noun of the question, the top hypernym of the main noun, etc.Then the QA query results are re-ranked by question class information.Experiments show that the questions in real-world web data sets can be accurately classified by the classifier, and the QA results after re-ranking can be obviously improved.It is proved that with both semantic and grammatical information, applications such as QA, built upon real-world web data sets, can be improved, thus showing better performance.
为了改善真实网络数据集上自动问答系统的性能, 定义出新的问题类别集合和通用的答案重新排序模型.问题分类器借助先验词典和语法分析, 将语义和语法信息引入信息检索和机器学习方法, 呈现为多种多样的训练属性, 包括疑问词、中心动词、疑问词与中心动词依赖关系、中心助动词位置、中心名词、中心名词顶级上位词等.进而通过问题类别信息, 对问答查询结果重新排序.实验表明:分类器能够精确实现真实网络数据集的问题分类, 重新排序后的自动问答结果也能得到明显改善.这说明借助语义和语法信息, 真实网络数据集上的自动问答系统等应用可以得到改善, 显示出更好的性能.

References:

[1] Sun R, Jiang J, Tan Y F, et al.Using syntactic and semantic relation analysis in question answering [C]//Proc of the 14th Text REtrieval Conference.Gaithersburg, Maryland, 2005.
[2] Harabagiu S, Moldovan D, Clark C, et al.Employing two question answering systems in TREC 2005 [C]//Proc of the 14th Text REtrieval Conference.Gaithersburg, Maryland, 2005.
[3] Wen Xu, Zhang Yu, Liu Ting, et al.Syntactic structure parsing based Chinese question classification [J].Journal of Chinese Information Processing, 2006, 20(2):33-39.(in Chinese)
[4] Lin Xudong, Peng Hong, Lin Piyuan, et al.Question interpretation and question classification based on dependency relations [J].Computer Science, 2007, 34(7):208-210.(in Chinese)
[5] Lo Ka Kan, Lam Wai.Using semantic relations with world knowledge for question answering [C]//Proc of the 15th Text REtrieval Conference.Gaithersburg, Maryland, 2006.
[6] Soricut R, Brill E.Automatic question answering using the web: beyond the factoid [J].Information Retrieval, 2006, 9(2):191-206.
[7] Bilotti Matthew W, Ogilvie Paul, Callan Jamie, et al.Structured retrieval for question answering [C]//Proc of the 30th ACM SIGIR Conference.Amsterdam, the Netherlands, 2007:351-358.
[8] Jijkoun V, de Rijke M.Retrieving answers from frequently asked questions pages on the web [C]//Proc of the 14th ACM CIKM Conference.Bremen, Germany, 2005:76-83.
[9] Li Xin, Roth Dan.Learning question classifiers [C]//Proc of the 19th International Conference on Computational Linguistics.Taipei, China, 2002:556-562.
[10] Li Xin, Roth Dan, Small Kevin.The role of semantic information in learning question classifiers [C]//Proc of the 1st International Joint Conference on Natural Language Processing.Cambridge University Press, 2006:229-249.

Memo

Memo:
Biography: Yuan Xiaojie(1963—), female, doctor, professor, yuanxj@nankai.edu.cn.
Foundation items: Microsoft Research Asia Internet Services in Academic Research Fund(No.FY07-RES-OPP-116), the Science and Technology Development Program of Tianjin(No.06YFGZGX05900).
Citation: Yuan Xiaojie, Yu Shitao, Shi Jianxing, et al.Question classification in question answering based on real-world web data sets[J].Journal of Southeast University(English Edition), 2008, 24(3):272-275.
Last Update: 2008-09-20