|Table of Contents|

[1] Wu Chen, Zhang Quan, Jia Ning, et al. Concept-based approach for information retrieval [J]. Journal of Southeast University (English Edition), 2006, 22 (3): 324-329. [doi:10.3969/j.issn.1003-7985.2006.03.007]
Copy

Concept-based approach for information retrieval()
Share:

Journal of Southeast University (English Edition)[ISSN:1003-7985/CN:32-1325/N]

Volumn:
22
Issue:
2006 3
Page:
324-329
Research Field:
Computer Science and Engineering
Publishing date:
2006-09-30

Info

Title:
Concept-based approach for information retrieval
Author(s):
Wu Chen1 2 Zhang Quan1 Jia Ning1 2
1Graduate School, Chinese Academy of Sciences, Beijing 100039, China
2 Institute of Acoustics, Chinese Academy of Sciences, Beijing 100080, China
Keywords:
information retrieval concept semantic knowledge content representation
PACS:
TP393
DOI:
10.3969/j.issn.1003-7985.2006.03.007
Abstract:
A concept-based approach is expected to resolve the word sense ambiguities in information retrieval and apply the semantic importance of the concepts, instead of the term frequency, to representing the contents of a document.Consequently, a formalized document framework is proposed.The document framework is used to express the meaning of a document with the concepts which are expressed by high semantic importance.The framework consists of two parts:the “domain” information and the “situation & background” information of a document.A document-extracting algorithm and a two-stage smoothing method are also proposed.The quantification of the similarity between the query and the document framework depends on the smoothing method.The experiments on the TREC6 collection demonstrate the feasibility and effectiveness of the proposed approach in information retrieval tasks.The average recall level precision of the model using the proposed approach is about 10% higher than that of traditional ones.

References:

[1] Salton G.Automatic information organization and retrieval [M].New York:McGraw-Hill, 1968.
[2] Crestani F, Pasi G.Soft computing in information retrieval [M].Germany:Physica Verlag and Co, 2000.102-121.
[3] Lalmas M.Logical models in information retrieval:introduction and overview [J].Information Processing and Management, 1998, 34(1):19-33.
[4] Miyamoto S.Fuzzy sets in information retrieval and clustering analysis [M].Kluwer Academic Press, 1990.
[5] Salton G, Buckley C.Term-weighting approaches in automatic text retrieval [J].Information Processing and Management, 1988, 24(5):513-523.
[6] Huang Zengyang.HNC (hierarchical network concept)theory [M].Beijing:Tsinghua University Press, 1998.(in Chinese)
[7] Schank R.Identification of conceptualizations underlying nature language [A].In:Schank R, Colby K, eds.Computer Models of Thought and Language[C].San Francisco, CA:W H Freeman Company, 1973.187-247.
[8] Schank R.Conceptual information processing[M].Amsterdam:North Holland, 1975.
[9] Huang Zengyang.Mathematics and physics symbol system of language in language concept space [M].Beijing:Ocean Press, 2004.(in Chinese)
[10] Miao Chuanjiang.Guide of HNC (hierarchical network concept)theory [M].Beijing:Tsinghua University Press, 2005.(in Chinese)
[11] Wei Xiangfeng.The software platform for expanded sentence category analysis based on the HNC theory [D].Beijing:Chinese Academy of Sciences, 2005.http://www.hncnlp.com/Abs/absEwxf.htm.(in Chinese)
[12] Zhai C, Lafferty J.Two-stage language models for information retrieval [A].In:Proceedings of the 25th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval[C].Tampere, Finland, 2002.49-56.

Memo

Memo:
Biographies: Wu Chen(1979—), male, graduate;Zhang Quan(corresponding author), male, doctor, professor, zhq@mail.ioa.ac.cn.
Last Update: 2006-09-20