IEICE globals.ieice.org Site

Author Search Result

[Author] Sang-Bum KIM(3hit)

1-3hit

Simple Weighting Techniques for Query Expansion in Biomedical Document Retrieval
Young-In SONG Kyoung-Soo HAN So-Young PARK Sang-Bum KIM Hae-Chang RIM

LETTER-Contents Technology and Web Information Systems

Vol:
E90-D No:11
Page(s):
1873-1876
In this paper, we propose two weighting techniques to improve performances of query expansion in biomedical document retrieval, especially when a short biomedical term in a query is expanded with its synonymous multi-word terms. When a query contains synonymous terms of different lengths, a traditional IR model highly ranks a document containing a longer terminology because a longer terminology has more chance to be matched with a query. However, such preference is clearly inappropriate and it often yields an unsatisfactory result. To alleviate the bias weighting problem, we devise a method of normalizing the weights of query terms in a long multi-word biomedical term, and a method of discriminating terms by using inverse terminology frequency which is a novel statistics estimated in a query domain. The experiment results on MEDLINE corpus show that our two simple techniques improve the retrieval performance by adjusting the inadequate preference for long multi-word terminologies in an expanded query.
A Definitional Question Answering System Based on Phrase Extraction Using Syntactic Patterns
Kyoung-Soo HAN Young-In SONG Sang-Bum KIM Hae-Chang RIM

LETTER-Natural Language Processing

Vol:
E89-D No:4
Page(s):
1601-1605
We propose a definitional question answering system that extracts phrases using syntactic patterns which are easily constructed manually and can reduce the coverage problem. Experimental results show that our phrase extraction system outperforms a sentence extraction system, especially for selecting concise answers, in terms of recall and precision, and indicate that the proper text unit of answer candidates and the final answer has a significant effect on the system performance.
Topic Document Model Approach for Naive Bayes Text Classification
Sang-Bum KIM Hae-Chang RIM Jin-Dong KIM

LETTER-Natural Language Processing

Vol:
E88-D No:5
Page(s):
1091-1094
The multinomial naive Bayes model has been widely used for probabilistic text classification. However, the parameter estimation for this model sometimes generates inappropriate probabilities. In this paper, we propose a topic document model for the multinomial naive Bayes text classification, where the parameters are estimated from normalized term frequencies of each training document. Experiments are conducted on Reuters 21578 and 20 Newsgroup collections, and our proposed approach obtained a significant improvement in performance compared to the traditional multinomial naive Bayes.

Author Search Result

[Author] Sang-Bum KIM(3hit)

Simple Weighting Techniques for Query Expansion in Biomedical Document Retrieval

A Definitional Question Answering System Based on Phrase Extraction Using Syntactic Patterns

Topic Document Model Approach for Naive Bayes Text Classification

Latest Issue

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles