Fuzzy Matching of Semantic Class in Chinese Spoken Language Understanding

Yanling LI, Qingwei ZHAO, Yonghong YAN

  • Full Text Views

    0

  • Cite this

Summary :

Semantic concept in an utterance is obtained by a fuzzy matching methods to solve problems such as words' variation induced by automatic speech recognition (ASR), or missing field of key information by users in the process of spoken language understanding (SLU). A two-stage method is proposed: first, we adopt conditional random field (CRF) for building probabilistic models to segment and label entity names from an input sentence. Second, fuzzy matching based on similarity function is conducted between the named entities labeled by a CRF model and the reference characters of a dictionary. The experiments compare the performances in terms of accuracy and processing speed. Dice similarity and cosine similarity based on TF score can achieve better accuracy performance among four similarity measures, which equal to and greater than 93% in F1-measure. Especially the latter one improved by 8.8% and 9% respectively compared to q-gram and improved edit-distance, which are two conventional methods for string fuzzy matching.

Publication
IEICE TRANSACTIONS on Information Vol.E96-D No.8 pp.1845-1852
Publication Date
2013/08/01
Publicized
Online ISSN
1745-1361
DOI
10.1587/transinf.E96.D.1845
Type of Manuscript
PAPER
Category
Natural Language Processing

Authors

Yanling LI
  Chinese Academy of Sciences,Inner Mongolia Normal University
Qingwei ZHAO
  Chinese Academy of Sciences
Yonghong YAN
  Chinese Academy of Sciences

Keyword

FlyerIEICE has prepared a flyer regarding multilingual services. Please use the one in your native language.