计算机科学与技术

基于视觉语义与RSSVM的图像检索

展开
  • 1.西北大学 信息科学与技术学院, 陕西 西安 710069;2.西北工业大学 电子信息学院, 陕西 西安 710072
李大湘(1974-),男,博士,工程师,主要从事图像检索、图像标注与图像分类研究.

收稿日期: 2009-05-11

  修回日期: 2009-10-21

  网络出版日期: 2010-04-25

基金资助

教育部新世纪优秀人才支持计划资助项目(NCET-07-0693)

Image Retrieval Based on Visual Semantics and RSSVM

Expand
  • 1.School of Information Science and Technology,Northwest University,Xi'an 710069,Shaanxi,China;2.School of Electronics Information,Northwestern Polytechnical University,Xi'an 710072,Shaanxi,China
李大湘(1974-),男,博士,工程师,主要从事图像检索、图像标注与图像分类研究.

Received date: 2009-05-11

  Revised date: 2009-10-21

  Online published: 2010-04-25

Supported by

教育部新世纪优秀人才支持计划资助项目(NCET-07-0693)

摘要

以图像的视觉语义为基础,设计了一种新的空间转换模型,提出了一种新的图像语义描述方法.首先,采用NCut方法对图像进行分割,提取每个区域的颜色、纹理与形状等视觉特征;再用K-Means聚类方法对训练集中所有的视觉特征进行聚类,称聚类中心为视觉语义(Visual Semantic,VS),用来构造投影空间;然后通过所定义的非线性函数,将每幅图像向投影空间作映射,得到图像的投影特征;最后,为了提高分类器的训练效率与性能,先采用RS(粗糙集)方法对投影特征进行属性约简,再用支持向量机(SVM)进行学习和分类.基于Corel图像集的对比实验结果表明,该方法性能受聚类数的影响不大,鲁棒性强,且性能优于其它方法.

本文引用格式

李大湘 彭进业 贺进芳 . 基于视觉语义与RSSVM的图像检索[J]. 华南理工大学学报(自然科学版), 2010 , 38(4) : 156 -161,166 . DOI: 10.3969/j.issn.1000-565X.2010.04.028

Abstract

Based on the visual semantics of images,a novel space-transforming model is designed,and a new semantics description method is proposed.In the investigation,first,the method of Normalized Cut(NCut) is used to segment each image into several regions,and such visual features as color,texture and shape,etc.of each region are extracted.Next,all the visual features in the training set are clustered by using the K-Means method,and each cluster center is regarded as a "visual semantic" to construct a projection space.Then,a nonlinear function is defined to map each image into a point in the projection space.Thus,all the projection features of the image are obtained.Moreover,in order to improve the training efficiency and performance of classifiers,projection features are reduced in attributes via the rough set(RS) method,and are trained and classified by the support vector machine.The results are finally compared with those based on the Corel image set.It is found that the proposed method is robust to cluster number,and is superior to the other methods in terms of retrieval performance.

文章导航

/