Skip to content

CMIB: unsupervised image object categorization in multiple visual contexts

Research output: Contribution to journalArticle

Object categorization in images is fundamental to various industrial areas, such as automated visual inspection, fast image retrieval and intelligent surveillance. Most existing methods treat visual features (e.g., scale-invariant feature transform, SIFT) as content information of the objects, while regarding image tags as its contextual information. However, the image tags can hardly been acquired in complete unsupervised settings, especially when the image volume is too large to be marked. In this work, we propose a novel and effective method called contextual multivariate information bottleneck (CMIB) to discover object category in totally unlabeled images. Unlike treating image tags as the object’s context, CMIB adopts one feature representation of the images to characterize the object’s content information, while regarding the auxiliary clusterings obtained by other multiple related features as its visual contexts. In the proposed CMIB framework, we borrow the idea of the data compression procedure for object category discovery, which aims to squeeze the source image collection into its compressed representation as much as possible, while maximally preserving the correlative information between the content and visual contexts. Specifically, two Bayesian networks are built to characterize the relationships between data compression and information preservation. Moreover, a sequential informationtheoretic optimization is proposed to ensure the convergence of the CMIB objective function. Extensive experiments on five real-world image data sets show that the proposed method can significantly outperform the state-of-the-art baselines.
Original languageEnglish
JournalIEEE Transactions on Industrial Informatics
Publication statusAccepted for publication - 11 Aug 2019


  • Unsupervised Image Object Categorization_pp

    Rights statement: The embargo end date of 2050 is a temporary measure until we know the publication date. Once we know the publication date the full text of this article will be able to view shortly afterwards.

    Accepted author manuscript (Post-print), 1.12 MB, PDF document

    Due to publisher’s copyright restrictions, this document is not freely available to download from this website until: 1/01/50

Related information

Relations Get citation (various referencing formats)

ID: 15859476