Rough-fuzzy based scene categorization for text detection and recognition in video

Document Type

Article

Publication Date

1-1-2018

Abstract

Scene image or video understanding is a challenging task especially when number of video types increases drastically with high variations in background and foreground. This paper proposes a new method for categorizing scene videos into different classes, namely, Animation, Outlet, Sports, e-Learning, Medical, Weather, Defense, Economics, Animal Planet and Technology, for the performance improvement of text detection and recognition, which is an effective approach for scene image or video understanding. For this purpose, at first, we present a new combination of rough and fuzzy concept to study irregular shapes of edge components in input scene videos, which helps to classify edge components into several groups. Next, the proposed method explores gradient direction information of each pixel in each edge component group to extract stroke based features by dividing each group into several intra and inter planes. We further extract correlation and covariance features to encode semantic features located inside planes or between planes. Features of intra and inter planes of groups are then concatenated to get a feature matrix. Finally, the feature matrix is verified with temporal frames and fed to a neural network for categorization. Experimental results show that the proposed method outperforms the existing state-of-the-art methods, at the same time, the performances of text detection and recognition methods are also improved significantly due to categorization.

Keywords

Rough set, Fuzzy set, Video categorization, Scene image classification, Video text detection, Video text recognition

Divisions

fsktm

Funders

National Natural Science Foundation of China under Grant No. 61672273, No. 61272218 and No. 61321491,Science Foundation for Distinguished Young Scholars of Jiangsu under Grant No. BK20160021,University of Malaya HIR under Grant No: M.C/625/1/HIR/210

Publication Title

Pattern Recognition

Volume

80

Publisher

Elsevier

This document is currently not available here.

Share

COinS