999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Rare Bird Sparse Recognition via Part-Based Gist Feature Fusion and Regularized Intraclass Dictionary Learning

2018-07-12 10:54:54JixinLiuNingSunXiaofeiLiGuangHanHaigenYangandQuansenSun
Computers Materials&Continua 2018年6期

Jixin Liu , Ning Sun , Xiaofei Li Guang Han Haigen Yangand Quansen Sun

Abstract:Rare bird has long been considered an important in the field of airport security,biological conservation, environmental monitoring, and so on. With the development and popularization of IOT-based video surveillance, all day and weather unattended bird monitoring becomes possible. However, the current mainstream bird recognition methods are mostly based on deep learning. These will be appropriate for big data applications,but the training sample size for rare bird is usually very short. Therefore, this paper presents a new sparse recognition model via improved part detection and our previous dictionary learning. There are two achievements in our work: (1) after the part localization with selective search, the gist feature of all bird image parts will be fused as data description; (2) the fused gist feature needs to be learned through our proposed intraclass dictionary learning with regularized K-singular value decomposition.According to above two innovations, the rare bird sparse recognition will be implemented by solving one l1-norm optimization. In the experiment with Caltech-UCSD Birds-200-2011 dataset, results show the proposed method can have better recognition performance than other SR methods for rare bird task with small sample size.

Keywords:Rare bird, sparse recognition, part detection, gist feature fusion, regularized intraclass dictionary learning.

1 Introduction

In the research field of bird monitoring and preservation, rare bird is undoubtedly one of the most valuable topic. However, it is also the most difficult to implement regulations.The reason is that, unlike human face or action, bird behavior is complicated and uncontrollable. In other words, the traditional manual observation can not be suitable for bird object, let alone use for rare bird. With the gradual popularization of the IOT(Internet of Things)-based video surveillance, all day and weather unattended bird monitoring becomes possible. Due to the above, new requirements for rare bird intelligent identification have been put forward.

As one application of pattern recognition, bird recognition has always been focused by researchers in the field of airport security, biological conservation, environmental monitoring, and so on. At present, for bird recognition, most achievements look at the aspect of audio data [Evangelista, Priolli, Silla Jr. et al. (2014); Ventura, Oliveira, Ganchev et al. (2015); Boulmaiz, Messadeg, Doghmane et al. (2016); Raghuram, Chavan, Belur et al.(2016); Chakraborty, Mukker, Rajan et al. (2017)]. But the study of bird image recognition might be relatively few [Li, Zhang and Yan (2014); Marini, Turatti, Britto et al. (2015);Karmaker, Schiffner, Strydom et al. (2017)]. In practical application, the audio recognition is not a ideal choice for bird monitoring. Because the real environment is easy to be influenced by noise interference. Therefore, the image data under video surveillance will be more suitable for bird recognition. For this purpose, it become necessary and urgent to carry out research in bird recognition for image or video data.

Image bird recognition is a kind of typical fine-grained recognition. For this kind of problem, CNN (convolutional neural network) [Han, Quan, Zhang et al. (2018)] is the most popular solution. Zhang et al. [Zhang, Donahue, Girshick et al. (2014)] propose a model for fine-grained categorization that overcomes these limitations by leveraging deep convolutional features computed on bottom-up region proposals. Lin et al. [Lin,Roychowdhury and Maji (2015)] present bilinear CNNs, an architecture that efficiently represents an image as a pooled outer product of two CNN features, that is effective at fine-grained recognition tasks. Wei et al. [Wei, Xie and Wu (2016)] propose a novel endto-end Mask-CNN model without the fully connected layers for fine-grained recognition.Although these studies have yielded some results, the limitation of CNN is undeniable.That is due to the fact that deep learning with CNN will be more appropriate for big data applications. But the rare bird recognition task is usually very difficult to have a large enough training samples for CNN modeling. Hence, for rare bird recognition with small size, we need to select new ways to ensure high robustness under natural scene.

According to the above requirements, this paper present a new SR (sparse recognition)method for rare bird recognition. Fig. 1 shows the processing flow of this method. There are two innovation points in our work: Firstly, the local (such head as torso) and global(the whole object) image patches, based on part detection, will be fused as feature description under GIST [Oliva and Torralba (2001)] space. Secondly, by introducing regularized K-singular value decomposition, our previous work [Liu and Sun (2016)] will be improved as a new classifier in the solving performance ofl1optimization. This paper will be organized as follows: Section 2 gives a brief introduction for SR method. In Section 3 the proposed SR for rare bird fine-grained recognition is detailed. Experimental results are analyzed in Sections 4 and Section 5 concludes the paper with a discussion.

Figure 1: System flow of the proposed SR method

2 Sparse recognition and the related works

In the study of SR, there are two mainstream approaches at present. One classical method is SRC (sparse representation-based classification). It is derived from the theory of compressed sensing which is presented by Candes et al. [Candes and Tao (2006)] and Donoho [Donoho (2006)]. In this method, any test samplebcan be sparsely measured through the global recognition matrix from the training sample setAnd the process will be implemented as

From this the recognition task can be accomplished by the following judgment

Unfortunately, the performance of SRC will rely on some preprocessing (such as alignment [Ma, Luong, Philips et al. (2012)] or registration [Mohammadi, Fatemizadeh and Mahoor (2014)]).

Considering the limitation of SRC, another SR idea is presented. That is so-called DSR(dictionary-based sparse recognition) [Patel, Wu, Biswas et al. (2012); Zhang, Sun,Porikli et al. (2017)]. The key of DSR is based on one dictionary learning process as

In the second part, under a fixed sparse representation, the dictionary will be replaced as

So the recognition task in DSR will be changed from Eq. (2) as

Patel et al. [Patel, Wu, Biswas et al. (2012)] indicates that, DSR could be more robust than SRC without any preprocessing. But, it is easy to be local optimum because of a lack of global measurement.

In order to integrate the superiority of SRC and DSR, we propose the concept of intraclass dictionary learning (IDL) [Liu and Sun (2016)]. In this method, the global recognition matrix like SRC framework will be replaced with the IDL (not DSR) result from each class training sample set. Thus the SR under IDL can be improved from Eq. (1)

By the experiment under some data sets, such as LFW [Huang, Ramesh, Berg et al.(2007)], Caltech101 [Li, Fergus and Perona (2007)] and ISR [Quattoni and Torralba(2001)], the proposed IDL shows the preferable recognition performance for image object in natural scene. Hence this paper will try to use it for rare bird fine-grained sparse recognition.

3 The proposed SR method for rare bird recognition

3.1 Challenges in rare bird image data

The major diversity of bird image is in the size, color and texture of bird parts. Take the popular Caltech-UCSD Birds-200-2011 [Wah, Branson, Welinder et al. (2011)](CUB200-2011) as one example. In this database, each class has at least three orientations for bird head. There is no doubt that other parts will be more complicated. So the SRC for human face [Wright, Yang, Ganesh et al. (2009)] will be inadvisable.Because this model usually depends on the preprocessing. Besides, the small sample size for rare bird can easily affect the sparsity precondition in SRC framework. Thus it can beseen, SR via dictionary learning should be taken seriously.

In the selection between DSR and IDL, we think that the latter is better. The reason is that, our previous work [Liu and Sun (2016)] shows that IDL has better robustness for object recognition under natural scene. When SR model can be determined, the new problem is how to realize feature description for dictionary learning.

From the current research achievement for bird recognition [Lin, Roychowdhury and Maji (2015); Wei, Xie and Wu (2016)], it is not hard to see that the part detection is one mainstream critical processing. So this paper, inspired by these studies, needs to select some part localization methods to generate suitable data representation. For this, we have some representative methods [Han, Quan, Zhang et al. (2018)] to leverage. Bourdev and Malik [Bourdev (2009)] propose a two-layer classification/regression model for detecting people and localizing body components; Felzenszwalb et al. [Felzenszwalb, Girshick,Mcallester et al. (2010)] described an object detection system based on mixtures of multiscale deformable part models; Uijlings et al. [Uijlings, Sande, Gevers et al. (2013)]introduce selective search which combines the strength of both an exhaustive search and segmentation; Long et al. [Long, Shelhamer and Darrell (2017)] show that a fully convolutional network trained end-to-end, pixels-to-pixels on semantic segmentation exceeds the state-of-the-art without further machinery.

Considering the lack of training samples for rare bird, Uijlings' selective search [Uijlings,Sande, Gevers et al. (2013)] will be very attractive. For CUB200-2011, this paper use selective search as part localization to extract the head, torso and object for each image sample. Fig. 2 shows the basic process.

Figure 2: Selective search for bird image

3.2 Gist feature fusion based on part detection

When the main parts have been acquired, feature description becomes critical step. From the view of bird recognition [Zhang, Donahue, Girshick et al. (2014); Wei, Xie and Wu(2016)], HOG (histogram of oriented gradients) [Dalal and Triggs (2005)] is one common filter for feature representation. Although it might a good choice for deformable parts model (DPM) [Felzenszwalb, Girshick, Mcallester et al. (2010); Azizpour and Laptev (2012)], our experiments show that HOG can not make it work to its advantage under SR system. In contrast, gist descriptor seems more appropriate for this paper.

About the gist feature, the original goal of Oliva et al. [Oliva and Torralba (2001)] is to build a computational model of the recognition of real world scenes that bypasses thesegmentation and the processing of individual objects or regions. The core of gist is Gabor filter. Assume one image isI(x,y), its 2D Gabor function can be as

On this basis, self-similarity Gabor can be structured as

From this, the gist feature can be extracted as

When the size of image grid unit is 4×4 under four scales and eight orientations, the gist feature dimensionality will be 512 (=4×4×4×8). In this paper, our fusion strategy is to cascading all parts' gist features as one data representation. Theoretically, the gist descriptor belongs to a kind of global feature. But the feature fusion in our work is derived from various local patches. This makes our gist feature having both local and global superiority in image description. The subsequent experiment will prove this point.

3.3 Regularized IDL for rare bird sparse recognition

As the comparison in Section 2, IDL could be an appropriate choice for rare bird recognition.If the gist feature with all parts has been generated, the SR classier can be set as

From Eq. (4) to Eq. (6), we can see the basic process for K-SVD. However, in each iteration, it implies that the update of dictionary and sparse representation would be not at the same time. So it might be likely to produce singular point. For solving this problem,Wei et al. [Wei, Xu and Wang (2012)] try to change the objective function as

Although this improvement could prevent the singular point, it is a pity that its performance will decline dramatically when the size of training sample is not enough.

The latest solution is presented by Dumitrescu et al. [Dumitrescu and Irofti (2017)]. In their so-called regularized K-SVD (RK-SVD), the signal error during sparserepresentation update will be changed as

Then the optimal measurement will be inferred as

Based on this RK-SVD, our IDL could be improved as Tab. 1. And we name it RIDL(regularized intraclass dictionary learning). Finally, the SR result for rare bird will be judged from the following criterion

Table 1: Algorithm of the proposed RIDL

4 Experiment and analysis

In this section, the rare bird training sample set comes from the CUB200-2011. Thisdataset has 200 bird classes with about 60 images in each class. In China, rare birds under the key state protection list are 58 species, and 16 species in it are endangered.Unfortunately, there is no complete correspondence category in CUB200-2011 for these birds. For this reason, we can only use some similar family or genus in CUB200-2011 instead. Our experiment will select 11 classes (such as Parakeet Auklet, Belted Kingfisher, White Pelican, and so on) from CUB200-2011 with 30 image samples in each class randomly. About each sample, three parts (head, torso and the whole object) will be segmented by selective search. And the feature fusion strategy is cascade mode. Fig. 3 shows some samples in CUB200-2011 and some results with part detection.

Figure 3: (a) Some samples and (b) results with part detection.

4.1 Experiment 1

For comparison of gist and other feature descriptors, RGB color histogram and HOG will be studied. Considering the possible way of feature fusion, five compound modes should be set including head, torso, object, head+torso, and head+torso+object. Recognition system will run 300 times with 25 training samples in each class, and recognition rate would be counted as evaluation index. Fig. 4 is the result for this experiment.

Figure 4: Five part compound modes under different feature space

From the Fig. 4, it can be seen that: (1) In the three representative feature descriptors, the gist fusion has better recognition rate than other two method; (2) Through thecomparison in the five fusion patterns, the head+torso+object shows the best application effect. These results means that the proposed gist feature fusion based on part detection could mix the global description from gist feature and the local segmentation from selective search.

4.2 Experiment 2

Because one innovation in this paper is to use the RK-SVD to improve our previous IDL as a new SR classier. For comparing the application effect of the proposed RIDL, three typical SR methods (SRC, DSR and IDL) will be tested. And another aim in this experiment is to study how the small sample size problem of rare bird influences the SR modes. So the recognition rate for these four SR modes will be contrasted under five training sample sizes (5, 10, 15, 20 and 25). Fig. 5 shows the result of above experiment.

Figure 5: SR methods with five training sample sizes

Fig. 5 shows two aspects of this experiment: (1) From the view of SR methods, the recognition rate of SRC is far less than other dictionary learning approaches; (2) With the change of training sample size, our IDL and RIDL will be more robust and stable than other classic methods. So it is not hard to see that the proposed RIDL could be more appropriate for rare bird recognition with small sample size.

5 Conclusion

For rare bird recognition, this paper proposes a new SR method based on gist feature fusion and regularized IDL. In our SR system, there are two key steps. One is that three parts (head, torso and object) of each bird image sample will be extracted through selective search before the feature fusion is implemented. Another is the proposed RIDL which can be considered as the improvement of our previous IDL via RK-SVD. The experimental results, under CUB200-2011, show the feasibility of our work for rare bird intelligence monitoring.

Acknowledgement:This work was supported by the China National Natural Science Funds (Grant No. 61401220 and No. 61471206) and the Scientific Research Foundation of Nanjing University of Posts and Telecommunications (Grant No. NY218066).

主站蜘蛛池模板: 国产精彩视频在线观看| A级毛片高清免费视频就| 青青草原国产免费av观看| 欧美视频在线不卡| 国产一区二区三区在线无码| 一级毛片在线播放免费| 亚洲成人黄色在线| 麻豆国产在线不卡一区二区| 波多野结衣一区二区三视频| 日韩毛片在线视频| 国产亚洲男人的天堂在线观看 | 亚洲区第一页| 国产精品分类视频分类一区| 亚洲一区第一页| 国产h视频在线观看视频| 国产精品jizz在线观看软件| 日本免费一区视频| 欧美一级一级做性视频| 韩国v欧美v亚洲v日本v| 毛片a级毛片免费观看免下载| 日韩国产亚洲一区二区在线观看 | 美臀人妻中出中文字幕在线| 成人免费一级片| 精品一区二区三区水蜜桃| 免费jjzz在在线播放国产| 久久美女精品| A级毛片无码久久精品免费| 亚洲最黄视频| 国产 在线视频无码| 国产青青草视频| 91人人妻人人做人人爽男同 | 亚洲精品第一页不卡| 永久成人无码激情视频免费| 国产91丝袜在线播放动漫| 玖玖精品视频在线观看| 爽爽影院十八禁在线观看| 亚洲制服丝袜第一页| 亚洲视频欧美不卡| 久久亚洲国产一区二区| 国产高清在线丝袜精品一区| 国产农村精品一级毛片视频| 国内精品手机在线观看视频| 91福利在线观看视频| 国产女人在线观看| 国产亚洲精| 青草娱乐极品免费视频| 午夜免费小视频| 四虎免费视频网站| 五月婷婷伊人网| 免费一级毛片完整版在线看| 99视频免费观看| 久久久久久久蜜桃| 免费国产小视频在线观看| 亚洲日韩AV无码一区二区三区人| 国产成人精品亚洲77美色| 久久精品国产91久久综合麻豆自制| 伊人91在线| 亚洲区第一页| 国产精品九九视频| 性色一区| 午夜精品久久久久久久无码软件| 亚洲成人精品在线| 国产成人精品免费视频大全五级| 国产极品粉嫩小泬免费看| 成人一级黄色毛片| 欧美日韩国产在线观看一区二区三区 | 国产丝袜91| 国产午夜一级毛片| 欧美人人干| 六月婷婷精品视频在线观看| 国产精品55夜色66夜色| 国内自拍久第一页| 国产毛片久久国产| 色婷婷在线影院| 国产精品太粉嫩高中在线观看| 欧美不卡在线视频| 中文字幕无码制服中字| 亚洲狼网站狼狼鲁亚洲下载| 国产在线观看一区二区三区| 无码乱人伦一区二区亚洲一| 久久久久国产精品嫩草影院| 国产一级小视频|