999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Mining potential social relationship with active learning in LBSN①

2017-06-27 08:09:23WangHaiping王海平ZhangHongWangYongBingJia
High Technology Letters 2017年2期

Wang Haiping (王海平), Zhang Hong, Wang Yong, Bing Jia

(*Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, P.R.China) (**National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, P.R. China) (***Henan Worker’s Cultural Palace, Zhengzhou 450007, P.R.China)

Mining potential social relationship with active learning in LBSN①

Wang Haiping (王海平)*, Zhang Hong②**, Wang Yong②**, Bing Jia***

(*Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, P.R.China) (**National Computer Network Emergency Response Technical Team/Coordination Center of China, Beijing 100029, P.R. China) (***Henan Worker’s Cultural Palace, Zhengzhou 450007, P.R.China)

Rapid development of local-based social network (LBSN) makes it more convenient for researchers to carry out studies related to social network. Mining potential social relationship in LBSN is the most important one. Traditionally, researchers use topological relation of social network or telecommunication network to mine potential social relationship. But the effect is unsatisfactory as the network can not provide complete information of topological relation. In this work, a new model called PSRMAL is proposed for mining potential social relationships with LBSN. With the model, better performance is obtained and guaranteed, and experiments verify the effectiveness.

data preprocessing, feature fusion, active learning

0 Introduction

Local-based social network (LBSN) is a new kind of social network where people could mark their positions information, and it developed rapidly in recent years. But LBSN differs from traditional social network as people could mark their positions information in the network. With the extensive use of smart phones, a large number of local-based social networks like Foursquares and Gowalla have emerged and have been drawing people’s attention. Besides, traditional social networks like Facebook and Twitter also add the position information in their products to improve their popularity. In this way, people could publish their statuses in the form of text or picture marked with geographical information in LBSN.

Nowadays, millions of check-ins appear in LBSN every day, which provide sufficient information for the study of social network, including social relationship mining, recommendation of goods and services, community detection, etc. As mining of social relationship is the basis of many studies, it has been drawing wide attention of researchers. Traditionally, researchers use topological relation of social network or telecommunication network to mine potential social relationship[1]. As LBSN could be viewed as the combination of traditional social relationship and marks with position information, potential social relationships could also be mined by traditional methods, however, in which obvious disadvantage exists. As people do not always use a certain local-based social network to communicate with their friends, the relational network extracted from the LBSN could not fully cover their relationships. In other word, features extracted from the existing relational network could not describe the attribute completely. Therefore, researchers have studied mining the potential social relationships by using geographical position information.

Ref.[2] discovered the relation between relationships and geographical position information, and verified the effectiveness to infer potential relationships with geographical information. Ref.[3] defined computation methods to extract features from LBSN and mined potential relationships later. Ref.[4] also studied the problem of inferring links with geographical information to be proved the effective Ref.[5] proposed an entropy-based model (EBM) to infer social connections,and further estimate the strength of social connections with spatial information. However, the studies mentioned above mainly focused on designing features, and less considered about preprocessing data and improving the prediction model. In this paper, a new model called PSRMAL is brought out for mining potential social relationships to predict people’s potential social relational network, combined with geographical information extracted from LBSN.

The rest of the paper is organized as follows: In Section 1, a method for designing PSRMAL is explained. The experiment is described in Section 2. Finally, conclusion is given in Section 3.

1 Design of PSRMAL

In this section, a method for devising model PSRMAL will be introduced in detail. The model can be viewed as two parts. The first is to extract features from LBSN, and the second is to train the model and further improve its performance. Fig.1 shows the structure for mining potential social relationships from LBSN, and the process could be divided into four steps, i.e. region partition, feature computation, feature fusion and active learning. The detailed description is as follows.

Fig.1 The structure for mining potential social relationships

1.1 Region partition

As each check-in corresponds to a GPS record, regions could be partitioned by clustering people’s GPS records of check-in. Generally, there are four methods for clustering, i.e. partitioning methods, hierarchical methods, density-based methods and grid-based methods. Partitioning, hierarchical and grid-based methods are designed to find spherical-shaped clusters, while positions where people check in are not always in regular shapes. Therefore, density-based methods are used to obtain segment regions in this work.Although density-based cluster is suitable for partitioning regions, it is still insufficient to ensure the rationality. In this paper, partition regions are brought out with three density-based cluster methods, i.e. DBSCAN (density-based spatial clustering of applications with noise), OPTICS (ordering points to identify the clustering structure), and DENCLUE (clustering based on density distribution functions). After region partition, each check-in record corresponds to three region marks to guarantee the rationality of partition.

1.2 Feature computation

Generally, people appearing at common positions are likely to be friends[6]. The more frequently they do, the more likely to be. Therefore, three methods that have usually been used for similarity computation in social network are applied, i.e common neighbors, Jaccard index and Cosine index[7]. Here positions are used that people check in to replace the nodes in network. Then features could be computed with three methods and features ComP, JacP and CosP could be obtained respectively. Here ComP denotes the common positions that two persons have checked in, JacP denotes the value that computed with Jaccard index for two persons, and CosP denotes values computed with cosine index. The computational formulas are shown as follows.

ComPij=φi∩φj

(1)

JacPij=φi∩φj/φi∪φj

(2)

(3)

1.3Featurefusionwithlogistic

Withthefeaturedefinitionmethodsmentionedinsubsection1.2,threedifferentkindsoffeaturesetsXN, XS, XE∈n×dcould be extracted, and XN, XS, XEdenote the feature sets extracted from LBSN with DBSCAN, OPTICS and DENCLUE respectively. Then letxNijdenotes theipair of persons’ position features extracted withjfeature computation method, while positions are obtained by DBSCAN. Similarly, letxSijandxEijdenote features when positions are obtained by OPTICS and DENCLUE correspondingly.

Then the fusion feature could be computed as

xij=αjxNij+βjxSij+γjxEij

(4)

whereαj,βjandγjrepresenttheweightsofdifferentfeatureswithcomputationj,and3nvalues will be obtained. To calculate the weighted values, the feature sets are united as

(5)

where R=[1,1…1]T∈1×d, and each column of XUis

xUi=(xNi1,xNi2…,xNin,xSi1,xSi2,…,xSin,xEi1,xEi2, …,xEin, 1)T∈(3n+1)×1

Correspondingly, W can be expressed as W=[αT,βT,γT,c]T∈(3n+1)×1,whereα=[α1,α2…αn]T∈n×1, β=[β1,β2,βn]T∈n×1andγ=[γ1,γ2…γn]T∈n×1are the weight sets of different features, andcisaconstantargument.

Forconvenience,formW=(w1,w2…w3n,c)T∈(3n+1)×1isusedtodenotetheweightset,andtheweightsetsofdifferentfeaturesareα=[w1,w2…wn]T∈n×1, β=[wn+1,wn+2…w2n]T∈n×1andγ=[w1,w2…wn]T∈n×1.

Inthiswork,logisticregressionisappliedtocalculatetheparameters.Theprobabilitythattwopersonshaverelationshipornotcanbeexpressedas

(6)

(7)

LethW(xU)=g(WTxU), and combine the two equations above to obtainp(y|xU, W)=hW(xU)y(1-hW(xU)1-y)

The likelihood function could be expressed as

(8)

And the log-likelihood is

(9)

Whenl(W) reaches the maximum, W is the weight set. In this study, gradient decent is used to solve it. The elements of W can be got as

(10)

Thenwjcouldbegotwhenthepartialderivateconverges,andtheresultis

(11)

1.4Activelearning

Inthiswork,activelearningisusedtoimprovetheperformanceforminingofpotentialsocialrelationship.Activelearningisproposedrelativetopassivelearning[8-13].Passivelearningreferstoselectingsamplesrandomlyfromthedatasetandlabelthem,thentrainsamodelwiththelabeledsamplesandclassifyunlabeledoneswiththemodel.However,theremayexistproblemssuchasinformationredundancy,excessivenoise,asthesamplesfortrainingmodelsarefetchedrandomly,whichwouldseriouslyaffecttheeffectivenessofclassification[14-19].Activelearningistodividethesamplelabelingworkintotwosteps.Firstly,itistolabelafewsamplesastheinitialtrainingset,fortrainingabasicclassifier,andlabeltheothersampleswiththeclassifier.Secondly,itistoselectacertainnumberofsamplesthatarehardtoconfirmtheirclassesaccordingtotheresult,andlabeltheseonesmanually.Thenewlabeledsamplestotheinitialtrainingsetareadded,andthefinalclassifierwiththenewtrainingsetistrained[20-25].Asthenewtrainingsetfetchedinthiswaycontainsmorecomprehensiveinformationofthedataset,amorerobustmodelwouldbeobtained.

2 Experiment

Inthissection,theperformanceofPSRMALisevaluatedwithaclassicalgorithm,supportvectormachine(SVM).Besides,socialrelationshipsarealsominedwiththreesinglefeaturesascontrastexperiments.Firstly,thefeaturesshouldbeextractedwiththreedifferentmethods.Secondly,thefeaturesarefusedwithlogisticregression,whiletheparameterW for fusion is shown in Table1.

Table 1 weight set

With the value of weight set W, each member of fusion feature X could be obtained as

Xij=αjxNij+βjxsij

(12)

Lastly,thepotentialsocialrelationshipswillbeminedbyusingXN, XS, XEand X respectively. In Fig.2(a)~(c), it is to express the experiment results of different methods for dividing positions, including DBSCAN, OPTICS, DENCLUE and the fusion. Let N denote the positions divided with DBSCAN,S denote OPTICS, E denote DENCLUE and F denote the fusion. As can be seen in the figures, the performance of fusion feature outperforms single features in almost all cases. The fusion is crucial to guarantee the stability of model and achieve high performance. Besides, active learning also contributes to enhance the performance. As the comprehensive assessment, F-measure is more persuasive. The values obtained by using XN, XS, XEand X are 42.18%, 41.08%, 38.43% and 44.34%, and the F-measure of fusion feature is boosted by 2.16%, 3.26% and 5.92% respectively.

(a) Index of precision

(b) Index of recall

(c) Index of F-measure

3 Conclusion and future work

In this study, a new model PSRMAL is proposed for mining potential social relationships with geographical information in LBSN. The importance of region partition is emphasized and the region is segmented with three different cluster methods, in which the rationality of partition is ensured. Then the features are fused with logistical and the performance of model is further improved with active learning mothed. Experiments prove the effectiveness of PSRMAL. In the future, more energy will be put to the efficiency of model to make it more suitable for real-time processing.

Reference

[ 1] Adamic L A, Adar E. Friends and neighbors on the web.SocialNetworks, 2001, 25(3): 211-230

[ 2] Cho E, Myers S A, Leskovec J. Friendship and mobility user movement in location-based social networks. In: Proceeding of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA, 2011. 1082-1090

[ 3] Wang D, Pedreschi D, Song C, et al. Human mobility, social ties, and link prediction. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA, 2011. 1100-1108

[ 4] Scellato S, Noulas A, Mascolo C. Exploiting place features in link prediction on location-based social networks. In: Proceedings the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, USA, 2011. 1046-1054

[ 5] Pham H, Shahabi C, Liu Y. EBM: an entropy-based model to infer social strength from spatiotemporal data. In: Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, New York, USA, 2013. 265-276

[ 6] Pham H, Hu L, Shahabi C. Towards integrating real-world spatiotemporal data with social networks. In: Proceedings of the 19th ACM SIGSPATIAL, New York, USA, 2011. 453-457

[ 7] Moyano L G, Thomae O R M, Frias-Martinez E. Uncovering the spatio-temporal structure of social networks using cell phone records. In: Proceedings the 12th International Conference on Data Mining Workshops (ICDMW 2012), Brussels, Belgium, 2012. 242-249

[ 8] Zhang X Y, Wang S, Yun X. Bidirectional active learning: a two-way exploration into unlabeled and labeled dataset.IEEETransactionsonNeuralNetworksandLearningSystems, 2015, 26(12): 3034-3044

[ 9] Zhang X Y, Wang S, Zhu X, et al. Update vs. upgrade: modeling with indeterminate multi-class active learning.Neurocomputing, 2015, 162: 163-170

[10] Zhang X. Interactive patent classification based on multi-classifier fusion and active learning.Neurocomputing, 2014, 127(3): 200-205

[11] Zhang X Y, Cheng J, Xu C, et al. Multi-view multi-label active learning for image classification. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), Cancun, Mexico, 2009. 258-261

[12] Zhang X Y, Xu C, Cheng J, et al. Automatic semantic annotation for video blogs. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME), Hannover, Germany, 2008. 121-124

[13] Zhang X Y, Cheng J, Lu H, et al. Selective sampling based on dynamic certainty propagation for image retrieval. In: Proceedings of the Advances in Multimedia Modeling (MMM), Kyoto, Japan, 2008. 425-435

[14] Zhang X Y, Cheng J, Lu H, et al. Weighted co-SVM for image retrieval with MVB strategy. In: Proceedings of the IEEE International Conference on Image Processing (ICIP), San Antonio, USA, 2007. 517-520

[15] Wang S, Zhang X Y, Yun X, et al. Joint recovery and representation learning for robust correlation estimation based on partially observed data. In: Proceedings of the IEEE International Conference on Data Mining Workshop, Atlantic City, USA, 2015. 1-7

[16] Zhang X Y. Preference modeling for personalized retrieval based on browsing history analysis.IEEJTransactionsonElectricalandElectronicEngineering, 2013, 8 (S1): 81-87

[17] Zhu X B, Jin X, Zhang X Y, et al. Context-aware local abnormality detection in crowded scene.ScienceChinaInformationSciences(SCIS), 2015, 58(5): 1-11

[18] Zhu G, Wang J, Wu Y, et al. MC-HOG correlation tracking with saliency proposal. In: Proceedings of the AAAI Conference on Artificial Intelligence, Phoenix, USA, 2016.1-7

[19] Zhang Y, Xu C, Zhang X, et al. Personalized retrieval of sports video based on multi-modal analysis and user preference acquisition.MultimediaToolsandApplications, 2009, 44(2): 305-330

[20] Zhang X Y, Hou Z, Zhu X, et al. Robust malware detection with dual-lane AdaBoost. In: Proceedings of the IEEE International Conference on Computer Communications, San Francisco, USA, 2016. 1051-1052

[21] Zhang X Y, Zhang K, Yun X, et al. Location-based correlation estimation in social network via collaborative learning. In: Proceedings of the IEEE International Conference on Computer Communications, San Francisco, USA, 2016. 1073-1074

[22] Zhang X Y, Wang S, Zhang L, et al. Ensemble feature selection with discriminative and representative properties for malware detection. In: Proceedings of the IEEE International Conference on Computer Communications, San Francisco, USA, 2016. 674-675

[23] Zhang Y, Zhang X, Xu C, et al. Personalized retrieval of sports video. In: Proceedings of the ACM Multimedia Workshop, Augsburg, Germany, 2007. 313-322

[24] Zhang X Y. Effective search with saliency-based matching and cluster-based browsing.HighTechnologyLetters, 2013, 19(1): 105-109

[25] Zhang X Y. Dynamic batch selective sampling based on version space analysis.HighTechnologyLetters, 2012, 18(2): 208-213

Wang Haiping, born in 1987. He is in pursuit of Ph.D degree, and is currently an engineer in Institute of Information Engineering, Chinese Academy of Sciences. He received his Master degree from College of Information of Renmin University of China in 2012. He also received his B.S. degree from Beijing Technology and Business University in 2009. His research interests include the design of algorithms for parallel processing, big data analysis and text mining.

10.3772/j.issn.1006-6748.2017.02.012

①Supported by the National Natural Science Foundation of China (No. 61501457).

②To whom correspondence should be addressed. E-mail: zhangh@isc.org.cn, wangyong@cert.org.cn

on Apr. 20, 2016

主站蜘蛛池模板: 久久永久精品免费视频| 精品国产成人高清在线| 日本欧美成人免费| 亚洲女同欧美在线| 华人在线亚洲欧美精品| 欧美区一区| 伊人久久精品无码麻豆精品| 中文字幕2区| 国产美女免费网站| 亚洲天堂网站在线| 国产午夜一级淫片| 97精品伊人久久大香线蕉| 免费av一区二区三区在线| 国产亚洲欧美在线专区| 在线观看无码av五月花| 久久综合丝袜长腿丝袜| 亚洲乱码在线播放| 欧美激情第一欧美在线| 日韩欧美综合在线制服| 久久不卡国产精品无码| 欧美国产成人在线| 国产精品亚洲一区二区三区在线观看| 亚洲视频免费播放| 欧美日韩高清在线| 国产91精品调教在线播放| 亚洲 欧美 中文 AⅤ在线视频| 国语少妇高潮| 国产福利免费视频| 精品久久久久久久久久久| 囯产av无码片毛片一级| 亚洲欧美日韩视频一区| 亚洲国产精品一区二区第一页免 | 国产精品嫩草影院视频| 亚洲欧洲日韩综合色天使| 亚洲欧美在线看片AI| 91福利免费视频| 中文字幕一区二区人妻电影| 黄色网页在线播放| 久久这里只有精品23| 午夜福利亚洲精品| 日韩性网站| 在线看片中文字幕| 真人免费一级毛片一区二区| 国产真实自在自线免费精品| 日日拍夜夜操| 自慰高潮喷白浆在线观看| 久久动漫精品| 国产欧美日韩免费| 99这里只有精品在线| 日本爱爱精品一区二区| 操操操综合网| 91麻豆国产视频| 久久久久久久久亚洲精品| av色爱 天堂网| 思思99思思久久最新精品| 精品一区二区三区四区五区| 99久视频| 2021国产乱人伦在线播放| 国产视频久久久久| 国产精品自拍合集| 就去色综合| 亚洲国产中文欧美在线人成大黄瓜| 精品视频91| 久久久成年黄色视频| 日韩不卡高清视频| 大陆精大陆国产国语精品1024| 午夜视频在线观看区二区| 亚洲国产欧美目韩成人综合| 国产乱人伦偷精品视频AAA| 毛片基地视频| 手机精品福利在线观看| 国产aaaaa一级毛片| 国产高清又黄又嫩的免费视频网站| 国产aⅴ无码专区亚洲av综合网| 青青国产在线| 538国产视频| 国产亚洲现在一区二区中文| 国产国语一级毛片| 成人毛片免费观看| 成人国产精品网站在线看| 亚洲一级毛片在线观| 人人91人人澡人人妻人人爽|