999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Barge Database

2014-10-27 23:46:49ByWuJiang
KNOWLEDGE IS POWER 2014年10期

By+Wu+Jiang

The last week of September, 2014 saw the official listing of Alibaba( A Chinas giant company) in the New York Stock Exchange(NYSE:BABA), which is the first Initial public offerings( IPO) and the largest scale in history, also marks the Interent evolves into a new era---a big data era that belongs to Chinese domestic internet enterprises.

The past and present big data

Big data or mass data refers to the data size is so large that it can not be extracted, managed, handled and processed as the information that can be interpreted by human beings with a proper range of time. Under the same condition, compared with those independent small-scale dataset which could analyzed data individually, more additional information and relational data base will be obtained if the analysis is based on the grouping of each small data. Such approach can be applied to forecast the commercial trend, judge the quality of research, avoid the widespread of disease, fight against crimes or predict real-time traffic and others.

Though far away from our daily life, big data has close ties with our daily life in deed. For example, Douban Music( a name of a Chinese social network) can infer which song is most liked by a certain user after its analysis of behaviour of user population, even users favorite movie can also be inffered. Through confluence analysis of sales data of its retail stores, Adidas can exactly know the consumers preference over their products in different regional culture so as to make a more resonable strategy of inventory stocking up in a smarter way. A love and marrige website in China is trying to introduce a system that can identify facial resemblance, the company is able to conclude which facial form is most enjoyed by its users on the basis of used information, then they can provide such popular service among its users. Taobao(the biggest C2C shopping website in Chinas mainland) can predict the possible goods that each consumer is interested in, thereout, individualized recommendation targeted to each user can be produced, this is what most people often see in the side bar of it commodity recommendation. Through the analysis of the information of classified commodities by large database model, Taobao is able to answer some interesting questions which are hard to most people, such as what is the favourite color of the T-shirt for the age group of 18 , or what is the difference between the people living in South and North China when it comes to preference of sports beverage?

The simple analysis of user behaviour will not produce too much value, while if the analysis is based on a quite large scale, then we can obtain valuable prediction from its performing trend, the decision-making in business in particular. In the past, take the well-known NongFu Spring (A Chinese enterprise of drinking water production) for example, if the company wants to get such market data to help them to make decisions as how to pile up can promote its sales? The people of which age group can spend most time in front of the pile? What is their purchasing volume each time? What changes of purchasing behaviour might take place for the change of temperature? How its competitors new packing influence its own sales? Though seem easy, these questions are hard to get convincing answers.

To answer the above questions, a lot of data needs to be collected. The salesmen from NongFu Spring have to come to local supermarkets to take ten pictures every day: the piling of the bottles, the change of their location, the height of the bottle piling and so on. Every day they have to cover 15 places for investigation and survey, and upload 150 pictures, producing data size about 10M which is not a large figure. While there are 10,000 salesmen across China, that means the data size is 100G, 3TB each month. Though these data seem easy, but without the support of relevant technology concerning about big data, such analysis could not be obtained.

There is one in Google had pointed out:” what really matters is not what we can do, but what is the right size can we do.”

It only needs several pieces of paper and a pen if you can just analyze 100 lines of data every day. But if you want to analyze 100,000 lines of data, according to the processing capacity of modern computer, you just need a computer and design programme. But if the data size has reached 1000000000 lines(1TB), even a powerful server station will satisfy your need, especially when you want a real-time or close to real-time processing speed. Thus, the field of computer and numerical calculation witnesses the occurrence of a trend—distributed computing which is a science requires a system by the connecting of cluster of computers through network and then engineering data that needs massive calculation will be divided into small computing areas, then the data will be processed by each computer of the network, after uploading the calculating results which will be combined to arrive at a final data conclusion. But in order to make full use of distributed computing, we have to solve such problems as how to divide the data? How can we achieve a balanced processing of the operating load of each computer? How to combine each result into a final data efficiently? Many computing models and concepts have been designed for the purpose of solving these problems from the hardware and software of computers. Some of the most representative are cloud computing, MapReduce (Handoop) , virtualization and others. While this might only be the beginning of the computing tide. Just like Jack Ma had said:” we are moving from an era of information science and technology to an era of data science and technology.”

Mass data and

the new occupations

of the Internet

To do well in mass data, the first thing of vital importance is to get massive valuable data, which is an advantage that most native Chinese Internet enterprises have. China has a large population, dynamic economy, millions of internet users, the abundance of users behavor data is directly decided by the abundance of user data resources. Taobao has 300 million registered users and Tencents registered users has already exceeded 1 billion. All the user data is absolutely a goldmine.

A new generation technology is bound to bring up full demand of technicians of a new generation. In an era of big data, data scientist and data engineer have been one of the hottest occupation in Silicon Valley. Comparing to the traditional software engineer, data scientist is a group of researchers who stand between mathematics(statitics) and computer science, their job includes both software design and development and data modelling and statistic analysis, meantime, they are able to turn data processing model into feasible software solutions. So the native Chinese internet enterprises also attach great importance to the reservation of talents in the field of data science, in the foreseeable future, practitioners of data science must be very popular in the job market.

主站蜘蛛池模板: 国产精品白浆无码流出在线看| 国产精品密蕾丝视频| 欧美无遮挡国产欧美另类| 蜜桃视频一区二区| 在线精品自拍| 国产亚洲精品自在线| 国产精品第一区在线观看| 免费又爽又刺激高潮网址 | 午夜a级毛片| 国产色伊人| 天天躁夜夜躁狠狠躁躁88| 成人免费一级片| 99在线小视频| 久久国产成人精品国产成人亚洲| 一本二本三本不卡无码| 激情六月丁香婷婷| 91麻豆国产在线| 四虎永久在线| 日韩欧美中文字幕在线精品| 免费不卡在线观看av| 久草中文网| 成年人免费国产视频| 日本福利视频网站| 亚洲精品无码成人片在线观看| 免费三A级毛片视频| 无码电影在线观看| 亚洲国产高清精品线久久| 99久久精品久久久久久婷婷| 国产免费久久精品44| 国产精品免费电影| 女人毛片a级大学毛片免费| 亚洲自拍另类| 国产精品久久久久鬼色| 亚洲性网站| 国产久操视频| 免费无码一区二区| 99视频在线免费看| 午夜老司机永久免费看片| 制服丝袜 91视频| 久久精品无码中文字幕| 午夜精品久久久久久久99热下载| 97人人模人人爽人人喊小说| 国产女人在线| 日韩中文字幕亚洲无线码| 色综合天天视频在线观看| 国产一区二区丝袜高跟鞋| 国产a网站| 91国内外精品自在线播放| 亚洲综合第一页| 国产凹凸一区在线观看视频| 日韩天堂视频| 亚洲码在线中文在线观看| 国产丝袜91| 亚洲视频一区| 中国一级毛片免费观看| 玖玖精品视频在线观看| 高清码无在线看| 亚洲中文字幕97久久精品少妇| 三上悠亚精品二区在线观看| 国产成人精品免费av| 久久精品视频亚洲| 操美女免费网站| 欧美视频在线播放观看免费福利资源| 毛片在线播放网址| 国产精品久久久久久久久久久久| 91外围女在线观看| 欧美色视频日本| 欧美中文字幕第一页线路一| 亚洲不卡网| 激情国产精品一区| 日本久久久久久免费网络| 毛片网站在线播放| 国产丝袜一区二区三区视频免下载| 特级aaaaaaaaa毛片免费视频| 欧美成人影院亚洲综合图| 国产特级毛片| 欧美午夜在线视频| 一级毛片无毒不卡直接观看| 欧美天堂在线| 2020国产精品视频| 免费国产小视频在线观看| 在线播放真实国产乱子伦|