X Informatics Unit 3 Data Deluge 5 Internet


All right, now we come to examples. The first set of examples
comes from the Internet. And so this is data deluge
examples from the Internet. Here is a nice slide I
found at an Oracle talk. And it tells you what
happens every 60 seconds. It says that, what does it say? It says there’s 700,000
Facebook updates. It says there are are 168
million emails are sent. 700,000 search queries, 13,000 music streaming on Pandora,
12,000 ads on Craig’slist, and so on, so
this is pretty impressive. What happens every minute,
and it’s this. it’s this driving internet activity and of course the things like
blogs which are higher value. There are fewer blogs per minute,
just 1,500 posts, and 60 new blogs every minute. So that’s one blog a second, you won’t be able to
read all of that those. Skype already has 370,000
[COUGH] minute voice calls going on every minute, so that’s quite impressive. So this says,
this is actually from the IBM talk, from the Berkley Big Data
conference, it discusses where
the data is coming from. We have on the left the basic
Internet, 12 terabytes of tweet everyday,
25 terabytes of Facebook log data. And it doesn’t actually know
how much data Google has. Other important data sources
are RFID tags which are basically subscribing the inventory and
location of entities and industry. There are 30 billion of
such tags already today. There were only 1.3 billion in 2005. We have 4.6 billion
camera phones available. So that’s responsible for
the explosion in uploaded images. And of course, we have hundreds of
millions of GPS devices that GPS data is driving the data deluge. In the energy area, we have 76
million smart meters in 2009, 200 million smart meters in 2014. So this is driving the field
of energy informatics. Which is how we produce more
efficient, responsive power and do not have brown-outs and
things like that. Using the data on usage
in intelligent fashions. Here’s a nice clock from
a company called JESS, called the Geosocial Universe. And it has in the middle
5.3 billion cell phones. Then around there we have,
floating around, we have various sources of data, of which the
largest is Skype at 663 million and it goes to Gmail and
Hotmail and Facebook and Yahoo! And all sorts of other LinkedIn and
MySpace and so on. So this is the universe of data that’s
driving social interactions.

Leave a Reply

Your email address will not be published. Required fields are marked *