Blind Date (More Unauthorized Online Experimenting)

blind-date

Following up from news a couple of weeks ago about Facebook manipulating its users, this week news abounds regarding a dating agency that has been conducting some experiments on its users.

The New York Times reports that the online dating agency OK Cupid has been manipulating the data it gives to its clients, to find out how compatibility and looks effect the dating process. The company conducted 3 different experiments, in one it hid profile pictures, in another, it hid profile text to see how it affected personality ratings, and in a third, it told some hopeful daters that they were a better or worse potential match with someone than the company’s software actually determined.

So as we might imagine they came up with a series of findings, that we could loosely interpret as the following:

1. If you are told that the person is more compatible you are more likely to contact them.

2. Users are likely to equate “looks” with “personality,” even in profiles that featured attractive photos and little if any substantive profile information

3. When the site obscured all profile photos one day, users engaged in more meaningful conversations, exchanged more contact details and responded to first messages more often. They got to know each other. But when pictures were reintroduced on the site, many of those conversations stopped cold.

Well as far as I can see number 1 is pretty self evident. If you send me a note saying that a person is not compatible then I probably won’t bother them with my personal issues., 2 is quite interesting, if I like the looks of someone I am more likely to think that they are an interesting person, may be fun and without doubt the perfect match for me. And also the third is quite obvious, if I don’t know what a person looks like I might imagine their looks and would be more likely to want to get to know them.

The OK Cupid blog will fill you in on the details.

One interesting line from the blog states that “guess what, everybody: if you use the Internet, you’re the subject of hundreds of experiments at any given time, on every site. That’s how websites work”. Wise words, but I wonder if everybody realizes that. And what power they wield!

Now I would like to raise the issue of how someone can design an algorithm to measure my compatibility with another person. What will make us more compatible? Height? Interests? Worldview (and if so how can you put that into numbers)?

There is an interesting book by Hubert Dreyfus called “What Computers Can’t Do”, and in it he argues that there are some areas and situations that cannot fully function. A computer program is based on expertise, on experience that can be categorized. If there are subject matters that are impossible to completely formalise, then they are impossible to formalize in computer programs (such as the one they use to find my perfect partner if they exist).

As a human I think we make decisions based upon generalizations of a situation. Characteristics are judged based upon experiences, I once knew someone with those characteristics and they were great, or stubborn, or nasty, etc. Research suggests that we play games such as chess in this way. We do not think about a long series of possible moves in the way a computer plays, but we see a situation, it reminds us of another situation that we have confronted in the past, and we act according to our experience of action in similar situations.

I am sure some readers have experience in this field, and I would be very happy to get some comments and expand my understanding.

The size of the Internet – and the human brain

How many human brains would it take to store the Internet?

Last September I asked if the human brain were a hard drive how much data could it hold?

The human hard drive: the brainI concluded that approximately 300 exabytes (or 300 million terabytes) of data can be stored in the memory of the average person. Interesting stuff right?

Now I know how much computer data the human brain can potentially hold, I want to know how many people’s brains would be needed to store the Internet.

To do this I need to know how big the Internet is. That can’t be too hard to find out, right?

It sounds like a simple question, but it’s almost like asking how big is the Universe!

Eric Schmidt

In 2005, Executive chairman of Google, Eric Schmidt, famously wrote regarding the size of the Internet:

“A study that was done last year indicated roughly five million terabytes. How much is indexable, searchable today? Current estimate: about 170 terabytes.”

So in 2004, the Internet was estimated to be 5 exobytes (or 5,120,000,000,000,000,000 bytes).

The Journal Science

In early 2011, the journal Science calculated that the amount of data in the world in 2007 was equivalent to around 300 exabytes. That’s a lot of data, and most would have been stored in such a way that it was accessible via the Internet – whether publicly accessible or not.

So in 2007, the average memory capacity of just one person, could have stored all the virtual data in the world. Technology has some catching up to do. Mother Nature is walking all over it!

The Impossible Question

In 2013, the size of the Internet is unknown. Without mass global collaboration, I don’t think we will ever know how big it is. The problem is defining what is the Internet and what isn’t. Is a businesses intranet which is accessible from external locations (so an extranet) part of the Internet? Arguably yes, it is.

A graph of the internet

A map of the known and indexed Internet, developed by Ruslan Enikeev using Alexa rank

I could try and work out how many sites there are, and then times this by the average site size. However what’s the average size of a website? YouTube is petabytes in size, whilst my personal website is just kilobytes. How do you average that out?

Part of the graph of the internet

See the red circle? That is pointing at Technology Bloggers! Yes we are on the Internet map.

The Internet is now too big to try and quantify, so I can’t determine it’s size. My best chance is a rough estimate.

How Big Is The Internet?

What is the size of the Internet in 2013? Or to put it another way, how many bytes is the Internet? Well, if in 2004 Google had indexed around 170 terabytes of an estimated 500 million terabyte net, then it had indexed around 0.00000034% of the web at that time.

On Google’s how search works feature, the company boasts how their index is well over 100,000,000 gigabytes. That’s 100,000 terabytes or 100 petabytes. Assuming that Google is getting slightly better at finding and indexing things, and therefore has now indexed around 0.000001% of the web (meaning it’s indexed three times more of the web as a percentage than it had in 2004) then 0.000001% of the web would be 100 petabytes.

100 petabytes times 1,000,000 is equal to 100 zettabytes, meaning 1% of the net is equal to around 100 zettabytes. Times 100 zettabytes by 100 and you get 10 yottabytes, which is (by my calculations) equivalent to the size of the web.

So the Internet is 10 yottabytes! Or 10,000,000,000,000 (ten thousand billion) terabytes.

How Many People Would It Take Memorise The Internet?

If the web is equivalent to 10 yottabytes (or 10,000,000,000,000,000,000,000,000 bytes) and the memory capacity of a person is 0.0003 yottabytes, (0.3 zettabytes) then currently, in 2013, it would take around 33,333 people to store the Internet – in their heads.

A Human Internet

The population of earth is currently 7.09 billion. So if there was a human Internet, whereby all people on earth were connected, how much data could we all hold?

The calculation: 0.0003 yottabytes x 7,090,000,000 = 2,127,000 yottabytes.

A yottabyte is currently the biggest officially recognised unit of data, however the next step (which isn’t currently recognised) is a brontobyte. So if mankind was to max-out its memory, we could store 2,127 brontobytes of data.

I estimated the Internet would take up a tiny 0.00047% of humanities memory capacity.

The conclusion of my post on how much data the human brain can hold was that we won’t ever be able to technically match the amazing feats that nature has achieved. Have I changed my mind? Not really, no.

File Sharing: Is Your Business Getting the Most Out of It?

Has your business ever reached a point where it’s run out of spare capacity for important files on its hard drives? If so, you might wonder what you could do to make sure it never happens again. The same might go for moving large files around, which can be fiddly at the best of times when email accounts cannot cope.

More room

A business can never have enough spare capacity for files. A growing number of businesses throughout the world have turned to cloud computing to help do both for a variety of reasons, which include:
Data servers

  • Being able to store files online in a ‘cloud’, an online space where they can be accessed securely.
  • Being able to share files from the cloud with clients and colleagues – collaboration is also possible.
  • Providing a viable alternative to a traditional server which is far more cost-effective.

While all this is help to make using the cloud palatable, there may be a possibility that businesses aren’t getting what they expect from some cloud storage providers, and that’s where enterprise cloud computing services like Egnyte come in.

Value for money

As with everything else they buy, businesses should make sure they get the most from their cloud storage and online file sharing package. There are a number of pitfalls facing companies who turn to the cloud for some of their IT solutions which they would do well to avoid.

The main one is the limits placed on the amount of file space you have to work with and the size of files which you can share. Many providers have limits in place, so it’s important to get as much space for as little money as possible. Also, consider what your business needs – how big are the files which you share and how much space do your files take up?

As this article states, the way in which your file sharing vendor affects your internet connection is also important. If you have several file sharing accounts, they could slow your internet speed down, so take that into consideration before choosing the right service provider.

Safety first

Another factor that should influence choice of a cloud file sharing provider is the security of their services. Most providers have security software which limits opportunities for accounts to get hacked, while a few have taken extra steps to make users’ accounts practically impervious to even the most sophisticated malware.