Blogs

Gary's Blog

Has Google Been out Googled by Realtime Web Search?

     

In this edition of Outside the DXperience we’re going to take a look at the Realtime web; find out what it is and how search differs between the Traditional Web and the Realtime Web; we’ll look at who the players are in realtime search and who amongst them, if any, has the upper hand; before looking at the question on everyone’s lips… has Google been out Googled?

What is the Realtime Web?
Well put simply, the Realtime Web is the here and now. Where the traditional web consists of pages from commercial sites, blog posts – both amateur and professional – news sites, weather sites and pages of information of every conceivable topic; they all have one thing in common: they are in the past tense. Someone has conceived them, written them, polished them and finally published them. By the time you read it the information contained in the pages of the Traditional Web is at least hours old.

The Realtime Web, on the other hand, is the web of the here and now. Fuelled by social media tools like Facebook and Twitter that allow users to post their thoughts at any given time, on any given topic, the Realtime Web is a window on the life of the global citizenry. Posts on the Realtime Web are never really conceived but are more off the cuff remarks; the thinking out loud of any given, ordinary, person. These posts are certainly never polished but, instead, are a measure of the writer’s emotion at that time. Posts on the Realtime Web are usually what we call “soft” posts, containing opinion rather than fact; emotion rather than measured response. They are useful because they are what people think without the “white lie” that most of us use to survive in our day to day social interactions.

Of course, saying posts on the Realtime web are “soft” is a generalisation, one that holds true in most circumstances certainly, but a generalisation nevertheless. One area where this is not true of course is in the area of citizen journalism. When some “Johnny on the spot” can post words and sometimes pictures too of events as they happen, in a way that no news corporation could do, unless they were very, very fortunate. One example of this has been the coverage of the Iranian election, where most, if not all, of the most interesting coverage has come from people inside the country getting the word out via Realtime Web tools like Twitter and YouTube. Another place where you see the Realtime Web excel compared to the Traditional Web is during a natural disaster, where news of earthquakes are commonly first heard on Twitter or FriendFeed before you hear about them on the mainstream news channels.

What is the difference Between Traditional Web Search and Realtime Web Search?
Traditional Web search works by having a “crawler” (a software agent) examine as many web pages as it can and index those pages in large server farms. Then, when someone makes a query, the index is examined and matching results are returned, based on an algorithm that each search engine company hopes will out perform the other. At the moment, the undisputed king of Traditional Web search is Google.

Searching the Realtime web works differently. You don’t have to be a genius to realise that it would not be possible to store and index every piece of Realtime Web information as it is generated. Instead Realtime Web search relies on making API calls to social media service providers and have them provide a list of “what just happened now and in the recent past”. Twitter, for example, limit the results of any search to the previous seven days. 

Who are the Players in RealTime Web Search?
Until recently, the three major players in Realtime Web Search were Twitter, Facebook and Friendfeed. This, of course, has been whittled down to just two now that Friendfeed have been bought by Facebook. There are, of course, a number of smaller players, here are a few in their own words:

Scoopler is a real-time search engine. We aggregate and organize content being shared on the internet as it happens, like eye-witness reports of breaking news, photos and videos from big events, and links to the hottest memes of the day. We do this by constantly indexing live updates from services including Twitter, Flickr, Digg, Delicious and more. When you search for a topic on Scoopler, we give you the most relevant results, updated in real-time.

OneRiot crawls the links people share on Twitter, Digg and other social sharing services, then indexes the content on those pages in seconds. The end result is a search experience that allows users to find the freshest, most socially-relevant content from across the realtime web.

TweetMeme is a service which aggregates all the popular links on twitter to determine which links are popular. TweetMeme is able to categorize these links into categories and subcategories, making it easy to filter out the noise to find what your interested in.

Who has the Upper Hand?
Of the two big players, Twitter and Facebook, it is hard to say who has the upper hand at the moment as both have their advantages, which I’ll summarise here for you now:

Twitter

It was Twitter who was first to market and that gives them the all important “first mover advantage”. There are millions of people who know Twitter search and reach for it as those in the realm of the Traditional Web reach for Google. Add to that the fact that Twitter has more experience, they’ve been in the game for a year now, learning what works and – more importantly perhaps – what doesn’t. This means that they have the knowledge to respond quickly to whatever Facebook can bring to the market place. But, to my mind, Twitter’s main advantage is that it is an open platform.

Facebook

Despite what you might have heard, size does matter and, in those terms, Facebook is way ahead. Facebook has over 250 million users. With those people to draw on, Facebook can provide a more accurate picture of what people are talking about at any given moment. You are also, statistically, more likely to have a friend on Facebook than on Twitter. Facebook’s search covers a wider range of media; as well as statuses Facebook search can deliver video and pictures as well which, by the way, can be filtered.

Has Google Been Out Googled?
Let’s be honest, Google was a fluke. Don’t look at me like that, it was! They started out with the idea of indexing the web and somewhere along the road they found that they could make money by placing adverts on the result pages of people’s searches. Fair play to the lads at Google, they exploited that discovery to the max and made a fortune doing it. What Google did was to shift the “value base” of content. Before Google came along (and to a certain extend before the web) the “value base” was with the content providers. “He who writes the words earns the money” as it were. However, Google shifted that “value base” to be “he who finds the content earns the money”.

With the advent of the Realtime Web, that “value base” has shifted again to be “he who provides the tools earns the money”. We established above that no company – not even the mighty Google – can index all the Realtime Web content that is produced every second of every day, and we are going to become heavily dependant on the tool vendors to provide an API which we, or search engine companies, can access to gain results for a search query. Successful social media tool vendors can charge what they like to search engine companies like Google, Yahoo and Microsoft to have access to that API. This, I feel, is the most likely revenue stream for providers such as Twitter and if Google are not very careful here, they could very well find themselves out Googled.

Published Aug 24 2009, 06:40 PM by Gary Short (DevExpress)
Filed under:
Technorati tags: Outside DXperience
Bookmark and Share

Comments

 

Ben Hall said:

I think you should take a look at http://www.cuil.com.

I would say that companies are indexing in real-time, plus if twitter are going to make their money off their streaming API - who are they going to sell it too? The most obvious would be Google.

The question is - do we even need real-time? Apart from news (earthquakes, Michael Jackson etc)...  What is real-time? Is a 5 minute delay considered real-time? Do we need it any quicker?

August 24, 2009 2:42 PM
 

Matthew Bender said:

You can't be serious. None of the "Realtime" search engines you mentioned are even remotely positioned in opposition to Google. They might be more appropriate for finding information on Twitter and such but they're not even trying to compete with Google on it's territory. As for Google being a "fluke", you must have access to a different history book (or alternat reality) than I have. They set out to build the best search engine of the day and did. It doesn't take a genius to connect the dots after that and make money from advertising.

August 26, 2009 2:51 PM
 

Gary Short (DevExpress) said:

I'm perfectly serious Matthew. You say that these search engines are not "positioned in opposition to Google" and that they are "more appropriate for finding information on Twitter and such". That is exactly my point. They have moved into realtime search of the "here and now" whilst Google are using an "index and query" model of the past which can't hope to cope with the enormity of information that is created on a second by second basis.

With regard to Google being being a fluke I'm sure I'm reading the same history books that you are. You say "it doesn't take a genius to connect the dots after that and make money from advertising" but it is well known that Larry and Sergey had an instinctive aversion to advertising, they also feared it would corrupt the search results, this was made clear in a paper that they published on their search engine in the early days. It was a meeting with their first investor, Bechtolsheim, that first made them see the possibilities. If he had been too busy to take that meeting...

Also, Google owes much of it's success to the fact that some outstanding engineers became available for hiring due to the downturn in the tech. industry at that time. I don't feel it is unreasonable to label these events as flukes.

August 26, 2009 3:32 PM
 

Matthew Bender said:

My point was that Google could incorporate "realtime" search but that the reverse will certainly never be true. If that still means they've "out Google'd Google" in your view then I don't have anything else to say on the topic.

With regard to the "fluke" comments, I suppose everyone is entitled to their own opinion. Mine is that commercial success of Google was a natural extension of the effectiveness of the search engine - which I don't see as a fluke - availability of programming talent aside.

August 27, 2009 12:27 PM
 

Gary Short (DevExpress) said:

Hi Matthew, well I really can't agree that Goolge could "incorporate realtime search" any time they want. With the traditional web all Google had to do was issue an http GET request and, bam, they had the content. Not so with the realtime web, the content isn't available via an http GET. The content is behind an "API wall" that the tool vendors own. The same way Google out outpaced the competition with better software and their own "secret sauce" hardware. That's what I mean by "out Googled Google".

With regard to the "fluke" comment I say a chance meeting and a change downturn are flukes but hey, you're right, everyone's entitled to their own opinion and it all makes for a good conversation, right?

August 27, 2009 1:44 PM
More from DevExpress
Live Chat
Have a pre-sales question?
Need assistance with your evaluation?
We are here to help.
Chat is one of the many ways you can contact members of the DevExpress Team. We are available Monday-Friday between 8:30am and 5:00pm Pacific Time.
If you need additional product information, require pre-sales assistance, or want help with your order, write to us at info@devexpress.com or call us at
+1 (818) 844-3383.