Archive for Search Engines

In case you don’t know what I mean by “delisted” from Google, it essentially means removing one of your website or a portion of your website from the Google search result.

Now why would I want to do that?  Doesn’t that seem kinda silly in the realm of SEO where you want as much exposure as possible?  The answer is that there may be times where you have sensitive material you want to “hide” from the public.

In my case, it was a bunch of SEO Networker content Fernando and I put together that we didn’t want released into the public.  Well, unfortunately, Google bot got a hold of it and was more than happy to list its location.  Now anybody can search for it and get their hands on it.

So I went on a mission – how do you remove certain elements of your website from Google without actually removing it from your server?  After all, you still want it available to a select group of people.

Three things are required for this to happen:

  1. Get a Google Webmaster account and verify your website
  2. Setup a (proper) robots.txt file and upload it to your server (your top folder, where you main files reside)
  3. Submit a delist request to Google via Google Webmaster

Signing Up for Google Webmasters

Signing up for a webmaster account is a process well explained by… well, the process itself so I won’t bother listing out what needs to be done here.  Just head over to Google Webmaster and click on “Sign in to Webmaster Tools”, then follow the instructions provided (i.e. log into your Google account).

Google Webmasters

Google Webmasters

Your Robots. txt…

Okay, this is the fun part :)

Robots.txt is a file you put on your server (associated with your domain) that tells the search engine bots what they can/should and can’t/shouldn’t crawl.  If that was greek to you, let me explain another it in simpler terms.  Search engines have to know what part of your website they can show in the search results.  They look to your robots.txt file for permission.

You can go to Robotsxtx.org for more details.

The real question is… what do you put in this file so Google (and other search engines) know what NOT to crawl?

So this is what you do, first create a file called robots.txt (all in small letters) on your desktop.  Open it in an editor like notepad.  And then copy and paste the following snippet of code in there:

User-agent:  *
Disallow: /example/
Disallow: /sec

The first line says “what is stated in the following lines every single search engine robot“.  Then the ensuing lines tell them each of the folders you DON’T want listed.

For example, http://www.yourdomain.com/example/ would NOT be listed.  And neither would any of your subdomains that start with “sec” such as http://www.yourdomain.com/seconds and http://www.yourdomain.com/secrets.

Then you save the robots.txt file and upload it to your domain – at the very top folder (i.e. http://www.yourdomain.com)

Telling Google to Bug Off… Nicely & Strategically of Course :)

Okay, the last step is a snap and it takes place right inside your Google Webmaster account.  You simply log in, and on the menu to the left, click on “Tools” and then “Remove URLs”.

Remove URLs from Google

And all the remains now is to click on the button that says newremoverequest and then following the instructions on the screen (selecting an entire website, just a directory within your website, specific files, or some outdated info still residing in the Google’s database – this would force them to update what your webpage looks like in their database.Confused?»

Voila, That’s It Folks!

Google’s response was fairly quick, they had my requested folder delisted by the next time, when I checked on it.  Which folder got delisted…?  That’s for me to know and you to will… never find out :)

Raymond Fong

You can learn more by going here: Cached Pages.
Categories : Blog, Search Engines
Comments (3)
Dec
14

Google Trying to Bury Microsoft?

Posted by: Raymond Fong | Comments (2)

So I just went into panic mode for a bit there… My blog wouldn’t load on Internet Explorer (IE)! Yep, the very same blog I’ve been slaving away for many hours just poops out (yes, that’s a technical term) IE.

So I disabled all my plugins thinking maybe the culprit is one of the many I’ve installed. Nope.

No cigar – IE still crashes.

So I ran my blot through the HTML validator and it told me it was something wrong with my page… Great. Long story short, after suspecting it may have been one of my videos and RSS, I disabled Google Friend Connect (GFC) and voila! IE loads my blog just fine.

I re-enabled all my plugins and everything is still dandy.

And just for sanity check… I looked up some of my friend’s blogs that has GFC installed and confirmed that their blogs crash too.  Given the fact that approximately 50% of surfers online still use IE, I’ve decided to disable Google Friend ConnectGFC is fixed»

It just seems so unlike them, those geniuses working at Google, to make a product that blatantly crashes the very tool that half of the online users use.

UNLESS, they are trying to bury Microsoft while promoting their own browser, Chrome…  Hmm, I wonder how long it’ll be before GFC crashes FireFox too.  :)

In all seriousness, I hope this is just a bug and in that case, we should all blog about it and make them aware of the situation.Google Knows?»

With that said, given the fact ~that 40~50% of surfers (to the websites I run analytics on) use IE, I am disabling GFC until the bug is fixed – this is for all you IE users.

Raymond Fong

Okay, Google fixed it so it’s back on :)
According to some insider information, it sounds like Google’s aware of the problem. We have to remember that GFC is still in Beta.
Comments (2)

You know, Google seems, wait no… Google IS everywhere. And the latest buzz now is how they are going to take over social media as we discuss in SEO Networker – how they are implementing Google Friend Connect, OpenSocial, FOAF, XFN and how they are following publicly declared connections, social network activity, etc.

My question is this, what are MSN and Yahoo doing? I can’t imagine them laying low in this battle of the search engines, can they?

google-yahoo-msn-logos

And if they are, think of the implications, can you say utter and complete Google takeover *ahem* monopoly? After all, social media is the wave of the future and if MSN and Yahoo don’t get in that game, they may not be able to compete with Google period.

And what would THAT mean?

And if they ARE taking measures to ensure Google doesn’t get TOO far of them, then what are MSN and Yahoo (quietly) brewing?

Can you imagine them snickering in their underground labs creating some social media tool that will absolutely blow Google away, all the while all the spotlight is focused on Google? How awesome would that be?

For MSN and Yahoo’s sake, I hope that’s what they are doing…

Comments (1)

Socialize With Me!