Linode Forum
Linode Community Forums
 FAQFAQ    SearchSearch    MembersMembers      Register Register 
 LoginLogin [ Anonymous ] 
Post new topic  Reply to topic
Author Message
 Post subject: Adsense and robots.tx
PostPosted: Wed Jun 15, 2011 8:56 am 
Offline
Senior Newbie

Joined: Wed Jun 15, 2011 8:37 am
Posts: 13
Website: http://aplawrence.com
My Adsense crawler errors report shows various files supposedly restricted by my robots.txt

They take the form of

http://webcache.googleusercontent.com/s ... google.com

with the details varying.

I don't see why my robots.txt would match that (or any of the others):

User-Agent: *
Disallow: /UNIXIART
Disallow: /pub
Disallow: /Anatests
Disallow: /Personal
Disallow: /MissDow
Disallow: /var/ftp/pub/psst-sample.pdf
Disallow: /cgi-bin/deepindexget.pl
Disallow: /cgi-bin/printer.pl
Disallow: /cgi-bin/newcomm.pl
Disallow: /cgi-bin/auth.pl
Disallow: /cgi-bin/countad.pl
Disallow: /cgi-bin/fatal.pl
Disallow: /cgi-bin/forumpost.pl
Disallow: /cgi-bin/freprint.pl
Disallow: /cgi-bin/showrelated.pl?
Disallow: /cgi-bin/getauthart.pl
Disallow: /cgi-bin/mkltest.pl
Disallow: /cgi-bin/mkpost.pl
Disallow: /cgi-bin/nav.pl
Disallow: /cgi-bin/randad.pl?
Disallow: /cgi-bin/snav.pl
Disallow: /cgi-bin/search3.pl
Disallow: /cgi-bin/supersearch.pl
Disallow: /cgi-bin/ta.pl
Disallow: /cgi-bin/tester.pl
Disallow: /Bela
Disallow: /Maps
Disallow: /errors
Disallow: /itech
Disallow: /Consultants/
Disallow: /prepub
Disallow: /buytests.html
Disallow: /download.html
Disallow: /contrib.html
Disallow: /Unix/sample
Disallow: /visitreport.html
Disallow: /SCOFAQ/upgrade.txt

In Googling around, I found a suggestion to add these lines:


User-agent: Mediapartners-Google
Allow: /

User-agent: Adsbot-Google
Allow: /

User-agent: Googlebot-Image
Allow: /

User-agent: Googlebot-Mobile
Allow: /

I don't know why I'd need to, but I'll try it - are the last two really Adsense related?


Top
   
 Post subject:
PostPosted: Wed Jun 22, 2011 11:44 am 
Offline
Senior Member

Joined: Sat Jun 12, 2010 4:53 pm
Posts: 77
Mine has those, fwiw. (A wordpress site) And google adsense says it can index ok without error.

Code:
User-agent: *
Disallow: /cgi-bin
Disallow: /wp-admin
Disallow: /wp-includes
Disallow: /wp-content
Disallow: /tag
Disallow: /author
Disallow: /wget/
Disallow: /httpd/
Disallow: /i/
Disallow: /f/
Disallow: /t/
Disallow: /c/
Disallow: /j/
 
User-agent: Mediapartners-Google
Allow: /
 
User-agent: Adsbot-Google
Allow: /
 
User-agent: Googlebot-Image
Allow: /
 
User-agent: Googlebot-Mobile
Allow: /
 



Top
   
Display posts from previous:  Sort by  
Post new topic  Reply to topic


Who is online

Users browsing this forum: No registered users and 3 guests


You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
RSS

Powered by phpBB® Forum Software © phpBB Group