http vs https - effect on page rank?

usasportstraining Notable Member

Messages:: 4,876

Likes Received:: 363

Best Answers:: 0

Trophy Points:: 280

Articles:: 4

#1

Does anyone know what effect https would have on page rank? Much of my SEO tools don't display much, if anything, when a given page is displayed using the https.

Just curious.

usasportstraining, Jun 12, 2007 IP

Gatorade Peon

Messages:: 2,130

Likes Received:: 222

Best Answers:: 0

Trophy Points:: 0

#2

I don't think it has any affect on pagerank. Https is used to make a secure connection to a site.

Gatorade, Jun 12, 2007 IP

seoworld Active Member

Messages:: 471

Likes Received:: 20

Best Answers:: 0

Trophy Points:: 58

#3

I dont think that the search engine robots or spiders can access a secure site which is https

seoworld, Jun 12, 2007 IP

usasportstraining Notable Member

Messages:: 4,876

Likes Received:: 363

Best Answers:: 0

Trophy Points:: 280

Articles:: 4

#4

Yeah, I was thinking the same thing that spiders may have difficulty crawling, although I can crawl it using various sitemap software. Hmmmm....

usasportstraining, Jun 12, 2007 IP

oseymour Well-Known Member

Messages:: 3,960

Likes Received:: 92

Best Answers:: 0

Trophy Points:: 135

#5

no pagerank shows for secure sites...I don't think the spiders crawl them

oseymour, Jun 12, 2007 IP

songchai Well-Known Member

Messages:: 503

Likes Received:: 3

Best Answers:: 0

Trophy Points:: 118

#6

seo ranter said: ↑

no pagerank shows for secure sites...I don't think the spiders crawl them
Click to expand...

I do agree with, as my company website also the same.

songchai, Jun 12, 2007 IP

toinkzzz Banned

Messages:: 314

Likes Received:: 6

Best Answers:: 0

Trophy Points:: 0

#7

can https can give supplemental results?

toinkzzz, Jun 13, 2007 IP

Dal Peon

Messages:: 87

Likes Received:: 3

Best Answers:: 0

Trophy Points:: 0

#8

If it is password protected then the spider will not index it. Usually you see https pages on secure pages during e-commerce checkout pages and secure forms on banking websites.

Dal, Jun 13, 2007 IP

sweetfunny Banned

Messages:: 5,743

Likes Received:: 467

Best Answers:: 0

Trophy Points:: 0

#9

Pagerank definately applies to https URL's just like it does to http

sweetfunny, Jun 13, 2007 IP

oseymour Well-Known Member

Messages:: 3,960

Likes Received:: 92

Best Answers:: 0

Trophy Points:: 135

#10

sweetfunny said: ↑

Pagerank definately applies to https URL's just like it does to http
Click to expand...

I don't think so, neither the google toolbar nor alexa shows information for encrypted pages.

oseymour, Jun 13, 2007 IP

sweetfunny Banned

Messages:: 5,743

Likes Received:: 467

Best Answers:: 0

Trophy Points:: 0

#11

seo ranter said: ↑

I don't think so, neither the google toolbar nor alexa shows information for encrypted pages.
Click to expand...

What information are you talking about? The thread is about Pagerank, which https URL's can get... Mine certainly did. It won't be cached, but will display backlinks and Pagerank.

sweetfunny, Jun 13, 2007 IP

GRIM Prominent Member

Messages:: 12,638

Likes Received:: 733

Best Answers:: 0

Trophy Points:: 360

#12

Google does crawl SSL, at times it will index and display ssl pages instead of the non ssl pages on my ecom sites.

GRIM, Jun 13, 2007 IP

Crazy_Zap and northpointaiki like this.

sweetfunny Banned

Messages:: 5,743

Likes Received:: 467

Best Answers:: 0

Trophy Points:: 0

#13

GRIM said: ↑

Google does crawl SSL, at times it will index and display ssl pages instead of the non ssl pages on my ecom sites.
Click to expand...

Yes it will index https, if you have a https version of your site you should always exclude Google from it as you will end up with big duplicate content problems.

sweetfunny, Jun 13, 2007 IP

usasportstraining likes this.

GRIM Prominent Member

Messages:: 12,638

Likes Received:: 733

Best Answers:: 0

Trophy Points:: 360

#14

sweetfunny said: ↑

Yes it will index https, if you have a https version of your site you should always exclude Google from it as you will end up with big duplicate content problems.
Click to expand...

Never had a duplicate content problem, plus last time I looked there was no real way of excluding just ssl. Is there such a code now?
Either way never had a problem...

GRIM, Jun 13, 2007 IP

usasportstraining Notable Member

Messages:: 4,876

Likes Received:: 363

Best Answers:: 0

Trophy Points:: 280

Articles:: 4

#15

sweetfunny said: ↑

Yes it will index https, if you have a https version of your site you should always exclude Google from it as you will end up with big duplicate content problems.
Click to expand...

Very good advice!

I do have the .htaccess set to redirect to https so the spider and visitors will continue on the https path.

I found something on Google about it too.

Block or remove your entire website using a robots.txt file

To remove your site from search engines and prevent all robots from crawling it in the future, place the following robots.txt file in your server root:

User-agent: *

Disallow: /

To remove your site from Google only and prevent just Googlebot from crawling your site in the future, place the following robots.txt file in your server root:

User-agent: Googlebot

Disallow: /

Each port must have its own robots.txt file. In particular, if you serve content via both http and https, you'll need a separate robots.txt file for each of these protocols. For example, to allow Googlebot to index all http pages but no https pages, you'd use the robots.txt files below.

For your http protocol (http://yourserver.com/robots.txt):

User-agent: *

Allow: /

For the https protocol (https://yourserver.com/robots.txt):

User-agent: *

Disallow: /
Click to expand...

usasportstraining, Jun 13, 2007 IP

GRIM Prominent Member

Messages:: 12,638

Likes Received:: 733

Best Answers:: 0

Trophy Points:: 360

#16

The above only works if you use different directories or am I wrong? All of my ssl enabled sites use symbolic links, same content in each folder. I am unable to server different robots.txt

GRIM, Jun 13, 2007 IP

Crazy_Zap likes this.

usasportstraining Notable Member

Messages:: 4,876

Likes Received:: 363

Best Answers:: 0

Trophy Points:: 280

Articles:: 4

#17

I've been wondering that as well. I have no idea how you could have two robot.txt files.

Any ideas?

usasportstraining, Jun 13, 2007 IP

GRIM likes this.

GRIM Prominent Member

Messages:: 12,638

Likes Received:: 733

Best Answers:: 0

Trophy Points:: 360

#18

usasportstraining said: ↑

I've been wondering that as well. I have no idea how you could have two robot.txt files.

Any ideas?
Click to expand...

Common SSL uses 2 different directories so in a general setup it would be no problem to have 2 different ones. I however use symbolic links so it uses all the same content out of one directory, just much easier..

I'm sure there is still a way, maybe an .htaccess referral code, it detects ssl and forwards to a different version...If I get time and am not overly lazy I might look into it

GRIM, Jun 13, 2007 IP

usasportstraining likes this.

usasportstraining Notable Member

Messages:: 4,876

Likes Received:: 363

Best Answers:: 0

Trophy Points:: 280

Articles:: 4

#19

I hope you get time, this could come in handy.

usasportstraining, Jun 14, 2007 IP

Mihai Active Member

Messages:: 567

Likes Received:: 8

Best Answers:: 0

Trophy Points:: 60

#20

In order to add a website URL to its index, a search engine must be able to access the site. Accessibility roadblocks are technologies or page elements past which a search engine spider cannot crawl. Robots exclusion and redirects are also important ways to manage how search engines access and index websites.

* Dynamic URLs and querystrings: URLs containing querystring elements such as & or ? to dynamically retrieve data may not allow access to search engine crawlers, so the content of web pages with dynamically generated URLs may not be searchable.
* Secure sockets layer (SSL): Search engine crawlers are unable to access web pages encrypted using SSL protocols.
* Javascript: Search engine crawlers do not follow links or page navigation written using javascript.
* Cookies and session IDs: Search engine crawlers do not accept cookies or work with session identifiers. Web pages requiring cookies or session IDs for access will not be searchable.
* Roadblocks (Lynux browser/viewer, firefox browser)
* Spider limits: Most search engine crawlers limit the page size or number of characters they will crawl. Decrease the size of large web pages by moving javascript and CSS to external files.
* Broken links: Search engine crawlers don't crawl past broken links.
* Sitemaps: A sitemap is a web page that lists and links to all of the pages of a site. Search engine crawlers can easily and effectively index a site using a sitemap. Sitemaps are especially useful for sites with content that is otherwise inaccessible due to dynamic URLs, SSL, or other roadblocks. Limit the number of links in a sitemap to fewer than 100 or build sitemaps around groups of pages.
* Non-HTML documents: Documents such as Word, Excel, PowerPoint, and Adobe PDF can be indexed by search engine crawlers. Assign a metadata title to Adobe and Microsoft documents using the File>Properties dialog.
* Canonical URLs: A search engine will consider http://utah.edu and http://www.utah.edu to be different websites. If both URL forms serve up the same pages, search engines will consider them to be duplicate content, and dramatically deduct the relevance score of both. Use Server (301) redirects to point alternate form URLs to the canonical URL without relevance penalties.
* Redirects: Redirect instructions tell web browsers and crawlers to move on to a new or revised URL. Server 301 redirects are server-side permanent redirect instructions which search engine spiders will follow. Server 302 redirects are server-side temporary redirect instructions, and most search engines will ignore them. Meta-refresh and javascript redirects are often used unethically to "cloak" content, and most search engine crawlers ignore them.
* Robots exclusion: The Robots Exclusion Protocol is a method that allows site administrators to indicate to visiting robots which parts of their site should not be visited. Robots can be specifically admitted or excluded on a site-wide, directory by directory, or page by page basis, using the robots.txt file or robots meta tag.

I hope this solves your problem.

Regards!

Mihai, Jun 28, 2007 IP

Log in or Sign up

http vs https - effect on page rank?

usasportstraining Notable Member

Gatorade Peon

seoworld Active Member

usasportstraining Notable Member

oseymour Well-Known Member

songchai Well-Known Member

toinkzzz Banned

Dal Peon

sweetfunny Banned

oseymour Well-Known Member

sweetfunny Banned

GRIM Prominent Member

sweetfunny Banned

GRIM Prominent Member

usasportstraining Notable Member

GRIM Prominent Member

usasportstraining Notable Member

GRIM Prominent Member

usasportstraining Notable Member

Mihai Active Member

Useful Searches