<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Natural Search Blog &#187; On-Page-Factors</title>
	<atom:link href="http://www.naturalsearchblog.com/tag/on-page-factors/rss2" rel="self" type="application/rss+xml" />
	<link>http://www.naturalsearchblog.com</link>
	<description>Thought leaders in search engine optimization weigh in with the latest SEO news and commentary</description>
	<pubDate>Thu, 09 Oct 2008 19:12:37 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.1</generator>
	<language>en</language>
		<!-- podcast_generator="podPress/8.8" -->
		<copyright>&#xA9; </copyright>
		<managingEditor>chris@netconcepts.com ()</managingEditor>
		<webMaster>chris@netconcepts.com()</webMaster>
		<category></category>
		<itunes:keywords></itunes:keywords>
		<itunes:subtitle></itunes:subtitle>
		<itunes:summary>Thought leaders in search engine optimization weigh in with the latest SEO news and commentary</itunes:summary>
		<itunes:author></itunes:author>
		<itunes:category text="Society &amp; Culture"/>
		<itunes:owner>
			<itunes:name></itunes:name>
			<itunes:email>chris@netconcepts.com</itunes:email>
		</itunes:owner>
		<itunes:block>No</itunes:block>
		<itunes:explicit>no</itunes:explicit>
		<itunes:image href="http://www.naturalsearchblog.com/wp-content/plugins/podpress/images/powered_by_podpress_large.jpg" />
		<image>
			<url>http://www.naturalsearchblog.com/wp-content/plugins/podpress/images/powered_by_podpress.jpg</url>
			<title>Natural Search Blog</title>
			<link>http://www.naturalsearchblog.com</link>
			<width>144</width>
			<height>144</height>
		</image>
		<item>
		<title>Google Sitemaps Reveal Some of the Black Box</title>
		<link>http://www.naturalsearchblog.com/archives/2006/07/11/google-sitemaps-reveal-some-of-the-black-box/</link>
		<comments>http://www.naturalsearchblog.com/archives/2006/07/11/google-sitemaps-reveal-some-of-the-black-box/#comments</comments>
		<pubDate>Wed, 12 Jul 2006 04:54:44 +0000</pubDate>
		<dc:creator>Chris Silver Smith</dc:creator>
		
		<category><![CDATA[Google]]></category>

		<category><![CDATA[Tools]]></category>

		<category><![CDATA[Algorithms]]></category>

		<category><![CDATA[Authoritative-Hubs]]></category>

		<category><![CDATA[ExpertRank]]></category>

		<category><![CDATA[Hubs]]></category>

		<category><![CDATA[Keyword-Classification]]></category>

		<category><![CDATA[On-Page-Factors]]></category>

		<category><![CDATA[PageRank]]></category>

		<category><![CDATA[Search Engine Optimization]]></category>

		<category><![CDATA[SEO]]></category>

		<category><![CDATA[Sitemaps]]></category>

		<guid isPermaLink="false">http://www.naturalsearchblog.com/archives/2006/07/11/google-sitemaps-reveal-some-of-the-black-box/</guid>
		<description><![CDATA[I earlier mentioned the recent Sitemaps upgrades which were announced in June, and how I thought these were useful for webmasters. But, the Sitemaps tools may also be useful in other ways beyond the obvious/intended ones.
The information that Google has made available in Sitemaps is providing a cool bit of intel on yet another one [...]]]></description>
			<content:encoded><![CDATA[<p><font size="2">I <a href="http://www.naturalsearchblog.com/archives/2006/06/26/google-sitemaps-upgrades-help-webmasters/">earlier mentioned</a> the recent Sitemaps upgrades which were <a href="http://sitemaps.blogspot.com/2006/06/get-more-from-latest-release.html">announced</a> in June, and how I thought these were useful for webmasters. But, the Sitemaps tools may also be useful in other ways beyond the obvious/intended ones.</font></p>
<p><font size="2">The information that Google has made available in Sitemaps is providing a cool bit of intel on yet another one of the <a href="http://blog.searchenginewatch.com/blog/060510-123802">200+ parameters</a> or &#8220;signals&#8221; that they&#8217;re using to rank pages for SERPs.</font></p>
<p><font size="2">For reference, check out the Page Analysis Statistics that are provided in Sitemaps for my &#8220;Acme&#8221; products and services experimental site:</font><font size="2"> </font></p>
<p align="center"><font size="2"><img src="http://static.flickr.com/60/187763732_1e173c0fe4.jpg" alt="Google Sitemaps Page Analysis" height="500" width="399" /></font></p>
<p><font size="2">It seems unlikely to me that these stats on &#8220;Common Words&#8221; found &#8220;In your site&#8217;s content&#8221; were generated just for the sake of providing nice tools for us in Sitemaps. No, the more likely scenario would seem to be that Google was already collating the most-common words found on your site for their own uses, and then they later chose to provide some of these stats to us in Sitemaps.</font></p>
<p><font size="2">This is significant, because we&#8217;ve already known that Google tracks keyword content for each page in order to assess its relevancy for search queries made with that term. But, why would Google be tracking your most-common keywords in a site-wide context?</font></p>
<p><font size="2">One good explanation presents itself: Google might be tracking common terms used throughout a site in order to assess if that site should be considered authoritative for particular keywords or thematic categories.</font></p>
<p><font size="2">Early on, algorithmic researchers such as Jon Kleinberg <a href="http://www.cs.cornell.edu/home/kleinber/auth.pdf" target="new">worked on methods</a> by which &#8220;authoritative&#8221; sites and &#8220;hubs&#8221; could be identified. <a href="http://domino.research.ibm.com/comm/wwwr_thinkresearch.nsf/pages/webhounds399.html" target="new">IBM and others</a> did further research on authority/hub identification, and I heard engineers from Teoma speak on the importance of these approaches a few times at SES conferences when <a href="http://about.ask.com/en/docs/about/webmasters.shtml" target="new">explaining the ExpertRank system</a> their algorithms were based upon.</font></p>
<p><font size="2">So, it&#8217;s not all that surprising that Google may be trying to use commonly-occuring text to help identify Authoritative sites for various themes. This would be one good automated method for classifying sites for subject matter categories and keywords.</font></p>
<p><font size="2">The take-away concept is that Google may be using words found in the visible text throughout your site to assess whether you&#8217;re authoritative for particular themes or not.</font></p>
<p><font size="2">Â </font></p>
]]></content:encoded>
			<wfw:commentRss>http://www.naturalsearchblog.com/archives/2006/07/11/google-sitemaps-reveal-some-of-the-black-box/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
