<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Natural Search Blog &#187; Supervised-Multiclass-Labeling</title>
	<atom:link href="http://www.naturalsearchblog.com/tag/supervised-multiclass-labeling/rss2" rel="self" type="application/rss+xml" />
	<link>http://www.naturalsearchblog.com</link>
	<description>Thought leaders in search engine optimization weigh in with the latest SEO news and commentary</description>
	<pubDate>Thu, 28 Aug 2008 19:21:10 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.1</generator>
	<language>en</language>
		<!-- podcast_generator="podPress/8.8" -->
		<copyright>&#xA9; </copyright>
		<managingEditor>chris@netconcepts.com ()</managingEditor>
		<webMaster>chris@netconcepts.com()</webMaster>
		<category></category>
		<itunes:keywords></itunes:keywords>
		<itunes:subtitle></itunes:subtitle>
		<itunes:summary>Thought leaders in search engine optimization weigh in with the latest SEO news and commentary</itunes:summary>
		<itunes:author></itunes:author>
		<itunes:category text="Society &amp; Culture"/>
		<itunes:owner>
			<itunes:name></itunes:name>
			<itunes:email>chris@netconcepts.com</itunes:email>
		</itunes:owner>
		<itunes:block>No</itunes:block>
		<itunes:explicit>no</itunes:explicit>
		<itunes:image href="http://www.naturalsearchblog.com/wp-content/plugins/podpress/images/powered_by_podpress_large.jpg" />
		<image>
			<url>http://www.naturalsearchblog.com/wp-content/plugins/podpress/images/powered_by_podpress.jpg</url>
			<title>Natural Search Blog</title>
			<link>http://www.naturalsearchblog.com</link>
			<width>144</width>
			<height>144</height>
		</image>
		<item>
		<title>New Research Could Improve Google Image Search</title>
		<link>http://www.naturalsearchblog.com/archives/2007/04/05/new-research-could-improve-google-image-search/</link>
		<comments>http://www.naturalsearchblog.com/archives/2007/04/05/new-research-could-improve-google-image-search/#comments</comments>
		<pubDate>Thu, 05 Apr 2007 13:14:16 +0000</pubDate>
		<dc:creator>Chris Silver Smith</dc:creator>
		
		<category><![CDATA[Google]]></category>

		<category><![CDATA[Image Optimization]]></category>

		<category><![CDATA[Research and Development]]></category>

		<category><![CDATA[Searching]]></category>

		<category><![CDATA[Google-Image-Labeler]]></category>

		<category><![CDATA[Google-Image-Search]]></category>

		<category><![CDATA[image-search]]></category>

		<category><![CDATA[Supervised-Multiclass-Labeling]]></category>

		<guid isPermaLink="false">http://www.naturalsearchblog.com/archives/2007/04/05/new-research-could-improve-google-image-search/</guid>
		<description><![CDATA[New research recently published out of University of CaliforniaÂ - San Diego could allow Google&#8217;s Image Search to easily begin using elements from &#8220;true image search&#8221; &#8212; that is, the ability for software to detect and identify elements appearing within the image itself rather than just relying upon external text metadata to associate keywords with the [...]]]></description>
			<content:encoded><![CDATA[<p>New research recently published out of University of CaliforniaÂ - San Diego could allow Google&#8217;s Image Search to easily begin using elements from &#8220;true image search&#8221; &#8212; that is, the ability for software to detect and identify elements appearing within the image itself rather than just relying upon external text metadata to associate keywords with the images. Read on for more details.</p>
<p><span id="more-223"></span></p>
<p>Â This new method is called &#8220;Supervised Multiclass Labeling (SML)&#8221;.Â  In an article published in the IEEE (Institute of Electrical and Electronics Engineers)Â Computer Society journal, TPAMI (Transactions on Pattern Analysis and Machine Intelligence)Â called &#8220;<a target="_blank" href="http://csdl2.computer.org/persagen/DLAbsToc.jsp?resourcePath=/dl/trans/tp/&amp;toc=comp/trans/tp/2007/03/i3toc.xml&amp;DOI=10.1109/TPAMI.2007.61" title="Supervised Learning of Semantic Classes for Image Annotation and Retrieval">Supervised Learning of Semantic Classes for Image Annotation and Retrieval</a>&#8220;, researchers <span class="articleauthor">Gustavo Carneiro, Antoni B. Chan, Pedro J. Moreno, and Nuno Vasconcelos describe a system wherein one could train it by supplying a sort of seed set of photos which have been labelled by humans with keywords of things seen in the photos, and then the seed set is used by the system on a database of photo images which are unlabelled. The system then will calculate the probability that various objects or â€œclassesâ€? it has been trained to recognize are present â€“ and labels the images accordingly. After labeling, images could then be retrieved via keyword searches using the newly developed meta data.</span></p>
<p><span class="articleauthor">The accuracy of this new method has apparently been superior toÂ that of other previously published content-based image labeling systems developed by information retrieval specialists. The SML system can also split up images based on their identifiable regions of content - a process which has historically been quite difficult for software systems to accomplish. For example, this methodÂ could separate a landscape photo into mountain, sky and lake regions and then identify those things based on the training data.</span></p>
<p align="center"><a href="http://www.flickr.com/photos/silvery/113203407/" title="Mountains on Catalina Island"><img border="0" width="240" src="http://farm1.static.flickr.com/56/113203407_ef716f90d3_m.jpg" alt="Coast of Santa Catalina Island, facing San Clemente" height="180" /></a></p>
<p><span class="articleauthor">One of theÂ engineers who contributedÂ on the development of the methodÂ and the associatedÂ publishedÂ paper, Pedro Moreno, is a researcher at Google who sometimes contributesÂ on the <a href="http://googleresearch.blogspot.com/" title="Google Research Blog">Google Research Blog</a>. Google just happens to have large quantities of images to use for such research, of course.</span></p>
<p><span class="articleauthor">John Battelle <a href="http://battellemedia.com/archives/003453.php" title="Image Search - What Will Happen">recently mentioned</a> the dream of being able to truly search by an image&#8217;s content, and it would appear that the concept is really close to fruition through this SML method.</span></p>
<p><span class="articleauthor">Now, one must ask oneself, how might a search engine like Google first develop a good sample set of human-labeled images in order to &#8220;train&#8221; this sort ofÂ newÂ algorithmicÂ labeling program? But, oh, wait &#8212; Google has already had an <a href="http://images.google.com/imagelabeler/" title="Google Image Labeler">Image Labeling</a> program in beta release which invites users to come in and submit keywords to be associated with images through a sort of game. And, Vanessa Fox told us in the SES Images &amp; Search panel discussion in Chicago last year, that some of their users really enjoy participating in the Image Labeler program. So, it&#8217;s not at all hard to connect the dots &#8212; if the automated Supervised Multiclass Labeling system were to be hooked up with the rich, trustworthy data developed through the Image Labeler program, Google would almost overnightÂ have the ability to perform true image search based on images&#8217; graphic content.</span></p>
<p><span class="articleauthor">If that Image Labeler user-tagging method were associated with this new algorithmic method, the random users could become live &#8220;trainers&#8221; for the software. As time progresses, the software could become steadily more accurate and more efficient at adding appropriate words to be associated metadata for images.</span></p>
<p><span class="articleauthor">Using this method, Google might only need a relatively small seed set of images to be tagged by humans in order to train their software to identify millions of other images. The e</span><span class="articleauthor">nd result would be a fantastically better Image Search, using more accurate data for associating users&#8217; keyword search requests with images appropriate for the keyword. </span><span class="articleauthor">This sort of advantage would put them ahead of nearly all the other image search services out there.</span></p>
<p><span class="articleauthor">Of course, there are other true image search services, but none of them have the user following of Google, I would bet.</span></p>
<p><span class="articleauthor">A <a target="_blank" href="http://www.jacobsschool.ucsd.edu/news/news_releases/release.sfe?id=650" title="Supervised Multiclass Labeling System press release from UCSD">press release from UCSD</a> on the research paper includes a nice video of UCSD professor Nuno Vasconcelos talking about the SML method if you&#8217;re interested.</span></p>
<p><span class="articleauthor">(Nota bene: I&#8217;ll be speaking on increasing website traffic through optimization of images and for optimizing for <a href="http://www.searchenginestrategies.com/sew/ny07/agenda3.html#ise" title="Images and Search SES Conference">Image Search at SES Conference</a> next week. I&#8217;ve also previously blogged in the subject of <a href="http://www.naturalsearchblog.com/archives/2006/03/22/need-more-traffic-try-image-search-optimization/" title="Image Optimization">image optimization</a>.)</span></p>
]]></content:encoded>
			<wfw:commentRss>http://www.naturalsearchblog.com/archives/2007/04/05/new-research-could-improve-google-image-search/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
