<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Boston Media Domain &#187; Search Engine Results Pages</title>
	<atom:link href="http://www.bostonmediadomain.com/tag/search-engine-results-pages/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bostonmediadomain.com</link>
	<description>Search, Social and Online Media for Domains</description>
	<lastBuildDate>Mon, 25 Apr 2011 15:18:01 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Bayesian Algorithm for Search</title>
		<link>http://www.bostonmediadomain.com/bayesian-algorithm-search/</link>
		<comments>http://www.bostonmediadomain.com/bayesian-algorithm-search/#comments</comments>
		<pubDate>Tue, 26 Jan 2010 01:06:48 +0000</pubDate>
		<dc:creator>jeff selig</dc:creator>
				<category><![CDATA[Commentary & Analysis]]></category>
		<category><![CDATA[Search Marketing PPC]]></category>
		<category><![CDATA[SEO Analysis]]></category>
		<category><![CDATA[Social Media Metrics]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[Search Engine Results Pages]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://bostonmediadomain.com/?p=1672</guid>
		<description><![CDATA[Is there a better search methodology than linking structures and authority of those links used by Google? The Bayesian Optimization Algorithm (BOA) is an algorithm based on the estimation of distributions. It uses techniques from modeling data by Bayesian networks to estimating the joint distribution of promising solutions. To obtain the structure of Bayesian network, different search algorithms can be used. The key point that BOA addresses is whether the constructed Bayesian network could generate new and useful solutions (strings), which could lead the algorithm in the right direction to solve the problem. Undoubtedly, this ability is a crucial factor of the efficiency of BOA. There has been much recent work on optimization algorithms that build probability models based on good solutions found so far and use the constructed models to guide further search. This class of algorithms is called Estimation of distribution algorithms (EDAs). The general scheme of EDAs works as follows: (1) Develop a probability distribution model by using initial population (2) Sampling step: generate a data set by sampling from the probability model (3) Testing step: test the data as solutions to the problem (4) Selection step: create an improved data set by selecting the better solutions [...]]]></description>
			<content:encoded><![CDATA[<!-- Start Shareaholic LikeButtonSetTop --><!-- End Shareaholic LikeButtonSetTop --><div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fbayesian-algorithm-search%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fbayesian-algorithm-search%2F&amp;source=seosem&amp;style=normal&amp;service=is.gd&amp;hashtags=image+search,Search+Engine+Results+Pages,SEO&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>Is there a better search methodology than linking structures and authority of those links used by Google?</p>
<p>The Bayesian Optimization Algorithm (BOA) is an algorithm based on the estimation of distributions. It uses techniques from modeling data by Bayesian networks to estimating the joint distribution of promising solutions. To obtain the structure of Bayesian network, different search algorithms can be used. The key point that BOA addresses is whether the constructed Bayesian network could generate new and useful solutions (strings), which could lead the algorithm in the right direction to solve the problem. Undoubtedly, this ability is a crucial factor of the efficiency of BOA.</p>
<p><img class="alignright size-full wp-image-1673" title="BayesProbSolnAlgorithm" src="http://www.bostonmediadomain.com/wp-content/uploads/2010/01/BayesProbSolnAlgorithm.jpg" alt="BayesProbSolnAlgorithm" width="649" height="517" />There has been much recent work on optimization algorithms that build probability models based on good solutions found so far and use the constructed models to guide further search. This class of algorithms is called Estimation of distribution algorithms (EDAs). The general scheme of EDAs works as follows:<br />
(1) Develop a probability distribution model by using initial population<br />
(2) Sampling step: generate a data set by sampling from the probability model<br />
(3) Testing step: test the data as solutions to the problem<br />
(4) Selection step: create an improved data set by selecting the better solutions and removing the worse ones<br />
(5) Learning step: create a new probability model from the old model and the improved data set<br />
(6) If the termination criteria are not met, go to (2)</p>
<p>&#8220;Naive Bayes&#8221; machine learning algorithm. It is a well-studied probabilistic algorithm often used in automatic text categorization. Compared to other algorithms (kNN, SVM, Decision Trees), it&#8217;s pretty fast and reasonably competitive in the quality of its results.</p>
<p>BOA Procedure<br />
The procedure of the BOA follows:<br />
(1) Set t=0, randomly generate an initial population P (0)<br />
(2) Select a set of promising strings S (t) from P (t)<br />
(3) Construct a Bayesian network B using a chosen metric<br />
and constraints<br />
(4) Generate a set of new strings O (t) according to the joint<br />
distribution encoded by B<br />
(5) Create a new population P (t+1) by replacing some<br />
strings from P (t) with O (t), set t=t+1<br />
(6) If the termination criteria are not met, go to (2)</p>
<p>Here is the algo in a bit more detail</p>
<dl>
<dd>Creates a new <code>Algorithm::NaiveBayes</code> object and returns it. The following parameters are accepted:</p>
<dl>
<dt><a name="purge">purge</a></dt>
<dd>If set to a true value, the <code>do_purge()</code> method will be invoked during <code>train()</code>. The default is true. Set this to a false value if you&#8217;d like to be able to add additional instances after training and then call <code>train()</code> again.</p>
</dd>
</dl>
</dd>
<dt><a name="add_instance"></a><a name="add_instance(_attributes_=&gt;_HASH,_label_=&gt;_STRING|ARRAY_)">add_instance( attributes =&gt; HASH, label =&gt; STRING|ARRAY )</a></dt>
<dd>Adds a training instance to the categorizer. The <code>attributes</code> parameter contains a hash reference whose keys are string attributes and whose values are the weights of those attributes. For instance, if you&#8217;re categorizing text documents, the attributes might be the words of the document, and the weights might be the number of times each word occurs in the document.</p>
<p>The <code>label</code> parameter can contain a single string or an array of strings, with each string representing a label for this instance. The labels can be any arbitrary strings. To indicate that a document has no applicable labels, pass an empty array reference.</p>
</dd>
<dt><a name="train"></a><a name="train()">train()</a></dt>
<dd>Calculates the probabilities that will be necessary for categorization using the <code>predict()</code> method.</p>
</dd>
<dt><a name="predict"></a><a name="predict(_attributes_=&gt;_HASH_)">predict( attributes =&gt; HASH )</a></dt>
<dd>Use this method to predict the label of an unknown instance. The attributes should be of the same format as you passed to <code>add_instance()</code>. <code>predict()</code> returns a hash reference whose keys are the names of labels, and whose values are the score for each label. Scores are between 0 and 1, where 0 means the label doesn&#8217;t seem to apply to this instance, and 1 means it does.</p>
<p>In practice, scores using Naive Bayes tend to be very close to 0 or 1 because of the way normalization is performed. I might try to alleviate this in future versions of the code.</p>
</dd>
<dt><a name="labels"></a><a name="labels()">labels()</a></dt>
<dd>Returns a list of all the labels the object knows about (in no particular order), or the number of labels if called in a scalar context.</p>
</dd>
<dt><a name="do_purge"></a><a name="do_purge()">do_purge()</a></dt>
<dd>Purges training instances and their associated information from the NaiveBayes object. This can save memory after training.</p>
</dd>
<dt><a name="purge"></a><a name="purge()">purge()</a></dt>
<dd>Returns true or false depending on the value of the object&#8217;s <code>purge</code> property. An optional boolean argument sets the property.</p>
</dd>
<dt><a name="save_state"></a><a name="save_state($path)">save_state($path)</a></dt>
<dd>This object method saves the object to disk for later use. The <code>$path</code> argument indicates the place on disk where the object should be saved:</p>
<pre>  $nb-&gt;save_state($path);</pre>
</dd>
<dt><a name="restore_state"></a><a name="restore_state($path)">restore_state($path)</a></dt>
<dd>This class method reads the file specified by <code>$path</code> and returns the object that was previously stored there using <code>save_state()</code>:</p>
<pre>  $nb = Algorithm::NaiveBayes-&gt;restore_state($path);<a title="click to go to top of document" name="THEORY" href="http://search.cpan.org/%7Ekwilliams/Algorithm-NaiveBayes-0.04/lib/Algorithm/NaiveBayes.pm#___top">
</a></pre>
</dd>
</dl>
<p>Bayes&#8217; Theorem is a way of inverting a conditional probability. It states:</p>
<pre>                P(y|x) P(x)
      P(x|y) = -------------
                   P(y)</pre>
<p>The notation <code>P(x|y)</code> means &#8220;the probability of <code>x</code> given <code>y</code>.&#8221;</p>
<p>In this case, we want to know the probability of a given category given a certain string of words in a document, so we have:</p>
<pre>                    P(words | cat) P(cat)
  P(cat | words) = --------------------
                           P(words)</pre>
<p>We have applied Bayes&#8217; Theorem because <code>P(cat | words)</code> is a difficult quantity to compute directly, but <code>P(words | cat)</code> and <code>P(cat)</code> are accessible (see below).</p>
<p>The greater the expression above, the greater the probability that the given document belongs to the given category. So we want to find the maximum value. We write this as</p>
<pre>                                 P(words | cat) P(cat)
  Best category =   ArgMax      -----------------------
                   cat in cats          P(words)</pre>
<p>Since <code>P(words)</code> doesn&#8217;t change over the range of categories, we can get rid of it. That&#8217;s good, because we didn&#8217;t want to have to compute these values anyway. So our new formula is:</p>
<pre>  Best category =   ArgMax      P(words | cat) P(cat)
                   cat in cats</pre>
<p>Finally, we note that if <code>w1, w2, ... wn</code> are the words in the document, then this expression is equivalent to:</p>
<pre>  Best category =   ArgMax      P(w1|cat)*P(w2|cat)*...*P(wn|cat)*P(cat)
                   cat in cats</pre>
<p>That&#8217;s the formula I use in my document categorization code. The last step is the only non-rigorous one in the derivation, and this is the &#8220;naive&#8221; part of the Naive Bayes technique. It assumes that the probability of each word appearing in a document is unaffected by the presence or absence of each other word in the document. We assume this even though we know this isn&#8217;t true: for example, the word &#8220;iodized&#8221; is far more likely to appear in a document that contains the word &#8220;salt&#8221; than it is to appear in a document that contains the word &#8220;subroutine&#8221;. Luckily, as it turns out, making this assumption even when it isn&#8217;t true may have little effect on our results.</p>
<input id="gwProxy" type="hidden" />
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<input id="gwProxy" type="hidden" />
<input id="jsProxy" onclick="jsCall();" type="hidden" />
<div class="shr-publisher-1672"></div><!-- Start Shareaholic LikeButtonSetBottom --><!-- End Shareaholic LikeButtonSetBottom -->]]></content:encoded>
			<wfw:commentRss>http://www.bostonmediadomain.com/bayesian-algorithm-search/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Search Engine Results Pages (SERP&#039;s)</title>
		<link>http://www.bostonmediadomain.com/search-engine-results-pages-serps/</link>
		<comments>http://www.bostonmediadomain.com/search-engine-results-pages-serps/#comments</comments>
		<pubDate>Thu, 12 Mar 2009 01:33:49 +0000</pubDate>
		<dc:creator>jeff selig</dc:creator>
				<category><![CDATA[Commentary & Analysis]]></category>
		<category><![CDATA[Domain News]]></category>
		<category><![CDATA[Search Marketing PPC]]></category>
		<category><![CDATA[SEO Analysis]]></category>
		<category><![CDATA[ORGANIC]]></category>
		<category><![CDATA[paid inclusion]]></category>
		<category><![CDATA[Paid Search]]></category>
		<category><![CDATA[pay per click]]></category>
		<category><![CDATA[PPC]]></category>
		<category><![CDATA[search engine marketing]]></category>
		<category><![CDATA[Search Engine Results Pages]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[SERP]]></category>
		<category><![CDATA[SERP's]]></category>

		<guid isPermaLink="false">http://bostonmediadomain.com/?p=604</guid>
		<description><![CDATA[Organic Listings (also referred to as &#8220;natural&#8221; or &#8220;free&#8221; listings), and Paid Advertisements bought on a &#8220;Pay Per Click&#8221; basis (PPC) or Cost Per Thousand (CPM). When you think of SERP&#8217;s think organic results, where 80 percent of the internet users click first. Most searchers decide in a matter of seconds which search listings to click on. 36% of internet users think that if your website is top in the SERP&#8217;s you are the top brand in your field, (iProspect 2006) and 92% higher click through rate when paid and organic search are done together. When looking at your site&#8217;s rankings in top search engines, what you really want to see is: 1. the most relevant page from your site (for a given search query), 2. high on the organic search results (SERP&#8217;s), 3. clear and concise site description (the NOSNIPPET meta tag tells Google not to show a snippet (desc) on SERP&#8217;s), 4. pages that relate to the query Dominating the SERP&#8217;s has everything to do with connections! Not showing up in the results you query? Then you need to develop better content and links.]]></description>
			<content:encoded><![CDATA[<!-- Start Shareaholic LikeButtonSetTop --><!-- End Shareaholic LikeButtonSetTop --><div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fsearch-engine-results-pages-serps%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fsearch-engine-results-pages-serps%2F&amp;source=seosem&amp;style=normal&amp;service=is.gd&amp;hashtags=ORGANIC,paid+inclusion,Paid+Search,pay+per+click,PPC,search+engine+marketing,Search+Engine+Results+Pages,SEO,SERP,SERP%27s&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>Organic Listings (also referred to as &#8220;natural&#8221; or &#8220;free&#8221; listings), and Paid Advertisements bought on a &#8220;Pay Per Click&#8221; basis (PPC) or Cost Per Thousand (CPM).</p>
<p>When you think of SERP&#8217;s think organic results, where 80 percent of the internet users click first.<br />
Most searchers decide in a matter of seconds which search listings to click on. 36% of internet users think that if your website is top in the SERP&#8217;s you are the top brand in your field, (iProspect 2006) and 92% higher click through rate when paid and organic search are done together.</p>
<p>When looking at your site&#8217;s rankings in top search engines, what you really want to see is:</p>
<p>   1. the most relevant page from your site (for a given search query),<br />
   2. high on the organic search results (SERP&#8217;s),<br />
   3. clear and concise site description (the NOSNIPPET meta tag tells Google not to show a snippet (desc) on SERP&#8217;s),<br />
   4. pages that relate to the query</p>
<p>Dominating the SERP&#8217;s has everything to do with connections! Not showing up in the results you query? Then you need to develop better content and links.</p>
<div class="shr-publisher-604"></div><!-- Start Shareaholic LikeButtonSetBottom --><!-- End Shareaholic LikeButtonSetBottom -->]]></content:encoded>
			<wfw:commentRss>http://www.bostonmediadomain.com/search-engine-results-pages-serps/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

