<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Boston Media Domain &#187; google patent</title>
	<atom:link href="http://www.bostonmediadomain.com/tag/google-patent/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bostonmediadomain.com</link>
	<description>Search, Social and Online Media for Domains</description>
	<lastBuildDate>Mon, 25 Apr 2011 15:18:01 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Google Patent on Anchor Based Google Sitelinks</title>
		<link>http://www.bostonmediadomain.com/google-patent-anchor-based-google-sitelinks/</link>
		<comments>http://www.bostonmediadomain.com/google-patent-anchor-based-google-sitelinks/#comments</comments>
		<pubDate>Mon, 30 Nov 2009 20:50:09 +0000</pubDate>
		<dc:creator>jeff selig</dc:creator>
				<category><![CDATA[SEO Analysis]]></category>
		<category><![CDATA[ARTIFICIAL ANCHOR FOR A DOCUMENT]]></category>
		<category><![CDATA[google patent]]></category>

		<guid isPermaLink="false">http://bostonmediadomain.com/?p=1322</guid>
		<description><![CDATA[Photo by Andrea [bah! la realtà!]Methods, systems, and apparatus, including computer program products, for linking to an intra-document portion of a target document includes receiving an address for a target document identified by a search engine in response to a query, the target document including query-relevant text that identifies an intra-document portion of the target document, the intra-document portion including the query relevant text. An artificial anchor is generated, the artificial anchor corresponding to the intra-document portion. The artificial anchor is appended the address. United States Patent Application 20090287698 Kind Code A1 Marmaros; David P. ;   et al. November 19, 2009 ARTIFICIAL ANCHOR FOR A DOCUMENT Abstract Methods, systems, and apparatus, including computer program products, for linking to an intra-document portion of a target document includes receiving an address for a target document identified by a search engine in response to a query, the target document including query-relevant text that identifies an intra-document portion of the target document, the intra-document portion including the query relevant text. An artificial anchor is generated, the artificial anchor corresponding to the intra-document portion. The artificial anchor is appended the address. Inventors: Marmaros; David P.; (Mountain View, CA) ; Gomes; Benedict A.; (Mountain View, [...]]]></description>
			<content:encoded><![CDATA[<!-- Start Shareaholic LikeButtonSetTop --><!-- End Shareaholic LikeButtonSetTop --><div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fgoogle-patent-anchor-based-google-sitelinks%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fgoogle-patent-anchor-based-google-sitelinks%2F&amp;source=seosem&amp;style=normal&amp;service=is.gd&amp;hashtags=ARTIFICIAL+ANCHOR+FOR+A+DOCUMENT,google+patent&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><span class="wp-decoratr-image"><img src="http://farm3.static.flickr.com/2016/2293760162_7769144459_m.jpg" alt="Brooklyn bridge" /><br />
<a rel="external nofollow" href="http://www.flickr.com/photos/94185697@N00/2293760162">Photo by Andrea [bah! la realtà!]</a></span>Methods, systems, and apparatus, including computer program products, for linking to an intra-document portion of a target document includes receiving an address for a target document identified by a search engine in response to a query, the target document including query-relevant text that identifies an intra-document portion of the target document, the intra-document portion including the query relevant text. An artificial anchor is generated, the artificial anchor corresponding to the intra-document portion. The artificial anchor is appended the address.</p>
<hr />
<table border="0" width="100%">
<tbody>
<tr>
<td width="50%" align="LEFT"><strong>United States Patent Application</strong></td>
<td width="50%" align="RIGHT"><strong><strong><em>20090287698</em></strong> </strong></td>
</tr>
<tr>
<td width="50%" align="LEFT" valign="TOP"><strong>Kind Code</strong></td>
<td width="50%" align="RIGHT"><strong>A1 </strong></td>
</tr>
<tr>
<td width="50%" align="LEFT"><strong> Marmaros; David P. ;   et al.</strong></td>
<td width="50%" align="RIGHT"><strong> November 19, 2009 </strong></td>
</tr>
</tbody>
</table>
<hr /><span style="font-size: xx-small;">ARTIFICIAL ANCHOR FOR A DOCUMENT </span></p>
<p><strong>Abstract</strong></p>
<p>Methods, systems, and apparatus, including computer program products, for      linking to an intra-document portion of a target document includes      receiving an address for a target document identified by a search engine      in response to a query, the target document including query-relevant text      that identifies an intra-document portion of the target document, the      intra-document portion including the query relevant text. An artificial      anchor is generated, the artificial anchor corresponding to the      intra-document portion. The artificial anchor is appended the address.</p>
<hr />
<table border="0" width="100%">
<tbody>
<tr>
<td width="10%" align="LEFT" valign="TOP">Inventors:</td>
<td width="90%" align="LEFT"><strong>Marmaros; David P.</strong>; <em>(Mountain View, CA)</em> <strong>; Gomes; Benedict A.</strong>; <em>(Mountain View, CA)</em> <strong>; Bharat; Krishna</strong>; <em>(Palo Alto, CA)</em></td>
</tr>
<tr>
<td width="10%" align="LEFT" valign="TOP">Correspondence Name and Address:</td>
<td width="90%" align="LEFT"><strong> </strong></p>
<pre><strong>    FISH &amp; RICHARDSON P.C.
    PO BOX 1022
    MINNEAPOLIS
    MN
    55440-1022
    US</strong></pre>
<p><strong> </strong></td>
</tr>
<tr>
<td width="10%" align="LEFT" valign="TOP">Assignee Name and Adress:</td>
<td width="90%" align="LEFT"><strong>GOOGLE INC., a Delaware corporation</strong><br />
<strong>Mountain View</strong><br />
<strong>CA</strong></td>
</tr>
<tr>
<td width="10%" align="LEFT" valign="TOP">Serial No.:</td>
<td width="90%" align="LEFT"><strong>508936</strong></td>
</tr>
<tr>
<td width="10%" align="LEFT" valign="TOP">Series Code:</td>
<td width="90%" align="LEFT"><strong>12 </strong></td>
</tr>
<tr>
<td width="10%" align="LEFT" valign="TOP">Filed:</td>
<td width="90%" align="LEFT"><strong>July 24, 2009</strong></td>
</tr>
</tbody>
</table>
<table border="0" width="100%">
<tbody>
<tr>
<td width="40%" align="LEFT" valign="TOP"><strong>U.S. Current Class:</strong></td>
<td width="60%" align="RIGHT" valign="TOP"><strong>707/5</strong>; 707/E17.108</td>
</tr>
<tr>
<td width="40%" align="LEFT" valign="TOP"><strong>U.S. Class at Publication:</strong></td>
<td width="60%" align="RIGHT" valign="TOP"><strong>707/5</strong>; 707/E17.108</td>
</tr>
<tr>
<td width="40%" align="LEFT" valign="TOP"><strong>Intern&#8217;l Class: </strong></td>
<td width="60%" align="RIGHT" valign="TOP">G06F 17/30 20060101 G06F017/30</td>
</tr>
</tbody>
</table>
<hr /><strong><em>Claims</em></strong></p>
<hr />1. A computer-implemented method, comprising:receiving an address for a      target document identified by a search engine in response to a query, the      target document including query-relevant text that identifies an      intra-document portion of the target document, the intra-document portion      including the query-relevant text;generating an artificial anchor, the      artificial anchor corresponding to the intra-document portion;      andappending the artificial anchor to the address.</p>
<p>2. The method of claim 1, wherein the artificial anchor comprises an      instruction for an artificial anchor module in a document browser on a      client device to navigate directly to the intra-document portion of the      target document when the target document is displayed on the client      device.</p>
<p>3. The method of claim 1, wherein appending the artificial anchor      comprises appending the artificial anchor to a uniform resource locator      that specifies the location of the target document.</p>
<p>4. The method of claim 1, wherein the artificial anchor includes a      preassigned artificial anchor designator designating the artificial      anchor to an artificial anchor module in a client device.</p>
<p>5. The method of claim 4, wherein the artificial anchor includes the      preassigned artificial anchor designator as either a prefix or a suffix      and wherein the preassigned artificial anchor designator includes a      preassigned set of text characters.</p>
<p>6. The method of claim 1, wherein the artificial anchor includes the      query-relevant text.</p>
<p>7. A computer-implemented method, comprising:receiving a plurality of      search results at a client device in response to a search query, each of      the search results including a uniform resource locator identifying a      corresponding target document and an artificial anchor corresponding to      an intra-document portion of the corresponding target document, the      intra-document portion including query-relevant text that identifies the      intra-document portion of the corresponding target document;in response      to a selection of a search result from the search results at the client      device, transmitting from the client device a request for the      corresponding target document identified by the uniform resource locator      included in the selected search result and receiving at the client device      the corresponding target document in response to the request;      andprocessing the artificial anchor included in the selected search      result at the client device to cause the client device to display the      corresponding intra-document portion of the corresponding target      document.</p>
<p>8. The method of claim 7, wherein each of the artificial anchors comprise      a preassigned artificial anchor designator designating the artificial      anchor to an artificial anchor module in the client device.</p>
<p>9. The method of claim 8, wherein each of the artificial anchors includes      the preassigned artificial anchor designator as either a prefix or a      suffix and wherein the preassigned artificial anchor designator includes      a preassigned set of text characters.</p>
<p>10. The method of claim 8, wherein processing the artificial anchor      included in the selected search result at the client device to cause the      client device to display the corresponding intra-document portion of the      corresponding target document comprises highlighting the query-relevant      text that identifies the intra-document portion of the corresponding      target document.</p>
<p>11. The method of claim 7, wherein each of the search results includes the      query-relevant text, and wherein the query-relevant text for each search      result is extracted from the corresponding intra-document portion of the      corresponding target document.</p>
<p>12. A computer-implemented method, comprising:receiving a search result      page including a plurality of search results, each search result      including a search result link pointing to a corresponding target      document and a snippet extracted from an intra-document portion of the      corresponding target document;for each search result, generating an      intra-document link based on the snippet included in the search result by      appending an artificial anchor to the search result link, the      intra-document link pointing to the intra-document portion of the      corresponding target document; andin response to a selection of one of      the intra-document links, receiving the corresponding target document and      navigating directly to the intra-document portion of the corresponding      target document.</p>
<p>13. The method of claim 12, wherein each artificial anchor includes a      preassigned artificial anchor designator designating the anchor as      artificial to an artificial anchor module in a client device.</p>
<p>14. The method of claim 13, wherein each artificial anchor includes a      portion of the snippet extracted from the intra-document portion of the      corresponding target document.</p>
<p>15. A client device, comprising:a client browser configured to display a      search result that includes an artificial anchor appended to a uniform      resource locator for a target document, the artificial anchor      corresponding to an intra-document portion of the target document, and      wherein in response to the search result being selected, the client      browser transmits a request for the target document and receives the      target document in response to the request; andan artificial anchor      module configured to process the artificial anchor and to cause the      client browser to display the intra-document portion of the target      document corresponding to the artificial anchor.</p>
<p>16. The client device of claim 15, wherein the artificial anchor comprises      a preassigned artificial anchor designator designating the artificial      anchor to the artificial anchor module in the client device.</p>
<p>17. The client device of claim 16, wherein the search results includes at      least one snippet extracted from the intra-document portion of the target      document.</p>
<p>18. A computer program product embodied on a computer-readable medium, the      computer program product including instructions, which when executed by a      computer system, are operable to cause the computer system to perform      acts comprising:receiving an address for a target document, the target      document including query-relevant text that identifies an intra-document      portion of the target document;generating an artificial anchor      corresponding to the intra-document portion of the target document;      andappending the artificial anchor to the address.</p>
<p>19. The computer program product of 18, wherein the artificial anchor      includes the preassigned artificial anchor designator as either a prefix      or a suffix and wherein the preassigned artificial anchor designator      includes a preassigned set of text characters.</p>
<p>20. The computer program product of claim 18, wherein the artificial      anchor includes the query-relevant text.</p>
<p>21. A computer program product embodied on a computer-readable medium, the      computer program product including instructions, which when executed by a      client device, are operable to cause the client device to perform acts      comprising:receiving a plurality of search results at a client device in      response to a search query, each of the search results including a      uniform resource locator identifying a corresponding target document and      an artificial anchor corresponding to an intra-document portion of the      corresponding target document, the intra-document portion including      query-relevant text that identifies the intra-document portion of the      corresponding target document;in response to a selection of a search      result from the search results at the client device, transmitting from      the client device a request for the corresponding target document      identified by the uniform resource locator included in the selected      search result and receiving at the client device the corresponding target      document in response to the request; andprocessing the artificial anchor      included in the selected search result at the client device to cause the      client device to display the corresponding intra-document portion of the      corresponding target document.</p>
<hr /><strong><em>Description</em></strong></p>
<hr />RELATED APPLICATION</p>
<p>[0001]This application claims the benefit of U.S. application Ser. No.      10/750,183, filed on Dec. 31, 2003 and entitled &#8220;Systems and Methods for      Direct Navigation to Specific Portion of Target Document,&#8221; the entirety      of which is hereby incorporated herein by reference.</p>
<p>BACKGROUND OF THE INVENTION</p>
<p>[0002]1. Field of the Invention</p>
<p>[0003]Systems and methods for direct navigation to and/or highlighting a      specific portion of a target document such as query-relevant portion of      the document are disclosed.</p>
<p>[0004]2. Description of Related Art</p>
<p>[0005]When a user searches for information on the Internet by submitting a      query to a search engine, the search engine returns a results page that      often provides several hyperlinks to web pages that may be relevant to      the user&#8217;s query. Under each hyperlink, the results page typically also      provides query-relevant information or text, often referred to as      &#8220;snippets,&#8221; extracted from the webpage to which the corresponding      hyperlink refers. When the user clicks on a desired hyperlink on the      search results page, the client browser typically navigates to the top of      the target or destination webpage (or other document such as a Word,      Excel or Portable Document Format (PDF) document, for example) to which      the hyperlink refers. However, the target webpage may be long and/or the      search query terms may be buried deep within the webpage. To locate the      desired search terms, the user may perform a search for the desired text      from within the web browser and/or manually (visually) search for the      relevant information by scrolling through the webpage.</p>
<p>[0006]As an example, to find pizza places in or near zip code 94043, the      user may search for &#8220;pizza&#8221; at zip code 94043 using Google&#8217;s &#8220;Searching      by Location&#8221; search engine currently located at      http:Nlabs.google.com/location (i.e.,      http://labs.google.com/location?q-pizza&amp;geo.about.near=94043&amp;Search=Googl-     e+Search) as shown in FIG. 1. An exemplary results page returned by the      search engine is shown in FIG. 2 in which the first hyperlink 20 points      to http://www.waiter.corn/roundtable, which contains a list of 35 pizza      places in or near zip code 94043. In addition to the various hyperlinks,      the search engine also provides a snippet 26 extracted from the webpage      to which each hyperlink refers. As shown, certain text of each snippet 26      is in bold font so as to highlight certain text that may be useful as      determined by the search engine. For the first hyperlink 20, the search      engine highlights the text &#8220;Pizza 570 N. Shoreline Blvd. MOUNTAIN VIEW,      CA.&#8221; of the snippet 26 in bold.</p>
<p>[0007]If the user clicks on the first-listed hyperlink 20, the client      browser loads and displays the destination webpage 22 to which the      hyperlink 20 refers at the top of the webpage 22, as shown in FIG. 3.      However, the target document or webpage is typically not hosted or      authored by the same entity as that of the search engine or other source      or origin document such that the desired information is typically not      propagated from the source document to the target document. Thus, for      example, because the browser typically displays the webpage 22 at the top      of the webpage 22, the restaurant that is in or closest to zip code 94043      may or may not be initially displayed within the browser. In the current      example, the restaurant that is in or closest to zip code 94043, namely,      the Round Table Pizza at 570 N. Shoreline Blvd. in Mountain View, Calif.      is the tenth out of 35 listings on the destination webpage 22.</p>
<p>[0008]In order for the user to determine which of the listings is the most      query relevant listing, the user can determine as the user scrolls      through the 35 listings on the destination webpage 22 which of the      listings is the one that is most relevant to the user&#8217;s query, i.e., in      or closest to zip code 94043. In the current example, the user must      scroll down a number of screens before the query-relevant listing 24 is      even displayed within the browser, as shown in FIG. 4. Alternatively, the      user may take note of the content of the snippet 26 before navigating or      clicking away from the results page (FIG. 2) and then search for (either      visually by scrolling or by performing a text search from within the      browser) the content of the snippet.</p>
<p>[0009]As is evident, each result on the search results page generated by      the search engine only points to a webpage and not to anything more      granular. In other words, the snippet generated by the search engine is      only displayed to the user on the search results page and is not      propagated or otherwise utilized to directly navigate to the desired      location on the destination webpage.</p>
<p>SUMMARY OF THE INVENTION</p>
<p>[0010]Disclosed herein are apparatus, methods and systems for linking to      an intra-document portion of a target document. In an implementation, a      method includes receiving an address for a target document identified by      a search engine in response to a query, the target document including      query-relevant text that identifies an intra document portion of the      target document, the intra-document portion including the query relevant      text; generating an artificial anchor, the artificial anchor      corresponding to the intra document portion; and appending the artificial      anchor to the address. The artificial anchor can be undefined in the      target document. Other embodiments of this aspect include corresponding      systems, apparatus, and computer program products.</p>
<p>[0011]In another implementation, a method includes receiving a plurality      of search results at a client device in response to a search query, each      of the search results including a uniform resource locator identifying a      corresponding target document and an artificial anchor corresponding to      an intra-document portion of the corresponding target document, the      intra-document portion including query-relevant text that identifies the      intra document portion of the corresponding target document; in response      to a selection of a search result from the search results at the client      device, transmitting from the client device a request for the      corresponding target document identified by the uniform resource locator      included in the selected search result and receiving at the client device      the corresponding target document in response to the request; and      processing the artificial anchor included in the selected search result      at the client device to cause the client device to display the      corresponding intra-document portion of the corresponding target      document. The artificial anchor can be undefined in the target document.      Other embodiments of this aspect include corresponding systems,      apparatus, and computer program products.</p>
<p>[0012]In still another implementation, a method includes receiving a      search result page including a plurality of search results, each search      result including a search result link pointing to a corresponding target      document and a snippet extracted from an intra-document portion of the      corresponding target document; for each search result, generating an      intra-document link based on the snippet included in the search result by      appending an artificial anchor to the search result link, the      intra-document link pointing to the intra-document portion of the      corresponding target document; and in response to a selection of one of      the intra-document links, receiving the corresponding target document and      navigating directly to the intra-document portion of the corresponding      target document. Other embodiments of this aspect include corresponding      systems, apparatus, and computer program products.</p>
<p>[0013]The details of one or more embodiments of the subject matter      described in this specification are set forth in the accompanying      drawings and the description below. Other features, aspects, and      advantages of the subject matter will become apparent from the      description, the drawings, and the claims.</p>
<p>BRIEF DESCRIPTION OF THE DRAWINGS</p>
<p>[0014]The present invention will be readily understood by the following      detailed description in conjunction with the accompanying drawings,      wherein like reference numerals designate like structural elements.</p>
<p>[0015]FIG. 1 is an illustrative client browser window displaying a search      engine interface for entering a query.</p>
<p>[0016]FIG. 2 is an illustrative client browser window displaying search      results.</p>
<p>[0017]FIG. 3 is an illustrative client browser window displaying a webpage      to which a search result hyperlink refers at the top of the webpage.</p>
<p>[0018]FIG. 4 is an illustrative client browser window displaying the      webpage of FIG. 3 at a location that displays query-relevant information      within the browser.</p>
<p>[0019]FIG. 5 is an illustrative client browser window displaying the      webpage of FIG. 3 in which the display is automatically scrolled to the      query-relevant information of the webpage and the query-relevant      information highlighted.</p>
<p>[0020]FIG. 6 is a flowchart of an illustrative client-side process for      automatically scrolling to and highlighting the query-relevant      information of the webpage.</p>
<p>[0021]FIG. 7 is a flowchart of an illustrative server-side process for      generating and inserting an artificial named anchor in the URL of each      search result.</p>
<p>[0022]FIG. 8 is a flowchart of a modified portion of the process shown in      FIG. 7 in which the search engine also determines if the target webpage      already contains a named anchor at or near the snippet.</p>
<p>[0023]FIG. 9 is a block diagram of an illustrative network system.</p>
<p>[0024]FIG. 10 is a block diagram of an illustrative client device.</p>
<p>DESCRIPTION OF THE SPECIFIC EMBODIMENTS</p>
<p>[0025]Systems and methods for direct navigation to and/or highlighting a      specific portion of a target document such as query-relevant portion of      the document are disclosed. The following description is presented to      enable any person skilled in the art to make and use the invention.      Descriptions of specific embodiments and applications are provided only      as examples and various modifications will be readily apparent to those      skilled in the art. The general principles defined herein may be applied      to other embodiments and applications without departing from the spirit      and scope of the invention. Thus, the present invention is to be accorded      the widest scope encompassing numerous alternatives, modifications and      equivalents consistent with the principles and features disclosed herein.      For purpose of clarity, details relating to technical material that is      known in the technical fields related to the invention have not been      described in detail so as not to unnecessarily obscure the present      invention.</p>
<p>[0026]As discussed above, FIG. 1 is an illustrative client browser window      displaying the interface for a Google search engine that searches by      location. FIG. 2 is an illustrative client browser window displaying the      search results in response to a query for &#8220;pizza&#8221; and &#8220;94043.&#8221; in which      the first hyperlink 20 points to http://www.waiter.com/roundtable/ that      contains a list of pizza restaurants. In addition to the various      hyperlinks, the search engine also provides a snippet 26 extracted from      the webpage to which each hyperlink refers. As shown, certain text of      each snippet 26 is in bold font so as to highlight certain text that may      be useful as determined by the search engine. As an example, for the      first hyperlink 20, the search engine highlights the query relevant text      &#8220;Pizza 570 N. Shoreline Blvd. MOUNTAIN VIEW, CA.&#8221; of the snippet 26 in      bold. Conventionally, the user may click on the first hyperlink 20 to      cause the browser to navigate to the top of the webpage to which the      hyperlink 20 refers, as shown in FIG. 3. The user may then scroll down to      find and determine which of the 35 listed pizza restaurants is the one      closest to or located in zip code 94043 within the webpage 22 as shown in      FIG. 4. Alternatively, the user may take note of the content of the      snippet in the results page (FIG. 2) before navigating from the results      page to the destination webpage and then search for (either by scrolling      or by performing a text search from within the browser) the content of      the snippet.</p>
<p>[0027]Conventional named anchors are added by webpage authors to allow      users to navigate directly to a specific part or location within the      webpage. For example, the webpage author may create a named anchor with      &lt;A NAME=location&gt; Location &lt;/A&gt; at the specific part of the      webpage to which the reader can navigate directly from another part of      the same webpage or from a different webpage. The NAME attribute names a      section, e.g., text or image, so that other links can reference it via a      clickable HREF link containing a pound sign &#8220;#&#8221; followed by the anchor      name. In particular, the same or different webpage may include a      clickable HREF link: &lt;A href=&#8221;[URL]#location&gt; Jump to Location      &lt;/A&gt;, where [URL] is the URL of the target webpage. The HREF      attribute specifies the URL address to which the browser is to navigate      when the user clicks on the HREF link, i.e., &#8220;Jump to Location.&#8221; Note      that using the HREF link containing an anchor name requires that the      author of the webpage to have created the named anchor. If a named anchor      does not exist for the specific part of the webpage, then the HREF link      cannot be used to allow the reader to navigate directly to that specific      part of the webpage. If an HREF link refers to a non-existent anchor, the      browser simply navigates to the top of the target webpage.</p>
<p>[0028]With systems and methods described herein, mechanisms are provided      to generate or simulate links with artificial named anchors and to allow      the browser to recognize the artificial named anchor and navigate      directly to the desired specific part of the target webpage even when the      author of the webpage has not created a named anchor at the specific part      of the webpage. In particular, the systems and methods described herein      simulate the general functionality of the named anchor and the HREF link      to provide links containing artificial named anchors that allow      navigation directly to a specific part of the target webpage even when a      named anchor does not exist at the specific part of the target webpage.      Such links can be utilized by any webpage to provide a link to a specific      part of another target webpage. In particular, such links containing      artificial named anchors can be particularly useful for search result      pages returned by search engines.</p>
<p>[0029]The systems and methods described herein may be utilized to navigate      any document that may be provided from any suitable source as the      Internet, an intranet, or local memory, for example. The term URL as used      herein generally refers to a locator or address for any document, not      necessarily only those available on the Internet. Suitable target or      destination documents are typically those displayable by a client      browser, typically a client web browser, such as an Internet Explorer,      Netscape, Opera, or Mozilla browser.</p>
<p>[0030]Referring again to the example described above with reference to      FIGS. 1-4, the search results page returned by the search engine may      provide or otherwise simulate links to a specific part of a target      webpage such as the part of the target webpage that includes at least a      portion of the snippet 26 or to a portion that the server or search      engine determined to be similar or otherwise relevant. Thus when the user      clicks on a portion of the snippet 26, for example, the browser may      navigate directly to the part of the target webpage 22 that includes the      portion of the snippet 26, as shown in FIG. 5.</p>
<p>[0031]To further draw the user&#8217;s attention to the query-relevant listing      24 in the target webpage 22, at least some of the snippet, e.g., &#8220;570 N.      Shoreline,&#8221; may be highlighted. For example, a specific portion may be      highlighted, i.e., draw attention to or otherwise emphasize, by modifying      the format of the specific portion such as by underlining, bolding,      italicizing, foreground and/or background color changing, font and/or      size changing, border drawing, text animating (e.g., &#8220;marching red ants&#8221;,      etc.), aligning, kerning, style editing/adding/removing. The formatting      change may help draw the user&#8217;s attention to the specific portion by      rendering the specific portion inconsistent or distinct from other parts      of the document, site, etc. Additionally or alternatively, information in      the target document may be modified, for example, by adding, removing or      editing relevant (or irrelevant) information, such as by scrolling to,      altering or adding a link, adding an image, deleting surrounding aspect,      adding new text, adding a popup or hover window, adding and/or executing      JavaScript instructions and/or other computing instructions in other      languages or methods. The information change may add links to relevant      information known to the source page, to provide a method of returning to      the source page, to add known links to the target page, etc.</p>
<p>[0032]In one embodiment of the present invention, navigation by the      browser directly to the specified query-relevant intra-document portion      is implemented on the client side while an instruction to the client      browser to navigate directly to the intra-document portion is provided by      a source, e.g., a server or search engine, providing the clickable link.      In particular, in one embodiment of the invention the client browser has      installed therein an artificial named anchor module that may be      implemented via a toolbar such as the Google toolbar, a dynamic link      library (DLL) or any other type of plug-in, a browser helper object      (BHO), or any other suitable mechanism to implement the desired      functionality in the browser. Note that the functionality need not be      implemented using a module and that the browser, shell or Word document      viewer, etc. can implement the functionality natively without an add-on.      The artificial named anchor module enables the browser to recognize when      an URL includes an artificial named anchor. In one embodiment, the      artificial named anchor module may be implemented to recognize as      artificial any named anchor that begins with a preassigned, artificial      named anchor prefix. For example, any set of suitable preassigned text      characters may be utilized as the preassigned anchor prefix. In one      embodiment of the invention, the preassigned anchor prefix is relatively      obscure so as to reduce the likelihood of conflicts with actual anchors      in the destination webpage. Any other suitable mechanism for recognizing      an artificial named anchor may be utilized. As an example, the      preassigned text characters may appear anywhere within the name of the      artificial anchor, e.g., as a suffix or anywhere within the artificial      anchor.</p>
<p>[0033]In the example shown in FIG. 5, _g_&#8221; is used as the pre-assigned      artificial named anchor prefix. As shown in the address bar 28 in FIG. 5,      the URL the client browser received is      http://www.waiter.com/roundtable#_g.sub.&#8211;570+N.+Shoreline. As is      evident, the URL references an artificial named anchor &#8220;_g.sub.&#8211;570 N.      Shoreline&#8221; although the target webpage may or may not include such an      anchor. Upon recognizing that the named anchor is artificial, the browser      parses or strips the preassigned artificial prefix, e.g., _g_, from the      artificial anchor and searches for the remaining artificial anchor text      (e.g., &#8220;570 N. Shoreline&#8221;) in the target webpage. If the remaining      artificial anchor text is not located, the browser may display the      webpage at the top. Alternatively, if the remaining anchor text is      located, the browser may navigate directly to the portion of the webpage      containing the remaining anchor text and may also highlight the text. In      the above example, the remaining anchor text is the same as the      intra-document portion of the target webpage. It is noted the artificial      anchor text may optionally be encoded such that the client browser may      unencode the artificial anchor text to extract the artificial anchor text      before navigating directly to the intra-document portion of the target      webpage. In other implementations, different or no encoding mechanisms      can be employed.</p>
<p>[0034]As is evident, the client-side process may be implemented via the      artificial named anchor module installed in the browser. FIG. 6 is a      flowchart of an illustrative client-side process 200 for recognizing and      processing artificial named anchors and navigating directly to and      highlighting the query-relevant portion of the webpage. The process 200      begins with the user entering a query via a search engine interface at      block 202. At block 204, the search engine returns the search results      typically with one or more snippets for each hyperlink. At block 206, the      user selects from the search results and clicks on a selected hyperlink      to navigate to a target or destination webpage to which the hyperlink      refers. It is noted that for each given search result, the search results      page may provide any number of hyperlinks, optionally with reference to      an artificial anchor. It is noted that certain search results may not      provide a hyperlink that references an artificial named anchor as may be      the case where the target is a PDF document, for example.</p>
<p>Currently, the browser only displays a PDF document at the top of the      document although the browser may be similarly modified so as to be able      to search and navigate within the PDF document. A similar mechanism may      also be implemented for Word, Excel, and/or various other documents with      segmentable and/or searchable pieces.</p>
<p>[0035]At block 208, the browser determines if the selected hyperlink      references a named anchor. If not, then the navigation proceeds as normal      at block 210. Alternatively, if the selected hyperlink references a named      anchor, then the browser determines if the named anchor is an artificial      anchor at block 212. As noted above, the browser may determine that an      anchor is artificial if the anchor begins with a preassigned artificial      anchor prefix e.g., -g-. If the browser determines that the anchor is not      artificial, then the navigation proceeds as normal at block 210.      Alternatively, if the browser determines that the anchor is artificial,      then the browser may parse and optionally unencode the artificial anchor      at block 214. In other implementations, different or no encoding      mechanisms can be employed. For example, the browser may strip the      artificial anchor of the preassigned anchor prefix, e.g., -g-. The      browser then loads the destination webpage and directly navigates to and      may also highlight, such as by modifying the formatting of and/or      information in the portion of the destination webpage specified by the      artificial anchor at block 216. In effect, the browser typically would      find, highlight and scroll to the first instance of the portion of the      destination webpage specified by the artificial anchor.</p>
<p>[0036]It is noted that in one embodiment of the present invention if the      client web browser does not have artificial named anchor module      installed, then the browser will default to ignoring the artificial named      anchor as such an anchor would not exist in the actual target webpage.      However, if the client web browser (or the client device in general) does      have the appropriate artificial named anchor module installed, the      artificial named anchor module will process artificial anchors according      to the exemplary client-side process 200 described above. Thus, the      artificial anchor serves as an instruction to the artificial named anchor      module or to the client browser in general.</p>
<p>[0037]The client-side process 200 may allow any source (referral webpage      or document) to utilize the artificial named anchor mechanism such that      any client device with the artificial named anchor module installed would      perform the artificial anchor recognition and processing process 200 to      navigate directly to the portion of the target webpage referenced by the      artificial anchor. Alternatively, the artificial named anchor module may      additionally examine the source (referral webpage) of a hyperlink with an      artificial named anchor and only process those whose source is      authorized. For example, the client process may determine whether a      source is authorized by sniffing an appropriate cookie or by use of      headers, for example.</p>
<p>[0038]Artificial anchors are generally most useful for searching for text      within the target webpage. However, other items to be displayed in the      browser and/or otherwise defined by the target webpage may also be      searched. Examples include image name, tag identification (ID), tag name,      location in the document object model (DOM), HTML byte offset, a general      offset or location definition within the search result document, etc. In      addition to using artificial anchors to find, highlight and scroll to the      referenced text or other portion of the target webpage, the client-side      process may alternatively or additionally perform other functions. For      example, the client-side process may be implemented to find and highlight      occurrences of specific words within the target webpage, e.g., by using a      different preassigned artificial anchor prefix such as _gh_.&#8221; Thus,      multiple occurrences of the specific word(s) or phrase(s) being searched      would be highlighted. As another example, the client-side process may be      implemented to find and scroll to a specified image based on, for      example, the image name, within the target webpage, e.g., by using a      different preassigned artificial anchor prefix such as &#8220;_gi_.&#8221; In      addition, the artificial anchor module of the client device may      alternatively perform a fuzzy (best effort) search rather than an      absolute or strict search when searching for the specific portion      referred to by the artificial anchor.</p>
<p>[0039]While the artificial anchors are recognized and processed by the      client-side process 200, in one embodiment the artificial anchors are      generated by a server-side process although a client-side process may be      similarly implemented. FIG. 7 is a flowchart of an illustrative search      engine server-side process 220 for generating and appending an artificial      anchor to the URL of the search result. In particular, at block 222, the      search engine receives a query from the user. At block 224, the search      engine performs the search and generates the search result. Each search      result may include an URL or hyperlink referencing a target or      destination webpage and one or more snippets extracted from the target      webpage to which the corresponding hyperlink refers. Any suitable search      methodology may be employed in block 224.</p>
<p>[0040]For each search result 226, the search engine may determine whether      the search result hyperlink already includes or references an anchor at      block 228. Search engines typically do not return hyperlinks with anchors      as anchors are generally not indexed. If such is the case, block 228 may      be eliminated. If the URL already includes an anchor, then the search      engine may continue onto the next search result at 226. Alternatively,      the search engine may strip the anchor from the URL at block 230.</p>
<p>[0041]If the search result URL already includes an anchor and the search      engine performs block 230 or if the search result URL does not include an      anchor, then the search engine generates a modified search result URL by      appending an artificial named anchor to the search result URL at block      232. For example, the search engine may use a preassigned artificial      anchor prefix such as &#8220;_g_&#8221; or any other mechanism to signal to the      client device that the named anchor is artificially generated.</p>
<p>[0042]As noted above, each search result may include a search result      hyperlink referencing the target webpage and one or more snippets      extracted from the target webpage to which the hyperlink refers. When      there are multiple snippets associated with a given search result, the      snippets are typically separated an ellipsis, such as a three-dot      ellipsis. The search engine may transform each snippet into at least one      active snippet in which each active snippet is a hyperlink with an      artificial anchor that references the portion of the target webpage      containing the corresponding snippet or portion of the snippet. For      example, each snippet may optionally be parsed into multiple snippet      segments in which the search engine transforms each snippet segment into      a separate hyperlink each with a corresponding artificial anchor. In one      embodiment, the snippet may be segmented in accordance with punctuation      marks that the snippet contains. In an alternative embodiment, the active      snippet may link to an artificial anchor that simply points to the search      term or phrase, for example.</p>
<p>[0043]After the search engine generates the hyperlink with the artificial      anchor, the process 220 then continues with the next search result at      226. After all search results have been processed, the search engine      server transmits the search results page to the client device at block      234.</p>
<p>[0044]In one alternative embodiment, the search engine may also determine      if the target webpage already contains a named anchor at or near the      snippet, either before or instead of performing block 232 as shown in the      flowchart of FIG. 8 illustrating a modified portion of the process 220.      As shown, after determining that the URL does not already include an      anchor at block 228, the search engine may determine if the target      webpage already contains a named anchor at or near the snippet at block      240. Block 240 may be performed before or instead of block 232. If the      target webpage already contains a named anchor at or near the snippet as      determined in block 240, then the search engine utilizes that anchor by      appending the anchor to the URL of the search result at block 242 and the      process continues with the next search result.</p>
<p>[0045]Process 220 may be implemented on the server side although it may be      alternatively or additionally implemented on the client side such as via      the client-side artificial anchor module or other suitable mechanism.      When the process of generating artificial anchors is implemented on the      client side and/or when the artificial anchor generation process is not      implemented by the search engine utilized by the user, the client-side      process may examine each snippet in each search result and generate a      hyperlink referencing an artificial named anchor.</p>
<p>[0046]As is evident, the combination of generating artificial anchors and      recognizing and processing artificial anchors improves the user&#8217;s web      browsing experience by allowing the user to navigate directly to a      specific intra-document portion of the target document or webpage      corresponding to the relevant snippet. The artificial anchors can be used      with any suitable search results such as the Google&#8217;s standard search,      Geo-Search, Froogle search, etc.</p>
<p>[0047]The use of artificial anchors is merely one embodiment. For example,      the server may provide hints or instructions to the client browser by      inserting hidden tags in the results page or attributes on the A tags.      The installed browser module on the client side skims the DOM of each      results page and extracts the relevant information from these hints or      instructions. Thus, when the user navigates to a target webpage or      document, the installed browser module applies the desired result of      these instructions to achieve similar functions. Client web browsers that      do not expect such instructions, i.e., without the appropriate module      installed, for example, would simply ignore the instructions. In such an      embodiment, the user would not see an artificial anchor appended to the      URL in the address bar of the client browser.</p>
<p>[0048]In one embodiment, to minimize the impact of transmitting extra      information from the server that will only be ignored by the client      device or more specifically the client browser if the client device is      not installed with the artificial anchor module, the server may sniff the      relevant cookie on the client device and determine if the artificial      anchor module is installed. To facilitate such a process, installation of      the artificial anchor module may cause the relevant cookie to be created      and modified. If the artificial anchor module is not installed, then the      server, e.g., search engine, is informed that the artificial anchor      module is not installed when the query is submitted and the search engine      may proceed with the search without generating references to artificial      links, for example.</p>
<p>[0049]In an alternative embodiment, the mechanism for automatically      scrolling to a specific portion of a target page may be implemented on      the server side. With server side implementation, a client side      artificial anchor module may not be needed. For example, in response to      the user clicking on a particular snippet line, the server returns a      cached page directly scrolled to a portion of the target document      corresponding to the selected snipped. In one embodiment, the search      query itself and the particular snippet line number are provided to the      server when the user clicking on a particular snippet line. The server      then regenerates the snippets, i.e., rerun the snippet generation      algorithm, and return the cached page using, for example, JavaScript or      other mechanism to automatically scroll to the portion of the cached page      that corresponds to the selected snippet line. As another example, each      snippet for a given target page is a link to a portion corresponding to      the snippet in a cached page. In particular, the link to the cached page      may include the query as well as additional information, e.g., a hint      such as in the form of an anchor, to appropriately position or scroll the      cached page in the browser using a named anchor. The cached page in turn      may include named anchors each prefixing a portion of the cached page      corresponding to the snippet. In general, with server side      implementation, a server may proxy the target page or an intermediary,      e.g., a web proxy or IBMYs Web Intermediary (WBI), may implement the      server side functionality.</p>
<p>[0050]In one embodiment, the system may be configured such that the      browser may simultaneously display all the snippet lines, such as with      the use of multiple frames each with at least one of the snippet lines      visible within the browser. Alternatively, the server may be configured      so as to return a cached page within one or more frames within a frame      set with each frame scrolled to display one of the snippet lines. For      example, where a snippet generation engine returns three snippet lines,      the browser may contain three frames within a frame set, with each frame      displaying the same target or cached page and each scrolled to a      different corresponding one of the snippet lines so as to simultaneously      display all of the snippet lines generated by the snippet engine.</p>
<p>[0051]FIG. 9 illustrates an exemplary networked system 100 in which      systems and methods described herein may be implemented. The networked      system 100 may include client devices 102 in communication with servers      104 and 106 via a network 108. The network 108 may be a local area      network (LAN), a wide area network (WAN), a telephone network, such as      the Public Switched Telephone Network (PSTN), an intranet, the Internet,      or any suitable combination of networks. For purposes of clarity, two      client devices 102 and three servers 104 and 106 are illustrated as      connected to the network 140. However, any suitable number of client      devices 102 and servers 104, 106 may be connected via the network 140. In      addition, a given client device may perform the functions of a server and      a server may perform the functions of a client device. The client devices      102 may include devices, such mainframes, minicomputers, personal      computers, laptops, personal digital assistants, or the like, capable of      connecting to the network 108. The client devices 102 may transmit data      over the network 108 and/or receive data from the network 108 via a wired      (e.g., copper, optical, etc.) and/or wireless connection.</p>
<p>[0052]FIG. 10 illustrates an exemplary client device 102 suitable for      implementation in the networked system 100 of FIG. 9. The client device      102 may include a bus 120, a processor 122, a main memory 124, a read      only memory (ROM) 126, a storage device 128, an input device 130, an      output device 132, and a communication interface 134. The bus 120 may      include one or more conventional buses that permit communication among      the components of the client device 102. The processor 122 may include      any type of conventional processor or microprocessor that interprets and      executes instructions. The main memory 124 may include a random access      memory (RAM) or another type of dynamic storage device that stores      information and instructions for execution by the processor 122. The ROM      126 may include a conventional ROM device or another type of static      storage device that stores static information and instructions for use by      the processor 122. The storage device 128 may include a magnetic and/or      optical recording medium, for example, and its corresponding drive.</p>
<p>[0053]The input device 130 may include one or more conventional mechanisms      that permit a user to input information to the client device 102 such as      a keyboard, a mouse, a pen, voice recognition and/or biometric      mechanisms, etc. The output device 132 may include one or more      conventional mechanisms that output information to the user, including a      display, a printer, a speaker, etc. The communication interface 134 may      include any transceiver-like mechanism that enables the client device 102      to communicate with other devices and/or systems. For example, the      communication interface 134 may include mechanisms for communicating with      another device or system via a network, such as network 108.</p>
<p>[0054]The client devices 102 perform certain searching-related operations      such as those described above. The client devices 102 may perform these      operations in response to the processor 122 executing software      instructions contained in a computer-readable medium, such as memory 124.      A computer-readable medium may be defined as one or more memory devices      and/or carrier waves. The software instructions may be read into memory      124 from another computer-readable medium such as the data storage device      128 or from another device via the communication interface 134. The      software instructions contained in memory 124 causes processor 122 to      perform search-related activities described below. Alternatively,      hardwired circuitry may be used in place of or in combination with      software instructions to implement search-related processes described      herein. Thus, the present invention is not limited to any specific      combination of hardware circuitry and software.</p>
<p>[0055]The servers 104 and 106 may include one or more types of computer      systems, such as a mainframe, minicomputer, or personal computer capable      of connecting to the network 108 to enable servers 104, 106 to      communicate with the client devices 102. In alternative implementations,      the servers 104, 106 may include mechanisms for directly connecting to      one or more client devices 102. The servers 104, 106 may transmit data      over the network 108 or receive data from the network 108 via a wired or      wireless connection.</p>
<p>[0056]The servers may be configured in a manner similar to that described      above in reference to FIG. 10 for client device 102. In one      implementation, the server 106 may include a search engine 110 usable by      the client devices 102. The servers 104 may store documents (e.g., web      pages) accessible by the client devices 102.</p>
<p>[0057]While the exemplary embodiments of the present invention are      described and illustrated herein, it will be appreciated that they are      merely illustrative and that modifications can be made to these      embodiments without departing from the spirit and scope of the invention.      Thus, the scope of the invention is intended to be defined only in terms      of the following claims as may be amended, with each claim being      expressly incorporated into this Description of Specific Embodiments as      an embodiment of the invention.</p>
<div class="shr-publisher-1322"></div><!-- Start Shareaholic LikeButtonSetBottom --><!-- End Shareaholic LikeButtonSetBottom -->]]></content:encoded>
			<wfw:commentRss>http://www.bostonmediadomain.com/google-patent-anchor-based-google-sitelinks/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google Patent application 20050071741</title>
		<link>http://www.bostonmediadomain.com/google-patent-application-20050071741/</link>
		<comments>http://www.bostonmediadomain.com/google-patent-application-20050071741/#comments</comments>
		<pubDate>Mon, 15 Jun 2009 15:10:04 +0000</pubDate>
		<dc:creator>jeff selig</dc:creator>
				<category><![CDATA[Commentary & Analysis]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google patent]]></category>
		<category><![CDATA[PPC]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://bostonmediadomain.com/?p=693</guid>
		<description><![CDATA[Abstract A system identifies a document and obtains one or more types of history data associated with the document. The system may generate a score for the document based, at least in part, on the one or more types of history data The presentation order of this document has been reversed to show the characteristics of the invention first and the legal information secondly. To see the original document please click here Description BACKGROUND OF THE INVENTION [0002] 1. Field of the Invention [0003] The present invention relates generally to information retrieval systems and, more particularly, to systems and methods for generating search results based, at least in part, on historical data associated with relevant documents. [0004] 2. Description of Related Art [0005] The World Wide Web (&#8220;web&#8221;) contains a vast amount of information. Search engines assist users in locating desired portions of this information by cataloging web documents. Typically, in response to a user&#8217;s request, a search engine returns links to documents relevant to the request. [0006] Search engines may base their determination of the user&#8217;s interest on search terms (called a search query) provided by the user. The goal of a search engine is to identify links to [...]]]></description>
			<content:encoded><![CDATA[<!-- Start Shareaholic LikeButtonSetTop --><!-- End Shareaholic LikeButtonSetTop --><div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fgoogle-patent-application-20050071741%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fgoogle-patent-application-20050071741%2F&amp;source=seosem&amp;style=normal&amp;service=is.gd&amp;hashtags=google,google+patent,PPC,search,SEO&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<h4 class="style34">Abstract</h4>
<p class="style37"><strong>A system identifies a document and obtains one or more types of history data associated with the document. The system may generate a score for the document based, at least in part, on the one or more types of history data</strong></p>
<p align="left">The presentation order of this document has been reversed to show the characteristics of the invention first and the legal information secondly. <span class="style3">To see the original document please <a href="http://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&amp;Sect2=HITOFF&amp;p=1&amp;u=/netahtml/PTO/search-bool.html&amp;r=1&amp;f=G&amp;l=50&amp;co1=AND&amp;d=PG01&amp;s1=20050071741&amp;OS=20050071741&amp;RS=20050071741">click here</a></span></p>
<h4 class="style14">Description</h4>
<div>
<p align="center"><span class="style10">BACKGROUND OF THE INVENTION</span></p>
<p>[0002] <span class="style10">1. Field of the Invention</span></p>
<p>[0003] The present invention relates generally to information retrieval systems and, more particularly, to systems and methods for generating search results based, at least in part, on historical data associated with relevant documents.</p>
<p>[0004]<span class="style10"> 2. Description of Related Art </span></p>
<p>[0005] The World Wide Web (&#8220;web&#8221;) contains a vast amount of information. Search engines assist users in locating desired portions of this information by cataloging web documents. Typically, in response to a user&#8217;s request, a search engine returns links to documents relevant to the request.</p>
<p>[0006] Search engines may base their determination of the user&#8217;s interest on search terms (called a search query) provided by the user. The goal of a search engine is to identify links to high quality relevant results based on the search query. Typically, the search engine accomplishes this by matching the terms in the search query to a corpus of pre-stored web documents. Web documents that contain the user&#8217;s search terms are considered &#8220;hits&#8221; and are returned to the user.</p>
<p>[0007] Ideally, a search engine, in response to a given user&#8217;s search query, will provide the user with the most relevant results. One category of search engines identifies relevant documents based on a comparison of the search query terms to the words contained in the documents. Another category of search engines identifies relevant documents using factors other than, or in addition to, the presence of the search query terms in the documents. One such search engine uses information associated with links to or from the documents to determine the relative importance of the documents.</p>
<p>[0008] Both categories of search engines strive to provide high quality results for a search query. There are several factors that may affect the quality of the results generated by a search engine. For example, some web site producers use spamming techniques to artificially inflate their rank. Also, &#8220;stale&#8221; documents (i.e., those documents that have not been updated for a period of time and, thus, contain stale data) may be ranked higher than &#8220;fresher&#8221; documents (i.e., those documents that have been more recently updated and, thus, contain more recent data). In some particular contexts, the higher ranking stale documents degrade the search results.</p>
<p>[0009] Thus, there remains a need to improve the quality of results generated by search engines.</p>
<p align="center"><span class="style10">SUMMARY OF THE INVENTION</span></p>
<p>[0010] Systems and methods consistent with the principles of the invention may score documents based, at least in part, on history data associated with the documents. This scoring may be used to improve search results generated in connection with a search query.</p>
<p>[0011] According to one aspect consistent with the principles of the invention, a method for scoring a document is provided. The method may include identifying a document and obtaining one or more types of history data associated with the document. The method may further include generating a score for the document based, at least in part, on the one or more types of history data.</p>
<p>[0012] According to another aspect, a method for scoring documents is provided. The method may include determining an age of linkage data associated with a linked document and ranking the linked document based on a decaying function of the age of the linkage data.</p>
<p align="center"><span class="style10">BRIEF DESCRIPTION OF THE DRAWINGS </span></p>
<p>[0013] The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate an embodiment of the invention and, together with the description, explain the invention. In the drawings,</p>
<p>[0014] FIG. 1 is a diagram of an exemplary network in which systems and methods consistent with the principles of the invention may be implemented;</p>
<p>[0015] FIG. 2 is an exemplary diagram of a client and/or server of FIG. 1 according to an implementation consistent with the principles of the invention;</p>
<p>[0016] FIG. 3 is an exemplary functional block diagram of the search engine of FIG. 1 according to an implementation consistent with the principles of the invention; and</p>
<p>[0017] FIGS. 4 is a flowchart of exemplary processing for scoring documents according to an implementation consistent with the principles of the invention.</p>
<p align="center"><span class="style10">DETAILED DESCRIPTION </span></p>
<p>[0018] The following detailed description of the invention refers to the accompanying drawings. The same reference numbers in different drawings may identify the same or similar elements. Also, the following detailed description does not limit the invention.</p>
<p>[0019] Systems and methods consistent with the principles of the invention may score documents using, for example, history data associated with the documents. The systems and methods may use these scores to provide high quality search results.</p>
<p>[0020] A &#8220;<strong>document</strong>,&#8221; as the term is used herein, is to be broadly interpreted to include any machine-readable and machine-storable work product. A document may include an e-mail, a web site, a file, a combination of files, one or more files with embedded links to other files, a news group posting, a blog, a web advertisement, etc. In the context of the Internet, a common document is a web page. Web pages often include textual information and may include embedded information (such as meta information, images, hyperlinks, etc.) and/or embedded instructions (such as Javascript, etc.). A page may correspond to a document or a portion of a document. Therefore, the words &#8220;page&#8221; and &#8220;document&#8221; may be used interchangeably in some cases. In other cases, a page may refer to a portion of a document, such as a sub-document. It may also be possible for a page to correspond to more than a single document.</p>
<p>[0021] In the description to follow, documents may be described as having links to other documents and/or links from other documents. For example, when a document includes a link to another document, the link may be referred to as a &#8220;<strong>forward link</strong>.&#8221; When a document includes a link from another document, the link may be referred to as a &#8220;<strong>back link</strong>.&#8221; When the term &#8220;link&#8221; is used, it may refer to either a back link or a forward link.</p>
<p align="center"><span class="style10">EXEMPLARY NETWORK CONFIGURATION </span></p>
<p>[0022] FIG. 1 is an exemplary diagram of a network 100 in which systems and methods consistent with the principles of the invention may be implemented. Network 100 may include multiple clients 110 connected to multiple servers 120-140 via a network 150. Network 150 may include a local area network (LAN), a wide area network (WAN), a telephone network, such as the Public Switched Telephone Network (PSTN), an intranet, the Internet, a memory device, another type of network, or a combination of networks. Two clients 110 and three servers 120-140 have been illustrated as connected to <strong>network 150</strong> for simplicity. In practice, there may be more or fewer clients and servers. Also, in some instances, a client may perform the functions of a server and a server may perform the functions of a client.</p>
<p>[0023] <strong>Clients 110</strong> may include client entities. An entity may be defined as a device, such as a wireless telephone, a personal computer, a personal digital assistant (PDA), a lap top, or another type of computation or communication device, a thread or process running on one of these devices, and/or an object executable by one of these device. <strong>Servers 120-140</strong> may include server entities that gather, process, search, and/or maintain documents in a manner consistent with the principles of the invention. Clients 110 and servers 120-140 may connect to network 150 via wired, wireless, and/or optical connections.</p>
<p>[0024] In an implementation consistent with the principles of the invention, server 120 may include a <strong>search engine 125</strong> usable by clients 110. Server 120 may crawl a corpus of documents (e.g., web pages), index the documents, and store information associated with the documents in a repository of crawled documents. Servers 130 and 140 may store or maintain documents that may be crawled by server 120. While servers 120-140 are shown as separate entities, it may be possible for one or more of servers 120-140 to perform one or more of the functions of another one or more of servers 120-140. For example, it may be possible that two or more of servers 120-140 are implemented as a single server. It may also be possible for a single one of servers 120-140 to be implemented as two or more separate (and possibly distributed) devices.</p>
<p align="center"><span class="style10">EXEMPLARY CLIENT/SERVER ARCHITECTURE</span></p>
<p>[0025] FIG. 2 is an exemplary diagram of a client or server entity (hereinafter called &#8220;client/server entity&#8221;), which may correspond to one or more of clients 110 and servers 120-140, according to an implementation consistent with the principles of the invention. The client/server entity may include a bus 210, a processor 220, a main <strong>memory 230</strong>, a read only memory (ROM) 240, a storage device 250, one or more input devices 260, one or more output devices 270, and a communication <strong>interface 280</strong>. Bus 210 may include one or more conductors that permit communication among the components of the client/server entity.</p>
<p>[0026] <strong>Processor 220</strong> may include one or more conventional processors or microprocessors that interpret and execute instructions. Main memory 230 may include a random access memory (RAM) or another type of dynamic storage device that stores information and instructions for execution by processor 220. ROM 240 may include a conventional ROM device or another type of static storage device that stores static information and instructions for use by processor 220. Storage device 250 may include a magnetic and/or optical recording medium and its corresponding drive.</p>
<p>[0027] <strong>Input device(s) 260</strong> may include one or more conventional mechanisms that permit an operator to input information to the client/server entity, such as a keyboard, a mouse, a pen, voice recognition and/or biometric mechanisms, etc. <strong>Output device(s) 270</strong> may include one or more conventional mechanisms that output information to the operator, including a display, a printer, a speaker, etc. Communication interface 280 may include any transceiver-like mechanism that enables the client/server entity to communicate with other devices and/or systems. For example, communication interface 280 may include mechanisms for communicating with another device or system via a network, such as network 150.</p>
<p>[0028] As will be described in detail below, the client/server entity, consistent with the principles of the invention, perform certain searching-related operations. The client/server entity may perform these operations in response to processor 220 executing software instructions contained in a computer-readable medium, such as memory 230. A computer-readable medium may be defined as one or more physical or logical memory devices and/or carrier waves.</p>
<p>[0029] The software instructions may be read into memory 230 from another computer-readable medium, such as data storage <strong>device 250</strong>, or from another device via communication interface 280. The software instructions contained in memory 230 may cause processor 220 to perform processes that will be described later. Alternatively, hardwired circuitry may be used in place of or in combination with software instructions to implement processes consistent with the principles of the invention. Thus, implementations consistent with the principles of the invention are not limited to any specific combination of hardware circuitry and software.</p>
<p align="center"><span class="style10">EXEMPLARY SEARCH ENGINE </span></p>
<p>[0030] FIG. 3 is an exemplary functional block diagram of search engine 125 according to an implementation consistent with the principles of the invention.<span class="style12"> Search engine 125 may include document locator 310, history component 320, and ranking component 330</span>. As shown in FIG. 3, one or more of document locator 310 and history component 320 may connect to a document corpus 340. Document corpus 340 may include information associated with documents that were previously crawled, indexed, and stored, for example, in a database accessible by search engine 125. History data, as will be described in more detail below, may be associated with each of the documents in document corpus 340. The history data may be stored in document corpus 340 or elsewhere.</p>
<p>[0031] <span class="style13">Document locator</span> 310 may identify a set of documents whose contents match a user search query. Document locator 310 may initially locate documents from document corpus 340 by comparing the terms in the user&#8217;s search query to the documents in the corpus. In general, processes for indexing documents and searching the indexed collection to return a set of documents containing the searched terms are well known in the art. Accordingly, this functionality of document locator 310 will not be described further herein.</p>
<p>[0032] <span class="style13">History component</span> 320 may gather history data associated with the documents in document corpus 340. In implementations consistent with the principles of the invention, the history data may include data relating to: document inception dates; document content updates/changes; query analysis; link-based criteria; anchor text (e.g., the text in which a hyperlink is embedded, typically underlined or otherwise highlighted in a document); traffic; user behavior; domain-related information; ranking history; user maintained/generated data (e.g., bookmarks); unique words, bigrams, and phrases in anchor text; linkage of independent peers; and/or document topics. These different types of history data are described in additional detail below. In other implementations, the history data may include additional or different kinds of data.</p>
<p>[0033] <span class="style13">Ranking component</span> 330 may assign a ranking score (also called simply a &#8220;score&#8221; herein) to one or more documents in document corpus 340. Ranking component 330 may assign the ranking scores prior to, independent of, or in connection with a search query. When the documents are associated with a search query (e.g., identified as relevant to the search query), search engine 125 may sort the documents based on the ranking score and return the sorted set of documents to the client that submitted the search query. Consistent with aspects of the invention, the ranking score is a value that attempts to quantify the quality of the documents. In implementations consistent with the principles of the invention, the score is based, at least in part, on the history data from history component 320.</p>
<p align="center"><span class="style10">EXEMPLARY HISTORY DATA </span></p>
<p>[0034] <span class="style10">Document Inception Date </span></p>
<p>[0035] According to an implementation consistent with the principles of the invention, a document&#8217;s inception date may be used to generate (or alter) a score associated with that document. The term &#8220;date&#8221; is used broadly here and may, thus, include time and date measurements. As described below, there are several techniques that can be used to determine a document&#8217;s inception date. Some of these techniques are &#8220;biased&#8221; in the sense that they can be influenced by third parties desiring to improve the score associated with a document. Other techniques are not biased. Any of these techniques, combinations of these techniques, or yet other techniques may be used to determine a document&#8217;s inception date.</p>
<p>[0036] According to one implementation, the inception date of a document may be determined from the date that search engine 125 first learns of or indexes the document. Search engine 125 may discover the document through crawling, submission of the document (or a representation/summary thereof) to search engine 125 from an &#8220;outside&#8221; source, a combination of crawl or submission-based indexing techniques, or in other ways. Alternatively, the inception date of a document may be determined from the date that search engine 125 first discovers a link to the document.</p>
<p>[0037] According to another implementation, the date that a domain with which a document is registered may be used as an indication of the inception date of the document. According to yet another implementation, the first time that a document is referenced in another document, such as a news article, newsgroup, mailing list, or a combination of one or more such documents, may be used to infer an inception date of the document. According to a further implementation, the date that a document includes at least a threshold number of pages may be used as an indication of the inception date of the document. According to another implementation, the inception date of a document may be equal to a time stamp associated with the document by the server hosting the document. Other techniques, not specifically mentioned herein, or combinations of techniques could be used to determine or infer a document&#8217;s inception date.</p>
<p>[0038] Search engine 125 may use the inception date of a document for scoring of the document. For example, it may be assumed that a document with a fairly recent inception date will not have a significant number of links from other documents (i.e., back links). For existing link-based scoring techniques that score based on the number of links to/from a document, this recent document may be scored lower than an older document that has a larger number of links (e.g., back links). When the inception date of the documents are considered, however, the scores of the documents may be modified (either positively or negatively) based on the documents&#8217; inception dates.</p>
<p>[0039] Consider the example of a document with an inception date of yesterday that is referenced by 10 back links. This document may be scored higher by search engine 125 than a document with an inception date of 10 years ago that is referenced by 100 back links because the rate of link growth for the former is relatively higher than the latter. While a spiky rate of growth in the number of back links may be a factor used by search engine 125 to score documents, it may also signal an attempt to spam search engine 125. Accordingly, in this situation, search engine 125 may actually lower the score of a document(s) to reduce the effect of spamming.</p>
<p>[0040] Thus, according to an implementation consistent with the principles of the invention, search engine 125 may use the inception date of a document to determine a rate at which links to the document are created (e.g., as an average per unit time based on the number of links created since the inception date or some window in that period). This rate can then be used to score the document, for example, giving more weight to documents to which links are generated more often.</p>
<p>[0041] In one implementation, search engine 125 may modify the link-based score of a document as follows:</p>
<p>H=L/log(F+2),</p>
<p>[0042] where H may refer to the history-adjusted link score, L may refer to the link score given to the document, which can be derived using any known link scoring technique (e.g., the scoring technique described in U.S. Pat. No. 6,285,999) that assigns a score to a document based on links to/from the document, and F may refer to elapsed time measured from the inception date associated with the document (or a window within this period).</p>
<p>[0043] For some queries, older documents may be more favorable than newer ones. As a result, it may be beneficial to adjust the score of a document based on the difference (in age) from the average age of the result set. In other words, search engine 125 may determine the age of each of the documents in a result set (e.g., using their inception dates), determine the average age of the documents, and modify the scores of the documents (either positively or negatively) based on a difference between the documents&#8217; age and the average age.</p>
<p>[0044] <span class="style12">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information relating to the inception date of the document</span>.</p>
<p>[0045] <span class="style10">Content Updates/Changes </span></p>
<p>[0046] According to an implementation consistent with the principles of the invention, information relating to a manner in which a document&#8217;s content changes over time may be used to generate (or alter) a score associated with that document. For example, a document whose content is edited often may be scored differently than a document whose content remains static over time. Also, a document having a relatively large amount of its content updated over time might be scored differently than a document having a relatively small amount of its content updated over time.</p>
<p>[0047] In one implementation, search engine 125 may generate a content update score (U) as follows:</p>
<p>[0048] U=f(UF, UA),</p>
<p>[0049] where f may refer to a function, such as a sum or weighted sum, UF may refer to an update frequency score that represents how often a document (or page) is updated, and UA may refer to an update amount score that represents how much the document (or page) has changed over time. UF may be determined in a number of ways, including as an average time between updates, the number of updates in a given time period, etc.</p>
<p>[0050] UA may also be determined as a function of one or more factors, such as the number of &#8220;new&#8221; or unique pages associated with a document over a period of time. Another factor might include the ratio of the number of new or unique pages associated with a document over a period of time versus the total number of pages associated with that document. Yet another factor may include the amount that the document is updated over one or more periods of time (e.g., n % of a document&#8217;s visible content may change over a period t (e.g., last m months)), which might be an average value. A further factor might include the amount that the document (or page) has changed in one or more periods of time (e.g., within the last x days).</p>
<p>[0051] According to one exemplary implementation, UA may be determined as a function of differently weighted portions of document content. For instance, content deemed to be unimportant if updated/changed, such as Javascript, comments, advertisements, navigational elements, boilerplate material, or date/time tags, may be given relatively little weight or even ignored altogether when determining UA. On the other hand, content deemed to be important if updated/changed (e.g., more often, more recently, more extensively, etc.), such as the title or anchor text associated with the forward links, could be given more weight than changes to other content when determining UA.</p>
<p>[0052] UF and UA may be used in other ways to influence the score assigned to a document. For example, the rate of change in a current time period can be compared to the rate of change in another (e.g., previous) time period to determine whether there is an acceleration or deceleration trend. Documents for which there is an increase in the rate of change might be scored higher than those documents for which there is a steady rate of change, even if that rate of change is relatively high. The amount of change may also be a factor in this scoring. For example, documents for which there is an increase in the rate of change when that amount of change is greater than some threshold might be scored higher than those documents for which there is a steady rate of change or an amount of change is less than the threshold.</p>
<p>[0053] In some situations, data storage resources may be insufficient to store the documents when monitoring the documents for content changes. In this case, search engine 125 may store representations of the documents and monitor these representations for changes. For example, search engine 125 may store &#8220;signatures&#8221; of documents instead of the (entire) documents themselves to detect changes to document content. In this case, search engine 125 may store a term vector for a document (or page) and monitor it for relatively large changes. According to another implementation, search engine 125 may store and monitor a relatively small portion (e.g., a few terms) of the documents that are determined to be important or the most frequently occurring (excluding &#8220;stop words&#8221;).</p>
<p>[0054] According to yet another implementation, search engine 125 may store a summary or other representation of a document and monitor this information for changes. According to a further implementation, search engine 125 may generate a similarity hash (which may be used to detect near-duplication of a document) for the document and monitor it for changes. A change in a similarity hash may be considered to indicate a relatively large change in its associated document. In other implementations, yet other techniques may be used to monitor documents for changes. In situations where adequate data storage resources exist, the full documents may be stored and used to determine changes rather than some representation of the documents.</p>
<p>[0055] For some queries, documents with content that has not recently changed may be more favorable than documents with content that has recently changed. As a result, it may be beneficial to adjust the score of a document based on the difference from the average date-of-change of the result set. In other words, search engine 125 may determine a date when the content of each of the documents in a result set last changed, determine the average date of change for the documents, and modify the scores of the documents (either positively or negatively) based on a difference between the documents&#8217; date-of-change and the average date-of-change.</p>
<p>[0056] <span class="style12">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information relating to a manner in which the document&#8217;s content changes over time.</span> For very large documents that include content belonging to multiple individuals or organizations, the score may correspond to each of the sub-documents (i.e., that content belonging to or updated by a single individual or organization).</p>
<p>[0057] <span class="style10">Query Analysis </span></p>
<p>[0058] According to an implementation consistent with the principles of the invention, one or more query-based factors may be used to generate (or alter) a score associated with a document. For example, one query-based factor may relate to the extent to which a document is selected over time when the document is included in a set of search results. In this case, search engine 125 might score documents selected relatively more often/increasingly by users higher than other documents.</p>
<p>[0059] Another query-based factor may relate to the occurrence of certain search terms appearing in queries over time. A particular set of search terms may increasingly appear in queries over a period of time. For example, terms relating to a &#8220;hot&#8221; topic that is gaining/has gained popularity or a breaking news event would conceivably appear frequently over a period of time. In this case, search engine 125 may score documents associated with these search terms (or queries) higher than documents not associated with these terms.</p>
<p>[0060] A further query-based factor may relate to a change over time in the number of search results generated by similar queries. A significant increase in the number of search results generated by similar queries, for example, might indicate a hot topic or breaking news and cause search engine 125 to increase the scores of documents related to such queries.</p>
<p>[0061] Another query-based factor may relate to queries that remain relatively constant over time but lead to results that change over time. For example, a query relating to &#8220;world series champion&#8221; leads to search results that change over time (e.g., documents relating to a particular team dominate search results in a given year or time of year). This change can be monitored and used to score documents accordingly.</p>
<p>[0062] Yet another query-based factor might relate to the &#8220;staleness&#8221; of documents returned as search results. The staleness of a document may be based on factors, such as document creation date, anchor growth, traffic, content change, forward/back link growth, etc. For some queries, recent documents are very important (e.g., if searching for Frequently Asked Questions (FAQ) files, the most recent version would be highly desirable). Search engine 125 may learn which queries recent changes are most important for by analyzing which documents in search results are selected by users. More specifically, search engine 125 may consider how often users favor a more recent document that is ranked lower than an older document in the search results. Additionally, if over time a particular document is included in mostly topical queries (e.g., &#8220;World Series Champions&#8221;) versus more specific queries (e.g., &#8220;New York Yankees&#8221;), then this query-based factor&#8211;by itself or with others mentioned herein&#8211;may be used to lower a score for a document that appears to be stale.</p>
<p>[0063] In some situations, a stale document may be considered more favorable than more recent documents. As a result, search engine 125 may consider the extent to which a document is selected over time when generating a score for the document. For example, if for a given query, users over time tend to select a lower ranked, relatively stale, document over a higher ranked, relatively recent document, this may be used by search engine 125 as an indication to adjust a score of the stale document.</p>
<p>[0064] Yet another query-based factor may relate to the extent to which a document appears in results for different queries. In other words, the entropy of queries for one or more documents may be monitored and used as a basis for scoring. For example, if a particular document appears as a hit for a discordant set of queries, this may (though not necessarily) be considered a signal that the document is spam, in which case search engine 125 may score the document relatively lower.</p>
<p>[0065] In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on one or more query-based factors.</p>
<p>[0066] <span class="style10">Link-Based Criteria </span></p>
<p>[0067] According to an implementation consistent with the principles of the invention, one or more link-based factors may be used to generate (or alter) a score associated with a document. In one implementation, the link-based factors may relate to the dates that new links appear to a document and that existing links disappear. The appearance date of a link may be the first date that search engine 125 finds the link or the date of the document that contains the link (e.g., the date that the document was found with the link or the date that it was last updated). The disappearance date of a link may be the first date that the document containing the link either dropped the link or disappeared itself.</p>
<p>[0068] These dates may be determined by search engine 125 during a crawl or index update operation. Using this date as a reference, search engine 125 may then monitor the time-varying behavior of links to the document, such as when links appear or disappear, the rate at which links appear or disappear over time, how many links appear or disappear during a given time period, whether there is trend toward appearance of new links versus disappearance of existing links to the document, etc.</p>
<p>[0069] Using the time-varying behavior of links to (and/or from) a document, search engine 125 may score the document accordingly. For example, a downward trend in the number or rate of new links (e.g., based on a comparison of the number or rate of new links in a recent time period versus an older time period) over time could signal to search engine 125 that a document is stale, in which case search engine 125 may decrease the document&#8217;s score. Conversely, an upward trend may signal a &#8220;fresh&#8221; document (e.g., a document whose content is fresh&#8211;recently created or updated) that might be considered more relevant, depending on the particular situation and implementation.</p>
<p>[0070] By analyzing the change in the number or rate of increase/decrease of back links to a document (or page) over time, search engine 125 may derive a valuable signal of how fresh the document is. For example, if such analysis is reflected by a curve that is dropping off, this may signal that the document may be stale (e.g., no longer updated, diminished in importance, superceded by another document, etc.).</p>
<p>[0071] According to one implementation, the analysis may depend on the number of new links to a document. For example, search engine 125 may monitor the number of new links to a document in the last n days compared to the number of new links since the document was first found. Alternatively, search engine 125 may determine the oldest age of the most recent y % of links compared to the age of the first link found.</p>
<p>[0072] For the purpose of illustration, consider y=10 and two documents (web sites in this example) that were both first found 100 days ago. For the first site, 10% of the links were found less than 10 days ago, while for the second site 0% of the links were found less than 10 days ago (in other words, they were all found earlier). In this case, the metric results in 0.1 for site A and 0 for site B. The metric may be scaled appropriately. In another exemplary implementation, the metric may be modified by performing a relatively more detailed analysis of the distribution of link dates. For example, models may be built that predict if a particular distribution signifies a particular type of site (e.g., a site that is no longer updated, increasing or decreasing in popularity, superceded, etc.).</p>
<p>[0073] According to another implementation, the analysis may depend on weights assigned to the links. In this case, each link may be weighted by a function that increases with the freshness of the link. The freshness of a link may be determined by the date of appearance/change of the link, the date of appearance/change of anchor text associated with the link, date of appearance/change of the document containing the link. The date of appearance/change of the document containing a link may be a better indicator of the freshness of the link based on the theory that a good link may go unchanged when a document gets updated if it is still relevant and good. In order to not update every link&#8217;s freshness from a minor edit of a tiny unrelated part of a document, each updated document may be tested for significant changes (e.g., changes to a large portion of the document or changes to many different portions of the document) and a link&#8217;s freshness may be updated (or not updated) accordingly.</p>
<p>[0074] Links may be weighted in other ways. For example, links may be weighted based on how much the documents containing the links are trusted (e.g., government documents can be given high trust). Links may also, or alternatively, be weighted based on how authoritative the documents containing the links are (e.g., authoritative documents may be determined in a manner similar to that described in U.S. Pat. No. 6,285,999). Links may also, or alternatively, be weighted based on the freshness of the documents containing the links using some other features to establish freshness (e.g., a document that is updated frequently (e.g., the Yahoo home page) suddenly drops a link to a document).</p>
<p>[0075] Search engine 125 may raise or lower the score of a document to which there are links as a function of the sum of the weights of the links pointing to it. This technique may be employed recursively. For example, assume that a document S is 2 years olds. Document S may be considered fresh if n % of the links to S are fresh or if the documents containing forward links to S are considered fresh. The latter can be checked by using the creation date of the document and applying this technique recursively.</p>
<p>[0076] According to yet another technique, the analysis may depend on an age distribution associated with the links pointing to a document. In other words, the dates that the links to a document were created may be determined and input to a function that determines the age distribution. It may be assumed that the age distribution of a stale document will be very different from the age distribution of a fresh document. Search engine 125 may then score documents based, at least in part, on the age distributions associated with the documents.</p>
<p>[0077] The dates that links appear can also be used to detect &#8220;spam,&#8221; where owners of documents or their colleagues create links to their own document for the purpose of boosting the score assigned by a search engine. A typical, &#8220;legitimate&#8221; document attracts back links slowly. A large spike in the quantity of back links may signal a topical phenomenon (e.g., the CDC web site may develop many links quickly after an outbreak, such as SARS), or signal attempts to spam a search engine (to obtain a higher ranking and, thus, better placement in search results) by exchanging links, purchasing links, or gaining links from documents without editorial discretion on making links. Examples of documents that give links without editorial discretion include guest books, referrer logs, and &#8220;free for all&#8221; pages that let anyone add a link to a document.</p>
<p>[0078] According to a further implementation, the analysis may depend on the date that links disappear. The disappearance of many links can mean that the document to which these links point is stale (e.g., no longer being updated or has been superseded by another document). For example, search engine 125 may monitor the date at which one or more links to a document disappear, the number of links that disappear in a given window of time, or some other time-varying decrease in the number of links (or links/updates to the documents containing such links) to a document to identify documents that may be considered stale. Once a document has been determined to be stale, the links contained in that document may be discounted or ignored by search engine 125 when determining scores for documents pointed to by the links.</p>
<p>[0079] According to another implementation, the analysis may depend, not only on the age of the links to a document, but also on the dynamic-ness of the links. As such, search engine 125 may weight documents that have a different featured link each day, despite having a very fresh link, differently (e.g., lower) than documents that are consistently updated and consistently link to a given target document. In one exemplary implementation, search engine 125 may generate a score for a document based on the scores of the documents with links to the document for all versions of the documents within a window of time. Another version of this may factor a discount/decay into the integration based on the major update times of the document.</p>
<p>[0080] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on one or more link-based factors.</span></p>
<p>[0081] <span class="style10">Anchor Text </span></p>
<p>[0082] According to an implementation consistent with the principles of the invention, information relating to a manner in which anchor text changes over time may be used to generate (or alter) a score associated with a document. For example, changes over time in anchor text associated with links to a document may be used as an indication that there has been an update or even a change of focus in the document.</p>
<p>[0083] Alternatively, if the content of a document changes such that it differs significantly from the anchor text associated with its back links, then the domain associated with the document may have changed significantly (completely) from a previous incarnation. This may occur when a domain expires and a different party purchases the domain. Because anchor text is often considered to be part of the document to which its associated link points, the domain may show up in search results for queries that are no longer on topic. This is an undesirable result.</p>
<p>[0084] One way to address this problem is to estimate the date that a domain changed its focus. This may be done by determining a date when the text of a document changes significantly or when the text of the anchor text changes significantly. All links and/or anchor text prior to that date may then be ignored or discounted.</p>
<p>[0085] The freshness of anchor text may also be used as a factor in scoring documents. The freshness of an anchor text may be determined, for example, by the date of appearance/change of the anchor text, the date of appearance/change of the link associated with the anchor text, and/or the date of appearance/change of the document to which the associated link points. The date of appearance/change of the document pointed to by the link may be a good indicator of the freshness of the anchor text based on the theory that good anchor text may go unchanged when a document gets updated if it is still relevant and good. In order to not update an anchor text&#8217;s freshness from a minor edit of a tiny unrelated part of a document, each updated document may be tested for significant changes (e.g., changes to a large portion of the document or changes to many different portions of the document) and an anchor text&#8217;s freshness may be updated (or not updated) accordingly.</p>
<p>[0086] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information relating to a manner in which anchor text changes over time.<br />
</span><br />
[0087] <span class="style10">Traffic </span></p>
<p>[0088] According to an implementation consistent with the principles of the invention, information relating to traffic associated with a document over time may be used to generate (or alter) a score associated with the document. For example, search engine 125 may monitor the time-varying characteristics of traffic to, or other &#8220;use&#8221; of, a document by one or more users. A large reduction in traffic may indicate that a document may be stale (e.g., no longer be updated or may be superseded by another document).</p>
<p>[0089] In one implementation, search engine 125 may compare the average traffic for a document over the last j days (e.g., where j=30) to the average traffic during the month where the document received the most traffic, optionally adjusted for seasonal changes, or during the last k days (e.g., where k=365). Optionally, search engine 125 may identify repeating traffic patterns or perhaps a change in traffic patterns over time. It may be discovered that there are periods when a document is more or less popular (i.e., has more or less traffic), such as during the summer months, on weekends, or during some other seasonal time period. By identifying repeating traffic patterns or changes in traffic patterns, search engine 125 may appropriately adjust its scoring of the document during and outside of these periods.</p>
<p>[0090] Additionally, or alternatively, search engine 125 may monitor time-varying characteristics relating to &#8220;advertising traffic&#8221; for a particular document. For example, search engine 125 may monitor one or a combination of the following factors: (1) the extent to and rate at which advertisements are presented or updated by a given document over time; (2) the quality of the advertisers (e.g., a document whose advertisements refer/link to documents known to search engine 125 over time to have relatively high traffic and trust, such as amazon.com, may be given relatively more weight than those documents whose advertisements refer to low traffic/untrustworthy documents, such as a pornographic site); and (3) the extent to which the advertisements generate user traffic to the documents to which they relate (e.g., their click-through rate). Search engine 125 may use these time-varying characteristics relating to advertising traffic to score the document.</p>
<p>[0091] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information relating to traffic associated with the document over time.</span></p>
<p>[0092] <span class="style11">User Behavior </span></p>
<p>[0093] According to an implementation consistent with the principles of the invention, information corresponding to individual or aggregate user behavior relating to a document over time may be used to generate (or alter) a score associated with the document. For example, search engine 125 may monitor the number of times that a document is selected from a set of search results and/or the amount of time one or more users spend accessing the document. Search engine 125 may then score the document based, at least in part, on this information.</p>
<p>[0094] If a document is returned for a certain query and over time, or within a given time window, users spend either more or less time on average on the document given the same or similar query, then this may be used as an indication that the document is fresh or stale, respectively. For example, assume that the query &#8220;Riverview swimming schedule&#8221; returns a document with the title &#8220;Riverview Swimming Schedule.&#8221; Assume further that users used to spend 30 seconds accessing it, but now every user that selects the document only spends a few seconds accessing it. Search engine 125 may use this information to determine that the document is stale (i.e., contains an outdated swimming schedule) and score the document accordingly.</p>
<p>[0095] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information corresponding to individual or aggregate user behavior relating to the document over time</span>.</p>
<p>[0096] <span class="style10">Domain-Related Information </span></p>
<p>[0097] According to an implementation consistent with the principles of the invention, information relating to a domain associated with a document may be used to generate (or alter) a score associated with the document. For example, search engine 125 may monitor information relating to how a document is hosted within a computer network (e.g., the Internet, an intranet or other network or database of documents) and use this information to score the document.</p>
<p>[0098] Individuals who attempt to deceive (spam) search engines often use throwaway or &#8220;doorway&#8221; domains and attempt to obtain as much traffic as possible before being caught. Information regarding the legitimacy of the domains may be used by search engine 125 when scoring the documents associated with these domains.</p>
<p>[0099] Certain signals may be used to distinguish between illegitimate and legitimate domains. For example, domains can be renewed up to a period of 10 years. Valuable (legitimate) domains are often paid for several years in advance, while doorway (illegitimate) domains rarely are used for more than a year. Therefore, the date when a domain expires in the future can be used as a factor in predicting the legitimacy of a domain and, thus, the documents associated therewith.</p>
<p>[0100] Also, or alternatively, the domain name server (DNS) record for a domain may be monitored to predict whether a domain is legitimate. The DNS record contains details of who registered the domain, administrative and technical addresses, and the addresses of name servers (i.e., servers that resolve the domain name into an IP address). By analyzing this data over time for a domain, illegitimate domains may be identified. For instance, search engine 125 may monitor whether physically correct address information exists over a period of time, whether contact information for the domain changes relatively often, whether there is a relatively high number of changes between different name servers and hosting companies, etc. In one implementation, a list of known-bad contact information, name servers, and/or IP addresses may be identified, stored, and used in predicting the legitimacy of a domain and, thus, the documents associated therewith.</p>
<p>[0101] Also, or alternatively, the age, or other information, regarding a name server associated with a domain may be used to predict the legitimacy of the domain. A &#8220;good&#8221; name server may have a mix of different domains from different registrars and have a history of hosting those domains, while a &#8220;bad&#8221; name server might host mainly pornography or doorway domains, domains with commercial words (a common indicator of spam), or primarily bulk domains from a single registrar, or might be brand new. The newness of a name server might not automatically be a negative factor in determining the legitimacy of the associated domain, but in combination with other factors, such as ones described herein, it could be.</p>
<p>[0102] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information relating to a legitimacy of a domain associated with the document. </span></p>
<p>[0103] <span class="style10">Ranking History </span></p>
<p>[0104] According to an implementation consistent with the principles of the invention, information relating to prior rankings of a document may be used to generate (or alter) a score associated with the document. For example, search engine 125 may monitor the time-varying ranking of a document in response to search queries provided to search engine 125. Search engine 125 may determine that a document that jumps in rankings across many queries might be a topical document or it could signal an attempt to spam search engine 125.</p>
<p>[0105] Thus, the quantity or rate that a document moves in rankings over a period of time might be used to influence future scores assigned to that document. In one implementation, for each set of search results, a document may be weighted according to its position in the top N search results. For N=30, one example function might be [((N+1)-SLOT)/N].sup.4. In this case, a top result may receive a score of 1.0, down to a score near 0 for the Nth result.</p>
<p>[0106] A query set (e.g., of commercial queries) can be repeated, and documents that gained more than M % in the rankings may be flagged or the percentage growth in ranking may be used as a signal in determining scores for the documents. For example, search engine 125 may determine that a query is likely commercial if the average (median) score of the top results is relatively high and there is a significant amount of change in the top results from month to month. Search engine 125 may also monitor churn as an indication of a commercial query. For commercial queries, the likelihood of spam is higher, so search engine 125 may treat documents associated therewith accordingly.</p>
<p>[0107] In addition to history of positions (or rankings) of documents for a given query, search engine 125 may monitor (on a page, host, document, and/or domain basis) one or more other factors, such as the number of queries for which, and the rate at which (increasing/decreasing), a document is selected as a search result over time; seasonality, burstiness, and other patterns over time that a document is selected as a search result; and/or changes in scores over time for a URL-query pair.</p>
<p>[0108] In addition, or alternatively, search engine 125 may monitor a number of document (e.g., URL) independent query-based criteria over time. For example, search engine 125 may monitor the average score among a top set of results generated in response to a given query or set of queries and adjust the score of that set of results and/or other results generated in response to the given query or set of queries. Moreover, search engine 125 may monitor the number of results generated for a particular query or set of queries over time. If search engine 125 determines that the number of results increases or that there is a change in the rate of increase (e.g., such an increase may be an indication of a &#8220;hot topic&#8221; or other phenomenon), search engine 125 may score those results higher in the future.</p>
<p>[0109] In addition, or alternatively, search engine 125 may monitor the ranks of documents over time to detect sudden spikes in the ranks of the documents. A spike may indicate either a topical phenomenon (e.g., a hot topic) or an attempt to spam search engine 125 by, for example, trading or purchasing links. Search engine 125 may take measures to prevent spam attempts by, for example, employing hysteresis to allow a rank to grow at a certain rate. In another implementation, the rank for a given document may be allowed a certain maximum threshold of growth over a predefined window of time. As a further measure to differentiate a document related to a topical phenomenon from a spam document, search engine 125 may consider mentions of the document in news articles, discussion groups, etc. on the theory that spam documents will not be mentioned, for example, in the news. Any or a combination of these techniques may be used to curtail spamming attempts.</p>
<p>[0110] It may be possible for search engine 125 to make exceptions for documents that are determined to be authoritative in some respect, such as government documents, web directories (e.g., Yahoo), and documents that have shown a relatively steady and high rank over time. For example, if an unusual spike in the number or rate of increase of links to an authoritative document occurs, then search engine 125 may consider such a document not to be spam and, thus, allow a relatively high or even no threshold for (growth of) its rank (over time).</p>
<p>[0111] In addition, or alternatively, search engine 125 may consider significant drops in ranks of documents as an indication that these documents are &#8220;out of favor&#8221; or outdated. For example, if the rank of a document over time drops significantly, then search engine 125 may consider the document as outdated and score the document accordingly.</p>
<p>[0112] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information relating to prior rankings of the document. </span></p>
<p>[0113] <span class="style10">User Maintained/Generated Data </span></p>
<p>[0114] According to an implementation consistent with the principles of the invention, user maintained or generated data may be used to generate (or alter) a score associated with a document. For example, search engine 125 may monitor data maintained or generated by a user, such as &#8220;bookmarks,&#8221; &#8220;favorites,&#8221; or other types of data that may provide some indication of documents favored by, or of interest to, the user. Search engine 125 may obtain this data either directly (e.g., via a browser assistant) or indirectly (e.g., via a browser). Search engine 125 may then analyze over time a number of bookmarks/favorites to which a document is associated to determine the importance of the document.</p>
<p>[0115] Search engine 125 may also analyze upward and downward trends to add or remove the document (or more specifically, a path to the document) from the bookmarks/favorites lists, the rate at which the document is added to or removed from the bookmarks/favorites lists, and/or whether the document is added to, deleted from, or accessed through the bookmarks/favorites lists. If a number of users are adding a particular document to their bookmarks/favorites lists or often accessing the document through such lists over time, this may be considered an indication that the document is relatively important. On the other hand, if a number of users are decreasingly accessing a document indicated in their bookmarks/favorites list or are increasingly deleting/replacing the path to such document from their lists, this may be taken as an indication that the document is outdated, unpopular, etc. Search engine 125 may then score the documents accordingly.</p>
<p>[0116] In an alternative implementation, other types of user data that may indicate an increase or decrease in user interest in a particular document over time may be used by search engine 125 to score the document. For example, the &#8220;temp&#8221; or cache files associated with users could be monitored by search engine 125 to identify whether there is an increase or decrease in a document being added over time. Similarly, cookies associated with a particular document might be monitored by search engine 125 to determine whether there is an upward or downward trend in interest in the document.</p>
<p>[0117] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on user maintained or generated data.<br />
</span><br />
[0118] <span class="style10">Unique Words, Bigrams, Phrases in Anchor Text </span></p>
<p>[0119] According to an implementation consistent with the principles of the invention, information regarding unique words, bigrams, and phrases in anchor text may be used to generate (or alter) a score associated with a document. For example, search engine 125 may monitor web (or link) graphs and their behavior over time and use this information for scoring, spam detection, or other purposes. Naturally developed web graphs typically involve independent decisions. Synthetically generated web graphs, which are usually indicative of an intent to spam, are based on coordinated decisions, causing the profile of growth in anchor words/bigrams/phrases to likely be relatively spiky.</p>
<p>[0120] One reason for such spikiness may be the addition of a large number of identical anchors from many documents. Another possibility may be the addition of deliberately different anchors from a lot of documents. Search engine 125 may monitor the anchors and factor them into scoring a document to which their associated links point. For example, search engine 125 may cap the impact of suspect anchors on the score of the associated document. Alternatively, search engine 125 may use a continuous scale for the likelihood of synthetic generation and derive a multiplicative factor to scale the score for the document.</p>
<p>[0121] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information regarding unique words, bigrams, and phrases in anchor text associated with one or more links pointing to the document.<br />
</span><br />
[0122] <span class="style10">Linkage of Independent Peers </span></p>
<p>[0123] According to an implementation consistent with the principles of the invention, information regarding linkage of independent peers (e.g., unrelated documents) may be used to generate (or alter) a score associated with a document.</p>
<p>[0124] A sudden growth in the number of apparently independent peers, incoming and/or outgoing, with a large number of links to individual documents may indicate a potentially synthetic web graph, which is an indicator of an attempt to spam. This indication may be strengthened if the growth corresponds to anchor text that is unusually coherent or discordant. This information can be used to demote the impact of such links, when used with a link-based scoring technique, either as a binary decision item (e.g., demote the score by a fixed amount) or a multiplicative factor.</p>
<p>[0125]<span class="style13"> In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on information regarding linkage of independent peers. </span></p>
<p>[0126] <span class="style10">Document Topics </span></p>
<p>[0127] According to an implementation consistent with the principles of the invention, information regarding document topics may be used to generate (or alter) a score associated with a document. For example, search engine 125 may perform topic extraction (e.g., through categorization, URL analysis, content analysis, clustering, summarization, a set of unique low frequency words, or some other type of topic extraction). Search engine 125 may then monitor the topic(s) of a document over time and use this information for scoring purposes.</p>
<p>[0128] A significant change over time in the set of topics associated with a document may indicate that the document has changed owners and previous document indicators, such as score, anchor text, etc., are no longer reliable. Similarly, a spike in the number of topics could indicate spam. For example, if a particular document is associated with a set of one or more topics over what may be considered a &#8220;stable&#8221; period of time and then a (sudden) spike occurs in the number of topics associated with the document, this may be an indication that the document has been taken over as a &#8220;doorway&#8221; document. Another indication may include the disappearance of the original topics associated with the document. If one or more of these situations are detected, then search engine 125 may reduce the relative score of such documents and/or the links, anchor text, or other data associated the document.</p>
<p>[0129] <span class="style13">In summary, search engine 125 may generate (or alter) a score associated with a document based, at least in part, on changes in one or more topics associated with the document.</span></p>
<p align="center"><span class="style10">EXEMPLARY PROCESSING </span></p>
<p>[0130] FIG. 4 is a flowchart of exemplary processing for scoring documents according to an implementation consistent with the principles of the invention. Processing may begin with server 120 identifying documents (act 410). The documents may include, for example, one or more documents associated with a search query, such as documents identified as relevant to the search query. Alternatively, the documents may include one or more documents in a corpus or repository of documents that are independent of any search query (e.g., documents that are identified by crawling a network and stored in a repository).</p>
<p>[0131] Search engine 125 may obtain history data associated with the identified documents (act 420). As described above, the history data may take different forms. For example, the history data may include data relating to document inception dates; document content updates/changes; query analysis; link-based criteria; anchor text; traffic; user behavior; domain-related information; ranking history; user maintained/generated data (e.g., bookmarks and/or favorites); unique words, bigrams, and phrases in anchor text; linkage of independent peers; and/or document topics. Search engine 125 may obtain one, or a combination, of these kinds of history data.</p>
<p>[0132] Search engine 125 may then score the identified documents based, at least in part, on the history data (act 430). When the identified documents are associated with a search query, search engine 125 may also generate relevancy scores for the documents based, for example, on how relevant they are to the search query. Search engine 125 may then combine the history scores with the relevancy scores to obtain overall scores for the documents. Instead of combining the scores, search engine 125 may alter the relevancy scores for the documents based on the history data, thereby raising or lowering the scores or, in some cases, leaving the scores the same. Alternatively, search engine 125 may score the documents based on the history data without generating relevancy scores. In any event, search engine 125 may score the documents using one, or a combination, of the types of history data.</p>
<p>[0133] When the identified documents are associated with a search query, search engine 125 may also form search results from the scored documents. For example, search engine 125 may sort the documents based on their scores. Search engine 125 may then form references to the documents, where a reference might include a title of the document (which may contain a hypertext link that will direct the user, when selected, to the actual document) and a snippet (i.e., a text excerpt) from the document. In other implementations, the references are formed differently. Search engine 125 may present references corresponding to a number of the top-scoring documents (e.g., a predetermined number of the documents, documents with scores above a threshold, all documents, etc.) to a user who submitted the search query.</p>
<p align="center"><span class="style10">CONCLUSION </span></p>
<p>[0134] Systems and methods consistent with the principles of the invention may use history data to score documents and form high quality search results.</p>
<p>[0135] The foregoing description of preferred embodiments of the present invention provides illustration and description, but is not intended to be exhaustive or to limit the invention to the precise form disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from practice of the invention. For example, while a series of acts has been described with regard to FIG. 4, the order of the acts may be modified in other implementations consistent with the principles of the invention. Also, non-dependent acts may be performed in parallel.</p>
<p>[0136] Further, it has generally been described that server 120 performs most, if not all, of the acts described with regard to the processing of FIG. 4. In another implementation consistent with the principles of the invention, one or more, or all, of the acts may be performed by another entity, such as another server 130 and/or 140 or client 110.</p>
<p>[0137] It will also be apparent to one of ordinary skill in the art that aspects of the invention, as described above, may be implemented in many different forms of software, firmware, and hardware in the implementations illustrated in the figures. The actual software code or specialized control hardware used to implement aspects consistent with the principles of the invention is not limiting of the present invention. Thus, the operation and behavior of the aspects were described without reference to the specific software code&#8211;it being understood that one of ordinary skill in the art would be able to design software and control hardware to implement the aspects based on the description herein.</p></div>
<p><strong>* * * * *</strong></p>
<p><strong>Abstract</strong></p>
<p class="style37"><strong>A system identifies a document and obtains one or more types of history data associated with the document. The system may generate a score for the document based, at least in part, on the one or more types of history data. </strong></p>
<p>United States Patent Application 20050071741<br />
Kind Code: A1 Acharya, Anurag<br />
et al.: March 31, 2005</p>
<p><span class="style36">Inventors:</span><br />
<span class="style3"><strong>Acharya, Anurag</strong>; <em>(Campbell, CA)</em> <strong>; Cutts, Matt</strong>; <em>(Mountain View, CA)</em> <strong>; Dean, Jeffrey</strong>; <em>(Palo Alto, CA)</em> <strong>; Haahr, Paul</strong>; <em>(San Francisco, CA)</em> <strong>; Henzinger, Monika</strong>; <em>(Lausanne, CH)</em> <strong>; Hoelzle, Urs</strong>; <em>(Palo Alto, CA)</em> <strong>; Lawrence, Steve</strong>; <em>(Mountain View, CA)</em> <strong>; Pfleger, Karl</strong>; <em>(Mountain View, CA)</em> <strong>; Sercinoglu, Olcan</strong>; <em>(Mountain View, CA)</em> <strong>; Tong, Simon</strong>; <em>(Mountain View, CA)</em></span></p>
<p><span class="style3">Correspondence Name and Address:<br />
HARRITY &amp; SNYDER, LLP 11240, WAPLES MILL ROAD, SUITE 300 FAIRFAX, VA 22030 US</span></p>
<p class="style9 style3 style3">Serial No.: 748664<strong><br />
</strong>Series Code: 10<strong><strong><br />
<span class="style37">Filed: <strong>December 31, 2003</strong></span></strong></strong></p>
<p class="style3">U.S. Current Class: 715/500<br />
Intern&#8217;l Class: G06F 017/00</p>
<p class="style3">To see the original document please <a href="http://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&amp;Sect2=HITOFF&amp;p=1&amp;u=/netahtml/PTO/search-bool.html&amp;r=1&amp;f=G&amp;l=50&amp;co1=AND&amp;d=PG01&amp;s1=20050071741&amp;OS=20050071741&amp;RS=20050071741">click here</a></p>
<h4 class="style34"><em> Claims</em></h4>
<div>
<p><span class="style37"><strong>What is claimed is:</strong></span></p>
<p>1. A method for scoring a document, comprising: identifying a document; obtaining one or more types of history data associated with the document; and generating a score for the document based on the one or more types of history data.</p>
<p>2. The method of claim 1, wherein the one or more types of history data includes information relating to an inception date; and wherein the generating a score includes: determining an inception date corresponding to the document, and scoring the document based, at least in part, on the inception date corresponding to the document.</p>
<p>3. The method of claim 2, wherein the document includes a plurality of documents; and wherein the scoring the document includes: determining an age of each of the documents based on the inception dates corresponding to the documents, determining an average age of the documents based on the ages of the documents, and scoring the documents based, at least in part, on a difference between the ages of the documents and the average age.</p>
<p>4. The method of claim 2, wherein the generating a score for the document includes scoring the document based, at least in part, on an elapsed time measured from the inception date corresponding to the document.</p>
<p>5. The method of claim 2, wherein the inception date corresponding to the document is based on at least one of a date when a search engine first discovers the document, a date when a search engine first discovers a link to the document, and a date when the document includes at least a predetermined number of pages.</p>
<p>6. The method of claim 1, wherein the one or more types of history data includes information relating to a manner in which a content of the document changes over time; and wherein the generating a score includes: determining a frequency at which the content of the document changes over time, and scoring the document based, at least in part, on the frequency at which the content of the document changes over time.</p>
<p>7. The method of claim 6, wherein the frequency at which the content of the document changes is based on at least one of an average time between the changes, a number of changes in a time period, and a comparison of a rate of change in a current time period with a rate of change in a previous time period.</p>
<p>8. The method of claim 6, wherein the generating a score further includes: determining an amount by which the content of the document changes over time, and scoring the document based, at least in part, on the frequency at which and the amount by which the content of the document changes over time.</p>
<p>9. The method of claim 8, wherein the amount by which the content of the document changes is based on at least one of a number of new pages associated with the document within a time period, a ratio of a number of new pages associated with the document versus a total number of pages associated with the document, and a percentage of the content of the document that has changed during a time period.</p>
<p>10. The method of claim 8, wherein the determining an amount by which the content of the document changes includes: weighting different portions of the content of the document differently based on a perceived importance of the portions, and determining the amount by which the content of the document changes as a function of the differently weighted portions of the content.</p>
<p>11. The method of claim 6, wherein the document includes a plurality of documents; and wherein the scoring the document includes: determining a date on which the content of each of the documents last changed, determining an average date of change based on the determined dates on which the contents of the documents last changed, and scoring the documents based, at least in part, on a difference between the dates on which the contents of the documents last changed and the average date of change.</p>
<p>12. The method of claim 1, wherein the one or more types of history data includes information relating to a manner in which a content of the document changes over time; and wherein the generating a score includes: determining an amount by which the content of the document changes over time, and scoring the document based, at least in part, on the amount by which the content of the document changes over time.</p>
<p>13. The method of claim 12, wherein the amount by which the content of the document changes is based on at least one of a number of new pages associated with the document within a time period, a ratio of a number of new pages associated with the document versus a total number of pages associated with the document, and a percentage of the content of the document that has changed during a time period.</p>
<p>14. The method of claim 12, wherein the determining an amount by which the content of the document changes includes: weighting different portions of the content of the document differently based on a perceived importance of the portions, and determining the amount by which the content of the document changes as a function of the differently weighted portions of the content.</p>
<p>15. The method of claim 1, wherein the one or more types of history data includes information relating to how often the document is selected when the document is included in a set of search results; and wherein the generating a score includes: determining an extent to which the document is selected over time when the document is included in a set of search results, and scoring the document based, at least in part, on the extent to which the document is selected over time when the document is included in the set of search results.</p>
<p>16. The method of claim 15, wherein the scoring the document includes assigning a higher score to the document when the document is selected more often than other documents in the set of search results over a time period.</p>
<p>17. The method of claim 1, wherein the one or more types of history data includes information relating to search terms that increasingly appear in search queries over time; and wherein the generating a score includes: determining whether the document is associated with the search terms, and scoring the document based, at least in part, on whether the document is associated with the search terms.</p>
<p>18. The method of claim 1, wherein the one or more types of history data includes information relating to queries that remain approximately constant over time but lead to results that change over time; and wherein the generating a score includes: determining whether the document is associated with queries that lead to results that change over time, and scoring the document based, at least in part, on whether the document is associated with queries that lead to results that change over time.</p>
<p>19. The method of claim 1, wherein the one or more types of history data includes information relating to staleness of documents; and wherein the generating a score includes: determining whether the document is stale, and scoring the document based, at least in part, on whether the document is stale.</p>
<p>20. The method of claim 19, wherein the scoring the document includes: determining whether stale documents are considered favorable for a search query when the document is determined to be stale, and scoring the document based, at least in part, on whether stale documents are considered favorable for the search query when the document is determined to be stale.</p>
<p>21. The method of claim 20, wherein the determining whether stale documents are considered favorable for the search query is based, at least in part, on how often stale documents were selected over recent documents over time for the search query.</p>
<p>22. The method of claim 1, wherein the one or more types of history data includes information relating to behavior of links over time; and wherein the generating a score includes: determining behavior of links associated with the document, and scoring the document based, at least in part, on the behavior of links associated with the document.</p>
<p>23. The method of claim 22, wherein the behavior of links relate to at least one of appearance and disappearance of one or more links pointing to the document.</p>
<p>24. The method of claim 23, wherein the appearance of one or more links relates to at least one of a date that a new link to the document appears, a rate at which the one or more links appear over time, and a number of the one or more links that appear during a time period, and the disappearance of one or more links relates to at least one of a date that an existing link to the document disappears, a rate at which the one or more links disappear over time, and a number of the one or more links that disappear during a time period.</p>
<p>25. The method of claim 22, wherein the determining behavior of links associated with the document includes monitoring at least one of time-varying behavior of links associated with the document, how many links associated with the document appear or disappear during a time period, and whether there is a trend toward appearance of new links associated with the document versus disappearance of existing links associated with the document.</p>
<p>26. The method of claim 1, wherein the one or more types of history data includes information relating to freshness of links; and wherein the generating a score includes: determining freshness of links associated with the document, assigning weights to the links based on the determined freshness, and scoring the document based, at least in part, on the weights assigned to the links associated with the document.</p>
<p>27. The method of claim 26, wherein the freshness of a link associated with the document is based on at least one of a date of appearance of the link, a date of a change to the link, a date of appearance of anchor text associated with the link, a date of a change to anchor text associated with the link, a date of appearance of a linking document containing the link, and a date of a change to a linking document containing the link.</p>
<p>28. The method of claim 26, wherein the weight assigned to a link is based on at least one of how much a document containing the link is trusted, how authoritative a document containing the link is, and a freshness of a document containing the link.</p>
<p>29. The method of claim 26, wherein the scoring the document includes: determining an age of each link pointing to the document, determining an age distribution associated with the links based on the ages of the links, and scoring the document based, at least in part, on the age distribution associated with the links.</p>
<p>30. The method of claim 1, wherein the one or more types of history data includes information relating to a manner in which anchor text changes over time; and wherein the generating a score includes: identifying a change in anchor text associated with a link to the document, and scoring the document based, at least in part, on the change in anchor text associated with a link to the document.</p>
<p>31. The method of claim 1, wherein the one or more types of history data includes information relating to differences in documents and anchor text associated with links to the documents; and wherein the generating a score includes: determining whether a content of the document changes such that the content differs from anchor text associated with one or more links to the document, and scoring the document based, at least in part, on whether the content of the document changes such that the content differs from the anchor text associated with one or more links to the document.</p>
<p>32. The method of claim 1, wherein the one or more types of history data includes information relating to freshness of anchor text; and wherein the generating a score includes: determining freshness of anchor text associated with one or more links to the document, and scoring the document based, at least in part, on the freshness of anchor text associated with one or more links to the document.</p>
<p>33. The method of claim 32, wherein the freshness of anchor text associated with a link to the document is based on at least one of a date of appearance of the anchor text, a date of a change to the anchor text, a date of appearance of a link associated with the anchor text, a date of a change to a link associated with the anchor text, a date of appearance of the document, and a date of a change to the document.</p>
<p>34. The method of claim 1, wherein the one or more types of history data includes information relating to traffic associated with documents; and wherein the generating a score includes: determining characteristics of traffic associated with the document, and scoring the document based, at least in part, on the characteristics of traffic associated with the document.</p>
<p>35. The method of claim 34, wherein the determining characteristics of traffic associated with the document includes analyzing a traffic pattern associated with the document to identify changes in the traffic pattern over time.</p>
<p>36. The method of claim 1, wherein the one or more types of history data includes information relating to user behavior associated with documents; and wherein the generating a score includes: determining user behavior associated with the document, and scoring the document based, at least in part, on the user behavior associated with the document.</p>
<p>37. The method of claim 36, wherein the user behavior relates to at least one of a number of times that the document is selected within a set of search results and an amount of time that one or more users spend accessing the document.</p>
<p>38. The method of claim 1, wherein the one or more types of history data includes domain-related information corresponding to domains associated with documents; and wherein the generating a score includes: analyzing domain-related information corresponding to a domain associated with the document over time, and scoring the document based, at least in part, on a result of the analyzing.</p>
<p>39. The method of claim 38, wherein the scoring the document includes: determining whether the domain associated with the document is legitimate, and scoring the document based, at least in part, on whether the domain associated with the document is legitimate.</p>
<p>40. The method of claim 38, wherein the domain-related information is related to at least one of an expiration date of the domain, a domain name server record associated with the domain, and a name server associated with the domain.</p>
<p>41. The method of claim 1, wherein the one or more types of history data includes information relating to a prior ranking history of documents; and wherein the generating a score includes: determining a prior ranking history of the document, and scoring the document based, at least in part, on the prior ranking history of the document.</p>
<p>42. The method of claim 41, wherein the scoring the document includes: determining a quantity or rate that the document moves in rankings over a time period, and scoring the document based, at least in part, on the quantity or rate that the document moves in the rankings.</p>
<p>43. The method of claim 41, wherein the prior ranking history is based on at least one of a number of queries for which the document is selected as a search result over time, a rate at which the document is selected as a search result over time, seasonality, burstiness, and changes in scores over time for a URL-query pair.</p>
<p>44. The method of claim 41, wherein the determining a prior ranking history of the document includes monitoring a rank of the document over time for spikes in the rank.</p>
<p>45. The method of claim 1, wherein the one or more types of history data includes information relating to user maintained or generated data; and wherein the generating a score includes: determining whether user maintained or generated data indicates that the document is of interest to a user, and scoring the document based, at least in part, on whether the user maintained or generated data indicates that the document is of interest to a user.</p>
<p>46. The method of claim 45, wherein the user maintained or generated data relates to at least one of favorites lists, bookmarks, temp files, and cache files associated with one or a plurality of users.</p>
<p>47. The method of claim 45, wherein the scoring the document includes: analyzing the user maintained or generated data over time to identify at least one of trends to add or remove the document, a rate at which the document is added to or removed from the user maintained or generated data, and whether the document is added to, deleted from, or accessed through the user maintained or generated data, and scoring the document based, at least in part, on a result of the analyzing.</p>
<p>48. The method of claim 1, wherein the one or more types of history data includes information relating to growth profiles of anchor text; and wherein the generating a score includes: determining a growth profile of anchor text associated with one or more links to the document, and scoring the document based, at least in part, on the growth profile of anchor text associated with one or more links to the document.</p>
<p>49. The method of claim 1, wherein the one or more types of history data includes information relating to linkage of independent peers; and wherein the generating a score includes: determining a growth in a number of independent peers that include the document, and scoring the document based, at least in part, on the number of independent peers.</p>
<p>50. The method of claim 1, wherein the one or more types of history data includes information relating to document topics; and wherein the generating a score includes: performing topic extraction relating to the document, monitoring a topic of the document for changes over time, and scoring the document based, at least in part, on changes to the topic of the document.</p>
<p>51. The method of claim 1, further comprising: obtaining a search query, where the identified document is identified as relevant to the search query; and generating a relevancy score for the document based on how relevant the document is to the search query; and wherein the generating a score for the document is based, at least in part, on the one or more types of history data and the relevancy score.</p>
<p>52. A system for scoring a document, comprising: means for identifying a document; means for obtaining a plurality of types of history data associated with the document; and means for generating a score for the document based, at least in part, on the plurality of types of history data.</p>
<p>53. A system for scoring a document, comprising: a history component configured to obtain one or more types of history data associated with a document; and a ranking component configured to: generate a score for the document based, at least in part, on the one or more types of history data.</p>
<p>54. A method for ranking a linked document, comprising: determining an age of linkage data associated with the linked document; and ranking the linked document based on a decaying function of the age of the linkage data.</p>
<p>55. The method of claim 54, wherein the linkage data includes at least one link.</p>
<p>56. The method of claim 54, wherein the linkage data includes anchor text.</p>
<p>57. The method of claim 54, wherein the linkage data includes a rank based, at least in part, on links and anchor text provided by one or more linking documents and related to the linked document.</p>
<p>58. The method of claim 57, further comprising: determining longevity of the linkage data; deriving an indication of content update for a linking document providing the linkage data; and adjusting the ranking of the linked document based on the longevity of the linkage data and the indication of content update for the linking document.</p>
<p>59. The method of claim 58, wherein the adjusting the ranking includes penalizing the ranking if the longevity indicates a short life for the linkage data and boosting the ranking if the longevity indicates a long life for the linkage data.</p>
<p>60. The method of claim 59, wherein the adjusting the ranking further includes penalizing the ranking if at least a portion of content from the linking document is considered stale over a period of time and boosting the ranking if the portion of content from the linking document is considered updated over the period of time.</p>
<p>61. The method of claim 54, further comprising: determining an indication of link churn for a linking document providing the linkage data; and based on the link churn, adjusting the ranking of the linked document.</p>
<p>62. The method of claim 61, wherein the indication of link churn is computed as a function of an extent to which one or more links provided by the linking document change over time.</p>
<p>63. The method of claim 62, wherein adjusting the ranking includes penalizing the ranking if the link churn is above a threshold.</p>
<p align="center"><span class="style10">RELATED APPLICATION </span></p>
<p>[0001] This application claims priority under 35 U.S.C. .sctn. 119 based on U.S. Provisional Application No. 60/507,617, filed Sep. 30, 2003, the disclosure of which is incorporated herein by reference.</p></div>
<div class="shr-publisher-693"></div><!-- Start Shareaholic LikeButtonSetBottom --><!-- End Shareaholic LikeButtonSetBottom -->]]></content:encoded>
			<wfw:commentRss>http://www.bostonmediadomain.com/google-patent-application-20050071741/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Importance of Backlinks</title>
		<link>http://www.bostonmediadomain.com/the-importance-of-backlinks/</link>
		<comments>http://www.bostonmediadomain.com/the-importance-of-backlinks/#comments</comments>
		<pubDate>Mon, 09 Jun 2008 11:30:58 +0000</pubDate>
		<dc:creator>jeff selig</dc:creator>
				<category><![CDATA[Commentary & Analysis]]></category>
		<category><![CDATA[Algorithm]]></category>
		<category><![CDATA[Anchor Text]]></category>
		<category><![CDATA[app]]></category>
		<category><![CDATA[Backlink]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[Directory]]></category>
		<category><![CDATA[Domain]]></category>
		<category><![CDATA[Filter]]></category>
		<category><![CDATA[GA]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google patent]]></category>
		<category><![CDATA[Hyperlink]]></category>
		<category><![CDATA[Inbound Link]]></category>
		<category><![CDATA[Index]]></category>
		<category><![CDATA[ip]]></category>
		<category><![CDATA[IP Address]]></category>
		<category><![CDATA[isp]]></category>
		<category><![CDATA[keyword]]></category>
		<category><![CDATA[Keyword Phrase]]></category>
		<category><![CDATA[Link]]></category>
		<category><![CDATA[Link Building]]></category>
		<category><![CDATA[Link Farm]]></category>
		<category><![CDATA[links]]></category>
		<category><![CDATA[Matt]]></category>
		<category><![CDATA[Need]]></category>
		<category><![CDATA[optimization]]></category>
		<category><![CDATA[Outbound Link]]></category>
		<category><![CDATA[Page]]></category>
		<category><![CDATA[Ping]]></category>
		<category><![CDATA[Query]]></category>
		<category><![CDATA[ranking]]></category>
		<category><![CDATA[Reciprocal Links]]></category>
		<category><![CDATA[Relevancy]]></category>
		<category><![CDATA[Rob]]></category>
		<category><![CDATA[ROS]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Search Engine]]></category>
		<category><![CDATA[search engine optimization]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[stats]]></category>
		<category><![CDATA[themes]]></category>
		<category><![CDATA[Traffic]]></category>
		<category><![CDATA[trust]]></category>
		<category><![CDATA[Update]]></category>
		<category><![CDATA[Visit]]></category>
		<category><![CDATA[web]]></category>
		<category><![CDATA[webmaster]]></category>

		<guid isPermaLink="false">http://bostonmediadomain.com/2008/06/09/the-importance-of-backlinks/</guid>
		<description><![CDATA[If you&#8217;ve read anything about or studied Search Engine Optimization, you&#8217;ve come across the term &#8220;backlink&#8221; at least once. For those of you new to SEO, you may be wondering what a backlink is, and why they are important. Backlinks have become so important to the scope of Search Engine Optimization, that they have become some of the main building blocks to good SEO. In this article, we will explain to you what a backlink is, why they are important, and what you can do to help gain them while avoiding getting into trouble with the Search Engines. What are &#8220;backlinks&#8221;? Backlinks are links that are directed towards your website. Also knows as Inbound links (IBL&#8217;s). The number of backlinks is an indication of the popularity or importance of that website. Backlinks are important for SEO because some search engines, especially Google, will give more credit to websites that have a good number of quality backlinks, and consider those websites more relevant than others in their results pages for a search query. When search engines calculate the relevance of a site to a keyword, they consider the number of QUALITY inbound links to that site. So we should not be [...]]]></description>
			<content:encoded><![CDATA[<!-- Start Shareaholic LikeButtonSetTop --><!-- End Shareaholic LikeButtonSetTop --><div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fthe-importance-of-backlinks%2F"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.bostonmediadomain.com%2Fthe-importance-of-backlinks%2F&amp;source=seosem&amp;style=normal&amp;service=is.gd&amp;hashtags=Algorithm,Anchor+Text,app,Backlink,content,Directory,Domain,Filter,GA,google,google+patent,Hyperlink,Inbound+Link,Index,ip,IP+Address,isp,keyword,Keyword+Phrase,Link,Link+Building,Link+Farm,links,Matt,Need,optimization,Outbound+Link,Page,Ping,Query,ranking,Reciprocal+Links,Relevancy,Rob,ROS,search,Search+Engine,search+engine+optimization,search+engines,SEO,stats,themes,Traffic,trust,Update,Visit,web,webmaster&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><font class="defaultfont"></p>
<p align="justify"> If you&#8217;ve read anything about or studied Search Engine Optimization, you&#8217;ve come across the term &#8220;backlink&#8221; at least once. For those of you new to SEO, you may be wondering what a backlink is, and why they are important. Backlinks have become so important to the scope of Search Engine Optimization, that they have become some of the main building blocks to good SEO. In this article, we will explain to you what a backlink is, why they are important, and what you can do to help gain them while avoiding getting into trouble with the Search Engines.</p>
<p align="justify">What are &#8220;backlinks&#8221;? Backlinks are links that are directed towards your website. Also knows as Inbound links (IBL&#8217;s). The number of backlinks is an indication of the popularity or importance of that website. Backlinks are important for SEO because some search engines, especially Google, will give more credit to websites that have a good number of quality backlinks, and consider those websites more relevant than others in their results pages for a search query.</p>
<p align="justify">When search engines calculate the relevance of a site to a keyword, they consider the number of QUALITY inbound links to that site. So we should not be satisfied with merely getting inbound links, it is the quality of the inbound link that matters.<br />
A search engine considers the content of the sites to determine the QUALITY of a link. When inbound links to your site come from other sites, and those sites have content related to your site, these inbound links are considered more relevant to your site. If inbound links are found on sites with unrelated content, they are considered less relevant. The higher the relevance of inbound links, the greater their quality.</p>
<p align="justify">For example, if a webmaster has a website about how to rescue orphaned kittens, and received a backlink from another website about kittens, then that would be more relevant in a search engine&#8217;s assessment than say a link from a site about car racing. The more relevant the site is that is linking back to your website, the better the quality of the backlink.</p>
<p align="justify">Search engines want websites to have a level playing field, and look for natural links built slowly over time. While it is fairly easy to manipulate links on a web page to try to achieve a higher ranking, it is a lot harder to influence a search engine with external backlinks from other websites. This is also a reason why backlinks factor in so highly into a search engine&#8217;s algorithm. Lately, however, a search engine&#8217;s criteria for quality inbound links has gotten even tougher, thanks to unscrupulous webmasters trying to achieve these inbound links by deceptive or sneaky techniques, such as with hidden links, or automatically generated pages whose sole purpose is to provide inbound links to websites. These pages are called link farms, and they are not only disregarded by search engines, but linking to a link farm could get your site banned entirely.</p>
<p align="justify">Another reason to achieve quality backlinks is to entice visitors to come to your website. You can&#8217;t build a website, and then expect that people will find your website without pointing the way. You will probably have to get the word out there about your site. One way webmasters got the word out used to be through reciprocal linking. Let&#8217;s talk about reciprocal linking for a moment.</p>
<p align="justify">There is much discussion in these last few months about reciprocal linking. In the last Google update, reciprocal links were one of the targets of the search engine&#8217;s latest filter. Many webmasters had agreed upon reciprocal link exchanges, in order to boost their site&#8217;s rankings with the sheer number of inbound links. In a link exchange, one webmaster places a link on his website that points to another webmasters website, and vice versa. Many of these links were simply not relevant, and were just discounted. So while the irrelevant inbound link was ignored, the outbound links still got counted, diluting the relevancy score of many websites. This caused a great many websites to drop off the Google map.</p>
<p align="justify">We must be careful with our reciprocal links. There is a Google patent in the works that will deal with not only the popularity of the sites being linked to, but also how trustworthy a site is that you link to from your own website. This will mean that you could get into trouble with the search engine just for linking to a bad apple. We could begin preparing for this future change in the search engine algorithm by being choosier with which we exchange links right now. By choosing only relevant sites to link with, and sites that don&#8217;t have tons of outbound links on a page, or sites that don&#8217;t practice black-hat SEO techniques, we will have a better chance that our reciprocal links won&#8217;t be discounted.</p>
<p align="justify">Many webmasters have more than one website. Sometimes these websites are related, sometimes they are not. You have to also be careful about interlinking multiple websites on the same IP. If you own seven related websites, then a link to each of those websites on a page could hurt you, as it may look like to a search engine that you are trying to do something fishy. Many webmasters have tried to manipulate backlinks in this way; and too many links to sites with the same IP address is referred to as backlink bombing.</p>
<p align="justify">One thing is certain: interlinking sites doesn&#8217;t help you from a search engine standpoint. The only reason you may want to interlink your sites in the first place might be to provide your visitors with extra resources to visit. In this case, it would probably be okay to provide visitors with a link to another of your websites, but try to keep many instances of linking to the same IP address to a bare minimum. One or two links on a page here and there probably won&#8217;t hurt you.</p>
<p align="justify">There are a few things to consider when beginning your backlink building campaign. It is helpful to keep track of your backlinks, to know which sites are linking back to you, and how the anchor text of the backlink incorporates keywords relating to your site. A tool to help you keep track of your backlinks is the Domain Stats Tool. This tool displays the backlinks of a domain in Google, Yahoo, and MSN. It will also tell you a few other details about your website, like your listings in the Open Directory, or DMOZ, from which Google regards backlinks highly important; Alexa traffic rank, and how many pages from your site that have been indexed, to name just a few.</p>
<p align="justify"> Another tool to help you with your link building campaign is the Backlink Builder Tool. It is not enough just to have a large number of inbound links pointing to your site. Rather, you need to have a large number of QUALITY inbound links. This tool searches for websites that have a related theme to your website which are likely to add your link to their website. You specify a particular keyword or keyword phrase, and then the tool seeks out related sites for you. This helps to simplify your backlink building efforts by helping you create quality, relevant backlinks to your site, and making the job easier in the process.</p>
<p align="justify">There is another way to gain quality backlinks to your site, in addition to related site themes: anchor text. When a link incorporates a keyword into the text of the hyperlink, we call this quality anchor text. A link&#8217;s anchor text may be one of the under-estimated resources a webmaster has. Instead of using words like &#8220;click here&#8221; which probably won&#8217;t relate in any way to your website, using the words &#8220;Please visit our tips page for how to nurse an orphaned kitten&#8221; is a far better way to utilize a hyperlink. A good tool for helping you find your backlinks and what text is being used to link to your site is the Backlink Anchor Text Analysis Tool. If you find that your site is being linked to from another website, but the anchor text is not being utilized properly, you should request that the website change the anchor text to something incorporating relevant keywords. This will also help boost your quality backlinks score.</p>
<p align="justify">Building quality backlinks is extremely important to Search Engine Optimization, and because of their importance, it should be very high on your priority list in your SEO efforts. We hope you have a better understanding of why you need good quality inbound links to your site, and have a handle on a few helpful tools to gain those links.</p>
<p></font></p>
<div class="shr-publisher-415"></div><!-- Start Shareaholic LikeButtonSetBottom --><!-- End Shareaholic LikeButtonSetBottom -->]]></content:encoded>
			<wfw:commentRss>http://www.bostonmediadomain.com/the-importance-of-backlinks/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

