Sophie

Sophie

distrib > Fedora > 18 > i386 > by-pkgid > d0983343df85ecf7d844c2cfc3a0597a > files > 486

python-whoosh-2.5.1-1.fc18.noarch.rpm



<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
  "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">


<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
    
    <title>Introduction to Whoosh &mdash; Whoosh 2.5.1 documentation</title>
    
    <link rel="stylesheet" href="_static/default.css" type="text/css" />
    <link rel="stylesheet" href="_static/pygments.css" type="text/css" />
    
    <script type="text/javascript">
      var DOCUMENTATION_OPTIONS = {
        URL_ROOT:    '',
        VERSION:     '2.5.1',
        COLLAPSE_INDEX: false,
        FILE_SUFFIX: '.html',
        HAS_SOURCE:  true
      };
    </script>
    <script type="text/javascript" src="_static/jquery.js"></script>
    <script type="text/javascript" src="_static/underscore.js"></script>
    <script type="text/javascript" src="_static/doctools.js"></script>
    <link rel="top" title="Whoosh 2.5.1 documentation" href="index.html" />
    <link rel="next" title="Glossary" href="glossary.html" />
    <link rel="prev" title="Quick start" href="quickstart.html" /> 
  </head>
  <body>
    <div class="related">
      <h3>Navigation</h3>
      <ul>
        <li class="right" style="margin-right: 10px">
          <a href="genindex.html" title="General Index"
             accesskey="I">index</a></li>
        <li class="right" >
          <a href="py-modindex.html" title="Python Module Index"
             >modules</a> |</li>
        <li class="right" >
          <a href="glossary.html" title="Glossary"
             accesskey="N">next</a> |</li>
        <li class="right" >
          <a href="quickstart.html" title="Quick start"
             accesskey="P">previous</a> |</li>
        <li><a href="index.html">Whoosh 2.5.1 documentation</a> &raquo;</li> 
      </ul>
    </div>  

    <div class="document">
      <div class="documentwrapper">
        <div class="bodywrapper">
          <div class="body">
            
  <div class="section" id="introduction-to-whoosh">
<h1>Introduction to Whoosh<a class="headerlink" href="#introduction-to-whoosh" title="Permalink to this headline">¶</a></h1>
<div class="section" id="about-whoosh">
<h2>About Whoosh<a class="headerlink" href="#about-whoosh" title="Permalink to this headline">¶</a></h2>
<p>Whoosh was created by <a class="reference external" href="mailto:matt&#37;&#52;&#48;whoosh&#46;ca">Matt Chaput</a>. It started as a quick and dirty
search server for the online documentation of the <a class="reference external" href="http://www.sidefx.com/">Houdini</a>
3D animation software package. Side Effects Software generously allowed Matt to open source
the code in case it might be useful to anyone else who needs a very flexible or pure-Python
search engine (or both!).</p>
<ul class="simple">
<li>Whoosh is fast, but uses only pure Python, so it will run anywhere Python runs,
without requiring a compiler.</li>
<li>By default, Whoosh uses the <a class="reference external" href="http://en.wikipedia.com/wiki/BM25">Okapi BM25F</a> ranking
function, but like most things the ranking function can be easily customized.</li>
<li>Whoosh creates fairly small indexes compared to many other search libraries.</li>
<li>All indexed text in Whoosh must be <em>unicode</em>.</li>
<li>Whoosh lets you store arbitrary Python objects with indexed documents.</li>
</ul>
</div>
<div class="section" id="what-is-whoosh">
<h2>What is Whoosh?<a class="headerlink" href="#what-is-whoosh" title="Permalink to this headline">¶</a></h2>
<p>Whoosh is a fast, pure Python search engine library.</p>
<p>The primary design impetus of Whoosh is that it is pure Python. You should be able to
use Whoosh anywhere you can use Python, no compiler or Java required.</p>
<p>Like one of its ancestors, Lucene, Whoosh is not really a search engine, it&#8217;s a programmer
library for creating a search engine <a class="footnote-reference" href="#id2" id="id1">[1]</a>.</p>
<p>Practically no important behavior of Whoosh is hard-coded. Indexing
of text, the level of information stored for each term in each field, parsing of search queries,
the types of queries allowed, scoring algorithms, etc. are all customizable, replaceable, and
extensible.</p>
<table class="docutils footnote" frame="void" id="id2" rules="none">
<colgroup><col class="label" /><col /></colgroup>
<tbody valign="top">
<tr><td class="label"><a class="fn-backref" href="#id1">[1]</a></td><td>It would of course be possible to build a turnkey search engine on top of Whoosh,
like Nutch and Solr use Lucene.</td></tr>
</tbody>
</table>
</div>
<div class="section" id="what-can-whoosh-do-for-you">
<h2>What can Whoosh do for you?<a class="headerlink" href="#what-can-whoosh-do-for-you" title="Permalink to this headline">¶</a></h2>
<p>Whoosh lets you index free-form or structured text and then quickly find matching
documents based on simple or complex search criteria.</p>
</div>
<div class="section" id="getting-help-with-whoosh">
<h2>Getting help with Whoosh<a class="headerlink" href="#getting-help-with-whoosh" title="Permalink to this headline">¶</a></h2>
<p>You can view outstanding issues on the
<a class="reference external" href="http://bitbucket.org/mchaput/whoosh">Whoosh Bitbucket page</a>
and get help on the <a class="reference external" href="http://groups.google.com/group/whoosh">Whoosh mailing list</a>.</p>
</div>
</div>


          </div>
        </div>
      </div>
      <div class="sphinxsidebar">
        <div class="sphinxsidebarwrapper">
  <h3><a href="index.html">Table Of Contents</a></h3>
  <ul>
<li><a class="reference internal" href="#">Introduction to Whoosh</a><ul>
<li><a class="reference internal" href="#about-whoosh">About Whoosh</a></li>
<li><a class="reference internal" href="#what-is-whoosh">What is Whoosh?</a></li>
<li><a class="reference internal" href="#what-can-whoosh-do-for-you">What can Whoosh do for you?</a></li>
<li><a class="reference internal" href="#getting-help-with-whoosh">Getting help with Whoosh</a></li>
</ul>
</li>
</ul>

  <h4>Previous topic</h4>
  <p class="topless"><a href="quickstart.html"
                        title="previous chapter">Quick start</a></p>
  <h4>Next topic</h4>
  <p class="topless"><a href="glossary.html"
                        title="next chapter">Glossary</a></p>
  <h3>This Page</h3>
  <ul class="this-page-menu">
    <li><a href="_sources/intro.txt"
           rel="nofollow">Show Source</a></li>
  </ul>
<div id="searchbox" style="display: none">
  <h3>Quick search</h3>
    <form class="search" action="search.html" method="get">
      <input type="text" name="q" />
      <input type="submit" value="Go" />
      <input type="hidden" name="check_keywords" value="yes" />
      <input type="hidden" name="area" value="default" />
    </form>
    <p class="searchtip" style="font-size: 90%">
    Enter search terms or a module, class or function name.
    </p>
</div>
<script type="text/javascript">$('#searchbox').show(0);</script>
        </div>
      </div>
      <div class="clearer"></div>
    </div>
    <div class="related">
      <h3>Navigation</h3>
      <ul>
        <li class="right" style="margin-right: 10px">
          <a href="genindex.html" title="General Index"
             >index</a></li>
        <li class="right" >
          <a href="py-modindex.html" title="Python Module Index"
             >modules</a> |</li>
        <li class="right" >
          <a href="glossary.html" title="Glossary"
             >next</a> |</li>
        <li class="right" >
          <a href="quickstart.html" title="Quick start"
             >previous</a> |</li>
        <li><a href="index.html">Whoosh 2.5.1 documentation</a> &raquo;</li> 
      </ul>
    </div>
    <div class="footer">
        &copy; Copyright 2007-2012 Matt Chaput.
      Created using <a href="http://sphinx.pocoo.org/">Sphinx</a> 1.1.3.
    </div>
  </body>
</html>