Change the future

Friday 5:10 p.m.–5:40 p.m.

Whoosh, the open-source Python search library

Matt Chaput

Audience level:
Intermediate
Category:
Useful Libraries

Description

From humble beginnings when I first learned Python just to write a search engine to make online help searchable, Whoosh has grown and matured to match the capabilities of much larger projects such as Lucene. This talk will explain simple uses of Whoosh to index and search documents, and demonstrate more advanced features such as faceting.

Abstract

  1. Introduction
  2. Who am I?
  3. A bit of history
  4. Who is using Whoosh?

  5. Using Whoosh (live demo)

  6. Examples of indexing documents
  7. Examples of searching
  8. Simple tweaks to analysis and query parsing for custom results

  9. Feature tour

  10. Highlighting
  11. Date/time parsing
  12. Sorting
  13. Spell checking
  14. Faceting
  15. Hierarchical searching

  16. Overview of architecture and algorithms

  17. Future directions (time permitting)