Lucene

Apache Lucene is indexing and search software. It is particularly suitable for indexing text in a variety of file formats. It can also carry out other text analysis functions. Lucene is often used in conjunction with other, closely-related Apache things such as the open-source Solr search server, Hadoop and Mahout.

Although Lucene is written in Java, ports are available for other programming language, such as PyLucene, a Pyhon port of the Core Lucene project.

Category: 
Text mining
Availability: 
Offline
Other software required: 
Other software required
Difficulty: 
Advanced
User Community: 
Mailing list; wiki
Active Development: 
Active development
Purpose: 
Single purpose
Operating System: 
Windows
Operating System: 
Mac
Operating System: 
Unix
System Requirements: 
Java, or other software if using a version ported to that software.