Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . The DOT before the extension type is required. You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .html, .txt, .docor whatever extensions you want. By default, Plucene comes with PDF, HTML and TXT file support. However, PDF needs additional software to be installed (see intall instructions). You may need additional CPAN:Plucene::SearchEngine::Index libraries and install additional third party tools such as antiword or xlhtml which provide required text extracting capabilities. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
Search formThe following form submits text to theplucsearch script. The installation instructions are detailed below.
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .htm, .html, .txt, .doc * Set PLUCENEINDEXPATH = /srv/www/twiki/plucene/index * Set PLUCENEATTACHMENTSPATH = /srv/www/twiki/pub * Set PLUCENESEARCHATTACHMENTSONLY = 1 * Set PLUCENESEARCHATTACHMENTSONLYLABEL = Display only attachments * Set PLUCENEINDEXVARIABLES = CONTACTINFO, JUSTANOTHERONE * Set PLUCENEINDEXSKIPWEBS = Trash, Sandbox * Set PLUCENEINDEXSKIPATTACHMENTS = Web.SomeTopic.AnAttachment.txt, Web.OtherTopic.OtherAttachment.pdf * Set PLUCENEDEBUG = 1
Add-On Info
| |||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
| |||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
|
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . The DOT before the extension type is required. You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .html, .txt, .docor whatever extensions you want. By default, Plucene comes with PDF, HTML and TXT file support. However, PDF needs additional software to be installed (see intall instructions). You may need additional CPAN:Plucene::SearchEngine::Index libraries and install additional third party tools such as antiword or xlhtml which provide required text extracting capabilities. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
Search formThe following form submits text to theplucsearch script. The installation instructions are detailed below.
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .htm, .html, .txt, .doc * Set PLUCENEINDEXPATH = /srv/www/twiki/plucene/index * Set PLUCENEATTACHMENTSPATH = /srv/www/twiki/pub * Set PLUCENESEARCHATTACHMENTSONLY = 1 * Set PLUCENESEARCHATTACHMENTSONLYLABEL = Display only attachments * Set PLUCENEINDEXVARIABLES = CONTACTINFO, JUSTANOTHERONE * Set PLUCENEINDEXSKIPWEBS = Trash, Sandbox * Set PLUCENEINDEXSKIPATTACHMENTS = Web.SomeTopic.AnAttachment.txt, Web.OtherTopic.OtherAttachment.pdf * Set PLUCENEDEBUG = 1
Add-On Info
|
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . The DOT before the extension type is required. You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .html, .txt, .docor whatever extensions you want. By default, Plucene comes with PDF, HTML and TXT file support. However, PDF needs additional software to be installed (see intall instructions). You may need additional CPAN:Plucene::SearchEngine::Index libraries and install additional third party tools such as antiword or xlhtml which provide required text extracting capabilities. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
Search formThe following form submits text to theplucsearch script. The installation instructions are detailed below.
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .htm, .html, .txt, .doc * Set PLUCENEINDEXPATH = /srv/www/twiki/plucene/index * Set PLUCENEATTACHMENTSPATH = /srv/www/twiki/pub * Set PLUCENESEARCHATTACHMENTSONLY = 1 * Set PLUCENESEARCHATTACHMENTSONLYLABEL = Display only attachments * Set PLUCENEINDEXVARIABLES = CONTACTINFO, JUSTANOTHERONE * Set PLUCENEINDEXSKIPWEBS = Trash, Sandbox * Set PLUCENEINDEXSKIPATTACHMENTS = Web.SomeTopic.AnAttachment.txt, Web.OtherTopic.OtherAttachment.pdf * Set PLUCENEDEBUG = 1
Add-On Info
|
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . The DOT before the extension type is required. You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .html, .txt, .docor whatever extensions you want. By default, Plucene comes with PDF, HTML and TXT file support. However, PDF needs additional software to be installed (see intall instructions). You may need additional CPAN:Plucene::SearchEngine::Index libraries and install additional third party tools such as antiword or xlhtml which provide required text extracting capabilities. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
Search formThe following form submits text to theplucsearch script. The installation instructions are detailed below.
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .htm, .html, .txt, .doc * Set PLUCENEINDEXPATH = /srv/www/twiki/plucene/index * Set PLUCENEATTACHMENTSPATH = /srv/www/twiki/pub * Set PLUCENESEARCHATTACHMENTSONLY = 1 * Set PLUCENESEARCHATTACHMENTSONLYLABEL = Display only attachments * Set PLUCENEINDEXVARIABLES = CONTACTINFO, JUSTANOTHERONE * Set PLUCENEINDEXSKIPWEBS = Trash, Sandbox * Set PLUCENEINDEXSKIPATTACHMENTS = Web.SomeTopic.AnAttachment.txt, Web.OtherTopic.OtherAttachment.pdf * Set PLUCENEDEBUG = 1
Add-On Info
| |||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
| |||||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
| |||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < | -- TWiki:Main/JoanMVigo - 21 Mar 2006 | ||||||||||||||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 27 Jun 2006 | ||||||||||||||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . The DOT before the extension type is required. You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .html, .txt, .docor whatever extensions you want. By default, Plucene comes with PDF, HTML and TXT file support. However, PDF needs additional software to be installed (see intall instructions). You may need additional CPAN:Plucene::SearchEngine::Index libraries and install additional third party tools such as antiword or xlhtml which provide required text extracting capabilities. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
Search formThe following form submits text to theplucsearch script. The installation instructions are detailed below.
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
| |||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf, .htm, .html, .txt, .doc * Set PLUCENEINDEXPATH = /srv/www/twiki/plucene/index * Set PLUCENEATTACHMENTSPATH = /srv/www/twiki/pub * Set PLUCENESEARCHATTACHMENTSONLY = 1 * Set PLUCENESEARCHATTACHMENTSONLYLABEL = Display only attachments * Set PLUCENEINDEXVARIABLES = CONTACTINFO, JUSTANOTHERONE * Set PLUCENEINDEXSKIPWEBS = Trash, Sandbox * Set PLUCENEINDEXSKIPATTACHMENTS = Web.SomeTopic.AnAttachment.txt, Web.OtherTopic.OtherAttachment.pdf * Set PLUCENEDEBUG = 1
Add-On Info
| |||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
| |||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||
< < | -- TWiki:Main/JoanMVigo - 20 Mar 2006 | ||||||||||||||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 21 Mar 2006 | ||||||||||||||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexed | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | All the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preference PLUCENEINDEXEXTENSIONS . You can copy & paste the next lines in your TWiki.TWikiPreferences topic | ||||||||||||||||||||||||
> > | All the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preference PLUCENEINDEXEXTENSIONS . The DOT before the extension type is required. You can copy & paste the next lines in your TWiki.TWikiPreferences topic | ||||||||||||||||||||||||
* Plucene settings | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | or whatever extensions you want. Remember that you may need additional CPAN:Plucene::SearchEngine::Index libraries and install required third party tools such as antiword or xlhtml. | ||||||||||||||||||||||||
> > | or whatever extensions you want. By default, Plucene comes with PDF, HTML and TXT file support. However, PDF needs additional software to be installed (see intall instructions). | ||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > | You may need additional CPAN:Plucene::SearchEngine::Index libraries and install additional third party tools such as antiword or xlhtml which provide required text extracting capabilities. | ||||||||||||||||||||||||
You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions.
Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Search formThe following form submits text to theplucsearch script. The installation instructions are detailed below.
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running. | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
* Plucene settings | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Add-On Info
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | -- TWiki:Main/JoanMVigo - 02 Mar 2006 | ||||||||||||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 20 Mar 2006 | ||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > | |||||||||||||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = pdf, html, txt, docor whatever extensions you want. Remember that you may need additional CPAN:Plucene::SearchEngine::Index libraries and install required third party tools such as antiword or xlhtml. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
> > | Search formThe following form submits text to theplucsearch script. The installation instructions are detailed below.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = pdf, htm, html, txt, doc * Set PLUCENEINDEXPATH = /srv/www/twiki/plucene/index _or whatever path your index folder is located_ * Set PLUCENEATTACHMENTSPATH = /srv/www/twiki/pub _or whatever path your pub folder is located_ * Set PLUCENESEARCHATTACHMENTSONLY = 1 * Set PLUCENESEARCHATTACHMENTSONLYLABEL = Display only attachments * Set PLUCENEINDEXVARIABLES = CONTACTINFO, JUSTANOTHERONE * Set PLUCENEINDEXSKIPWEBS = Trash, Sandbox * Set PLUCENEINDEXSKIPATTACHMENTS = Web.SomeTopic.AnAttachment.txt, Web.OtherTopic.OtherAttachment.pdf * Set PLUCENEDEBUG = 1
Add-On Info
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Deleted: | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
< < | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
| |||||||||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||||||||
< < | Time ago I found Plucene, which is a Perl port of the java library Lucene. So this plugin/addon intends to be a new search engine, with Plucene as its backend. | ||||||||||||||||||||||||||||||||
> > | Time ago I found Plucene, which is a Perl port of the java library Lucene. So this plugin/addon intends to be a topic/attachment search engine, with Plucene as its backend. | ||||||||||||||||||||||||||||||||
I would like to thank TWiki:Main.SopanShewale for his many suggestions and contributions. | |||||||||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||||||||
< < | ![]() | ||||||||||||||||||||||||||||||||
> > | Note that this plugin have a release for each TWiki major version, namely Cairo and Dakar. | ||||||||||||||||||||||||||||||||
UsageIndexing with plucindexTheplucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = pdf, html, txt, docor whatever extensions you want. Remember that you may need additional CPAN:Plucene::SearchEngine::Index libraries and install required third party tools such as antiword or xlhtml. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
| |||||||||||||||||||||||||||||||||
Deleted: | |||||||||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||||||||
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = pdf, htm, html, txt, doc * Set PLUCENEINDEXPATH = /srv/www/twiki/plucene/index _or whatever path your index folder is located_ * Set PLUCENEATTACHMENTSPATH = /srv/www/twiki/pub _or whatever path your pub folder is located_ * Set PLUCENESEARCHATTACHMENTSONLY = 1 * Set PLUCENESEARCHATTACHMENTSONLYLABEL = Display only attachments * Set PLUCENEINDEXVARIABLES = CONTACTINFO, JUSTANOTHERONE * Set PLUCENEINDEXSKIPWEBS = Trash, Sandbox | |||||||||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||||||||||
Add-On Info
| |||||||||||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||||||||
< < | Related Topic: TWikiAddOns | ||||||||||||||||||||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 02 Mar 2006 | ||||||||||||||||||||||||||||||||
Deleted: | |||||||||||||||||||||||||||||||||
< < | -- TWiki:Main/JoanMVigo - 03 Mar 2006 | ||||||||||||||||||||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web: | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | |||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Deleted: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | I'm not a Perl guru, however I found Plucene, which is a Perl port of the java library Lucene, so I tried to implement a new search engine, using Plucene as its backend. | ||||||||||||||||||||||||
> > | Time ago I found Plucene, which is a Perl port of the java library Lucene. So this plugin/addon intends to be a new search engine, with Plucene as its backend. | ||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > | I would like to thank TWiki:Main.SopanShewale for his many suggestions and contributions.
![]() | ||||||||||||||||||||||||
UsageIndexing with plucindex | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | The plucindex script indexes all the content of your data folder, and it uses some TWiki code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details). | ||||||||||||||||||||||||
> > | The plucindex script indexes all the public webs, and it uses some TWiki::Func code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details). | ||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. It should not be invoked by browser. | ||||||||||||||||||||||||
> > | By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. To prevent browser access, it has been placed out of the public bin folder. | ||||||||||||||||||||||||
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev | |||||||||||||||||||||||||
Deleted: | |||||||||||||||||||||||||
< < | Searching with plucsearchTheplucsearch script uses one of the templates plucsearh.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with plucsearch script.
However, the query syntax is quite different:
PluceneSearch site topic)
| ||||||||||||||||||||||||
Updating with plucupdate | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | The plucupdate script uses the web's .changes files to know about topic modifications, in a way such mailnotify works. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments). | ||||||||||||||||||||||||
> > | The plucupdate script uses the web's .changes files to know about topic modifications, in a way such old mailnotify worked. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments). | ||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | This script should be executed by an hourly crontab. It should not be invoked by browser. | ||||||||||||||||||||||||
> > | This script should be executed by an hourly crontab. As before, this script has been placed out of the public bin folder. | ||||||||||||||||||||||||
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . You can copy & paste the next lines in your TWiki.TWikiPreferences topic | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
or whatever extensions you want. Remember that you may need additional CPAN:Plucene::SearchEngine::Index libraries and install required third party tools such as antiword or xlhtml. | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks to TWiki:Main/SopanShewale for his contributions. | ||||||||||||||||||||||||
> > | You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks again to TWiki:Main/SopanShewale for his contributions. | ||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > | Searching with plucsearchTheplucsearch script uses a template plucsearch.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with the plucsearch script.
The query syntax has been improved
PluceneSearch site topic)
Other featuresThis new version provides some extra functionality:
| ||||||||||||||||||||||||
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running. | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
Add-On Info | |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < |
| ||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||
> > |
| ||||||||||||||||||||||||
| |||||||||||||||||||||||||
Changed: | |||||||||||||||||||||||||
< < | -- TWiki:Main/JoanMVigo - 15 Dec 2004 | ||||||||||||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 03 Mar 2006 | ||||||||||||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindex | |||||||||||||||||||||
Changed: | |||||||||||||||||||||
< < | The plucindex script indexes all the content of your data folder, and it uses some TWiki code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (only PDF/HTML/TXT). | ||||||||||||||||||||
> > | The plucindex script indexes all the content of your data folder, and it uses some TWiki code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (see below for more details). | ||||||||||||||||||||
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Searching with plucsearch | |||||||||||||||||||||
Changed: | |||||||||||||||||||||
< < | The plucsearch script uses the plucsearh.tmpl template that can be adapted to your site skin easily. I've also attached a PluceneSearch topic with a form ready to use with plucsearch script. | ||||||||||||||||||||
> > | The plucsearch script uses one of the templates plucsearh.tmpl (that can be adapted to your site skin easily) or the plucsearch.pattern.tmpl (if you use the pattern skin). There is also a PluceneSearch topic with a form ready to use with plucsearch script. | ||||||||||||||||||||
However, the query syntax is quite different:
| |||||||||||||||||||||
Changed: | |||||||||||||||||||||
< < |
| ||||||||||||||||||||
> > |
| ||||||||||||||||||||
| |||||||||||||||||||||
Added: | |||||||||||||||||||||
> > |
| ||||||||||||||||||||
Query examples (just type it in your PluceneSearch site topic)
| |||||||||||||||||||||
Added: | |||||||||||||||||||||
> > |
| ||||||||||||||||||||
| |||||||||||||||||||||
Added: | |||||||||||||||||||||
> > |
| ||||||||||||||||||||
Please, to suggest searching improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such mailnotify works. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev | |||||||||||||||||||||
Added: | |||||||||||||||||||||
> > | Attachment file types to be indexedAll the PDF, HTML and text attachments are also indexed by default. If you want to override this setting you can use a TWiki preferencePLUCENEINDEXEXTENSIONS . You can copy & paste the next lines in your TWiki.TWikiPreferences topic
* Plucene settings * Set PLUCENEINDEXEXTENSIONS = .pdf,.html,.txt,.docor whatever extensions you want. Remember that you may need additional CPAN:Plucene::SearchEngine::Index libraries and install required third party tools such as antiword or xlhtml. You can find/post additional CPAN:Plucene::SearchEngine::Index libraries for many file types at TWiki:Plugins/SearchEnginePluceneAddOnDev. Thanks to TWiki:Main/SopanShewale for his contributions. | ||||||||||||||||||||
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
| |||||||||||||||||||||
Changed: | |||||||||||||||||||||
< < |
| ||||||||||||||||||||
> > |
| ||||||||||||||||||||
Added: | |||||||||||||||||||||
> > |
| ||||||||||||||||||||
| |||||||||||||||||||||
Changed: | |||||||||||||||||||||
< < |
| ||||||||||||||||||||
> > |
| ||||||||||||||||||||
Added: | |||||||||||||||||||||
> > | * Plucene settings * Set PLUCENEINDEXPATH = /srv/www/personal/index or whatever path your index folder is located | ||||||||||||||||||||
Add-On Info
| |||||||||||||||||||||
Added: | |||||||||||||||||||||
> > |
| ||||||||||||||||||||
| |||||||||||||||||||||
Changed: | |||||||||||||||||||||
< < | -- TWiki:Main/JoanMVigo - 26 Nov 2004 | ||||||||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 15 Dec 2004 | ||||||||||||||||||||
Changed: | |||||||||||||||||||||||||||||||||||||||||||||||
< < | Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the content of your data folder, and it uses some TWiki code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (only PDF/HTML/TXT).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Searching with plucsearchTheplucsearch script uses the plucsearh.tmpl template that can be adapted to your site skin easily. I've also attached a PluceneSearch topic with a form ready to use with plucsearch script.
However, the query syntax is quite different:
PluceneSearch site topic)
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such mailnotify works. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
Add-On Info
| ||||||||||||||||||||||||||||||||||||||||||||||
> > | Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the content of your data folder, and it uses some TWiki code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (only PDF/HTML/TXT).
By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Searching with plucsearchTheplucsearch script uses the plucsearh.tmpl template that can be adapted to your site skin easily. I've also attached a PluceneSearch topic with a form ready to use with plucsearch script.
However, the query syntax is quite different:
PluceneSearch site topic)
Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such mailnotify works. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
Add-On Info
| ||||||||||||||||||||||||||||||||||||||||||||||
Added: | |||||||||||||||||||||||||||||||||||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 26 Nov 2004 | ||||||||||||||||||||||||||||||||||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web: | |||||||||||||||
Changed: | |||||||||||||||
< < |
| ||||||||||||||
> > |
| ||||||||||||||
I'm not a Perl guru, however I found Plucene, which is a Perl port of the java library Lucene, so I tried to implement a new search engine, using Plucene as its backend.
UsageIndexing with plucindexTheplucindex script indexes all the content of your data folder, and it uses some TWiki code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (only PDF/HTML/TXT). | |||||||||||||||
Changed: | |||||||||||||||
< < | By now, you should run this script manually each time you want the index files to be updated, or just add an hourly or daily crontab to run it automatically. It should not be invoked by browser. | ||||||||||||||
> > | By now, you should run this script manually after installation to create the index files used by plucsearch . If you want, you can also schedule a weekly or monthly crontab job to create the index files again, or maybe execute it manually when you take down your server for maintenance tasks. It should not be invoked by browser. | ||||||||||||||
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Searching with plucsearchTheplucsearch script uses the plucsearh.tmpl template that can be adapted to your site skin easily. I've also attached a PluceneSearch topic with a form ready to use with plucsearch script.
However, the query syntax is quite different:
| |||||||||||||||
Deleted: | |||||||||||||||
< < | |||||||||||||||
PluceneSearch site topic)
| |||||||||||||||
Added: | |||||||||||||||
> > | Updating with plucupdateTheplucupdate script uses the web's .changes files to know about topic modifications, in a way such mailnotify works. Also, a .plucupdate file is used on each web directory storing the last timestamp the script was run on it. So when this script is executed, first checks if there are any topic updates since last execution. The most recent topic updates are removed from the index and then reindexed again (the same goes for attachments).
This script should be executed by an hourly crontab. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev | ||||||||||||||
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
| |||||||||||||||
Added: | |||||||||||||||
> > |
| ||||||||||||||
| |||||||||||||||
Changed: | |||||||||||||||
< < |
| ||||||||||||||
> > |
| ||||||||||||||
| |||||||||||||||
Added: | |||||||||||||||
> > |
| ||||||||||||||
Add-On Info
| |||||||||||||||
Changed: | |||||||||||||||
< < |
| ||||||||||||||
> > |
| ||||||||||||||
| |||||||||||||||
Changed: | |||||||||||||||
< < |
| ||||||||||||||
> > |
| ||||||||||||||
Added: | |||||||||||||||
> > |
| ||||||||||||||
| |||||||||||||||
Changed: | |||||||||||||||
< < |
| ||||||||||||||
> > |
| ||||||||||||||
| |||||||||||||||
Changed: | |||||||||||||||
< < | -- TWiki:Main/JoanMVigo - 18 Nov 2004 | ||||||||||||||
> > | -- TWiki:Main/JoanMVigo - 23 Nov 2004 | ||||||||||||||
Deleted: | |||||||||||||||
< < | |||||||||||||||
Plucene Search Engine Add-OnTWiki original search engine is a simple yet powerful tool. However, it can not search within attached documents. That has been discused in many topics in the Codev web:
UsageIndexing with plucindexTheplucindex script indexes all the content of your data folder, and it uses some TWiki code to retrieve the list of available webs and to retrieve their topic list. For each topic, the meta data is inspected and indexed, as the text body. Also, if the topic has attachments, those are indexed (only PDF/HTML/TXT).
By now, you should run this script manually each time you want the index files to be updated, or just add an hourly or daily crontab to run it automatically. It should not be invoked by browser.
Please, to suggest indexing improvements read/post to TWiki:Plugins/SearchEnginePluceneAddOnDev
Searching with plucsearchTheplucsearch script uses the plucsearh.tmpl template that can be adapted to your site skin easily. I've also attached a PluceneSearch topic with a form ready to use with plucsearch script.
However, the query syntax is quite different:
PluceneSearch site topic)
Add-On Installation InstructionsNote: You do not need to install anything on the browser to use this add-on. The following instructions are for the administrator who installs the add-on on the server where TWiki is running.
Add-On Info
|