Apps

Module Indexation des fichiers 2.3.1

Description

Ce module gère dans JCMS l'indexation textuelle de la plupart des formats de fichiers bureautiques, et permet donc de faire une recherche textuelle à l'intérieur.

Ce module peut-être complété du module de conversion PDF pour offrir une alternative à JCMS Universal.

Attention ! La licence de production de ce module est payante. Ce module peut être déployé en production par les utilisateurs disposant d'une license JCMS Universal.


Captures d'écran

1. Onglet Statuts des fichiers
2. Onglet Processeurs
3. Onglet Extensions

Installation

Suivez la procédure d'installation dans le Gestionnaire de Modules et redémarrez le site.


Changements

File Processor Plugin - Version 2.3.1

Bug

  • FPP-76 : Plugin version 2.3 was compiled for Java 1.5 only

File Processor Plugin - Version 2.3

Bug

  • FPP-74 : File indexing is stopped with LockObtainFailedException on WEB-INF\data\lucene\FilesIndex\write.lock
  • FPP-72 : Some Words in indexed files are not searchable

Enhancement * If .doc files are RTF files they are parsed by the RTF parser

File Processor Plugin - Version 2.2

Bug

  • FPP-72 : Some Words in indexed files are not searchable

Enhancement

  • If .doc files are RTF files they are parsed by the RTF parser

New Feature

  • FileProcessor and FileParser may have now an availability

==> Warning : - method "isAvailable" now added for FileActionComponant. - Plugin PDFConverter 1.1 or more needed

  • Admin interface in OPERATION instead of MONITORING
  • Size of FileStatus is displayed in interface
  • When a file is globally blacklisted, it is no longer considered as OK (nor NOK).
  • Lib Update : tm-extractors-0.4 to tm-extractors-1.0
    • Fast save format support for Word document added
    • Old Word for Windows :
      • Word 4.0 : 1987 for PC (!)
      • Word 1.0 : 1989 for Windows 1.0;
      • Word 2.0 : 1991 for Windows 3.1;

(cf : http://fr.wikipedia.org/wiki/Word) See also : http://www.textmining.org/ and http://code.google.com/p/text-mining/

  • Copyrights concerns and references added in documentation for :
    • jxl : LGPL
    • TextMining Extractor : LGPL
    • POI : Apache Licence v2.0
    • PDFBox : provided "as is"

File Processor Plugin - Version 2.1

Bug

  • [FPP-24] - Support link or junction that make a cyclic reference in upload directory.
  • [FPP-47] - When there is a problem for a repository, it must appear not only in logs but also in admin interface

New Feature

  • [FPP-3] - [performance] Unitary Processing while a FileDocument is created
  • [FPP-9] - [robustness,performance] Set a producer-consumer mecanism
  • [FPP-52] - Add an ability to re process a file, whatever happend with it

File Processor Plugin - Version 2.0.1

Bug

  • [FPP-43] - Keep some parameter when a redirect is done in AdminHandler
  • [FPP-49] - Some DELETE operation on files are not taken in account
  • [FPP-53] - When a file is modified, and was previously indexed, it is indicated as indexed

Improvement

  • [FPP-41] - Add parser/processor filter in file list
  • [FPP-42] - Schedule processings for files never processed at first
  • [FPP-44] - Improve unknown exception logging
  • [FPP-45] - Processor thread's name is way too long and clutters jcms.log
  • [FPP-46] - Output log message "File processing performed in XX ms" only when some files have been processed
  • [FPP-48] - Reduce FPP Alarm Manager thread name
  • [FPP-50] - In Admin interface keep search parameter when blacklisting action is done

New Feature

  • [FPP-51] - Add a way to blacklist many files

File Processor Plugin - Version 2.0

Bug

  • [FPP-28] - [POI] Bug POI on secure word document cause an OutOfMemoryException

Improvement

  • [FPP-22] - [library] upgrade POI 3.0.1 (specific packaging cvs extracted)
  • [FPP-23] - [properties management] List of FileProcessors becomes a list (map) of properties instead of a property of a list
  • [FPP-37] - Partially BlackList a file for a processor if processing cause an unknown exception and duration is over a specified duration
  • [FPP-40] - Differenciate Parser and Processors, making two distinct interfaces

New Feature

  • Partially - [FPP-34] - [IHM] Admin interface - Gives an admin interface to : - allow to launch/stop the alarm listener by a admin act;

(missing ability to modify properties repository by repository)

  • [FPP-16] - [IHM] Display the list of file type treated and processor descriptions
  • [FPP-26] - Allow using FileProcessor and FPP architecture to index differently and in another directory
  • [FPP-29] - File Repository differenciation
  • [FPP-30] - JDring use for FFP
  • [FPP-31] - BlackList
  • [FPP-32] - Typing Processing Exception to blacklist or not files for a FP
  • [FPP-35] - Store the time of a process or index, to display it to an admin
  • [FPP-36] - Allow an admin to manually globally or partially blacklist a file
  • [FPP-39] - Put a Target in admin interface to allow sub plugin to add something in FPP admin IHM

File Processor Plugin - Version 1.1

Bug

  • [FPP-2] - [logs] DEBUG level for logs is used while TRACE level should be used
  • [FPP-8] - [IHM] While indexation is done, the indexation date in IHM is incorrect

The correction impacts JCMS too, the complete correction of [FPP-8] require JCMS-5.7.2

New Feature

  • [FPP-5] - [security,cluster] Set witness file under WEB-INF/data/plugin/FileProcessor instead of upload
  • [FPP-6] - [logs] Use log4j NDC ability
  • [FPP-10] - [caractere encoding] witness file in UTF-8
  • [FPP-15] - [robustness] Make Processing Thread priority as a property

Task

  • [FPP-1] - [default values] change processing periodicity
  • [FPP-7] - [logs] Set corrects logs message (language pb)
  • [FPP-13] - [properties]Set fr and en properties to some properties
  • [FPP-18] - [javadoc] Correct comments
  • [FPP-19] - Improve plugin description
  • [FPP-21] - [Plugin] Resize the preview image

Indexation Plugin v1.0

1. Main new features

  • File Processors Management
  • File Processors Parser for documents type :
    • Html;
    • MS Excel;
    • MS Powerpoint;
    • MS Word;
    • Open Office;
    • PDF;
    • Text;
    • XML.
  • No parser for MS Office 2007 (Open XML).

2. Main updates

3. Bugs fixed

FAQ

1. Quels sont les formats de fichiers supportés ?

Le module d'indexation des fichiers traites les formats suivants :

  • Microsoft Office (Word, Excel, PowerPoint), jusqu'à Microsoft Office 2003
  • OpenOffice
  • PDF
  • RTF
  • HTML
  • Texte

Le format OpenXML, utilisé par Microsoft Office 2007 sera pris en charge dans une prochaine version.

2. Qui peut utiliser ce module ?

Ce module est réservé aux clients du module JCMS Universal.

Informations

Version
  • 2.3.1
Stabilité
  • Stable
Compatibilité
  • JCMS 5.7.4
    JCMS 5.7.5
Certifié Jalios
  • Oui
Prix
  • Module payant
Support
  • Jalios Support
Auteur
  • Jalios SA
Licence
  • Jalios
Taille
  • 7,05 Mo
Mis-à-jour
  • 15/02/11
Téléchargements
  • 8