Class AsciiFoldingDataProcessor

  • All Implemented Interfaces:
    DocumentPreProcessor

    public class AsciiFoldingDataProcessor
    extends Object
    implements DocumentPreProcessor
    Converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists for every searchable field.
    • Constructor Detail

      • AsciiFoldingDataProcessor

        public AsciiFoldingDataProcessor()
    • Method Detail

      • initialize

        public void initialize​(FieldConfigAccess fieldConfig,
                               Map<String,​String> preProcessorConfig)
        Description copied from interface: DocumentPreProcessor
        DataPreProcessor MUST have a no-args constructor. To configure it afterwards, this method will be used.
        Specified by:
        initialize in interface DocumentPreProcessor
        preProcessorConfig - custom string-to-string map that can be configured per DocumentPreProcessor.
      • process

        public boolean process​(Document sourceDocument,
                               boolean visible)
        Description copied from interface: DocumentPreProcessor
        Called for each source document.
        Specified by:
        process in interface DocumentPreProcessor
        visible - weather or not the record is currently marked for indexing.
        Returns:
        true if the record should be indexed, false otherwise.