Skip navigation links

Package it.unimi.di.big.mg4j.document.tika

This package contains classes that expose Tika parsers as MG4J factories.

See: Description

Package it.unimi.di.big.mg4j.document.tika Description

This package contains classes that expose Tika parsers as MG4J factories. Each type of Tika metadata is mapped, when possible, to an MG4J field. However, when using an AutoDetectDocumentFactory or any other factory in which metadata fields are user-definable or otherwise variable, it is impossible to provide a static listing of all available fields, as they depend on the actual factory used to parse the document. In this case, an instance of a GreedyTikaField is used to return some useful data to the caller by (essentially) concatenating the string representations of all metadata fields.
Skip navigation links