MG4J: The Manual

Table of Contents

1. A Quick Tour of MG4J
Building your first index
Building a compressed collection
More options
Querying MG4J
More sophisticated queries
A semantic index
A TREC index
2. Behind the scenes: The indexing process
Preamble: terms, dictionaries and term-related maps
Scan: Building batches
Time/space requirements
Combining batches
Splitting indices
Virtual fields in MG4J
Virtual fields and virtual fragments
Document resolvers
What is a document resolver actually doing: virtual texts and gaps
Payload-based indices
3. Performance
Indexing Time
Setting up the index structure
Setup Time
Query Time
4. Clusters & Partitioning
Documental vs. Lexical
Partitioning vs. Clustering
Creating a Cluster
5. Accessing MG4J indices programmatically