|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object it.unimi.di.mg4j.search.score.AbstractScorer it.unimi.di.mg4j.search.score.AbstractWeightedScorer it.unimi.di.mg4j.search.score.TfIdfScorer
public class TfIdfScorer
A scorer that implements the TF/IDF ranking formula.
There are a number of incarnations with small variations of the formula itself. Here, the weight assigned to a term which appears in f documents out of a collection of N documents w.r.t. to a document of length l in which the term appears c times is
This class uses a CounterCollectionVisitor
and related classes to take into consideration only terms that are actually involved
in the current document.
Field Summary |
---|
Fields inherited from class it.unimi.di.mg4j.search.score.AbstractWeightedScorer |
---|
index2Weight |
Fields inherited from class it.unimi.di.mg4j.search.score.AbstractScorer |
---|
documentIterator, indexIterator |
Constructor Summary | |
---|---|
TfIdfScorer()
|
Method Summary | |
---|---|
TfIdfScorer |
copy()
|
double |
score()
Computes a score by calling Scorer.score(Index) for
each index in the current document iterator, and adding the weighted results. |
double |
score(Index index)
Returns a score for the current document of the last document iterator given to Scorer.wrap(DocumentIterator) , but
considering only a given index (optional operation). |
boolean |
usesIntervals()
Whether this scorer uses intervals. |
void |
wrap(DocumentIterator d)
Wraps the given document iterator. |
Methods inherited from class it.unimi.di.mg4j.search.score.AbstractWeightedScorer |
---|
getWeights, setWeights |
Methods inherited from class it.unimi.di.mg4j.search.score.AbstractScorer |
---|
nextDocument |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Methods inherited from interface it.unimi.di.mg4j.search.score.Scorer |
---|
getWeights, nextDocument, setWeights |
Constructor Detail |
---|
public TfIdfScorer()
Method Detail |
---|
public TfIdfScorer copy()
copy
in interface DelegatingScorer
copy
in interface Scorer
copy
in interface FlyweightPrototype<Scorer>
public double score() throws IOException
AbstractWeightedScorer
Scorer.score(Index)
for
each index in the current document iterator, and adding the weighted results.
score
in interface Scorer
score
in class AbstractWeightedScorer
IOException
public double score(Index index)
Scorer
Scorer.wrap(DocumentIterator)
, but
considering only a given index (optional operation).
score
in interface Scorer
index
- the only index to be considered.
public void wrap(DocumentIterator d) throws IOException
AbstractScorer
This method records internally the provided iterator.
wrap
in interface Scorer
wrap
in class AbstractWeightedScorer
d
- the document iterator that will be used in subsequent calls to
Scorer.score()
and Scorer.score(Index)
.
IOException
public boolean usesIntervals()
Scorer
This method is essential when aggregating scorers,
because if several scores need intervals, a CachingDocumentIterator
will be necessary.
usesIntervals
in interface Scorer
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |