Home / CSE MCQs / Hadoop MCQs :: Discussion

Discussion :: Hadoop MCQs

  1. The tokens are passed through a Lucene ____________ to produce NGrams of the desired length.
  2. A.
    ShngleFil
    B.
    ShingleFilter
    C.
    SingleFilter
    D.
    Collfilter

    View Answer

    Workspace

    Answer : Option B

    Explanation :

    The tools that the collocation identification algorithm are embedded within either consume tokenized text as input or provide the ability to specify an implementation of the Lucene Analyzer class perform tokenization in order to form ngrams.


Be The First To Comment