Lucene++ - a full-featured, c++ search engine
API Documentation


Loading...
Searching...
No Matches
Tokenizer.h
Go to the documentation of this file.
1
2// Copyright (c) 2009-2014 Alan Wright. All rights reserved.
3// Distributable under the terms of either the Apache License (Version 2.0)
4// or the GNU Lesser General Public License.
6
7#ifndef TOKENIZER_H
8#define TOKENIZER_H
9
10#include "TokenStream.h"
11
12namespace Lucene {
13
20class LPPAPI Tokenizer : public TokenStream {
21protected:
24
26 Tokenizer(const ReaderPtr& input);
27
30
32 Tokenizer(const AttributeFactoryPtr& factory, const ReaderPtr& input);
33
36
38 Tokenizer(const AttributeSourcePtr& source, const ReaderPtr& input);
39
40public:
41 virtual ~Tokenizer();
42
44
45protected:
49
50public:
52 virtual void close();
53
59 virtual int32_t correctOffset(int32_t currentOff);
60
61 using TokenStream::reset;
62
65 virtual void reset(const ReaderPtr& input);
66};
67
68}
69
70#endif
#define LUCENE_CLASS(Name)
Definition LuceneObject.h:24
A TokenStream enumerates the sequence of tokens, either from Fields of a Document or from query text.
Definition TokenStream.h:44
A Tokenizer is a TokenStream whose input is a Reader.
Definition Tokenizer.h:20
virtual ~Tokenizer()
CharStreamPtr charStream
Definition Tokenizer.h:48
Tokenizer()
Construct a tokenizer with null input.
virtual int32_t correctOffset(int32_t currentOff)
Return the corrected offset. If input is a CharStream subclass this method calls CharStream#correctOf...
virtual void close()
By default, closes the input Reader.
Tokenizer(const ReaderPtr &input)
Construct a token stream processing the given input.
Tokenizer(const AttributeSourcePtr &source)
Construct a token stream processing the given input using the given AttributeSource.
Tokenizer(const AttributeFactoryPtr &factory)
Construct a tokenizer with null input using the given AttributeFactory.
ReaderPtr input
The text source for this Tokenizer.
Definition Tokenizer.h:47
virtual void reset(const ReaderPtr &input)
Reset the tokenizer to a new reader. Typically, an analyzer (in its reusableTokenStream method) will ...
Tokenizer(const AttributeSourcePtr &source, const ReaderPtr &input)
Construct a token stream processing the given input using the given AttributeSource.
Tokenizer(const AttributeFactoryPtr &factory, const ReaderPtr &input)
Construct a token stream processing the given input using the given AttributeFactory.
Definition AbstractAllTermDocs.h:12
boost::shared_ptr< AttributeSource > AttributeSourcePtr
Definition LuceneTypes.h:520
boost::shared_ptr< Reader > ReaderPtr
Definition LuceneTypes.h:547
boost::shared_ptr< AttributeFactory > AttributeFactoryPtr
Definition LuceneTypes.h:519
boost::shared_ptr< CharStream > CharStreamPtr
Definition LuceneTypes.h:27

clucene.sourceforge.net