Lucene++ - a full-featured, c++ search engine
API Documentation


Loading...
Searching...
No Matches
LetterTokenizer.h
Go to the documentation of this file.
1
2// Copyright (c) 2009-2014 Alan Wright. All rights reserved.
3// Distributable under the terms of either the Apache License (Version 2.0)
4// or the GNU Lesser General Public License.
6
7#ifndef LETTERTOKENIZER_H
8#define LETTERTOKENIZER_H
9
10#include "CharTokenizer.h"
11
12namespace Lucene {
13
19class LPPAPI LetterTokenizer : public CharTokenizer {
20public:
23
25 LetterTokenizer(const AttributeSourcePtr& source, const ReaderPtr& input);
26
28 LetterTokenizer(const AttributeFactoryPtr& factory, const ReaderPtr& input);
29
31
33
34public:
36 virtual bool isTokenChar(wchar_t c);
37};
38
39}
40
41#endif
#define LUCENE_CLASS(Name)
Definition LuceneObject.h:24
An abstract base class for simple, character-oriented tokenizers.
Definition CharTokenizer.h:15
A LetterTokenizer is a tokenizer that divides text at non-letters. That's to say, it defines tokens a...
Definition LetterTokenizer.h:19
virtual bool isTokenChar(wchar_t c)
Collects only characters which satisfy UnicodeUtil::isAlpha(c).
LetterTokenizer(const ReaderPtr &input)
Construct a new LetterTokenizer.
LetterTokenizer(const AttributeSourcePtr &source, const ReaderPtr &input)
Construct a new LetterTokenizer using a given AttributeSource.
LetterTokenizer(const AttributeFactoryPtr &factory, const ReaderPtr &input)
Construct a new LetterTokenizer using a given AttributeFactory.
Definition AbstractAllTermDocs.h:12
boost::shared_ptr< AttributeSource > AttributeSourcePtr
Definition LuceneTypes.h:520
boost::shared_ptr< Reader > ReaderPtr
Definition LuceneTypes.h:547
boost::shared_ptr< AttributeFactory > AttributeFactoryPtr
Definition LuceneTypes.h:519

clucene.sourceforge.net