public class TdbTextSeparators
extends java.lang.Object
implements com.tietoenator.trip.jxp.internal.utils.DOMSink
Constructor and Description |
---|
TdbTextSeparators()
Constructor, creates a usuable blank container
|
TdbTextSeparators(TdbTextSeparators src)
Copy constructor
|
Modifier and Type | Method and Description |
---|---|
void |
clear()
Initializes the instance, removes any pre-existing state
|
void |
copyFrom(TdbTextSeparators src)
Copy member data from another instance
|
java.lang.String |
getIgnoreChars()
Retrieve any characters that should be ignored whilst parsing sentence boundaries.
|
boolean |
getParagraphNeedsBeginOfSentence()
Check if paragraphs are only considered valid if they begin with a validly
formed start of sentence.
|
boolean |
getParagraphNeedsEndOfSentence()
Check if a paragraph is only considered to have completed if
that completion occurred along with a valid end of sentence.
|
java.lang.String |
getParagraphSeparatorSpec()
Retrieve the specification (in class terms) for the minimum boundary
between paragraphs.
|
boolean |
getRequiresParagraphCheck()
Check if this database uses paragraph parsing when indexing.
|
boolean |
getRequiresSentenceCheck()
Check if this database uses sentence parsing when indexing.
|
java.lang.String |
getSentenceBeginChars()
Retrieve any "special" characters that can constitute a valid introduction
to a new sentence.
|
java.lang.String |
getSentenceBeginSpec()
Retrieve the specification (in class terms) for the beginning of a
sentence, i.e. the character types that trigger the beginning of a
new sentence.
|
java.lang.String |
getSentenceEndChars()
Retrieve any characters that can constitute a valid ending to a sentence.
|
java.lang.String |
getSentenceSeparatorSpec()
Retrieve the specification (in class terms) for sentence separators,
i.e. character types that are skipped after the end of a sentence when
looking for the beginning of the next.
|
void |
setIgnoreChars(java.lang.String chars)
Establish any characters that should be ignored whilst parsing sentence boundaries.
|
void |
setParagraphNeedsBeginOfSentence(boolean mode)
Establish whether paragraphs are only to be considered valid if they begin with a validly
formed start of sentence.
|
void |
setParagraphNeedsEndOfSentence(boolean mode)
Establish whether a paragraph is only considered to have completed if
that completion occurred along with a valid end of sentence.
|
void |
setParagraphSeparatorSpec(java.lang.String spec)
Establish the specification (in class terms) for the minimum boundary
between paragraphs.
|
void |
setRequiresParagraphCheck(boolean mode)
Establish whether this database will use paragraph parsing when indexing.
|
void |
setRequiresSentenceCheck(boolean mode)
Establish whether this database is to use sentence parsing when indexing.
|
void |
setSentenceBeginChars(java.lang.String chars)
Establish any "special" characters that can constitute a valid introduction
to a new sentence.
|
void |
setSentenceBeginSpec(java.lang.String spec)
Establish the specification (in class terms) for the beginning of a
sentence, i.e. the character types that trigger the beginning of a
new sentence.
|
void |
setSentenceEndChars(java.lang.String chars)
Establish any characters that can constitute a valid ending to a sentence.
|
void |
setSentenceSeparatorSpec(java.lang.String spec)
Establish the specification (in class terms) for sentence separators,
i.e. character types that are skipped after the end of a sentence when
looking for the beginning of the next.
|
public TdbTextSeparators()
public TdbTextSeparators(TdbTextSeparators src)
src
- Source from which to copypublic boolean getRequiresSentenceCheck()
public void setRequiresSentenceCheck(boolean mode)
mode
- true if sentence parsing is to be usedpublic java.lang.String getSentenceBeginSpec()
getSentenceBeginChars
)public void setSentenceBeginSpec(java.lang.String spec)
setSentenceBeginChars
)spec
- Sentence begin specificationpublic java.lang.String getSentenceBeginChars()
public void setSentenceBeginChars(java.lang.String chars)
chars
- Special sentence introductory characterspublic java.lang.String getSentenceSeparatorSpec()
public void setSentenceSeparatorSpec(java.lang.String spec)
spec
- Sentence separation specificationpublic java.lang.String getSentenceEndChars()
public void setSentenceEndChars(java.lang.String chars)
chars
- Sentence ending characterspublic java.lang.String getIgnoreChars()
public void setIgnoreChars(java.lang.String chars)
chars
- Ignored characterspublic boolean getRequiresParagraphCheck()
public void setRequiresParagraphCheck(boolean mode)
mode
- true if the database is to parse paragraphspublic boolean getParagraphNeedsBeginOfSentence()
public void setParagraphNeedsBeginOfSentence(boolean mode)
mode
- true if a valid start of sentence is required to start a new paragraphpublic java.lang.String getParagraphSeparatorSpec()
public void setParagraphSeparatorSpec(java.lang.String spec)
spec
- Paragraph separator specpublic boolean getParagraphNeedsEndOfSentence()
public void setParagraphNeedsEndOfSentence(boolean mode)
mode
- true if paragraph endings require valid sentence endingspublic void clear()
public void copyFrom(TdbTextSeparators src)
src
- Source of copy operation