getSnowballWordSet

Reads stopwords from a stopword list in Snowball format.

The snowball format is the following:

  • Lines may contain multiple words separated by whitespace.

  • The comment character is the vertical line (|).

  • Lines may contain trailing comments.

Return

the given CharArraySet with the reader's words

Parameters

reader

Reader containing a Snowball stopword list

result

the CharArraySet to fill with the readers words


Reads stopwords from a stopword list in Snowball format.

The snowball format is the following:

  • Lines may contain multiple words separated by whitespace.

  • The comment character is the vertical line (|).

  • Lines may contain trailing comments.

Return

An unmodifiable CharArraySet with the reader's words

Parameters

reader

Reader containing a Snowball stopword list


Reads stopwords from a stopword list in Snowball format.

The snowball format is the following:

  • Lines may contain multiple words separated by whitespace.

  • The comment character is the vertical line (|).

  • Lines may contain trailing comments.

Return

An unmodifiable CharArraySet with the reader's words

Parameters

stream

InputStream in UTF-8 encoding containing a Snowball stopword list


Reads stopwords from a stopword list in Snowball format.

The snowball format is the following:

  • Lines may contain multiple words separated by whitespace.

  • The comment character is the vertical line (|).

  • Lines may contain trailing comments.

Return

An unmodifiable CharArraySet with the reader's words

Parameters

stream

InputStream containing a Snowball stopword list

charset

Charset of the stopword list