SimpleCxxLib
packageclass TokenScanner
TokenScanner
class is illustrated by the
following pattern, which reads the tokens in the string variable
input
:
TokenScanner scanner(input); while (scanner.hasMoreTokens()) { string token = scanner.nextToken(); ... process the token ... }The
TokenScanner
class exports several additional methods
that give clients more control over its behavior. Those methods are
described individually in the documentation.
Constructor | |
TokenScanner(str) TokenScanner(infile) | Initializes a scanner object. |
Methods | |
Defines a new multicharacter operator. | |
Adds the characters in str to the set of characters legal in a WORD token. | |
Reads the next character from the scanner input stream. | |
Returns the current position of the scanner in the input stream. | |
Returns the string value of a token. | |
Returns the type of this token. | |
Returns true if there are additional tokens for this scanner to read. | |
Tells the scanner to ignore comments. | |
Tells the scanner to ignore whitespace characters. | |
Returns true if the character is valid in a word. | |
Returns the next token from this scanner. | |
Pushes the specified token back into this scanner's input stream. | |
Controls how the scanner treats tokens that begin with a digit. | |
Controls how the scanner treats tokens enclosed in quotation marks. | |
setInput(infile) | Sets the token stream for this scanner to the specified string or input stream. |
Pushes the character ch back into the scanner stream. | |
Reads the next token and makes sure it matches the string expected . |
TokenScanner(); TokenScanner(string str); TokenScanner(istream & infile);
Usage:
TokenScanner scanner; TokenScanner scanner(str); TokenScanner scanner(infile);
void setInput(string str); void setInput(istream & infile);
Usage:
scanner.setInput(str); scanner.setInput(infile);
bool hasMoreTokens();
true
if there are additional tokens for this
scanner to read.
Usage:
if (scanner.hasMoreTokens()) ...
string nextToken();
nextToken
is called when no tokens are available, it returns the empty string.
Usage:
token = scanner.nextToken();
void saveToken(string token);
nextToken
, the scanner will return
the saved token without reading any additional characters from the
token stream.
Usage:
scanner.saveToken(token);
int getPosition() const;
saveToken
has been called, this position corresponds
to the beginning of the saved token. If saveToken
is
called more than once, getPosition
returns -1.
Usage:
int pos = scanner.getPosition();
void ignoreWhitespace();
nextToken
method treats whitespace characters
(typically spaces and tabs) just like any other punctuation mark
and returns them as single-character tokens.
Calling
scanner.ignoreWhitespace();changes this behavior so that the scanner ignores whitespace characters.
Usage:
scanner.ignoreWhitespace();
void ignoreComments();
scanner.ignoreComments();sets the parser to ignore comments.
Usage:
scanner.ignoreComments();
void scanNumbers();
nextToken
method treats numbers and letters
identically and therefore does not provide any special processing for
numbers. Calling
scanner.scanNumbers();changes this behavior so that
nextToken
returns the
longest substring that can be interpreted as a real number.
Usage:
scanner.scanNumbers();
void scanStrings();
scanner.scanStrings();changes this assumption so that
nextToken
returns a single
token consisting of all characters through the matching quotation mark.
The quotation marks are returned as part of the scanned token so that
clients can differentiate strings from other token types.
Usage:
scanner.scanStrings();
void addWordCharacters(string str);
str
to the set of characters
legal in a WORD
token. For example, calling
addWordCharacters("_")
adds the underscore to the
set of characters that are accepted as part of a word.
Usage:
scanner.addWordCharacters(str);
bool isWordCharacter(char ch) const;
true
if the character is valid in a word.
Usage:
if (scanner.isWordCharacter(ch)) ...
void addOperator(string op);
nextToken
when the input stream contains operator
characters, the scanner returns the longest possible operator
string that can be read at that point.
Usage:
scanner.addOperator(op);
void verifyToken(string expected);
expected
. If it does not, verifyToken
throws an error.
Usage:
scanner.verifyToken(expected);
TokenType getTokenType(string token) const;
EOF
,
SEPARATOR
, WORD
, NUMBER
,
STRING
, or OPERATOR
.
Usage:
TokenType type = scanner.getTokenType(token);
int getChar();
Usage:
int ch = scanner.getChar();
void ungetChar(int ch);
ch
back into the scanner stream.
The character must match the one that was read.
Usage:
scanner.ungetChar(ch);
string getStringValue(string token) const;
Usage:
string str = scanner.getStringValue(token);