1996-12-10 09:06:26 +03:00
|
|
|
Changes between release 2.5.4 (11Sep96) and release 2.5.3:
|
|
|
|
|
|
|
|
- Fixed a bug introduced in 2.5.3 that blew it when a call
|
|
|
|
to input() occurred at the end of an input file.
|
|
|
|
|
|
|
|
- Fixed scanner skeleton so the example in the man page of
|
|
|
|
scanning strings using exclusive start conditions works.
|
|
|
|
|
|
|
|
- Minor Makefile tweaks.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.5.3 (29May96) and release 2.5.2:
|
|
|
|
|
|
|
|
- Some serious bugs in yymore() have been fixed. In particular,
|
|
|
|
when using AT&T-lex-compatibility or %array, you can intermix
|
|
|
|
calls to input(), unput(), and yymore(). (This still doesn't
|
|
|
|
work for %pointer, and isn't likely to in the future.)
|
|
|
|
|
|
|
|
- A bug in handling NUL's in the input stream of scanners using
|
|
|
|
REJECT has been fixed.
|
|
|
|
|
|
|
|
- The default main() in libfl.a now repeatedly calls yylex() until
|
|
|
|
it returns 0, rather than just calling it once.
|
|
|
|
|
|
|
|
- Minor tweak for Windows NT Makefile, MISC/NT/Makefile.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.5.2 (25Apr95) and release 2.5.1:
|
|
|
|
|
|
|
|
- The --prefix configuration option now works.
|
|
|
|
|
|
|
|
- A bug that completely broke the "-Cf" table compression
|
|
|
|
option has been fixed.
|
|
|
|
|
|
|
|
- A major headache involving "const" declarators and Solaris
|
|
|
|
systems has been fixed.
|
|
|
|
|
|
|
|
- An octal escape sequence in a flex regular expression must
|
|
|
|
now contain only the digits 0-7.
|
|
|
|
|
|
|
|
- You can now use "--" on the flex command line to mark the
|
|
|
|
end of flex options.
|
|
|
|
|
|
|
|
- You can now specify the filename '-' as a synonym for stdin.
|
|
|
|
|
|
|
|
- By default, the scanners generated by flex no longer
|
|
|
|
statically initialize yyin and yyout to stdin and stdout.
|
|
|
|
This change is necessary because in some ANSI environments,
|
|
|
|
stdin and stdout are not compile-time constant. You can
|
|
|
|
force the initialization using "%option stdinit" in the first
|
|
|
|
section of your flex input.
|
|
|
|
|
|
|
|
- "%option nounput" now correctly omits the unput() routine
|
|
|
|
from the output.
|
|
|
|
|
|
|
|
- "make clean" now removes config.log, config.cache, and the
|
|
|
|
flex binary. The fact that it removes the flex binary means
|
|
|
|
you should take care if making changes to scan.l, to make
|
|
|
|
sure you don't wind up in a bootstrap problem.
|
|
|
|
|
|
|
|
- In general, the Makefile has been reworked somewhat (thanks
|
|
|
|
to Francois Pinard) for added flexibility - more changes will
|
|
|
|
follow in subsequent releases.
|
|
|
|
|
|
|
|
- The .texi and .info files in MISC/texinfo/ have been updated,
|
|
|
|
thanks also to Francois Pinard.
|
|
|
|
|
2003-11-19 00:37:39 +03:00
|
|
|
- The FlexLexer::yylex(std::istream* new_in, std::ostream* new_out) method
|
1996-12-10 09:06:26 +03:00
|
|
|
now does not have a default for the first argument, to disambiguate
|
|
|
|
it from FlexLexer::yylex().
|
|
|
|
|
|
|
|
- A bug in destructing a FlexLexer object before doing any scanning
|
|
|
|
with it has been fixed.
|
|
|
|
|
|
|
|
- A problem with including FlexLexer.h multiple times has been fixed.
|
|
|
|
|
|
|
|
- The alloca() chud necessary to accommodate bison has grown
|
|
|
|
even uglier, but hopefully more correct.
|
|
|
|
|
|
|
|
- A portability tweak has been added to accommodate compilers that
|
|
|
|
use char* generic pointers.
|
|
|
|
|
|
|
|
- EBCDIC contact information in the file MISC/EBCDIC has been updated.
|
|
|
|
|
|
|
|
- An OS/2 Makefile and config.h for flex 2.5 is now available in
|
|
|
|
MISC/OS2/, contributed by Kai Uwe Rommel.
|
|
|
|
|
|
|
|
- The descrip.mms file for building flex under VMS has been updated,
|
|
|
|
thanks to Pat Rankin.
|
|
|
|
|
|
|
|
- The notes on building flex for the Amiga have been updated for
|
|
|
|
flex 2.5, contributed by Andreas Scherer.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.5.1 (28Mar95) and release 2.4.7:
|
|
|
|
|
|
|
|
- A new concept of "start condition" scope has been introduced.
|
|
|
|
A start condition scope is begun with:
|
|
|
|
|
|
|
|
<SCs>{
|
|
|
|
|
|
|
|
where SCs is a list of one or more start conditions. Inside
|
|
|
|
the start condition scope, every rule automatically has the
|
|
|
|
prefix <SCs> applied to it, until a '}' which matches the
|
|
|
|
initial '{'. So, for example:
|
|
|
|
|
|
|
|
<ESC>{
|
|
|
|
"\\n" return '\n';
|
|
|
|
"\\r" return '\r';
|
|
|
|
"\\f" return '\f';
|
|
|
|
"\\0" return '\0';
|
|
|
|
}
|
|
|
|
|
|
|
|
is equivalent to:
|
|
|
|
|
|
|
|
<ESC>"\\n" return '\n';
|
|
|
|
<ESC>"\\r" return '\r';
|
|
|
|
<ESC>"\\f" return '\f';
|
|
|
|
<ESC>"\\0" return '\0';
|
|
|
|
|
|
|
|
As indicated in this example, rules inside start condition scopes
|
|
|
|
(and any rule, actually, other than the first) can be indented,
|
|
|
|
to better show the extent of the scope.
|
|
|
|
|
|
|
|
Start condition scopes may be nested.
|
|
|
|
|
|
|
|
- The new %option directive can be used in the first section of
|
|
|
|
a flex scanner to control scanner-generation options. Most
|
|
|
|
options are given simply as names, optionally preceded by the
|
|
|
|
word "no" (with no intervening whitespace) to negate their
|
|
|
|
meaning. Some are equivalent to flex flags, so putting them
|
|
|
|
in your scanner source is equivalent to always specifying
|
|
|
|
the flag (%option's take precedence over flags):
|
|
|
|
|
|
|
|
7bit -7 option
|
|
|
|
8bit -8 option
|
|
|
|
align -Ca option
|
|
|
|
backup -b option
|
|
|
|
batch -B option
|
|
|
|
c++ -+ option
|
|
|
|
caseful opposite of -i option (caseful is the default);
|
|
|
|
case-sensitive same as above
|
|
|
|
caseless -i option;
|
|
|
|
case-insensitive same as above
|
|
|
|
debug -d option
|
|
|
|
default opposite of -s option
|
|
|
|
ecs -Ce option
|
|
|
|
fast -F option
|
|
|
|
full -f option
|
|
|
|
interactive -I option
|
|
|
|
lex-compat -l option
|
|
|
|
meta-ecs -Cm option
|
|
|
|
perf-report -p option
|
|
|
|
read -Cr option
|
|
|
|
stdout -t option
|
|
|
|
verbose -v option
|
|
|
|
warn opposite of -w option (so use "%option nowarn" for -w)
|
|
|
|
|
|
|
|
array equivalent to "%array"
|
|
|
|
pointer equivalent to "%pointer" (default)
|
|
|
|
|
|
|
|
Some provide new features:
|
|
|
|
|
|
|
|
always-interactive generate a scanner which always
|
|
|
|
considers its input "interactive" (no call to isatty()
|
|
|
|
will be made when the scanner runs)
|
|
|
|
main supply a main program for the scanner, which
|
|
|
|
simply calls yylex(). Implies %option noyywrap.
|
|
|
|
never-interactive generate a scanner which never
|
|
|
|
considers its input "interactive" (no call to isatty()
|
|
|
|
will be made when the scanner runs)
|
|
|
|
stack if set, enable start condition stacks (see below)
|
|
|
|
stdinit if unset ("%option nostdinit"), initialize yyin
|
|
|
|
and yyout statically to nil FILE* pointers, instead
|
|
|
|
of stdin and stdout
|
|
|
|
yylineno if set, keep track of the current line
|
|
|
|
number in global yylineno (this option is expensive
|
|
|
|
in terms of performance). The line number is available
|
|
|
|
to C++ scanning objects via the new member function
|
|
|
|
lineno().
|
|
|
|
yywrap if unset ("%option noyywrap"), scanner does not
|
|
|
|
call yywrap() upon EOF but simply assumes there
|
|
|
|
are no more files to scan
|
|
|
|
|
|
|
|
Flex scans your rule actions to determine whether you use the
|
|
|
|
REJECT or yymore features (this is not new). Two %options can be
|
|
|
|
used to override its decision, either by setting them to indicate
|
|
|
|
the feature is indeed used, or unsetting them to indicate it
|
|
|
|
actually is not used:
|
|
|
|
|
|
|
|
reject
|
|
|
|
yymore
|
|
|
|
|
|
|
|
Three %option's take string-delimited values, offset with '=':
|
|
|
|
|
|
|
|
outfile="<name>" equivalent to -o<name>
|
|
|
|
prefix="<name>" equivalent to -P<name>
|
|
|
|
yyclass="<name>" set the name of the C++ scanning class
|
|
|
|
(see below)
|
|
|
|
|
|
|
|
A number of %option's are available for lint purists who
|
|
|
|
want to suppress the appearance of unneeded routines in
|
|
|
|
the generated scanner. Each of the following, if unset,
|
|
|
|
results in the corresponding routine not appearing in the
|
|
|
|
generated scanner:
|
|
|
|
|
|
|
|
input, unput
|
|
|
|
yy_push_state, yy_pop_state, yy_top_state
|
|
|
|
yy_scan_buffer, yy_scan_bytes, yy_scan_string
|
|
|
|
|
|
|
|
You can specify multiple options with a single %option directive,
|
|
|
|
and multiple directives in the first section of your flex input file.
|
|
|
|
|
|
|
|
- The new function:
|
|
|
|
|
|
|
|
YY_BUFFER_STATE yy_scan_string( const char *str )
|
|
|
|
|
|
|
|
returns a YY_BUFFER_STATE (which also becomes the current input
|
|
|
|
buffer) for scanning the given string, which occurs starting
|
|
|
|
with the next call to yylex(). The string must be NUL-terminated.
|
|
|
|
A related function:
|
|
|
|
|
|
|
|
YY_BUFFER_STATE yy_scan_bytes( const char *bytes, int len )
|
|
|
|
|
|
|
|
creates a buffer for scanning "len" bytes (including possibly NUL's)
|
|
|
|
starting at location "bytes".
|
|
|
|
|
|
|
|
Note that both of these functions create and scan a *copy* of
|
|
|
|
the string/bytes. (This may be desirable, since yylex() modifies
|
|
|
|
the contents of the buffer it is scanning.) You can avoid the
|
|
|
|
copy by using:
|
|
|
|
|
|
|
|
YY_BUFFER_STATE yy_scan_buffer( char *base, yy_size_t size )
|
|
|
|
|
|
|
|
which scans in place the buffer starting at "base", consisting
|
|
|
|
of "size" bytes, the last two bytes of which *must* be
|
|
|
|
YY_END_OF_BUFFER_CHAR (these bytes are not scanned; thus, scanning
|
|
|
|
consists of base[0] through base[size-2], inclusive). If you
|
|
|
|
fail to set up "base" in this manner, yy_scan_buffer returns a
|
|
|
|
nil pointer instead of creating a new input buffer.
|
|
|
|
|
|
|
|
The type yy_size_t is an integral type to which you can cast
|
|
|
|
an integer expression reflecting the size of the buffer.
|
|
|
|
|
|
|
|
- Three new routines are available for manipulating stacks of
|
|
|
|
start conditions:
|
|
|
|
|
|
|
|
void yy_push_state( int new_state )
|
|
|
|
|
|
|
|
pushes the current start condition onto the top of the stack
|
|
|
|
and BEGIN's "new_state" (recall that start condition names are
|
|
|
|
also integers).
|
|
|
|
|
|
|
|
void yy_pop_state()
|
|
|
|
|
|
|
|
pops the top of the stack and BEGIN's to it, and
|
|
|
|
|
|
|
|
int yy_top_state()
|
|
|
|
|
|
|
|
returns the top of the stack without altering the stack's
|
|
|
|
contents.
|
|
|
|
|
|
|
|
The start condition stack grows dynamically and so has no built-in
|
|
|
|
size limitation. If memory is exhausted, program execution
|
|
|
|
is aborted.
|
|
|
|
|
|
|
|
To use start condition stacks, your scanner must include
|
|
|
|
a "%option stack" directive.
|
|
|
|
|
|
|
|
- flex now supports POSIX character class expressions. These
|
|
|
|
are expressions enclosed inside "[:" and ":]" delimiters (which
|
|
|
|
themselves must appear between the '[' and ']' of a character
|
|
|
|
class; other elements may occur inside the character class, too).
|
|
|
|
The expressions flex recognizes are:
|
|
|
|
|
|
|
|
[:alnum:] [:alpha:] [:blank:] [:cntrl:] [:digit:] [:graph:]
|
|
|
|
[:lower:] [:print:] [:punct:] [:space:] [:upper:] [:xdigit:]
|
|
|
|
|
|
|
|
These expressions all designate a set of characters equivalent to
|
|
|
|
the corresponding isXXX function (for example, [:alnum:] designates
|
|
|
|
those characters for which isalnum() returns true - i.e., any
|
|
|
|
alphabetic or numeric). Some systems don't provide isblank(),
|
|
|
|
so flex defines [:blank:] as a blank or a tab.
|
|
|
|
|
|
|
|
For example, the following character classes are all equivalent:
|
|
|
|
|
|
|
|
[[:alnum:]]
|
|
|
|
[[:alpha:][:digit:]
|
|
|
|
[[:alpha:]0-9]
|
|
|
|
[a-zA-Z0-9]
|
|
|
|
|
|
|
|
If your scanner is case-insensitive (-i flag), then [:upper:]
|
|
|
|
and [:lower:] are equivalent to [:alpha:].
|
|
|
|
|
|
|
|
- The promised rewrite of the C++ FlexLexer class has not yet
|
|
|
|
been done. Support for FlexLexer is limited at the moment to
|
|
|
|
fixing show-stopper bugs, so, for example, the new functions
|
|
|
|
yy_scan_string() & friends are not available to FlexLexer
|
|
|
|
objects.
|
|
|
|
|
|
|
|
- The new macro
|
|
|
|
|
|
|
|
yy_set_interactive(is_interactive)
|
|
|
|
|
|
|
|
can be used to control whether the current buffer is considered
|
|
|
|
"interactive". An interactive buffer is processed more slowly,
|
|
|
|
but must be used when the scanner's input source is indeed
|
|
|
|
interactive to avoid problems due to waiting to fill buffers
|
|
|
|
(see the discussion of the -I flag in flex.1). A non-zero value
|
|
|
|
in the macro invocation marks the buffer as interactive, a zero
|
|
|
|
value as non-interactive. Note that use of this macro overrides
|
|
|
|
"%option always-interactive" or "%option never-interactive".
|
|
|
|
|
|
|
|
yy_set_interactive() must be invoked prior to beginning to
|
|
|
|
scan the buffer.
|
|
|
|
|
|
|
|
- The new macro
|
|
|
|
|
|
|
|
yy_set_bol(at_bol)
|
|
|
|
|
|
|
|
can be used to control whether the current buffer's scanning
|
|
|
|
context for the next token match is done as though at the
|
|
|
|
beginning of a line (non-zero macro argument; makes '^' anchored
|
|
|
|
rules active) or not at the beginning of a line (zero argument,
|
|
|
|
'^' rules inactive).
|
|
|
|
|
|
|
|
- Related to this change, the mechanism for determining when a scan is
|
|
|
|
starting at the beginning of a line has changed. It used to be
|
|
|
|
that '^' was active iff the character prior to that at which the
|
|
|
|
scan started was a newline. The mechanism now is that '^' is
|
|
|
|
active iff the last token ended in a newline (or the last call to
|
|
|
|
input() returned a newline). For most users, the difference in
|
|
|
|
mechanisms is negligible. Where it will make a difference,
|
|
|
|
however, is if unput() or yyless() is used to alter the input
|
|
|
|
stream. When in doubt, use yy_set_bol().
|
|
|
|
|
|
|
|
- The new beginning-of-line mechanism involved changing some fairly
|
|
|
|
twisted code, so it may have introduced bugs - beware ...
|
|
|
|
|
|
|
|
- The macro YY_AT_BOL() returns true if the next token scanned from
|
|
|
|
the current buffer will have '^' rules active, false otherwise.
|
|
|
|
|
|
|
|
- The new function
|
|
|
|
|
|
|
|
void yy_flush_buffer( struct yy_buffer_state* b )
|
|
|
|
|
|
|
|
flushes the contents of the current buffer (i.e., next time
|
|
|
|
the scanner attempts to match a token using b as the current
|
|
|
|
buffer, it will begin by invoking YY_INPUT to fill the buffer).
|
|
|
|
This routine is also available to C++ scanners (unlike some
|
|
|
|
of the other new routines).
|
|
|
|
|
|
|
|
The related macro
|
|
|
|
|
|
|
|
YY_FLUSH_BUFFER
|
|
|
|
|
|
|
|
flushes the contents of the current buffer.
|
|
|
|
|
|
|
|
- A new "-ooutput" option writes the generated scanner to "output".
|
|
|
|
If used with -t, the scanner is still written to stdout, but
|
|
|
|
its internal #line directives (see previous item) use "output".
|
|
|
|
|
|
|
|
- Flex now generates #line directives relating the code it
|
|
|
|
produces to the output file; this means that error messages
|
|
|
|
in the flex-generated code should be correctly pinpointed.
|
|
|
|
|
|
|
|
- When generating #line directives, filenames with embedded '\'s
|
|
|
|
have those characters escaped (i.e., turned into '\\'). This
|
|
|
|
feature helps with reporting filenames for some MS-DOS and OS/2
|
|
|
|
systems.
|
|
|
|
|
|
|
|
- The FlexLexer class includes two new public member functions:
|
|
|
|
|
2003-11-19 00:37:39 +03:00
|
|
|
virtual void switch_streams( std::istream* new_in = 0,
|
|
|
|
std::ostream* new_out = 0 )
|
1996-12-10 09:06:26 +03:00
|
|
|
|
|
|
|
reassigns yyin to new_in (if non-nil) and yyout to new_out
|
|
|
|
(ditto), deleting the previous input buffer if yyin is
|
|
|
|
reassigned. It is used by:
|
|
|
|
|
2003-11-19 00:37:39 +03:00
|
|
|
int yylex( std::istream* new_in = 0, std::ostream* new_out = 0 )
|
1996-12-10 09:06:26 +03:00
|
|
|
|
|
|
|
which first calls switch_streams() and then returns the value
|
|
|
|
of calling yylex().
|
|
|
|
|
|
|
|
- C++ scanners now have yy_flex_debug as a member variable of
|
|
|
|
FlexLexer rather than a global, and member functions for testing
|
|
|
|
and setting it.
|
|
|
|
|
|
|
|
- When generating a C++ scanning class, you can now use
|
|
|
|
|
|
|
|
%option yyclass="foo"
|
|
|
|
|
|
|
|
to inform flex that you have derived "foo" as a subclass of
|
|
|
|
yyFlexLexer, so flex will place your actions in the member
|
|
|
|
function foo::yylex() instead of yyFlexLexer::yylex(). It also
|
|
|
|
generates a yyFlexLexer::yylex() member function that generates a
|
|
|
|
run-time error if called (by invoking yyFlexLexer::LexerError()).
|
|
|
|
This feature is necessary if your subclass "foo" introduces some
|
|
|
|
additional member functions or variables that you need to access
|
|
|
|
from yylex().
|
|
|
|
|
|
|
|
- Current texinfo files in MISC/texinfo, contributed by Francois
|
|
|
|
Pinard.
|
|
|
|
|
|
|
|
- You can now change the name "flex" to something else (e.g., "lex")
|
|
|
|
by redefining $(FLEX) in the Makefile.
|
|
|
|
|
|
|
|
- Two bugs (one serious) that could cause "bigcheck" to fail have
|
|
|
|
been fixed.
|
|
|
|
|
|
|
|
- A number of portability/configuration changes have been made
|
|
|
|
for easier portability.
|
|
|
|
|
|
|
|
- You can use "YYSTATE" in your scanner as an alias for YY_START
|
|
|
|
(for AT&T lex compatibility).
|
|
|
|
|
|
|
|
- input() now maintains yylineno.
|
|
|
|
|
|
|
|
- input() no longer trashes yytext.
|
|
|
|
|
|
|
|
- interactive scanners now read characters in YY_INPUT up to a
|
|
|
|
newline, a large performance gain.
|
|
|
|
|
|
|
|
- C++ scanner objects now work with the -P option. You include
|
|
|
|
<FlexLexer.h> once per scanner - see comments in <FlexLexer.h>
|
|
|
|
(or flex.1) for details.
|
|
|
|
|
|
|
|
- C++ FlexLexer objects now use the "cerr" stream to report -d output
|
|
|
|
instead of stdio.
|
|
|
|
|
|
|
|
- The -c flag now has its full glorious POSIX interpretation (do
|
|
|
|
nothing), rather than being interpreted as an old-style -C flag.
|
|
|
|
|
|
|
|
- Scanners generated by flex now include two #define's giving
|
|
|
|
the major and minor version numbers (YY_FLEX_MAJOR_VERSION,
|
|
|
|
YY_FLEX_MINOR_VERSION). These can then be tested to see
|
|
|
|
whether certain flex features are available.
|
|
|
|
|
|
|
|
- Scanners generated using -l lex compatibility now have the symbol
|
|
|
|
YY_FLEX_LEX_COMPAT #define'd.
|
|
|
|
|
|
|
|
- When initializing (i.e., yy_init is non-zero on entry to yylex()),
|
|
|
|
generated scanners now set yy_init to zero before executing
|
|
|
|
YY_USER_INIT. This means that you can set yy_init back to a
|
|
|
|
non-zero value in YY_USER_INIT if you need the scanner to be
|
|
|
|
reinitialized on the next call.
|
|
|
|
|
|
|
|
- You can now use "#line" directives in the first section of your
|
|
|
|
scanner specification.
|
|
|
|
|
|
|
|
- When generating full-table scanners (-Cf), flex now puts braces
|
|
|
|
around each row of the 2-d array initialization, to silence warnings
|
|
|
|
on over-zealous compilers.
|
|
|
|
|
|
|
|
- Improved support for MS-DOS. The flex sources have been successfully
|
|
|
|
built, unmodified, for Borland 4.02 (all that's required is a
|
|
|
|
Borland Makefile and config.h file, which are supplied in
|
|
|
|
MISC/Borland - contributed by Terrence O Kane).
|
|
|
|
|
|
|
|
- Improved support for Macintosh using Think C - the sources should
|
|
|
|
build for this platform "out of the box". Contributed by Scott
|
|
|
|
Hofmann.
|
|
|
|
|
|
|
|
- Improved support for VMS, in MISC/VMS/, contributed by Pat Rankin.
|
|
|
|
|
|
|
|
- Support for the Amiga, in MISC/Amiga/, contributed by Andreas
|
|
|
|
Scherer. Note that the contributed files were developed for
|
|
|
|
flex 2.4 and have not been tested with flex 2.5.
|
|
|
|
|
|
|
|
- Some notes on support for the NeXT, in MISC/NeXT, contributed
|
|
|
|
by Raf Schietekat.
|
|
|
|
|
|
|
|
- The MISC/ directory now includes a preformatted version of flex.1
|
|
|
|
in flex.man, and pre-yacc'd versions of parse.y in parse.{c,h}.
|
|
|
|
|
|
|
|
- The flex.1 and flexdoc.1 manual pages have been merged. There
|
|
|
|
is now just one document, flex.1, which includes an overview
|
|
|
|
at the beginning to help you find the section you need.
|
|
|
|
|
|
|
|
- Documentation now clarifies that start conditions persist across
|
|
|
|
switches to new input files or different input buffers. If you
|
|
|
|
want to e.g., return to INITIAL, you must explicitly do so.
|
|
|
|
|
|
|
|
- The "Performance Considerations" section of the manual has been
|
|
|
|
updated.
|
|
|
|
|
|
|
|
- Documented the "yy_act" variable, which when YY_USER_ACTION is
|
|
|
|
invoked holds the number of the matched rule, and added an
|
|
|
|
example of using yy_act to profile how often each rule is matched.
|
|
|
|
|
|
|
|
- Added YY_NUM_RULES, a definition that gives the total number
|
|
|
|
of rules in the file, including the default rule (even if you
|
|
|
|
use -s).
|
|
|
|
|
|
|
|
- Documentation now clarifies that you can pass a nil FILE* pointer
|
|
|
|
to yy_create_buffer() or yyrestart() if you've arrange YY_INPUT
|
|
|
|
to not need yyin.
|
|
|
|
|
|
|
|
- Documentation now clarifies that YY_BUFFER_STATE is a pointer to
|
|
|
|
an opaque "struct yy_buffer_state".
|
|
|
|
|
|
|
|
- Documentation now stresses that you gain the benefits of removing
|
|
|
|
backing-up states only if you remove *all* of them.
|
|
|
|
|
|
|
|
- Documentation now points out that traditional lex allows you
|
|
|
|
to put the action on a separate line from the rule pattern if
|
|
|
|
the pattern has trailing whitespace (ugh!), but flex doesn't
|
|
|
|
support this.
|
|
|
|
|
|
|
|
- A broken example in documentation of the difference between
|
|
|
|
inclusive and exclusive start conditions is now fixed.
|
|
|
|
|
|
|
|
- Usage (-h) report now goes to stdout.
|
|
|
|
|
|
|
|
- Version (-V) info now goes to stdout.
|
|
|
|
|
|
|
|
- More #ifdef chud has been added to the parser in attempt to
|
|
|
|
deal with bison's use of alloca().
|
|
|
|
|
|
|
|
- "make clean" no longer deletes emacs backup files (*~).
|
|
|
|
|
|
|
|
- Some memory leaks have been fixed.
|
|
|
|
|
|
|
|
- A bug was fixed in which dynamically-expanded buffers were
|
|
|
|
reallocated a couple of bytes too small.
|
|
|
|
|
|
|
|
- A bug was fixed which could cause flex to read and write beyond
|
|
|
|
the end of the input buffer.
|
|
|
|
|
|
|
|
- -S will not be going away.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.4.7 (03Aug94) and release 2.4.6:
|
|
|
|
|
|
|
|
- Fixed serious bug in reading multiple files.
|
|
|
|
|
|
|
|
- Fixed bug in scanning NUL's.
|
|
|
|
|
|
|
|
- Fixed bug in input() returning 8-bit characters.
|
|
|
|
|
|
|
|
- Fixed bug in matching text with embedded NUL's when
|
|
|
|
using %array or lex compatibility.
|
|
|
|
|
|
|
|
- Fixed multiple invocations of YY_USER_ACTION when using '|'
|
|
|
|
continuation action.
|
|
|
|
|
|
|
|
- Minor prototyping fixes.
|
|
|
|
|
|
|
|
Changes between release 2.4.6 (04Jan94) and release 2.4.5:
|
|
|
|
|
|
|
|
- Linking with -lfl no longer required if your program includes
|
|
|
|
its own yywrap() and main() functions. (This change will cause
|
|
|
|
problems if you have a non-ANSI compiler on a system for which
|
|
|
|
sizeof(int) != sizeof(void*) or sizeof(int) != sizeof(size_t).)
|
|
|
|
|
|
|
|
- The use of 'extern "C++"' in FlexLexer.h has been modified to
|
|
|
|
get around an incompatibility with g++'s header files.
|
|
|
|
|
|
|
|
Changes between release 2.4.5 (11Dec93) and release 2.4.4:
|
|
|
|
|
|
|
|
- Fixed bug breaking C++ scanners that use REJECT or variable
|
|
|
|
trailing context.
|
|
|
|
|
|
|
|
- Fixed serious input problem for interactive scanners on
|
|
|
|
systems for which char is unsigned.
|
|
|
|
|
|
|
|
- Fixed bug in incorrectly treating '$' operator as variable
|
|
|
|
trailing context.
|
|
|
|
|
|
|
|
- Fixed bug in -CF table representation that could lead to
|
|
|
|
corrupt tables.
|
|
|
|
|
|
|
|
- Fixed fairly benign memory leak.
|
|
|
|
|
|
|
|
- Added `extern "C++"' wrapper to FlexLexer.h header. This
|
|
|
|
should overcome the g++ 2.5.X problems mentioned in the
|
|
|
|
NEWS for release 2.4.3.
|
|
|
|
|
|
|
|
- Changed #include of FlexLexer.h to use <> instead of "".
|
|
|
|
|
|
|
|
- Added feature to control whether the scanner attempts to
|
|
|
|
refill the input buffer once it's exhausted. This feature
|
|
|
|
will be documented in the 2.5 release.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.4.4 (07Dec93) and release 2.4.3:
|
|
|
|
|
|
|
|
- Fixed two serious bugs in scanning 8-bit characters.
|
|
|
|
|
|
|
|
- Fixed bug in YY_USER_ACTION that caused it to be executed
|
|
|
|
inappropriately (on the scanner's own internal actions, and
|
|
|
|
with incorrect yytext/yyleng values).
|
|
|
|
|
|
|
|
- Fixed bug in pointing yyin at a new file and resuming scanning.
|
|
|
|
|
|
|
|
- Portability fix regarding min/max/abs macros conflicting with
|
|
|
|
function definitions in standard header files.
|
|
|
|
|
|
|
|
- Added a virtual LexerError() method to the C++ yyFlexLexer class
|
|
|
|
for reporting error messages instead of always using cerr.
|
|
|
|
|
|
|
|
- Added warning in flexdoc that the C++ scanning class is presently
|
|
|
|
experimental and subject to considerable change between major
|
|
|
|
releases.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.4.3 (03Dec93) and release 2.4.2:
|
|
|
|
|
|
|
|
- Fixed bug causing fatal scanner messages to fail to print.
|
|
|
|
|
|
|
|
- Fixed things so FlexLexer.h can be included in other C++
|
|
|
|
sources. One side-effect of this change is that -+ and -CF
|
|
|
|
are now incompatible.
|
|
|
|
|
2000-03-14 01:55:05 +03:00
|
|
|
- libfl.a now supplies private versions of the <string.h>/
|
1996-12-10 09:06:26 +03:00
|
|
|
<strings.h> string routines needed by flex and the scanners
|
|
|
|
it generates, to enhance portability to some BSD systems.
|
|
|
|
|
|
|
|
- More robust solution to 2.4.2's flexfatal() bug fix.
|
|
|
|
|
|
|
|
- Added ranlib of installed libfl.a.
|
|
|
|
|
|
|
|
- Some lint tweaks.
|
|
|
|
|
|
|
|
- NOTE: problems have been encountered attempting to build flex
|
|
|
|
C++ scanners using g++ version 2.5.X. The problem is due to an
|
|
|
|
unfortunate heuristic in g++ 2.5.X that attempts to discern between
|
|
|
|
C and C++ headers. Because FlexLexer.h is installed (by default)
|
|
|
|
in /usr/local/include and not /usr/local/lib/g++-include, g++ 2.5.X
|
|
|
|
decides that it's a C header :-(. So if you have problems, install
|
|
|
|
the header in /usr/local/lib/g++-include instead.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.4.2 (01Dec93) and release 2.4.1:
|
|
|
|
|
|
|
|
- Fixed bug in libfl.a referring to non-existent "flexfatal" function.
|
|
|
|
|
|
|
|
- Modified to produce both compress'd and gzip'd tar files for
|
|
|
|
distributions (you probably don't care about this change!).
|
|
|
|
|
|
|
|
|
|
|
|
Changes between release 2.4.1 (30Nov93) and release 2.3.8:
|
|
|
|
|
|
|
|
- The new '-+' flag instructs flex to generate a C++ scanner class
|
|
|
|
(thanks to Kent Williams). flex writes an implementation of the
|
|
|
|
class defined in FlexLexer.h to lex.yy.cc. You may include
|
|
|
|
multiple scanner classes in your program using the -P flag. Note
|
|
|
|
that the scanner class also provides a mechanism for creating
|
|
|
|
reentrant scanners. The scanner class uses C++ streams for I/O
|
|
|
|
instead of FILE*'s (thanks to Tom Epperly). If the flex executable's
|
|
|
|
name ends in '+' then the '-+' flag is automatically on, so creating
|
|
|
|
a symlink or copy of "flex" to "flex++" results in a version of
|
|
|
|
flex that can be used exclusively for C++ scanners.
|
|
|
|
|
|
|
|
Note that without the '-+' flag, flex-generated scanners can still
|
|
|
|
be compiled using C++ compilers, though they use FILE*'s for I/O
|
|
|
|
instead of streams.
|
|
|
|
|
|
|
|
See the "GENERATING C++ SCANNERS" section of flexdoc for details.
|
|
|
|
|
|
|
|
- The new '-l' flag turns on maximum AT&T lex compatibility. In
|
|
|
|
particular, -l includes support for "yylineno" and makes yytext
|
|
|
|
be an array instead of a pointer. It does not, however, do away
|
|
|
|
with all incompatibilities. See the "INCOMPATIBILITIES WITH LEX
|
|
|
|
AND POSIX" section of flexdoc for details.
|
|
|
|
|
|
|
|
- The new '-P' option specifies a prefix to use other than "yy"
|
|
|
|
for the scanner's globally-visible variables, and for the
|
|
|
|
"lex.yy.c" filename. Using -P you can link together multiple
|
|
|
|
flex scanners in the same executable.
|
|
|
|
|
|
|
|
- The distribution includes a "texinfo" version of flexdoc.1,
|
|
|
|
contributed by Roland Pesch (thanks also to Marq Kole, who
|
|
|
|
contributed another version). It has not been brought up to
|
|
|
|
date, but reflects version 2.3. See MISC/flex.texinfo.
|
|
|
|
|
|
|
|
The flex distribution will soon include G.T. Nicol's flex
|
|
|
|
manual; he is presently bringing it up-to-date for version 2.4.
|
|
|
|
|
|
|
|
- yywrap() is now a function, and you now *must* link flex scanners
|
|
|
|
with libfl.a.
|
|
|
|
|
|
|
|
- Site-configuration is now done via an autoconf-generated
|
|
|
|
"configure" script contributed by Francois Pinard.
|
|
|
|
|
|
|
|
- Scanners now use fread() (or getc(), if interactive) and not
|
|
|
|
read() for input. A new "table compression" option, -Cr,
|
|
|
|
overrides this change and causes the scanner to use read()
|
|
|
|
(because read() is a bit faster than fread()). -f and -F
|
|
|
|
are now equivalent to -Cfr and -CFr; i.e., they imply the
|
|
|
|
-Cr option.
|
|
|
|
|
|
|
|
- In the blessed name of POSIX compliance, flex supports "%array"
|
|
|
|
and "%pointer" directives in the definitions (first) section of
|
|
|
|
the scanner specification. The former specifies that yytext
|
|
|
|
should be an array (of size YYLMAX), the latter, that it should
|
|
|
|
be a pointer. The array version of yytext is universally slower
|
|
|
|
than the pointer version, but has the advantage that its contents
|
|
|
|
remain unmodified across calls to input() and unput() (the pointer
|
|
|
|
version of yytext is, still, trashed by such calls).
|
|
|
|
|
|
|
|
"%array" cannot be used with the '-+' C++ scanner class option.
|
|
|
|
|
|
|
|
- The new '-Ca' option directs flex to trade off memory for
|
|
|
|
natural alignment when generating a scanner's tables. In
|
|
|
|
particular, table entries that would otherwise be "short"
|
|
|
|
become "long".
|
|
|
|
|
|
|
|
- The new '-h' option produces a summary of the flex flags.
|
|
|
|
|
|
|
|
- The new '-V' option reports the flex version number and exits.
|
|
|
|
|
|
|
|
- The new scanner macro YY_START returns an integer value
|
|
|
|
corresponding to the current start condition. You can return
|
|
|
|
to that start condition by passing the value to a subsequent
|
|
|
|
"BEGIN" action. You also can implement "start condition stacks"
|
|
|
|
by storing the values in an integer stack.
|
|
|
|
|
|
|
|
- You can now redefine macros such as YY_INPUT by just #define'ing
|
|
|
|
them to some other value in the first section of the flex input;
|
|
|
|
no need to first #undef them.
|
|
|
|
|
|
|
|
- flex now generates warnings for rules that can't be matched.
|
|
|
|
These warnings can be turned off using the new '-w' flag. If
|
|
|
|
your scanner uses REJECT then you will not get these warnings.
|
|
|
|
|
|
|
|
- If you specify the '-s' flag but the default rule can be matched,
|
|
|
|
flex now generates a warning.
|
|
|
|
|
|
|
|
- "yyleng" is now a global, and may be modified by the user (though
|
|
|
|
doing so and then using yymore() will yield weird results).
|
|
|
|
|
|
|
|
- Name definitions in the first section of a scanner specification
|
|
|
|
can now include a leading '^' or trailing '$' operator. In this
|
|
|
|
case, the definition is *not* pushed back inside of parentheses.
|
|
|
|
|
|
|
|
- Scanners with compressed tables are now "interactive" (-I option)
|
|
|
|
by default. You can suppress this attribute (which makes them
|
|
|
|
run slightly slower) using the new '-B' flag.
|
|
|
|
|
|
|
|
- Flex now generates 8-bit scanners by default, unless you use the
|
|
|
|
-Cf or -CF compression options (-Cfe and -CFe result in 8-bit
|
|
|
|
scanners). You can force it to generate a 7-bit scanner using
|
|
|
|
the new '-7' flag. You can build flex to generate 8-bit scanners
|
|
|
|
for -Cf and -CF, too, by adding -DDEFAULT_CSIZE=256 to CFLAGS
|
|
|
|
in the Makefile.
|
|
|
|
|
|
|
|
- You no longer need to call the scanner routine yyrestart() to
|
|
|
|
inform the scanner that you have switched to a new file after
|
|
|
|
having seen an EOF on the current input file. Instead, just
|
|
|
|
point yyin at the new file and continue scanning.
|
|
|
|
|
|
|
|
- You no longer need to invoke YY_NEW_FILE in an <<EOF>> action
|
|
|
|
to indicate you wish to continue scanning. Simply point yyin
|
|
|
|
at a new file.
|
|
|
|
|
|
|
|
- A leading '#' no longer introduces a comment in a flex input.
|
|
|
|
|
|
|
|
- flex no longer considers formfeed ('\f') a whitespace character.
|
|
|
|
|
|
|
|
- %t, I'm happy to report, has been nuked.
|
|
|
|
|
|
|
|
- The '-p' option may be given twice ('-pp') to instruct flex to
|
|
|
|
report minor performance problems as well as major ones.
|
|
|
|
|
|
|
|
- The '-v' verbose output no longer includes start/finish time
|
|
|
|
information.
|
|
|
|
|
|
|
|
- Newlines in flex inputs can optionally include leading or
|
|
|
|
trailing carriage-returns ('\r'), in support of several PC/Mac
|
|
|
|
run-time libraries that automatically include these.
|
|
|
|
|
|
|
|
- A start condition of the form "<*>" makes the following rule
|
|
|
|
active in every start condition, whether exclusive or inclusive.
|
|
|
|
|
|
|
|
- The following items have been corrected in the flex documentation:
|
|
|
|
|
|
|
|
- '-C' table compression options *are* cumulative.
|
|
|
|
|
|
|
|
- You may modify yytext but not lengthen it by appending
|
|
|
|
characters to the end. Modifying its final character
|
|
|
|
will affect '^' anchoring for the next rule matched
|
|
|
|
if the character is changed to or from a newline.
|
|
|
|
|
|
|
|
- The term "backtracking" has been renamed "backing up",
|
|
|
|
since it is a one-time repositioning and not a repeated
|
|
|
|
search. What used to be the "lex.backtrack" file is now
|
|
|
|
"lex.backup".
|
|
|
|
|
|
|
|
- Unindented "/* ... */" comments are allowed in the first
|
|
|
|
flex input section, but not in the second.
|
|
|
|
|
|
|
|
- yyless() can only be used in the flex input source, not
|
|
|
|
externally.
|
|
|
|
|
|
|
|
- You can use "yyrestart(yyin)" to throw away the
|
|
|
|
current contents of the input buffer.
|
|
|
|
|
|
|
|
- To write high-speed scanners, attempt to match as much
|
|
|
|
text as possible with each rule. See MISC/fastwc/README
|
|
|
|
for more information.
|
|
|
|
|
|
|
|
- Using the beginning-of-line operator ('^') is fairly
|
|
|
|
cheap. Using unput() is expensive. Using yyless() is
|
|
|
|
cheap.
|
|
|
|
|
|
|
|
- An example of scanning strings with embedded escape
|
|
|
|
sequences has been added.
|
|
|
|
|
|
|
|
- The example of backing-up in flexdoc was erroneous; it
|
|
|
|
has been corrected.
|
|
|
|
|
|
|
|
- A flex scanner's internal buffer now dynamically grows if needed
|
|
|
|
to match large tokens. Note that growing the buffer presently
|
|
|
|
requires rescanning the (large) token, so consuming a lot of
|
|
|
|
text this way is a slow process. Also note that presently the
|
|
|
|
buffer does *not* grow if you unput() more text than can fit
|
|
|
|
into the buffer.
|
|
|
|
|
|
|
|
- The MISC/ directory has been reorganized; see MISC/README for
|
|
|
|
details.
|
|
|
|
|
|
|
|
- yyless() can now be used in the third (user action) section
|
|
|
|
of a scanner specification, thanks to Ceriel Jacobs. yyless()
|
|
|
|
remains a macro and cannot be used outside of the scanner source.
|
|
|
|
|
|
|
|
- The skeleton file is no longer opened at run-time, but instead
|
|
|
|
compiled into a large string array (thanks to John Gilmore and
|
|
|
|
friends at Cygnus). You can still use the -S flag to point flex
|
|
|
|
at a different skeleton file.
|
|
|
|
|
|
|
|
- flex no longer uses a temporary file to store the scanner's
|
|
|
|
actions.
|
|
|
|
|
|
|
|
- A number of changes have been made to decrease porting headaches.
|
|
|
|
In particular, flex no longer uses memset() or ctime(), and
|
|
|
|
provides a single simple mechanism for dealing with C compilers
|
|
|
|
that still define malloc() as returning char* instead of void*.
|
|
|
|
|
|
|
|
- Flex now detects if the scanner specification requires the -8 flag
|
|
|
|
but the flag was not given or on by default.
|
|
|
|
|
|
|
|
- A number of table-expansion fencepost bugs have been fixed,
|
|
|
|
making flex more robust for generating large scanners.
|
|
|
|
|
|
|
|
- flex more consistently identifies the location of errors in
|
|
|
|
its input.
|
|
|
|
|
|
|
|
- YY_USER_ACTION is now invoked only for "real" actions, not for
|
|
|
|
internal actions used by the scanner for things like filling
|
|
|
|
the buffer or handling EOF.
|
|
|
|
|
|
|
|
- The rule "[^]]" now matches any character other than a ']';
|
|
|
|
formerly it matched any character at all followed by a ']'.
|
|
|
|
This change was made for compatibility with AT&T lex.
|
|
|
|
|
|
|
|
- A large number of miscellaneous bugs have been found and fixed
|
|
|
|
thanks to Gerhard Wilhelms.
|
|
|
|
|
|
|
|
- The source code has been heavily reformatted, making patches
|
|
|
|
relative to previous flex releases no longer accurate.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 Patch #8 (21Feb93) and 2.3 Patch #7:
|
|
|
|
|
|
|
|
- Fixed bugs in dynamic memory allocation leading to grievous
|
|
|
|
fencepost problems when generating large scanners.
|
|
|
|
- Fixed bug causing infinite loops on character classes with 8-bit
|
|
|
|
characters in them.
|
|
|
|
- Fixed bug in matching repetitions with a lower bound of 0.
|
|
|
|
- Fixed bug in scanning NUL characters using an "interactive" scanner.
|
|
|
|
- Fixed bug in using yymore() at the end of a file.
|
|
|
|
- Fixed bug in misrecognizing rules with variable trailing context.
|
|
|
|
- Fixed bug compiling flex on Suns using gcc 2.
|
|
|
|
- Fixed bug in not recognizing that input files with the character
|
|
|
|
ASCII 128 in them require the -8 flag.
|
|
|
|
- Fixed bug that could cause an infinite loop writing out
|
|
|
|
error messages.
|
|
|
|
- Fixed bug in not recognizing old-style lex % declarations if
|
|
|
|
followed by a tab instead of a space.
|
|
|
|
- Fixed potential crash when flex terminated early (usually due
|
|
|
|
to a bad flag) and the -v flag had been given.
|
|
|
|
- Added some missing declarations of void functions.
|
|
|
|
- Changed to only use '\a' for __STDC__ compilers.
|
|
|
|
- Updated mailing addresses.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 Patch #7 (28Mar91) and 2.3 Patch #6:
|
|
|
|
|
|
|
|
- Fixed out-of-bounds array access that caused bad tables
|
|
|
|
to be produced on machines where the bad reference happened
|
|
|
|
to yield a 1. This caused problems installing or running
|
|
|
|
flex on some Suns, in particular.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 Patch #6 (29Aug90) and 2.3 Patch #5:
|
|
|
|
|
|
|
|
- Fixed a serious bug in yymore() which basically made it
|
|
|
|
completely broken. Thanks goes to Jean Christophe of
|
|
|
|
the Nethack development team for finding the problem
|
|
|
|
and passing along the fix.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 Patch #5 (16Aug90) and 2.3 Patch #4:
|
|
|
|
|
|
|
|
- An up-to-date version of initscan.c so "make test" will
|
|
|
|
work after applying the previous patches
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 Patch #4 (14Aug90) and 2.3 Patch #3:
|
|
|
|
|
|
|
|
- Fixed bug in hexadecimal escapes which allowed only digits,
|
|
|
|
not letters, in escapes
|
|
|
|
- Fixed bug in previous "Changes" file!
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 Patch #3 (03Aug90) and 2.3 Patch #2:
|
|
|
|
|
|
|
|
- Correction to patch #2 for gcc compilation; thanks goes to
|
|
|
|
Paul Eggert for catching this.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 Patch #2 (02Aug90) and original 2.3 release:
|
|
|
|
|
|
|
|
- Fixed (hopefully) headaches involving declaring malloc()
|
|
|
|
and free() for gcc, which defines __STDC__ but (often) doesn't
|
|
|
|
come with the standard include files such as <stdlib.h>.
|
|
|
|
Reordered #ifdef maze in the scanner skeleton in the hope of
|
|
|
|
getting the declarations right for cfront and g++, too.
|
|
|
|
|
|
|
|
- Note that this patch supercedes patch #1 for release 2.3,
|
|
|
|
which was never announced but was available briefly for
|
|
|
|
anonymous ftp.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.3 (full) release of 28Jun90 and 2.2 (alpha) release:
|
|
|
|
|
|
|
|
User-visible:
|
|
|
|
|
|
|
|
- A lone <<EOF>> rule (that is, one which is not qualified with
|
|
|
|
a list of start conditions) now specifies the EOF action for
|
|
|
|
*all* start conditions which haven't already had <<EOF>> actions
|
|
|
|
given. To specify an end-of-file action for just the initial
|
|
|
|
state, use <INITIAL><<EOF>>.
|
|
|
|
|
|
|
|
- -d debug output is now contigent on the global yy_flex_debug
|
|
|
|
being set to a non-zero value, which it is by default.
|
|
|
|
|
|
|
|
- A new macro, YY_USER_INIT, is provided for the user to specify
|
|
|
|
initialization action to be taken on the first call to the
|
|
|
|
scanner. This action is done before the scanner does its
|
|
|
|
own initialization.
|
|
|
|
|
|
|
|
- yy_new_buffer() has been added as an alias for yy_create_buffer()
|
|
|
|
|
|
|
|
- Comments beginning with '#' and extending to the end of the line
|
|
|
|
now work, but have been deprecated (in anticipation of making
|
|
|
|
flex recognize #line directives).
|
|
|
|
|
|
|
|
- The funky restrictions on when semi-colons could follow the
|
|
|
|
YY_NEW_FILE and yyless macros have been removed. They now
|
|
|
|
behave identically to functions.
|
|
|
|
|
|
|
|
- A bug in the sample redefinition of YY_INPUT in the documentation
|
|
|
|
has been corrected.
|
|
|
|
|
|
|
|
- A bug in the sample simple tokener in the documentation has
|
|
|
|
been corrected.
|
|
|
|
|
|
|
|
- The documentation on the incompatibilities between flex and
|
|
|
|
lex has been reordered so that the discussion of yylineno
|
|
|
|
and input() come first, as it's anticipated that these will
|
|
|
|
be the most common source of headaches.
|
|
|
|
|
|
|
|
|
|
|
|
Things which didn't used to be documented but now are:
|
|
|
|
|
|
|
|
- flex interprets "^foo|bar" differently from lex. flex interprets
|
|
|
|
it as "match either a 'foo' or a 'bar', providing it comes at the
|
|
|
|
beginning of a line", whereas lex interprets it as "match either
|
|
|
|
a 'foo' at the beginning of a line, or a 'bar' anywhere".
|
|
|
|
|
|
|
|
- flex initializes the global "yyin" on the first call to the
|
|
|
|
scanner, while lex initializes it at compile-time.
|
|
|
|
|
|
|
|
- yy_switch_to_buffer() can be used in the yywrap() macro/routine.
|
|
|
|
|
|
|
|
- flex scanners do not use stdio for their input, and hence when
|
2006-02-25 05:28:55 +03:00
|
|
|
writing an interactive scanner one must explicitly call fflush()
|
1996-12-10 09:06:26 +03:00
|
|
|
after writing out a prompt.
|
|
|
|
|
|
|
|
- flex scanner can be made reentrant (after a fashion) by using
|
|
|
|
"yyrestart( yyin );". This is useful for interactive scanners
|
|
|
|
which have interrupt handlers that long-jump out of the scanner.
|
|
|
|
|
|
|
|
- a defense of why yylineno is not supported is included, along
|
|
|
|
with a suggestion on how to convert scanners which rely on it.
|
|
|
|
|
|
|
|
|
|
|
|
Other changes:
|
|
|
|
|
|
|
|
- Prototypes and proper declarations of void routines have
|
|
|
|
been added to the flex source code, courtesy of Kevin B. Kenny.
|
|
|
|
|
|
|
|
- Routines dealing with memory allocation now use void* pointers
|
|
|
|
instead of char* - see Makefile for porting implications.
|
|
|
|
|
|
|
|
- Error-checking is now done when flex closes a file.
|
|
|
|
|
|
|
|
- Various lint tweaks were added to reduce the number of gripes.
|
|
|
|
|
|
|
|
- Makefile has been further parameterized to aid in porting.
|
|
|
|
|
|
|
|
- Support for SCO Unix added.
|
|
|
|
|
|
|
|
- Flex now sports the latest & greatest UC copyright notice
|
|
|
|
(which is only slightly different from the previous one).
|
|
|
|
|
|
|
|
- A note has been added to flexdoc.1 mentioning work in progress
|
|
|
|
on modifying flex to generate straight C code rather than a
|
|
|
|
table-driven automaton, with an email address of whom to contact
|
|
|
|
if you are working along similar lines.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.2 Patch #3 (30Mar90) and 2.2 Patch #2:
|
|
|
|
|
|
|
|
- fixed bug which caused -I scanners to bomb
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.2 Patch #2 (27Mar90) and 2.2 Patch #1:
|
|
|
|
|
|
|
|
- fixed bug writing past end of input buffer in yyunput()
|
|
|
|
- fixed bug detecting NUL's at the end of a buffer
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.2 Patch #1 (23Mar90) and 2.2 (alpha) release:
|
|
|
|
|
|
|
|
- Makefile fixes: definition of MAKE variable for systems
|
|
|
|
which don't have it; installation of flexdoc.1 along with
|
|
|
|
flex.1; fixed two bugs which could cause "bigtest" to fail.
|
|
|
|
|
|
|
|
- flex.skel fix for compiling with g++.
|
|
|
|
|
|
|
|
- README and flexdoc.1 no longer list an out-of-date BITNET address
|
|
|
|
for contacting me.
|
|
|
|
|
|
|
|
- minor typos and formatting changes to flex.1 and flexdoc.1.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.2 (alpha) release of March '90 and previous release:
|
|
|
|
|
|
|
|
User-visible:
|
|
|
|
|
|
|
|
- Full user documentation now available.
|
|
|
|
|
|
|
|
- Support for 8-bit scanners.
|
|
|
|
|
|
|
|
- Scanners now accept NUL's.
|
|
|
|
|
|
|
|
- A facility has been added for dealing with multiple
|
|
|
|
input buffers.
|
|
|
|
|
|
|
|
- Two manual entries now. One which fully describes flex
|
|
|
|
(rather than just its differences from lex), and the
|
|
|
|
other for quick(er) reference.
|
|
|
|
|
|
|
|
- A number of changes to bring flex closer into compliance
|
|
|
|
with the latest POSIX lex draft:
|
|
|
|
|
|
|
|
%t support
|
|
|
|
flex now accepts multiple input files and concatenates
|
|
|
|
them together to form its input
|
|
|
|
previous -c (compress) flag renamed -C
|
|
|
|
do-nothing -c and -n flags added
|
|
|
|
Any indented code or code within %{}'s in section 2 is
|
|
|
|
now copied to the output
|
|
|
|
|
|
|
|
- yyleng is now a bona fide global integer.
|
|
|
|
|
|
|
|
- -d debug information now gives the line number of the
|
|
|
|
matched rule instead of which number rule it was from
|
|
|
|
the beginning of the file.
|
|
|
|
|
|
|
|
- -v output now includes a summary of the flags used to generate
|
|
|
|
the scanner.
|
|
|
|
|
|
|
|
- unput() and yyrestart() are now globally callable.
|
|
|
|
|
|
|
|
- yyrestart() no longer closes the previous value of yyin.
|
|
|
|
|
|
|
|
- C++ support; generated scanners can be compiled with C++ compiler.
|
|
|
|
|
|
|
|
- Primitive -lfl library added, containing default main()
|
|
|
|
which calls yylex(). A number of routines currently living
|
|
|
|
in the scanner skeleton will probably migrate to here
|
|
|
|
in the future (in particular, yywrap() will probably cease
|
|
|
|
to be a macro and instead be a function in the -lfl library).
|
|
|
|
|
|
|
|
- Hexadecimal (\x) escape sequences added.
|
|
|
|
|
|
|
|
- Support for MS-DOS, VMS, and Turbo-C integrated.
|
|
|
|
|
|
|
|
- The %used/%unused operators have been deprecated. They
|
|
|
|
may go away soon.
|
|
|
|
|
|
|
|
|
|
|
|
Other changes:
|
|
|
|
|
|
|
|
- Makefile enhanced for easier testing and installation.
|
|
|
|
- The parser has been tweaked to detect some erroneous
|
|
|
|
constructions which previously were missed.
|
|
|
|
- Scanner input buffer overflow is now detected.
|
|
|
|
- Bugs with missing "const" declarations fixed.
|
|
|
|
- Out-of-date Minix/Atari patches provided.
|
|
|
|
- Scanners no longer require printf() unless FLEX_DEBUG is being used.
|
|
|
|
- A subtle input() bug has been fixed.
|
|
|
|
- Line numbers for "continued action" rules (those following
|
|
|
|
the special '|' action) are now correct.
|
|
|
|
- unput() bug fixed; had been causing problems porting flex to VMS.
|
|
|
|
- yymore() handling rewritten to fix bug with interaction
|
|
|
|
between yymore() and trailing context.
|
|
|
|
- EOF in actions now generates an error message.
|
|
|
|
- Bug involving -CFe and generating equivalence classes fixed.
|
|
|
|
- Bug which made -CF be treated as -Cf fixed.
|
|
|
|
- Support for SysV tmpnam() added.
|
|
|
|
- Unused #define's for scanner no longer generated.
|
|
|
|
- Error messages which are associated with a particular input
|
|
|
|
line are now all identified with their input line in standard
|
|
|
|
format.
|
|
|
|
- % directives which are valid to lex but not to flex are
|
|
|
|
now ignored instead of generating warnings.
|
|
|
|
- -DSYS_V flag can now also be specified -DUSG for System V
|
|
|
|
compilation.
|
|
|
|
|
|
|
|
|
|
|
|
Changes between 2.1 beta-test release of June '89 and previous release:
|
|
|
|
|
|
|
|
User-visible:
|
|
|
|
|
|
|
|
- -p flag generates a performance report to stderr. The report
|
|
|
|
consists of comments regarding features of the scanner rules
|
|
|
|
which result in slower scanners.
|
|
|
|
|
|
|
|
- -b flag generates backtracking information to lex.backtrack.
|
|
|
|
This is a list of scanner states which require backtracking
|
|
|
|
and the characters on which they do so. By adding rules
|
|
|
|
one can remove backtracking states. If all backtracking states
|
|
|
|
are eliminated, the generated scanner will run faster.
|
|
|
|
Backtracking is not yet documented in the manual entry.
|
|
|
|
|
|
|
|
- Variable trailing context now works, i.e., one can have
|
|
|
|
rules like "(foo)*/[ \t]*bletch". Some trailing context
|
|
|
|
patterns still cannot be properly matched and generate
|
|
|
|
error messages. These are patterns where the ending of the
|
|
|
|
first part of the rule matches the beginning of the second
|
|
|
|
part, such as "zx*/xy*", where the 'x*' matches the 'x' at
|
|
|
|
the beginning of the trailing context. Lex won't get these
|
|
|
|
patterns right either.
|
|
|
|
|
|
|
|
- Faster scanners.
|
|
|
|
|
|
|
|
- End-of-file rules. The special rule "<<EOF>>" indicates
|
|
|
|
actions which are to be taken when an end-of-file is
|
|
|
|
encountered and yywrap() returns non-zero (i.e., indicates
|
|
|
|
no further files to process). See manual entry for example.
|
|
|
|
|
|
|
|
- The -r (reject used) flag is gone. flex now scans the input
|
|
|
|
for occurrences of the string "REJECT" to determine if the
|
|
|
|
action is needed. It tries to be intelligent about this but
|
|
|
|
can be fooled. One can force the presence or absence of
|
|
|
|
REJECT by adding a line in the first section of the form
|
|
|
|
"%used REJECT" or "%unused REJECT".
|
|
|
|
|
|
|
|
- yymore() has been implemented. Similarly to REJECT, flex
|
|
|
|
detects the use of yymore(), which can be overridden using
|
|
|
|
"%used" or "%unused".
|
|
|
|
|
|
|
|
- Patterns like "x{0,3}" now work (i.e., with lower-limit == 0).
|
|
|
|
|
|
|
|
- Removed '\^x' for ctrl-x misfeature.
|
|
|
|
|
|
|
|
- Added '\a' and '\v' escape sequences.
|
|
|
|
|
|
|
|
- \<digits> now works for octal escape sequences; previously
|
|
|
|
\0<digits> was required.
|
|
|
|
|
|
|
|
- Better error reporting; line numbers are associated with rules.
|
|
|
|
|
|
|
|
- yyleng is a macro; it cannot be accessed outside of the
|
|
|
|
scanner source file.
|
|
|
|
|
|
|
|
- yytext and yyleng should not be modified within a flex action.
|
|
|
|
|
|
|
|
- Generated scanners #define the name FLEX_SCANNER.
|
|
|
|
|
|
|
|
- Rules are internally separated by YY_BREAK in lex.yy.c rather
|
|
|
|
than break, to allow redefinition.
|
|
|
|
|
|
|
|
- The macro YY_USER_ACTION can be redefined to provide an action
|
|
|
|
which is always executed prior to the matched rule's action.
|
|
|
|
|
|
|
|
- yyrestart() is a new action which can be used to restart
|
|
|
|
the scanner after it has seen an end-of-file (a "real" one,
|
|
|
|
that is, one for which yywrap() returned non-zero). It takes
|
|
|
|
a FILE* argument indicating a new file to scan and sets
|
|
|
|
things up so that a subsequent call to yylex() will start
|
|
|
|
scanning that file.
|
|
|
|
|
|
|
|
- Internal scanner names all preceded by "yy_"
|
|
|
|
|
|
|
|
- lex.yy.c is deleted if errors are encountered during processing.
|
|
|
|
|
|
|
|
- Comments may be put in the first section of the input by preceding
|
|
|
|
them with '#'.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Other changes:
|
|
|
|
|
|
|
|
- Some portability-related bugs fixed, in particular for machines
|
|
|
|
with unsigned characters or sizeof( int* ) != sizeof( int ).
|
|
|
|
Also, tweaks for VMS and Microsoft C (MS-DOS), and identifiers all
|
|
|
|
trimmed to be 31 or fewer characters. Shortened file names
|
|
|
|
for dinosaur OS's. Checks for allocating > 64K memory
|
|
|
|
on 16 bit'ers. Amiga tweaks. Compiles using gcc on a Sun-3.
|
|
|
|
- Compressed and fast scanner skeletons merged.
|
|
|
|
- Skeleton header files done away with.
|
|
|
|
- Generated scanner uses prototypes and "const" for __STDC__.
|
|
|
|
- -DSV flag is now -DSYS_V for System V compilation.
|
|
|
|
- Removed all references to FTL language.
|
|
|
|
- Software now covered by BSD Copyright.
|
|
|
|
- flex will replace lex in subsequent BSD releases.
|