src - FreeBSD source tree

diff options


context:
space:
mode:

author	Peter Wemm <peter@FreeBSD.org>	2013-08-11 20:03:12 +0000
committer	Peter Wemm <peter@FreeBSD.org>	2013-08-11 20:03:12 +0000
commit	f0957ccae4f402b93cf27b125542343d28b53109 (patch)
tree	7c1ae67d07b93aea05bfea51c590c1112b65042b /contrib/nvi/regex
parent	ebe2785690e3a82421eac98f089a934901731af5 (diff)
parent	be3e4646eef6a3abcf58590dac24a5dfe54540f6 (diff)

Update nvi-1.79 to 2.1.1-4334a8297f

This is the gsoc-2011 project to clean up and backport multibyte support from other nvi forks in a form we can use. USE_WIDECHAR is on unless building for the rescue crunchgen. This should allow editing in the native locale encoding. USE_ICONV depends on make.conf having 'WITH_ICONV=YES' for now. This adds the ability to do things like edit a KOI8-R file while having $LANG set to (say) en_US.UTF-8. iconv is used to transcode the characters for display. Other points: * It uses gencat and catopen/etc instead of homegrown msg catalog stuff. * A lot of stuff has been trimmed out, eg: the perl and tcl bindings which we could never use in base anyway. * It uses ncursesw when in widechar mode. This could be interesting. GSoC info: http://www.google-melange.com/gsoc/proposal/review/google/gsoc2011/zy/1 Repo at: https://github.com/lichray/nvi2 Obtained from: Zhihao Yuan <lichray@gmail.com>

Notes

Notes: svn path=/head/; revision=254225

Diffstat (limited to 'contrib/nvi/regex')

-rw-r--r--

contrib/nvi/regex/COPYRIGHT

-rw-r--r--

contrib/nvi/regex/WHATSNEW

-rw-r--r--

contrib/nvi/regex/cclass.h

-rw-r--r--

contrib/nvi/regex/cname.h

143

-rw-r--r--

contrib/nvi/regex/engine.c

1102

-rw-r--r--

contrib/nvi/regex/re_format.7

271

-rw-r--r--

contrib/nvi/regex/regcomp.c

1737

-rw-r--r--

contrib/nvi/regex/regerror.c

176

-rw-r--r--

contrib/nvi/regex/regex.3

540

-rw-r--r--

contrib/nvi/regex/regex.h

109

-rw-r--r--

contrib/nvi/regex/regex2.h

174

-rw-r--r--

contrib/nvi/regex/regexec.c

180

-rw-r--r--

contrib/nvi/regex/regfree.c

-rw-r--r--

contrib/nvi/regex/utils.h

14 files changed, 4809 insertions, 0 deletions

diff --git a/contrib/nvi/regex/COPYRIGHT b/contrib/nvi/regex/COPYRIGHT
new file mode 100644
index 000000000000..574f6bcec6c7
--- /dev/null
+++ b/contrib/nvi/regex/COPYRIGHT

@@ -0,0 +1,56 @@

+This software is not subject to any license of the American Telephone

+and Telegraph Company or of the Regents of the University of California.

+Permission is granted to anyone to use this software for any purpose on

+any computer system, and to alter it and redistribute it, subject

+to the following restrictions:

+1. The author is not responsible for the consequences of use of this

+ software, no matter how awful, even if they arise from flaws in it.

+2. The origin of this software must not be misrepresented, either by

+ explicit claim or by omission. Since few users ever read sources,

+ credits must appear in the documentation.

+3. Altered versions must be plainly marked as such, and must not be

+ misrepresented as being the original software. Since few users

+ ever read sources, credits must appear in the documentation.

+4. This notice may not be removed or altered.

+=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=

+/*-

+ *

+ * Redistribution and use in source and binary forms, with or without

+ * modification, are permitted provided that the following conditions

+ * are met:

+ * 1. Redistributions of source code must retain the above copyright

+ * notice, this list of conditions and the following disclaimer.

+ * 2. Redistributions in binary form must reproduce the above copyright

+ * notice, this list of conditions and the following disclaimer in the

+ * documentation and/or other materials provided with the distribution.

+ * 3. All advertising materials mentioning features or use of this software

+ * must display the following acknowledgement:

+ * This product includes software developed by the University of

+ * California, Berkeley and its contributors.

+ * 4. Neither the name of the University nor the names of its contributors

+ * may be used to endorse or promote products derived from this software

+ * without specific prior written permission.

+ *

+ * THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND

+ * ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE

+ * IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE

+ * ARE DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE

+ * FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL

+ * DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS

+ * OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)

+ * HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT

+ * LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY

+ * OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF

+ * SUCH DAMAGE.

+ *

+ * @(#)COPYRIGHT 8.1 (Berkeley) 3/16/94

+ */

diff --git a/contrib/nvi/regex/WHATSNEW b/contrib/nvi/regex/WHATSNEW
new file mode 100644
index 000000000000..f4301d300dd3
--- /dev/null
+++ b/contrib/nvi/regex/WHATSNEW

@@ -0,0 +1,94 @@

+# @(#)WHATSNEW 8.3 (Berkeley) 3/18/94

+New in alpha3.4: The complex bug alluded to below has been fixed (in a

+slightly kludgey temporary way that may hurt efficiency a bit; this is

+another "get it out the door for 4.4" release). The tests at the end of

+the tests file have accordingly been uncommented. The primary sign of

+the bug was that something like a?b matching ab matched b rather than ab.

+(The bug was essentially specific to this exact situation, else it would

+have shown up earlier.)

+New in alpha3.3: The definition of word boundaries has been altered

+slightly, to more closely match the usual programming notion that "_"

+is an alphabetic. Stuff used for pre-ANSI systems is now in a subdir,

+and the makefile no longer alludes to it in mysterious ways. The

+makefile has generally been cleaned up some. Fixes have been made

+(again!) so that the regression test will run without -DREDEBUG, at

+the cost of weaker checking. A workaround for a bug in some folks'

+<assert.h> has been added. And some more things have been added to

+tests, including a couple right at the end which are commented out

+because the code currently flunks them (complex bug; fix coming).

+Plus the usual minor cleanup.

+New in alpha3.2: Assorted bits of cleanup and portability improvement

+(the development base is now a BSDI system using GCC instead of an ancient

+Sun system, and the newer compiler exposed some glitches). Fix for a

+serious bug that affected REs using many [] (including REG_ICASE REs

+because of the way they are implemented), *sometimes*, depending on

+memory-allocation patterns. The header-file prototypes no longer name

+the parameters, avoiding possible name conflicts. The possibility that

+some clot has defined CHAR_MIN as (say) `-128' instead of `(-128)' is

+now handled gracefully. "uchar" is no longer used as an internal type

+name (too many people have the same idea). Still the same old lousy

+performance, alas.

+New in alpha3.1: Basically nothing, this release is just a bookkeeping

+convenience. Stay tuned.

+New in alpha3.0: Performance is no better, alas, but some fixes have been

+made and some functionality has been added. (This is basically the "get

+it out the door in time for 4.4" release.) One bug fix: regfree() didn't

+free the main internal structure (how embarrassing). It is now possible

+to put NULs in either the RE or the target string, using (resp.) a new

+REG_PEND flag and the old REG_STARTEND flag. The REG_NOSPEC flag to

+regcomp() makes all characters ordinary, so you can match a literal

+string easily (this will become more useful when performance improves!).

+There are now primitives to match beginnings and ends of words, although

+the syntax is disgusting and so is the implementation. The REG_ATOI

+debugging interface has changed a bit. And there has been considerable

+internal cleanup of various kinds.

+New in alpha2.3: Split change list out of README, and moved flags notes

+into Makefile. Macro-ized the name of regex(7) in regex(3), since it has

+to change for 4.4BSD. Cleanup work in engine.c, and some new regression

+tests to catch tricky cases thereof.

+New in alpha2.2: Out-of-date manpages updated. Regerror() acquires two

+small extensions -- REG_ITOA and REG_ATOI -- which avoid debugging kludges

+in my own test program and might be useful to others for similar purposes.

+The regression test will now compile (and run) without REDEBUG. The

+BRE \$ bug is fixed. Most uses of "uchar" are gone; it's all chars now.

+Char/uchar parameters are now written int/unsigned, to avoid possible

+portability problems with unpromoted parameters. Some unsigned casts have

+been introduced to minimize portability problems with shifting into sign

+bits.

+New in alpha2.1: Lots of little stuff, cleanup and fixes. The one big

+thing is that regex.h is now generated, using mkh, rather than being

+supplied in the distribution; due to circularities in dependencies,

+you have to build regex.h explicitly by "make h". The two known bugs

+have been fixed (and the regression test now checks for them), as has a

+problem with assertions not being suppressed in the absence of REDEBUG.

+No performance work yet.

+New in alpha2: Backslash-anything is an ordinary character, not an

+error (except, of course, for the handful of backslashed metacharacters

+in BREs), which should reduce script breakage. The regression test

+checks *where* null strings are supposed to match, and has generally

+been tightened up somewhat. Small bug fixes in parameter passing (not

+harmful, but technically errors) and some other areas. Debugging

+invoked by defining REDEBUG rather than not defining NDEBUG.

+New in alpha+3: full prototyping for internal routines, using a little

+helper program, mkh, which extracts prototypes given in stylized comments.

+More minor cleanup. Buglet fix: it's CHAR_BIT, not CHAR_BITS. Simple

+pre-screening of input when a literal string is known to be part of the

+RE; this does wonders for performance.

+New in alpha+2: minor bits of cleanup. Notably, the number "32" for the

+word width isn't hardwired into regexec.c any more, the public header

+file prototypes the functions if __STDC__ is defined, and some small typos

+in the manpages have been fixed.

+New in alpha+1: improvements to the manual pages, and an important

+extension, the REG_STARTEND option to regexec().

diff --git a/contrib/nvi/regex/cclass.h b/contrib/nvi/regex/cclass.h
new file mode 100644
index 000000000000..f28bccdfafcc
--- /dev/null
+++ b/contrib/nvi/regex/cclass.h

@@ -0,0 +1,85 @@

+/* $NetBSD: cclass.h,v 1.2 2008/12/05 22:51:42 christos Exp $ */

+/*-

+ *

+ * This code is derived from software contributed to Berkeley by

+ * Henry Spencer of the University of Toronto.