NetBSD

Commit Graph

Author	SHA1	Message	Date
mrg	366296f5a8	add a description about what was being attempted to failed writes messages.	2015-08-05 07:10:03 +00:00
joerg	2b8a053617	Retire __SCCSID. It has only archeological value now. Also retire lint conditional around __RCSID, lint can handle that fine.	2009-11-06 18:34:22 +00:00
dsl	8b6ec7b129	long align records written to temporary files.	2009-10-07 21:03:29 +00:00
dsl	eab2f96cf5	Fix borked fix for sort relying on realloc() changing the buffer end. Sorts of more than 8MB data now probably work again.	2009-09-28 20:30:01 +00:00
dsl	6458ae9cdf	Move all the fopen() calls out of the record read routines into the callers. Split the merge sort so that fsort() can pass the 'FILE *' of the temporary files to be merged into the merge code. Don't rely on realloc() not moving the end address of a buffer! Rework merge sort so that it sorts pointers to 'struct mfile' and only copies about sort record descriptors. No functional change intended.	2009-09-26 21:16:55 +00:00
dsl	1310aa04b4	Save length of key instead of relying of the weight of the record sep. This frees a byte value to use for 'end of key' (to correctly sort short keys) while still having a weight assigned to the field sep. (Unless -t is given, the field sep is in the field data.) Do reverse sorts by writing the output file in reverse order (rather than reversing the sort - apart from merges). All key compares are now unweighted. For 'sort -u' mark duplicates keys during the sort and don't write to the output. Use -S to mean a posix sort - where equal keys are sorted using the raw record (rather than being kept in the original order). For 'sort -f' (no keys) generate a key of the folded data (as for -n -i and -d), simplifies the code and allows a 'posix' sort.	2009-09-10 22:02:40 +00:00
dsl	2abdfb3907	Now we have our own radix_sort() change the interface so that we pass an array of 'RECHEADER *' and remove all the crappy stuff that backed up by REC_DATA_OFFSET (etc). Also change radix_sort() to return the number of elements, soon to be used to drop duplicate keys (for sort -u).	2009-09-05 12:00:25 +00:00
dsl	7b4a02befd	Rework the way sort generates sort keys: - If we generate a key, it is always sortable using memcmp() - If we are sorting the whole record, then a weight-table must be used during compares. - Major surgery to encoding of numbers to ensure unique keys for equal numeric values. Reverse numerics are handled by inverting the sign. - Case folding (-f) is handled when the sort keys are generated. No other code has to care at all. - Key uniqueness (-u) is done during merge for large datasets. It only has to be done when writing the output file for small files. Since the file is in key order this is simple! Probably fixes all of: PR/27257 PR/25551 PR/22182 PR/31095 PR/30504 PR/36816 PR/37860 PR/39308 Also PR/18614 should no longer die, but a little more work needs to be done on the merging for very large files.	2009-08-22 10:53:28 +00:00
dsl	f155f3b8b9	The code that attempted to sort large files by sorting each chunk by the first key byte and writing to a temp file, then sorting the records from each temp file that had the same first key byte (and repeating for upto 4 key bytes) was a nice idea, but completely doomed to failure. Eg PR/9308 where a 70MB file has all but one record the same and short keys. Not only does the code not work, it is rather guaranteed to be slow. Instead always use a merge sort for fully sorted chunk of records (each temporary file contains one lot of sorted records). The -H option already did this, so just rip out all the code and variables that can't be used when -H was specified. Further cleanup to come ...	2009-08-18 18:00:28 +00:00
dsl	9ab8b68075	Replace all uses of sizeof(TRECHEADER) with REC_DATA_OFFSET - which is defined as offsetof(RECHEADER, data). Delete TRECHEADER.	2009-08-16 19:53:43 +00:00
dsl	9987745061	Remove reference to db.h by using separate ptr+len fields for the only structure that used it. Pass end of keybuf area, not size to enterkey() - largely to remove a variable who'se use isn't obvious from the name! The structute of this code sucks.	2009-08-15 18:40:01 +00:00
dsl	477a33f936	linebuf and linebuf_size are only used inside seq() - which also not only has its own static variable, but will also extend the buffer. Remove linebuf/size and change seq() to use a private, locally managed buffer.	2009-08-15 16:50:29 +00:00
dsl	5e8c7b5dbd	Remove the unused 'DBT *key' parameter from seq().	2009-08-15 16:10:40 +00:00
dsl	a3b5c4400f	In makeline() change 'pos' from 'char ' to 'u_char ' and remove all the casts associated with its use. None of the uses can possibly care about the signedness of the pointer.	2009-08-15 14:31:48 +00:00
dsl	2a0ab276a2	Ansify. I'm looking at fixing the 'sort -n' fubars, but this code is an inpeneterable mess - which needs some fixing first!	2009-08-15 09:48:46 +00:00
lukem	64d3192b1d	Fix WARNS=4 issues (-Wcast-qual -Wsign-compare)	2009-04-13 11:07:59 +00:00
martin	ce099b4099	Remove clause 3 and 4 from TNF licenses	2008-04-28 20:22:51 +00:00
mrg	f066626ffb	char -> u_char in a couple of places to match other variables.	2006-05-11 19:16:42 +00:00
he	90d4762740	Initialize a local variable to appease -Wuninitialized. Marked with XXXGCC for pmppc (found while compiling for it). Reviewed by lukem.	2005-06-07 09:51:34 +00:00
jdolecek	33f354551b	fix some cases of use of unitialized variables	2004-02-15 11:52:12 +00:00
itojun	ba02466ff9	KNF (mostly whitespace)	2003-10-18 03:03:20 +00:00
itojun	50847da5c5	safer use of realloc	2003-10-16 06:56:17 +00:00
jdolecek	f84513a754	add TNF copyright	2003-08-07 11:32:34 +00:00
agc	89aaa1bb64	Move UCB-licensed code from 4-clause to 3-clause licence. Patches provided by Joel Baker in PR 22365, verified by myself.	2003-08-07 11:13:06 +00:00
jdolecek	0f5341a33d	max_o in struct tempfile needs to be off_t use fseeko() rather than fseek() when changing file offset using max_o	2002-12-24 15:09:27 +00:00
jdolecek	266fc04d19	Make compilable with -Wshadow	2001-05-15 11:18:23 +00:00
jdolecek	affba8f2e9	Pull up various cosmetic (mostly whitespace) changes from OpenBSD. This is primarily to ease syncing the two versions.	2001-02-19 20:50:17 +00:00
jdolecek	5347005ed0	resurrect old ftmp() - it supports alternative directory for temporary file, which is needed for -T support	2001-02-19 15:45:45 +00:00
jdolecek	966f1aeec3	makeline(): make the overflow handling code safe vs. buffer realloc, add a comment explaining what we do here	2001-01-18 21:02:47 +00:00
jdolecek	b36ae8b14a	makeline(): put back the memmove(3) removed in rev 1.5 in belief it's been redundant. "Oops" This fixes bug reported to me by Simon Burge.	2001-01-13 20:10:52 +00:00
itojun	8dd4895415	fix few confusing indentation. XXX still broken	2001-01-13 17:27:21 +00:00
jdolecek	16b90fdb48	one more warning to kill	2001-01-13 10:17:43 +00:00
jdolecek	7102161857	Since SUS explicitly specifies sort(1) should append a record delimiter to file if it doesn't end with one, don't warn when this happens.	2001-01-13 10:07:06 +00:00
jdolecek	43de9457c0	remove #if 0 part	2001-01-12 19:24:42 +00:00
jdolecek	1c216f18ea	general cleanup of file list passing: * get rid of union f_handle, replace by passing explicit int parameter and (new) struct filelist * add new typedefs gen_func_t and put_func_t and use where appropriate	2001-01-11 14:05:24 +00:00
jdolecek	d3a4171066	make ftmp() wrapper aroung tmpfile(), there is no need to reimplement it move ftmp() from tmp.c to files.c g/c no longer needed stuff	2001-01-08 19:16:49 +00:00
jdolecek	c477768e0b	fix bugs caused by implicit assumption that 'length' and 'offset' members of struct recheader/trecheader are shorts - they are size_t now this makes sort pass all tests in TEST/stests again after my last change other misc cosmetic changes	2000-10-17 15:22:57 +00:00
jdolecek	ab259a291a	enlarge line buffer as necessary, so that it's possible to process lines longer than 65522 characters constify, rename MAXLLEN to DEFLLEN	2000-10-16 21:53:19 +00:00
jdolecek	681fb9cb36	don't use register declarations	2000-10-15 20:46:33 +00:00
bjh21	e5218d1719	Two classes of changes from the initial OpenBSD commit of this sort(1): FILE * variables are called "fp" rather than "fd". Better (safer) temporary-file handling.	2000-10-07 20:37:06 +00:00
bjh21	6029888a3a	Hit sort(1) with a hammer till it compiles. Also add RCSIDs.	2000-10-07 18:37:09 +00:00
bjh21	1d5d9b5b60	4.4BSD-Lite2 contrib/sort	2000-10-07 16:39:34 +00:00

42 Commits