NetBSD

Commit Graph

Author	SHA1	Message	Date
oster	bceb7a2778	bufpool must be accessed at splbio().	2004-03-23 02:34:10 +00:00
oster	7e8ad96008	If the DAG failed, need to make sure we wipe the dagList structures too.	2004-03-22 20:28:57 +00:00
oster	43ccce7d13	Why start a timer, and then just ignore it? punt	2004-03-21 21:20:46 +00:00
oster	78d093eaf5	Yesterday's fix to rf_disks.c (rev 1.51) was necessary, but not sufficient to clobber this nasty little bug. The behaviour observed was a panic when doing a 'raidctl -f' on a component when DAGs were in flight for the given RAID set. Unfortunatly, the faulty behaviour was very intermittent, and it was difficult to not only reliably reproduce the bug (nor determine when it was fixed!) but also to even figure out what might be the cause of the problem. The real issue was that ci_vp for the failed component was being set to NULL in rf_FailDisk(), but with DAGs still in flight, some of them were still expecting to use ci_vp to determine where to read to/write from! The fix is to call rf_SuspendNewRequestsAndWait() from rf_FailDisk() to make sure the RAID set is quiet and all IOs have completed before mucking with ci_vp and other data structures. rf_ResumeNewRequests() is then used to continue on as usual.	2004-03-21 21:08:08 +00:00
oster	3dd7f5503f	Fix a nastly little bug that I've been chasing over the past 12 hours. If raidPtr->numFailures isn't initialized properly, then all sorts of whacky things can happen, including incorrect DAGs being generated. (Triggering this problem is a little esoteric, which is why this bug has been in hiding for so long -- I only saw it after rebooting with a degraded RAID 5 set that was autoconfigured, rebuilding the failed componennt, and then failing the component while IO was happening to the RAID set.)	2004-03-21 06:32:03 +00:00
oster	492aa07868	Doesn't hurt much to zero this before we start mucking with it.	2004-03-21 06:16:49 +00:00
oster	01e44f9df5	Add in a couple of missed foo=foo->next's.	2004-03-21 03:22:08 +00:00
oster	ac19c32ed5	Can't conditionalize cleanup on numStripeUnitsBailed -- have to cleanup regardless. More importantly, we can't free any of the AccessStripeMaps here!	2004-03-20 21:25:55 +00:00
oster	06f16f554f	NO_STRIPE_LOCKS is never set, so this code will always execute. Remove conditionals, and left-shift code.	2004-03-20 17:30:40 +00:00
oster	1966e6afbb	Cleanup function prototypes.	2004-03-20 16:48:05 +00:00
oster	a7f8d0aef6	[bah.. specifying rf_dagutils.c twice on a checkin doesn't get you rf_dagutils.h... missed this one from yesterday. sorry folks :( ] Change signature of rf_AllocBuffer() to take a dag_h and buffer size instead of an PDA and an alloclist. This lets us do the vple dance inside of rf_AllocBuffer(). Cleanup usage of rf_AllocIOBuffer() and use rf_AllocBuffer() instead. Fix all uses of rf_AllocBuffer() to conform to the new way of doing things.	2004-03-20 15:56:21 +00:00
oster	9aa1b6b7c0	Change signature of rf_AllocBuffer() to take a dag_h and buffer size instead of an PDA and an alloclist. This lets us do the vple dance inside of rf_AllocBuffer(). Cleanup usage of rf_AllocIOBuffer() and use rf_AllocBuffer() instead. Fix all uses of rf_AllocBuffer() to conform to the new way of doing things.	2004-03-20 05:21:53 +00:00
oster	0ff2145648	For each RAID set, pre-allocate a number of "emergency buffers" to be used in the event that we can't malloc a buffer of the appropriate size in the traditional way. rf_AllocIOBuffer() and rf_FreeIOBuffer() deal with allocating/freeing these structures. These buffers are stored in a list on the 'iobuf' list. iobuf_count keeps track of how many buffers are available, and numEmergencyBuffers is the effective "high-water" mark for the freelist. The buffers allocated by rf_AllocIOBuffer() are stripe-unit sized, which is the maximum size requested by any of the callers. Add an iobufs entry to RF_DagHeader_s. Use it for keeping track of buffers that get allocated from the free-list. Add a "generic list" pool (VoidPointerListElement Pool) for elements used to maintain a list of allocated memory. [It is somewhat less than ideal to add another little pool to handle this...] Teach rf_AllocBuffer() to use the new rf_AllocIOBuffer(). Modify other Mallocs to use rf_AllocIOBuffer(), and to update dag_h->iobufs as appropriate. Update rf_FreeDAG() to handle cleanup of dag_h->iobufs. While here, add some missing pool_destroy() calls for a number of pools. With these changes, it should (in theory) be possible to swap on RAID 5 sets again. That said, I've not had any success there yet -- but the last issue I saw at least wasn't in RAIDframe. :-} [There is room for this code to become a bit more consise, but I wanted to do a checkpoint here with something known to work :) ]	2004-03-20 04:22:05 +00:00
oster	29c6e63ebb	dag_node_pool never did get used here. Turf.	2004-03-19 17:04:35 +00:00
oster	1a3e20d5d9	Introduce a dual-purpose pool for providing pointer and param "caches" for RF_DagNode_t's. Scale the structure size based on RF_MAXCOL. Use the new allocation method in InitNode(). Note that we can't get rid of the mallocs in there until we can prove that this new allocation method is a strict upper bound. Unless someone tries running a RAID set with 40 components, the mallocs here shouldn't shouldn't be an issue. (and if someone does make a set with 40 components they will run into other issues with other constants long before then)	2004-03-19 17:01:26 +00:00
oster	b2c52e1175	Take care of six more mallocs: - Pull rf_FreePhysDiskAddr() out from under a #ifdef, since we're now going to use it. - Add a pda_cleanup_list into the DAG header. Use it in rf_FreeDAG() to cleanup any PDA's that get allocated but have no "easy" way of being located and freed when the DAG completes. - numStripeUnitsAccessed is a per-stripe value, and has a maximum value equal to the number of colums (thus limited by RF_MAXCOL). Use this knowledge to set a high-bound on overlappingPDAs, and stuff it on the stack instead of malloc'ing it all the time! This costs us a whopping 40 bytes on the stack, but saves a malloc() and a free().	2004-03-19 15:16:18 +00:00
oster	5ac8fbad7f	Add a comment. Will hopefully save time next time someone tries to figure out where the allocated memory is freed.	2004-03-19 02:57:34 +00:00
oster	d3810da59b	Add a few comments to explain what some of these new structures are, and where they are used.	2004-03-19 02:34:30 +00:00
oster	208b461a96	Introduce 3 more pools and 6 functions to handle allocating/freeing elements from the pools. Re-work rf_SelectAlgorithm() to get rid of all the 8 malloc's, and to use the new functions to get/put these 'support structures'. I'm not overly happy with some of the variable names, but them's the breaks. In the process of changing things, fix a bug: - in the case where we can't create a dag, free asmh_b and blockFuncs too!! [if you were able to look at the source code related to these changes, and comprehend what was going on without having your eyes bleed or getting dizzy, please contact me... I'm sure I'll have more code which would benefit by you having a look at it before I commit it :) ]	2004-03-19 02:27:44 +00:00
oster	997983060e	Re-work rf_State_Quiesce() so that we don't have to hold a lock while doing a pool_get().	2004-03-19 01:56:03 +00:00
oster	b69e81af97	Remove a debugging line that was accidentally left in.	2004-03-18 17:46:22 +00:00
oster	ba5bdf0048	Use rf_AllocDAGNode() to get new DAG nodes.	2004-03-18 17:26:36 +00:00
oster	1051cc745f	Re-work the locking mechanisms for reconstruct and PSS structures such that we don't actually hold a simplelock while we are doing a pool_get(), but that we still effectively protecting critical code. This should fix all of the outstanding LOCKDEBUG warnings related to rebuilding RAID sets.	2004-03-18 16:54:54 +00:00
oster	d4fe1a2103	- Introduce a 'dagnode' pool. Initialize it and allow for cleanup. Provide rf_AllocDAGNode() and rf_FreeDAGNode() to handle allocation/freeing. - Introduce a "nodes" linked list of RF_DagNode_t's into the DAG header. Initialize nodes in InitHdrNode(). Arrange for nodes cleanup in rf_FreeDAG(). - Add a "list_next" to RF_DagNode_t to keep track of nodes on the above "nodes" list. (This is distinct from the "next" field of RF_DagNode_t, which keeps track of the firing order of nodes.) "list_next" gets used in the cleanup routines, and in traversing through a set of nodes that belong to a particular set of nodes (e.g. those belonging to xorNodes for a given DAG). - use rf_AllocDAGNode() instead of mallocs of variable-sized arrays of RF_DagNode_t's. Mostly mechanical changes to convert the DAG construction from "access nodes via an array index" to "access nodes via a 'nextnode' pointer". - rework a couple of tricky spots where assumptions about the node order was being abused. - performance remains consistent with performance before these changes. [Thanks to Simon Burge (simonb at you.know.where) for looking over the mechanical changes to make sure I didn't biff anything.]	2004-03-18 16:40:05 +00:00
oster	5f5c148f74	raidPtr->num_spare is NOT sufficient here. We must allocate at least an additional RF_MAXSPARE spare units, just in case.	2004-03-13 03:32:08 +00:00
oster	8e82e43e0e	This desc->mutex is only ever initialized -- never used. toss	2004-03-13 02:31:12 +00:00
oster	ae179d21e5	paramDAG and paramASM are only ever set, but never used. toss	2004-03-13 02:20:40 +00:00
oster	cb1bb8925a	Cleanup function prototypes.	2004-03-13 02:04:02 +00:00
oster	8150ff6fbd	- don't use rf_PrintUserStats() for recon statistics. rf_PrintUserStats() was mean for the simulator, and doesn't provide any real info in kernel-space, especially for reconstructs. Reconstructing actually renders the stats even more useless, since it resets them all to zero before the reconstruct starts! - since rf_PrintUserStats() is no longer used, nuke it along with the routines that feed it. Nothing was using this code, and if we ever need it again, we know where to find it.	2004-03-13 02:00:15 +00:00
oster	37533dd249	Introduce RF_DEBUG_ENGINE and use it to disable a number of debugging bits in rf_engine.c	2004-03-09 03:10:26 +00:00
oster	82c3abf8e3	Use RF_DEBUG_QUIESCE to #if-out some printfs that really only should be seen when debugging.	2004-03-09 02:41:21 +00:00
oster	71371823e7	rf_ShutdownList() isn't returning anything useful, and doesn't need to return anything. Cleanup. Cleanup function prototypes in rf_shutdown.h Use #if RF_DEBUG_SHUTDOWN to #if-out more code.	2004-03-09 02:15:33 +00:00
oster	bce42a3095	Move pss_pool to rf_pools. Will save a bit of extra memory at run-time, and we can only do one reconstruction at a time anyway. Nuke pss_issued_pool - move it to an internal structure in pss.	2004-03-08 02:25:27 +00:00
oster	8cbd53826e	A few more cleanups missed in last commit.	2004-03-08 01:59:26 +00:00
oster	acce07be62	Minor cleanup. No functional change.	2004-03-08 01:55:14 +00:00
oster	c25748c684	Rename rf_rad_pool_lock to rf_rad_lock. The latter is far more accurate.	2004-03-07 23:17:44 +00:00
oster	f95359dd19	- Introduce rf_pools which contains all of the various global pools used by RAIDframe. Convert all other RAIDframe global pools to use pools defined within this new structure. - Introduce rf_pool_init(), used for initializing a single pool in RAIDframe. Teach each of the configuration routines to use rf_pool_init(). - Cleanup a few pool-related comments. - Cleanup revent initialization and #defines. - Add a missing pool_destroy() for the reconbuffer pool. (Saves another 1K off of an i386 GENERIC kernel, and makes stuff a lot more readable)	2004-03-07 22:15:19 +00:00
oster	834d494bb2	- Introduce rf_pools which contains all of the various global pools used by RAIDframe. Convert all other RAIDframe global pools to use pools defined within this new structure. - Introduce rf_pool_init(), used for initializing a single pool in RAIDframe. Teach each of the configuration routines to use rf_pool_init(). - Cleanup a few pool-related comments. - Cleanup revent initialization and #defines. - Add a missing pool_destroy() for the reconbuffer pool. (Saves another 1K off of an i386 GENERIC kernel, and makes stuff a lot more readable)	2004-03-07 21:57:44 +00:00
oster	c31c7c5652	Cleanup function prototypes.	2004-03-07 05:01:15 +00:00
oster	dd0716f291	We don't need 2-fault tollerant stuff here in the default case. Use #if's to conditionalize the code compilation. (Saves another 1K on i386 GENERIC kernels)	2004-03-07 02:59:25 +00:00
oster	43648c8525	Minor cleanups. No functional changes.	2004-03-07 02:49:23 +00:00
oster	d02f580adf	- fix up initialization of rf_recond_pool - introduce rf_reconbuffer_pool and teach rf_MakeReconBuffer() to use it	2004-03-07 02:46:58 +00:00
oster	3f36d142d8	Re-work rf_GenerateFailedAccessASMs() to simplify things a bit. rf_AllocBuffer() is available, so use it to get buffer space instead of the previous RF_Malloc() bits. Saves a few bytes, but more importantly makes the code much more readable.	2004-03-07 02:25:36 +00:00
oster	9e4fc614fc	Pretty up a bit of unused code.	2004-03-06 23:53:31 +00:00
oster	967b00b4ec	rf_AllocBuffer() doesn't do anything with its dag_h parameter. Nuke it, and adjust callers.	2004-03-06 23:52:20 +00:00
oster	54eab15188	Sprinkle a few #if's to ignore some bits that are only used for RAID 6 or PQ stuff. Saves another 3K on i386 GENERIC.	2004-03-06 23:34:27 +00:00
oster	dc76c4b111	Minor tabbing cleanup. No functional change.	2004-03-06 22:59:42 +00:00
oster	bfeeabba13	Use RF_INCLUDE_PARITY_DECLUSTERING_DS to #if-out more unneeded bits. (We can't do RF_DISTRIBUTE_SPARE bits without the parity declustering stuff.)	2004-03-05 03:58:21 +00:00
oster	7bd09b0f2b	Introduce RF_DEBUG_DAG and use it to #if-out rf_dagDebug sections. (i386 GENERIC kernel shrinks by 1.6K)	2004-03-05 03:22:05 +00:00
oster	a0413593bc	- remove the RF__INC's, as necessary. They are not needed any more. - introduce RF_MIN_'s, as necessary. These will indicate the low-water mark for pools as well as the pool_prime() value. - add pool_setlowat() for the critical pools. - pool_prime() and pool_setlowat() the raidframe_cbufpool. - re-order some pool_prime()'s and pool_sethiwat()'s for clarity.	2004-03-05 02:53:55 +00:00
oster	a57d045808	Remove a comment that is no longer relevant.	2004-03-04 03:27:30 +00:00
oster	b6ec2bc708	Cleanup function prototypes.	2004-03-04 03:14:02 +00:00
oster	516b22a972	Another variable that was only ever initialized. Turf.	2004-03-04 03:02:01 +00:00
oster	5d768c003f	Cleanup function prototypes.	2004-03-04 02:55:58 +00:00
oster	708c19adaf	Fix mcpair locking issues -- we really don't need the lock held the entire time we spend in calling rf_DiskIOEnqueue().	2004-03-04 02:49:58 +00:00
oster	409f2e3222	More function prototype cleanups.	2004-03-04 02:26:50 +00:00
oster	5102542bf2	Cleanup function prototype.	2004-03-04 02:25:13 +00:00
oster	65f967653e	More leftover 'row' bits now removed.	2004-03-04 01:57:54 +00:00
oster	55cb2c9a0d	Cleanup one more leftover 'row' variable. Cleanup function prototypes in rf_diskqueue.h	2004-03-04 01:53:25 +00:00
oster	6e2928d6d5	resultNum isn't used anywhere. Good-bye.	2004-03-04 00:56:13 +00:00
oster	8b515e1496	rf_bwd1 and rf_bwd2 are holdovers from the "backward" error recovery. Nuke them, and the little bit of code associated with them.	2004-03-04 00:54:30 +00:00
oster	2fb9f8db54	Nuke some unnecessary casts. No functional changes.	2004-03-03 17:14:46 +00:00
oster	28bd6c8ea2	Introduce RF_REVENT_READ_FAILED, RF_REVENT_WRITE_FAILED and RF_REVENT_FORCEREAD_FAILED. This removes 3 more RF_PANIC()'s (but we'll currently still panic if any of these cases occur). fix up a few printf's. XXX: still needs more cleanup and testing (and be taught to not panic).	2004-03-03 16:59:54 +00:00
oster	b2784ea2d7	Cleanup function prototypes.	2004-03-03 16:12:28 +00:00
oster	992f7c8269	- cleanup unneeded 'row' from RF_CallbackDesc_s. - remove callbackArg2 from RF_CallbackDesc_s -- it is only ever set, never read. - now that this is done, all callbacks should only take a single argument, and we can simplify things further.	2004-03-03 13:29:00 +00:00
oster	dc851b8e23	Improve comments on rf_LookupRUStatus(). No functional changes.	2004-03-03 01:02:44 +00:00
oster	edf3711f4e	Cleanup function prototypes.	2004-03-03 00:58:03 +00:00
oster	b3eae139eb	- cleanup memory allocation in rf_AllocPSStatus() - change function signature of rf_LookupRUStatus(). The last argument is now a pointer to a new PSS, in case one is needed. Rather than having rf_LookupRUStatus() allocate a new PSS, we pre-allocate one beforehand, where necessary, just in case. - change callers of rf_lookupRUStatus() to deal with the new way of calling rf_lookupRUStatus(). [no improvement or worsening of parity rebuild/initialization performance.]	2004-03-03 00:45:20 +00:00
oster	564980b994	Another spot where we're willing to wait for pool_get to give us something useful.	2004-03-02 19:36:21 +00:00
oster	a37cdad459	A few more cases where RF_DEBUG_PSS can be used.	2004-03-02 16:40:10 +00:00
oster	79a5cbaf41	DO_WAIT and DO_SIGNAL macros are a bit much, as they are only each used in one place. Simplify things a bit. No functional change.	2004-03-02 16:03:00 +00:00
oster	a7e422ac7f	We're willing to wait, so we should always get something valid from pool_get().	2004-03-02 15:55:56 +00:00
oster	d2accf5991	Fix more cases where mutex locks were being held longer than necessary. [For the record: The mcpair mutex is being used to protect mcpair->flag. mcpair gets allocated before each call to rf_DispatchDAG(), so there is no other process/thread that could be mucking with it. It is only used to detect the completion of a given parity unit, and rf_DispatchDAG() only uses it to setup the callback argument for rf_MCPairWakeupFunc() which will be called when the IO completes. The code after the call to rf_DispatchDAG() sits and waits for a 'wakeup' on mcpair->cond (rf_MCPairWakeupFunc() does that). If mcpair->flag is 0 when rf_DispatchDAG() completes, then rf_MCPairWakeupFunc() hasn't been called yet (the IO hasn't completed). If it is 1, then the IO is already done, and we continue on our merry way without sleeping. Thus, we don't need to hold any lock on mcpair while calling rf_DispatchDAG().]	2004-03-02 15:47:35 +00:00
oster	c7eaad6a14	Use RF_ACC_TRACE to #if out more chunks of code related only to access tracing. (not turned on yet)	2004-03-01 23:30:57 +00:00
oster	e35e293611	We always wait until we get memory, so the NULL check doesn't buy us anything.	2004-03-01 01:12:22 +00:00
oster	63e07285f8	Bah. RF_ACC_TRACE bits need a bit more work before they can be turned off. Re-enable until all parts can be disabled at the same time.	2004-02-29 23:27:24 +00:00
oster	b5e62ef069	As a LOCK_DEBUG kernel complains, we should really initialze rf_tracing_mutex.	2004-02-29 23:10:58 +00:00
oster	4b8d5fc8b6	Unlock the mcpair mutex before we call rf_DispatchDAG(), and re-lock it once we are back. Makes things much happier on SMP boxen. Problem noted by SimonB. (Thanks!)	2004-02-29 23:03:30 +00:00
oster	e9ec029612	Oops.. might still need this mutex in a few places.	2004-02-29 22:32:37 +00:00
oster	65ba8db490	Minor shuffling of variable declarations to clean RF_ACC_TRACE #defines up a bit. No functional changes.	2004-02-29 22:11:54 +00:00
oster	0d2f3606aa	Add "RF_ACC_TRACE" as a new #define to rf_archs.h. Use it to conditionalize some of the access tracing and tracerec bits. Chops about 4 K off of an i386 GENERIC kernel.	2004-02-29 21:38:41 +00:00
oster	cf01eb78b2	Minor comment reformatting.	2004-02-29 20:47:37 +00:00
oster	d272af6ed2	When the 'if' condition is true, the function will return, so there is no need for this chunk of code to be in the 'else' part. Remove the 'else', and left-shift the code. No functional changes.	2004-02-29 20:11:26 +00:00
oster	5a02af5b21	Adjust _rf_ShutdownCreate() so that it is willing to wait for more memory. Since we only now ever "return(0)", just return (void) instead. Cleanup all uses of rf_ShutdownCreate() to not worry about it ever failing. Shaves another 600 bytes off of an i386 GENERIC kernel.	2004-02-29 04:03:50 +00:00
oster	da96c7f542	InitHdrNode() might as well return 'void'. Nothing ever pays attention to what it returns anyway!	2004-02-29 01:50:23 +00:00
oster	4907fd699b	rf_MakeAllocList() will always get memory. No point in checking for a case that won't ever occur.	2004-02-29 01:49:13 +00:00
oster	d4a8e64b16	We'd better have gotten a dag header from the pool. In any event, callers arn't checking what we return anyway. (Cleanup memory allocations.)	2004-02-29 01:47:45 +00:00
oster	2e0cff190a	TransferDagMemory() doesn't exist, so these lines are just wasting space.	2004-02-29 01:37:04 +00:00
oster	5e23bfcf9b	Stripe functions are now handled by a linked-list instead of a runtime-variable array. Fix a bug where stripeFuncs was being freed, and then being used after (in the case of numStripesBailed > 0).	2004-02-29 01:24:34 +00:00
oster	873f4a20f5	Add forgotten pool_destroy().	2004-02-27 03:18:02 +00:00
oster	24099528e9	Use a dynamically allocated linked list of dagLists instead of using a dynamically allocated variable-sized array (dagArray). Convert code to use the new linked list stuff instead of the array stuff (the ratio of one dagList per stripe still applies). The big advantage is in being able to more efficiently allocate the dagLists on-the-fly, and not have to know the size(s) of the array beforehand.	2004-02-27 02:55:17 +00:00
oster	d3f3b25e8d	Poolify alloclist stuff. Saves a whopping 32 bytes while decreasing code complexity.	2004-02-25 02:15:52 +00:00
wiz	f05e6f1a3a	occured -> occurred. From Peter Postma.	2004-02-24 15:12:51 +00:00
oster	8bf9613af8	unlockQNodes, unlockDataNodes, and unlockParityNodes are not used. Turf.	2004-02-21 20:06:29 +00:00
oster	2eaec068c9	Relegate some printf's to DEBUG land.	2004-02-08 04:37:56 +00:00
hannken	3db4e2acd8	Make VOP_STRATEGY(bp) a real VOP as discussed on tech-kern. VOP_STRATEGY(bp) is replaced by one of two new functions: - VOP_STRATEGY(vp, bp) Call the strategy routine of vp for bp. - DEV_STRATEGY(bp) Call the d_strategy routine of bp->b_dev for bp. DEV_STRATEGY(bp) is used only for block-to-block device situations.	2004-01-25 18:06:48 +00:00
oster	b68ecaf95f	Always ask for a new RF_StripeLockDesc_t "just in case", and then give it back if we don't need it. If we don't allocate it before we take our lock, LOCKDEBUG (rightfully) complains that we're trying to grab something from the pool with PR_WAITOK. This code (and the PR_WAITOK in particular) really needs to be revisited at some point.	2004-01-23 01:57:08 +00:00
oster	1a5bc3eb34	Add missing simple_lock_init(). Noted by skrueger-at-europe-dot-com.	2004-01-15 20:27:27 +00:00
oster	7ab3e4e2e3	Add 2 missing RF_LOCK_MUTEX()es. Problem pointed out by skrueger-at-europe-dot-com. (It turns out that the mutex used to serve two different purposes, not just one, and for its current use, it's actually miss-named. Will fix that some other time.)	2004-01-14 15:10:55 +00:00
oster	6297e36a60	Since the LOCK and UNLOCK flags are never used, no need in extracting them. Collapse the related variables down to zero. That means 'flags' is 0 as well. Nuke the extraction macros, a bunch of the variables, and replace 'flags' as well.	2004-01-10 17:04:44 +00:00
yamt	7266a95907	store a i/o priority hint in struct buf for buffer queue discipline.	2004-01-10 14:39:50 +00:00
oster	2e19186660	iCleanup the RF_CREATE_PARAM3(). Middle two "arguments" were nothing but 0 in all cases.	2004-01-10 00:56:27 +00:00
oster	5b7e949cbf	More lu_flags disappear into oblivion.	2004-01-10 00:53:08 +00:00
oster	7edb3013bb	Nuke lu_flag and code associated with it being non-zero. The compiler already knew that these chunks of code could never be reached (since lu_flag was always 0), so it already ignored them. No functional changes.	2004-01-09 23:35:59 +00:00
oster	ac6b86acbb	[This file should have been committed with the other rf_enableAtomicRMW changes.] Cleanup rf_enableAtomicRMW and its use. According to the comments, we can't set this to anything other than zero anyway. Shaves off another 900 bytes. lu_flag's days are numbered now, as are the middle parameters of RF_CREATE_PARAM3.	2004-01-09 23:31:37 +00:00
oster	35d7c1ee7e	Cleanup rf_enableAtomicRMW and its use. According to the comments, we can't set this to anything other than zero anyway. Shaves off another 900 bytes. lu_flag's days are numbered now, as are the middle parameters of RF_CREATE_PARAM3.	2004-01-09 23:26:17 +00:00
oster	d9ff8342a0	Clean up a few comments, and improve code formatting somewhat. Makes things a bit more readable. No functional changes.	2004-01-06 03:27:13 +00:00
oster	181f2eebee	remove terminate_disk_queues from RF_Raid_s. The hist_diskreq[] stuff is only used for the CHAINDECLUSTERING, so hide it with an #if.	2004-01-05 01:19:07 +00:00
oster	290ef1fca7	raidPtr->reconControl->percentCompleted only gets used in one debugging printf, and in rf_netbsdkintf.c. We can do the calculations inside of RF_DEBUG_RECON for the one debugging printf, and only perform the percentCompleted calculation "on demand" in the rf_netbsdkintf.c case. Shaves a few more bytes off an i386 GENERIC kernel, and ever-so-slightly decreases the amount of work performed during a reconstruct.	2004-01-04 21:06:04 +00:00
oster	87e6ffb752	As noted by Thierry Deval in a posting to misc/at/openbsd.org, rf_DecrAccessesCountState wasn't in the correct spot in RF_AccessState_e. Following up on that has resulted in one other correction. Changing orderings of these states is tricky, and shouldn't be attempted without some thorough analysis. For the changes committed, the following analysis is offerred: 1) RAIDframe uses a little state machine to take care of building, executing, and processing the DAGs used to direct IO. 2) The rf_DecrAccessesCountState state is handled by the function rf_State_DecrAccessCount(). The purpose of this state is to decrement the number of "accesses-in-flight". 3) rf_Cleanup_State is handled by rf_State_Cleanup(). Its job is to do general cleanup of DAG arrays and any stripe locks. 4) DefaultStates[] in rf_layout.c indicates that the right spot for rf_DecrAccessesCountState is just before rf_Cleanup_State. Analysis of code for both states indicates that the order doesn't matter too much, although rf_State_DecrAccessCount() should probably take place after rf_State_Cleanup() to be more correct. 5) Comments in rf_State_ProcessDAG() indicates that the next state should be rf_Cleanup_State. However: it attempts to get there by using desc->state++; which actually takes it to just rf_DecrAccessesCountState! This turned out to be OK before, since rf_Cleanup_State would follow right after, and all would be taken careof (albeit in arguably the "less correct" order). 6) With the current ordering, if we head directly to rf_Cleanup_State (as we do, for example, if multiple components fail in a RAID 5 set), then we'll actually miss going trough rf_DecrAccessesCountState), and could end up never being able to reach quiescence! Perhaps not too big of a deal, given that the RAID set is pretty much toast by that point at which such a drastic state change happens, but might as well have this correct. The changes made are: 1) Since having rf_State_DecrAccessCount() come after rf_State_Cleanup() is just fine, change rf_layout.c to reflect that rf_DecrAccessesCountState comes after rf_Cleanup_State (i.e. they swap positions in the state list). This means that going to rf_Cleanup_State after bailing on a failed DAG access will do all the right things -- the state will get cleaned up, and then the access counts will get decremented properly. The comment in rf_State_ProcessDAG() is now actually correct -- the next state will be rf_Cleanup_State. 2) Move rf_DecrAccessesCountState in RF_AccessState_e to just after rf_CleanupState. This puts RF_AccessState_e in sync with DefaultStates[]. Fortunately, these states are rarely referred to by name, and so this change ends up being mostly cosmetic -- it really only fixes cleanup behaviour for the recent "Failed to create a DAG" changes.	2004-01-04 06:37:16 +00:00
oster	24b034a69e	Fix the "We panic if we can't create a DAG" problem that's existed ~forever. This requires a number of things: 1) If we can't create a DAG, set desc->numStripes to 0 in rf_SelectAlgorithm. This will ensure that we don't attempt to free any dagArray[] elements in rf_StateCleanup. 2) Modify rf_State_CreateDAG() to not panic in the event of a DAG failure. Instead, set the bp->b_flags and bp->b_error, and set things up to skip to rf_State_Cleanup(). 3) Need to mark desc->status as "bad" so that we actually stop looking for a different DAG. (which we won't find... no matter how many times we try). 4) rf_State_LastState() will then do the biodone(), and return EIO for the IO in question. 5) Remove some " \|\| 1 "'s from ProcessNode(). These were for debugging, and we don't need the failure notices spewing over and over again as the failing DAGs are processed. 6) Needed to change if (asmap->numDataFailed + asmap->numParityFailed > 1) to if ((asmap->numDataFailed + asmap->numParityFailed > 1) \|\| (raidPtr->numFailures > 1)){ in rf_raid5.c so that it doesn't try to return rf_CreateNonRedundantWriteDAG as the creation function. 7) Note that we can't apply the above change to the RAID 1 code as with the silly "fake 2-D" RAID 1 sets, it is possible to have 2 failed components in the RAID 1 set, and that would stop them from working. (I really don't know why/how those "fake 2-D" RAID 1 sets even work with all the "single-fault" assumptions present in the rest of the code.) 8) Needed to protect rf_RAID0DagSelect() in a similar way -- it should return NULL as the createFunc. 9) No point printing out "Multiple disks failed..." a zillion times.	2004-01-02 21:41:08 +00:00
hannken	897a1fc008	Remove "row" from DEBUG statements.	2004-01-02 11:52:15 +00:00
oster	5b70d1d979	Cleanup some unused desc->flags: RF_DAG_RETURN_DAG RF_DAG_RETURN_ASM RF_DAG_TEST_ACCESS and the code that goes with them. A couple more of these can probably go too, but I might need them in a bit.	2004-01-01 23:35:08 +00:00
oster	f025a0425c	Finally cleanup RF_BZERO. Turns out that 'bp' was only used to obtain bp->b_proc for mapping userspace buffers to kernelspace in the original rf_kintf.c. That means bp isn't of any use in RF_BZERO() for us, and the macro can be replaced with just the memset(). No functional changes.	2004-01-01 20:39:58 +00:00
oster	da4ed88ce1	Tidy up the #includes a bit.	2004-01-01 19:37:17 +00:00
oster	f2512ce061	Nuke a few more 'row'-related bits that were missed in the row-removal.	2004-01-01 19:32:55 +00:00
oster	d3aa064253	Nuke a bunch of unused variables: - node_queue_cond - quiescent_cond - eq_cond - desc->cond - desc->head - diskqueue->numWaiting Nuke rf_print_unable_to_init_cond(). Nuke rf_TerminateDiskQueues prototype from rf_diskqueue.h.	2004-01-01 19:27:35 +00:00
oster	2c490b5588	Remove 3 unused debugging options: RF_DBG_OPTION(accessTraceBufSize, 0) RF_DBG_OPTION(disableAsyncAccs, 0) RF_DBG_OPTION(suppressTraceDelays, 0)	2003-12-31 22:25:38 +00:00
oster	7ea5de8a08	Clean up a bunch of comments.	2003-12-31 17:47:53 +00:00
oster	7f9bbe325c	Replace the DO_FREE() macro with just RF_Free(). No functional changes, just less confusion.	2003-12-31 16:32:50 +00:00
oster	8ff567e586	Fix comments and make function declarations conform. No functional changes.	2003-12-31 16:23:50 +00:00
oster	a5c20197a4	Add in a bunch of RF_SIGNAL_COND()'s that were missing. Tidy up a few lines.	2003-12-31 04:13:52 +00:00
oster	abbecc25a2	Left-shift another else{} chunk. No functional changes.	2003-12-31 04:00:01 +00:00
oster	b7bb3e3c62	left-shift the "else" part of the if(!lp_SubmitReconBuffer) condition. Cleanup. No real functional changes, just more readable.	2003-12-31 03:51:28 +00:00
oster	824f2ed930	Negate a condition, and flip if/else parts. Preparation for left-shifting the (now) else part. No real functional change.	2003-12-31 03:29:11 +00:00
oster	00515ece62	Remove a couple of incorrect comments. (rf_recon_req needs to be passed to rf_ReconThread(), and there's no way around that for now.)	2003-12-31 03:02:57 +00:00
oster	4283fdd054	init_dqd and clean_dqd don't do enough to be their own functions, and just serve to cause confusion. Back-merge their contents to their (only) calling spots and nuke them.	2003-12-31 02:47:58 +00:00
oster	7c465d22a7	Increase the scope of RF_DEBUG_PSS to cover a bit more stuff.	2003-12-31 00:42:46 +00:00
oster	4635d077cf	nuke the 'bp' argument to rf_bxor and friends since it is no longer used. In fact, bp was being set to "NULL" when passed rf_bxor() in rf_parityscan.c:rf_VerifyParityBasic().	2003-12-31 00:00:06 +00:00
oster	6e2bbdfcc9	Very carefully remove REMAP_VA() macro. It's only doing a simple assignement, and the 'bp' argument isn't used. (Guess what's going to disappear next?) No functional changes.	2003-12-30 23:40:20 +00:00
oster	f5e96bf53e	Nuke a forgotten 'row' from struct RF_PerDiskReconCtrl_s.	2003-12-30 23:23:17 +00:00
oster	dd05ba5c10	sparediskids isn't used anywhere. turf.	2003-12-30 22:56:40 +00:00
oster	0b4f5c68b8	No point keeping 'pt' around. It's only ever assigned to.	2003-12-30 22:12:10 +00:00
oster	5a41c1d7ce	rf_FreePDAList() and rf_FreeASMList() wern't using their l_end or count arguments. Nuke the unused args, and fix up callers and fn prototypes. Rename l_start arguments.	2003-12-30 22:11:14 +00:00
oster	c43fc67d7d	Some days you wonder if some of the function declaration consistency was just an accident in the first place. Cleanup function decls and a few comments. [ok.. so I wasn't going to fix this many.. but once you're on a roll....]	2003-12-30 21:59:03 +00:00
oster	01635d963b	Make function declarations consistent a) with each other and b) with KNF. Clean up comments.	2003-12-30 19:28:26 +00:00
oster	34ad01b703	Fix slight bogon from row removal. 'r' would have been 0 here, not 1, which means r*raidPtr->numCol would have always been 0, not raidPtr->numCol.	2003-12-30 17:46:59 +00:00
oster	ed1155b792	access_tracebuf is declared, allocated, and freed. No freeloading datastructures allowed! Punt. accessTraceBufCount, rf_accessTraceBufSize, and rf_stopCollectingTraces are similarly declared, initialized, and then never changed. Punt. rf_ShutdownAccessTrace() now does nothing. Remove it, and the callback setup stuff from rf_ConfigureAccessTrace().	2003-12-30 17:29:41 +00:00
oster	5c71699f0b	Clean up comments, and generally make more readable. No functional changes.	2003-12-29 17:13:36 +00:00
oster	f7e3fe7a56	Clean up code/comments. No functional changes. Just a lot easier to read now.	2003-12-29 16:57:35 +00:00
oster	86c2a9ed65	The ugly wonderment of appropriately placed ASCII characters formerly known as rf_threadstuff.c is no more.	2003-12-29 06:37:47 +00:00
oster	ee0afdc3dc	rf_lkmgr_mutex_init() is only called from one spot, and it really can't fail. Simplify life in rf_BootRaidframe(), and then nuke rf_lkmgr_mutex_init(). Cleanup rf_threadstuff.h a bit more too. rf_threadstuff.c is about to Go Away.	2003-12-29 06:30:42 +00:00
oster	e276d553e6	Garbage-collect a whole mess of this RF_THREADGROUP_* stuff that isn't being used. Then, nuke rf_init_managed_threadgroup() and all descendants and relations.	2003-12-29 06:19:28 +00:00
oster	8b09b35f45	Let's see... raidPtr->recon_done_procs is never set to anything (other than NULL when raidPtr is initialized). That means SignalReconDone() never does anything useful. Bye-bye! Say good-bye to recon_done_procs and recon_done_procs_mutex (and its initializer) as well.	2003-12-29 05:58:34 +00:00
oster	6a080d56e0	IO_BUF_ERR really doesn't need to be a macro. Replace the macro.	2003-12-29 05:52:58 +00:00
oster	fc5966383f	_rf_create_managed_cond() is now left doing nothing. Convert callers. Mash DO_RAID_COND in rf_driver.c out of existance. - Nuke (already #if 0'ed) _rf_create_managed_lkmgr_mutex() while we're busy here. simplify DO_INIT in rf_engine.c	2003-12-29 05:48:13 +00:00
oster	2f4066f82e	cond_destroyer joins mutex_destroyer in the "nothing useful is done here" department. remove _rf_init_threadgroup() and rf_destroy_threadgroup() which were already #if 0'ed. rf_cond_destroy() does nothing. Nuke it, and all callers. rf_cond_init() doesn't deserve to be a separate function any more. Fix up the remaining 3 callers, and nuke rf_cond_init(). Another 0.4K goes "poof", but still no functionality lost!	2003-12-29 05:36:19 +00:00
oster	c629932636	_rf_create_managed_mutex() is doing just a simple: rf_mutex_init(m) now. The rest of the fluff is no longer needed. It also cannot fail, so error checking on rf_create_managed_mutex() is just wasting space. Nuke the #define's associated with rf_create_managed_mutex(). Convert rf_create_managed_mutex(listp,m) to just rf_mutex_init(m). Remove wasteful "error checking" and simplify all instances where this is called. (another 0.3K saved in the binary, but the real savings is in code readability!)	2003-12-29 05:22:16 +00:00
oster	f7db986b43	mutex_destroyer() has been reduced to nothing. Turf it. That means there is no point in having any of the shutdown stuff call mutex_destroyer() on any of the mutexes. Nuke related code.	2003-12-29 05:09:08 +00:00
oster	9a5cad32eb	Since rf_mutex_destroy() is effectively a NOP now, nuke it.	2003-12-29 05:01:14 +00:00

1 2 3 4 5 ...

765 Commits