NetBSD

Author	SHA1	Message	Date
chris	747fcfc089	Fix PTE_FLUSH_RANGE macro, it should have had a cnt parameter.	2002-11-12 09:46:37 +00:00
thorpej	508637429c	Fix pasto. (Man, it's not my day today, is it...)	2002-11-08 02:35:23 +00:00
thorpej	c05e648e83	Ensure that some integer constants are always unsigned.	2002-11-08 01:31:23 +00:00
thorpej	c2c9021d7d	Fix goof-ups in last (compiler used to test previously used a different file).	2002-11-08 00:19:51 +00:00
thorpej	c138531637	Adjust stdargs/varargs for GCC 3.x.	2002-11-08 00:08:02 +00:00
bsh	7b6639153c	make atomic_{set,clear}_bit() inline for arm32 ports, and add <machine/atomic.h> for them.	2002-10-19 12:22:33 +00:00
bjh21	a531a4ae8e	Undo recent cpu_switch register usage changes in order to decrease nathanw_sa merge pain.	2002-10-19 00:10:53 +00:00
bjh21	3d1b6867f0	In cpu_switch(), stack more registers at the start of the function, and hence save fewer into the PCB. This should give me enough free registers in cpu_switch to tidy things up and support MULTIPROCESSOR properly. While we're here, make the stacked registers into an APCS stack frame, so that DDB backtraces through cpu_switch() will work. This also affects cpu_fork(), which has to fabricate a switchframe and PCB for the new process.	2002-10-18 21:32:57 +00:00
bjh21	441e8907fe	Switch to using the MI C versions of setrunqueue() and remrunqueue(). GCC produces almost exactly the same instructions as the hand-assembled versions, albeit in a different order. It even found one place where it could shave one off. Its insistence on creating a stack frame might slow things down marginally, but not, I think, enough to matter.	2002-10-15 20:53:38 +00:00
bjh21	75248cc7a1	It appears that MI code requires ci_cpuid to be the CPU number of the CPU in question, whereas the ARM code was using it to hold the model identification. To fix this, rename: ci_cpuid -> ci_arm_cpuid ci_cputype -> ci_arm_cputype (for consistency) ci_cpurev -> ci_arm_cpurev (ditto) ci_cpunum -> ci_cpuid This makes top(1) give correct CPU numbers in its "STATE" column (all 0 for now).	2002-10-13 12:24:57 +00:00
bjh21	d8fd346734	Remember the location of each CPU's idle PCB in struct cpu_info. Move allocation of the idle PCB from hydra.c to cpu.c and add some extra initialisation from cpu_fork().	2002-10-12 21:06:46 +00:00
bjh21	a7385c575f	Move curpcb into struct cpu_info in MULTIPROCESSOR kernels.	2002-10-12 12:20:08 +00:00
bjh21	5a9767e3de	Minor tidy-up, mostly to improve readability. The SWP instruction is now in its own little inline function, and this allows us to get rid of all the automatic variables elsewhere. This subtly changes the semantics of __cpu_simple_lock() such that the loop ends up one instruction longer, but I'm not sure that's a particularly bad thing.	2002-10-07 23:19:49 +00:00
thorpej	7bbf61fd89	Add support for restartable atomic sequences on 26-bit ARM. Compile tested only. Now that all ARM systems have RAS, move __HAVE_RAS from arm/arm32/types.h to arm/types.h.	2002-10-07 02:48:38 +00:00
bjh21	3832819227	Minimal changes to allow a kernel with "options MULTIPROCESSOR" to compile and boot multi-user on a single-processor machine. Many of these changes are wildly inappropriate for actual multi-processor operation, and correcting this will be my next task.	2002-10-05 13:46:57 +00:00
chs	c081614ea2	it really helps to get the stub right before cutting + pasting it 27 times. alas, I did not. doh.	2002-09-22 07:53:39 +00:00
chs	55e1f79335	add pmap_remove_all() hook (empty on most platforms so far).	2002-09-22 07:17:08 +00:00
simonb	eb4524608c	Only need to define __HAVE_MD_RUNQUEUE once here...	2002-09-22 05:56:32 +00:00
gmcgarry	dca80f08fd	Add __HAVE_MD_RUNQUEUE flag for MD code to override MI run queue primitives.	2002-09-22 04:11:32 +00:00
gehenna	77a6b82b27	Merge the gehenna-devsw branch into the trunk. This merge changes the device switch tables from static array to dynamically generated by config(8). - All device switches is defined as a constant structure in device drivers. - The new grammer ``device-major'' is introduced to ``files''. device-major <prefix> char <num> [block <num>] [<rules>] - All device major numbers must be listed up in port dependent majors.<arch> by using this grammer. - Added the new naming convention. The name of the device switch must be <prefix>_[bc]devsw for auto-generation of device switch tables. - The backward compatibility of loading block/character device switch by LKM framework is broken. This is necessary to convert from block/character device major to device name in runtime and vice versa. - The restriction to assign device major by LKM is completely removed. We don't need to reserve LKM entries for dynamic loading of device switch. - In compile time, device major numbers list is packed into the kernel and the LKM framework will refer it to assign device major number dynamically.	2002-09-06 13:18:43 +00:00
thorpej	212cb9f78d	Add machine-dependent bits of RAS for arm32.	2002-08-31 03:07:32 +00:00
thorpej	aafe6e006c	Define macros describing the 4M super-sections that our pmap actually uses (since we allocate PT pages in 4K chunks, rather than 1K chunks).	2002-08-24 02:48:50 +00:00
thorpej	77a6866508	Enable caching on kernel and user page tables. This saves having to do uncached memory access during VM operations (which can be quite expensive on some CPUs). We currently write-back PTEs as soon as they're modified; there is some room for optimization (to write them back in larger chunks). For PTEs in the APTE space (i.e. PTEs for pmaps that describe another process's address space), PTEs must also be evicted from the cache complete (PTEs in PTE space will be evicted durint a context switch).	2002-08-24 02:16:30 +00:00
thorpej	6cc7c1c1ff	* Add PTE_SYNC() and PTE_SYNC_RANGE() macros. These don't actually do anything yet. * Use PTE_SYNC() and PTE_SYNC_RANGE() in some obvious places, i.e. where vtopte() is used.	2002-08-22 01:13:53 +00:00
thorpej	a7d44c2503	Use separate function pointers for dmamap_sync pre- vs post- operations. Change the bus_dmamap_sync() macro to test the ops argument against pre- and post- constants. The compiler will optimize out dead code because of the constants. Since post- operations are not needed on ARM (except for ISA bounce buffers), this eliminate a large number of function calls which are noops, each of which cost at least 6 cycles just in the call and return overhead (not to mention whatever other useless work the compiler decides to do in the callee).	2002-08-17 20:46:26 +00:00
thorpej	ebff575bc3	* Add a new machdep.powersave sysctl, which controls the use of the CPU's "sleep" function in the idle loop. * Default all CPUs to not use powersave, except for the PDA processors (SA11x0 and PXA2x0). This significantly reduces inteterrupt latency in high-performance applications (and was good to squeeze another ~10% out of an XScale IOP on a Gig-E benchmark).	2002-08-16 15:25:53 +00:00
thorpej	4706ae8670	Use cpsr_c rather then cpsr_all where appropriate.	2002-08-14 23:33:11 +00:00
thorpej	323a5902ee	Garbage-collect some unused routines.	2002-08-14 23:24:46 +00:00
briggs	4bb5ae3d09	Inline SetCPSR calls where it seems prudent to do so. This avoids two branches and allows the compiler to better utilize registers around calls to disable/enable/restore_interrupts().	2002-08-14 21:55:52 +00:00
thorpej	203dd6b325	* Add an ARM32_DMAMAP_COHERENT flag to indicate that a loaded DMA map contains "coherent" (non-cached in ARM-land) mappings. * Set ARM32_DMAMAP_COHERENT in the map at the start of a load operation, and clear it in _bus_dmamap_load_buffer() if we encounter any cacheable mappings. * In _bus_dmamap_sync(), if the map is marked COHERENT, skip any cache flushing.	2002-08-14 20:50:37 +00:00
thorpej	201e41fc31	* Rename "word" -> 16, and "long" -> 32, as suggested by Ben Harris. * Replace __byte_swap_32_variable() with a C version from Richard Earnshaw that generates nearly identical assembly (and it would be exactly identical with the addition of another peephole to GCC ARM back-end).	2002-08-14 15:08:57 +00:00
thorpej	da5ef20b1a	Byte-swapping optimizations, enabled if compiling with GCC: * Byte-swap 16-bit and 32-bit constants at compile-time. * Inline 16-bit and 32-bit variable byte-swaps. These take 3 and 4 insns, respectively, and inlining saves the minimum 6 cycle penalty to call/return from the byte swap function.	2002-08-13 22:41:36 +00:00
thorpej	19227e620e	Add a PVF_EXEC -- we don't use it yet, though.	2002-08-09 23:08:39 +00:00
thorpej	884bc64586	Add some code, conditional on PMAP_ALIAS_DEBUG, that can be used to hunt for virtual aliases between managed (pmap_enter) and non-managed (pmap_kenter_pa) mappings.	2002-08-09 18:22:59 +00:00
thorpej	0291ab61ec	* PMC_TYPE_I80200 -> PMC_CLASS_I80200 to reflect the terminology used in pmc(3). * Some minor namespace cleanup.	2002-08-09 05:27:09 +00:00
thorpej	f91adb85ce	* XSCALE_PMC_TYPE_I80200 -> PMC_TYPE_I80200 * XSCALE_PMC_TYPE_CCNT -> PMC_TYPE_I80200_CCNT * XSCALE_PMC_TYPE_PMCx -> PMC_TYPE_I80200_PMCx Per discussion with Allen Briggs.	2002-08-07 21:11:35 +00:00
briggs	0b956d0b8b	Implement pmc(9) -- An interface to hardware performance monitoring counters. These counters do not exist on all CPUs, but where they do exist, can be used for counting events such as dcache misses that would otherwise be difficult or impossible to instrument by code inspection or hardware simulation. pmc(9) is meant to be a general interface. Initially, the Intel XScale counters are the only ones supported.	2002-08-07 05:14:47 +00:00
thorpej	dce4476374	Overhaul how DMA ranges work in the ARM bus_dma implementation. A new "arm32_dma_range" structure now describes a DMA window, with a system address base, bus address base, and length. In addition to providing info about which memory regions are legal for DMA, the new structure provides address translation support, as well. As before, if a tag does not list any ranges, then all addresses are considered valid, and no DMA address translation is performed. This allows us to remove a large chunk of code which was duplicated and tweaked slightly (to do the address translation) from the stock ARM bus_dma in the XScale IOP and ARM Integrator ports. Test compiled on all ARM platforms, test booted on Intel IQ80321 and Shark.	2002-07-31 17:34:23 +00:00
thorpej	79af00bddb	Move the calls to uvm_page_physload() out of pmap_bootstrap() and into platform-specific initialization code, giving platform-specific code control over which free list a given chunk of memory gets put onto. Changes are essentially mechanical. Test compiled for all ARM platforms, test booted on Intel IQ80321 and Shark. Discussed some time ago on port-arm.	2002-07-31 00:20:51 +00:00
thorpej	7b652cb939	Change the way that DMA map syncs are done. Instead of remembering the virtual address for each DMA segment, just cache a pointer to the original buffer/buftype used to load the DMA map, and use that. This lets us shrink the bus_dma_segment_t down from 12 bytes to 8, and the cache flushing is also more efficient. Tested on an i80321 -- changes to others are mechanical.	2002-07-28 17:54:05 +00:00
briggs	c13ee269dd	Handle i80200 step D0 and i80321 step B0	2002-07-22 18:17:42 +00:00
ichiro	2255ed4ecb	add ixpcom to cdevsw	2002-07-16 14:20:04 +00:00
ichiro	7374c0afee	add support for ixp12x0	2002-07-15 16:27:15 +00:00
ichiro	83c0b66d47	add cpu id for "PXA250/210 3rd version CPUcore". for using many PDA/xscale-core.	2002-07-10 07:00:50 +00:00
thorpej	31404c3f2e	When delivering a signal, there is no need to push the signal number, code, context pointer, or handler onto the stack, so don't do so.	2002-06-23 00:16:20 +00:00
thorpej	ffe1440f29	Add the CPU ID for the 600MHz i80321 part.	2002-06-07 18:25:28 +00:00
thorpej	dada8613e1	Let machine-dependent code specify how to enumerate the bus. Currently, everyone uses pci_enumerate_bus_generic().	2002-05-15 19:23:51 +00:00
thorpej	7d3e137a0c	Hard-wire CLKF_BASEPRI() to 0 on the ARM, since spllowersoftclock() might not actually be able to unblock the interrupt, which would cause us to run the softclock interrupts with hardclock blocked. Per discussion w/ Charles Hannum.	2002-05-08 22:22:46 +00:00
rjs	767d5585e0	Use processor specific versions of ARM cache control functions for SA1100 and SA1110 instead of using SA110 ones. Rename common StrongARM functions from sa110_* to sa1_*. Reviewed by Jason Thorpe.	2002-05-03 16:45:21 +00:00
thorpej	860fe83065	Add support for the Intel PXA210 and PXA250. From Hiroyuki Bessho, PR 16617.	2002-05-03 03:28:48 +00:00

1 2 3 4 5 ...

280 Commits