Bochs

Author	SHA1	Message	Date
Stanislav Shwartsman	fd60a984a0	Instructions that should not check pending FPU exceptions	2003-12-28 18:58:15 +00:00
Stanislav Shwartsman	9ccb363ec3	bochs style decode/execute of FPU instructions. With this coding style each instruction could be implemented separatelly even not together with current Bochs FPU emulator. Step-by-step I am going to transfer all FPU instructions from current Bochs FPU emulator to new style and remove an old bugged emulator. Anyway, now I could implement all currently missed FPU instructions without hacking wm-fpu-emu.	2003-12-27 13:50:06 +00:00
Stanislav Shwartsman	d51aece0c1	Change BX_PANIC messages to BX_INFO when behaviour is accepted with Intel/AMD docs. Instructions MOV_CxRx and MOV_RxCx are not supported in v8086 mode according to Intel manuals. Also these instructions are treated as register-to-register regardless to MODRM byte fields (according to AMD manuals) Also commit fix for MOV_EwSw by Kevin	2003-11-13 21:17:31 +00:00
Stanislav Shwartsman	ac50ab3760	Implemented RCPSS/RCPPS SSE instructions	2003-11-07 20:53:27 +00:00
Stanislav Shwartsman	4e74efdf0c	Fast fxsave/fxrstor	2003-10-24 20:44:43 +00:00
Stanislav Shwartsman	ac20b6405a	- FXSAVE/FXRSTOR instructions should be available in P6 mode - Added second UD2 opcode to fetchdecode - Added RDPMC instruction to fetchdecode - 'changes' updated	2003-10-24 18:34:16 +00:00
Stanislav Shwartsman	7f570b0150	Added PNI new streaming extensions instructions PNI could be enabled by setting BX_SUPPORT_PNI in config.h After the feature will be fully validation I'll also add configure option. The implemntation is ~complete. I've missed only three FPU new opcodes of FUSTTP instruction and MONITOR/WAIT instructions. Enjoy ! ;)	2003-08-29 21:20:52 +00:00
Stanislav Shwartsman	254ad17328	Changes method of resolving opcode/attributes from group table New method more flexible and easy to understanding. Reorganizing fetchdecode code and make it more easy and understandable	2003-08-28 19:25:23 +00:00
Stanislav Shwartsman	79f46df971	separate APIC from CPU	2003-08-17 18:55:16 +00:00
Alexander Krisak	8559551001	iretd cpu instruction in real mode implemented, i hope this closes bugs 537047, 603410, 637822, 664544, 687619.	2003-08-17 18:15:04 +00:00
Stanislav Shwartsman	1616539667	additional FPU changes	2003-08-01 09:32:33 +00:00
Volker Ruppert	2ef0c43c7d	- description of ldtr fixed	2003-06-08 09:55:50 +00:00
Stanislav Shwartsman	1d45167e5b	Merged NEW-INSTRUCTIONS branch	2003-05-15 16:41:17 +00:00
Kevin Lawton	a17d06abcb	Optimized the main cpu loop iCache checks to remove a redundant check. Commented out a number of instances of invalidate_prefetch_q(), for branches which do not change CS since the EIP window mechanism takes care of validating that EIP lands in the current page or not in the main cpu loop anyways. Fixed a couple cases (v8086 mode and real mode) of loading CS where the EIP page window was not invalidated in segment_ctrl_pro.cc. That may fix some aliasing problems reported before (OS2).	2003-05-10 22:25:55 +00:00
Stanislav Shwartsman	d1d2fb34f0	Fixed number of compilation errors for FPU disabled case Transfer fpu.cc from /fpu to /cpu	2003-04-22 20:21:34 +00:00
Stanislav Shwartsman	7db893970c	Read attributes bits even for BxSplit11b opcodes Move lock prefix check later in fetchdecode function when all attributes is ready.	2003-04-06 19:08:31 +00:00
Stanislav Shwartsman	a050c1ac7d	Reserved cpu attribute bit for 3DNOW instructions decoding	2003-04-05 16:40:55 +00:00
Stanislav Shwartsman	1e71c9e56e	Merged patch-unallowed-lock-cases patch. According to the Intel manuals: The LOCK prefix can be prepended only to the following instructions and only to those forms of the instructions where the destination operand is a memory operand: ADD, ADC, AND, BTC, BTR, BTS, CMPXCHG, CMPXCH8B, DEC, INC, NEG, NOT, OR, SBB, SUB, XOR, XADD, and XCHG. If the LOCK prefix is used with one of these instructions and the source operand is a memory operand, an undefined opcode exception (#UD) will be generated. An undefined opcode exception will also be generated if the LOCK prefix is used with any instruction not in the above list. Checking of the LOCK prefix done in fetchDecode state and not overloads Bochs's execution.	2003-04-05 12:16:53 +00:00
Christophe Bothamy	1a518b81fe	- add __attribute__((regparm(X))) performance trick with gcc on x86 on some cpu instructions (patch from Conn Clark) - performance improvement is 1% on win95 boot	2003-03-17 00:41:01 +00:00
Christophe Bothamy	50efc3b8c7	- apply Conn Clark's patch.perf-regparm-cclark : - it works only on x86 with gcc2.95+ - uses the GCC function atribute "regparm(n)" to declare that certain functions use the register calling convention - performance improvement is about 6%	2003-03-02 23:59:12 +00:00
Peter Tattam	94880d1412	Fix guest2host and related optimizations to work on 64 bit host. 1) fixed the type of "hostPageAddr" and associated typecasts. 2) fixed the type of "pages" and associated typecasts (overloaded variable) 3) patch to cpu.cc to calculate "eipPageBias" correctly in 64 bit mode	2003-02-28 02:37:18 +00:00
Peter Tattam	70d752c8c2	external debugger only: fixed ask() to be virtual to let a panic trap into external debugger	2003-02-26 02:41:30 +00:00
Stanislav Shwartsman	7fa75388a1	Added bx_cpuid value to the BX_CPU class to avoid any problems with BX_CPU_ID implementation	2003-02-13 15:51:22 +00:00
Stanislav Shwartsman	cdfc3cbce4	instrumentation enchancements: * renamed CPU_ID to BX_CPU_ID. with this new name there is no possibility for name contentions and BX_CPU_ID definition could be moved out to NEED_CPU_REG_SHORTCUTS block * returned back `unsigned BX_CPU::which_cpu(void)` function * added BX_CPU_ID parameter for BX_INSTR_PHY_READ(a20addr, len); BX_INSTR_PHY_WRITE(a20addr, len); now it will be BX_INSTR_PHY_READ(cpu_id, a20addr, len); BX_INSTR_PHY_WRITE(cpu_id, a20addr, len);	2003-02-13 15:04:11 +00:00
Bryce Denney	7336c891ee	- CPU_ID fix from Shai Fultheim, who writes: > CPU_ID is defined as > #define CPU_ID (BX_CPU_THIS_PTR local_apic.get_id()) > This is not true when the APIC name is changed (true in Linux). Please > change this to: > #define CPU_ID (BX_CPU_THIS - BX_CPU(0))	2003-02-09 13:30:39 +00:00
Christophe Bothamy	939b558fdf	- apply patch.sysenterexit-mrieker: - adds sysenter/sysexit support for cpu-level>=6 - enabled by ./configure --enable-sep	2003-01-20 20:10:31 +00:00
Stanislav Shwartsman	29ab05b4da	Removed duplicate SSE opcodes	2002-12-22 20:48:45 +00:00
Stanislav Shwartsman	1cd38bb7dd	Recommitted SSE code reorganization. Fix in FXSAVE/FXRESTOR opcodes -> If the OSFXSR bitCR4 is not set, the FXRSTOR instruction does not restore the states of the XMM and MXCSR registers.	2002-12-22 20:13:00 +00:00
Stanislav Shwartsman	4906ffef7c	Clean Peter's commit with MOVNTDQ instruction implementation	2002-12-20 09:11:39 +00:00
Bryce Denney	9b2914fd1d	- Temporarily revert Stanislav's changes between 2002-12-18 and 2002-12-19. Because source files were added/removed it would require an update of the windows and macos project files, so I want to wait until after 2.0. M Makefile.in 1.51 back to 1.50 M cpu.h 1.121 back to 1.120 M fetchdecode.cc 1.37 back to 1.36 M fetchdecode64.cc 1.33 back to 1.32 M sse.cc 1.17 back to 1.16 A sse2.cc 1.27 back to 1.26 (added back) R sse_move.cc removed R sse_pfp.cc removed - to bring these changes back again, all we have to do is "cvs update -j tmp-before1 -j tmp-after1"	2002-12-19 05:53:18 +00:00
Stanislav Shwartsman	aa361badf2	Reorganized SSE/SSE2 code sse.cc -> general SSE stuff and SSE integer (MMX extensions) sse_move.cc -> memory transfer and shuffle opcodes sse_pfp.cc -> packed floating point operations	2002-12-18 22:33:44 +00:00
Christophe Bothamy	16ebfdb9e1	- update for macos compile	2002-12-12 15:29:45 +00:00
Stanislav Shwartsman	bcd57bdcaf	* Current duplicate SSE/SSE2 instructions list * MOVUPS_VpsWps (0f 10) = MOVUPD_VpdWpd (66 0f 10) = MOVDQU_VdqWdq (f3 0f 6f) MOVUPS_WpsVps (0f 11) = MOVUPD_WpdVpd (66 0f 11) = MOVDQU_WdqVdq (f3 0f 7f) MOVAPS_VpsWps (0f 28) = MOVAPD_VpdWpd (66 0f 28) = MOVDQA_VdqWdq (66 0f 6f) MOVAPS_WpsVps (0f 29) = MOVAPD_WpdVpd (66 0f 29) = MOVDQA_WdqVdq (66 0f 7f) MOVNTPS_MdqVps (0f 2b) = MOVNTPD_MdqVpd (66 0f 2b) MOVLPS_VpsMq (0f 12) = MOVLPD_VsdMq (66 0f 12) MOVLPS_MqVps (0f 13) = MOVLPD_MqVsd (66 0f 13) MOVHPS_VpsMq (0f 16) = MOVHPD_VpdMq (66 0f 16) MOVHPS_MqVps (0f 17) = MOVHPD_MqVpd (66 0f 17) ANDPS_VpsWps (0f 54) = ANDPD_VpdWpd (66 0f 54) = PAND_VpdWpd (66 0f db) ANDNPS_VpsWps (0f 55) = ANDNPD_VpdWpd (66 0f 55) = PANDN_VpdWpd (66 0f df) ORPS_VpsWps (0f 56) = ORPD_VpdWpd (66 0f 56) = POR_VpdWpd (66 0f eb) XORPS_VpsWps (0f 57) = XORPD_VpdWpd (66 0f 57) = PXOR_VpdWpd (66 0f ef) Removed dupes	2002-11-25 21:58:55 +00:00
Bryce Denney	dcedff8d46	- fix some minor compile bugs that appear when you mix up instrumentation, debugger, SMP, and x86-64. A few macros were missing the CPU_ID argument, and a few passed nonexistent variables to the instrumentation macros. - I changed CPU_ID into a plain old macro instead of an inline call to a trivial which_cpu() function, and removed which_cpu(). Modified Files: cpu/cpu.h cpu/ctrl_xfer64.cc debug/dbg_main.cc	2002-11-21 18:22:03 +00:00
Bryce Denney	add9107dae	- add BOCHSAPI to bxICache_c	2002-11-15 18:12:04 +00:00
Stanislav Shwartsman	ccbc8e0ef7	MOVAPS/MOVAPD have a different exceptions	2002-11-15 12:44:39 +00:00
Stanislav Shwartsman	7ccf1de78f	According to the Intel (and AMD) manuals a lot different SSE/SSE2 opcodes has EXACTLY the same operation. Deleted first three redundant opcodes (move integer data): MOVLPS_VpsMq (0f 12) = MOVLPD_VsdMq (66 0f 12) MOVLPS_MqVps (0f 13) = MOVLPD_MqVsd (66 0f 13) MOVHPS_VpsMq (0f 16) = MOVHPD_VpdMq (66 0f 16) MOVHPS_MqVps (0f 17) = MOVHPD_MqVpd (66 0f 17) Until under examination: XORPS,XORPD ORPS,ORPD ANDPS,ANDPD ANDNPS,ANDNPD MOVUPS,MOVUPD	2002-11-13 22:24:03 +00:00
Stanislav Shwartsman	968b2744f4	According to the Intel (and AMD) manuals a lot different SSE/SSE2 opcodes has EXACTLY the same operation. Deleted first three redundant opcodes: MOVAPS_VpsWps (0f 28) = MOVAPD_VpdWpd (66 0f 28) MOVAPS_WpsVps (0f 29) = MOVAPD_WpdVpd (66 0f 29) MOVNTPS_MdqVps (0f 2b) = MOVNTPD_MdqVpd (66 0f 2b) Until checking: XORPS,XORPD ORPS,ORPD ANDPS,ANDPD ANDNPS,ANDNPD MOVUPS,MOVUPD MOVLPS,MOVLPD MOVHPS,MOVHPD	2002-11-13 21:35:17 +00:00
Stanislav Shwartsman	5803e20240	Changed policy of SSE/SSE2 checking	2002-11-13 21:00:05 +00:00
Stanislav Shwartsman	4363745725	Implemented SSE2 integer instructions: PACKSSDW_VdqWdq PUNPCKHDQ_VdqWq PUNPCKHWD_VdqWq PUNPCKHBW_VdqWq PUNPCKHQDQ_VdqWq MOVD_EdVd MOVD_VdqEd	2002-11-08 12:47:24 +00:00
Peter Tattam	1bb5040031	Miscellaneous patches for Tattam's External Debugger. - Now compiles for plain ia-32 - Fixed some printf formatting for ia32 only. - Update to latest Win32 DLL - Added an ICEBP (Undoc 0xF8, INT 01) facility. - updated to use latest VGA refresh routine	2002-11-04 05:27:26 +00:00
Gregory Alexander	0e390b33f6	Double semicolons are confusing VisualAge.	2002-10-28 18:36:53 +00:00
Stanislav Shwartsman	b84f0bd0f2	This was not a cleanup. Those macros were intentionally there to offer a way to substitute more efficient code to do the RMW cases. At the moment, they just map to the normal functions. Sorry, restored the previous version ...	2002-10-25 18:26:29 +00:00
Stanislav Shwartsman	a0c1fd60e6	Just little cleanup of macro duplicating an existing code	2002-10-25 17:23:34 +00:00
Bryce Denney	cec9135e9f	- Apply patch.replace-Boolean rev 1.3. Every "Boolean" is now changed to a "bx_bool" which is always defined as Bit32u on all platforms. In Carbon specific code, Boolean is still used because the Carbon header files define it to unsigned char. - this fixes bug [ 623152 ] MacOSX: Triple Exception Booting win95. The bug was that some code in Bochs depends on Boolean to be a 32 bit value. (This should be fixed, but I don't know all the places where it needs to be fixed yet.) Because Carbon defined Boolean as an unsigned char, Bochs just followed along and used the unsigned char definition to avoid compile problems. This exposed the dependency on 32 bit Boolean on MacOS X only and led to major simulation problems, that could only be reproduced and debugged on that platform. - On the mailing list we debated whether to make all Booleans into "bool" or our own type. I chose bx_bool for several reasons. 1. Unlike C++'s bool, we can guarantee that bx_bool is the same size on all platforms, which makes it much less likely to have more platform-specific simulation differences in the future. (I spent hours on a borrowed MacOSX machine chasing bug 618388 before discovering that different sized Booleans were the problem, and I don't want to repeat that.) 2. We still have at least one dependency on 32 bit Booleans which must be fixed some time, but I don't want to risk introducing new bugs into the simulation just before the 2.0 release. Modified Files: bochs.h config.h.in gdbstub.cc logio.cc main.cc pc_system.cc pc_system.h plugin.cc plugin.h bios/rombios.c cpu/apic.cc cpu/arith16.cc cpu/arith32.cc cpu/arith64.cc cpu/arith8.cc cpu/cpu.cc cpu/cpu.h cpu/ctrl_xfer16.cc cpu/ctrl_xfer32.cc cpu/ctrl_xfer64.cc cpu/data_xfer16.cc cpu/data_xfer32.cc cpu/data_xfer64.cc cpu/debugstuff.cc cpu/exception.cc cpu/fetchdecode.cc cpu/flag_ctrl_pro.cc cpu/init.cc cpu/io_pro.cc cpu/lazy_flags.cc cpu/lazy_flags.h cpu/mult16.cc cpu/mult32.cc cpu/mult64.cc cpu/mult8.cc cpu/paging.cc cpu/proc_ctrl.cc cpu/segment_ctrl_pro.cc cpu/stack_pro.cc cpu/tasking.cc debug/dbg_main.cc debug/debug.h debug/sim2.cc disasm/dis_decode.cc disasm/disasm.h doc/docbook/Makefile docs-html/cosimulation.html fpu/wmFPUemu_glue.cc gui/amigaos.cc gui/beos.cc gui/carbon.cc gui/gui.cc gui/gui.h gui/keymap.cc gui/keymap.h gui/macintosh.cc gui/nogui.cc gui/rfb.cc gui/sdl.cc gui/siminterface.cc gui/siminterface.h gui/term.cc gui/win32.cc gui/wx.cc gui/wxmain.cc gui/wxmain.h gui/x.cc instrument/example0/instrument.cc instrument/example0/instrument.h instrument/example1/instrument.cc instrument/example1/instrument.h instrument/stubs/instrument.cc instrument/stubs/instrument.h iodev/cdrom.cc iodev/cdrom.h iodev/cdrom_osx.cc iodev/cmos.cc iodev/devices.cc iodev/dma.cc iodev/dma.h iodev/eth_arpback.cc iodev/eth_packetmaker.cc iodev/eth_packetmaker.h iodev/floppy.cc iodev/floppy.h iodev/guest2host.h iodev/harddrv.cc iodev/harddrv.h iodev/ioapic.cc iodev/ioapic.h iodev/iodebug.cc iodev/iodev.h iodev/keyboard.cc iodev/keyboard.h iodev/ne2k.h iodev/parallel.h iodev/pci.cc iodev/pci.h iodev/pic.h iodev/pit.cc iodev/pit.h iodev/pit_wrap.cc iodev/pit_wrap.h iodev/sb16.cc iodev/sb16.h iodev/serial.cc iodev/serial.h iodev/vga.cc iodev/vga.h memory/memory.h memory/misc_mem.cc	2002-10-25 11:44:41 +00:00
Bryce Denney	5e520261db	Add plugin support to Bochs by merging all the changes from the BRANCH_PLUGINS branch! Authors: Bryce Denney Christophe Bothamy Kevin Lawton (we grabbed a lot of plugin code from plex86) Testing help from: Volker Ruppert Don Becker (Psyon) Jeremy Parsons (Br'fin) The change log is too long to paste in here. To read the change log, do cvs log patches/patch.final-from-BRANCH_PLUGINS.gz All the changes and a detailed description are contained in a patch called patch.final-from-BRANCH_PLUGINS.gz. To look at the complete patch, do cvs upd -r1.1 patches/patch.final-from-BRANCH_PLUGINS.gz Then you will have a local copy of the patch, which you can gunzip and play with however you want. Modified Files: .bochsrc Makefile.in aclocal.m4 bochs.h config.h.in configure configure.in gdbstub.cc logio.cc main.cc pc_system.cc pc_system.h state_file.h bios/Makefile.in bios/rombios.c cpu/Makefile.in cpu/access.cc cpu/apic.cc cpu/arith16.cc cpu/arith32.cc cpu/arith8.cc cpu/cpu.cc cpu/cpu.h cpu/ctrl_xfer32.cc cpu/exception.cc cpu/fetchdecode.cc cpu/fetchdecode64.cc cpu/flag_ctrl.cc cpu/flag_ctrl_pro.cc cpu/init.cc cpu/io.cc cpu/logical16.cc cpu/logical32.cc cpu/logical8.cc cpu/paging.cc cpu/proc_ctrl.cc cpu/protect_ctrl.cc cpu/segment_ctrl_pro.cc cpu/shift16.cc cpu/shift32.cc cpu/stack64.cc cpu/string.cc cpu/tasking.cc debug/Makefile.in debug/dbg_main.cc disasm/Makefile.in doc/docbook/user/user.dbk dynamic/Makefile.in fpu/Makefile.in gui/Makefile.in gui/amigaos.cc gui/beos.cc gui/carbon.cc gui/control.cc gui/control.h gui/gui.cc gui/gui.h gui/keymap.cc gui/keymap.h gui/macintosh.cc gui/nogui.cc gui/rfb.cc gui/sdl.cc gui/sdlkeys.h gui/siminterface.cc gui/siminterface.h gui/term.cc gui/win32.cc gui/wx.cc gui/wxdialog.cc gui/wxdialog.h gui/wxmain.cc gui/wxmain.h gui/x.cc gui/keymaps/sdl-pc-de.map gui/keymaps/sdl-pc-us.map gui/keymaps/x11-pc-de.map instrument/example0/instrument.h instrument/example1/instrument.h instrument/stubs/instrument.cc instrument/stubs/instrument.h iodev/Makefile.in iodev/biosdev.cc iodev/biosdev.h iodev/cdrom.cc iodev/cmos.cc iodev/cmos.h iodev/devices.cc iodev/dma.cc iodev/dma.h iodev/eth_fbsd.cc iodev/eth_linux.cc iodev/eth_null.cc iodev/eth_tap.cc iodev/floppy.cc iodev/floppy.h iodev/guest2host.cc iodev/guest2host.h iodev/harddrv.cc iodev/harddrv.h iodev/iodebug.cc iodev/iodebug.h iodev/iodev.h iodev/keyboard.cc iodev/keyboard.h iodev/ne2k.cc iodev/ne2k.h iodev/parallel.cc iodev/parallel.h iodev/pci.cc iodev/pci.h iodev/pci2isa.cc iodev/pci2isa.h iodev/pic.cc iodev/pic.h iodev/pit.cc iodev/pit.h iodev/pit_wrap.cc iodev/pit_wrap.h iodev/sb16.cc iodev/sb16.h iodev/scancodes.cc iodev/scancodes.h iodev/serial.cc iodev/serial.h iodev/slowdown_timer.cc iodev/slowdown_timer.h iodev/unmapped.cc iodev/unmapped.h iodev/vga.cc iodev/vga.h memory/Makefile.in memory/memory.cc memory/memory.h memory/misc_mem.cc misc/bximage.c misc/niclist.c Added Files: README-plugins extplugin.h ltdl.c ltdl.h ltdlconf.h.in ltmain.sh plugin.cc plugin.h	2002-10-24 21:07:56 +00:00
Stanislav Shwartsman	c5f0ef8c76	Removed duplicated definition of BX_SEG_REGS	2002-10-16 22:10:07 +00:00
Stanislav Shwartsman	194952a53d	Merged BOCHS-SSE branch	2002-10-16 17:37:35 +00:00
Bryce Denney	c07f5836f3	- move definition of bx_address earlier, just after the Bit32u and Bit64u types are defined. This should ensure that bx_address is defined by the time it's needed.	2002-10-13 22:38:17 +00:00
Kevin Lawton	3183ab7102	Added some preliminary configure and config.h stuff for SSE/SSE2 for Stanislav. Also, some method prototypes and skeletal functions in access.cc for read/write double quadword features. Also cleaned up one warning in protect_ctrl.cc for non-64 bit compiles. There was an unused variable, only used for 64-bit.	2002-10-11 01:11:11 +00:00
Peter Tattam	b968c4e5c8	Latest round of patches/fixups to get 64 bit emulation further. This is an interim update to allow others to test. We have userland code running!!! (up to a point) Able to start executing "sash" as /sbin/init in userland from linux 64 bit kernel until it crashes trying to access a null pointer. No kernel panics though, just a segfault loop.	2002-10-08 14:43:18 +00:00
Kevin Lawton	b8d7f5c88e	Moved the asm() statements from the arithmetic instruction emulation into inline functions with asm() statements in cpu.h. This cleans up the *.cc code (which now doesn't have any asm()s in it), and centralizes the asm() code so constraints can be modified in one place. This also makes it easier to cover more instructions with asm()s for more efficient eflags handling.	2002-10-07 22:51:58 +00:00
Kevin Lawton	d7a16521c4	Replaced "return;" statements associated with bx_guard.special_unwind_stack hack with longjmp() back to cpu.cc main decode loop, and added a check in there to return control when bx_guard.special_unwind_stack is set (compiling with debugger enabled only). If in the debugger you try to execute further instructions (which you shouldn't), other fields need to be reset I would think, such as EXT and errorno, and have to make sure ESP/EIP are corrected properly. Basically, this hack is only good for examining the current situation of a nasty fault.	2002-10-06 22:08:18 +00:00
Kevin Lawton	b1d2f7ae48	Added a return value to handleAsyncEvent() so that requests to exit out of cpu_loop() and back to the caller can be honored. Previously, code in this function was a part of cpu_loop so a "return;" would already do that. Now, a value is passed back to cpu_loop() to denote such a request, and then a return is executed from cpu_loop(). I haven't tested this yet, but previously I must have broke certain debugging requests by moving the code to a separate function and not fixing the "return;" statements.	2002-10-05 14:51:25 +00:00
Peter Tattam	db0a37824c	Fixed elusive APIC interrupt problems when bochs compiled for P6 or later. Symptom: Linux kernel 2.4.19 would hang in random places. CPU still running, but in dle loop. Cause: if APIC interrupt occurred while a PIC interrupt was pending, the PIC interrupt would be lost. This is because either an APIC or PIC interrupt would trash any pending interrupt event because INTR is only a state, not an event queue. Temporary fix: reworked apic.cc to have it's own copy of INTR state. cpu.cc now checks for both cpu.INTR and local_apic.INTR. Need to do further research to see if local_apic and pic can be integrated in such a way as properly manage the combined effects of both devices accessing INTR state.	2002-10-05 10:25:31 +00:00
Kevin Lawton	0d22bbafc2	Added a new function writeEFlags() which takes a 32-bit eflags value and a change-mask, rather than passing all the boolean change flags as arguments. Recoded the POPF instruction in flag_ctrl.cc to use the new writeEFlags() function, and to make it more sane. Also, the old write_flags() and write_eflags() functions redirect to writeEFlags() for now. Later, when we get back in a development mode, it would be better to make all calls use the new function and get rid of the old ones.	2002-10-05 06:33:10 +00:00
Bryce Denney	d754550d47	- a boolean variable is represented by just 1 bit, 0=false or 1=true. We have been using the Boolean type for a number of multi-bit fields on the assumption that it is actually many bits wide. However, this assumption is unsafe and has caused some bugs that are hard to track down. - in the Carbon library on MacOS X, Boolean is defined to be an unsigned char. This has been causing some of the EFLAGS accessors to fail (bits 8-31) because they depended on Boolean being 32 bits wide. I changed these accessors to return Bit32u instead. I believe that this will finally fix [ 618388 ] Unable to boot under MacOS X. - It would be possible to create a bochs specific type for booleans (bx_bool), but it's cleaner to simply use "Boolean" when we actually mean a 1-bit true or false field, and Bit8u/Bit32u when it is a multibit field.	2002-10-04 22:25:22 +00:00
Kevin Lawton	66452e9898	Replaced tabs in cpu/*.{cc,h} files with spaces.	2002-10-04 17:04:33 +00:00
Kevin Lawton	f64eb0e16a	Changed the hot countdown timer in pc_system.* files to be 32-bits rather than 64. This is possible, because there is always an active null (heartbeat) timer, with periodicity of less than or equal to the maximum 32-bit int value. This generates a little less code in the hot part of cpu_loop, and saved about 3% execution time on a Win95 boot. Moved the asynchronous handling code from cpu_loop() to its own function since it's a long path. This neatened up the code a little (less gotos and all), and made it more clear to use a "while (1)" around the iterative code in cpu_loop().	2002-10-04 16:26:10 +00:00
Kevin Lawton	ee47fabac0	Committed new bochs internal timers (in pc_system.{cc,h}. These seem to be working better, are a more simple design, easier to understand, and AFAIK don't have race conditions in them like the old ones do. Re-coded the apic timer, to return cycle accurate values which vary with each iteration of a read from a guest OS. The previous implementation had very poor resolution. It also didn't check the mask bit to see if an apic timer interrupt should occur on countdown to 0. The apic timer now calls its own bochs timer, rather than tag on the one in iodev/devices.cc. I needed to use one new function which is an inline in pc_sytem.h. That would have to be added to the old pc_system.h if we have to back-out to it. Linux/x86-64 now boots until it hits two undefined opcodes: FXRSTOR (0f ae). This restores FPU, MMX, XMM and MXCSR registers from a 512-byte region of memory. We don't implement this yet. MOVNTDQ (66 0f e7). This is a move involving an XMM register. The 0x66 prefix is used so it's a double quadword, rather than MOVNTQ (0f e7) which operates on a single quadword. The Linux kernel panic is on the MOVNTQD opcodes. Perhaps that's because that opcode is used in exception handling of the 1st? Looks like we need to implement some new instructions.	2002-10-03 15:47:13 +00:00
Bryce Denney	0d28420aa2	- provide dbg_xlate_linear2phy when running as GDB stub	2002-10-03 04:53:53 +00:00
Bryce Denney	be4005269b	- many parameters in cpu were being redefined if you stop simulation and restart another one in wxWindows. Fixed that. Also, on restart, the apic id's left over from the first run were causing panics. Fixed that. - modified: main.cc cpu/apic.cc cpu/cpu.h cpu/init.cc	2002-09-30 22:18:53 +00:00
Kevin Lawton	67721c48f4	The convience functions protected_mode(), v8086_mode() and real_mode() now simply return a cached value which is set upon mode changes. The biggest problem was protected_mode() which did something like: return CR0.PM && ! EFLAGS.VM This adds up when it was being executed many times in branch functions etc. Now, cached values are set and sampled instead.	2002-09-29 22:38:18 +00:00
Kevin Lawton	a5537449cd	Split out reg-reg and reg-memory cases for a few other high-profile instructions, mainly variants of MOV. Had to update fetchdecode64 to keep it inline with the 32-bit mods.	2002-09-29 19:21:38 +00:00
Stanislav Shwartsman	d495bd75a6	fter integration of SplitMod11b changes Bochs failed to compile in SMP mode. I fixed the compilation errors in CVS, smbd please check if the fix is property;	2002-09-28 09:38:58 +00:00
Kevin Lawton	08a89fe7b6	Performance mod: I implemented a suggestion from Peter Tattam and Jas Sandys-Lumsdaine to split out common instructions into variants which deal with the mod=11b case (Reg-Reg) and the other cases (which do memory ops). Actually, I only split MOV_GwEw and MOV_GdEd for now. According to some instrumentation of a Win95 boot, they were the most frequently used opcode by far.	2002-09-28 05:38:11 +00:00
Kevin Lawton	13a1e55f20	Committed patches/patch-bochs-instrumentation from Stanislav. Some things changed in the ctrl_xfer.cc, fetchdecode.cc, and cpu.cc since the original patches, so I did some patch integration by hand. Check the placement of the macros BX_INSTR_FETCH_DECODE_COMPLETED() and BX_INSTR_OPCODE() in cpu.cc to make sure I go them right. Also, I changed the parameters to BX_INSTR_OPCODE() to update them to the new code. I put some comments before each of these to help determine if the placement is right. These macros are only compiled in if you are gathering instrumentation data from bochs, so they shouldn't effect others.	2002-09-28 00:54:05 +00:00
Stanislav Shwartsman	e6adebfe2d	Added MMX opcodes to x86-64 mode Fixed problem with fetching extra byte in ESCx opcodes if FPU is disabled	2002-09-27 09:56:40 +00:00
Kevin Lawton	47f2e7c404	Got rid of the KPL64Hacks macro. The fixes below eliminated it. Created 64-bit versions of some branch instructions and changed fetchdecode64.cc to use them instead. This keeps the #ifdef pollution down for 32-bit code and made fixing them easier. They needed to clear the upper bits of RIP for 16-bit operand sizes. They also should not have had a protection limit check in them, especially since that field is still 32-bit in cpu.h, so there's no way to set nominal 64-bit values. The 32-bit versions were also not honoring the upper 32-bits of RIP. LOOPNE64_Jb LOOPE64_Jb LOOP64_Jb JCXZ64_Jb Changed all occurances of JCC_Jw/JCC_Jd in fetchdecode64.cc to use JCC_Jq, which was coded already. Both JMP_Jq and JCC_Jq are now fixed w.r.t. 16-bit opsizes and upper RIP bit clearing.	2002-09-27 07:01:02 +00:00
Kevin Lawton	6d74a334d6	64-bit bug#1: Instructions such as MOV_ALOq were always fetching 64-bit address opcode info, which was incorrect. Fixed. Got rid of BxImmediate_Oq. fetchdecode64.cc now uses BxImmediateO, like the fetch routine does. Addresses which are embedded in the opcode, have a size which depends on the current addressing size. For long-mode, this is either 64 (default) or 32 (AddrSize over-ride). BxImmediate_O now conditionally fetches based on AddrSize. 64-bit bug#2: In JMP_Jq(), when the current operand size is 16-bits, the upper dword of RIP was not being cleared. The semantics with this case are weird - one would think the top 48 bits would be cleared, but apparently only the top 32 bits are. Anyways, I fixed this. Replaced some of the messy immediate fetching (byte-by-byte) in fetchdecode64.cc with ReadHost{Q,D}WordFromLittleEndian() calls for cleanliness. Should do this for all the cases, plus the 32-bit stuff.	2002-09-26 21:32:26 +00:00
Peter Tattam	67082a5b50	Implemented SWAPGS instruction. Note that it is unusual to decode (see SGDT instruction)	2002-09-25 14:09:08 +00:00
Bryce Denney	8f9bec3919	- remove unused, and incorrect MSR fields	2002-09-25 13:26:04 +00:00
Peter Tattam	a0d90e9b39	Implemented SYSCALL and SYSRET as part of x86-64 emulation. Since the SYSCALL replaces the LOADALL instruction, it is incompatible with earlier CPU types. At moment, the SYSCALL is only enabled by x86-64 emulation, but the code can be incorporated in IA32 only emulations. Instructions added: 0F 05 SYSCALL (replaces LOADALL) 0F 07 SYSRET (new) TODO: restructure #if ... so that it can be used by non x86-64 emulations.	2002-09-25 12:54:41 +00:00
Kevin Lawton	3c09fdb363	I updated code that was using !!get_CF() (or other arithmetic flag) to use getB_CF() etc. getB_CF() and friends are only for a relatively small number of cases where a true boolean/binary number (0 or 1) is required rather than 0 or non-0 as is returned by get_CF().	2002-09-24 18:33:38 +00:00
Kevin Lawton	aeca26fc04	Declaration of loadSRegLMNominal() is now only defined for 64-bit.	2002-09-24 16:39:33 +00:00
Kevin Lawton	26ebda0775	Got rid of INIT_64_DESCRIPTOR in all places. Added/replaced it with loadSRegLMNominal() which should be used to load a segment register in long-mode with nominal values which are compatible with existing checks and expectations for descriptor cache values. Fixed 64-bit iret to not do a descriptor fetch if SS selector is null. Also load SS with loadSRegLMNorminal() in the same case.	2002-09-24 16:35:44 +00:00
Bryce Denney	de0e58c2c5	These changes are from Peter Tattam - fix load_ss, remove load_ss_null - change the "#if KPL64Hacks" around msr stuff into "#if BX_IGNORE_BAD_MSR" - remove "#if KPL64Hacks" from BX_CPU_C::can_push - segment_ctrl_pro.cc: bug fix to ss == null handling in 64 bit mode Modified: cpu/cpu.h cpu/ctrl_xfer_pro.cc cpu/exception.cc cpu/proc_ctrl.cc cpu/segment_ctrl_pro.cc cpu/stack_pro.cc	2002-09-24 08:29:06 +00:00
Kevin Lawton	281e62d8b1	I integrated my hacks to get Linux/x86-64 booting. To keep these from interfering from a normal compile here's what I did. In config.h.in (which will generate config.h after a configure), I added a #define called KPL64Hacks: #define KPL64Hacks After running configure, you must set this by hand. It will default to off, so you won't get my hacks in a normal compile. This will go away soon. There is also a macro just after that called BailBigRSP(). You don't need to enabled that, but you can. In many of the instructions which seemed like they could be hit by the fetchdecode64() process, but which also touched EIP/ESP, I inserted a macro. Usually this macro expands to nothing. If you like, you can enabled it, and it will panic if it finds the upper bits of RIP/RSP set. This helped me find bugs. Also, I cleaned up the emulation in ctrl_xfer{8,16,32}.cc. There were some really old legacy code snippets which directly accessed operands on the stack with access_linear. Lots of ugly code instead of just pop_32() etc. Cleaning those up, minimized the number of instructions which directly manipulate the stack pointer, which should help in refining 64-bit support.	2002-09-24 00:44:56 +00:00
Kevin Lawton	6e7a2e91f2	Added more x86 specific asm() code to directly handle eflags return values for some common instructions (like test/and/cmp). Only compiles in on x86 of course.	2002-09-22 22:22:16 +00:00
Bryce Denney	a785453194	- fixed another case of get_##flag##(void)	2002-09-22 19:06:46 +00:00
Bryce Denney	fda29cd55b	- in definition of ArithmeticalFlag, we had "getB_##flag##(void)", which says to paste getB_ with flag and then paste with (. It should be "getB_##flag(void)". Some preprocessors are complaining about pasting the symbol with the paren.	2002-09-22 19:03:24 +00:00
Kevin Lawton	b742ccec7e	Changed eflags accessors for get_?F() to use (val32 & (1<<N)) instead of (1 & (val32>>N)), and added a getB_?F() accessor for special cases which need a strict binary value (exactly 0 or 1). Most code only needed a value for logical comparison. I modified the special cases which do need a binary number for shifting and comparison between flags, to use the special getB_?F() accessor. Cleaned up memory.cc functions a little, now that all accesses are within a single page. Fixed a (not very likely encountered) bug in fetchdecode.cc (and fetchdecode64.cc) where a 2-byte opcode starting with a prefix starts at the last offset on a page. There were no checks on the segment overrides for a boundary condition. I added them. The eflags enhancements added just a tiny bit of performance.	2002-09-22 18:22:24 +00:00
Bryce Denney	5933d94c91	- add typecast to Bit32u to avoid lots of useless -Wall warnings. The constant expression on the right side of the comparison was turning out signed, while the expression on the left was unsigned.	2002-09-22 02:53:09 +00:00
Kevin Lawton	3bfeab23c9	Split out JZ/JNZ instructions from JCC because they were called so frequently. Coded asm() statements for INC/DEC_ERX() instructions. Cleaned up the iCache a litle including a bug fix. The generation ID was decrementing the whole field including some high meta bits. That could roll over after 1 Billion cycles. I know only decrement if the field is valid, to save the write. I implemented inline functions which can serve the value of the arithmetic flags if they are cached, and redirect to the lazy_flags.cc routines if not. Most of this was just prep work for adding more asm() statements for native eflags processing when on x86.	2002-09-22 01:52:21 +00:00
Kevin Lawton	e2e219eda0	Modified the way that the register field (low 3 bits of a few opcodes also extended by the REX.B field on Hammer) is passed to instructions. I rearranged the bxInstruction_c to free up a field to be used to pass this info when mod-rm bytes are not used. This got rid of the ugly ((i->b1 & 7) + i->rex_b) code. Probably shaved just a very little run time off Hammer emulation, and even less on x86-32. The resultant is a little cleaner anyways.	2002-09-20 23:17:51 +00:00
Kevin Lawton	402d02974d	Moved the EFLAGS.RF check and clearing of inhibit_mask code in cpu.cc out of the main loop, and into the asynchronous events handling. I went through all the code paths, and there doesn't seem to be any reason for that code to be in the hot loop. Added another accessor for getting instruction data, called modC0(). A lot of instructions test whether the mod field of mod-nnn-rm is 0xc0 or not, ie., it's a register operation and not memory. So I flag this in fetchdecode{,64}.cc. This added on the order of 1% performance improvement for a Win95 boot. Macroized a few leftover calls to Write_RMV_virtual_xyz() that didn't get modified in the x86-64 merge. Really, they just call the real function for now, but I want to have them available to do direct writes with the guest2host TLB pointers.	2002-09-20 03:52:59 +00:00
Gregory Alexander	88e64f9521	Fix big endian compile problem.	2002-09-20 03:06:39 +00:00
Kevin Lawton	0cd7346b9c	- Added an instruction cache. Size is fixed for the moment, but if you hand edit cpu/cpu.h, and change BxICacheEntries, you can try different sizes. I'll make this more flexible with configure. For now, use "--enable-icache" with no parameters. - Modified fetchdecode.cc/fetchdecode64.cc just enough so that instructions which encode a direct address now use a memory resolution function which just sticks the immediate address into rm_addr. With cached instructions we need this.	2002-09-19 19:17:20 +00:00
Kevin Lawton	4e51dcae40	Converted all the remaining available separate fields in bxInstruction_c to bitfields. bxInstruction_c is now 24 bytes, including 4 for the memory addr resolution function pointer, and 4 for the execution function pointer (16 + 4 + 4). Coded more accessors, to abstract access from most code.	2002-09-18 08:00:43 +00:00
Kevin Lawton	6723ca9bf4	Moved more separate fields in the bxInstruction_c into bitfields with accessors. Had to touch a number of files to update the access using the new accessors. Moved rm_addr to the CPU structure, to slim down bxInstruction_c and to prevent future instruction caching from getting sprayed with writes to individual rm_addr fields. There only needs to be one. Though need to deal with instructions which have static non-modrm addresses, but which are using rm_addr since that will change. bxInstruction_c is down to about 40 bytes now. Trying to get down to 24 bytes.	2002-09-18 05:36:48 +00:00
Kevin Lawton	07b0df2a8a	Updated accessing of modrm/sib addressing information to use accessors. This lets me work on compressing the size of fetch-decode structure (now called bxInstruction_c). I've reduced it down to about 76 bytes. We should be able to do much better soon. I needed the abstraction of the accessors, so I have a lot of freedom to re-arrange things without making massive future changes. Lost a few percent of performance in these mods, but my main focus was to get the abstraction.	2002-09-17 22:50:53 +00:00
Bryce Denney	f1a3e0307a	- add #if BX_CPU_LEVEL>=4 around cr0.wp and cr4 so that i386 will compile	2002-09-17 22:14:33 +00:00
Stanislav Shwartsman	a43bd93b98	just little clean of the code	2002-09-17 14:36:39 +00:00
Kevin Lawton	3d4210fd3f	Got rid of a couple fields in BxInstruction_t that were no longer used. Also rearranged that struct a little to be more compressed. Over time, I'm going to reduce it further, for use with future accelerations.	2002-09-17 04:20:42 +00:00
Kevin Lawton	5eb4e247bc	Merged the final filed ("paging.cc") from Peter Tattam's x86-64 enhancement to bochs. You can now configure with --enable-guest2host-tlb. Force the support of big pages (PSE) when x86-64 is configured. Reverted back to only one kind of TLB entry style, since everything is ported. Fixed one bug in io.cc with as_64 and the index registers. There are others, as noticed by Peter.	2002-09-16 20:23:38 +00:00
Bryce Denney	6a50742b20	- clean up ^M pollution from working in cygwin	2002-09-16 17:00:16 +00:00
Bryce Denney	42f412c43b	- MS VC++ did not accept initialization of static const fields in the class declaration, for example: static const unsigned os_64=0, as_64=0; After reading some suggestions on usenet, I changed these into enums instead, like this: enum { os_64=0, as_64=0 };	2002-09-16 15:21:51 +00:00
Kevin Lawton	278e27d5fe	Merged proc_ctrl.cc. Also fixed a bug in CR4 reloading; we were printing a message when a reserved bit was set, but not causing a #GP(0). As well, I force a new PAE support option to 1 when Hammer support is enabled.	2002-09-14 23:17:55 +00:00
Kevin Lawton	93d05990cc	Updated CR4 to use the patented Bryce bitfields accessor method for both cpu32 and cpu64, to make upcoming merging easier, and the code cleaner. Compiled for debug as well, and fixed CR4 for that also.	2002-09-14 19:21:41 +00:00
Kevin Lawton	6d4b3e0e4d	(cpu64) Merged 4 more files.	2002-09-14 17:29:47 +00:00
Kevin Lawton	e5dc75091b	(cpu64) Merged protect_ctrl.cc. For cpu64 there is a cpu field called cpu_mode. Now there is one for cpu32, but it is declared: static const unsigned cpu_mode=BX_MODE_IA32; This way the compiler can compile-out if-then-else clauses based on it, allowing for easier code sharing.	2002-09-13 21:08:54 +00:00
Bryce Denney	7ff21b5f30	- the implementation of accessors should not use BX_CPU_C_PREFIX. When static member functions are turned on, BX_CPU_C_PREFIX expands to nothing, and any method that uses BX_CPU_C_PREFIX instead of explictly writing "BX_CPU_C::" will not be a member function at all. This makes it impossible for code outside the BX_CPU_C object to call the accessor because sometimes the method is at ptr_to_cpu->get_EIP() and other times you'd have to do just get_EIP(). The only way I've found to solve this is to remove the BX_CPU_C_PREFIX and write BX_CPU_C:: instead. - in debug/dbg_main.cc I removed the EBP, EIP, ESP, SP shortcuts. Now the accessors are used everywhere. Also I replaced a reference to the short-lived get_erx() accessor with ones that work: get_EAX(), etc. - with these changes the current cvs compiles with any combination of debugger enabled/disabled, SMP enabled/disabled, and x86-64 enabled/disabled.	2002-09-13 18:15:20 +00:00
Kevin Lawton	ac7ca2b035	Changed cpu64 calls to macros: BX_READ_8BIT_REG() --> BX_READ_8BIT_REGx() BX_WRITE_8BIT_REG() --> BX_WRITE_8BIT_REGx() They use an extra parameter "extended". I coded this as the macro without the "x" for cpu32 compiles. This allows for ease of merging and code sharing.	2002-09-13 17:04:14 +00:00
Kevin Lawton	bbb20f5d49	Got rid of get_bit{1,3,5,15} accessors to EFLAGS. They were only used by the debug functions, and those can get the entire eflags value in one shot now.	2002-09-13 05:03:37 +00:00
Kevin Lawton	b9d3791aa5	Integrated Stanislav's general register accessors, which model Bryce's eflags accessors.	2002-09-13 01:09:10 +00:00
Kevin Lawton	6655634179	I merged the cpu/cpu.h and cpu64/cpu.h files as well as the other header files. There no longer are any .h files in cpu64/. Had to make some changes to the .cc files for dealing with accesses to eip.	2002-09-13 00:15:23 +00:00
Christophe Bothamy	ea76dbe210	- fixed compile problem with gcc 2.95	2002-09-12 20:51:48 +00:00
Stanislav Shwartsman	647c1676e9	Added general registers accessors (like for EFLAGS)	2002-09-12 20:00:24 +00:00
Bryce Denney	5d9fa0844e	- rename "_long" to "dword" in eip structure in cpu64. - add get_erx() method to bx_gen_reg_t which returns the erx field of the structure (which is has a different name in cpu and cpu64). Providing an accessor is one strategy for avoiding igly "#ifdef BX_SUPPORT_X86_64" statements in the rest of the code. - cpu64/init.cc: the "eflags" before get_flag and set_flag is no longer correct. removed. - modified files: load32bitOShack.cc logio.cc cpu/cpu.h cpu64/apic.cc cpu64/cpu.h cpu64/init.cc cpu64/proc_ctrl.cc debug/dbg_main.cc	2002-09-12 18:52:14 +00:00
Bryce Denney	5fc31bcfda	- this revision changes the way eflags are accessed throughout the cpu and cpu64 directories. Instead of using the macros introduced in cpu.h rev 1.37 such as GetEFlagsDFLogical and SetEFlagsDF and ClearEFlagsDF, I made inline methods on the BX_CPU_C object that access the eflags fields. The problem with the macros is that they cannot be used outside the BX_CPU_C object. The macros have now been removed, and all references to eflags now use these new accessors. - I debated whether to put the accessors as members of the BX_CPU_C object or members of the bx_flags_reg_t struct. I chose to make them members of BX_CPU_C for two reasons: 1. the lazy flags are implemented as members of BX_CPU_C, and 2. the eflags are referenced in many many places and it is more compact without having to put eflags in front of each. (The real problem with compactness is having to write BX_CPU_THIS_PTR in front of everything, but that's another story.) - Kevin pointed out a major bug in my set accessor code. What a difference a little tilde can make! That is fixed now. - modified: load32bitOShack.cc debug/dbg_main.cc and in both cpu and cpu64 directories: cpu.cc cpu.h ctrl_xfer_pro.cc debugstuff.cc exception.cc flag_ctrl.cc flag_ctrl_pro.cc init.cc io.cc io_pro.cc proc_ctrl.cc soft_int.cc string.cc vm8086.cc	2002-09-12 18:10:46 +00:00
Bryce Denney	22eb32934a	- declare class BX_CPU_C early before it's first used	2002-09-12 17:06:40 +00:00
Bryce Denney	450070850b	- the debugger was broken by recent changes in the cpu flags. To provide a consistent way of accessing these flags that works both inside and outside the BX_CPU class, I added inline accessor methods for each flag: assert_FLAG(), clear_FLAG(), set_FLAG(value), and get_FLAG () that returns its value. I use assert to mean "set the value to one" to avoid confusion, since there's also a set method that takes a value. - the eflags access macros (e.g. GetEFlagsDFLogical, ClearEFlagsTF) are now defined in terms of the inline accessors. In most cases it will result in the same code anyway. The major advantage of the accesors is that they can be used from inside or outside the BX_CPU object, while the macros can only be used from inside. - since almost all eflags were stored in val32 now, I went ahead and removed the if_, rf, and vm fields. Now the val32 bit is the "official" value for these flags, and they have accessors just like everything else. - init.cc: move the registration of registers until after they have been initialized so that the initial value of each parameter is correct. Modified files: debug/dbg_main.cc cpu/cpu.h cpu/debugstuff.cc cpu/flag_ctrl.cc cpu/flag_ctrl_pro.cc cpu/init.cc	2002-09-11 03:55:22 +00:00
Kevin Lawton	425ad824c0	I changed the TLB entry from 3 dwords to 4, and (when you compile with GCC) align them with the GCC special alignment attribute. Since there was then one available field, I split the protection attributes and native host pointers into their own fields. Before, with 3 dwords per TLB entry, some entries (about 3/8) were spanning two processor cache lines (assuming a 32-byte cache line). Now, they all fit within one cache line. Knocked about 1.4% off Win95 boot time, probably more off normal software runs.	2002-09-10 00:01:01 +00:00
Bryce Denney	be659a09b3	- check in Stanislav Shwartsman's patch "bochs-mmx.patch-endian-support". He writes: Detailed description: MMX instruction set support. Also supports BIG_ENDIAN systems. Tested on Solaris and HP1100. - modified files: configure.in cpu/Makefile.in cpu/cpu.h cpu/fetchdecode.cc cpu/proc_ctrl.cc fpu/fpu_system.h fpu/wmFPUemu_glue.cc - added files: cpu/i387.h cpu/mmx.cc	2002-09-09 16:11:25 +00:00
Kevin Lawton	0d7a5fdf3c	I rehashed the way the EFLAGS register was stored internally. All the EFLAGS bits used to be cached in separate fields. I left a few of them in separate fields for now - might remove them at some point also. When the arithmetic fields are known (ie they're not in lazy mode), they are all cached in a 32-bit EFLAGS image, just like the x86 EFLAGS register expects. All other eflags are store in the 32-bit register also, with a few also mirrored in separate fields for now. The reason I did this, was so that on x86 hosts, asm() statements can be #ifdef'd in to do the calculation and get the native eflags results very cheaply. Just to test that it works, I coded ADD_EdId() and ADD_EwIw() with some conditionally compiled asm()s for accelerated eflags processing and it works. -Kevin	2002-09-08 04:08:14 +00:00
Kevin Lawton	51c93e12a1	The paging unit gets notified of all CR0/CR3/CR4 updates so it can decide how to proceed. Some of those bits are necessary to make TLB invalidation decisions. INVLPG doesn't cause a whole TLB flush anymore, just one page. Some of the current CPU behaviours model the P6, especially on CR0 reloads. Earlier processors kept some pre-change pre-fetched instructions until a branch. We could probably model that by setting a flag, and letting the revalidate_prefetch_q function cause serialization. The TLB flush code only invalidates entries which are not already invalidated for the case where the TLB invalidation ID trick is not in use.	2002-09-07 05:21:28 +00:00
Kevin Lawton	491035fcb2	I extended the guest-to-host TLB acceleration across the Read-Modify-Write instructions. The first read phase stores the host pointer in the "pages" field if a direct use pointer is available. The Write phase first checks if a pointer was issued and uses it for a direct write if available. I chose the "pages" field since it needs to be checked by the write_RMW_virtual variants anyways and thus needs to be cached anyways. Mostly the mods where to access.cc, but I did also macro-ize the calls to write_RMW_virtual...() in files which use it and cpu.h. Right now, the macro is just a straight pass-through. I tried expanding it to a quick initial check for the pointer availability to do the write in-place, with a function call as a fall-back. That didn't seemed to matter at all. Booting is not helped by this really. The upper bound of the gain is 5 or 6%, and that's only if you have a loop that looks like: label: add [eax], ebx ;; mega read-modify-write instruction jmp label ;; intensive loop.	2002-09-06 21:54:58 +00:00
Gregory Alexander	4f6039f533	Macroize BX_TLB_QUICK_INVALIDATE code. Kevin Lawton says he doesn't get a performance benefit. I'm not sure if I do. Either way, the difference isn't very large. This code may get removed if it turns out to be useless.	2002-09-06 19:21:55 +00:00
Gregory Alexander	1c3ae99300	Speed-up for TLB invalidates as proposed by Peter Tattam. I had been planning on this same thing in a similar form for the I$, so this made a lot of sense, and was easy to implement.	2002-09-06 14:58:56 +00:00
Bryce Denney	85b3dfe60f	- fix minor problems with static member function declarations: - bx_gen_reg cannot be declared with BX_SMF or it can't read gen_reg when static member functions are turned on. - use "BX_CPU_C_PREFIX" instead of "BX_CPU_C::" for get_segment_base. - the SMF (static member function) tricks are just plain wierd. The only way to really be sure that you're not breaking something is to try compiling it with SMF on and with SMF off. e.g. "configure && make" and "configure --enable-processors=2 && make".	2002-09-05 20:16:40 +00:00
Stanislav Shwartsman	2d2651a0f3	Added some useful debug/information methods for BX_CPU class	2002-09-05 19:46:20 +00:00
Stanislav Shwartsman	611d983900	Added get_REGISTER functions for all registers	2002-09-05 19:12:02 +00:00
Kevin Lawton	f0c9896964	Now, when you compile with --enable-guest2host-tlb, non-paged mode uses the notion of the guest-to-host TLB. This has the benefit of allowing more uniform and streamlined acceleration code in access.cc which does not have to check if CR0.PG is set, eliminating a few instructions per guest access. Shaved just a little off execution time, as expected. Also, access_linear now breaks accesses which span two pages, into two calls the the physical memory routines, when paging is off, just like it always has for paging on. Besides being more uniform, this allows the physical memory access routines to known the complete data item is contained within a single physical page, and stop reapplying the A20ADDR() macro to pointers as it increments them. Perhaps things can be optimized a little more now there too... I renamed the routines to {read,write}PhysicalPage() as a reminder that these routines now operate on data solely within one page. I also added a little code so that the paging module is notified when the A20 line is tweaked, so it can dump whatever mappings it wants to.	2002-09-05 02:31:24 +00:00
Kevin Lawton	8a1baa6bb8	Added ::{read,write}_virtual_qword() functions as per Stanislav's request. I have not tested these functions, but they model the format and acceleration principals of the byte/word/dword functions. Give them a try on both little/big endian machines.	2002-09-04 20:23:54 +00:00
Kevin Lawton	d07c1c0bb0	I rehashed the way the paging code stores protection bits, so that a compare of the current access could be done more efficiently against the cached values, both in the normal paging routines, and in the accelerated code in access.cc. This cut down the amount of code path needed to get to direct use of a host address nicely, and speed definitely got a boost as a result, especially if you use the --enable-guest2host-tlb option. The CR0.WP flag was a real pain, because it imparts a complication on the way protections work. Fortunately it's not a high-change flag, so I just base the new cached info on the current CR0.WP value, and dump the TLB cache when it changes.	2002-09-04 08:59:13 +00:00
Kevin Lawton	3f2d28f86c	Added guest2host TLB tricks to read-modify-write variants of access routines in access.cc, completing the upgrade of those routines. You do need '--enable-guest2host-tlb', before you get the speedups for now. The guest2host mods seem pretty solid, though I do need to see what effects the A20 line has on this cache and the paging TLB in general.	2002-09-03 04:54:28 +00:00
Kevin Lawton	746f09b427	There's a bug in the repeated IO & mem copy speedups. I added --enable-repeat-speedups with default to disabled. Reconfigure/recompile and the speedup code will be #ifdef'd out for now. It manifested as junk written to the VGA screen while booting/running Windows. Also made some more mods to the main cpu loop. Moved the handling of EXT/errorno outside the main loop, much like the extra EIP/ESP commits were moved, for a little better performance. I changed the fetch_ptr/bytesleft method of fetching to a slightly different model, which calculates a window for which EIP will be valid (land on the current page), and a bias which when applied to EIP will be from 0..upper_page_limit. Speed is about the same for either method, but a pseudo-op/threaded-interpreter will plug in better with this and be faster.	2002-09-02 18:44:35 +00:00
Kevin Lawton	3d8e5f8b61	Removed the BX_FETCHDECODE_CACHE mods, and the patch that Bryce created for use of ensuring all mods were removed cleanly.	2002-09-01 23:02:36 +00:00
Kevin Lawton	3a5f338419	Integrated patches for: - Paging code rehash. You must now use --enable-4meg-pages to use 4Meg pages, with the default of disabled, since we don't well support 4Meg pages yet. Paging table walks model a real CPU more closely now, and I fixed some bugs in the old logic. - Segment check redundancy elimination. After a segment is loaded, reads and writes are marked when a segment type check succeeds, and they are skipped thereafter, when possible. - Repeated IO and memory string copy acceleration. Only some variants of instructions are available on all platforms, word and dword variants only on x86 for the moment due to alignment and endian issues. This is compiled in currently with no option - I should add a configure option. - Added a guest linear address to host TLB. Actually, I just stick the host address (mem.vector[addr] address) in the upper 29 bits of the field 'combined_access' since they are unused. Convenient for now. I'm only storing page frame addresses. This was the simplest for of such a TLB. We can likely enhance this. Also, I only accelerated the normal read/write routines in access.cc. Could also modify the read-modify-write versions too. You must use --enable-guest2host-tlb, to try this out. Currently speeds up Win95 boot time by about 3.5% for me. More ground to cover... - Minor mods to CPUI/MOV_CdRd for CMOV. - Integrated enhancements from Volker to getHostMemAddr() for PCI being enabled.	2002-09-01 20:12:09 +00:00
Gregory Alexander	1be5b1d46c	Added a linked list to further speed up icache invalidates. These should be pretty snappy now. It's time to generate some actual statistics. Modified Files: cpu/cpu.cc cpu/cpu.h cpu/init.cc memory/memory.cc	2002-06-05 21:51:30 +00:00
Gregory Alexander	c41505e342	Added a RPN directory for the cache to help make invalidates faster. Hopefully this won't slow things down too much. config.h.in cpu/cpu.cc cpu/cpu.h memory/memory.cc	2002-06-05 03:59:31 +00:00
Gregory Alexander	fda1b874e9	Check in FETCHDECODE Caching, with changes. Specific changes from the patch: 1.) renamed fdcache_eip to fdcache_ip, as it is using the RIP instead of the EIP. 2.) added a Boolean array fdcache_is32 which uses is32 to determine icache hits. Otherwise we could run 32-bit code as 16-bit or vice versa. Modified Files: config.h.in cpu/cpu.cc cpu/cpu.h memory/memory.cc	2002-06-03 22:39:11 +00:00
Bryce Denney	30aaf4088e	- commit patch.wxwindows.gz in the main branch. Now you can try out the wxwindows interface by just "configure --with-wx; make" Modified Files: Makefile.in bochs.h config.h.in configure configure.in load32bitOShack.cc logio.cc main.cc cpu/cpu.cc cpu/cpu.h debug/dbg_main.cc gui/Makefile.in gui/control.cc gui/gui.cc gui/siminterface.cc gui/siminterface.h gui/x.cc iodev/cdrom.cc iodev/keyboard.cc memory/misc_mem.cc Added Files: README-wxWindows wxbochs.rc gui/wx.cc gui/wxmain.cc gui/wxmain.h gui/bitmaps/cdromd.xpm gui/bitmaps/configbutton.xpm gui/bitmaps/copy.xpm gui/bitmaps/floppya.xpm gui/bitmaps/floppyb.xpm gui/bitmaps/mouse.xpm gui/bitmaps/paste.xpm gui/bitmaps/power.xpm gui/bitmaps/reset.xpm gui/bitmaps/snapshot.xpm Removed Files: patches/patch.wxwindows.gz	2002-04-18 00:22:20 +00:00
instinc	22dc1c4f96	added address of the caught watchpoint	2002-04-01 04:42:43 +00:00
Bryce Denney	640d71d017	- check in Zwane Mwaikambo's MSR patch: patch.msr.	2002-03-27 16:04:05 +00:00
Bryce Denney	b8ecf5b118	- apply patch.smp-sync-arb-ids. This patch adds a local APIC behavior that was missing before, the special "INIT Level Deassert" synchronize arbitration ID trick.	2002-03-25 01:58:34 +00:00
instinc	dabe63ef72	added a control variable for debugger to know if register tracing is required or not	2001-10-03 19:53:48 +00:00
Bryce Denney	daf2a9fb55	- add RCS Id to header of every file. This makes it easier to know what's going on when someone sends in a modified file.	2001-10-03 13:10:38 +00:00
Bryce Denney	6a1c01c8b5	- back out my poorly written patch.virtual-address-checks-overflow	2001-10-02 20:01:29 +00:00
Bryce Denney	67ebaaca87	- apply patch.virtual-addr-checks-overflow to fix bug [ #433759 ] virtual address checks can overflow > Bochs has been crashing in some cases when you try to access data which > overlaps the segment limit, when the segment limit is near the 32-bit > boundary. The example that came up a few times is reading/writing 4 bytes > starting at 0xffffffff when the segment limit was 0xffffffff. The > condition used to compare offset+length-1 with the limit, but > offset+length-1 was overflowing so the comparison went wrong. This patch > changes the condition so that it supports all segment limits except for > sizes 0,1,2,3 bytes. Dave and I figured that these sizes would not be > needed, while size 0xffffffff is used quite a lot.	2001-10-02 17:02:28 +00:00
Bryce Denney	fd7e7ee86c	- added debugger command "info fpu" which prints all FPU registers in an output format similar to gdb (when you do info all-registers). Also, if you do "info all" you get the CPU registers and the FPU registers. - added bx_cpu_c method called fpu_print_regs, which is implemented in wmFPUemu_glue.cc	2001-09-15 06:55:14 +00:00
Bryce Denney	ad11335293	- remove space after line continuation character. Thanks to Martijn Boekhorst <Martijn@boekhorst.net> for pointing it out.	2001-09-11 23:32:14 +00:00
Bryce Denney	3d29d5d614	- add instrumentation macros for begin and end opcode. These are usually defined to be empty, so there should be no effect except for instrumentation	2001-06-28 19:45:44 +00:00
Bryce Denney	f822257511	- there were cases where BX_APIC_SUPPORT were used and others where BX_SUPPORT_APIC were used. To follow the pattern used by other names like this, I changed them all to BX_SUPPORT_APIC. Thanks to Tom Lindström for chasing this down!	2001-06-12 13:07:43 +00:00
Bryce Denney	565fa8ea8e	- another speed boost: when not using SMP, use BX_CPU_C bx_cpu; BX_MEM_C bx_mem; and when more than one processor, use BX_CPU_C bx_cpu_array[BX_SMP_PROCESSORS]; BX_MEM_C bx_mem_array[BX_ADDRESS_SPACES]; The changeover is controlled by BX_SMP_PROCESSORS, but there are only a few code changes since nearly all code uses the BX_CPU(n) and BX_MEM(n) macros. - This turns out to make a 10% speed difference! With this revision, the CVS version now gets 95% of the performance of the 3/25/2000 snapshot, which I've been using as my baseline.	2001-06-05 17:35:08 +00:00
Bryce Denney	49664f7503	- parts of the SMP merge apparantly broke the debugger and this revision tries to fix it. The shortcuts to register names such as AX and DL are #defines in cpu/cpu.h, and they are defined in terms of BX_CPU_THIS_PTR. When BX_USE_CPU_SMF=1, this works fine. (This is what bochs used for a long time, and nobody used the SMF=0 mode at all.) To make SMP bochs work, I had to get SMF=0 mode working for the CPU so that there could be an array of cpus. When SMF=0 for the CPU, BX_CPU_THIS_PTR is defined to be "this->" which only works within methods of BX_CPU_C. Code outside of BX_CPU_C must reference BX_CPU(num) instead. - to try to enforce the correct use of AL/AX/DL/etc. shortcuts, they are now only #defined when "NEED_CPU_REG_SHORTCUTS" is #defined. This is only done in the cpu/*.cc code.	2001-05-24 18:46:34 +00:00
Bryce Denney	e61d00351f	- merged BRANCH-smp-bochs into main branch. For details see comments in BRANCH-smp-bochs revisions. - The general task was to make multiple CPU's which communicate through their APICs. So instead of BX_CPU and BX_MEM, we now have BX_CPU(x) and BX_MEM(y). For an SMP simulation you have several processors in a shared memory space, so there might be processors BX_CPU(0..3) but only one memory space BX_MEM(0). For cosimulation, you could have BX_CPU(0) with BX_MEM(0), then BX_CPU(1) with BX_MEM(1). WARNING: Cosimulation is almost certainly broken by the SMP changes. - to simulate multiple CPUs, you have to give each CPU time to execute in turn. This is currently implemented using debugger guards. The cpu loop steps one CPU for a few instructions, then steps the next CPU for a few instructions, etc. - there is some limited support in the debugger for two CPUs, for example printing information from each CPU when single stepping.	2001-05-23 08:16:07 +00:00
Todd T.Fries	bdb89cd364	merge in BRANCH-io-cleanup. To see the commit logs for this use either cvsweb or cvs update -r BRANCH-io-cleanup and then 'cvs log' the various files. In general this provides a generic interface for logging. logfunctions:: is a class that is inherited by some classes, and also . allocated as a standalone global called 'genlog'. All logging uses . one of the ::info(), ::error(), ::ldebug(), ::panic() methods of this . class through 'BX_INFO(), BX_ERROR(), BX_DEBUG(), BX_PANIC()' macros . respectively. . . An example usage: . BX_INFO(("Hello, World!\n")); iofunctions:: is a class that is allocated once by default, and assigned as the iofunction of each logfunctions instance. It is this class that maintains the file descriptor and other output related code, at this point using vfprintf(). At some future point, someone may choose to write a gui 'console' for bochs to which messages would be redirected simply by assigning a different iofunction class to the various logfunctions objects. More cleanup is coming, but this works for now. If you want to see alot of debugging output, in main.cc, change onoff[LOGLEV_DEBUG]=0 to =1. Comments, bugs, flames, to me: todd@fries.net	2001-05-15 14:49:57 +00:00
Bryce Denney	a6fef54678	- update copyright dates to 2001 for all mandrake headers - for bochs files with other header, replaced with current mandrake header	2001-04-10 02:20:02 +00:00
Bryce Denney	4e04f4cb58	- change all inline declarations to one of two macros: BX_C_INLINE or BX_CPP_INLINE. Then in config.h.in you can define these two as you wish.	2001-04-10 02:10:09 +00:00
cvs	beff63eb32	- entered original Bochs snapshot bochs-2000_0325a.tar.gz from ftp.bochs.com	2001-04-10 01:04:59 +00:00

... 17 18 19 20 21 ...

1051 Commits