Bochs

Author	SHA1	Message	Date
Stanislav Shwartsman	ff79cbd596	Infrstructure change to support disasm of BxInstruction_c directly (without calling disasm) The end goal will be also merging of disasm and cpu decoder to one module and remove the disasm. Two bug fixes on the way: TBM: fixed 64-bit TBM instructions with memory access (did 32-bit load instead of 64-bit) BMI2: fixed operands order for PEXT/PDEP instructions AVX2: fixed gather instruction decoding bug from decoder alias commit	2013-09-24 05:21:00 +00:00
Stanislav Shwartsman	cc694377b9	Standartization of Bochs instruction handlers. Bochs instruction emulation handlers won't refer to direct fields of instructions like MODRM.NNN or MODRM.RM anymore. Use generic source/destination indications like SRC1, SRC2 and DST. All handlers are modified to support new notation. In addition fetchDecode module was modified to assign sources to instructions properly. Immediate benefits: - Removal of several duplicated handlers (FMA3 duplicated with FMA4 is a trivial example) - Simpler to understand fetch-decode code Future benefits: - Integration of disassembler into Bochs CPU module, ability to disasm bx_instruction_c instance (planned) Huge patch. Almost all source files wre modified.	2012-08-05 13:52:40 +00:00
Stanislav Shwartsman	002c86660a	reword all the CPU code in preparation for future CPU speedup implementation. Bochs emulation can be another 10-15% faster using technique described in paper "Fast Microcode Interpretation with Transactional Commit/Abort" http://amas-bt.cs.virginia.edu/2011proceedings/amasbt2011-p3.pdf	2011-07-06 20:01:18 +00:00
Stanislav Shwartsman	a02d8cfe67	cleanups, simplications, copyright updates	2011-04-23 20:39:27 +00:00
Stanislav Shwartsman	7d80a6ebe0	Adding Id and Rev property to all files	2011-02-24 21:54:04 +00:00
Stanislav Shwartsman	d60b7c0919	rename accessor for opcodeReg() in instruction	2010-12-06 21:45:56 +00:00
Stanislav Shwartsman	8308a47168	trying to get rid of b1() in instruction class	2010-09-24 21:15:16 +00:00
Stanislav Shwartsman	bd60e0264c	change Copyright to Bochs Project	2009-12-04 16:53:12 +00:00
Stanislav Shwartsman	9929e6ed78	- updated FSF address	2009-01-16 18:18:59 +00:00
Stanislav Shwartsman	a95c24b019	Some functions could be called only from 32 bit	2008-09-06 21:18:08 +00:00
Stanislav Shwartsman	b96f78dc0a	Some kind of big change in fetchdecode tables invented in order to compress the tables for better host data cache utilization	2008-08-29 19:23:03 +00:00
Stanislav Shwartsman	5dd02b26e3	Make even more efficient RmAddr calculation - good optimizing compiler could make more efficient code than it was before	2008-08-08 09:22:49 +00:00
Stanislav Shwartsman	3f5efb6475	Remove more duplicated methods	2008-07-13 10:06:07 +00:00
Stanislav Shwartsman	678ac970aa	Reorganize ctrl_xfer8.cc code, allows to inline branch32 method	2008-06-22 03:45:55 +00:00
Stanislav Shwartsman	167c7075fb	Use fastcall gcc attribute for all cpu execution functions - this pure "compiler helper" optimization brings additional 2% speedup to Bochs code	2008-03-22 21:29:41 +00:00
Stanislav Shwartsman	405fcfd75d	Reorganize 3-byte opcode tables - bigger tables but easier to maintain them	2008-02-29 03:02:03 +00:00
Stanislav Shwartsman	37fbb82baa	Cleanups. Move bxInstruction_c definition to separate file instr.h	2008-01-29 17:13:10 +00:00
Stanislav Shwartsman	7b80c5f481	I merged and succeded to remove some similar execution functions - less code, less chance for branch misprediction	2008-01-25 19:34:30 +00:00
Stanislav Shwartsman	d9984bb3a1	Eliminate BxResolve call from the heart of cpu loop and move into instructions that really require this calculation. Yes, it blows the code of EVERY CPU method but it has >15% speedup !	2008-01-10 19:37:56 +00:00
Stanislav Shwartsman	d830c301cf	Fixed 64-bit versions of LOOP instructions, some cleanups	2007-12-21 17:30:49 +00:00
Stanislav Shwartsman	5d4e32b8da	Avoid pointer params for every read_virtual_* except 16-byte SSE and 10-byte x87 reads	2007-12-20 20:58:38 +00:00
Stanislav Shwartsman	b516589e4e	Changes in write_virtual_* and pop_* functions -> avoid moving parameteres by pointer	2007-12-20 18:29:42 +00:00
Stanislav Shwartsman	0a1063ad77	Split GvEv opcode groups	2007-11-21 22:36:02 +00:00
Stanislav Shwartsman	cdc9a09090	Split more opcodes	2007-11-18 18:24:46 +00:00
Stanislav Shwartsman	5ec15df46d	Split more opcodes EbIb opcodes	2007-11-17 18:08:46 +00:00
Stanislav Shwartsman	d9e58bd598	split11b on opcode tables level - split almost eevery splittable instruction will be continued	2007-11-17 12:44:10 +00:00
Stanislav Shwartsman	351244d1ea	Rename splitmod11b methods	2007-11-16 08:30:22 +00:00
Stanislav Shwartsman	9db896d100	minor x86_64 fixes and cleanups	2007-01-12 22:47:21 +00:00
Stanislav Shwartsman	a4129e5341	Handle NULL_SEG_REG (no segment override) case in fetchdecode.cc	2006-05-24 20:57:37 +00:00
Stanislav Shwartsman	5c3fba4399	Support access to SMRAM in memory object Cleanup in CPU code	2006-03-26 18:58:01 +00:00
Stanislav Shwartsman	7b6c2587a9	Now devices could be compiled separatelly from CPU Averything that required cpu.h include now has it explicitly and there are a lot of files not dependant by CPU at all which will compile a lot faster now ...	2006-03-06 22:03:16 +00:00
Stanislav Shwartsman	51e03f071d	Fixed XLAT instruction for x86-64 Small optimization for lazy flags for ADD/ADC/SUB/SBB instructions Enable RETF64 for same privelege level return	2005-07-21 01:59:05 +00:00
Stanislav Shwartsman	ce8f1ade07	Some not really significant speedups	2005-06-21 17:01:21 +00:00
Stanislav Shwartsman	c026a90779	Unify coding style in CPU methods NO AFFECT ON EMULATION RESULTS	2005-05-20 20:06:50 +00:00
Stanislav Shwartsman	8f0cf91fff	This commit is the first commit in long series of changes the have several purposes: 1. Review and commit patch [ 896733 ] Lazy flags, for more instructions, only 1 src op May be partially, but I hope to get all ideas from patch in 2. Get Bochs speedup after lazy flags optimization 3. Most important for me: improve correctness of emulation by handling several undocumented EFLAGS modifications. And finally pass UFLAGS - Undefined Flags Test v 3.0 Copyright (C) Potemkin's Hackers Group (PHG) 1989,1995 The test still fails on > 50% of its checks.	2004-08-09 21:28:47 +00:00
Stanislav Shwartsman	3f7c794b26	commit patch 899972 data xfer performance patch V 2.0.4 2004-02-18 15:38 nobody psychosmur	2004-02-26 19:17:40 +00:00
Christophe Bothamy	091052e199	- reverting to previous revision (xfer8 1.15, xfer16 1.21, xfer32 1.21) as it breaks AMD64 support.	2003-05-08 17:56:48 +00:00
Christophe Bothamy	b3d16a48ef	- apply another speedup patch from Conn Clark. Notes from the author: Here is another one of my speed up patches. Unlike my previous speedups this one will help more platforms than just X86. It cleans up the Data Xfer instructions. Since the Data Xfer instructions are the most often executed instructions it gives a noticable boost in speed. The basic optimization technique was to eliminate intermediate variables and pass a pointer to the final destination or original source to the read_virtual_whatever and the write_virtual_whatever functions.	2003-05-03 16:19:07 +00:00
Christophe Bothamy	83b8bbedff	- fix REX MOVB immediate for x86_64 (patch by Arnd Bergmann) (bug [ 720776 ] REX MOVB immediate broken for x86_64)	2003-04-26 10:02:03 +00:00
Stanislav Shwartsman	b84f0bd0f2	This was not a cleanup. Those macros were intentionally there to offer a way to substitute more efficient code to do the RMW cases. At the moment, they just map to the normal functions. Sorry, restored the previous version ...	2002-10-25 18:26:29 +00:00
Stanislav Shwartsman	a0c1fd60e6	Just little cleanup of macro duplicating an existing code	2002-10-25 17:23:34 +00:00
Kevin Lawton	a5537449cd	Split out reg-reg and reg-memory cases for a few other high-profile instructions, mainly variants of MOV. Had to update fetchdecode64 to keep it inline with the 32-bit mods.	2002-09-29 19:21:38 +00:00
Kevin Lawton	402d02974d	Moved the EFLAGS.RF check and clearing of inhibit_mask code in cpu.cc out of the main loop, and into the asynchronous events handling. I went through all the code paths, and there doesn't seem to be any reason for that code to be in the hot loop. Added another accessor for getting instruction data, called modC0(). A lot of instructions test whether the mod field of mod-nnn-rm is 0xc0 or not, ie., it's a register operation and not memory. So I flag this in fetchdecode{,64}.cc. This added on the order of 1% performance improvement for a Win95 boot. Macroized a few leftover calls to Write_RMV_virtual_xyz() that didn't get modified in the x86-64 merge. Really, they just call the real function for now, but I want to have them available to do direct writes with the guest2host TLB pointers.	2002-09-20 03:52:59 +00:00
Kevin Lawton	4e51dcae40	Converted all the remaining available separate fields in bxInstruction_c to bitfields. bxInstruction_c is now 24 bytes, including 4 for the memory addr resolution function pointer, and 4 for the execution function pointer (16 + 4 + 4). Coded more accessors, to abstract access from most code.	2002-09-18 08:00:43 +00:00
Kevin Lawton	6723ca9bf4	Moved more separate fields in the bxInstruction_c into bitfields with accessors. Had to touch a number of files to update the access using the new accessors. Moved rm_addr to the CPU structure, to slim down bxInstruction_c and to prevent future instruction caching from getting sprayed with writes to individual rm_addr fields. There only needs to be one. Though need to deal with instructions which have static non-modrm addresses, but which are using rm_addr since that will change. bxInstruction_c is down to about 40 bytes now. Trying to get down to 24 bytes.	2002-09-18 05:36:48 +00:00
Kevin Lawton	07b0df2a8a	Updated accessing of modrm/sib addressing information to use accessors. This lets me work on compressing the size of fetch-decode structure (now called bxInstruction_c). I've reduced it down to about 76 bytes. We should be able to do much better soon. I needed the abstraction of the accessors, so I have a lot of freedom to re-arrange things without making massive future changes. Lost a few percent of performance in these mods, but my main focus was to get the abstraction.	2002-09-17 22:50:53 +00:00
Kevin Lawton	b9212fdafb	(cpu64) Merged data_xfer8.cc	2002-09-13 21:47:21 +00:00
Kevin Lawton	491035fcb2	I extended the guest-to-host TLB acceleration across the Read-Modify-Write instructions. The first read phase stores the host pointer in the "pages" field if a direct use pointer is available. The Write phase first checks if a pointer was issued and uses it for a direct write if available. I chose the "pages" field since it needs to be checked by the write_RMW_virtual variants anyways and thus needs to be cached anyways. Mostly the mods where to access.cc, but I did also macro-ize the calls to write_RMW_virtual...() in files which use it and cpu.h. Right now, the macro is just a straight pass-through. I tried expanding it to a quick initial check for the pointer availability to do the write in-place, with a function call as a fall-back. That didn't seemed to matter at all. Booting is not helped by this really. The upper bound of the gain is 5 or 6%, and that's only if you have a loop that looks like: label: add [eax], ebx ;; mega read-modify-write instruction jmp label ;; intensive loop.	2002-09-06 21:54:58 +00:00
Bryce Denney	daf2a9fb55	- add RCS Id to header of every file. This makes it easier to know what's going on when someone sends in a modified file.	2001-10-03 13:10:38 +00:00
Bryce Denney	49664f7503	- parts of the SMP merge apparantly broke the debugger and this revision tries to fix it. The shortcuts to register names such as AX and DL are #defines in cpu/cpu.h, and they are defined in terms of BX_CPU_THIS_PTR. When BX_USE_CPU_SMF=1, this works fine. (This is what bochs used for a long time, and nobody used the SMF=0 mode at all.) To make SMP bochs work, I had to get SMF=0 mode working for the CPU so that there could be an array of cpus. When SMF=0 for the CPU, BX_CPU_THIS_PTR is defined to be "this->" which only works within methods of BX_CPU_C. Code outside of BX_CPU_C must reference BX_CPU(num) instead. - to try to enforce the correct use of AL/AX/DL/etc. shortcuts, they are now only #defined when "NEED_CPU_REG_SHORTCUTS" is #defined. This is only done in the cpu/*.cc code.	2001-05-24 18:46:34 +00:00

1 2

53 Commits