Bochs

Author	SHA1	Message	Date
Alexander Krisak	a6cce95418	fixed compillation	2008-09-23 13:25:32 +00:00
Stanislav Shwartsman	db664c4012	more optimizations after fetchdecode	2008-09-16 20:57:16 +00:00
Stanislav Shwartsman	d7fdaaad5b	remove not needed index set	2008-09-16 19:22:13 +00:00
Stanislav Shwartsman	a9c77eb75d	Try to optimize individual instructions after fetchdecode	2008-09-16 19:20:03 +00:00
Stanislav Shwartsman	7566faf948	A bit simplify FPU decoding	2008-09-16 18:28:53 +00:00
Stanislav Shwartsman	d57a211df9	Fixed handling of prefixes for EMMS Small FPU optimization	2008-09-12 20:59:31 +00:00
Stanislav Shwartsman	b03f940807	optimize seg_override decoding	2008-09-08 16:15:59 +00:00
Stanislav Shwartsman	c1306f7d75	small non-significant speedups	2008-09-06 21:10:40 +00:00
Stanislav Shwartsman	b3b2f77675	Reduce size of Bochs static tables by changing from bx_bool (which is 32bit) to Bit8u	2008-09-06 18:21:29 +00:00
Stanislav Shwartsman	0cd11fd385	Updated instrumentation callbacks - removed fetchdecode_completed callback	2008-09-06 17:49:32 +00:00
Stanislav Shwartsman	a0e395188f	Fixed merge error	2008-08-29 20:43:05 +00:00
Stanislav Shwartsman	b96f78dc0a	Some kind of big change in fetchdecode tables invented in order to compress the tables for better host data cache utilization	2008-08-29 19:23:03 +00:00
Stanislav Shwartsman	a5a01c4b42	optimize LEAVE operation	2008-08-27 21:57:40 +00:00
Stanislav Shwartsman	d0803ebd10	branch_16 optimizations	2008-08-23 22:27:58 +00:00
Stanislav Shwartsman	991ae348cb	Clean invalidate_prefetch_q when not needed	2008-08-23 13:55:37 +00:00
Stanislav Shwartsman	70f363a05c	Unroll back 32-bit fetchdecode displ	2008-08-11 21:06:27 +00:00
Stanislav Shwartsman	a8adb36dc2	Implemented MOVBE Intel Atom(R) instruction	2008-08-11 18:53:24 +00:00
Stanislav Shwartsman	b61017e5b6	Split more opcodes using new LOAD technique	2008-08-10 21:16:12 +00:00
Stanislav Shwartsman	1da5943f1a	More use of LOAD_Ex method	2008-08-10 19:34:28 +00:00
Stanislav Shwartsman	0d90ab0478	Completely new way to handle LD+OP cases - allows to significantly reduce number of BX_CPU_C methods	2008-08-09 21:05:07 +00:00
Stanislav Shwartsman	24e0b53720	This more ellegant way to have debug info for BxError and not lose any performace	2008-08-09 19:18:09 +00:00
Stanislav Shwartsman	709d74728d	Call #UD exception directly instead of UndefinedOpcode function - for future use	2008-07-13 15:35:10 +00:00
Stanislav Shwartsman	0127415ba6	Clear some duplicated arithmetic opcodes - difference only in operands order	2008-07-13 09:59:59 +00:00
Stanislav Shwartsman	65275ffc02	Remove repeat speedups from 16-bit address size methods - they not gonna speed up anyway because of segment limit issue	2008-06-25 10:34:21 +00:00
Stanislav Shwartsman	a6fda9a971	Instrumentation code updated, some PANIC messages fixed	2008-06-23 02:56:31 +00:00
Stanislav Shwartsman	678ac970aa	Reorganize ctrl_xfer8.cc code, allows to inline branch32 method	2008-06-22 03:45:55 +00:00
Stanislav Shwartsman	7f82a536b3	Fixed code duplication during prefix decoding	2008-06-11 20:58:29 +00:00
Stanislav Shwartsman	aff775bce4	Small code optimization	2008-06-09 19:35:59 +00:00
Stanislav Shwartsman	ed4be45a8b	Split shift/rotate opcodes in 32-bit mode and 64-bit mode	2008-05-02 22:47:07 +00:00
Stanislav Shwartsman	f5780a5f5c	Hide some BX_MEM_C variables Optimize resolve16 methods - by reducing their amount again - reduce chance for misspredictin	2008-05-01 20:08:37 +00:00
Stanislav Shwartsman	81deffd65d	More fetchdecode fixes	2008-04-30 21:32:33 +00:00
Stanislav Shwartsman	e5b6f90b62	some fetchdecode fixes	2008-04-30 21:07:12 +00:00
Stanislav Shwartsman	64f2489afb	Correctly implement opcode group G11 i.e. instructions C6 and C7 should @UD when modrm nnn field != 0 (1st instr in the group	2008-04-24 21:52:28 +00:00
Stanislav Shwartsman	892fa99c6f	- prefetch hint should be NOP when use in register mode - #GP when trying to set reserved bits of CR4_HI in 64-bit mode - #GP when trying to set reserved bits of EFER MSR - clear upper part of RSI/RDI when executing rep instructions with 32-bit asize even if no repeat iterations were executed (because of RCX=0 for example) - write SYSENTER_EIP_MSR and SYSENTER_ESP_MSR as 64-bit when x86_64 supported - set MSR_FMASK reset value - MSR_FMASK should be 32-bit only - check for fetch permissions when doing ITLB lookup - #GP when trying to write non-canonical address to MSR_CSTAR or MSR_LSTAR - correct repeat instructions timing - mark TSS busy in TR after it is loaded	2008-04-16 16:44:06 +00:00
Stanislav Shwartsman	419dc57dbd	Complete MASKMOVDQU decoding fix	2008-04-16 05:56:55 +00:00
Stanislav Shwartsman	4f3f8608f7	Fixed MASKMOVDQU instruction decoding	2008-04-16 05:41:43 +00:00
Stanislav Shwartsman	fe59e0ae6a	FIxed comment in fetchdecode	2008-04-06 18:31:10 +00:00
Stanislav Shwartsman	1bdddc1f78	Split SHRD/SHLD instructions	2008-04-05 19:08:01 +00:00
Stanislav Shwartsman	5826e2843a	Inline pop/push functions Store only single byte of opcode in b1() - speedup shift instructions Code cleanups	2008-04-05 17:51:55 +00:00
Stanislav Shwartsman	2aaafa76a2	Reorganize fetchdecode tables with another level of redirection - a leap toward future improvements Currently no speedup and no slowdown - about the same results on my Bochs benchmarking A lot of code reorganization in fetchdecode	2008-04-04 22:39:45 +00:00
Stanislav Shwartsman	62e3728591	preparations for future optimizations - not necessary speedupo now	2008-04-03 17:56:59 +00:00
Stanislav Shwartsman	3f2487a0af	Enabled tracing cross repeated instructions	2008-03-31 18:53:08 +00:00
Stanislav Shwartsman	255d512e29	Organize bxInstruction fields differently	2008-03-31 17:33:34 +00:00
Stanislav Shwartsman	14ff07b482	Small code cleanup	2008-03-29 09:58:23 +00:00
Stanislav Shwartsman	e48b398bee	Add NIL register and simplify more BxResolve work	2008-03-29 09:34:35 +00:00
Stanislav Shwartsman	167c7075fb	Use fastcall gcc attribute for all cpu execution functions - this pure "compiler helper" optimization brings additional 2% speedup to Bochs code	2008-03-22 21:29:41 +00:00
Stanislav Shwartsman	7e490699d4	Removing hooks for not-implemented SSE4A from the Bochs code.	2008-03-21 20:04:42 +00:00
Stanislav Shwartsman	946b7a369d	Added const to fetchPtr in cpu functions	2008-03-03 15:16:46 +00:00
Stanislav Shwartsman	5e7218b8c3	Fixed problem introduced by prev checkin + Fix beak to debugger when executing HLT instruction	2008-02-29 05:39:40 +00:00
Stanislav Shwartsman	405fcfd75d	Reorganize 3-byte opcode tables - bigger tables but easier to maintain them	2008-02-29 03:02:03 +00:00
Stanislav Shwartsman	0f44b4f0ec	Fixes in MODRM tables	2008-02-15 12:23:49 +00:00
Stanislav Shwartsman	4fc0df26e8	a bit optimize and simplify x87 decoding	2008-02-14 18:59:41 +00:00
Stanislav Shwartsman	063d896226	Optimization in 16-bit resolve functions Fixes for hosts which can't support misaligned memory access	2008-02-07 20:43:13 +00:00
Stanislav Shwartsman	fb0ce45d28	Unpack more fields in bxInstruction_c -> this increase bxInstruction size by 4 bytes but I have no way but do it if want to support SSE5 dest override later	2008-02-04 21:28:53 +00:00
Stanislav Shwartsman	a2897933a3	white space cleanup	2008-02-02 21:46:54 +00:00
Stanislav Shwartsman	37fbb82baa	Cleanups. Move bxInstruction_c definition to separate file instr.h	2008-01-29 17:13:10 +00:00
Stanislav Shwartsman	7b80c5f481	I merged and succeded to remove some similar execution functions - less code, less chance for branch misprediction	2008-01-25 19:34:30 +00:00
Stanislav Shwartsman	63d8d50cfc	code cleanup	2008-01-20 20:11:17 +00:00
Stanislav Shwartsman	8c9de8b4db	speculative tracing on fetchdecode level	2008-01-18 09:36:15 +00:00
Stanislav Shwartsman	d9984bb3a1	Eliminate BxResolve call from the heart of cpu loop and move into instructions that really require this calculation. Yes, it blows the code of EVERY CPU method but it has >15% speedup !	2008-01-10 19:37:56 +00:00
Stanislav Shwartsman	eee1a9030d	a bit simplify and optimize shift instructions print failed segment info in check_cs - more debug info	2007-12-30 20:16:35 +00:00
Stanislav Shwartsman	c3c9c40674	Move MaxFetch calculation into fetchdecode - simplify the logic	2007-12-22 17:17:40 +00:00
Stanislav Shwartsman	e9a148f9c4	lmost last instruction split -> CMOV in 16/32 bit modes	2007-12-21 18:24:19 +00:00
Stanislav Shwartsman	6ac7fa7106	MMX - modify masked write to RMW - faster execution CMPXCHG8B/16B - fixed possible problem. Instruction not allowed to fault after some part of it written to the memory	2007-12-19 23:21:11 +00:00
Stanislav Shwartsman	c9932e97eb	Fixes in resolve.cc -> reduce amount of resolve functions even more	2007-12-18 21:41:44 +00:00
Stanislav Shwartsman	fe2e0525da	More optimization for string instructions	2007-12-17 19:52:01 +00:00
Stanislav Shwartsman	de5838ce80	cleanups and fixes for Immediate_IbIb of SSE4A	2007-12-16 20:47:10 +00:00
Stanislav Shwartsman	1e843cb462	Decode SSE4A Rework immediate bytes decoding to make it faster	2007-12-15 17:42:24 +00:00
Stanislav Shwartsman	903f6dea35	Split setCC functions - makes code faster and simpler	2007-12-14 21:29:36 +00:00
Stanislav Shwartsman	d9a59c7a1f	Added ability to merge traces cross JCC branch instructions Makes traces longer -> emulation faster in average	2007-12-14 20:41:09 +00:00
Stanislav Shwartsman	05c7a1e61b	Fixed problem with trace cache enabled String instructions might confise trace cache by finishing instruction execution method without actually completing an instruction (and advancing eip)	2007-12-13 18:42:31 +00:00
Stanislav Shwartsman	adda3befd3	Trace cache optimization merged	2007-12-09 18:36:05 +00:00
Stanislav Shwartsman	4c16dd71a8	Fixed compilation error in SMP mode	2007-12-07 09:38:42 +00:00
Stanislav Shwartsman	1bcf42baec	oops, fixed incorrect checkin	2007-12-01 16:59:36 +00:00
Stanislav Shwartsman	7ca78b88e9	configure/compile changes + small optimizations	2007-12-01 16:45:17 +00:00
Stanislav Shwartsman	8cfd17202a	some simple SSE code optimizations	2007-11-27 22:12:45 +00:00
Stanislav Shwartsman	c51888f43f	Split last BxLockable opcodes -> this allows to eliminate mod==0xc0 check from fetchdecode of every instruction reduce ACPU.CC dependencies - now that file doesn't depend of CPU	2007-11-25 20:22:10 +00:00
Stanislav Shwartsman	3daa468c02	Fixed comments in bit.cc Revert back lock prefix changes in fetchdecode - not all lockable instructions are splitted yet ;(	2007-11-23 16:37:06 +00:00
Stanislav Shwartsman	1dbe51a2fb	Split ENTER_IwBw function according to os32. Fixed ENTER/LEAVE in 64-bit mode	2007-11-22 17:33:06 +00:00
Stanislav Shwartsman	0a1063ad77	Split GvEv opcode groups	2007-11-21 22:36:02 +00:00
Stanislav Shwartsman	1af7010e50	Optimized memory access for 64-bit mode Starting convergence to new lazy flags scheme by Darek Mihocka (www.emulators.com). The new flags code is still being validated and perfected but I try to minimize the diff between 2 versionS	2007-11-20 17:15:33 +00:00
Stanislav Shwartsman	2bd8958783	Change force_flags() implementation and make lazy flags a bit more lazy :)	2007-11-19 19:55:09 +00:00
Stanislav Shwartsman	d75a69fd2e	Remove BxResolve tables	2007-11-18 22:14:39 +00:00
Stanislav Shwartsman	fb61418307	optimize modrm/sib decoding	2007-11-18 21:38:58 +00:00
Stanislav Shwartsman	30f42d74f1	make sreg index tables static in fetchdecode and remove them from init.cc/cpu.h	2007-11-18 21:07:40 +00:00
Stanislav Shwartsman	bcaba54489	Merge resolve functions for 32 and 64-bit	2007-11-18 19:46:14 +00:00
Stanislav Shwartsman	57d2d14865	Split POP_Ev opcodes	2007-11-18 18:49:19 +00:00
Stanislav Shwartsman	cdc9a09090	Split more opcodes	2007-11-18 18:24:46 +00:00
Stanislav Shwartsman	613bad34ee	split MOVZX/MOVSX opcodes	2007-11-17 18:29:00 +00:00
Stanislav Shwartsman	5ec15df46d	Split more opcodes EbIb opcodes	2007-11-17 18:08:46 +00:00
Stanislav Shwartsman	d5a58e1df2	Split more opcodes - G3 group	2007-11-17 16:20:37 +00:00
Stanislav Shwartsman	d9e58bd598	split11b on opcode tables level - split almost eevery splittable instruction will be continued	2007-11-17 12:44:10 +00:00
Stanislav Shwartsman	abe3f4c5c2	Split one more opcode	2007-11-16 21:43:23 +00:00
Stanislav Shwartsman	b4b922809a	Move 3byte opcode decoding under Modrm condition	2007-11-16 20:49:51 +00:00
Stanislav Shwartsman	565e7f9868	Merge common fetchdecode groups. Add more comments to fetchdecode tables	2007-11-16 18:34:14 +00:00
Stanislav Shwartsman	393018cdf8	More split11b	2007-11-16 17:45:58 +00:00
Stanislav Shwartsman	351244d1ea	Rename splitmod11b methods	2007-11-16 08:30:22 +00:00
Stanislav Shwartsman	db02731cbf	Replace BxAnother attribute in fetchdecode by table lookup like it is done in disasm. This is done in preparation to feature huge fetchdecode change - all fethdecode tables will be duplicated and made separatate table for ModC0 and others. So ALL instructions will emjoy SplitMod11b automatically (if they want). After splitting ALL instruction I hope to get 20% speedup at least.	2007-11-15 17:57:56 +00:00
Stanislav Shwartsman	9da22f9b65	Remove redundant BxAnother attr from prefixes The meaning of BxAnother attr - instryction has modrm byte	2007-11-14 22:52:16 +00:00
Stanislav Shwartsman	0fa82afe1f	Bugfix and optimize BxResolve calls - bugfix in 64-bit mode	2007-11-13 17:30:54 +00:00
Stanislav Shwartsman	edfff23ca0	Split JCC methods to 16 different methods per branch condition	2007-11-12 18:20:15 +00:00
Stanislav Shwartsman	eea5023da8	small simplification for fetchdecode	2007-11-11 20:56:22 +00:00
Stanislav Shwartsman	2653d54e96	split 32-bit modermdata variable in BxInstruction_c to 4 Bit8u variables this way it is possible to save shifts and masking when accessing modrm fields	2007-11-08 18:21:37 +00:00
Stanislav Shwartsman	2f5fa07af3	small speedups	2007-11-07 10:40:40 +00:00
Stanislav Shwartsman	292153b30e	Fixed BranchImm cases in 64-bit mode	2007-10-22 17:41:41 +00:00
Stanislav Shwartsman	5445de19d1	Decoding : F2 and F2 prefix could override prefix 66 when determine SSE opcode	2007-10-20 10:56:44 +00:00
Stanislav Shwartsman	be9ad60ef3	cleanups	2007-10-11 22:44:17 +00:00
Stanislav Shwartsman	8adbbcf17c	Started first implementation of MONITOR/MWAIT	2007-10-11 21:29:01 +00:00
Stanislav Shwartsman	0dc4badfbb	Added SSE4A and SSE4_2 to disassembler Implemented POPCNT instruction	2007-09-19 19:38:10 +00:00
Stanislav Shwartsman	b64fc08c54	implement prefetch hint opcodes	2007-08-23 16:47:51 +00:00
Stanislav Shwartsman	4555cc9be3	ud2b opcode should have modrm byte	2007-08-18 13:51:16 +00:00
Stanislav Shwartsman	5189cfbf10	SSE4 support	2007-04-19 16:12:21 +00:00
Stanislav Shwartsman	e26609fa97	Support for Intel LSS/LFS/LGS in 64-bit mode TODO: have both AMD and Intelk versions	2007-04-09 20:28:15 +00:00
Stanislav Shwartsman	ef542b3790	Learn to decode and disassemble VMX opcodes No fetchdecode support but everything is ready	2007-03-23 14:35:50 +00:00
Stanislav Shwartsman	c24627c00f	Implemented CLFLUSH instruction Set of minor fixes for correctness	2007-01-28 21:27:31 +00:00
Stanislav Shwartsman	acd1a05f6f	Fixed bugs for SSE3E execution and decoding	2007-01-25 21:44:35 +00:00
Stanislav Shwartsman	f8003098b1	Rename SSE4 to SSE3E to match intel docs. SSE4 coming later ;) Fixed "last prefix" for REX in 64-bit mode	2007-01-25 19:09:41 +00:00
Stanislav Shwartsman	9db896d100	minor x86_64 fixes and cleanups	2007-01-12 22:47:21 +00:00
Stanislav Shwartsman	5c21f7821f	Speed simulation between 3 to 5% by eliminating several checks from cpu loop. The checks were related to repeat instructions - handle them differently	2007-01-05 13:40:47 +00:00
Stanislav Shwartsman	6c3420a18b	Add debug prints before any #GP excepion which only possible to be generated	2006-06-09 22:29:07 +00:00
Stanislav Shwartsman	a4129e5341	Handle NULL_SEG_REG (no segment override) case in fetchdecode.cc	2006-05-24 20:57:37 +00:00
Stanislav Shwartsman	fc799ab623	FetchDecode tables are constant. Marking them const implicitly will help to compiler/linker in optimization.	2006-05-12 18:03:26 +00:00
Stanislav Shwartsman	fe644dfcbf	- Code cleanup, remove x86-64 code from functions which cannot be called from x86-64 - Fix PANIC multiple SSE prefix decoding (fetchdecode and disasm) - More Bit32u -> bx_phy_address convert - Lazy flags optimization	2006-05-12 17:04:19 +00:00
Stanislav Shwartsman	20b14aefa6	Fix in BSWAP 64-bit mode - allow to use additional R8-R15 registers Also fixed code duplication story with BSWAP instruction	2006-05-07 18:58:47 +00:00
Stanislav Shwartsman	d69eba6c07	Split in/out instructions based on operand size	2006-05-07 18:27:36 +00:00
Stanislav Shwartsman	03eac64013	Added decoding of new SSE4 instructions (recently published in Intel docs) At least CPUID detects them correctly The code is never tested (still) ! (but should work fine)	2006-04-06 18:30:05 +00:00
Stanislav Shwartsman	9dc1790f07	Simplify and optimize fetchdecode methods. Now fetchdecode is simpler to understand and easier to modify, for example to support 3-byte opcodes (SSE4)	2006-04-05 20:52:40 +00:00
Stanislav Shwartsman	f8c3968d42	Changes list made after CVS service crash: - Fixed critical bug in CPU code added with one of the prev commits - Disasm support for SSE4 - Rename PNI->SSE3 everywhere in the code - Correctly decode, disassemble and execute 'XCHG R8, rAX' x86-64 instruction - Correctly decode, disassemble and execute multi-byte NOP 0F F1 opcode - Fixed ENTER and LEAVE instructions in x86-64 mode - Added ability to turn ON instruction trace, only GUI support is missed. Instruction trace could be enabled if Bochs was compiled with disasm - More changes Bit32u -> bx_phy_address - Complete preliminary implementation of SMM in Bochs, SMI is still PANICs but if you press 'continue' everything should work OK - Small code cleanup - Update CHANGES and user docs	2006-04-05 17:31:35 +00:00
Stanislav Shwartsman	f347ab97bf	Fixed CALL/JMP far through call gate 64 Decode SWAPGS and RDTSCP instructions Indent changes in fetchdecode	2006-03-22 20:47:11 +00:00
Stanislav Shwartsman	7b6c2587a9	Now devices could be compiled separatelly from CPU Averything that required cpu.h include now has it explicitly and there are a lot of files not dependant by CPU at all which will compile a lot faster now ...	2006-03-06 22:03:16 +00:00
Stanislav Shwartsman	38a7e0abea	0f 0d (3dnow prefetch instruction) should execute as NOP when running on Intel EM64T CPU and as prefetch on AMD	2005-11-11 21:09:02 +00:00
Stanislav Shwartsman	d1c722211e	Fix duplicate opcodes, fix opcode names and disasm bugs	2005-09-23 16:45:41 +00:00
Stanislav Shwartsman	37bd193337	Split PUSHF/POPF to 3 different methods according to op size. By the way fix VIP/VIF flags handling in POPF/PUSHF (future fix for VME)	2005-08-08 19:56:11 +00:00
Stanislav Shwartsman	8616109eb8	revert back not correct change in fetchdecode	2005-08-05 12:53:09 +00:00
Stanislav Shwartsman	8be190d848	Implemented RDTSCP instruction	2005-08-05 12:47:33 +00:00
Stanislav Shwartsman	954aae3f99	Speedup push/pop operations, they actually not needed to do can_push/can_pop checkes, the same checkes already done in read/write_virtial methods Split push_seg_reg methods according to op size	2005-07-31 17:57:27 +00:00
Stanislav Shwartsman	2b5a812674	Split last bit.cc methods according to os16/32/64	2005-07-25 04:18:20 +00:00
Stanislav Shwartsman	ce8f1ade07	Some not really significant speedups	2005-06-21 17:01:21 +00:00
Stanislav Shwartsman	a86002a8bc	Improve Bochs instrumentation Small changes in APIC timer, should fix the bug report [ 957660 ] >>PANIC<< APIC: R(curr timer count): delta < initial	2005-04-29 21:28:59 +00:00
Stanislav Shwartsman	e6e9dd3825	Extend Bochs instrumentation Compatability fixes	2005-03-17 20:50:57 +00:00
Stanislav Shwartsman	709b218c10	Reduce metaInfo initialization in fetchDecode	2005-03-01 21:44:01 +00:00
Stanislav Shwartsman	2bfc842c09	CPU fixes by Kevin Lawton	2005-02-16 21:27:21 +00:00
Stanislav Shwartsman	9492942ae6	In 64-bit mode, the CS, DS, ES, and SS segment overrides are ignored.	2005-02-12 19:25:33 +00:00
Stanislav Shwartsman	bbcc5e0e3a	Split BOUND instruction to two different according to operand size Coding style change	2005-01-28 20:50:48 +00:00
Stanislav Shwartsman	46bb3d8853	remove duplicated data arrays from CPU	2004-12-11 20:51:13 +00:00
Stanislav Shwartsman	5213e903bd	mov duplicate opcode groups from fectchdecode*.cc to .h use common register accessor macroses instead of direct register file structure access	2004-11-26 20:21:28 +00:00
Stanislav Shwartsman	69c0b06955	fixes in disassembler split REPEAT instructions according to opsize to speedup execution now each REPEATABLE instruction splitted to 3 different instructions, one for 16-bit operand size, one for 32-bit and one for 64-bit. Choosing of correct instruction occure in fetchdecode step.	2004-11-20 23:26:32 +00:00
Stanislav Shwartsman	08810d54c4	Fix fetchdecode for FPU instructions when FPU is not present	2004-11-12 16:47:35 +00:00
Stanislav Shwartsman	4f1f070c37	Fix comments for code	2004-10-08 19:29:04 +00:00
Stanislav Shwartsman	760a195c9d	* Fix LOCK prefix handling for x86-64 * Split BT*_EvGv functions to 3 different function according to exec mode	2004-09-17 20:47:19 +00:00
Stanislav Shwartsman	fc631037ff	remove obsolete comments from fetchdecode	2004-09-06 20:22:39 +00:00
Stanislav Shwartsman	77b3886f8b	Cleanup and optimize	2004-08-28 08:41:46 +00:00
Stanislav Shwartsman	4eea772270	LOADALL for cpu-level=2 in fetchdecode	2004-05-11 16:44:58 +00:00
Stanislav Shwartsman	3274e0dd12	Commit patch [ 950905 ] Do not PANIC on rare, bad input from user-mode by h.johansson with little changes and fixes	2004-05-10 21:05:51 +00:00
Stanislav Shwartsman	279d207d45	Fix fetchdecode bugs reported by Gilbert Netzer (opcode patches for x86_64 cpu)	2004-05-03 17:58:36 +00:00
Stanislav Shwartsman	cf6d1b8bd9	port some changes from spftfloat-fpu branch to the MT	2004-04-09 15:34:59 +00:00
Stanislav Shwartsman	0eb71999db	Added missed 287 opcodes which should be executed as NOP in 387+	2003-12-28 18:19:41 +00:00
Stanislav Shwartsman	9ccb363ec3	bochs style decode/execute of FPU instructions. With this coding style each instruction could be implemented separatelly even not together with current Bochs FPU emulator. Step-by-step I am going to transfer all FPU instructions from current Bochs FPU emulator to new style and remove an old bugged emulator. Anyway, now I could implement all currently missed FPU instructions without hacking wm-fpu-emu.	2003-12-27 13:50:06 +00:00
Stanislav Shwartsman	ac20b6405a	- FXSAVE/FXRSTOR instructions should be available in P6 mode - Added second UD2 opcode to fetchdecode - Added RDPMC instruction to fetchdecode - 'changes' updated	2003-10-24 18:34:16 +00:00
Stanislav Shwartsman	7f570b0150	Added PNI new streaming extensions instructions PNI could be enabled by setting BX_SUPPORT_PNI in config.h After the feature will be fully validation I'll also add configure option. The implemntation is ~complete. I've missed only three FPU new opcodes of FUSTTP instruction and MONITOR/WAIT instructions. Enjoy ! ;)	2003-08-29 21:20:52 +00:00
Stanislav Shwartsman	254ad17328	Changes method of resolving opcode/attributes from group table New method more flexible and easy to understanding. Reorganizing fetchdecode code and make it more easy and understandable	2003-08-28 19:25:23 +00:00
Stanislav Shwartsman	6aa0a62fe7	Optimizing fetchdecode	2003-08-15 13:08:24 +00:00
Stanislav Shwartsman	96984cb6cb	Added missed fetchdecode table entry for SYSENTER/SYSEXIT	2003-06-20 08:58:12 +00:00
Stanislav Shwartsman	58efdfb31f	An illegal lock prefix was not checked for instructions without any attributes (i.e. without immediate, modrm or any other additional bytes except prefixes).	2003-06-12 17:01:37 +00:00
Stanislav Shwartsman	3c00944998	I hope this is the last one ...	2003-05-29 19:44:59 +00:00
Stanislav Shwartsman	f933d604d3	Fixed missed BxLockable for XCHG instruction	2003-05-29 17:15:08 +00:00
Stanislav Shwartsman	1d45167e5b	Merged NEW-INSTRUCTIONS branch	2003-05-15 16:41:17 +00:00
Volker Ruppert	79b811f23f	- fixed warnings in these files: cpu/fetchdecode.cc cpu/mmx.cc cpu/proc_ctrl.cc iodev/virt_timer.cc plugin.cc	2003-05-02 12:22:48 +00:00
Stanislav Shwartsman	446fca9ed0	Superfluous braces in initializers in fetchdecode.cc	2003-04-23 17:52:59 +00:00
Stanislav Shwartsman	40bd4f138b	Little style changes Elliminated i387_t alimit field (not used in FPU)	2003-04-16 18:38:53 +00:00
Stanislav Shwartsman	7db893970c	Read attributes bits even for BxSplit11b opcodes Move lock prefix check later in fetchdecode function when all attributes is ready.	2003-04-06 19:08:31 +00:00
Stanislav Shwartsman	1e71c9e56e	Merged patch-unallowed-lock-cases patch. According to the Intel manuals: The LOCK prefix can be prepended only to the following instructions and only to those forms of the instructions where the destination operand is a memory operand: ADD, ADC, AND, BTC, BTR, BTS, CMPXCHG, CMPXCH8B, DEC, INC, NEG, NOT, OR, SBB, SUB, XOR, XADD, and XCHG. If the LOCK prefix is used with one of these instructions and the source operand is a memory operand, an undefined opcode exception (#UD) will be generated. An undefined opcode exception will also be generated if the LOCK prefix is used with any instruction not in the above list. Checking of the LOCK prefix done in fetchDecode state and not overloads Bochs's execution.	2003-04-05 12:16:53 +00:00
Christophe Bothamy	1a518b81fe	- add __attribute__((regparm(X))) performance trick with gcc on x86 on some cpu instructions (patch from Conn Clark) - performance improvement is 1% on win95 boot	2003-03-17 00:41:01 +00:00
Stanislav Shwartsman	cdfc3cbce4	instrumentation enchancements: * renamed CPU_ID to BX_CPU_ID. with this new name there is no possibility for name contentions and BX_CPU_ID definition could be moved out to NEED_CPU_REG_SHORTCUTS block * returned back `unsigned BX_CPU::which_cpu(void)` function * added BX_CPU_ID parameter for BX_INSTR_PHY_READ(a20addr, len); BX_INSTR_PHY_WRITE(a20addr, len); now it will be BX_INSTR_PHY_READ(cpu_id, a20addr, len); BX_INSTR_PHY_WRITE(cpu_id, a20addr, len);	2003-02-13 15:04:11 +00:00
Christophe Bothamy	c6abf1d0d1	- fix old #if BX_SUPPORT_SYSENTEREXIT found by Stanislav. The sysenter/exit code was not called at all!	2003-01-20 21:30:00 +00:00
Christophe Bothamy	939b558fdf	- apply patch.sysenterexit-mrieker: - adds sysenter/sysexit support for cpu-level>=6 - enabled by ./configure --enable-sep	2003-01-20 20:10:31 +00:00
Stanislav Shwartsman	4b59ecbc62	Implemented SSE/SSE2 duplicate opcodes in more intellegent way ...	2002-12-22 21:48:23 +00:00
Stanislav Shwartsman	e73df72525	implementation of additional SSE/SSE2 instructions	2002-12-22 20:42:56 +00:00
Stanislav Shwartsman	4906ffef7c	Clean Peter's commit with MOVNTDQ instruction implementation	2002-12-20 09:11:39 +00:00
Bryce Denney	9b2914fd1d	- Temporarily revert Stanislav's changes between 2002-12-18 and 2002-12-19. Because source files were added/removed it would require an update of the windows and macos project files, so I want to wait until after 2.0. M Makefile.in 1.51 back to 1.50 M cpu.h 1.121 back to 1.120 M fetchdecode.cc 1.37 back to 1.36 M fetchdecode64.cc 1.33 back to 1.32 M sse.cc 1.17 back to 1.16 A sse2.cc 1.27 back to 1.26 (added back) R sse_move.cc removed R sse_pfp.cc removed - to bring these changes back again, all we have to do is "cvs update -j tmp-before1 -j tmp-after1"	2002-12-19 05:53:18 +00:00
Stanislav Shwartsman	aa361badf2	Reorganized SSE/SSE2 code sse.cc -> general SSE stuff and SSE integer (MMX extensions) sse_move.cc -> memory transfer and shuffle opcodes sse_pfp.cc -> packed floating point operations	2002-12-18 22:33:44 +00:00
Stanislav Shwartsman	bcd57bdcaf	* Current duplicate SSE/SSE2 instructions list * MOVUPS_VpsWps (0f 10) = MOVUPD_VpdWpd (66 0f 10) = MOVDQU_VdqWdq (f3 0f 6f) MOVUPS_WpsVps (0f 11) = MOVUPD_WpdVpd (66 0f 11) = MOVDQU_WdqVdq (f3 0f 7f) MOVAPS_VpsWps (0f 28) = MOVAPD_VpdWpd (66 0f 28) = MOVDQA_VdqWdq (66 0f 6f) MOVAPS_WpsVps (0f 29) = MOVAPD_WpdVpd (66 0f 29) = MOVDQA_WdqVdq (66 0f 7f) MOVNTPS_MdqVps (0f 2b) = MOVNTPD_MdqVpd (66 0f 2b) MOVLPS_VpsMq (0f 12) = MOVLPD_VsdMq (66 0f 12) MOVLPS_MqVps (0f 13) = MOVLPD_MqVsd (66 0f 13) MOVHPS_VpsMq (0f 16) = MOVHPD_VpdMq (66 0f 16) MOVHPS_MqVps (0f 17) = MOVHPD_MqVpd (66 0f 17) ANDPS_VpsWps (0f 54) = ANDPD_VpdWpd (66 0f 54) = PAND_VpdWpd (66 0f db) ANDNPS_VpsWps (0f 55) = ANDNPD_VpdWpd (66 0f 55) = PANDN_VpdWpd (66 0f df) ORPS_VpsWps (0f 56) = ORPD_VpdWpd (66 0f 56) = POR_VpdWpd (66 0f eb) XORPS_VpsWps (0f 57) = XORPD_VpdWpd (66 0f 57) = PXOR_VpdWpd (66 0f ef) Removed dupes	2002-11-25 21:58:55 +00:00
Stanislav Shwartsman	121de7d960	Fixed bug with decoding of Group15	2002-11-15 13:05:19 +00:00
Stanislav Shwartsman	ccbc8e0ef7	MOVAPS/MOVAPD have a different exceptions	2002-11-15 12:44:39 +00:00
Stanislav Shwartsman	7ccf1de78f	According to the Intel (and AMD) manuals a lot different SSE/SSE2 opcodes has EXACTLY the same operation. Deleted first three redundant opcodes (move integer data): MOVLPS_VpsMq (0f 12) = MOVLPD_VsdMq (66 0f 12) MOVLPS_MqVps (0f 13) = MOVLPD_MqVsd (66 0f 13) MOVHPS_VpsMq (0f 16) = MOVHPD_VpdMq (66 0f 16) MOVHPS_MqVps (0f 17) = MOVHPD_MqVpd (66 0f 17) Until under examination: XORPS,XORPD ORPS,ORPD ANDPS,ANDPD ANDNPS,ANDNPD MOVUPS,MOVUPD	2002-11-13 22:24:03 +00:00
Stanislav Shwartsman	968b2744f4	According to the Intel (and AMD) manuals a lot different SSE/SSE2 opcodes has EXACTLY the same operation. Deleted first three redundant opcodes: MOVAPS_VpsWps (0f 28) = MOVAPD_VpdWpd (66 0f 28) MOVAPS_WpsVps (0f 29) = MOVAPD_WpdVpd (66 0f 29) MOVNTPS_MdqVps (0f 2b) = MOVNTPD_MdqVpd (66 0f 2b) Until checking: XORPS,XORPD ORPS,ORPD ANDPS,ANDPD ANDNPS,ANDNPD MOVUPS,MOVUPD MOVLPS,MOVLPD MOVHPS,MOVHPD	2002-11-13 21:35:17 +00:00
Stanislav Shwartsman	4363745725	Implemented SSE2 integer instructions: PACKSSDW_VdqWdq PUNPCKHDQ_VdqWq PUNPCKHWD_VdqWq PUNPCKHBW_VdqWq PUNPCKHQDQ_VdqWq MOVD_EdVd MOVD_VdqEd	2002-11-08 12:47:24 +00:00
Bryce Denney	cec9135e9f	- Apply patch.replace-Boolean rev 1.3. Every "Boolean" is now changed to a "bx_bool" which is always defined as Bit32u on all platforms. In Carbon specific code, Boolean is still used because the Carbon header files define it to unsigned char. - this fixes bug [ 623152 ] MacOSX: Triple Exception Booting win95. The bug was that some code in Bochs depends on Boolean to be a 32 bit value. (This should be fixed, but I don't know all the places where it needs to be fixed yet.) Because Carbon defined Boolean as an unsigned char, Bochs just followed along and used the unsigned char definition to avoid compile problems. This exposed the dependency on 32 bit Boolean on MacOS X only and led to major simulation problems, that could only be reproduced and debugged on that platform. - On the mailing list we debated whether to make all Booleans into "bool" or our own type. I chose bx_bool for several reasons. 1. Unlike C++'s bool, we can guarantee that bx_bool is the same size on all platforms, which makes it much less likely to have more platform-specific simulation differences in the future. (I spent hours on a borrowed MacOSX machine chasing bug 618388 before discovering that different sized Booleans were the problem, and I don't want to repeat that.) 2. We still have at least one dependency on 32 bit Booleans which must be fixed some time, but I don't want to risk introducing new bugs into the simulation just before the 2.0 release. Modified Files: bochs.h config.h.in gdbstub.cc logio.cc main.cc pc_system.cc pc_system.h plugin.cc plugin.h bios/rombios.c cpu/apic.cc cpu/arith16.cc cpu/arith32.cc cpu/arith64.cc cpu/arith8.cc cpu/cpu.cc cpu/cpu.h cpu/ctrl_xfer16.cc cpu/ctrl_xfer32.cc cpu/ctrl_xfer64.cc cpu/data_xfer16.cc cpu/data_xfer32.cc cpu/data_xfer64.cc cpu/debugstuff.cc cpu/exception.cc cpu/fetchdecode.cc cpu/flag_ctrl_pro.cc cpu/init.cc cpu/io_pro.cc cpu/lazy_flags.cc cpu/lazy_flags.h cpu/mult16.cc cpu/mult32.cc cpu/mult64.cc cpu/mult8.cc cpu/paging.cc cpu/proc_ctrl.cc cpu/segment_ctrl_pro.cc cpu/stack_pro.cc cpu/tasking.cc debug/dbg_main.cc debug/debug.h debug/sim2.cc disasm/dis_decode.cc disasm/disasm.h doc/docbook/Makefile docs-html/cosimulation.html fpu/wmFPUemu_glue.cc gui/amigaos.cc gui/beos.cc gui/carbon.cc gui/gui.cc gui/gui.h gui/keymap.cc gui/keymap.h gui/macintosh.cc gui/nogui.cc gui/rfb.cc gui/sdl.cc gui/siminterface.cc gui/siminterface.h gui/term.cc gui/win32.cc gui/wx.cc gui/wxmain.cc gui/wxmain.h gui/x.cc instrument/example0/instrument.cc instrument/example0/instrument.h instrument/example1/instrument.cc instrument/example1/instrument.h instrument/stubs/instrument.cc instrument/stubs/instrument.h iodev/cdrom.cc iodev/cdrom.h iodev/cdrom_osx.cc iodev/cmos.cc iodev/devices.cc iodev/dma.cc iodev/dma.h iodev/eth_arpback.cc iodev/eth_packetmaker.cc iodev/eth_packetmaker.h iodev/floppy.cc iodev/floppy.h iodev/guest2host.h iodev/harddrv.cc iodev/harddrv.h iodev/ioapic.cc iodev/ioapic.h iodev/iodebug.cc iodev/iodev.h iodev/keyboard.cc iodev/keyboard.h iodev/ne2k.h iodev/parallel.h iodev/pci.cc iodev/pci.h iodev/pic.h iodev/pit.cc iodev/pit.h iodev/pit_wrap.cc iodev/pit_wrap.h iodev/sb16.cc iodev/sb16.h iodev/serial.cc iodev/serial.h iodev/vga.cc iodev/vga.h memory/memory.h memory/misc_mem.cc	2002-10-25 11:44:41 +00:00
Bryce Denney	5e520261db	Add plugin support to Bochs by merging all the changes from the BRANCH_PLUGINS branch! Authors: Bryce Denney Christophe Bothamy Kevin Lawton (we grabbed a lot of plugin code from plex86) Testing help from: Volker Ruppert Don Becker (Psyon) Jeremy Parsons (Br'fin) The change log is too long to paste in here. To read the change log, do cvs log patches/patch.final-from-BRANCH_PLUGINS.gz All the changes and a detailed description are contained in a patch called patch.final-from-BRANCH_PLUGINS.gz. To look at the complete patch, do cvs upd -r1.1 patches/patch.final-from-BRANCH_PLUGINS.gz Then you will have a local copy of the patch, which you can gunzip and play with however you want. Modified Files: .bochsrc Makefile.in aclocal.m4 bochs.h config.h.in configure configure.in gdbstub.cc logio.cc main.cc pc_system.cc pc_system.h state_file.h bios/Makefile.in bios/rombios.c cpu/Makefile.in cpu/access.cc cpu/apic.cc cpu/arith16.cc cpu/arith32.cc cpu/arith8.cc cpu/cpu.cc cpu/cpu.h cpu/ctrl_xfer32.cc cpu/exception.cc cpu/fetchdecode.cc cpu/fetchdecode64.cc cpu/flag_ctrl.cc cpu/flag_ctrl_pro.cc cpu/init.cc cpu/io.cc cpu/logical16.cc cpu/logical32.cc cpu/logical8.cc cpu/paging.cc cpu/proc_ctrl.cc cpu/protect_ctrl.cc cpu/segment_ctrl_pro.cc cpu/shift16.cc cpu/shift32.cc cpu/stack64.cc cpu/string.cc cpu/tasking.cc debug/Makefile.in debug/dbg_main.cc disasm/Makefile.in doc/docbook/user/user.dbk dynamic/Makefile.in fpu/Makefile.in gui/Makefile.in gui/amigaos.cc gui/beos.cc gui/carbon.cc gui/control.cc gui/control.h gui/gui.cc gui/gui.h gui/keymap.cc gui/keymap.h gui/macintosh.cc gui/nogui.cc gui/rfb.cc gui/sdl.cc gui/sdlkeys.h gui/siminterface.cc gui/siminterface.h gui/term.cc gui/win32.cc gui/wx.cc gui/wxdialog.cc gui/wxdialog.h gui/wxmain.cc gui/wxmain.h gui/x.cc gui/keymaps/sdl-pc-de.map gui/keymaps/sdl-pc-us.map gui/keymaps/x11-pc-de.map instrument/example0/instrument.h instrument/example1/instrument.h instrument/stubs/instrument.cc instrument/stubs/instrument.h iodev/Makefile.in iodev/biosdev.cc iodev/biosdev.h iodev/cdrom.cc iodev/cmos.cc iodev/cmos.h iodev/devices.cc iodev/dma.cc iodev/dma.h iodev/eth_fbsd.cc iodev/eth_linux.cc iodev/eth_null.cc iodev/eth_tap.cc iodev/floppy.cc iodev/floppy.h iodev/guest2host.cc iodev/guest2host.h iodev/harddrv.cc iodev/harddrv.h iodev/iodebug.cc iodev/iodebug.h iodev/iodev.h iodev/keyboard.cc iodev/keyboard.h iodev/ne2k.cc iodev/ne2k.h iodev/parallel.cc iodev/parallel.h iodev/pci.cc iodev/pci.h iodev/pci2isa.cc iodev/pci2isa.h iodev/pic.cc iodev/pic.h iodev/pit.cc iodev/pit.h iodev/pit_wrap.cc iodev/pit_wrap.h iodev/sb16.cc iodev/sb16.h iodev/scancodes.cc iodev/scancodes.h iodev/serial.cc iodev/serial.h iodev/slowdown_timer.cc iodev/slowdown_timer.h iodev/unmapped.cc iodev/unmapped.h iodev/vga.cc iodev/vga.h memory/Makefile.in memory/memory.cc memory/memory.h memory/misc_mem.cc misc/bximage.c misc/niclist.c Added Files: README-plugins extplugin.h ltdl.c ltdl.h ltdlconf.h.in ltmain.sh plugin.cc plugin.h	2002-10-24 21:07:56 +00:00
Stanislav Shwartsman	194952a53d	Merged BOCHS-SSE branch	2002-10-16 17:37:35 +00:00
Kevin Lawton	66452e9898	Replaced tabs in cpu/*.{cc,h} files with spaces.	2002-10-04 17:04:33 +00:00
Kevin Lawton	a5537449cd	Split out reg-reg and reg-memory cases for a few other high-profile instructions, mainly variants of MOV. Had to update fetchdecode64 to keep it inline with the 32-bit mods.	2002-09-29 19:21:38 +00:00
Stanislav Shwartsman	d495bd75a6	fter integration of SplitMod11b changes Bochs failed to compile in SMP mode. I fixed the compilation errors in CVS, smbd please check if the fix is property;	2002-09-28 09:38:58 +00:00
Kevin Lawton	08a89fe7b6	Performance mod: I implemented a suggestion from Peter Tattam and Jas Sandys-Lumsdaine to split out common instructions into variants which deal with the mod=11b case (Reg-Reg) and the other cases (which do memory ops). Actually, I only split MOV_GwEw and MOV_GdEd for now. According to some instrumentation of a Win95 boot, they were the most frequently used opcode by far.	2002-09-28 05:38:11 +00:00
Kevin Lawton	13a1e55f20	Committed patches/patch-bochs-instrumentation from Stanislav. Some things changed in the ctrl_xfer.cc, fetchdecode.cc, and cpu.cc since the original patches, so I did some patch integration by hand. Check the placement of the macros BX_INSTR_FETCH_DECODE_COMPLETED() and BX_INSTR_OPCODE() in cpu.cc to make sure I go them right. Also, I changed the parameters to BX_INSTR_OPCODE() to update them to the new code. I put some comments before each of these to help determine if the placement is right. These macros are only compiled in if you are gathering instrumentation data from bochs, so they shouldn't effect others.	2002-09-28 00:54:05 +00:00
Stanislav Shwartsman	e6adebfe2d	Added MMX opcodes to x86-64 mode Fixed problem with fetching extra byte in ESCx opcodes if FPU is disabled	2002-09-27 09:56:40 +00:00
Stanislav Shwartsman	f987ad036e	Changed BxError to UndefinedOpcode function for UD2 opcode (oF 0B)	2002-09-26 18:58:50 +00:00
Peter Tattam	a0d90e9b39	Implemented SYSCALL and SYSRET as part of x86-64 emulation. Since the SYSCALL replaces the LOADALL instruction, it is incompatible with earlier CPU types. At moment, the SYSCALL is only enabled by x86-64 emulation, but the code can be incorporated in IA32 only emulations. Instructions added: 0F 05 SYSCALL (replaces LOADALL) 0F 07 SYSRET (new) TODO: restructure #if ... so that it can be used by non x86-64 emulations.	2002-09-25 12:54:41 +00:00
Kevin Lawton	b742ccec7e	Changed eflags accessors for get_?F() to use (val32 & (1<<N)) instead of (1 & (val32>>N)), and added a getB_?F() accessor for special cases which need a strict binary value (exactly 0 or 1). Most code only needed a value for logical comparison. I modified the special cases which do need a binary number for shifting and comparison between flags, to use the special getB_?F() accessor. Cleaned up memory.cc functions a little, now that all accesses are within a single page. Fixed a (not very likely encountered) bug in fetchdecode.cc (and fetchdecode64.cc) where a 2-byte opcode starting with a prefix starts at the last offset on a page. There were no checks on the segment overrides for a boundary condition. I added them. The eflags enhancements added just a tiny bit of performance.	2002-09-22 18:22:24 +00:00
Kevin Lawton	3bfeab23c9	Split out JZ/JNZ instructions from JCC because they were called so frequently. Coded asm() statements for INC/DEC_ERX() instructions. Cleaned up the iCache a litle including a bug fix. The generation ID was decrementing the whole field including some high meta bits. That could roll over after 1 Billion cycles. I know only decrement if the field is valid, to save the write. I implemented inline functions which can serve the value of the arithmetic flags if they are cached, and redirect to the lazy_flags.cc routines if not. Most of this was just prep work for adding more asm() statements for native eflags processing when on x86.	2002-09-22 01:52:21 +00:00
Kevin Lawton	e2e219eda0	Modified the way that the register field (low 3 bits of a few opcodes also extended by the REX.B field on Hammer) is passed to instructions. I rearranged the bxInstruction_c to free up a field to be used to pass this info when mod-rm bytes are not used. This got rid of the ugly ((i->b1 & 7) + i->rex_b) code. Probably shaved just a very little run time off Hammer emulation, and even less on x86-32. The resultant is a little cleaner anyways.	2002-09-20 23:17:51 +00:00
Kevin Lawton	402d02974d	Moved the EFLAGS.RF check and clearing of inhibit_mask code in cpu.cc out of the main loop, and into the asynchronous events handling. I went through all the code paths, and there doesn't seem to be any reason for that code to be in the hot loop. Added another accessor for getting instruction data, called modC0(). A lot of instructions test whether the mod field of mod-nnn-rm is 0xc0 or not, ie., it's a register operation and not memory. So I flag this in fetchdecode{,64}.cc. This added on the order of 1% performance improvement for a Win95 boot. Macroized a few leftover calls to Write_RMV_virtual_xyz() that didn't get modified in the x86-64 merge. Really, they just call the real function for now, but I want to have them available to do direct writes with the guest2host TLB pointers.	2002-09-20 03:52:59 +00:00
Kevin Lawton	0cd7346b9c	- Added an instruction cache. Size is fixed for the moment, but if you hand edit cpu/cpu.h, and change BxICacheEntries, you can try different sizes. I'll make this more flexible with configure. For now, use "--enable-icache" with no parameters. - Modified fetchdecode.cc/fetchdecode64.cc just enough so that instructions which encode a direct address now use a memory resolution function which just sticks the immediate address into rm_addr. With cached instructions we need this.	2002-09-19 19:17:20 +00:00
Kevin Lawton	4e51dcae40	Converted all the remaining available separate fields in bxInstruction_c to bitfields. bxInstruction_c is now 24 bytes, including 4 for the memory addr resolution function pointer, and 4 for the execution function pointer (16 + 4 + 4). Coded more accessors, to abstract access from most code.	2002-09-18 08:00:43 +00:00
Kevin Lawton	6723ca9bf4	Moved more separate fields in the bxInstruction_c into bitfields with accessors. Had to touch a number of files to update the access using the new accessors. Moved rm_addr to the CPU structure, to slim down bxInstruction_c and to prevent future instruction caching from getting sprayed with writes to individual rm_addr fields. There only needs to be one. Though need to deal with instructions which have static non-modrm addresses, but which are using rm_addr since that will change. bxInstruction_c is down to about 40 bytes now. Trying to get down to 24 bytes.	2002-09-18 05:36:48 +00:00
Kevin Lawton	07b0df2a8a	Updated accessing of modrm/sib addressing information to use accessors. This lets me work on compressing the size of fetch-decode structure (now called bxInstruction_c). I've reduced it down to about 76 bytes. We should be able to do much better soon. I needed the abstraction of the accessors, so I have a lot of freedom to re-arrange things without making massive future changes. Lost a few percent of performance in these mods, but my main focus was to get the abstraction.	2002-09-17 22:50:53 +00:00
Kevin Lawton	3d4210fd3f	Got rid of a couple fields in BxInstruction_t that were no longer used. Also rearranged that struct a little to be more compressed. Over time, I'm going to reduce it further, for use with future accelerations.	2002-09-17 04:20:42 +00:00
Kevin Lawton	d16fcfce91	(cpu64) Merged fetchdecode.cc. Also, I had some problems with circular dependencies between 3 cpu related libs that I need as part of this transition. I changed the "ar rv" to "ld -i -o" to do an incremental load instead of an archive. Hope this doesn't break any platforms. We can reset this later.	2002-09-13 23:59:24 +00:00
Bryce Denney	be659a09b3	- check in Stanislav Shwartsman's patch "bochs-mmx.patch-endian-support". He writes: Detailed description: MMX instruction set support. Also supports BIG_ENDIAN systems. Tested on Solaris and HP1100. - modified files: configure.in cpu/Makefile.in cpu/cpu.h cpu/fetchdecode.cc cpu/proc_ctrl.cc fpu/fpu_system.h fpu/wmFPUemu_glue.cc - added files: cpu/i387.h cpu/mmx.cc	2002-09-09 16:11:25 +00:00
Kevin Lawton	3f2d28f86c	Added guest2host TLB tricks to read-modify-write variants of access routines in access.cc, completing the upgrade of those routines. You do need '--enable-guest2host-tlb', before you get the speedups for now. The guest2host mods seem pretty solid, though I do need to see what effects the A20 line has on this cache and the paging TLB in general.	2002-09-03 04:54:28 +00:00
Bryce Denney	daf2a9fb55	- add RCS Id to header of every file. This makes it easier to know what's going on when someone sends in a modified file.	2001-10-03 13:10:38 +00:00
Todd T.Fries	2bbb1ef8eb	strip '\n' from BX_{INFO,DEBUG,ERROR,PANIC} don't need it, moved the output of it into the general io functions. saves space, as well as removes the confusing output if a '\n' is left off	2001-05-30 18:56:02 +00:00
Bryce Denney	49664f7503	- parts of the SMP merge apparantly broke the debugger and this revision tries to fix it. The shortcuts to register names such as AX and DL are #defines in cpu/cpu.h, and they are defined in terms of BX_CPU_THIS_PTR. When BX_USE_CPU_SMF=1, this works fine. (This is what bochs used for a long time, and nobody used the SMF=0 mode at all.) To make SMP bochs work, I had to get SMF=0 mode working for the CPU so that there could be an array of cpus. When SMF=0 for the CPU, BX_CPU_THIS_PTR is defined to be "this->" which only works within methods of BX_CPU_C. Code outside of BX_CPU_C must reference BX_CPU(num) instead. - to try to enforce the correct use of AL/AX/DL/etc. shortcuts, they are now only #defined when "NEED_CPU_REG_SHORTCUTS" is #defined. This is only done in the cpu/*.cc code.	2001-05-24 18:46:34 +00:00
Bryce Denney	e61d00351f	- merged BRANCH-smp-bochs into main branch. For details see comments in BRANCH-smp-bochs revisions. - The general task was to make multiple CPU's which communicate through their APICs. So instead of BX_CPU and BX_MEM, we now have BX_CPU(x) and BX_MEM(y). For an SMP simulation you have several processors in a shared memory space, so there might be processors BX_CPU(0..3) but only one memory space BX_MEM(0). For cosimulation, you could have BX_CPU(0) with BX_MEM(0), then BX_CPU(1) with BX_MEM(1). WARNING: Cosimulation is almost certainly broken by the SMP changes. - to simulate multiple CPUs, you have to give each CPU time to execute in turn. This is currently implemented using debugger guards. The cpu loop steps one CPU for a few instructions, then steps the next CPU for a few instructions, etc. - there is some limited support in the debugger for two CPUs, for example printing information from each CPU when single stepping.	2001-05-23 08:16:07 +00:00
Todd T.Fries	bdb89cd364	merge in BRANCH-io-cleanup. To see the commit logs for this use either cvsweb or cvs update -r BRANCH-io-cleanup and then 'cvs log' the various files. In general this provides a generic interface for logging. logfunctions:: is a class that is inherited by some classes, and also . allocated as a standalone global called 'genlog'. All logging uses . one of the ::info(), ::error(), ::ldebug(), ::panic() methods of this . class through 'BX_INFO(), BX_ERROR(), BX_DEBUG(), BX_PANIC()' macros . respectively. . . An example usage: . BX_INFO(("Hello, World!\n")); iofunctions:: is a class that is allocated once by default, and assigned as the iofunction of each logfunctions instance. It is this class that maintains the file descriptor and other output related code, at this point using vfprintf(). At some future point, someone may choose to write a gui 'console' for bochs to which messages would be redirected simply by assigning a different iofunction class to the various logfunctions objects. More cleanup is coming, but this works for now. If you want to see alot of debugging output, in main.cc, change onoff[LOGLEV_DEBUG]=0 to =1. Comments, bugs, flames, to me: todd@fries.net	2001-05-15 14:49:57 +00:00
Bryce Denney	a6fef54678	- update copyright dates to 2001 for all mandrake headers - for bochs files with other header, replaced with current mandrake header	2001-04-10 02:20:02 +00:00
cvs	beff63eb32	- entered original Bochs snapshot bochs-2000_0325a.tar.gz from ftp.bochs.com	2001-04-10 01:04:59 +00:00

... 3 4 5 6 7 ...

417 Commits