llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-09-14 20:57:01 +00:00

History

Hal Finkel 15c773b6f2 Add a scheduling model (with itinerary) for the PPC POWER7 This adds a scheduling model for the POWER7 (P7) core, and enables the machine-instruction scheduler when targeting the P7. Scheduling for the P7, like earlier ooo PPC cores, requires considering both dispatch group hazards, and functional unit resources and latencies. These are both modeled in a combined itinerary. Dispatch group formation is still handled by the post-RA scheduler (which still needs to be updated for the P7, but nevertheless does a pretty good job). One interesting aspect of this change is that I've also enabled to use of AA duing CodeGen for the P7 (just as it is for the embedded cores). The benchmark results seem to support this decision (see below), and while this is normally useful for in-order cores, and not for ooo cores like the P7, I think that the dispatch slot hazards are enough like in-order resources to make the AA useful. Test suite significant performance differences (where negative is a speedup, and positive is a regression) vs. the current situation: MultiSource/Benchmarks/BitBench/drop3/drop3 with AA: N/A without AA: -28.7614% +/- 19.8356% (significantly against AA) MultiSource/Benchmarks/FreeBench/neural/neural with AA: -17.7406% +/- 11.2712% without AA: N/A (significantly in favor of AA) MultiSource/Benchmarks/SciMark2-C/scimark2 with AA: -11.2079% +/- 1.80543% without AA: -11.3263% +/- 2.79651% MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt with AA: -41.8649% +/- 17.0053% without AA: -34.5256% +/- 23.7072% MultiSource/Benchmarks/mafft/pairlocalalign with AA: 25.3016% +/- 17.8614% without AA: 38.6629% +/- 14.9391% (significantly in favor of AA) MultiSource/Benchmarks/sim/sim with AA: N/A without AA: 13.4844% +/- 7.18195% (significantly in favor of AA) SingleSource/Benchmarks/BenchmarkGame/Large/fasta with AA: 15.0664% +/- 6.70216% without AA: 12.7747% +/- 8.43043% SingleSource/Benchmarks/BenchmarkGame/puzzle with AA: 82.2713% +/- 26.3567% without AA: 75.7525% +/- 41.1842% SingleSource/Benchmarks/Misc/flops-2 with AA: -37.1621% +/- 20.7964% without AA: -35.2342% +/- 20.2999% (significantly in favor of AA) These are 99.5% confidence intervals from 5 runs per configuration. Regarding the choice to turn on AA during CodeGen, of these results, four seem significantly in favor of using AA, and one seems significantly against. I'm not making this decision based on these numbers alone, but these results seem consistent with results I have from other tests, and so I think that, on balance, using AA is a win. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@195981 91177308-0d34-0410-b5e6-96231b3b80d8		2013-11-30 20:55:12 +00:00
..
Analysis	[PM] Split the CallGraph out from the ModulePass which creates the	2013-11-26 04:19:30 +00:00
AsmParser	Make it explicit that nulls are not allowed in names.	2013-11-19 21:12:39 +00:00
Bitcode
CodeGen	Reverse the order of eviction checks for possible compile time savings. No functionality.	2013-11-29 23:49:38 +00:00
DebugInfo	DebugInfo: Avoid emitting pubtype entries for type DIEs that just indirect to a type unit.	2013-11-26 00:22:37 +00:00
ExecutionEngine	Use the mangler consistently instead of using getGlobalPrefix directly.	2013-11-28 08:59:52 +00:00
IR	Fix spurious return introduced by my earlier patch to DebugInfo	2013-11-26 18:54:37 +00:00
IRReader
Linker
LTO
MC	The global prefix is always one char. Don't use a string for it.	2013-11-28 17:00:49 +00:00
Object
Option
Support	Lift self-copy protection up to the header file and add self-move	2013-11-26 00:54:44 +00:00
TableGen
Target	Add a scheduling model (with itinerary) for the PPC POWER7	2013-11-30 20:55:12 +00:00
Transforms	Rein in overzealous InstCombine of fptrunc(OP(fpextend, fpextend)).	2013-11-28 21:38:05 +00:00
CMakeLists.txt
LLVMBuild.txt
Makefile