llvm-6502

mirror of https://github.com/c64scene-ar/llvm-6502.git synced 2024-12-14 11:32:34 +00:00

History

Elena Demikhovsky 1596373671 Shuffle optimization for AVX/AVX2. The current patch optimizes frequently used shuffle patterns and gives these instruction sequence reduction. Before: vshufps $-35, %xmm1, %xmm0, %xmm2 ## xmm2 = xmm0[1,3],xmm1[1,3] vpermilps $-40, %xmm2, %xmm2 ## xmm2 = xmm2[0,2,1,3] vextractf128 $1, %ymm1, %xmm1 vextractf128 $1, %ymm0, %xmm0 vshufps $-35, %xmm1, %xmm0, %xmm0 ## xmm0 = xmm0[1,3],xmm1[1,3] vpermilps $-40, %xmm0, %xmm0 ## xmm0 = xmm0[0,2,1,3] vinsertf128 $1, %xmm0, %ymm2, %ymm0 After: vshufps $13, %ymm0, %ymm1, %ymm1 ## ymm1 = ymm1[1,3],ymm0[0,0],ymm1[5,7],ymm0[4,4] vshufps $13, %ymm0, %ymm0, %ymm0 ## ymm0 = ymm0[1,3,0,0,5,7,4,4] vunpcklps %ymm1, %ymm0, %ymm0 ## ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[4],ymm1[4],ymm0[5],ymm1[5] git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@159188 91177308-0d34-0410-b5e6-96231b3b80d8		2012-06-26 08:04:10 +00:00
..
Analysis	Enable the new LoopInfo algorithm by default.	2012-06-26 04:11:38 +00:00
Archive
AsmParser	Extend the IL for selecting TLS models (PR9788)	2012-06-23 11:37:03 +00:00
Bitcode	llvm/lib: [CMake] Add explicit dependency to intrinsics_gen.	2012-06-24 13:32:01 +00:00
CodeGen	Update a bunch of stale comments that dated from when this folled the	2012-06-26 05:16:37 +00:00
DebugInfo	Fix a C++11 UDL conflict.	2012-04-04 20:33:56 +00:00
ExecutionEngine	llvm/lib: [CMake] Add explicit dependency to intrinsics_gen.	2012-06-24 13:32:01 +00:00
Linker	Extend the IL for selecting TLS models (PR9788)	2012-06-23 11:37:03 +00:00
MC	EmitZerofill should take a 64-bit size or else it's chopping off large zero-filled global. rdar://11729134	2012-06-22 20:14:46 +00:00
Object	Added accessors for getting coff_relocation info	2012-06-18 19:47:16 +00:00
Support	PR13013: ELF Type identification fails for MSB type ELF files.	2012-06-25 14:48:43 +00:00
TableGen	Emit TableGen's header comment with C-style comments, so it can be used from C89 code.	2012-06-19 17:04:16 +00:00
Target	Shuffle optimization for AVX/AVX2.	2012-06-26 08:04:10 +00:00
Transforms	revert my previous commit (r159173), since as Eli pointed out, it's perfectly ok to mark realloc as noalias	2012-06-25 23:26:10 +00:00
VMCore	improve optimization of invoke instructions:	2012-06-25 17:11:47 +00:00
CMakeLists.txt
LLVMBuild.txt
Makefile