llvm-6502/test
Benjamin Kramer 2f8a6cdfa3 X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available.
pmuludq is slow, but it turns out that all the unpacking and packing of the
scalarized mul is even slower. 10% speedup on loop-vectorized paq8p.

git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@170985 91177308-0d34-0410-b5e6-96231b3b80d8
2012-12-22 16:07:56 +00:00
..
Analysis Improve the X86 cost model for loads and stores. 2012-12-21 01:33:59 +00:00
Archive
Assembler
Bindings/Ocaml
Bitcode
BugPoint
CodeGen X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available. 2012-12-22 16:07:56 +00:00
DebugInfo Move these files over to the debug info directory. 2012-12-21 00:03:42 +00:00
ExecutionEngine
Feature
FileCheck
Instrumentation [msan] Remove unreachable blocks before instrumenting a function. 2012-12-21 11:18:49 +00:00
Integer
JitListener
Linker
MC [mips] Fix encoding of BAL instruction. Also, fix assembler test case which 2012-12-21 23:13:59 +00:00
Object Teach MachO which sections contain code 2012-12-17 17:59:32 +00:00
Other
Scripts
TableGen
tools
Transforms Fix a bug in the code that checks if we can vectorize loops while using dynamic 2012-12-21 00:07:35 +00:00
Unit
Verifier
YAMLParser
CMakeLists.txt
lit.cfg
lit.site.cfg.in
Makefile
Makefile.tests
TestRunner.sh