Emmanuel Marty
410544f4e6
Fast 6502 LZSA2 depacker: smaller size, same speed
2020-01-03 10:01:23 +01:00
introspec
96b9933bd3
remove temporary label
2020-01-02 14:32:30 +00:00
introspec
9cd9fa5939
-2 bytes
...
(same speed)
2020-01-02 14:30:00 +00:00
Emmanuel Marty
8721c11041
Add faster LZSA2 depacker by jbrandwood
2019-12-24 12:02:34 +01:00
Emmanuel Marty
a46796b6a9
Fix assembling of 6502 fast v1 depacker
2019-12-24 10:16:01 +01:00
Peter Ferrie
302234a91b
fast v2
2019-10-31 23:01:00 -07:00
Peter Ferrie
9fc9a49d67
fast v1
2019-10-31 22:20:02 -07:00
mobygamer
30192238ea
Rewrite 8088 jumptable decompressor for maximum speed
...
This is a rewrite of LZSA1JMP.ASM to use a 256-element jumptable, which
allows the code to handle all of the hot paths (common cases) without
any branching. This not only reduces branches (which are very costly on
x86) to a bare minimum, but also grants us foreknowledge in a decode
path of what steps can be skipped.
The new code is 12.7% faster than the old code, and assembles to less
than 3K of object code and data.
2019-10-26 23:34:24 -05:00
introspec
d5d788946e
Added an option for unrolling long match copying
...
Usually useless and costing +57 bytes, this option can bring dramatic performance improvements on very compressible data dominated by long matches
2019-10-22 20:11:46 +01:00
introspec
495a12216f
-1 byte
...
Very slightly faster too
2019-10-11 00:23:43 +01:00
introspec
566e3a94e8
+0.2% speed
...
also, added an option to unroll LDIR for longer matches (which adds 38 bytes, but can be significantly faster for files with many long matches)
2019-10-10 22:50:23 +01:00
Emmanuel Marty
6a62f7d795
Update Z80 depackers changes history
2019-09-26 11:42:52 +02:00
Emmanuel Marty
681f78d1e8
Rename
2019-09-26 07:48:59 +02:00
Emmanuel Marty
8015ab8650
Rename
2019-09-26 07:48:44 +02:00
Emmanuel Marty
2f15298343
Rename
2019-09-26 07:48:33 +02:00
Emmanuel Marty
648a308d87
Rename
2019-09-26 07:48:19 +02:00
Emmanuel Marty
587a92f4ab
Rename Z80 depackers, add version history to LZSA1
2019-09-26 07:47:43 +02:00
Emmanuel Marty
7d9135c548
Update Z80 decompressors
2019-09-25 08:09:18 +02:00
Emmanuel Marty
9de7e930e9
Faster LZSA1 z80 decompression
2019-08-27 13:16:20 +02:00
uniabis
a807344343
2bytes shorter
2019-08-22 12:55:55 +09:00
introspec
be30cae636
-1 byte
...
slightly slower, but this is the size-optimized branch
2019-08-06 12:36:27 +01:00
introspec
44bff39de3
New faster and shorter decompressors
...
This update is mostly about better integration of improvements by uniabis, with spke contributing several smaller size optimizations.
2019-08-01 15:07:14 +01:00
introspec
e7bb1faece
Merge branch 'master' into master
2019-07-31 23:24:30 +01:00
introspec
51ef92cdab
incorporated improvements by uniabis
...
also, slightly faster decompression for fast packer in backwards mode
2019-07-31 20:42:47 +01:00
uniabis
8d0528fddc
hd64180 support
...
a bit faster, a bit smaller
2019-07-31 01:39:27 +09:00
Emmanuel Marty
0a04796b19
Fix for z80 LZSA2 fast backward depacker
2019-07-27 15:39:44 +02:00
introspec
ac3bf78273
fix a bug in the backward version of unlzsa2_fast_v1.asm
...
an INC HL slipped through
2019-07-27 14:14:54 +01:00
Emmanuel Marty
82edcb8bb5
Fix literal runs that are multiple of 256 bytes
2019-07-27 01:35:46 +02:00
Emmanuel Marty
ae4cc12aed
Use ACME syntax
2019-07-26 12:31:26 +02:00
Emmanuel Marty
fd70be918c
Merge pull request #19 from specke/master
...
Support for -b in Z80 decompressors
2019-07-24 20:09:48 +02:00
Emmanuel Marty
4835e4c26c
Support backward decompression
2019-07-24 20:08:23 +02:00
introspec
cca79e3e59
Delete unlzsa_small_v1.asm
2019-07-24 17:31:22 +01:00
introspec
607b26d337
Delete unlzsa_fast_v1.asm
2019-07-24 17:31:14 +01:00
introspec
fd61f403ad
LZSA1 decompressors with added support for -b.
2019-07-24 17:30:37 +01:00
introspec
fcfba056d2
Add files via upload
...
LZSA2 decompressors with support for -b option.
2019-07-24 17:28:39 +01:00
Emmanuel Marty
081a29a3db
Fix copying multiples of 256 bytes
2019-07-14 16:14:55 +02:00
Emmanuel Marty
710d7e05d6
Fix comments
2019-07-14 10:13:16 +02:00
Emmanuel Marty
0b540431fc
Fix comments
2019-07-14 10:12:32 +02:00
Emmanuel Marty
981b1d5925
NASM versions of Jim Leonard's speed-optimized depackers
2019-07-14 10:11:16 +02:00
Emmanuel Marty
19e8bc0468
Merge pull request #17 from MobyGamer/decompressor/8086_speed
...
Time-efficient LZSA2 decompressor
2019-07-14 09:28:40 +02:00
mobygamer
c38b582e73
Time-efficient LZSA2 decompressor
...
This commit provides a time-effecient LZSA2 decompressor for
the 8088 (and higher) CPU. Decompression speed is roughly 50% faster
than ZX7 on the same hardware.
2019-07-13 22:35:28 -05:00
Emmanuel Marty
6445d9ff6f
Merge pull request #16 from peterferrie/master - thanks!
...
smaller
2019-07-12 10:34:58 +02:00
Peter Ferrie
6797fe6268
smaller
2019-07-11 17:20:38 -07:00
Emmanuel Marty
7195104c73
Merge pull request #15 from MobyGamer/decompressor/8086_speed
...
Submit 8086-optimized decompressor using jump tables
2019-07-12 00:59:41 +02:00
mobygamer
f123a6d9df
Submit 8086-optimized decompressor using jump tables
...
Because the 8086's BIU is more efficient and can read a word in
a single operation, we can skew LZSA1 decompression faster on 8086
by using a jump table to eliminate 2 comparions.
2019-07-11 17:22:29 -05:00
Emmanuel Marty
4d7d90b893
Merge pull request #14 from MobyGamer/decompressor/8086_speed
...
Additional minor speedups
2019-07-11 09:01:14 +02:00
mobygamer
638c33b432
Additional minor speedups
2019-07-11 00:58:46 -05:00
Peter Ferrie
3bfe60de44
use BYTE
2019-07-10 18:03:42 -07:00
Peter Ferrie
fb53706361
smaller
2019-07-10 18:02:27 -07:00
Emmanuel Marty
d3d62c3bf0
Use BYTE
2019-07-11 01:52:13 +02:00
Peter Ferrie
7861ef1552
smaller
2019-07-10 16:04:42 -07:00
Peter Ferrie
8e26fa9cac
smaller
2019-07-10 10:57:41 -07:00
Peter Ferrie
a1841e5e5d
smaller
2019-07-10 10:48:15 -07:00
Peter Ferrie
68339ad961
smaller
2019-07-09 18:44:37 -07:00
Peter Ferrie
15c9059adf
smaller
2019-07-09 18:30:29 -07:00
Peter Ferrie
e85cb8a5bb
smaller
2019-07-09 16:49:01 -07:00
Emmanuel Marty
467cd1970f
Merge pull request #9 from MobyGamer/decompressor/8086_speed
...
Additional 1% speedup from deferring work
2019-07-09 20:42:17 +02:00
mobygamer
9a180a59f9
Additional 1% speedup from to introspec suggestions
2019-07-09 13:05:10 -05:00
Emmanuel Marty
b7b3ca1907
Merge pull request #8 from MobyGamer/decompressor/8086_speed - thank you!
...
8088-speed-optimized LZSA1 decompression
2019-07-08 09:17:51 +02:00
mobygamer
b0c9d61aab
8088-speed-optimized LZSA1 decompression
...
This commit contributes LZSA1 raw block format decompression code,
optimized for speed for the 8088 and higher processors. With
appropriate compression options (-f1 -m5), decompression speed is
comparable to LZ4.
2019-07-07 20:25:50 -05:00
Peter Ferrie
111403c052
a little more
2019-07-07 12:05:02 -07:00
Emmanuel Marty
fe0dcf107f
Use db rather than .byte
2019-07-04 18:09:29 +02:00
Peter Ferrie
32b98f3085
a bit faster, a bit smaller
2019-07-03 19:38:27 -07:00
Emmanuel Marty
da4f6f7882
Restore CLC
2019-07-03 23:39:05 +02:00
Peter Ferrie
6f46c60259
slightly faster
2019-07-03 12:11:50 -07:00
Emmanuel Marty
8d8f50a509
Don't use 80186 opcodes. Issue #3
2019-06-27 18:32:49 +02:00
Emmanuel Marty
5a531cd4ce
Remove
2019-06-21 17:40:33 +02:00
Emmanuel Marty
d1245c7d6a
Fix small typo in z80 depackers
2019-06-21 17:39:50 +02:00
Emmanuel Marty
931f5fff9b
Add Z80 decompressors for LZSA2
2019-06-10 13:24:15 +02:00
Emmanuel Marty
435e157239
Save 3 more bytes
2019-06-08 23:59:34 +02:00
Emmanuel Marty
aab82fa022
Save 9 bytes in 6502 LZSA1 depacker
2019-06-08 23:52:18 +02:00
Emmanuel Marty
659f6c14a9
Small improvement over 6502 LZSA2 depacker
2019-06-08 16:18:36 +02:00
Emmanuel Marty
79ed7bf91e
Further update LZSA2 format; avoid name conflicts
2019-06-08 13:35:03 +02:00
Emmanuel Marty
272f2e7a29
Update LZSA2 6502 and 8088 depackers
2019-06-07 23:22:34 +02:00
emmanuel-marty
d70830b525
Simplify nibble handling in LZSA2 8088 depacker
2019-05-11 11:42:00 +02:00
emmanuel-marty
8b7b4a2b4f
Check in LZSA2 implementation (ratio competitive with ZX7, faster decompression)
2019-05-09 16:51:29 +02:00
emmanuel-marty
97dd3ffc1f
Update fast Z80 decompressor
2019-04-25 17:19:58 +02:00
emmanuel-marty
2b9780bd65
Finalize lzsa1 compressed format, speed up and simplify decompression
2019-04-24 09:47:40 +02:00
emmanuel-marty
f837ed096e
Add size-optimized Z80 decompressor, contributed by spke
2019-04-21 08:15:43 +02:00
emmanuel-marty
4c5d44fff4
Improve hotpath of 6502 depacker further
2019-04-10 19:41:41 +02:00
emmanuel-marty
8c3fc6d652
Simplify short offset handling in 6502 depacker
2019-04-10 17:37:36 +02:00
emmanuel-marty
c220694531
Add 6502 decompressor
2019-04-10 17:30:24 +02:00
emmanuel-marty
02592cfe3b
Fix typo in 8088 decompressor comments
2019-04-10 17:30:13 +02:00
emmanuel-marty
be54a4f889
Check in optimized-for-speed Z80 decompressor contributed by spke
2019-04-07 00:01:42 +02:00
emmanuel-marty
e24320b23b
Save 1 byte in 8088 decompressor
2019-04-06 00:21:15 +02:00
emmanuel-marty
a785010448
Revert token to O|LLL|MMMM; revert to always shifting the match offset by 1; set raw block end marker as a large zero-size match
2019-04-05 23:16:05 +02:00
emmanuel-marty
1ef1ad8111
Reorganize token byte for faster decoding on 8-bit CPUs, without affecting the compression ratio
2019-04-05 11:58:44 +02:00
emmanuel-marty
c7692cf688
Store 16-bit lengths and match offsets directly, to simplify decompression on 8-bit CPUs without affecting the compression ratio
2019-04-05 10:42:06 +02:00
emmanuel-marty
0744ec99de
Unpack raw blocks in 8088 decompressor
2019-04-03 13:05:32 +02:00
emmanuel-marty
06396f5ba6
Save 2 bytes in 8088 decompressor
2019-04-02 13:21:45 +02:00
marty-emmanuel
e216b0c544
Initial checkin
2019-04-01 18:04:56 +02:00