Googles appar
Huvudmeny

Post a Comment On: cbloom rants

"09-25-11 - More on LZ String Matching"

5 Comments -

1 – 5 of 5
Blogger Shelwien said...

Do you have any comments about applying BWT suffix arrays for LZ matchfinding? For example, its used in bsdiff.

September 26, 2011 at 6:08 AM

Blogger cbloom said...

That's actually what I use for my optimal parsing LZ at the moment. I wrote a note about it here :

http://cbloomrants.blogspot.com/2010/06/06-17-10-suffix-array-neighboring-pair.html

The big advantage of it is that "divsufsort" by Yuta Mori is really superb.

It will be part of my test set.

September 26, 2011 at 8:53 AM

Blogger doynax said...

I know this is a little out of left-field but do you happen know of any algorithms for static dictionary coding? I usually work on embedded system with rather limited RAM where such algorithms would be useful.

Articles on dictionary coding usually mention such methods in passing, and there's plenty of papers on searching existing dictionaries of course, but I'm having trouble finding much of anything on generating them in the first place.

Sorry for bugging you but I'd appreciate a nudge in the right direction. I never did get the hang of this literature search thing and I usually manage to get stuck behind some paywall besides ;)

September 26, 2011 at 12:53 PM

Blogger cbloom said...

@doynax :

"Off-Line Dictionary-Based Compression"

Larsson & Moffat

September 26, 2011 at 3:41 PM

Blogger doynax said...

Thank you!

I had considered stripping off ineffective codewords from a LZ-78 dictionary in an ad-hoc fashion, but this approach should actually work ;)

It'll be interesting to see what sort of compression I'll get out of it on our data, especially without using much in the way of entropy coding.

September 27, 2011 at 11:31 PM

You can use some HTML tags, such as <b>, <i>, <a>

This blog does not allow anonymous comments.

Comment moderation has been enabled. All comments must be approved by the blog author.

You will be asked to sign in after submitting your comment.