View Single Post
Old 09-08-2010, 12:22 PM   #48
vugtitan
Addict
vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.vugtitan ought to be getting tired of karma fortunes by now.
 
Posts: 299
Karma: 1042776
Join Date: Aug 2010
Device: none
Quote:
Originally Posted by wesleyer View Post
Maybe you need to read a little bit more about Information Theory.

Here goes a treatise on it: aK

You just need to uncompress it.
Just had a brilliant idea after reading about the compression algorithm on wiki.

the problem is that the dictionary in say winrar/winzip records the location frequency and recurrence of each letter in a table of the compressed file and uses this table to uncompress it.

The dictionary cannot be compressed anymore as it stands.


for example:-

"Ask not what your country can do for you -- ask what you can do for your country."

The quote has 17 words, made up of 61 letters, 16 spaces, one dash and one period. If each letter, space or punctuation mark takes up one unit of memory, we get a total file size of 79 units. To get the file size down, we need to look for redundancies.

Immediately, we notice that:

* "ask" appears two times
* "what" appears two times
* "your" appears two times
* "country" appears two times
* "can" appears two times
* "do" appears two times
* "for" appears two times
* "you" appears two times



Now my idea is why not have a separate dictionary for the primary dictionary which will specify the primary dictionary's dataset.because the primary dictionary is 10 times smaller then the secondary dictionary which describes how to uncompress the primary dictionary and the primary dictionary will expand the compressed file.Thus the secondary dictionary simply ONLY relates to the primary dictionary and how to create it.

thus you would have 2 separate files on compressing rar1 and rar2 and rar2 only relates to rar1 dictionary's creation and then rar1 will recreate original file.

by this method you could achieve nearly unlimited compression for not just text but video too.just imagine a dvd video file downloading in seconds.

could you people give me a critical analysis of any mistakes in the above above idea?
vugtitan is offline   Reply With Quote