Tags: reverse_engineering 

Rating:

Step 1:
Extract the contents of the PyInstaller-generated executable file. The source code for ‘pyinstxtractor’ has been pasted inside the ex.py file.

Step 2:
Decompile the .pyc file using pycdas since pycdc won't work with the latest Python versions.

Step 3:
Use AI to reconstruct the source code part by part. Do not feed the entire text at once, as AI may approximate the code to the standard version, which won't help with the challenge.

Step 4:
After generating the entire code, you will notice that the compression algorithm is based on Huffman encoding. The differences compared to the standard Huffman code are outlined in the diary.

Step 5:
The algorithm encrypts characters based on their frequency of occurrence in the string provided in the diary. Input this frequency data into the source code to obtain the prefix codes used for each character.

Step 6:
Write a script to decrypt the given file using the extracted prefix codes.

Original writeup (https://github.com/CyberCell-Viit/VishwaCTF-24-Writeups/blob/main/VishwaCTF'24/Reverse%20Engineering/Bad%20Compression.pdf).