TIL you can de-obfuscate code with ChatGPT

TIL you can de-obfuscate code with ChatGPT(twitter.com)

225 points by alexandr1us 3 years ago | 106 comments

shever73 3 years ago |

https://twitter.com/AlexAlexandrius/status/16178852870484254...

So, all I've learned is that ChatGPT knows the obfuscated and de-obfuscated versions of code that it itself has generated.

letmevoteplease 3 years ago | |

I just tested it on a little snippet of my code obfuscated with https://obfuscator.io/ and it worked almost perfectly.

My original code:

  function resizeImage(img) {
   var maxHeight = 350;
   var ratio = 1;
   if(img.height > maxHeight) {
    ratio = maxHeight / img.height;
   } 
   
   var width = img.width * ratio
   var height = img.height * ratio;
   
   var canvas = document.createElement('canvas');
   canvas.height = height;
   canvas.width = width;
   var ctx = canvas.getContext('2d');
   ctx.drawImage(img, 0, 0, width, height);
   return canvas.toDataURL("image/jpeg", 0.8);
  }

ChatGPT's answer: https://i.imgur.com/5jgPMEd.png

sdflhasjd 3 years ago | | |

Interesting why it changes 350px to 600px.

Also interesting is how it's explanation of the deobfuscated code, although broadly correct in terms of goal, doesn't accurately describe the steps. Almost as if it's disregarding the code altogether and merely describing another implementation of "resizeImage".

shever73 3 years ago | | |

Interesting, I tried it on the sample code at https://obfuscator.io, which was just:

    function hi() {
      console.log("Hello World!");
    }
    hi();

ChatGPT was unable to deobfuscate. Here's the answer: https://i.imgur.com/20XhPw6.png

buster 3 years ago | |

Which is not runnable in the first place. Interesting though, as this shows exactly the problem. It looks legit, but it's some generated fake text.

ok123456 3 years ago | | |

It tends to invent whole fake packages.

crizzlenizzle 3 years ago |

Is it correct though?

I’ve been toying around with ChatGPT for a few weeks now and I encountered a few situations in which ChatGPT was like 90% accurate at best. Things like suggesting snippets of configuration files or plugin research. It’s good to get an idea and get started somewhere, but I certainly cannot trust it blindly.

gooseus 3 years ago | |

What I've been telling everyone is that you can not (should not) ask ChatGPT a question that you can not independently verify that answer to yourself.

This is kind of what makes it good for generating code, because everything it generates can be pretty quickly verified and validated by another machine (interpreter/compiler).

Makes it not so great for writing essays on books you didn't read, and especially for doing math you don't understand... because it can't do math AT ALL.

imtringued 3 years ago | | |

I was kind of thinking about this.

Let's hypothetically assume we have some sort of AGI and we can ask it to write programs and text and nothing else.

Is there anyone on this planet who would think that they don't need to look at the generated code? I mean imagine a manager simply feeding in tickets and getting a finished application out without ever knowing how it was produced.

The application is business critical and any kind of mistake could ruin his business which puts the manager at complete mercy of the AI.

Now you might say that this happens with humans as well but when humans cause problems we let other humans review and test their code.

AI causes problems? Let's add more humans. Wait a minute...

int_19h 3 years ago | | |

> everything it generates can be pretty quickly verified and validated by another machine

It can be verified in a sense that it builds, but that doesn't mean that it actually does what you asked it to do, or that it does it on all valid inputs. The worst bugs to track down are silent logic bugs.

danenania 3 years ago | | |

For math, I'm kind of surprised that it can't recognize "this is math" and then handle that with normal calculations instead of the language model. I assume we'll see that before long.

boplicity 3 years ago | | |

I'm leaning towards using it for things I already know exactly how to do -- including a very clear idea of the result. In these contexts, it can save some mental workload / time.

Jeema101 3 years ago | |

Don't think so. There's clearly the beginning of a while loop near the top of the obfuscated version. There's no loops at all in the 'de-obfuscated' version.

cocomutator 3 years ago | | |

Here's GPT's own explanation what the purpose of that while loop is:

---

This code uses JavaScript's `eval` function to obfuscate the code by looping over an array of strings and passing them as arguments to `eval` to create a variable. It also uses an anonymous function to obfuscate the code. The code is deobfuscated by replacing the `eval` function and the anonymous function with their respective strings.

bobkazamakis 3 years ago | | |

That's not generally a good enough indicator; plenty of obfuscation involves loops that otherwise aren't hit.

nabakin 3 years ago | | |

The while loop does not come from the original code. Probably part of the obfuscation https://twitter.com/AlexAlexandrius/status/16178998824000839...

seanhunter 3 years ago | |

Yeah an example I was shown was python code to process some data. It was 30 lines of correct-looking trivial boilerplate code, except for one regex to do the actual processing. The regex was hopelessly wrong.

Clearly if you didn't know how to write the other 29 lines of code there's no way you are going to be able to debug the regex.

vlunkr 3 years ago | | |

The optimistic way to look at it though is that it wrote the boring 29 lines that you didn't want to write and got you straight to the actual problem that needs solving.

Lutger 3 years ago | |

90% accurate sounds impressive, and it is, but its still 100% incorrect almost always.

euroderf 3 years ago | | |

But does it follow the 80/20 rule ?

In this case, 80% of the answer for 20% of the effort ?

hangonhn 3 years ago | |

It has this really amazing and terrifying quality of being a really good bullshitter. I asked it an AWS question once and it gave me 4 very convincing sounding answers. I went to try it. 2 of them are complete bullshit as in as the commands don't even exist. The only good answer is the one I already had. It's in this uncanny valley of bullshitting. Can be quite dangerous in some situations, especially if one is lolled into trusting it.

gjvc 3 years ago | | |

I recognize what you are describing and I actually think that its predisposition to doing this has become worse in the past week or so.

zahrc 3 years ago | |

In my experience, ChatGPT often comes up with pseudo syntax.

folkrav 3 years ago | | |

It often happens that ChatGPT will confidently give you something that _looks_ like what you're asking for despite it being awfully wrong - sometimes you can make it "understand" its mistake and correct it, sometimes not. It's usually not that far off, but trusting it blindly is just out of the question.

generalizations 3 years ago | |

It's good enough. I know zero powershell, but I know other languages enough to understand the common grammar. With ChatGPT I'm a fairly rapid powershell programmer right off the bat - as evidenced by the script I've been writing this afternoon. I don't know any of the (overcomplicated) syntax, but now I don't have to.

nabakin 3 years ago | |

This is the original code https://twitter.com/AlexAlexandrius/status/16178998824000839...

It's very similar to the deobfuscated version, but ChatGPT wrote the code in the first place

anigbrowl 3 years ago | |

Is it ready for production? Maybe not. Is it amazing and inevitably going to get better? Yes. Does it make a lot of human labor redunant in the very foreseeable future? Also yes.

weego 3 years ago | |

Define accurate?

It's just like any other AI system, it returns results as a best effort proposition of accurate with a % confidence that doesn't map well to binary outcomes.

So yes, it can be accurate. But there are scenarios where it must be strict or binary correct, and its not great at that bit.

influx 3 years ago | | |

I’ve had it confidently tell me to use Python libraries that don’t exist, pass parameters to methods that aren’t in the method signature, and to write code that had to be debugged and fixed.

I’m still excited to use it, but you have to know enough about coding to ensure correctness. It’s no where near possible for a non-coder to build a complicated app with (so far).

rednerrus 3 years ago |

It's been my experience that ChatGPT gets things wrong sometimes. It's also been my experience that if you say "X isn't working as expected." it will do its best to fix the issues. It usually does a pretty good job of fixing it.

I had it write a handful of scripts for me yesterday. It got about 90% of it right on the first pass and 99% of it right on the second pass. You still need to have some understanding of what you're doing so you can see when things are wrong but man if it doesn't save you a lot of time.

ZyanWu 3 years ago |

Misleading, ChatGPT generated both obfuscated and de-obfuscated code

worldsavior 3 years ago | |

So what?

mlatu 3 years ago | | |

the "obfuscated" "version" doesnt even work.

alex literally asked chatgpt to come up with a math problem and its solution, both from whole cloth. and you ask so what.

well, everyone should ask "so what?" to alex.

sidewndr46 3 years ago | | |

A reversible function is not a very interesting development in the world.

mlatu 3 years ago |

i would rather ask it to give me the python source of a program capable to deobfuscate THAT string.

anything else is just naive, wishfull thinking and a waste of time. you will have to deobfuscate the code manually anyways

but at least you got on HN

EDIT:

OMG...

the obfuscated code was generated by chatgpt

AND DOESNT EVEN WORK

garrettjoecox 3 years ago | |

Are you just now realizing these LLM’s can’t factually determine if what they output is correct?

twic 3 years ago | | |

No, i'm just now realizing that HN users can’t factually determine if what these LLM's output is correct.

whamlastxmas 3 years ago | | |

It actively makes shit up in my experience. Like outright lies about basic facts.

mlatu 3 years ago | | |

no, i am just now realizing what a clout hero alex is

teeray 3 years ago |

Every commenter is jumping in asking “is it correct?” Even if it’s not 100%, if it’s at least reasonably close, it could be a tremendous force-multiplier against obfuscation for someone with some familiarity with roughly what the code is trying to do.

shagie 3 years ago |

Fortunately, there's a good repository of obfuscated code.

https://www.ioccc.org

    int main(int b,char**i){long long n=B,a=I^n,r=(a/b&a)>>4,y=atoi(*++i),_=(((a^n/b)*(y>>T)|y>>S)&r)|(a^r);printf("%.8s\n",(char*)&_);}

If you want to get the same (or similar) results:

    curl https://api.openai.com/v1/completions \
      -H "Content-Type: application/json" \
      -H "Authorization: Bearer $OPENAI_API_KEY" \
      -d '{
      "model": "code-davinci-002",
      "prompt": "Deobfuscate the following C code:\n\nint main(int b,char**i){long long n=B,a=I^n,r=(a/b&a)>>4,y=atoi(*++i),_=(((a^n/b)*(y>>T)|y>>S)&r)|(a^r);printf(\"%.8s\\n\",(char*)&_);}",
      "temperature": 0,
      "max_tokens": 256,
      "top_p": 1,
      "frequency_penalty": 0,
      "presence_penalty": 0
    }'

That one loops on the response:

    The code is obfuscated with the following rules:

    * `B` is the value of `b`
    * `I` is the value of `i`
    * `T` is the value of `y`
    * `S` is the value of `_`

Changing to: "model": "text-davinci-003",

    int main(int b, char** i) {
        long long n = b, a = *i[0] ^ n, r = (a / b & a) >> 4, y = atoi(*++i), _ = (((a ^ n / b) * (y >> 8) | y >> 7) & r) | (a ^ r);
        printf("%.8s\n", (char*)&_);
    }

Which... isn't exactly helpful.

Asking it to explain it instead:

    This code is a C program that prints out a string. The program takes two parameters, b and i, and assigns them to variables n and a. Then the program performs various calculations on those variables and the parameters, such as shifting and bitwise operations. Finally, the program prints out the result of the calculations, which is stored in the variable _, as an 8-character string.

captainmuon 3 years ago |

Wow, I wonder how many "bytes of state" ChatGPT must have internally to be able to do that transform. Or does it guess from certain sequences and just writes something plausible?

It would be interesting to test if it can solve "memory hard" problems, like repeated obfuscation or hashing, that would require large internal state. Then you could map how large the equivalent memory would be.

azatom 3 years ago |

Two types of comments:

- "the result doesn't even work..." yeah, even to be a rubberduck is amazing, what thll you expect, got your payment too?

- "wow, amazing...": not really, best case it is a google without (direct) advertisement, found the original code/very similar parts..

tryied with own obfuscated code, not from the net... can not get anything from it

dom96 3 years ago |

Also interested about its accuracy. I built a C/C++ obfuscator[0] a while back, I'll have to make a note to see how well ChatGPT deobfuscates its output.

[0] - https://picheta.me/obfuscator

Double_a_92 3 years ago |

More like: "ChatGPT can pretend to deobfuscate the code"

yodon 3 years ago | |

Are you disagreeing with the deobfuscation that ChatGPT produced?

shever73 3 years ago | | |

According to Alex's own tweet, ChatGPT originally produced the obfuscated code too.

In my head, this is like asking someone to translate "Hello" into French and then asking them to translate "Bonjour" back to English. It proves nothing about capabilities or usefulness.

zekica 3 years ago | | |

I'm disagreeing that the obfuscated code even works. You shoudn't be able to deobfuscate code that doesn't work and get a resulting code that works.

KomoD 3 years ago | | |

It's just guessing and happened to be right once

cinntaile 3 years ago |

That's amazing, I'll have to try this. Did you verify that it's correct?

omgomgomgomg 3 years ago |

And the thing does not learn, it does not save the good answers somewhere nor does it discard the bad ones.

I have asked it to generate some mac os compatible vba code, oh dear, that "macro" never worked so far.

jcadam 3 years ago |

I asked ChatGPT to write me a fibonacci sequence generator suitable for entry into the obfuscated C contest. And it actually spit out something reasonably well obfuscated (used lots of macros).

arriu 3 years ago |

Could this be taken to the extreme by asking it to decompile assembly?

pathartl 3 years ago | |

I've been using it paired with Ghidra to give me a better idea of what's going on. It helped me create a no-CD crack for an old game.

amrb 3 years ago | | |

Silly idea tho would it be helpful in creating the server side code for an online only game?

I've seen a project for battlefield 3 tho already have the feeling it's a team effort at minimum?

amrb 3 years ago | |

From my own research, ctf tools like angr can build AST trees, so I'm working on the thinking I can train the AI to review interesting parts of the execution tree. happy to get feedback or papers since this has been the most interesting find so far https://arxiv.org/abs/1906.12029

pskkk 3 years ago | |

I previously did this. I wrote a naive integer factorization program in C, compiled it, extracted the disassembly and intentionally broke it. It generated a working c function they was almost correct given the broken assembly. I then “talked” with it to improve the code, even suggested that the original disassembly contained an error. It was surprisingly good.

Note: I broke the disassembly intentionally because when I presented the original disassembly it immediately outputted the/a C program to factorize integers.

the_mitsuhiko 3 years ago | |

I have used ChatGPT somewhat successfully to decompile assembly in to C and C++. It's making a lot of mistakes but despite all of this, it's very helpful.

1970-01-01 3 years ago |

Soundness implies validity, but validity does not imply soundness.

RektBoy 3 years ago |

Give it binary obfuscated with VMprotect

KomoD 3 years ago |

Wasn't really a particularly difficult sample... You know what it does by just looking at it.

dsabanin 3 years ago |

I have to say, I find all the comments dismissing ChatGPT hilarious. I read them in a funny grandpa voice. However, we should look past the insignificant details. The main achievement is that we now have a really capable unstructured text-to-computer interface. We can hook it up to anything and it will give us answers with whatever properties we desire, in whatever shape we can think of.