Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wrote my bachelor thesis on something tangential — basically, some researchers found that it was possible in some very specific circumstances to train a classifier to do author attribution (i.e. figure out who wrote the program) based just on the compiled binaries they produced. I don’t think the technique has been used for anything actually useful, but it’s cool to see that individual coding style survives the compilation process, so much so that you can tell one person’s compiled programs apart from another’s.


Do you mean the whole binary or just the text segment/instructions?

Because I think this gets a lot easier if you can look at the symbol table, strings, and codesigning certificate.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: