https://people.eecs.berkeley.edu/~dawnsong/papers/2012%20On%20the%20Feasibility%20of%20Internet-Scale%20Author%20Identification.pdfin 2012 they were able to identify the correct poster with
80% accuracy from a set of one hundred thousand posters basado on text corpus alone. and [b]classifiers are one of the few areas of technology that have exponentially increased in power since 2012[\b] and have a massive industry behind it.
of course a short post like on 4chan has a small amount of bits it can have and even things like uhh pidgeon hole principle can be used to say "wait a minute how many combinations of 7 word posts are there even!" (well its a lot actually) so theres no much information in one single post.... or is there? well yes there is! And in the example from 2012 that i skimmed it seems they were using blog posts which aren't usually that long anyway..
and its much harder to identify someone from 100,000 posters rather than from kissues' small population of what? like 100 posters? (maybe kissumins can shed some light on this, im also interested in any kind of analytics that they do collect)
also the difference between stylometry and fingerprinting is that any random person with access to kissu can implement a stylo attack and they dont need access to the server.
this is probably the 15th time I have made a post about stylometry on kissu, but also people keep making these threads about identifying people so I keep having to make them.
another cool fact about styolmetry attacks is that you can do totally cross platform attacks. for example finding someones 4chan posts, or other forum posts... or imagine the case where you have a large enough corpus built that you manage to find their facebook account! from the 2012 example using 100,000 people these seems kind of feasable.. Add together other bits of data you can find out about them. Usual posting time, descriptions about their life (gather demographic data)
using a few posts you might be able to find out exactly who someone is
Imagine the kind of abilities a place like the NSA has that is just a bunch of giant airplane hangars filled with nerds who love to think about this stuff all the time and have access to every single internet post and transaction in the entire world. It doesnt seem ridiculous to me that the NSA could take the text from one kissu post and then use that to get your SSN in a few hours if they wanted to.
very exciting!