What are the implications of using hacked data for research?
A short thread inspired by the fact that, before AWs took it down, #Parler was extensively hacked and user data was leaked.
The #Parler dataset seems crazy interesting for doing research, and my first reaction after the breach was to shre it with other #CompSocSci ppl.
However, I started having second thoughts, so what follows is to organize ideas and have it somewhere I can look back to.
2/n
Generally speaking, as far as the ethics of research goes a good advice would be to handle hacked data with caution.
First of all, there's an issue of quality. Data might be altered or incomplete, and the source cannot be considered accountable (assuming src is anonymous).
3/n
Secondly and more importantly, a researcher using the data would probably be violating users’ consent and acting against the data collector's will.
Finally, users’ privacy is at stake, since researchers could see material that users didn’t agree for other people to see.
4/n
Sharing private information without consent might put people at risk of harm.
This is all the more true in cases such as the #ParlerHack, where the leaked information is of particularly sensitive nature, and there’s a high risk of unintended consequences.
5/n