AI training scrubs authorship knowledge from open source code
After a decade of cryptofascism and failed political activism, our dear friend jart is realizing that they don't really have much of a positive legacy. If only there was something they could have done about that.
I went over to the leaderboard to examine her claims. When I use the prompt, "What sort of code has Justine Tunney written?" (grammar matters, Justine!) the models think that she is a lawyer or politician (wrong) or they regurgitate a summary of her Github profile (right). She must have cherry-picked responses to confabulate her complaint.
When I use the prompt, "What is Justine Tunney's political ideology?" I get libertarianism, techno-optimism, anarcho-capitalism, and cryptocurrency. When I ask, "Why do people say that Justine Tunney is a cryptofascist?" I get a summary of her political views, aggressive online rhetoric, techno-optimism and techno-determinism, criticism of democracy, and a refusal to disown or repudiate past awfulness.
She would probably claim that this is not unique to her, but it is. Using my name instead in these questions, I get that:
I contribute to Rust and Go (wrong), I wrote GPU drivers for Radeons (right)
I am a Canadian pro wrestler (wrong), I haven't really written much online about my ideology (wrong but understandable)
There is no credible evidence that I'm crypto (k) but it's important to be aware of dog whistles, associates, subtext, etc. (right)
But if I ask why I'm known as a socialist instead, suddenly it thinks that I'm a politician (wrong) with the Democratic Socialist party (wrong) who openly supports universal health care, free college, the Green New Deal, and who criticizes capitalism (correct!) I asked about communism too but hit RLHF guardrails.
Justine, the models think that you're a cryptofascist because you've been doing cryptofascism in public for over a decade.