Censorship gone awry on Reddit: the aftermath of our r/science AMA

dhimmel (66)in #science • 7 years ago (edited)

You may remember my announcement earlier this week of our AMA (Ask Me Anything) on Reddit Science. Overall, the AMA, which was about Sci-Hub and our recent study on its coverage, was a success and generated lot's of interesting discussion:

From the outside, it looks like everything ran smoothly. The AMA generated 65 comments and has a score of 116 (upvotes − downvotes). There are plenty of high-impact quotes, but my favorite comes from coauthor Thomas Munro, who wrote:

lucaxx85's questions themselves illustrate how paywalls raise costs, by allowing authors to externalize these ruinous costs to society: a vast public subsidy — tens of billions of dollars a year — [for] the concealment of publicly-funded research from the public. We argue that Sci-Hub is hastening the end of this grotesque situation.

However, on the inside, the AMA was anything but smooth. For those who are unfamiliar with r/science AMAs, they are scheduled events were Redditors ask questions to a pre-approved group of scientists. In this case, all coauthors from our study were given the login to the eLife_AMA account, and we were all instructed to respond using this account. The AMA was organized by the journal eLife and the r/science moderators.

Childhood Cancer Data Lab@greenescientist and I were sitting back-to-back in the in #Philly. Outside, a nor'easter raged. We were typing away furiously, answering questions. Miserlou was also in the office, and asked, "guys, why are you ignoring my question?" What question!? We went over to Miserlou's workstation, and here's the comment we saw:

However, from our viewpoint there was nothing. Miserlou's comment had been shadowbanned! Shadowbanning is where a comment appears as if it posted while logged in as its creator, but does not appear for anyone else. The goal of shadowbanning is to censor content without its creator realizing. Not only was Miserlou's question a fantastic one, but he's had this Reddit account since 2007 and has accumulated 17,692 post karma and 12,879 comment karma, in addition to being a moderator for 17 subreddits.

Given that Miserlou was shadowbanned, we decided to check whether our own comments from the eLife_AMA account showed up if we were logged out. Many did not show up. So here we were leading this AMA and Reddit was deceiving us into thinking we were having a conversation, while we really just posting to the void. Then things got even more ridiculous. Two of @greenescientist's comments that had initially posted got deleted. When we were logged in, we saw:

But everyone else saw (Internet Archive snapshot):

At this point, we almost called off the whole AMA. What's the point of doing it if all of our comments were going to be censored or deleted? Was there a troll moderator that wanted to sabotage the event?

I took a few emergency actions. First, I tweeted the issue. I wanted to establish documentation on an independent platform in case this was an adversarial situation. Second, I messaged the mods. Third, I posted a comment from the eLife_AMA account describing the issue:

Of course, this comment was shadowbanned. Therefore, I edited the initial thread description to include:

We have temporarily stopped responding to comments using the eLife_AMA account because our comments are getting deleted. See this tweet thread for details. Can a mod please leave a reply on this comment when this issue has been fixed?

Eventually, a moderator got back to us. Here is our entire exchange which transpired over the next several hours:

So according to the moderator, it was the automated "automod" rather than a human saboteur that was censoring our comments. This negatively impacted the AMA, since many comments would take several hours to post (only once a moderator had un-banned them). Hence, back-and-forth conversation wasn't really possible.

Interestingly, I believe the comment shown above where I detail the censorship was eventually un-banned by a mod but is now back to being banned. However, maybe I was just confused since shadowbanning is so misleading and annoying. The number of false positives is absolutely unacceptable and indicates a major issue with r/science's automod.

One reason I suspect our posts were flagged is because we use links. Yes, since we're scientists and believing in crediting sources, we link a lot. I've also had this problem with Disqus, where my comments with links always get marked as spam. People complain about fake news, but combating fake news requires a culture of attributing sources, which starts with encouraging (and not discouraging) hyperlinking. At least however, Disqus is slightly more forthcoming that your comment has been flagged as spam:

Why decentralized social networks are the solution?

Could this situation have happened were we using Steem? Yes and no. On Steem, the blockchain is permissionless and decentralized. Therefore, if we were commenting on a Steem post to do an AMA, anyone could comment (within the limits of their bandwidth which depends on their Power). Now certain comments may get downvoted. Each explorer (like https://steemit.com, https://busy.org/, or https://stage.steemiz.io/) is able to sort and hide comments however it'd like.

This means that if I were unhappy with the filtering of one Steem explorer, I could switch to another one. Unlike Reddit, there is not the possibility that comments could be deleted or hidden from the world. This means that all censorship is elective. For example, I may want to use an explorer that censors certain types of content, such as offensive, illegal, or low-quality content. However, neither I nor the content is beholden to any specific frontend. For example, I certainly wouldn't use an explorer that censored content with such carelessness as r/science's automod. And with Steem I have a choice.

r/science has 18.4 million subscribers and has been around for 11 years. How could such a popular forum have such terrible and erratic censorship? I think the main issue is that Reddit is a centralized platform. Therefore, subreddits and users have a high degree of reliance is placed Reddit Inc. In this instance, it seems that r/science developed a custom "automod" program, which has its problems. Decentralized protocols allow for much more innovation, since no permission is needed to innovate. Check out SteemTools to see how many services and applications have been built on Steem since its creation in 2016.

Let's check back in 5 years and see whether online communities still aggregate on centralized platforms or whether decentralized, incentived platforms are home to the best communities and best AMAs!

This post and its images are released under a Creative Commons Attribution 4.0 License, so feel free to reuse or repost it anywhere for any purpose as long as you link to this post.

#reddit #censorship #steem #ama

7 years ago in #science by dhimmel (66)

$136.77

Sort:

Trending

[-]

liminalphase (37) 6 years ago

Thanks for the thoughtful post! I am a mod for some subs on Reddit and noticed some confusion about the site in your post and follow-up comments. So I thought it might be helpful to discuss how Reddit works with regards to these processes and what is specific to /r/science.

A removed comment is not shadowbanning. Anytime you have a comment removed for any reason on Reddit it usually still shows up to you but not others. But neither you nor /u/miserlou were shadowbanned so no worries there! Banning and shadowbanning are very specific terms within Reddit but neither apply to this context.
Automod is not specific to /r/science. It is a mechanism that is now built into Reddit that allows subs to insert specific phrases or websites that will auto-pull the comment. For example, racial slurs or known spam sites. But it will also auto pull really short comments (ex: if you just say "ok") or comments with a lot of links. The link issue is not something that /r/science moderators can change and is something you may run into for every sub. One way around this is to message mods when you post a link-heavy comment and it can be manually released. The amount of karma you've accumulated may also impact this issue (low karma will auto-pull a comment on many big subs.)
Moderators cannot change your comment nor where links follow. Either the entire comment stays or we can remove it. There is no in-between option.
Each sub has its own rules for content. You can't post cat photos in /r/dogs, for example, because the point of that sub is to curate photos of dogs. /r/science is (in)famous for their strict moderation rules of no jokes/pop culture, no hate speech, no pseudo-science, no fights, etc. Even Redditors who don't frequent the sub are well aware of this and know that if they run afoul of those rules their comment will be removed. For better or worse this sub-by-sub set of rules is the normative culture of Reddit and people expect a need to codeswitch.
/r/science is also unique because to enforce this strict conversational curation they have over 1,000 moderators. Most only have permission to remove comments while only a handful have additional permissions. This helps handle hot-button posts that garner tons of racist or sexist comments (for example) but it does slow down response time for actions that require higher permissions (such as approving removed comments.) Most subs simply aren't big enough to warrant that kind of team. Higher level mods periodically survey their activities and strip permissions if they are moderating in ways that aren't in line with their rules. I can't tell you definitively if any of your comments were grabbed by low-level mods vs automod, but I don't see any rule violations but I do see links. So automod is the logical assumption.
Some subs do alert users when their comments are removed but in my experience the subs with millions of users do not. This is mostly due to a volume issue. In small subs it is easy for me to give people a heads-up and manage responses. But if a post has a hundred removed comments due to actual rule violations (i.e. valid removals) it would take a lot of manpower to respond to each query or response.
I also see from a comment that you posted a text-post to /r/science, but their rules do not allow that. I suspect it was removed by auto-mod. Many subs do allow text-posts and figuring out where to target your content is part of just getting to know platforms and sub-cultures.

All of this does lead to very interesting debates for platforms from Facebook to steem. Do sites have obligations to deal with harassment, violent threats, and illegal content? If so, how do you build that moderation into the system without censoring inappropriately?

What about less obviously problematic content? One reason /r/science says they have such strict rules is that they've worked with communications scholars who showed through peer-reviewed research that pseudoscience and/or aggressiveness in comments meaningfully impacts how readers interpret the science in the associated posts. This research is why most major science news outlets have removed their commenting sections, btw. So what is the appropriate way to cultivate discussion that doesn't feed pseudoscience and/or science dismissal?

I certainly don't have the answer. Reddit is also very frustrating in that we moderators have been begging them for better moderation tools for years. Modmail is awful. Automod is a very blunt instrument, as you discovered. It is hard to sort through notifications. It is simply not well set-up for moderation yet that responsibility falls on the shoulders of volunteers. If your experiences frustrated you please consider dropping an email to the Reddit admins telling them to give moderators halfway decent mod tools! :)

$2.28

3 votes

[-]

tibra (59) 7 years ago (edited)

Automod, etc, is nonsense. Whenever anybody tells you A.I., or any buzzword, or a ``smart'' script censored your comment on a social media platform, be very skeptical. Very skeptical.

Very few of these are fully automated. The technology isn't there yet. (Deep learning, with or without convolution layers or reservoirs, doesn't produce much in the models of objects with which it would interact. This is problematic for a semantic web. Even less so a script or a typical bot.)

A group of people are usually there. There is some automation, true, very likely, but it's never to the extent alleged. Rather they don't want to reveal they disliked your comment/post/content. Because you might take action in response.

The same rationale as for shadowbanning. As opposed to banning.

Organizations can allege ``whoopsie'' and excuse and deny real hostile behavior against those whom they provide a service.

In general, alleging censorship it's part of some automatic procedure has been a trick used by censors in the days of Stalin, to reduce pushback against censorship, as described for example by Abdurakman Avtorhanov (The technology of power, New York: Praeger, 1959). Even mere delays are very effective at breaking the formation of a consensus. Or creating instead another consensus. Timing matters. Indeed timing is the unstable element determining phase change in networks.

$0.47

2 votes

[-]

dhimmel (66) 7 years ago

this situation.Thanks @tibra for your thoughtful reply. While I agree with many of your points regarding censorship generally, I'm not sure how much they apply to

<p dir="auto">It doesn't make much sense that Reddit mods would be interested in censoring us. After all, the mods were the ones who invited us to do the AMA. Now there are multiple mods, and while most mods were on-board with our AMA, perhaps a few went rogue and deleted comments. <p dir="auto">It seems to me that the automod feature is extraordinarily simple and most likely does not use deep learning or any other type of machine learning. My guess is that it's just a set of human-designed rules. One of those rules is to flag posts with links in them, perhaps with a few more criteria. This rule was likely added to combat <a href="https://en.wikipedia.org/wiki/Spamdexing#Link_spam" target="_blank" rel="noreferrer noopener" title="This link will take you away from hive.blog" class="external_link">spam, since links in Reddit threads are likely a decent way to influence SEO. However, it is a heavy handed solution because it catches both the best (comments with sources) and the worst (link spam) content. <p dir="auto">Perhaps a better solution would be if a post is classified as spam for Reddit to use <a href="https://support.google.com/webmasters/answer/96569?hl=en" target="_blank" rel="noreferrer noopener" title="This link will take you away from hive.blog" class="external_link"><code>rel="nofollow" for hyperlinks in that post. Therefore, the spammers would see little benefit. Furthermore, once a flagged post received a certain score, the <code>nofollow precaution could be removed. Now I'm not sure whether Reddit's platform provides r/science this flexibility. Back to my point of why it's important to separate out the content layer into a decentralized database (i.e. blockchain) and allow any frontend to be built on top.

$0.43

2 votes

[-]

grisotti (11) 7 years ago

Nowadays censorship is becoming a very real problem on the Internet. More and more platforms are taking an aggressive stance against shared content. Reddit makes some very controversial decisions in this regard, as it recently shut down all blacknet dependent subreddits. On the one hand, it's pretty easy to justify Reddit's actions when it comes to removing illegal content from its platform. Judging by his recent statement, the company takes a zero position in relation to drugs, firearms, sexual services, stolen goods, etc. Most people would be happy to accept this approach, as most of these topics should be avoided because of their illegal nature. On the other hand, we must admit that it is censorship in its purest form. Regardless of the illegality of the above-mentioned topics, there is no reason to prevent people from thinking or talking about them. In fact, it is the decision of Reddit will push more people to other platforms which are not actively controlled. Whether or not it was a reasonable decision, understandably, has yet to be determined.

Censorship gone awry on Reddit: the aftermath of our r/science AMA

Why decentralized social networks are the solution?

Daniel Himmelstein tweeted @ 21 Mar 2018 - 19:28 UTC

Daniel Himmelstein tweeted @ 21 Mar 2018 - 19:17 UTC

Disclaimer: I am just a bot trying to be helpful.