Abstract
Hateful content online is a concern for social media platforms, policymakers, and the public. This has led high-profile content platforms, such as Facebook, to adopt algorithmic content-moderation systems; however, the impact of algorithmic moderation on user perceptions is unclear. We experimentally test the extent to which the type of content being removed (profanity vs hate speech) and the explanation given for its removal (no explanation vs link to community guidelines vs specific explanation) influence user perceptions of human and algorithmic moderators. Our preregistered study encompasses representative samples (N = 2870) from the United States, the Netherlands, and Portugal. Contrary to expectations, our findings suggest that algorithmic moderation is perceived as more transparent than human, especially when no explanation is given for content removal. In addition, sending users to community guidelines for further information on content deletion has negative effects on outcome fairness and trust.
Original language | English |
---|---|
Pages (from-to) | 2595-2617 |
Number of pages | 23 |
Journal | New media & society |
Volume | 25 |
Issue number | 10 |
DOIs | |
Publication status | Published - 2023 |
Keywords
- Artificial intelligence
- Content moderation
- Cross-country
- Experiment
- Hate speech
- Profanity
- Social media