Anthropic explains how Claude's AI constitution protects it against adversarial inputsWho needs a human in the training loop?By A. Tarantola, 05.10.2023
Bipartisan bill would require that social networks have 'clear' content policiesThe Internet PACT Act would also modify Section 230 to require takedowns of illegal content.By J. Fingas, 02.17.2023
Google is making free anti-terrorism moderation tools for smaller websitesThe technology may be needed to obey local content laws.By J. Fingas, 01.04.2023
Twitch halts paid stream boosts after viewers abuse them to push pornThe new program lets viewers boost their favorite streams when they purchase subscriptions and bits.By A. Khalid, 04.02.2022
GGWP is an AI system that tracks and fights in-game toxicityAAA studios won’t fix the problem, so esports pro Dennis Fong is stepping up.By J. Conditt, 03.22.2022
Instagram Live creators can now bring in moderators to handle trollsThe feature could make broadcasts a more positive experience for almost everyone.By K. Holt, 03.12.2022
Twitter reportedly knew Spaces could be misused due to a lack of moderationCompany executives forged ahead with the audio chat feature despite warnings, according to 'The Washington Post.'By K. Holt, 12.11.2021
Twitch offers slightly more information about suspensionsBut stops frustratingly short of just explicitly saying why.By D. Cooper, 08.10.2021
Senate Republicans vote to subpoena Facebook and Twitter CEOsUp for discussion: the companies' alleged blocking of a controversial 'New York Post' article.By N. Ingraham, 10.22.2020
Sony tries to clear up confusion over voice chat recording on PS5PlayStation 5 gamers can submit 40-second recorded clips to address harassment or abuse.By R. Lawler, 10.17.2020
Reddit has banned nearly 7,000 hateful subreddits since June 29th‘People are weirdly creative about how to be mean to each other.’By C. Fisher, 08.20.2020
Pinterest's moderation doesn't catch some abusive and false materialIts emphasis on hiding content has issues.By J. Fingas, 07.13.2020
Twitch forms review board to strengthen moderation policies... and spit-shine its reputation.By J. Conditt, 05.15.2020
Facebook will pay content moderators $52 million in PTSD settlementEach of the 11,250 plaintiffs will receive at least $1,000.By K. Holt, 05.13.2020
YouTube will temporarily increase automated content moderationThe automated system will remove some videos without human review during the coronavirus outbreak.By C. Fisher, 03.17.2020
TikTok will stop using China-based moderators to screen foreign contentIt wants to reassure other countries that their videos are safe.By J. Fingas, 03.16.2020
YouTube reportedly considered screening all YouTube Kids videosNumerous scandals marred its 2019. By S. Dent, 12.26.2019
YouTube AI thought robot fighting was animal crueltyAre the robots gaining empathy for their brethren?By M. DeAngelis, 08.22.2019
An Amazon employee might have listened to your Alexa recordingAmazon apparently uses human-transcribed recordings to refine Alexa's capabilities.By M. Moon, 04.11.2019
Steam mods will filter 'off-topic review bombs' from ratingsBut the reviews will stay up on its site.By R. Lawler, 03.17.2019