Perspective API: Using machine learning to score the toxicity of online comments

a thoughtful web.

Good ideas and conversation. No ads, no tracking. Login or Take a Tour!

Perspective API: Using machine learning to score the toxicity of online comments · 5

forwardslash · 2568 days ago

perspectiveapi.com · #technology · #internet

Perspective is an API that makes it easier to host better conversations. The API uses machine learning models to score the perceived impact a comment might have on a conversation. Developers and publishers can use this score to give realtime feedback to commenters or help moderators do their job, or allow readers to more easily find relevant information, as illustrated in two experiments below. We’ll be releasing more machine learning models later in the year, but our first model identifies whether a comment could be perceived as “toxic" to a discussion.

This is a project out of Google's think tank. You can test out how the API scores things by using the text box near the bottom of the page; interesting to see how a single word can change 'perceived toxicity'

tweet · print · htmlmarkup tips · 0

Devac · 2567 days ago · link ·

This comment has been deleted.

–

user-inactivated · 2567 days ago · link ·

Dude. I appreciate you presenting a case so well. Way to go about testing shit.

+discuss+discuss

kleinbl00 · 2567 days ago · link ·

Run it down this thread:

1) I'm a toxic mutherfucker

2) Because I cuss a lot

3) and whatever you say in an indoor voice, it isn't toxic.

I'm not a machine learning guy, but I don't know I'd describe the problem as "strictness" so much as "lack of context."

18%, by the way, 13% without point 1 and 11% without point 2.

+discuss+discuss

–

user-inactivated · 2567 days ago · link ·

Paper. They're just looking at small windows of the text, building (very sparse) vectors along the lines of "1.0 if this sequence of n words/characters appeared in the text, 0.0 if not" and doing some voodoo with it. This is the sort of classifier marketing firms use to guess whether Twitter feels positively or negatively about something. You're not going to be able to do fine-grained classification of short texts that way and, unsurprisingly, "toxicity" looks a lot like vehemence.

+discuss+discuss

–

kleinbl00 · 2567 days ago · link ·

I am nothing if not vehement.

+discuss+discuss

Devac · 2567 days ago · link ·

This comment has been deleted.

user-inactivated · 2568 days ago · link ·

Hm. I thought for a moment that it would be cool to introduce such a meter to Reddit (because it can be a harsh place) and to Hubski (to improve comment quality further - not that it needs to much), but then it occurred to me:

What if it the result is wrong?

The code's in it's childhood at most. What will the implications of such a mistake be for the human writing it? Suppose you write something important, and the mechanism rates it as "toxic". Will it stop you from posting it? Would it seed doubt into you? Could it possibly bully you into refusing to post something on the matter further (because it's a machine, and machines are "never wrong")?

On the more scientific side of things, I wonder what it would make of select phrases from philosophy books after analyzing the whole book. For example, will it treat "God is dead, and we killed him" as toxic after seeing the reasoning that led to the phrase?