New algorithm scans through escort ads to find victims of trafficking

Scientists have developed an algorithm that scans online ads for escorts to identify human traffickers and their victims.

The algorithm, called InfoShield, searches for escort ads with strong similarities, which can be a sign of trafficking.

Per the study paper:

The majority of ads suspected of HT [human trafficking] are written by one person, who is controlling ads for four-six different victims at a time. By looking for small clusters of ads that contain similar phrasing, rather than analyzing standalone ads, we’re finding the groups of ads that are most likely to be organized activity, which is a strong signal of HT.

Law enforcement agencies typically look for HT cases manually. InfoShield is designed to speed up the search by detecting mini-clusters of ads, grouping them together, and summarizing the common parts.

“Our algorithm can put the millions of advertisements together and highlight the common parts,” said study co-author Christos Faloutsos in a statement. “If they have a lot of things in common, it’s not guaranteed, but it’s highly likely that it is something suspicious.”

TNW City Coworking space - Where your best work happens

A workspace designed for growth, collaboration, and endless networking opportunities in the heart of tech.

Book a tour now

[Read: 3 new technologies ecommerce brands can use to connect better with customers]

Researchers at Carnegie Mellon University and McGill University adapted InfoShield from an algorithm used to spot anomalies in data, such as typos in hospital patient information.

In tests on escort listings that had already been identified as advertising victims of trafficking, InfoShield correctly flagged the ads with 84% precision. In addition, it didn’t incorrectly identify any of the listings as trafficking ads.

However, the team had to keep their findings private to protect the victims. To prove their algorithm worked, they applied it to tweets created by bots, which also typically tweet the same information in similar ways.

They found that InfoShield was also highly accurate at detecting the bots:

Moreover, it is scalable, requiring about eight hours for 4 million documents on a stock laptop.

The team now hopes to see their research help victims of trafficking, and ultimately reduce human suffering.

You can read the study paper here.

Greetings Humanoids! Did you know we have a newsletter all about AI? You can subscribe to it right here.

Story by Thomas Macaulay

Managing editor

Thomas is the managing editor of TNW. He leads our coverage of European tech and oversees our talented team of writers. Away from work, he e (show all) Thomas is the managing editor of TNW. He leads our coverage of European tech and oversees our talented team of writers. Away from work, he enjoys playing chess (badly) and the guitar (even worse).

Get the TNW newsletter

Get the most important tech news in your inbox each week.

New algorithm scans through escort ads to find victims of trafficking

Get the TNW newsletter

Also tagged with

Kembara closes €750M first close to fuel growth of European deep tech startups

Synthesia’s valuation jumps to $4B after $200M raise

Discover TNW All Access

Bananas, champagne, and robots: Why automation still needs humans

Engineering’s AI reality check