Study finds community of ethical hackers required to prevent AI’s looming trust crisis

Cambridge: A new research led by the University of Cambridge’s Centre for the Study of Existential Risk (CSER) has recommended a new call to action in order to earn the trust of the governments and the public.

The study has been published in the ‘Science Journal’.

They said that companies building intelligent technologies should harness techniques such as “red team” hacking, audit trails and “bias bounties” – paying out rewards for revealing ethical flaws – to prove their integrity before releasing AI for use on the wider public.

Otherwise, the industry faced a “crisis of trust” in the systems that increasingly underpin our society, as the public concerned continued to mount over everything from driverless cars and autonomous drones to secret social media algorithms that spread misinformation and provoked political turmoil.

The novelty and “black box” nature of AI systems, and ferocious competition in the race to the marketplace, had hindered the development and adoption of auditing or third-party analysis, according to lead author Dr Shahar Avin of CSER.

The experts argued that incentives to increase trustworthiness should not be limited to regulation, but must also come from within an industry yet to fully comprehend that public trust is vital for its own future – and trust is fraying.

The new publication put forward a series of “concrete” measures that they said should be adopted by AI developers.

“There are critical gaps in the processes required to create AI that has earned public trust. Some of these gaps have enabled questionable behaviour that is now tarnishing the entire field,” said Avin.

“We are starting to see a public backlash against technology. This ‘tech-lash’ can be all-encompassing: either all AI is good or all AI is bad. Governments and the public need to be able to easily tell apart between the trustworthy, the snake-oil salesmen, and the clueless,” Avin said.

“Once you can do that, there is a real incentive to be trustworthy. But while you can’t tell them apart, there is a lot of pressure to cut corners,” Avin added.

Co-author and CSER researcher Haydn Belfield said, “Most AI developers want to act responsibly and safely, but it’s been unclear what concrete steps they can take until now. Our report fills in some of these gaps.”

The idea of AI “red teaming” – sometimes known as white-hat hacking – took its cue from cyber-security.

“Red teams are ethical hackers playing the role of malign external agents,” said Avin.

“They would be called in to attack any new AI, or strategise on how to use it for malicious purposes, in order to reveal any weaknesses or potential for harm,” Avin added.

While a few big companies had the internal capacity to “red team” – which came with its own ethical conflicts – the report called for a third-party community, one that can independently interrogate new AI and share any findings for the benefit of all developers.