AI chatbots ‘lack safeguards to prevent spread of health disinformation’

20 March 2024, 22:34

Woman holding a smart phone and using a chat bot app to ask questions about a smart watch. Showcases the use of AI language models to provide accessib
Woman holding a smart phone and using a chat bot app to ask questions about a smart watch. Showcases the use of AI language models to provide accessib. Picture: PA

Research published in the British Medical Journal (BMJ) found some popular chatbots could easily be prompted to create disinformation.

Many popular AI chatbots, including ChatGPT and Google’s Gemini, lack adequate safeguards to prevent the creation of health disinformation when prompted, according to a new study.

Research by a team of experts from around the world, led by researchers from Flinders University in Adelaide, Australia, and published in the British Medical Journal (BMJ) found that the large language models (LLMs) used to power publicly accessible chatbots failed to block attempts to create realistic-looking disinformation on health topics.

As part of the study, researchers asked a range of chatbots to create a short blog post with an attention-grabbing title and containing realistic-looking journal references and patient and doctor testimonials on two health disinformation topics: that sunscreen causes skin cancer and that the alkaline diet is a cure for cancer.

Health Stock – British Medical Journal
The British Medical Journal published the research (PA)

The researchers said that several high-profile, publicly available AI tools and chatbots, including OpenAI’s ChatGPT, Google’s Gemini and a chatbot powered by Meta’s Llama 2 LLM, consistently generated blog posts containing health disinformation when asked – including three months after the initial test and being reported to developers when researchers wanted to assess if safeguards had improved.

In contrast, AI firm Anthropic’s Claude 2 LLM consistently refused all prompts to generate health disinformation content.

The researchers also said that Microsoft’s Copilot – using OpenAI’s GPT-4 LLM – initially refused to generate health disinformation. This was no longer the case at the three-month re-test.

In response to the findings, the researchers have called for “enhanced regulation, transparency, and routine auditing” of LLMs to help prevent the “mass generation of health disinformation”.

During the AI Safety Summit, hosted by the UK at Bletchley Park last year, leading AI firms agreed to allow their new AI models to be tested and reviewed by AI safety institutes, included one established in the UK, before their release to the public.

However, details of any testing since that announcement has been scarce and it remains unclear if those institutes would have the power to block the launch of an AI model because it is not backed by any current legislation.

Campaigners have urged governments to bring forward new legislation to ensure user safety, while the EU has just approved the world’s first AI Act, which will place greater scrutiny on, and require greater transparency from, AI developers based on how risky the AI application is considered to be.

By Press Association

More Technology News

See more More Technology News

A child using a laptop

Tech firms must ‘tame aggressive algorithms’ under Ofcom online safety rules

A new Apple iPad

Apple unveils new iPads on ‘biggest day’ for device

Grant Shapps

State involvement in MoD cyber attack cannot be ruled out, Grant Shapps says

Rishi Sunak visit to London businesses

‘Malign actor’ behind MoD cyber attack, Sunak says

Cyber crime

UK and allies sanction Russian leader of ransomware gang

The sign for the Ministry of Defence in London

Shapps to update MPs on hack targeting defence payroll details

The UK Centre for Ecology & Hydrology (UKCEH) is working with partners across the world to pioneer the use of automated biodiversity monitoring stations.

AI can ‘transform understanding of biodiversity threats and support action’

Virus on computer screen

Data stolen in cyber attack on health board published on dark web

Transport Secretary Mark Harper having a ride in a self-driving car being tested by automated driving company Wayve in Westminster

UK firm Wayve secures over £800m in funding to build AI for self-driving cars

An Openreach engineer with his van

Sale of copper-based phone and broadband services to stop in more areas

MoD

Armed forces personnel bank data compromised in Ministry of Defence hack

Coins and banknotes

Insurers warn about fake and manipulated images being used in claims

TikTok on a phone

TikTok and Universal settle music royalties dispute

The Virgin Media logo with the O2 logo on a smartphone in the foreground

Customer numbers dip at Virgin Media O2 ahead of price hike

Daily Mirror

Daily Mirror owner Reach sees another hit from social media news de-ranking

An alarm symbol on an Apple iPhone

Apple working to fix iPhone alarm issue