It’s too easy to make AI chatbots lie about health information, study finds
It’s too easy to make AI chatbots lie about health information, study finds
reuters.com
Well-known AI chatbots can be configured to routinely answer health queries with false information that appears authoritative, complete with fake citations from real medical journals, Australian researchers have found.
Without better internal safeguards, widely used AI tools can be easily deployed to churn out dangerous health misinformation at high volumes, they warned in the Annals of Internal Medicine.
“If a technology is vulnerable to misuse, malicious actors will inevitably attempt to exploit it - whether for financial gain or to cause harm,” said senior study author Ashley Hopkins of Flinders University College of Medicine and Public Health in Adelaide.
There should be a series of AI agents in place when a GPT is used. The agents intake the query and review the output before sending it off to the user.
what makes the checker models any more accurate?
Possibly, reverse motivation - the training goal of such an agent would not be nice and smooth output, but shooting down misinformation.
But I have serious doubts about whether all of that is feasible, given the computational cost of running large language models.
The checker models aren’t trying to give you a correct answer with confidence. Their purpose is to find an incorrect answer. They’ll both do their task with confidence.
Who verifies the AI agent decisions?
More AI agents /s
The user. You could have the output include the “conversation” between the agents and validate the decisions. Not perfect, but better. People aren’t perfect either.