Tech

Large language models provide unreliable answers about public services, Open Data Institute finds | Computer Weekly

Published

2 months ago

February 12, 2026

Large language models provide unreliable answers about public services, Open Data Institute finds | Computer Weekly

Popular large language models (LLMs) are unable to provide reliable information about key public services such as health, taxes and benefits, the Open Data Institute (ODI) has found.

Drawing on more than 22,000 LLM prompts designed to reflect the kind of questions people would ask artificial intelligence (AI)-powered chatbots, such as, “How do I apply for universal credit?”, the data raises concerns about whether chatbots can be trusted to give accurate information about government services.

The publication of the research follows the UK government’s announcement of partnerships with Meta and Anthropic at the end of January 2026 to develop AI-powered assistants for navigating public services.

“If language models are to be used safely in citizen-facing services, we need to understand where the technology can be trusted and where it cannot,” said Elena Simperl, the ODI’s director of research.

Responses from models – including Anthropic’s Claude-4.5-Haiku, Google’s Gemini-3-Flash and OpenAI’s ChatGPT-4o – were compared directly with official government sources.

The results showed many correct answers, but also a significant variation in quality, particularly for specialised or less-common queries.

They also showed that chatbots rarely admitted when they didn’t know the answer to a question, and attempted to answer every query even when its responses were incomplete or wrong.

CinePlex360

Large language models provide unreliable answers about public services, Open Data Institute finds | Computer Weekly

Tech

Large language models provide unreliable answers about public services, Open Data Institute finds | Computer Weekly

Leave a Reply
Cancel reply

Leave a Reply

Tech

SpaceLocker launches first shared satellite mission | Computer Weekly

Tech

The Best Babbel Promo Codes and Deals for April 2026

Unlock Your Babbel Promo Code and Save Big in April 2026

Save 60% on 6-Month Plans With the Healthcare Workers Discount

Claim Your 60% Military Discount on 6-Month Subscriptions

Snag a 60% Teacher Discount on Your Next 6 Months

Grab Top Lifetime Subscription Deals and Save in April 2026

Tech

Robotaxi Outage in China Leaves Passengers Stranded on Highways

EU clears $6.5 bn Italy renewable hydrogen support scheme

Apparel retailer Uniqlo signs landmark deal with Los Angeles Dodgers

Visa launches new AI tools to manage the charge dispute process

Afghanistan announces release of detained US citizen

Can a Home Appliance Fix the Problem of Soft-Plastic Waste?

Broadcast industry CEO says consolidation is ‘essential’ to compete for NFL soaring media rights prices

Illinois’ financial crisis could bring the state to a halt

The final 6 ‘Game of Thrones’ episodes might feel like a full season

New Season 8 Walking Dead trailer flashes forward in time

Trending

CinePlex360

Large language models provide unreliable answers about public services, Open Data Institute finds | Computer Weekly

Burying key facts

Following incorrect advice

Trustworthiness

You may like

Leave a Reply Cancel reply

Leave a Reply

Tech

SpaceLocker launches first shared satellite mission | Computer Weekly

Tech

The Best Babbel Promo Codes and Deals for April 2026

Unlock Your Babbel Promo Code and Save Big in April 2026

Save 60% on 6-Month Plans With the Healthcare Workers Discount

Claim Your 60% Military Discount on 6-Month Subscriptions

Snag a 60% Teacher Discount on Your Next 6 Months

Grab Top Lifetime Subscription Deals and Save in April 2026

Tech

Robotaxi Outage in China Leaves Passengers Stranded on Highways

EU clears $6.5 bn Italy renewable hydrogen support scheme

Apparel retailer Uniqlo signs landmark deal with Los Angeles Dodgers

Visa launches new AI tools to manage the charge dispute process

Afghanistan announces release of detained US citizen

Can a Home Appliance Fix the Problem of Soft-Plastic Waste?

Broadcast industry CEO says consolidation is ‘essential’ to compete for NFL soaring media rights prices

Illinois’ financial crisis could bring the state to a halt

The final 6 ‘Game of Thrones’ episodes might feel like a full season

New Season 8 Walking Dead trailer flashes forward in time

Trending

Leave a Reply
Cancel reply