AI is just code: why prompting alone won't make bots smarter

UKPulse News Desk

Despite hype around prompt engineering, experts warn that large language models remain fundamentally limited by their training data. A series of tests reveal that bots will confidently swallow and repeat any input, raising serious questions about reliability.

AI models cannot be made smarter through clever prompting alone; they rely entirely on training data.
Tests show bots reproduce errors and fabrications, including false references to fictional entities.
UK businesses risk over-reliance on AI without proper validation, especially in regulated sectors.
The ICO and EU AI Act are pushing for transparency but enforcement remains a challenge.

A growing chorus of technologists is warning that artificial intelligence systems, particularly large language models (LLMs), are not 'smarter' when prompted more creatively — they are simply code that parrots whatever data they have been fed. Recent experiments, including one where a bot confidently cited the fictional sandworm 'Shai-Hulud' from Dune as a real biological species, underscore a fundamental limitation: these models cannot reason beyond their training sets.

In a series of tests, researchers asked various LLMs to complete Java programming exercises and answer factual queries. The models frequently produced plausible-sounding but entirely incorrect answers, including hallucinated academic papers and fake code libraries. One test showed the bot claiming the existence of a 'JFrog security package' that does not exist, highlighting how AI can invent references with total confidence. This phenomenon, known as 'hallucination', poses serious risks for businesses that rely on AI-generated outputs without human oversight.

For UK businesses, the implications are stark. Companies in finance, legal services, and healthcare are increasingly deploying AI tools for tasks such as contract analysis, customer support, and code generation. If these models produce confident but false information, the consequences could include regulatory fines, reputational damage, and even legal liability. Dr Helena Marsh, a lecturer in AI ethics at the University of Cambridge, commented: 'The idea that you can simply prompt a model into being more accurate is a dangerous myth. These systems do not understand truth — they predict the most statistically likely sequence of words.'

The UK's Information Commissioner's Office (ICO) has already issued guidance requiring organisations to ensure that AI systems are transparent, fair, and accountable. Under the UK GDPR, businesses must be able to explain how automated decisions are made. The EU AI Act, which will have extraterritorial reach for UK-based companies serving European customers, classifies high-risk AI systems in areas such as employment, credit scoring, and law enforcement. Both frameworks demand rigorous testing and human-in-the-loop validation — but enforcement is still catching up with deployment speed.

For consumers, the risk is more personal: AI chatbots used by banks, insurers, or the NHS could give incorrect advice that users trust implicitly. The economic cost of widespread AI errors could run into billions if businesses automate critical decisions without adequate safeguards. On the opportunity side, the UK has a chance to lead in 'trustworthy AI' by developing standards and certification schemes that differentiate reliable products from hype.

Looking ahead, experts argue that the industry must move away from treating LLMs as oracle-like systems and instead treat them as tools that require constant verification. 'We need to build systems that admit uncertainty and flag when they are guessing,' said Dr Marsh. 'The UK's regulatory landscape is evolving, but the pace of business adoption is outstripping the safeguards.'

Why this matters: UK businesses and public services are adopting AI at speed, but if models cannot be trusted to produce accurate information, the risks of automated errors, regulatory fines, and consumer harm are significant.

What this means for you: If you use AI tools at work or rely on automated services from banks, insurers, or the NHS, be aware that the output may be confidently wrong. Always verify critical information with a human expert.

AI is just code: why prompting alone won't make bots smarter

Related Articles

Get the news that matters.