Ai Report Trends and Insights v1.0
Ai Report Trends and Insights v1.0
Executive summary 3
Key findings 4
In 2024, AI transitioned from an emerging technology Leapwork spoke to 401 senior and technical
to business-critical. Companies across industries professionals in the US and UK, half of which were
have rapidly integrated AI applications into their C-Suite executives, to understand two critical
operations to enhance efficiency, innovation, and components of the new AI era in the context of the
customer engagement. However, as AI becomes digital enterprise:
more deeply embedded in these processes, its
limitations and the challenges it introduces are 1 Testing of AI: How businesses will build trust
increasingly apparent. in the AI applications they are integrating
While AI is widely adopted across sectors, This report provides decision-makers with a
concerns about its reliability, accuracy, and overall comprehensive overview of the current state of AI
effectiveness continue growing. These concerns and software quality, offering essential insights and
have been amplified by high-profile system failures solutions for businesses to adapt and consistently
over the past year, exposing critical vulnerabilities deliver exceptional user and customer experiences
in software systems. As a result, businesses are at scale, now that AI is a critical part of the equation.
increasingly re-evaluating how technologies are
tested, maintained, and trusted to deliver the
expected quality consistently.
testing
78% of companies see testing
AI as essential, which aligns
with the majority already The continued need for human
encountering significant issues.
oversight in AI-driven QA
Amid a year of outages that have taken social Like any software, there is growing recognition that
media platforms offline, stopped fast food deliveries, AI applications need thorough testing to prevent
stranded passengers at airports, ceased bank business disruptions. An overwhelming 78% of
operations and postal deliveries, global conditions companies agree the AI apps need better testing;
are ripe for a potentially rocky AI revolution. in fact, 77% of C-Suite executives said testing is
critical to ensuring their performance, accuracy, and
With 85% of companies having integrated AI into their
reliability.
tech stack in the past year, there’s cause for concern
that the number of IT failures is on the rise. Already, “For all its advancements, AI has limitations, and I
68% of companies have faced performance and think people are coming around to that fact pretty
reliability issues with their AI applications. quickly,” says Robert Salesas, CTO at Leapwork. “The
rapid automation enabled by AI can dramatically
“There have been too many outages this year alone,
increase output, but without thorough testing, this
many of which affected millions of customers for
could also lead to more software vulnerabilities,
big brands. We’ve been given a wake-up call no one
especially in untested applications. It makes sense
can ignore,” says Christian Brink Frederiksen, CEO of
that C-Suite executives would be especially sensitive
Leapwork. “What makes digital infrastructure today
to this because of the implications for customer
so tricky to test is the copious amount of complex,
experience and negative publicity.
interconnected applications. A tiny error in one
application could have a monumental cascading
effect and shut down businesses.”
No
15%
Yes Yes Yes
83% 88% 86%
There’s an opportunity here for cross-industry Despite the urgent need for reliable AI, only 16%
collaboration to ensure more testing tools are up of companies believe their testing processes are
to scratch for the challenges of the modern world efficient.
where AI apps are more and more widespread.”
This reveals a troubling gap in quality assurance
For now, AI has limitations, and integration failure in that raises an important question: If today’s testing
particular was the most popular issue cited by 22% methods are falling short, how can the industry
of C-Suite executives. ensure that AI delivers on the promised benefits?
As companies grapple with the inherent bugs started adopting AI-augmented testing tools – an
and limitations of AI, a consideration emerges: overwhelming 79% now use them.
the potential of AI-augmented testing tools to
The fact that so many also trust the results of
effectively tackle the unique challenges posed by AI
these tools indicates an understanding amongst
applications.
early adopters of their possibilities and limitations.
While AI-augmented testing tools are gaining traction Especially as trust is even higher amongst technical
across industries, their true potential in enhancing AI leaders (72% vs. 64% for C-Suite).
reliability and performance is yet to be fully realized.
But a closer look at the survey results reveals gaps
Leapwork’s findings reveal that the trust is there -
between industries: there’s significantly more
68% of overall respondents say they trust the results
trust placed in AI-augmented testing within the
that AI-augmented tools provide - but isn’t there
technology industry (80%) than in retail (53%).
a paradox in using AI to address concerns about
the very technology it is meant to test? This is a “With retail, it’s easy to think about the mega
crucial aspect we’ll explore further in this report, as retailers and forget about the smaller boutique
leveraging these tools could be key to mitigating the vendors who have less familiarity with AI,” says
risks associated with AI integration. Salesas.
No
21%
Yes Yes Yes
81% 85% 76%
“Not every retailer is tech-first like Amazon, and Still, as trust in these tools grows, it naturally raises a
there’s likely a cultural gap at play here: tech broader reflection on how adoption of AI-augmented
companies are at the forefront of AI development testing tools will affect the role of human testers in
and implementation, which means they get first the long term.
dibs on the talent who is more likely to have a
deeper understanding of the tools’ capabilities
and limitations. On a practical level, there are
also still many retail operations that rely on older
systems that don’t integrate seamlessly with AI-
augmentedtesting tools, and the stakes of failures
are high when any errors can directly impact
customer satisfaction and sales. Retail environments
themselves can be enormously diverse and complex
– and the customers even more diverse – which
might be giving professionals in the industry pause
about trusting AI’s ability to test all situations
accurately.”
Like every industry and trade impacted by AI, QA these tools can enhance their roles rather than
teams now face the question, ‘what will happen replace them. On the other hand, C-Suite executives
to humans?’ Leapwork’s findings suggest humans are looking at business operations more broadly,
are unlikely to disappear from the testing equation with optimism about how technology can improve
anytime soon. In fact, over two-thirds of C-Suite efficiency. While their perspectives may differ, both
executives (68%) believe that testing will need human groups agree that human input will remain a critical
validation for the foreseeable future, and almost part of the testing process.”
every single IT Director (92%) agrees.
“I believe that the synergy between AI and human
“There’s always going to be some variation in how expertise represents a transformative partnership in
technical teams and C-Suite executives perceive software testing. AI tools can significantly enhance
the need for human validation,” says Salesas. “For IT efficiency, allowing technical teams to focus on
teams, there’s a natural concern about job security innovation and ideation rather than the repetitive
as AI tools evolve, but the focus should be on how details of testing in an increasingly complex software
Interestingly, one sector is an outlier: manufacturing. Over half of C-Suite executives (53%) say the tools
While most respondents still believe in human have increased the number of new roles compared
validation, the number is a lot lower (56%) than in to just over a third (36%) of technical leaders.
other sectors like technology/software (79%), finance These perceptions also vary based on sector: 52%
and banking (83%), and healthcare (85%). of respondents in healthcare are seeing new jobs
created compared to only 36% in government.
“Manufacturing is centered today around
About 57% of respondents in manufacturing report a
maximizing automation – it’s all about
reduction in jobs.
standardization, repetitive processes, efficiency,
and a strong desire to keep costs low. This could
explain why respondents in the sector perceive less
need for human intervention. It speaks to a range of
different priorities, regulations, environments, and
operational characteristics of these sectors. Finance
and healthcare come with considerably strict and
unique compliance and safety requirements that
may be nudging those sectors towards a stronger
preference for human supervision.”
No matter how AI impacts human testing roles, the may require further skills and financial investment to
trend is clear: AI-augmented testing tools continue fully take advantage of.”
to gain popularity. Most organizations (74%) foresee
increased investment in AI-augmented testing tools “I believe that a critical component to swaying
in their organization in the next year. The degree to executives who are unconvinced about the value of
which they agree varies from role to role. For example, AI-augmented testing will be to present tools that are
most CTOs (77%) expect increased investment into the intuitive to use, making them accessible not just for
tools, but far fewer CIOs (58%) do. technical teams but also for business users. When the
skills gap is so wide, you can’t afford for things to be
“Generally speaking, CTOs tend to focus much more difficult to adopt.”
on emerging technologies and how to apply them to
the business. That goes a long way to explain why they This leads us back to the pressing question: If easy-to-
might advocate more strongly for AI-augmented tools learn AI-augmented testing tools are the answer to
and AI in general. They’re thinking about the long-term buggy AI apps, is there a paradox in relying on the same
vision and staying competitive in that future.” Says technology that’s already causing issues to test itself?
Salesas. “CIOs, on the other hand, are more about To resolve this potential paradox lies in understanding
the day-to-day IT operations – particularly the short- that AI-augmented testing tools are not simply another
to-medium term. They manage a far larger remit of layer of AI—In the right context, they can be used to
operations and competing priorities. This might make address specific shortcomings of AI systems by providing
CIOs more reluctant to take any big leaps on tools that rigorous, unbiased assessments of their performance.
No
6%
Yes Yes
Yes 87% 78%
74%
Director of IT IT Manager /
Lead
This is where Leapwork’s approach comes into play. For instance, consider an AI chatbot designed
Leapwork’s approach to using AI to test AI is highly to answer user queries. If asked, “Do you offer
effective and designed to build trust - not just in your international shipping?”—a question that could be
AI applications, but across all the applications that phrased in various ways such as “Can you ship to
make up your business processes. other countries?” or “Do you deliver overseas?”—the
AI Validate block ensures that every response, which
Understanding how AI can might also vary linguistically, still accurately conveys
the correct answer: “Yes, we offer international
validate AI shipping to select countries.” By automating this
Leapwork’s AI capabilities are designed to assess validation process, Leapwork helps you avoid the
how well generative AI handles specific tasks and pitfalls of AI unreliability, providing a robust framework
responds to user-defined prompts. It does this by for ensuring that your AI applications deliver the right
comparing the AI-generated responses against results every time.
predefined, human-crafted expectations. This
ensures that the outputs of your AI applications are
consistent, accurate, and aligned with your intended
results.
Start D365 Retail Login Read Excel Verify D365 Match... AI Validate Pass
Range
Price Value from
“A1:C8”, sheet:”Sheet1” Not Valid Fail
Product
Number
Price
Report methodology
The research was conducted by Censuswide who
gathered responses from 401 respondents across US
and UK organizations. These included 201 C-Suite
executives (CTO/CIOs) and 200 technical leads
(including VP of IT, Director of IT, IT Manager, software
engineering leads, QA test manager/director).