

The Amazon AMET Payments team, serving approximately 10 million customers across five countries in the Middle East and North Africa, has reduced test case generation time from one week to mere hours. This was made possible by implementing the multi-agent AI solution SAARAM, which also improved the quality of test coverage.
If routine quality checks are taking weeks for your QA engineers, this is already a prime case for automation. Message our manager — he will run a free analysis of your business and niche and show exactly how to get a real business result from an AI agent in your case, not a nice-looking picture. Message the manager
The AMET Payments team releases an average of five new features monthly. Each feature requires comprehensive test case generation, which traditionally consumed one week of manual effort per project. Quality assurance engineers spent this time analyzing business requirement documents (BRDs), design documents, UI mocks, and historical test preparations—a process that required one full-time engineer annually merely for test case creation.
Initial attempts using single-agent AI systems often produced generic outputs like “verify payment works correctly” instead of the specific, actionable test cases the QA team required. For example, specific details were needed, such as “verify that when a UAE customer selects cash on delivery (COD) for an order above 1,000 AED with a saved credit card, the system displays the COD fee of 11 AED and processes the payment through the COD gateway with order state transitioning to ‘pending delivery.’”
Amazon developed SAARAM (QA Lifecycle App), a multi-agent AI solution that helps reduce test case generation from one week to hours. SAARAM is built using Amazon Bedrock with Claude Sonnet by Anthropic and the Strands Agents SDK.
A key breakthrough came from a human-centric approach: instead of asking “How should AI think about testing?”, the team asked “How do experienced humans think about testing?”. This led to detailed research into the cognitive workflows of senior QA professionals, who do not process documents holistically but work through specialized mental phases.
SAARAM consists of specialized agents, each focusing on a specific aspect of the testing process, mimicking expert approaches:
The system went through several iterations to overcome context length limitations, reduce hallucinations, and ensure scalability.
The implementation of SAARAM led to the following key results:
The solution is already being used by the AMET QA team and is positioned for expansion across other QA teams in the International Emerging Stores and Payments (IESP) Org.
Source: aws.amazon.com
Want to learn how AI agents can accelerate your testing processes or other routine tasks? Message the manager. We will analyze your case for free and suggest the optimal solution.