More Software Testing Unleashed episodes

Why Your CI Pipeline Is Lying to You - Simon Stewart

Published 23 Apr 2026

Show Notes: richard-seidl.com/en/blog/flaky-tests

Duration: 00:24:26

Flaky tests in CI/CD pipelines undermine reliability by causing intermittent failures due to shared state, timing issues, and environmental inconsistencies, requiring strategies like test exclusion, ownership, and prioritized fixes, alongside prevention through rigorous pre-CI testing and layered approaches, while AI aids debugging but not replacement, emphasizing iterative improvements over emotional attachment to code.

Episode Description

In this episode, I talk with Simon Stewart, professional software developer and former lead of the Selenium project for over 10 years, about one of th...

Overview

The podcast explores the challenges posed by flaky tests in software development, defined as tests that fail unpredictably due to a lack of repeatability. These tests undermine confidence in CI/CD pipelines by obscuring whether code changes are safe, while also consuming significant time investigating false failures. Key qualities of reliable tests include self-checking, fast feedback, isolation, and consistent results. The discussion highlights that flakiness often stems from shared state (e.g., database inconsistencies), race conditions, or environmental variables, emphasizing the need to root out systemic issues rather than merely masking symptoms.

Strategies for managing flaky tests include temporarily removing them from CI pipelines, assigning ownership to specific individuals for accountability, and prioritizing fixes for critical tests. Techniques like rerunning tests (up to three times) or implementing a "time-to-live" policy for unresolved flaky tests are suggested. Addressing root causes involves refactoring tests into a test pyramid structure, focusing on smaller, isolated tests, or even removing tests that lack value. The episode also touches on the role of AI in testing, which can generate test ideas or debug code but should not replace human judgment, due to its potential for overconfident or erroneous suggestions. Overall, the focus is on treating flakiness as an opportunity to improve system robustness and code reliability, rather than an unavoidable hurdle.

Recent Episodes of Software Testing Unleashed

28 May 2026 Why Traditional Testing Fails for AI Systems - Dusanka Lecic

Chatbot testing challenges include non-deterministic outputs, user-driven input testing, and the need for specialized tools, addressed through manual exploration, the C-H-A-T framework (Context, Hallucination control, Accuracy, Testing), hybrid testing approaches, and future integrated solutions to manage context retention, hallucinations, and retrieval-based errors.

21 May 2026 Why Testers Are Safe Despite AI Hype - Mitko Mitev

Software testing evolves with automation, agile, DevOps, and AI, but human expertise remains critical due to AI's limitations in context, business logic, and user behavior, shifting testers toward strategic roles while AI aids in repetitive tasks and efficiency.

16 Apr 2026 From Nokia to iPhone: What Pen Testers Learned - Bartosz Czernic-Goawski

The historical evolution of mobile security, from unencrypted analog systems to 5G cryptography, highlights enduring vulnerabilities like app flaws, IoT risks, user behavior threats, platform security trade-offs, and the ongoing tension between innovation, usability, and privacy.

9 Apr 2026 Empowering Women in Software Testing - Line Ebdrup Thomsen

Highlighting women's underrepresentation in tech's software development versus higher presence in testing due to diverse entry paths and alignment with creativity, while addressing gender bias, stereotypes, and microaggressions, and emphasizing inclusive practices, non-technical skills, and leveraging testing's collaborative nature for growth.

2 Apr 2026 The Hidden Playwright Advantage Developers Miss - Maciej Kusz

Python offers broader flexibility for non-web and infrastructure testing with Playwright but requires extra setup, while TypeScript provides native integration with advanced web-specific tools like visual regression testing and Electron/mobile support, making the choice depend on project needs and team expertise.

More Software Testing Unleashed episodes