Testing Microservices: Lessons from the beShera EdTech Platform

beShera is a social EdTech platform built on a microservices architecture. User onboarding, course management, social interactions, examinations, and AI-powered learning features — all as separate services communicating over APIs.

This was the most complex system I've tested. Here's what I learned.

Why Microservices Testing is Different

In a monolith, data flows through one codebase. You break something, you see it. In microservices, a bug in the notification service might only appear when the exam service fires an event, which only happens under specific user state conditions.

The real bugs are at the integration points.

My Testing Strategy

1. Map the Service Boundaries First

Which service owns which data
How services communicate (REST, events, both?)
What happens when one service is slow or down

This map became the source of truth for all my edge-case tests.

2. Contract Testing for APIs

Does Service B return what Service A expects?
What happens when Service B returns an unexpected field?
What happens when Service B adds a new required field?

I found a real bug here: the exam service expected user_id as an integer, but the user service started returning it as a string after a migration. Tests passed in isolation. The integration broke.

3. End-to-End Critical Paths

I defined 5 critical user journeys and tested them end-to-end:

New user onboarding → register → verify email → complete profile → enroll in first course
Course completion → watch all modules → pass exam → receive certificate
Social interaction → post → comment → like → notification delivery
AI-assisted learning → start AI session → complete → progress saved
Account recovery → forgot password → reset → re-login → session intact

Each journey touched 3–7 microservices. Any one of them could break the entire flow.

4. Chaos Testing (Informal)

What happens to an active exam session if the network drops mid-test?
What if the user closes the browser during payment?
What if the same user logs in from two devices simultaneously?

The simultaneous session test found a real bug: both sessions stayed active, and actions from one session sometimes overwrote the other's progress.

The Bug I'm Most Proud of Finding

During social interaction testing, I found that unliking a post you'd never liked actually decremented the like count below zero. The like count could go to -1, -2, etc.

The root cause: the frontend sent the unlike API call optimistically before confirming whether the user had actually liked the post. The backend didn't validate current like state before decrementing.

Impact: High. A public-facing counter showing negative numbers is both a UX bug and a data integrity issue.

Key Takeaways

Test services in isolation first, then together — find the unit bugs before the integration bugs
Idempotency matters — every write operation should be safe to call twice
Event-driven bugs are time-delayed — not everything shows up immediately
State management across services is the hardest problem — test every state transition explicitly