High-Level Design (HLD / Distributed Systems) Interview Questions & Answers — Cracked Java
// Module · 28 topics · 70 questions

High-Level Design (HLD / Distributed Systems)

Distributed systems and large-scale architecture interview preparation. Fundamentals (CAP, consistency, replication), building blocks (caches, queues, databases), and full system designs across FAANG, EU, and regional interview styles.

0%
Reviewed
0 / 70
Topics done
0 / 28
Est. time left
~60h
Next up
The HLD Interview Framework
28 topics
01Mid5 Q0

The HLD Interview Framework

The canonical 4-step approach (requirements → estimation → high-level design → deep dive), capacity-estimation method, time budgeting, and how FAANG / EU / regional styles differ.

Not startedStart →
02Senior5 Q0

CAP Theorem, PACELC, Consistency Models

CAP stated correctly, the PACELC extension, the consistency spectrum (linearizable → eventual), ACID vs BASE, and quorum reads/writes.

Not startedStart →
03Senior6 Q0

Replication & Partitioning (Sharding)

Single-leader / multi-leader / leaderless replication, sync vs async, replication lag, sharding strategies, consistent hashing, resharding, and hot partitions.

Not startedStart →
04Senior5 Q0

Caching Strategies

Cache layers, the four caching patterns, eviction policies, cache stampede mitigation, invalidation, Redis vs Memcached, and two-tier (L1/L2) caching.

Not startedStart →
05Senior5 Q0

Message Queues & Event Streaming

Queue vs log, delivery guarantees, ordering, backpressure, dead-letter queues, the outbox pattern, and CDC. RabbitMQ vs Kafka.

Not startedStart →
06Senior4 Q0

Database Choices — SQL vs NoSQL

When to choose relational, document, key-value, wide-column, graph, search, or time-series stores, and the cost of polyglot persistence.

Not startedStart →
07Mid4 Q0

API Design — REST, gRPC, GraphQL, WebSockets

REST principles and idempotency, REST vs gRPC vs GraphQL, real-time options (WebSockets/SSE/long polling), versioning, and pagination.

Not startedStart →
08Senior3 Q0

Rate Limiting & Throttling (HLD perspective)

Distributed rate limiting with Redis, per-user/IP/key granularity, edge vs application limits, the API-gateway role, and 429 + Retry-After.

Not startedStart →
09Mid3 Q0

Load Balancing

L4 vs L7, balancing algorithms, consistent hashing for cache-aware routing, sticky sessions, health checks, and LB redundancy.

Not startedStart →
10Senior4 Q0

Microservices, Service Mesh, API Gateway

Monolith vs microservices vs modular monolith, service discovery, API gateway, service mesh, distributed tracing, and the Saga + outbox patterns.

Not startedStart →
11Senior4 Q0

Storage Systems — Disk, RAM, Object Storage

The storage hierarchy and latency numbers, B-trees vs LSM-trees, row vs columnar storage, object/block/file storage, and tiered hot/warm/cold data.

Not startedStart →
12Mid3 Q0

CDN & Edge

What a CDN does, edge cache invalidation, edge compute, anycast vs DNS geo-routing, HTTP/2-3, and TLS termination at the edge.

Not startedStart →
13Senior4 Q0

Observability — Logging, Metrics, Tracing

The three pillars, structured logging, metric types, distributed tracing with OpenTelemetry, and SLI/SLO/SLA with error budgets.

Not startedStart →
14Mid1 Q0

Design a URL Shortener (TinyURL / bit.ly)

The most common HLD warm-up. ID-generation strategies, SQL vs NoSQL choice, caching, custom URLs, and analytics — with FAANG / EU / regional variants.

Not startedStart →
15Senior1 Q0

Design Twitter / Threads (Social Feed)

Timeline generation (pull / fan-out-on-write / hybrid), feed ranking, the celebrity (hot-user) problem, media storage, and caching layers.

Not startedStart →
16Senior1 Q0

Design a Chat / Messaging System (WhatsApp / Telegram)

1:1 and group chat, online presence, delivery/read receipts, media, WebSocket vs polling, and sharding by chat ID.

Not startedStart →
17Senior1 Q0

Design a Ride-Sharing System (Uber / Lyft)

Geospatial indexing (geohash / S2 / H3), driver-rider matching, surge pricing, real-time location tracking, and payment integration.

Not startedStart →
18Senior1 Q0

Design Instagram / Photo-Sharing

Feed generation, image storage and delivery (S3 + CDN), thumbnails, stories (24h TTL), likes/comments at scale, and hashtag search.

Not startedStart →
19Senior1 Q0

Design Netflix / YouTube (Video Streaming)

Video encoding pipeline, adaptive bitrate streaming (HLS/DASH), CDN delivery, recommendations, and view counting at scale.

Not startedStart →
20Senior1 Q0

Design a Notification Service (at scale)

Channel abstraction, template service, user preferences, a delivery queue with retries and DLQ, per-channel rate limiting, and provider failover.

Not startedStart →
21Senior1 Q0

Design a Web Crawler

URL frontier (priority + dedup), politeness (robots.txt, per-domain rate limit), DNS caching, content storage, duplicate detection, and distributed coordination.

Not startedStart →
22Senior1 Q0

Design a Search Autocomplete / Typeahead

Trie data structure, prefix caching, top-K suggestions, ranking by popularity/personalization, and the trend-update pipeline.

Not startedStart →
23Senior1 Q0

Design a Distributed Key-Value Store (Dynamo-style)

Consistent hashing, virtual nodes, replication, vector clocks, read repair, Merkle trees, gossip, and sloppy quorum.

Not startedStart →
24Senior1 Q0

Design a Distributed Job Scheduler (Cron at Scale)

Job queue, worker assignment, leader election (Zookeeper/etcd), missed-run handling, distributed locks, idempotency, and monitoring.

Not startedStart →
25Senior1 Q0

Design Google Docs / Collaborative Editor

Operational Transformation vs CRDTs, real-time sync over WebSockets, presence, cursor sharing, version history, and offline editing.

Not startedStart →
26Senior1 Q0

Design a Payment System

Idempotency keys, distributed transactions via Saga, reconciliation, double-entry bookkeeping, fraud hooks, and processor integration (Stripe/Click/Payme model).

Not startedStart →
27Senior1 Q0

Design an Exam Prep Platform (PrepHub-style)

Exam session management, real-time scoring, leaderboards, payment integration, content delivery, and anti-cheat/proctoring — an end-to-end real-system case.

Not startedStart →
28Mid1 Q0

How to Practice HLD

How many problems to solve, the internalizable template, per-section time-boxing, handling "I don't know" gracefully, and how to signal senior.

Not startedStart →