Skip to content

Architecture Atlas

The Architecture Atlas explains why YOLO-Toys looks the way it does: thin routes, a central manager, explicit handler dispatch, and normalization boundaries that let heterogeneous model families share one service contract.

Read this chapter when: you need the system map, the request lifecycle, and the decision boundaries that make extension safe.

Questions this chapter answers

  • Why not expose one endpoint per model family?
  • Why centralize model resolution through the registry?
  • Where does normalization happen, and what does it cost?
  • How does the runtime stay extensible without becoming opaque?
  • How does the middleware stack order reflect production concerns?
  • Why is the cache operationally aware rather than just time-based?
  1. Start with System Overview
  2. Continue to Request Lifecycle
  3. Read Handler Topology for execution boundaries
  4. Read Middleware Stack for operational layers
  5. Read Config Injection for settings flow
  6. Read Model Cache for caching strategy
  7. Finish with the ADR set to understand intentional trade-offs

Released under the MIT License.