Architecture
This directory contains architectural documentation for podcast_scraper — the current
system design, quality constraints, testing approach, data contracts, and the platform
vision for where the system is heading.
Current state
| Document |
Purpose |
| Architecture |
System design — pipeline flow, module map, configuration, ways to run, ADR index |
| Non-Functional Requirements |
Quality constraints — performance, security, reliability, observability, maintainability, scalability |
| Testing Strategy |
Test pyramid, patterns, decision criteria, CI integration |
Target state
| Document |
Purpose |
| Platform Architecture Blueprint |
Platform vision — multi-tenant platform, distributed ML, two-tier deployment, observability, deployment lifecycle. Concrete RFCs are broken out from individual sections as implementation begins. |
Data contracts (ontology specifications)
| Folder |
Contents |
| gi/ |
Grounded Insight Layer (GIL) ontology — node/edge types, grounding contract, gi.schema.json |
| kg/ |
Knowledge Graph (KG) ontology — entities, topics, relationships, kg.schema.json |
Diagrams
Generated architecture visualizations. See diagrams/ for the full
list and regeneration instructions.