ADR-048: Centralized Model Registry¶

Context & Problem Statement¶

Model architecture limits (e.g., 1024 for BART, 16384 for LED) are currently hardcoded throughout the codebase:

This leads to:

Maintenance Burden: Adding new models requires updating hardcoded values in multiple files
Inconsistency Risk: Limits can drift out of sync between files
No Single Source of Truth: Model capabilities are scattered and undocumented
Error-Prone: Easy to use wrong limits for new or unknown models
Limited Extensibility: Hard to add new model types without code changes

We adopt a Centralized Model Registry to store model architecture limits and capabilities.

Model Registry: Single source of truth for all model architecture limits stored in src/podcast_scraper/providers/ml/model_registry.py.
ModelCapabilities Dataclass: Structured, type-safe model capability information (max context window, model type, etc.).
O(1) Lookup: Registry provides fast lookup by model ID/alias.
Pattern-Based Fallbacks: Intelligent guessing for unknown models (e.g., "bart-*" → BART limits).
Safe Defaults: Conservative defaults for unknown models.
Extensibility: Runtime registration for custom models.

Single Source of Truth: Model capabilities are documented and maintained in one place
Eliminates Hardcoded Values: Removes scattered hardcoded limits throughout codebase
Maintainability: Adding new models requires updating one place, not multiple files
Consistency: Limits can't drift out of sync
Extensibility: New models can be registered without code changes
Type Safety: Structured, type-safe model capability information

Keep Hardcoded Values: Rejected as it leads to maintenance burden and inconsistency.
Dynamic Detection Only: Rejected as it requires model loading and doesn't work for unknown models.
Configuration Files: Rejected as it adds complexity and doesn't provide type safety.

Module: src/podcast_scraper/providers/ml/model_registry.py - Model registry
Pattern: Registry pattern with pattern-based fallbacks
ModelCapabilities: Max context window, model type (BART, LED, T5, etc.), aliases
Lookup: O(1) lookup by model ID/alias
Fallbacks: Pattern-based guessing (e.g., "bart-" → BART limits, "led-" → LED limits)
Defaults: Conservative defaults for unknown models (e.g., 512 tokens)
Extensibility: Runtime registration via register_model() function
Model-Agnostic: Handles both test and production models identically
Status: 🟢 Implemented — model_registry.py with ModelRegistry and ModelCapabilities