CI/CD Workflows¶

This document provides detailed documentation for all GitHub Actions workflows.

For architecture and optimization strategies, see Overview. For resource usage and limits, see Resource Usage. For local development setup, see Local Development.

CI/CD Pipeline¶

Overview¶

The Podcast Scraper project uses GitHub Actions for continuous integration and deployment. The CI/CD pipeline consists of seven workflows that automate testing, code quality checks, security scanning, Docker validation, documentation deployment, dependency updates, and metrics collection.

Workflows Summary¶

Workflow	File	Purpose	Trigger
Python Application	`python-app.yml`	Main CI pipeline with testing, linting, and builds	Push/PR to `main` (only when Python/config files change)
Documentation Deploy	`docs.yml`	Build and deploy MkDocs documentation to GitHub Pages	Push to `main`, PR with doc changes, manual
CodeQL Security	`codeql.yml`	Security vulnerability scanning	Push/PR to `main` (only when code/workflow files change), scheduled weekly
Docker Build & Test	`docker.yml`	Build and test Docker images	Push to `main` (all), PRs (Dockerfile/.dockerignore only)
Snyk Security Scan	`snyk.yml`	Dependency and Docker image vulnerability scanning	Push/PR to `main`, scheduled weekly (Mondays), manual
Nightly Comprehensive	`nightly.yml`	Full test suite with comprehensive metrics collection	Scheduled daily (2 AM UTC), push to release branches, manual
Dependabot	`dependabot.yml`	Automated dependency update PRs	Scheduled weekly (Mondays), monthly for Docker

Complete Pipeline Visualization¶

graph TB
    subgraph "Trigger Events"
        T1[Push to main]
        T2[Pull Request to main]
        T3[Schedule]
        T4[Manual Dispatch]
    end

    subgraph "Python Application Workflow"
        P1[Lint Job]
        P2[Test Job]
        P3[Docs Build Job]
        P4[Build Package Job]
    end

    subgraph "Docs Workflow"
        D1[Build Docs]
        D2[Deploy to Pages]
    end

    subgraph "CodeQL Workflow"
        C1[Security Analysis]
    end

    subgraph "Docker Workflow"
        DOCK1[Build Docker Image]
        DOCK2[Test Docker Image]
    end

    subgraph "Snyk Workflow"
        SNYK1[Dependencies Scan]
        SNYK2[Docker Scan]
    end

    T1 --> P1 & P2 & P3 & P4
    T2 --> P1 & P2 & P3 & P4
    T1 --> D1 --> D2
    T1 --> C1
    T1 --> DOCK1 --> DOCK2
    T1 --> SNYK1 & SNYK2

tests/** - Test files
pyproject.toml - Project configuration
Makefile - Build configuration
Dockerfile, .dockerignore - Docker files
.github/workflows/python-app.yml - Workflow itself

Skips when: Only documentation, markdown, or non-code files change

Workflow Trigger Conditions¶

Each workflow only runs when specific files are modified:

python-app.yml Workflow:

**.py (any Python file)
tests/** (any test file)
pyproject.toml
Makefile
Dockerfile
.dockerignore

docker.yml Workflow:

Push to main: All changes (Dockerfile, .dockerignore, pyproject.toml, *.py)
Pull requests: Only Dockerfile or .dockerignore changes

snyk.yml Workflow:

**.py
pyproject.toml
Dockerfile
.dockerignore

codeql.yml Workflow:

**.py
.github/workflows/**

docs.yml Workflow:

Note: Docs workflow does NOT trigger on PRs (only on push to main)

Job Details¶

This is the main CI pipeline that ensures code quality, runs tests, builds documentation, and validates the package.

Always-Running Jobs¶

1. Lint Job (Always Runs)¶

Purpose: Perform quick code quality checks without heavy ML dependencies

When: Both PRs and push to main

Duration: ~1-2 minutes

Steps:

Checkout code
Set up Python 3.11 with pip caching
Set up Node.js 20 for markdown linting
Install dev dependencies (excluding ML packages)
Install markdownlint-cli
Run all lint checks:
make format-check - Black & isort formatting validation
make lint - flake8 code linting
make lint-markdown - Markdown file linting
make type - mypy static type checking
make security - bandit & pip-audit security scanning
Run code quality analysis (continue-on-error):
radon cc - Cyclomatic complexity analysis
radon mi - Maintainability index
interrogate - Docstring coverage
vulture - Dead code detection
codespell - Spell checking
Saves metrics to reports/ for dashboard consumption

Why separate from test? Linting is much faster without ML dependencies, providing quick feedback.

2. test-unit Job (Always Runs)¶

Purpose: Run unit tests separately for maximum parallelization

When: Both PRs and push to main

Duration: ~2-5 minutes

What it runs: pytest tests/unit/ -v --tb=short -n auto

Unit tests: All unit tests (no markers, runs all unit tests)
Network isolation: Enforced and verified
Import verification: Ensures modules work without ML dependencies
Parallel execution: Uses -n auto for speed

Key Features:

No ML dependencies: Fast execution (dev dependencies only)
Network isolation: Enforced via pytest-socket (--disable-socket --allow-hosts=127.0.0.1,localhost)
Test count verification: Ensures at least 50 unit tests run

PR-Only Jobs (Fast Feedback)¶

3. test-integration-fast Job (Runs on PRs only, with exceptions)¶

Purpose: Run critical path integration tests for fast PR feedback

When: Pull requests only (not on push to main, with exceptions for docs-only PRs)

Conditional Execution:

if: |
  github.event_name == 'pull_request' &&
  github.event.pull_request.head.ref != 'fix/ai-guidelines-linting' &&
  !contains(github.event.pull_request.head.ref, 'docs/')

Network guard: --disable-socket --allow-hosts=127.0.0.1,localhost
Re-runs enabled: 2 retries with 1s delay for flaky tests
Parallel execution: Uses -n auto for speed
Duration monitoring: --durations=20 shows the 20 slowest tests

Key Features:

Fast feedback: Critical path tests only (not all integration tests)
Includes slow critical path tests: Critical path tests must run to validate core functionality
Slow test monitoring: Use --durations=20 output to identify and optimize slow tests
Network isolation: Enforced via pytest-socket
Re-runs: Handles flaky tests automatically
ML dependencies: Installs ML dependencies (required for real Whisper in integration tests)

4. test-e2e-fast Job (Runs on PRs only, with exceptions)¶

Purpose: Run critical path E2E tests for fast PR feedback

When: Pull requests only (not on push to main, with exceptions for docs-only PRs)

Conditional Execution:

if: |
  github.event_name == 'pull_request' &&
  github.event.pull_request.head.ref != 'fix/ai-guidelines-linting' &&
  !contains(github.event.pull_request.head.ref, 'docs/')

Network guard: --disable-socket --allow-hosts=127.0.0.1,localhost
Re-runs enabled: 2 retries with 1s delay for flaky tests
Parallel execution: Uses -n auto for speed
Duration monitoring: --durations=20 shows the 20 slowest tests

Key Features:

Fast feedback: Critical path tests only (not all E2E tests)
Includes slow critical path tests: Critical path tests must run to validate core functionality
Slow test monitoring: Use --durations=20 output to identify and optimize slow tests
Network isolation: Enforced via pytest-socket
Re-runs: Handles flaky tests automatically
ML dependencies: Installs ML dependencies (required for real Whisper in E2E tests)
Real ML models: Uses real Whisper, spaCy, and Transformers models (no mocks)

Main Branch Only Jobs¶

5. preload-ml-models Job (Main Branch Only)¶

Purpose: Preload ML models to ensure they are cached for integration and E2E tests

When: Push to main branch only

Conditional Execution:

if: github.event_name == 'push' && github.ref == 'refs/heads/main'

Variants:

make preload-ml-models - Test models (Whisper tiny, small BART, en_core_web_sm)
make preload-ml-models-production - Production models (Whisper base, BART-large-cnn, LED-large-16384)

Key Features:

Dependency for full test jobs: test-integration and test-e2e depend on this job
ML model caching: Ensures models are available before test jobs run
Efficiency: Models are cached, so subsequent jobs can use them immediately

6. test-integration-slow Job (Main Branch Only)¶

Purpose: Run slow integration tests (includes slow/ml_models tests)

When: Push to main branch only

Conditional Execution:

if: github.event_name == 'push' && github.ref == 'refs/heads/main'
needs: [preload-ml-models]

ML dependencies: Required (includes ML packages)

Key Features:

Slow tests only: Focuses on tests that require ML models or are slow
Re-runs: Handles flaky tests automatically
ML model caching: Uses models preloaded by preload-ml-models job
Test count verification: Ensures at least 10 slow integration tests run

7. test-e2e Job (Main Branch Only)¶

Purpose: Run all E2E tests (full suite)

When: Push to main branch only

Conditional Execution:

if: github.event_name == 'push' && github.ref == 'refs/heads/main'
needs: [preload-ml-models]

Key Features:

Complete validation: All E2E tests run (critical path + non-critical)
Network isolation: Enforced via pytest-socket
Re-runs: Handles flaky tests automatically
ML model caching: Uses models preloaded by preload-ml-models job
Test count verification: Ensures at least 50 E2E tests run

7. Docs Build Job (Always Runs)¶

Purpose: Validate documentation builds correctly

When: Both PRs and push to main

Duration: ~2-3 minutes

Steps:

Checkout code
Set up Python 3.11 with pip caching (docs + pyproject.toml)
Install documentation dependencies + ML packages (needed for mkdocstrings API docs)
Build documentation: make docs
Runs mkdocs build --strict
Generates API documentation from docstrings
Validates all internal links

Note: This job runs in the main workflow to validate docs on every PR, even if not touching doc files.

8. Build Package Job (Always Runs)¶

Purpose: Validate the package can be built for distribution

When: Both PRs and push to main

Duration: ~1-2 minutes

Steps:

Checkout code
Set up Python 3.11 with pip caching
Install build tools (pip install build)
Build package: make build
Creates source distribution (.tar.gz)
Creates wheel distribution (.whl)
Validates pyproject.toml configuration

9. coverage-unified Job (Main Branch Only)¶

Purpose: Combine coverage from all test jobs and publish to GitHub Pages

When: Push to main branch only (after test-unit completes)

Duration: ~3-5 minutes

Steps:

Download coverage artifacts from all test jobs (unit, integration, e2e)
Combine coverage reports into unified report
Generate unified coverage summary in job summary
Generate code quality metrics (complexity, maintainability, docstrings)
Collect pipeline performance metrics (if ML dependencies available)
Generate metrics JSON (metrics/latest.json)
Update metrics history (metrics/history.jsonl)
Generate HTML dashboard (metrics/index.html)
Upload unified coverage to Codecov
Publish metrics to GitHub Pages

Outputs:

reports/coverage-unified.xml - Combined coverage report
metrics/latest.json - Current metrics for dashboard
metrics/history.jsonl - Historical trend data
metrics/index.html - Interactive dashboard

10. deploy-metrics Job (Main Branch Only)¶

Purpose: Deploy metrics dashboard to GitHub Pages

When: After coverage-unified completes (even if it fails)

Steps:

Deploy metrics to GitHub Pages using actions/deploy-pages@v4

Result: Dashboard accessible at https://[username].github.io/podcast_scraper/metrics/

Additional Jobs (Conditional)¶

11. docker-build Job (Runs if Docker files changed)¶

Purpose: Build and test Docker image

Conditional Execution:

Only runs if PR modifies: Dockerfile, .dockerignore, pyproject.toml, or *.py

Duration: ~5-8 minutes

Steps:

Checkout code
Free disk space
Set up Docker Buildx
Build Docker image (default)
Build Docker image (multiple models variant)
Test Docker image (help command)
Test Docker image (version check)
Test Docker image (error handling)
Validate Dockerfile with hadolint

Runs in parallel with: All other jobs

10. snyk-dependencies Job (Runs if code/Docker files changed)¶

Purpose: Security scan of Python dependencies

Conditional Execution:

Only runs if PR modifies: **.py, pyproject.toml, Dockerfile, or .dockerignore

Duration: ~3-5 minutes

Steps:

Checkout code
Set up Python 3.11
Install dependencies (including ML)
Run Snyk scan (Python dependencies)
Upload results to GitHub Code Scanning

Note: continue-on-error: true (doesn't fail PR)

11. snyk-docker Job (Runs if code/Docker files changed)¶

Purpose: Security scan of Docker image

Conditional Execution:

Only runs if PR modifies: **.py, pyproject.toml, Dockerfile, or .dockerignore
Skips on scheduled runs: if: github.event_name != 'schedule'

Duration: ~8-12 minutes

Steps:

Checkout code
Free disk space
Set up Docker Buildx
Build Docker image
Verify Docker image exists
Run Snyk scan (Docker image)
Upload results to GitHub Code Scanning

Note: continue-on-error: true (doesn't fail PR)

12. snyk-monitor Job (Always Runs on PR)¶

Purpose: Monitor dependencies for security updates

Conditional Execution:

if: github.event_name == 'pull_request' || github.event_name == 'schedule'

13. analyze Job (Runs if Python/workflow files changed)¶

Purpose: CodeQL security analysis

Conditional Execution:

Only runs if PR modifies: **.py or .github/workflows/**

Duration: ~10-15 minutes

Steps:

Checkout repository
Initialize CodeQL (for Python and Actions)
Perform CodeQL analysis

Runs in parallel with: All other jobs

Critical Path Testing Strategy¶

On pull requests, multiple test jobs run in parallel to provide fast feedback:

Critical Path Jobs (Run in Parallel)¶

Purpose: Quick pass/fail signal on critical path tests

Jobs:

test-unit - All unit tests (~2-5 min)
test-integration-fast - Critical path integration tests only (~5-8 min)
test-e2e-fast - Critical path E2E tests only (~8-12 min)

Features:

Run in parallel (no waiting)
Use critical_path marker to filter tests
Network isolation enforced
Re-runs enabled for flaky test handling
No coverage overhead (faster execution)

When to check: After ~8-12 minutes for early feedback

Why This Design?¶

Fast feedback: Critical path jobs provide early signal if something is broken
Parallel execution: All critical path jobs run simultaneously, no sequential waiting
Simplicity: No separate coverage job - each test job handles its own validation
Efficiency: Critical path tests focus on what matters most for fast feedback

What About Non-Critical Tests?¶

Non-critical path integration and E2E tests are excluded from PRs for faster feedback. They run on the main branch only:

All integration tests: Run in test-integration job on main branch (after preload-ml-models)
All E2E tests: Run in test-e2e job on main branch (after preload-ml-models)

This ensures:

✅ Fast PR feedback (critical path tests only)
✅ Full validation on main branch (all tests run)
✅ Balance between speed and coverage

Jobs That DO NOT Run on PRs¶

preload-ml-models Job:

Condition: if: github.event_name == 'push' && github.ref == 'refs/heads/main'
Only runs on: Push to main branch
Does NOT run on: Pull requests
Purpose: Preload ML models for full test suite jobs

test-integration Job:

Condition: if: github.event_name == 'push' && github.ref == 'refs/heads/main'
Only runs on: Push to main branch
Does NOT run on: Pull requests
Runs: All integration tests (full suite)
Dependencies: needs: [preload-ml-models]

test-e2e Job:

Condition: if: github.event_name == 'push' && github.ref == 'refs/heads/main'
Only runs on: Push to main branch
Does NOT run on: Pull requests
Runs: All E2E tests (full suite)
Dependencies: needs: [preload-ml-models]

When Do Slow Jobs Run? (Important Clarification)¶

"Push to main" means code is actually committed to the main branch. This happens in two scenarios:

PR is merged → Code is pushed to main → Slow jobs run
Direct push to main (bypassing PR) → Slow jobs run

Timeline: PR Creation → Merge → Slow Jobs

│ ❌ Full suite jobs DO NOT run:                               │
│   - preload-ml-models                                        │
│   - test-integration                                         │
│   - test-e2e                                                 │
│                                                              │
│ Purpose: Fast feedback during code review                    │
└─────────────────────────────────────────────────────────────┘
                          ↓
                    PR is Merged
                          ↓
┌─────────────────────────────────────────────────────────────┐
│ PHASE 2: After Merge (Push to Main)                         │
├─────────────────────────────────────────────────────────────┤
│ ✅ All fast jobs run again (same as PR)                      │
│                                                              │
│ ✅ Full suite jobs NOW run:                                  │
│   - preload-ml-models (preloads ML models)                  │
│   - test-integration (all integration tests)               │
│   - test-e2e (all E2E tests)                                │
│                                                              │
│ Purpose: Full test suite validation after merge              │
└─────────────────────────────────────────────────────────────┘

The main branch will show a failed status
You'll get a notification
You can either:
Fix the issue in a new PR
Revert the merge if it's critical

Note: Slow test failures don't prevent the merge (they run after merge), but they do alert you to issues that need attention.

Comparison: PR vs Push to Main¶

Job	During PR Review	After Merge (Push to Main)
`lint`	✅	✅
`test-unit`	✅	✅
`test-integration-fast`	✅ (with exceptions)	❌ Does NOT run
`preload-ml-models`	❌ Does NOT run	✅ Runs after merge
`test-integration`	❌ Does NOT run	✅ Runs after merge
`test-e2e-fast`	✅ (with exceptions)	❌ Does NOT run
`test-e2e`	❌ Does NOT run	✅ Runs after merge
`docs`	✅	✅
`build`	✅	✅
`docker-build`	✅ (if Dockerfile/.dockerignore change)	✅ (all changes)
`snyk-*`	✅ (if files match)	✅ (if files match)
`analyze`	✅ (if files match)	✅ (if files match)

Key Difference:

PR Review: Fast critical path tests only (quick feedback, ~10-15 min total)
After Merge: Full test suite only (complete validation, no redundant fast tests)

PR Execution Timeline¶

Typical PR Flow (All Jobs Run):

│  ├─ build (1-2 min)
│  ├─ docker-build (5-8 min) [if Docker files changed]
│  ├─ snyk-dependencies (3-5 min) [if code changed]
│  ├─ snyk-docker (8-12 min) [if Docker files changed]
│  ├─ snyk-monitor (2-3 min)
│  └─ analyze (10-15 min) [if Python/workflow files changed]
│
Time ~8-12 min - All critical path jobs complete
└─ PR status updated

Time ~15-20 min - All jobs complete (if no failures)
└─ PR status updated

Total time: ~20-25 minutes (CodeQL is the bottleneck)

PR Status Checks¶

The PR will show a status check for each job that runs. All checks must pass for the PR to be mergeable (unless branch protection rules allow otherwise).

Required Checks (Always Run):

✅ lint
✅ test-unit
✅ test-integration-fast (PRs only, unless docs-only PR)
✅ test-e2e-fast (PRs only, unless docs-only PR)
✅ docs
✅ build
✅ snyk-monitor

Optional Checks (Run if files match):

⚠️ docker-build (PRs: only if Dockerfile/.dockerignore changed)
⚠️ snyk-dependencies (if code changed)
⚠️ snyk-docker (if Docker files changed)
⚠️ analyze (if Python/workflow files changed)

Note: Snyk jobs have continue-on-error: true, so they won't block PRs even if they fail.

Re-runs for Flaky Tests¶

Some test jobs include automatic re-runs to handle flaky tests:

Configuration:

--reruns 2: Retry failed tests up to 2 times (3 total attempts)
--reruns-delay 1: Wait 1 second between retries

Which jobs use re-runs:

✅ test-integration-fast (PRs only): Critical path integration tests
✅ test-integration (main branch): All integration tests
✅ test-e2e-fast (PRs only): Critical path E2E tests
✅ test-e2e (main branch): All E2E tests
❌ test-unit (always): No re-runs (unit tests should be stable)
❌ preload-ml-models (main branch): No re-runs (setup job)

How it works:

Test runs and fails
Wait 1 second
Retry test (attempt 1)
If still fails, wait 1 second
Retry test (attempt 2)
If all attempts fail, mark as FAILED

Why re-runs?

Integration and E2E tests can be flaky due to timing, I/O, or resource contention
Re-runs reduce false negatives from transient failures
Improves CI reliability without masking real issues

Dependency Management¶

graph TD
    A[pyproject.toml] --> B{Job Type}

    B -->|Lint Job| C[dev dependencies only]
    C --> C1[black]
    C --> C2[flake8]
    C --> C3[mypy]
    C --> C4[bandit]

    B -->|test-unit Job| C2a[dev dependencies only]
    C2a --> C2A[pytest]
    C2a --> C2B[No ML deps - fast]
    C2a --> C2C[Import check script]

    B -->|test-integration-fast Job| E2[dev + pytest-socket]
    E2 --> E2A[pytest]
    E2 --> E2B[pytest-socket for network guard]
    E2 --> E2C[No ML deps - critical path only]

    B -->|test-e2e-fast Job| E5[dev + pytest-socket]
    E5 --> E5A[pytest]
    E5 --> E5B[pytest-socket for network guard]
    E5 --> E5C[No ML deps - critical path only]

    B -->|preload-ml-models Job Main| F[dev + ml dependencies]
    F --> F1a[make preload-ml-models]
    F --> F2a[Cache ML models]

    B -->|test-integration Job Main| D[dev + ml dependencies]
    D --> D1[pytest]
    D --> D2[transformers]
    D --> D3[torch]
    D --> D4[whisper]
    D --> D5[spacy]
    D --> D6[Uses preloaded models]

    B -->|test-e2e Job Main| E3[dev + ml dependencies + pytest-socket]
    E3 --> E3A[pytest]
    E3 --> E3B[pytest-socket for network guard]
    E3 --> E3C[transformers]
    E3 --> E3D[torch]
    E3 --> E3E[whisper]
    E3 --> E3F[spacy]

    B -->|Docs Job| E[docs + ml dependencies]
    E --> E1[mkdocs-material]
    E --> E2D[mkdocstrings]
    E --> E3G[ML packages for API docs]

    B -->|Build Job| Fb[build tools only]
    Fb --> F1b[python -m build]

docs/** - Documentation files
mkdocs.yml - MkDocs configuration
**.py - Python files (needed for API documentation)
README.md - Main readme file
.github/workflows/docs.yml - Workflow itself

Sequential Pipeline (Build → Deploy)¶

Unlike the Python app workflow, this has a sequential dependency:

graph LR
    A[Trigger] --> B[Build Job]
    B -->|Generate Site| C[Upload Artifact]
    C -->|Only on push to main| D[Deploy Job]
    D -->|Deploy| E[GitHub Pages]

    style B fill:#87CEEB
    style D fill:#90EE90

Purpose: Build MkDocs site from documentation sources

Runs: On all triggers (push, PR, manual)

Steps:

Check out repository
Set up Python 3.11 with pip caching
Install documentation dependencies
Build MkDocs site with strict mode (mkdocs build --strict)
Upload artifact (only on push to main)

Output: .build/site/ directory containing the static website

2. Deploy Job¶

Purpose: Deploy built site to GitHub Pages

Runs: Only on push to main (conditional)

Depends on: Build job must succeed

Steps:

Deploy to GitHub Pages using the uploaded artifact

Environment:

Name: github-pages
URL: Deployment URL exposed as output

Concurrency Control¶

concurrency:
  group: "pages"
  cancel-in-progress: true

CodeQL Security Workflow¶

File: .github/workflows/codeql.yml Triggers:

Push to main branch (only when code or workflow files change)
Pull Requests to main branch (only when code or workflow files change)
Scheduled: Every Thursday at 13:17 UTC (weekly security scan)

Path Filters:

**.py - All Python source files
.github/workflows/** - GitHub Actions workflow files

Skips when: Only documentation or non-code files change

Matrix Strategy (Parallel Language Analysis)¶

CodeQL analyzes multiple languages in parallel using a matrix:

graph TB
    A[CodeQL Workflow] --> B{Matrix Strategy}

    B --> C[Python Analysis]
    B --> D[GitHub Actions Analysis]

    C --> C1[Initialize CodeQL]
    C --> C2[Analyze Python Code]
    C --> C3[Upload Results]

    D --> D1[Initialize CodeQL]
    D --> D2[Analyze Actions YAML]
    D --> D3[Upload Results]

    style C fill:#FFE4B5
    style D fill:#FFE4B5

Language	Build Mode	Purpose
Python	`none`	Analyze application code for security vulnerabilities
GitHub Actions	`none`	Analyze workflow YAML files for security issues

Build Mode: none means no compilation required (interpreted languages)

CodeQL Job Details¶

Per-language steps:

Checkout repository
Initialize CodeQL tools
Scan codebase for security vulnerabilities
Upload results to GitHub Security tab

Scan Categories:

SQL injection
Cross-site scripting (XSS)
Path traversal
Code injection
Hardcoded secrets
Unsafe deserialization
And more...

Schedule Details¶

schedule:
  - cron: '17 13 * * 4'

Runs: Every Thursday at 13:17 UTC Purpose: Catch newly discovered vulnerabilities in dependencies

Docker Build & Test Workflow¶

File: .github/workflows/docker.yml Triggers:

Push to main: All changes (Dockerfile, .dockerignore, pyproject.toml, *.py)
Pull requests: Only when Dockerfile or .dockerignore change

Path Filters:

Push to main:
Dockerfile, .dockerignore - Docker-related files
pyproject.toml - Project configuration
*.py - Python source files
Pull requests:
Dockerfile - Main Dockerfile
.dockerignore - Docker ignore file

Optimization: Docker build no longer runs on PRs for Python-only changes, saving ~10 minutes per PR. Full Docker validation still runs on merge to main.

Optimization Implementation¶

The Docker workflow uses a hybrid optimization approach to balance fast PR feedback with comprehensive validation:

PR Triggers (Conditional):

Docker build runs on PRs only when Dockerfile or .dockerignore change
Python code changes (pyproject.toml, *.py) no longer trigger Docker build on PRs
Result: ~10 minutes saved per PR for Python-only changes

Lightweight Build on PRs:

When PRs do trigger (Dockerfile changes), they use a lightweight build:
Build only default image (skip multi-model build)
Skip model preloading (PRELOAD_ML_MODELS=false)
Result: ~3-5 minutes per PR (vs 10+ minutes for full build)

Optimized Full Build on Main:

Push to main triggers optimized full build:
Parallel builds: Both images (default + multi-model) build simultaneously using matrix strategy
Scoped caching: Separate cache scopes for each image type for better cache hits
BuildKit cache mounts: Already implemented in Dockerfile for pip and model caches
Result: ~5 minutes saved on main (parallel execution vs sequential)

Performance Improvements:

PRs: ~10 minutes saved (no build for Python-only changes) + ~5-7 minutes saved (lightweight build when triggered)
Main: ~5 minutes saved (parallel builds) + faster rebuilds (better caching)

Workflow Purpose¶

Validates that Docker images can be built correctly and pass basic smoke tests. Ensures Docker configuration remains functional as code evolves.

Docker Job Details¶

Docker Build Jobs¶

Purpose: Build Docker images and run smoke tests

Duration:

PR builds (fast): ~3-5 minutes (lightweight, single image, no model preloading)
Main builds (full): LLM-only ~5-10 minutes; ML variant with production preload is much longer (large downloads; parallel matrix)

PR Build (docker-build-fast):

Runs only when Dockerfile or .dockerignore change
Builds only the LLM-only image (INSTALL_EXTRAS=, no ML deps)
Fast feedback for Dockerfile changes

Main Build (docker-build-full):

Runs on push to main branch
Uses matrix strategy for parallel builds:
LLM-only: INSTALL_EXTRAS= (no ML deps)
ML variant: INSTALL_EXTRAS=ml, PRELOAD_ML_MODELS=true (production preload bundle)
Scoped caching per image type for better cache hits
Full validation with all models

Steps:

Checkout code
Free disk space (removes unnecessary system packages)
Set up Docker Buildx
Build Docker image(s):
Uses Dockerfile
Caches layers using GitHub Actions cache
BuildKit cache mounts for pip (model files are baked into the image layer)
Tags: podcast-scraper:test (default) or podcast-scraper:multi-model
Test Docker images:
Test --help command
Test --version command
Test error handling (no args should show error)
Validate Dockerfile with hadolint
Lints Dockerfile for best practices
Catches common Docker anti-patterns

Docker Image Tests:

✅ Help command works
✅ Version command works
✅ Error handling works (no args shows error)
✅ Multiple model preloading works

Dockerfile Validation:

✅ Hadolint linting passes
✅ Best practices enforced

Snyk Security Scan Workflow¶

File: .github/workflows/snyk.yml Triggers:

Push to main branch (only when code/Docker files change)
Pull Requests to main branch (only when code/Docker files change)
Scheduled: Every Monday at 00:00 UTC (weekly security scan)
Manual dispatch (workflow_dispatch)

Path Filters:

**.py - Python source files
pyproject.toml - Project configuration
Dockerfile, .dockerignore - Docker files
Dockerfile - Main Dockerfile

Skips when: Only documentation or unrelated files change

Snyk Workflow Purpose¶

Provides comprehensive security scanning for both Python dependencies and Docker images using Snyk's vulnerability database.

Snyk Job Details¶

1. Snyk Dependencies Scan¶

Purpose: Scan Python dependencies for known vulnerabilities

Runs: On all triggers (push, PR, schedule, manual)

Steps:

Checkout code
Set up Python 3.11 with pip caching
Install dependencies (pip install -e .[dev,ml])
Run Snyk scan on Python dependencies
Scans installed packages from pyproject.toml
Severity threshold: high (only reports high/critical issues)
Generates SARIF file for GitHub Code Scanning integration
Upload results to GitHub Code Scanning
Results appear in Security tab
Integrated with GitHub's security dashboard

Output: SARIF file uploaded to GitHub Code Scanning

2. Snyk Docker Scan¶

Purpose: Scan Docker image for vulnerabilities

Runs: On push/PR (not on schedule)

Steps:

Checkout code
Free disk space
Set up Docker Buildx
Build Docker image
Tags as podcast-scraper:snyk-scan
Uses GitHub Actions cache
Verify Docker image exists
Run Snyk scan on Docker image
Scans built image for vulnerabilities
Severity threshold: high
Generates SARIF file
Upload results to GitHub Code Scanning

Output: SARIF file uploaded to GitHub Code Scanning

3. Snyk Monitor¶

Purpose: Monitor dependencies for ongoing security tracking

Runs: On PRs and scheduled runs (not on push)

Steps:

Checkout code
Set up Python 3.11 with pip caching
Install dependencies
Run Snyk monitor
Tracks dependencies in Snyk dashboard
Enables ongoing monitoring and alerts

Purpose: Provides long-term security monitoring and alerts for new vulnerabilities

Snyk Schedule Details¶

schedule:
  - cron: '0 0 * * 1'

Runs: Every Monday at 00:00 UTC Purpose: Weekly security scan to catch newly discovered vulnerabilities

Integration with GitHub Security¶

Results uploaded to GitHub Code Scanning
Appears in Security tab
Integrated with Dependabot alerts
SARIF format for standardized reporting

Dependabot Automated Dependency Updates¶

File: .github/dependabot.yml Purpose: Automatically create pull requests to update dependencies, keeping the project secure and current.

Dependabot Overview¶

Dependabot automatically monitors dependencies and creates PRs for updates. Unlike Snyk and pip-audit which alert about vulnerabilities, Dependabot creates PRs to update dependencies automatically.

Key Difference¶

Tool	Function
Snyk/pip-audit	Alerts about vulnerable dependencies
Dependabot	Creates PRs to update dependencies automatically

Configuration¶

Dependabot is configured to monitor three package ecosystems:

Python dependencies (pip) - Weekly updates on Mondays
GitHub Actions - Weekly updates on Mondays
Docker - Monthly updates

Python Dependencies Configuration¶

Schedule:

Interval: Weekly
Day: Monday
PR Limit: 5 open PRs maximum

Grouping Strategy:

dev-dependencies group: Development tools (pytest, black, isort, flake8, mypy, bandit, radon, vulture, interrogate, codespell)
Updates: Minor and patch versions
Rationale: Dev tools can be updated together safely
ml-dependencies group: ML libraries (torch, transformers, openai-whisper, spacy)
Updates: Patch versions only
Rationale: ML libraries have frequent breaking changes, so only patches are automated

Labels:

dependencies
automated

Commit Message:

Prefix: deps

GitHub Actions Configuration¶

Schedule:

Interval: Weekly
Day: Monday
PR Limit: 3 open PRs maximum

Labels:

dependencies
ci/cd
automated

Commit Message:

Prefix: ci

Docker Configuration¶

Schedule:

Interval: Monthly
PR Limit: No limit (monthly schedule naturally limits PRs)

Labels:

dependencies
docker
automated

Configuration Decisions¶

Decision	Rationale
Weekly schedule (Monday)	Aligns with Snyk schedule, gives week to review
5 PR limit for pip	Avoids PR flood, prioritizes important updates
Grouped updates	Reduces noise (dev tools together, ML libs together)
Patch-only for ML	ML libraries have frequent breaking changes
GitHub Actions updates	Keeps CI actions current and secure

How It Works¶

Dependabot checks for updates according to the schedule
Creates PRs for available updates (respecting PR limits)
Groups related dependencies to reduce PR noise
Labels PRs with dependencies and automated for easy filtering
Assigns reviewer (chipi) for review

Reviewing Dependabot PRs¶

What to check:

✅ CI passes (all tests, linting, security scans)
✅ No breaking changes (check changelog/release notes)
✅ Compatibility with current codebase
✅ For ML dependencies: Test with actual models if patch update

When to merge:

✅ CI passes
✅ No obvious breaking changes
✅ PR description looks safe

When to close:

❌ Breaking changes that require code updates
❌ Known compatibility issues
❌ Update conflicts with other work

Integration with Security Tools¶

Dependabot works alongside existing security tools:

Snyk: Alerts about vulnerabilities → Dependabot creates PRs to fix them
pip-audit: Identifies vulnerable packages → Dependabot can update them
CodeQL: Security scanning → Dependabot keeps dependencies current

Workflow:

Snyk/pip-audit identifies vulnerable dependency
Dependabot creates PR to update to secure version
CI validates the update
Merge PR to resolve vulnerability

Accessing Dependabot¶

PRs: View all Dependabot PRs via Dependabot updates
Alerts: Security alerts appear in Security tab
Configuration: .github/dependabot.yml

RFC-038: Continuous Review Tooling - Dependabot implementation details
Issue #169 - Dependabot setup

Parallel Execution Summary¶

Workflow Independence¶

All workflows run independently and in parallel when triggered:

Python Application Workflow (PRs: 6 jobs - lint, unit, fast integration, fast E2E, docs, build; Main: 6 jobs - lint, unit, full integration, full E2E, docs, build)
Documentation Deployment Workflow (sequential: build → deploy)
CodeQL Workflow (matrix: Python + Actions analysis in parallel)
Docker Workflow (single job: build → test → validate)
Snyk Workflow (3 jobs: dependencies scan, Docker scan, monitor - run based on trigger type)

Parallel Workflow Execution¶

Each workflow is independent and can run simultaneously:

Python app workflow doesn't wait for Docker
Docker workflow doesn't wait for Snyk
CodeQL runs independently of other workflows
Documentation workflow runs independently

This maximizes parallelism and reduces total CI time.

Parallel Execution Details¶

✅ Completely Parallel¶

Within Python Application Workflow - Pull Requests:

├── Lint Job (1-2 min)
├── test-unit (2-5 min) - All unit tests
├── test-integration-fast (5-8 min) - Critical path only
├── test-e2e-fast (8-12 min) - Critical path only
├── Docs Job (2-3 min)
└── Build Job (1-2 min)

Within Python Application Workflow - Push to Main:

├── Lint Job (1-2 min)
├── test-unit Job (2-5 min) - No ML deps, fast
├── preload-ml-models (2-5 min) - Preloads ML models for full suite
├── test-integration (5-10 min) - Full suite, includes re-runs, ML deps, needs preload-ml-models
├── test-e2e (20-30 min) - Full suite, includes re-runs, ML deps, network guard, needs preload-ml-models
├── Docs Job (2-3 min)
└── Build Job (1-2 min)

├── Python Analysis
└── Actions Analysis

All three workflows (Python app, docs, CodeQL) trigger independently
They run in parallel when triggered by the same event

❌ Sequential¶

Documentation Workflow:

Build Job → Deploy Job

Performance Optimizations¶

1. Pip Caching¶

All workflows use pip caching to speed up dependency installation:

- uses: actions/setup-python@v5
  with:
    python-version: "3.11"
    cache: "pip"
    cache-dependency-path: pyproject.toml

graph TD
    A[Dependency Strategy] --> B[Lint: dev only]
    A --> C[Test: dev + ml]
    A --> D[Docs: docs + ml]
    A --> E[Build: build tools only]

    B --> F[Fast: 2-3 min]
    C --> G[Slow: 10-15 min]
    D --> H[Medium: 3-5 min]
    E --> I[Fast: 2-3 min]

Test job proactively frees ~30GB of disk space before installing ML dependencies:

sudo rm -rf /usr/share/dotnet
sudo rm -rf /usr/local/lib/android
sudo rm -rf /opt/ghc
# ... more cleanup

rm -rf ~/.cache/huggingface
rm -rf ~/.cache/torch
rm -rf ~/.cache/whisper

Workflow Triggers Matrix¶

Workflow	Push to main	PR to main	Schedule	Manual	Doc Changes	Code Changes
Python Application	✅ (code only)	✅ (code only)	❌	❌	❌	✅
Documentation Deploy	✅ (deploy)	✅ (build only)	❌	✅	✅	✅ (API docs)
CodeQL Security	✅ (code only)	✅ (code only)	✅ Weekly	❌	❌	✅

Path-Based Optimization Strategy¶

Strategy Overview¶

All workflows implement intelligent path-based filtering to ensure CI/CD runs only when necessary. This optimization dramatically reduces unnecessary CI runs, saves compute resources, and provides faster feedback.

Decision Matrix¶

When you change files, here's what runs:

Files Changed	Python App	Docs Deploy	CodeQL	Reasoning
Only `docs/`	❌ Skip	✅ Run	❌ Skip	Docs changes don't require code validation
Only `.py` files	✅ Run	✅ Run	✅ Run	Code changes need full validation + API docs rebuild
Only `README.md`	❌ Skip	✅ Run	❌ Skip	README is included in docs site
`pyproject.toml`	✅ Run	❌ Skip	❌ Skip	Config changes affect dependencies/build
`Dockerfile`	✅ Run	❌ Skip	❌ Skip	Docker builds depend on package validation
`.github/workflows/`	✅ (if python-app.yml)	✅ (if docs.yml)	✅ Run	Workflow changes need validation
Mixed changes	✅ Run	✅ Run	✅ Run	Any match triggers the workflow

Benefits¶

Time Savings:

Docs-only change: ~18 minutes saved (only 3-5 min for docs vs. 20+ min for everything)
README-only change: ~18 minutes saved
Config-only change: ~5 minutes saved (skips docs and CodeQL)

Resource Savings:

~70% fewer runner minutes for documentation updates
~30GB less disk space operations per docs-only change
Reduced ML dependency installations

Developer Experience:

✅ Faster feedback loop for documentation updates
✅ Clear separation: code changes = full CI, docs changes = docs only
✅ No wasted time waiting for unrelated checks

Examples¶

Example 1: Documentation Update¶

# You change only: docs/api/REFERENCE.md
git commit -m "Update API documentation"

✅ docs.yml runs (3-5 min)
❌ python-app.yml skipped
❌ codeql.yml skipped

Total CI time: ~3-5 minutes (vs. 20+ minutes before)

Example 2: Python Code Change¶

# You change: downloader.py
git commit -m "Fix download retry logic"

✅ python-app.yml runs (lint, test, docs, build)
✅ docs.yml runs (API docs need rebuild)
✅ codeql.yml runs (security scan on code)

Total CI time: ~15-20 minutes (all workflows needed)

Example 3: Mixed Changes¶

# You change: docs/index.md AND service.py
git commit -m "Update docs and fix service"

✅ All workflows run (code changed = full validation needed)

Total CI time: ~15-20 minutes (appropriate for code changes)

Minimal Docs CI/CD Validation ✅¶

The system now passes the "minimal docs CI/CD" requirement:

When changing ONLY documentation files:

✅ Docs build and deploy (required)
❌ NO Python linting
❌ NO Python testing
❌ NO security scanning
❌ NO package building

Status: ✅ VALIDATED - Optimization complete

CI/CD Evolution Highlights¶

Key Improvements Over Time¶

Path-Based Workflow Filtering ⭐ NEW
Intelligent path filtering prevents unnecessary workflow runs
Docs-only changes skip Python testing, linting, and security scanning
Saves ~18 minutes per docs-only commit
~70% reduction in runner minutes for documentation updates
Parallel Job Execution
Separated lint, unit tests, integration tests, docs, and build into independent parallel jobs
Reduced total CI time from ~20 minutes sequential to ~15 minutes parallel (limited by slowest job)
Unit tests run fast (2-3 min) without ML dependencies, integration tests run in parallel (10-15 min)
Smart Dependency Management
Lint job runs without ML dependencies for fast feedback (2-3 min)
Unit test job runs without ML dependencies for fast feedback (2-3 min)
Integration test job includes full ML stack for complete validation
Separate dependency groups in pyproject.toml: [dev], [ml], [docs]
ML Dependency Import Verification ⭐ NEW
Automatic check that unit tests can import modules without ML dependencies
Prevents modules from importing ML deps at top level (which would break CI)
Script: scripts/tools/check_unit_test_imports.py
Runs before unit tests in CI, catches issues early
Comprehensive Security Scanning
CodeQL for static analysis (Python + Actions)
Snyk for dependency and Docker image scanning
Scheduled weekly scans for newly discovered vulnerabilities
Bandit & pip-audit in lint job for immediate feedback
Multiple layers of security validation
Documentation as Code
Docs build validated on every PR
Automatic deployment to GitHub Pages on merge
API documentation auto-generated from docstrings
Resource Optimization
Pip caching reduces dependency install time
Proactive disk space management
Post-test cache cleanup
Developer Experience
Fast lint feedback (~2-3 min)
Clear separation of concerns (lint vs test)
make ci command to run full CI suite locally

Nightly Comprehensive Tests Workflow¶

File: .github/workflows/nightly.yml

Triggers:

Scheduled: Daily at 2 AM UTC (cron: 0 2 * * *)
Push to release branches: release/** (e.g., release/2.4, release/2.5)
Manual dispatch: workflow_dispatch

Purpose: Run comprehensive test suite with full metrics collection, trend tracking, and reporting.

Release Branch Testing¶

As of issue #248, the nightly workflow also runs on push to release branches (e.g., release/2.4, release/2.5).

Why Release Branch Testing?

✅ Early detection: Catch issues in release branches before they affect releases
✅ Metrics collection: Track test health and coverage for release branches
✅ Release confidence: Ensure stable branches maintain quality
✅ Immediate feedback: No need to wait for scheduled run

When It Runs:

Scheduled: Daily at 2 AM UTC on default branch (main)
On Push: When commits are pushed to any release/** branch
Manual: Via workflow_dispatch for any branch

Artifacts:

Test results, coverage reports, and metrics are uploaded for both scheduled and release branch runs
90-day retention for all artifacts

Nightly Overview¶

The nightly workflow implements RFC-025 Layer 3 (Comprehensive Analysis). Unlike the main branch jobs which focus on fast validation, the nightly workflow:

Runs all tests (unit + integration + e2e, including slow/ml_models)
Collects comprehensive metrics (JUnit XML, coverage reports, JSON reports)
Tracks historical trends (metrics history file)
Publishes metrics to GitHub Pages (accessible via public URL)
Generates job summaries with key metrics

Metrics Collection Strategy¶

Current Implementation: Metrics collection is only in the nightly workflow (not in main branch jobs).

Why Nightly Only?

Simplicity: One place for metrics collection, easier to maintain
Performance: Main branch jobs stay fast (no metrics overhead)
Sufficiency: Daily metrics are enough for trend tracking
Cost: Less artifact storage and processing

What Gets Collected:

✅ JUnit XML (reports/junit.xml) - Test results and timing
✅ Coverage XML (reports/coverage.xml) - Coverage data
✅ Coverage HTML (reports/coverage-html/) - Human-readable coverage reports
✅ JSON Report (reports/pytest.json) - Structured test metrics
✅ Slowest Tests (--durations=20) - Identifies performance bottlenecks
✅ Metrics JSON (metrics/latest.json) - Aggregated metrics for consumption
✅ History File (metrics/history.jsonl) - Historical trend data

Metrics Publishing:

Metrics are published to the gh-pages branch
Accessible via GitHub Pages: https://[username].github.io/podcast_scraper/metrics/
metrics/latest.json - Latest metrics (machine-readable)
metrics/history.jsonl - Historical trends (line-delimited JSON)

Nightly Workflow Jobs¶

nightly-tests Job¶

Purpose: Run all tests with comprehensive metrics collection using production ML models

Duration: ~4-5 hours (includes all tests with production models: Whisper base, BART-large-cnn, LED-large-16384)

Important: OpenAI/LLM provider tests are excluded from nightly runs to avoid API costs (see issue #183). The nightly build uses only local ML models (Whisper, spaCy, Transformers). This excludes ~50 tests marked with @pytest.mark.llm.

Steps:

Checkout code (full history for trend tracking)
Free disk space
Set up Python 3.11
Cache ML models (for faster execution)
Install full dependencies (including ML + pytest-json-report)
Preload production ML models via make preload-ml-models-production (if cache miss)
Run regular tests (unit + integration + e2e, excluding nightly-only and LLM tests):
Marker: -m "not nightly and not llm"
Run nightly-only tests via make test-nightly:
Marker: -m "nightly and not llm"
Uses production models: Whisper base, BART-large-cnn, LED-large-16384
Processes all 15 episodes across 5 podcasts (p01-p05)
Validates full pipeline with production-quality models
Run all tests with comprehensive metrics:
--junitxml=reports/junit.xml (JUnit XML for test results)
--json-report --json-report-file=reports/pytest.json (Structured JSON metrics)
--cov-report=xml:reports/coverage.xml (Coverage XML)
--cov-report=html:reports/coverage-html (Coverage HTML)
--cov-report=term-missing (Terminal coverage report)
--durations=20 (Slowest tests identification)
--reruns 2 --reruns-delay 1 (Flaky test handling)
--disable-socket --allow-hosts=127.0.0.1,localhost (Network guard)
Generate job summary with key metrics — test summary (total, passed, failed, skipped, duration), coverage percentage, slowest tests (top 10), flaky tests detection, metric alerts
Generate metrics (scripts/dashboard/generate_metrics.py) — extracts metrics from pytest JSON and coverage XML, creates metrics/latest.json with trends and alerts, updates metrics/history.jsonl, calculates trends, detects deviations
Generate HTML dashboard (scripts/dashboard/generate_dashboard.py) — creates interactive Chart.js dashboard with current metrics, trends, alerts, slowest tests, flaky test details; outputs metrics/index.html
Upload artifacts (test reports, coverage reports, metrics) — 90-day retention
Upload metrics artifact for GitHub Pages deployment

deploy-metrics Job¶

Purpose: Deploy metrics to GitHub Pages

Runs: After nightly-tests completes (even if tests fail)

Steps:

Deploy metrics to GitHub Pages using actions/deploy-pages@v4
Automatically publishes metrics/ directory from artifact
Accessible via: https://[username].github.io/podcast_scraper/metrics/
Includes latest.json, history.jsonl, and index.html (dashboard)

Metrics Dashboard:

URL: https://[username].github.io/podcast_scraper/metrics/index.html
Type: Unified dashboard with data source selector (CI or Nightly)
Features: Interactive charts, alerts, slowest tests, trend analysis
Features:
Current metrics display (tests, coverage, runtime, flaky tests)
Interactive trend charts (runtime, coverage, test count, flaky tests)
Alerts section (regressions, coverage drops, flaky test increases)
Slowest tests table
Flaky tests table (tests that passed on rerun)
Historical data visualization (last 30 runs)

Result: Metrics accessible via public URL

Concurrency Control:

Uses concurrency: group: "pages-metrics" with cancel-in-progress: false
Allows multiple nightly runs to accumulate metrics without conflicts

Nightly Workflow Performance Benchmarks¶

Reference timing from January 2026 (with cache hits):

Job	Duration	Notes
`preload-ml-models-nightly`	3:30	Fast with cache hit
`nightly-lint`	3:00	Runs in parallel
`nightly-docs`	2:40	Runs in parallel
`nightly-test-unit`	1:40	No ML deps, runs in parallel
`nightly-test-integration`	6:30	After preload
`nightly-test-e2e`	11:30	After preload
`nightly-only-tests`	64:00	77% of total time
Total wall time	~80 min	Critical path limited

Critical Path:

preload (3:30) → e2e (11:30) → nightly-only (64:00) → metrics
                                                    ≈ 80 min total

Key Observations:

nightly-only-tests dominates at 64 minutes (production models: bart-large-cnn, led-large-16384)
Parallel jobs (lint, docs, unit) complete while preload runs
With cold cache (first run), preload can take 15-20 minutes for model downloads (~10GB)
Integration and E2E tests benefit from pre-cached models

Optimization Notes:

Unit tests have no ML dependencies → run immediately without waiting for preload
Production models are much larger than test models → explains long nightly-only runtime
Further optimization would require splitting nightly-only-tests or using faster hardware

nightly-deps-analysis Job¶

Purpose: Analyze module dependencies and detect architectural issues

Duration: ~3-5 minutes

Runs: In parallel after lint and build jobs pass

Steps:

Checkout code
Set up Python 3.11
Install dependencies (including pydeps)
Generate dependency graphs:
make deps-graph - Simplified graph (clustered, max-bacon=2)
make deps-graph-full - Full dependency graph
Check for circular imports via make deps-check-cycles
Run dependency analysis script:
python scripts/tools/analyze_dependencies.py --report
Analyzes import patterns
Checks thresholds (max 15 imports per module)
Generates JSON report
Upload dependency artifacts (90-day retention):
reports/deps*.svg - Dependency graphs (visual)
reports/deps*.json - Analysis report (structured)

Configuration:

continue-on-error: true - Don't fail nightly if dependency analysis has issues
Artifacts retained for 90 days for tracking architecture changes over time

Output Files:

reports/deps-simple.svg - Simplified dependency graph (easy to understand)
reports/deps-full.svg - Complete dependency graph (detailed)
reports/deps-analysis.json - Structured analysis with issues and metrics

Key Metrics Tracked:

Metric	Description	Threshold
Circular imports	Cycles in import graph	0
Fan-out	Modules a file imports	<15
Module count	Total modules analyzed	Monitor

Related Documentation:

Module Dependency Analysis - Architecture documentation
Issue #170 - Module coupling analysis setup
RFC-038: Continuous Review Tooling - Implementation details

Benefits:

📊 Visualize dependencies: SVG graphs show module relationships
🔍 Detect cycles: Catch circular imports automatically
📈 Track architecture: Monitor coupling and dependencies over time
🚨 Early warnings: Identify modules exceeding import thresholds

Metrics vs Main Branch Jobs¶

Main Branch Jobs (python-app.yml):

✅ Fast validation (pass/fail)
❌ No metrics collection
✅ Quick feedback (5-20 minutes)
Purpose: Validate code works

Nightly Workflow (nightly.yml):

✅ Comprehensive metrics collection
✅ Historical trend tracking
✅ Full test suite (including slow tests)
✅ Metrics publishing to GitHub Pages
Purpose: Deep analysis and trend tracking

Why This Design?

Fast PR feedback: Main branch jobs focus on validation, not analysis
Comprehensive analysis: Nightly workflow does deep metrics collection
Resource efficiency: Metrics collection only once per day
Simplicity: Clear separation of concerns

Accessing Metrics¶

Via GitHub Pages Dashboard:

Unified Dashboard: https://[username].github.io/podcast_scraper/metrics/index.html
Use the dropdown selector to switch between CI and Nightly data sources
Interactive charts, alerts, and comprehensive metrics visualization

Via JSON API (Machine-Readable):

CI metrics: https://[username].github.io/podcast_scraper/metrics/latest-ci.json
CI history: https://[username].github.io/podcast_scraper/metrics/history-ci.jsonl
Nightly metrics: https://[username].github.io/podcast_scraper/metrics/latest-nightly.json
Nightly history: https://[username].github.io/podcast_scraper/metrics/history-nightly.jsonl

Via GitHub Actions:

Job summaries show key metrics in workflow run
Artifacts available for download (90-day retention)

Via Automation:

# Fetch latest CI metrics
curl https://[username].github.io/podcast_scraper/metrics/latest-ci.json | jq

# Fetch latest nightly metrics
curl https://[username].github.io/podcast_scraper/metrics/latest-nightly.json | jq

# Check CI history (last entry)
curl https://[username].github.io/podcast_scraper/metrics/history-ci.jsonl | tail -1 | jq

# Check nightly history (last entry)
curl https://[username].github.io/podcast_scraper/metrics/history-nightly.jsonl | tail -1 | jq

RFC-025: Test Metrics and Health Tracking - Metrics collection strategy
RFC-026: Metrics Consumption and Dashboards - Metrics consumption methods

Quick Reference¶

Workflow Files¶

.github/workflows/
├── python-app.yml    # Main CI (lint, test, docs, build)
├── docs.yml          # Documentation deployment
└── codeql.yml        # Security scanning

# Individual checks
make format-check lint type security test docs build

# Auto-fix formatting
make format

CI/CD Workflows¶

CI/CD Pipeline¶

Overview¶

Workflows Summary¶

Complete Pipeline Visualization¶

Workflow Trigger Conditions¶

Job Details¶

Always-Running Jobs¶

1. Lint Job (Always Runs)¶

2. test-unit Job (Always Runs)¶

PR-Only Jobs (Fast Feedback)¶

3. test-integration-fast Job (Runs on PRs only, with exceptions)¶

4. test-e2e-fast Job (Runs on PRs only, with exceptions)¶

Main Branch Only Jobs¶

5. preload-ml-models Job (Main Branch Only)¶

6. test-integration-slow Job (Main Branch Only)¶

7. test-e2e Job (Main Branch Only)¶

7. Docs Build Job (Always Runs)¶

8. Build Package Job (Always Runs)¶

9. coverage-unified Job (Main Branch Only)¶

10. deploy-metrics Job (Main Branch Only)¶

Additional Jobs (Conditional)¶

11. docker-build Job (Runs if Docker files changed)¶

10. snyk-dependencies Job (Runs if code/Docker files changed)¶

11. snyk-docker Job (Runs if code/Docker files changed)¶

12. snyk-monitor Job (Always Runs on PR)¶

13. analyze Job (Runs if Python/workflow files changed)¶

Critical Path Testing Strategy¶

Critical Path Jobs (Run in Parallel)¶

Why This Design?¶

What About Non-Critical Tests?¶

Jobs That DO NOT Run on PRs¶

When Do Slow Jobs Run? (Important Clarification)¶

Comparison: PR vs Push to Main¶

PR Execution Timeline¶

PR Status Checks¶

Re-runs for Flaky Tests¶

Dependency Management¶

Sequential Pipeline (Build → Deploy)¶

2. Deploy Job¶

Concurrency Control¶

CodeQL Security Workflow¶

Matrix Strategy (Parallel Language Analysis)¶

CodeQL Job Details¶

Schedule Details¶

Docker Build & Test Workflow¶

Optimization Implementation¶

Workflow Purpose¶

Docker Job Details¶

Docker Build Jobs¶

Snyk Security Scan Workflow¶

Snyk Workflow Purpose¶

Snyk Job Details¶

1. Snyk Dependencies Scan¶

2. Snyk Docker Scan¶

3. Snyk Monitor¶

Snyk Schedule Details¶

Integration with GitHub Security¶

Dependabot Automated Dependency Updates¶

Dependabot Overview¶

Key Difference¶

Configuration¶

Python Dependencies Configuration¶

GitHub Actions Configuration¶

Docker Configuration¶

Configuration Decisions¶

How It Works¶

Reviewing Dependabot PRs¶

Integration with Security Tools¶

Accessing Dependabot¶

Related Documentation¶

Parallel Execution Summary¶

Workflow Independence¶

Parallel Workflow Execution¶

Parallel Execution Details¶

✅ Completely Parallel¶

❌ Sequential¶

Performance Optimizations¶

1. Pip Caching¶

Workflow Triggers Matrix¶