Report Aggregation Architecture - Developer Guide¶

This guide explains how to work with FinWiz's report aggregation architecture, which follows the AI Minimalism principle: use Python for deterministic tasks and reserve AI exclusively for analysis requiring reasoning.

Table of Contents¶

Architecture Overview
Adding New Crew Types
Creating New Templates
Extending Export Schemas
Python vs AI Task Decisions
Testing Guidelines
Common Patterns

Architecture Overview¶

Core Principles¶

Pydantic-First: All crew outputs validated with strict Pydantic schemas
Python for Determinism: HTML generation and data consolidation use Jinja2 templates and Python functions (NO AI)
File-Based Data Passing: Pass file paths (not data) between crews to avoid context limits
Concurrent Execution: All SME crews run in parallel for maximum performance
Clean Architecture: Clear separation between analysis (AI) and presentation (Python)

Data Flow¶

Text Only

AI Analysis → JSON Export → Python Template → HTML Report
                ↓
         Consolidation (Python) → Final Report (Python)

Adding New Crew Types¶

Step 1: Define Pydantic Export Schema¶

Create a new export schema in src/finwiz/schemas/crew_exports.py:

Python

from pydantic import BaseModel, Field
from typing import List, Dict, Any
from datetime import datetime

class MyNewCrewExport(CrewExportBase):
    """Export schema for MyNewCrew analysis."""
    crew_name: str = Field(default="my_new_crew")

    # Analysis Results
    analysis_data: Dict[str, Any] = Field(..., description="Main analysis results")
    risk_assessment: RiskAssessmentStandardized

    # Scores and Grades
    composite_score: float = Field(..., ge=0.0, le=1.0)
    grade: str = Field(..., pattern="^(A\\+|A|B|C|D|F)$")

    # Recommendations
    recommendation: str = Field(..., pattern="^(BUY|HOLD|SELL)$")
    confidence: float = Field(..., ge=0.0, le=1.0)
    rationale: str = Field(..., min_length=50)

    # Metadata
    data_sources: List[str]
    report_html_path: str
    report_json_path: str

    model_config = {
        "extra": "forbid",  # Reject unknown fields
        "str_strip_whitespace": True
    }

Key Requirements:

Extend CrewExportBase for common fields
Use extra='forbid' for strict validation
Include report_html_path and report_json_path
Add field validators for complex validation
Document all fields with descriptions

Step 2: Add Final Reporter to Crew¶

Add a final reporter agent to your crew that generates the export:

Python

# In src/finwiz/crews/my_new_crew/my_new_crew.py

from finwiz.utils.agent_validators import final_reporter
from finwiz.schemas.crew_exports import MyNewCrewExport
import json
from pathlib import Path

@final_reporter
@agent
def investment_reporter(self) -> Agent:
    """Final reporter that creates validated export."""
    return Agent(
        config=self.agents_config["investment_reporter"],
        tools=[],  # MUST be empty - enforced by decorator
        verbose=True
    )

@task
def generate_export_task(self) -> Task:
    """Generate Pydantic-validated export object."""
    return Task(
        description="""
        Consolidate all analysis findings from context and create a validated export.

        Steps:
        1. Extract analysis data from context
        2. Create MyNewCrewExport object with all required fields
        3. Validate against Pydantic schema
        4. Save JSON to output/reports/{session_id}/my_new_crew/{ticker}_export.json
        5. Return the export object

        CRITICAL: All fields must be populated with actual data, not placeholders.
        """,
        expected_output="Validated MyNewCrewExport object saved to JSON",
        agent=self.investment_reporter(),
        async_execution=False  # Final task must be synchronous
    )

Key Requirements:

Use @final_reporter decorator to enforce empty tools
Final task must be synchronous (async_execution=False)
Save JSON to standardized path: output/reports/{session_id}/{crew_name}/{ticker}_export.json
Validate against Pydantic schema before saving

Step 3: Update Flow to Generate HTML¶

Update the Flow to call Python HTML generation after crew execution:

Python

# In src/finwiz/flows/flow_orchestrator.py

from finwiz.tools.html_report_generator import HTMLReportGenerator

@listen("initialize_flow")
def execute_my_new_crew(self) -> dict[str, Any]:
    """Execute MyNewCrew and generate HTML report."""
    # Execute crew
    crew = MyNewCrew()
    result = crew.crew().kickoff(inputs={"ticker": self.state.ticker})

    # Get JSON export path
    json_path = f"output/reports/{self.state.session_id}/my_new_crew/{self.state.ticker}_export.json"

    # Generate HTML from JSON using Python template
    generator = HTMLReportGenerator()
    html_path = generator.generate_crew_report(
        crew_name="my_new_crew",
        export_data=json.loads(Path(json_path).read_text()),
        output_path=json_path.replace("_export.json", "_report.html")
    )

    # Store paths in state
    if "my_new_crew" not in self.state.crew_export_paths:
        self.state.crew_export_paths["my_new_crew"] = []
    self.state.crew_export_paths["my_new_crew"].append(json_path)

    if "my_new_crew" not in self.state.crew_html_paths:
        self.state.crew_html_paths["my_new_crew"] = []
    self.state.crew_html_paths["my_new_crew"].append(html_path)

    return {"crew_name": "my_new_crew", "json_path": json_path, "html_path": html_path}

Step 4: Update Consolidation¶

Update ReportConsolidator to handle the new crew type:

Python

# In src/finwiz/utils/report_consolidator.py

from finwiz.schemas.crew_exports import MyNewCrewExport

class ReportConsolidator:
    def consolidate_reports(self, crew_export_paths: Dict[str, List[str]]) -> ConsolidatedReportExport:
        """Consolidate all crew exports."""
        # ... existing code ...

        # Load MyNewCrew exports
        my_new_analyses = []
        if "my_new_crew" in crew_export_paths:
            my_new_analyses = self._load_exports(
                crew_export_paths["my_new_crew"],
                MyNewCrewExport
            )

        # Create consolidated export
        consolidated = ConsolidatedReportExport(
            session_id=self.session_id,
            # ... existing fields ...
            my_new_analyses=my_new_analyses
        )

        return consolidated

Creating New Templates¶

Step 1: Create Base Template (if needed)¶

If you need custom styling, extend the base template:

{% raw %}

HTML

<!-- src/finwiz/templates/crew_reports/base.html -->
<!DOCTYPE html>
<html lang="fr">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>{% block title %}Rapport FinWiz{% endblock %}</title>
    <style>
        /* Professional CSS styling */
        body {
            font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
            line-height: 1.6;
            max-width: 1200px;
            margin: 0 auto;
            padding: 20px;
            background-color: #f5f5f5;
        }

        /* Light/Dark mode support */
        @media (prefers-color-scheme: dark) {
            body {
                background-color: #1a1a1a;
                color: #e0e0e0;
            }
        }

        /* Grade colors */
        .grade-a-plus { color: #27ae60; font-weight: bold; }
        .grade-a { color: #2ecc71; }
        .grade-b { color: #f39c12; }
        .grade-c { color: #e67e22; }
        .grade-d { color: #e74c3c; }
        .grade-f { color: #c0392b; font-weight: bold; }

        /* Responsive tables */
        table {
            width: 100%;
            border-collapse: collapse;
            margin: 20px 0;
            background-color: white;
        }

        @media (prefers-color-scheme: dark) {
            table {
                background-color: #2a2a2a;
            }
        }
    </style>
    {% block extra_styles %}{% endblock %}
</head>
<body>
    {% block content %}{% endblock %}
</body>
</html>

{% endraw %}

Step 2: Create Crew-Specific Template¶

Create a template for your crew in src/finwiz/templates/crew_reports/:

{% raw %}

HTML

<!-- src/finwiz/templates/crew_reports/my_new_crew_report.html -->
{% extends "crew_reports/base.html" %}

{% block title %}Analyse {{ data.ticker }} - {{ data.crew_name }}{% endblock %}

{% block content %}
<h1>📊 Analyse {{ data.ticker }}</h1>
<p><strong>Date:</strong> {{ data.analysis_date }}</p>
<p><strong>Classe d'actif:</strong> {{ data.asset_class }}</p>

<section>
    <h2>Recommandation</h2>
    <p class="grade-{{ data.grade.lower().replace('+', '-plus') }}">
        {% if data.recommendation == "BUY" %}✅{% elif data.recommendation == "SELL" %}❌{% else %}⏸️{% endif %}
        <strong>{{ data.recommendation }}</strong> - Grade {{ data.grade }}
    </p>
    <p><strong>Score composite:</strong> {{ "%.2f"|format(data.composite_score) }}</p>
    <p><strong>Confiance:</strong> {{ "%.0f"|format(data.confidence * 100) }}%</p>
</section>

<section>
    <h2>Analyse</h2>
    <p>{{ data.rationale }}</p>
</section>

<section>
    <h2>Évaluation des risques</h2>
    <table>
        <tr>
            <th>Risque</th>
            <th>Score</th>
        </tr>
        <tr>
            <td>Risque global</td>
            <td>{{ data.risk_assessment.overall_risk_score }}/5</td>
        </tr>
        <tr>
            <td>Risque systématique</td>
            <td>{{ data.risk_assessment.systematic_risk }}/5</td>
        </tr>
        <tr>
            <td>Risque idiosyncratique</td>
            <td>{{ data.risk_assessment.idiosyncratic_risk }}/5</td>
        </tr>
    </table>
</section>

<section>
    <h2>Sources de données</h2>
    <ul>
    {% for source in data.data_sources %}
        <li>{{ source }}</li>
    {% endfor %}
    </ul>
</section>

<footer>
    <p><small>Généré le {{ data.analysis_date }} par FinWiz</small></p>
</footer>
{% endblock %}

{% endraw %}

Template Best Practices:

Use French for all user-facing text
Include responsive design for mobile/desktop
Support light/dark mode with CSS media queries
Use semantic HTML (sections, tables, lists)
Include grade color classes
Add generation timestamp in footer

Step 3: Register Template in HTMLReportGenerator¶

Update HTMLReportGenerator to load your template:

Python

# In src/finwiz/tools/html_report_generator.py

class HTMLReportGenerator:
    def generate_crew_report(
        self,
        crew_name: str,
        export_data: dict,
        output_path: str
    ) -> str:
        """Generate HTML report from crew export data."""
        # Map crew names to templates
        template_map = {
            "stock_crew": "stock_report.html",
            "etf_crew": "etf_report.html",
            "crypto_crew": "crypto_report.html",
            "my_new_crew": "my_new_crew_report.html",  # Add your template
        }

        template_name = template_map.get(crew_name)
        if not template_name:
            raise ValueError(f"No template found for crew: {crew_name}")

        # Load and render template
        template = self.jinja_env.get_template(f"crew_reports/{template_name}")
        html_content = template.render(data=export_data)

        # Save HTML
        Path(output_path).parent.mkdir(parents=True, exist_ok=True)
        Path(output_path).write_text(html_content, encoding='utf-8')

        return output_path

Extending Export Schemas¶

Adding New Fields¶

To add fields to an existing export schema:

Python

# In src/finwiz/schemas/crew_exports.py

class StockCrewExport(CrewExportBase):
    # ... existing fields ...

    # Add new field
    new_metric: float = Field(
        ...,
        ge=0.0,
        le=100.0,
        description="New metric description"
    )

    # Add field validator
    @field_validator('new_metric')
    @classmethod
    def validate_new_metric(cls, v: float) -> float:
        """Validate new metric is reasonable."""
        if v < 0 or v > 100:
            raise ValueError("New metric must be between 0 and 100")
        return v

Adding Nested Models¶

For complex data structures, create nested Pydantic models:

Python

class DetailedAnalysis(BaseModel):
    """Nested model for detailed analysis."""
    metric_1: float
    metric_2: float
    summary: str

    model_config = {"extra": "forbid"}

class StockCrewExport(CrewExportBase):
    # ... existing fields ...

    # Use nested model
    detailed_analysis: DetailedAnalysis

Schema Versioning¶

When making breaking changes, version your schemas:

Python

class StockCrewExportV2(CrewExportBase):
    """Version 2 of StockCrewExport with breaking changes."""
    schema_version: str = Field(default="2.0")

    # ... new fields ...

Python vs AI Task Decisions¶

Decision Framework¶

Use this checklist to decide whether a task should use AI or Python:

Use Python (NOT AI) For¶

Is this task deterministic? (same input = same output)
Can this be expressed as a template?
Is this just data transformation?
Is this a calculation or validation?
Can a junior developer implement this in Python?

If you answered YES to any question, use Python, not AI.

Use AI ONLY For¶

Does this require reasoning or judgment?
Does this involve interpreting complex data?
Does this require synthesis of multiple sources?
Does this involve natural language understanding?
Does this require creative content generation?

Examples¶

❌ WRONG: Using AI for HTML Generation¶

Python

@task
def generate_html_report(self) -> Task:
    return Task(
        description="Generate HTML report from JSON data",
        agent=self.reporter(),  # AI agent
        # WRONG: Wasting LLM calls on template rendering
    )

✅ CORRECT: Using Python Template¶

Python

def generate_html_report(json_data: dict) -> str:
    """Generate HTML report using Jinja2 template."""
    template = jinja_env.get_template('report.html')
    return template.render(data=json_data)
    # CORRECT: Fast, cheap, testable

❌ WRONG: Using AI for Data Consolidation¶

Python

@task
def consolidate_reports(self) -> Task:
    return Task(
        description="Read all crew reports and consolidate them",
        agent=self.aggregator(),  # AI agent
        # WRONG: Wasting LLM calls on file reading
    )

✅ CORRECT: Using Python Function¶

Python

def consolidate_reports(file_paths: list[str]) -> ConsolidatedReport:
    """Consolidate crew reports using Python."""
    reports = []
    for path in file_paths:
        with open(path) as f:
            report = CrewReport.model_validate_json(f.read())
            reports.append(report)
    return ConsolidatedReport(reports=reports)
    # CORRECT: Fast, cheap, testable

Cost-Benefit Analysis¶

Task Type	AI Cost	Python Cost	AI Time	Python Time	Reliability
HTML Generation	$0.10-0.30	$0	5-15s	0.01s	95%
Data Consolidation	$0.05-0.15	$0	3-10s	0.001s	95%
Template Rendering	$0.10-0.30	$0	5-15s	0.01s	95%
Calculations	$0.05-0.20	$0	2-8s	0.001s	95%
Python Total	-	$0	-	<0.1s	100%

Savings per execution: $0.30-0.95 per task × 4-8 tasks = $1.20-7.60

Testing Guidelines¶

Unit Testing Export Schemas¶

Python

# tests/unit/schemas/test_crew_exports.py

import pytest
from finwiz.schemas.crew_exports import MyNewCrewExport
from pydantic import ValidationError

def test_should_validate_valid_export():
    """Test that valid export data passes validation."""
    export = MyNewCrewExport(
        ticker="AAPL",
        asset_class="stock",
        composite_score=0.85,
        grade="A",
        recommendation="BUY",
        confidence=0.9,
        rationale="Strong fundamentals and growth prospects",
        data_sources=["Yahoo Finance", "SEC EDGAR"],
        report_html_path="output/reports/session/my_new_crew/AAPL_report.html",
        report_json_path="output/reports/session/my_new_crew/AAPL_export.json"
    )

    assert export.ticker == "AAPL"
    assert export.grade == "A"

def test_should_reject_invalid_grade():
    """Test that invalid grade is rejected."""
    with pytest.raises(ValidationError) as exc_info:
        MyNewCrewExport(
            ticker="AAPL",
            grade="Z",  # Invalid grade
            # ... other fields ...
        )

    assert "grade" in str(exc_info.value)

def test_should_reject_extra_fields():
    """Test that extra fields are rejected (extra='forbid')."""
    with pytest.raises(ValidationError) as exc_info:
        MyNewCrewExport(
            ticker="AAPL",
            unknown_field="value",  # Extra field
            # ... other fields ...
        )

    assert "extra fields not permitted" in str(exc_info.value)

Unit Testing HTML Generation¶

Python

# tests/unit/tools/test_html_report_generator.py

import pytest
from finwiz.tools.html_report_generator import HTMLReportGenerator

def test_should_generate_html_from_export(mocker, tmp_path):
    """Test HTML generation from crew export."""
    # Arrange
    generator = HTMLReportGenerator()
    export_data = {
        "ticker": "AAPL",
        "grade": "A",
        "recommendation": "BUY",
        # ... other fields ...
    }
    output_path = tmp_path / "report.html"

    # Act
    result_path = generator.generate_crew_report(
        crew_name="my_new_crew",
        export_data=export_data,
        output_path=str(output_path)
    )

    # Assert
    assert output_path.exists()
    html_content = output_path.read_text()
    assert "AAPL" in html_content
    assert "Grade A" in html_content
    assert "BUY" in html_content

Unit Testing Consolidation¶

Python

# tests/unit/utils/test_report_consolidator.py

import pytest
from finwiz.utils.report_consolidator import ReportConsolidator

def test_should_consolidate_crew_exports(mocker, tmp_path):
    """Test consolidation of multiple crew exports."""
    # Arrange
    consolidator = ReportConsolidator(session_id="test_session")

    # Create mock JSON files
    stock_export = tmp_path / "stock_export.json"
    stock_export.write_text('{"ticker": "AAPL", "grade": "A", ...}')

    crew_export_paths = {
        "stock_crew": [str(stock_export)]
    }

    # Act
    consolidated = consolidator.consolidate_reports(crew_export_paths)

    # Assert
    assert consolidated.session_id == "test_session"
    assert len(consolidated.stock_analyses) == 1
    assert consolidated.stock_analyses[0].ticker == "AAPL"

Common Patterns¶

Pattern 1: Crew with Final Reporter¶

Python

class MyNewCrew:
    @final_reporter
    @agent
    def investment_reporter(self) -> Agent:
        return Agent(
            config=self.agents_config["investment_reporter"],
            tools=[],  # Enforced empty
            verbose=True
        )

    @task
    def generate_export_task(self) -> Task:
        return Task(
            description="Generate validated export from context",
            expected_output="MyNewCrewExport object saved to JSON",
            agent=self.investment_reporter(),
            async_execution=False  # Final task must be sync
        )

Pattern 2: Flow Method with HTML Generation¶

Python

@listen("initialize_flow")
def execute_crew_with_html(self) -> dict[str, Any]:
    """Execute crew and generate HTML report."""
    # 1. Execute crew
    crew = MyNewCrew()
    result = crew.crew().kickoff(inputs=inputs)

    # 2. Get JSON path
    json_path = f"output/reports/{session_id}/{crew_name}/{ticker}_export.json"

    # 3. Generate HTML using Python
    generator = HTMLReportGenerator()
    html_path = generator.generate_crew_report(
        crew_name=crew_name,
        export_data=json.loads(Path(json_path).read_text()),
        output_path=json_path.replace("_export.json", "_report.html")
    )

    # 4. Store paths in state
    self.state.crew_export_paths[crew_name] = [json_path]
    self.state.crew_html_paths[crew_name] = [html_path]

    return {"json_path": json_path, "html_path": html_path}

Pattern 3: Python Consolidation Function¶

Python

def consolidate_reports(crew_export_paths: Dict[str, List[str]]) -> ConsolidatedReportExport:
    """Pure Python consolidation (NO AI)."""
    # 1. Load and validate exports
    stock_analyses = _load_exports(crew_export_paths.get("stock_crew", []), StockCrewExport)
    etf_analyses = _load_exports(crew_export_paths.get("etf_crew", []), ETFCrewExport)

    # 2. Create consolidated export
    consolidated = ConsolidatedReportExport(
        session_id=session_id,
        stock_analyses=stock_analyses,
        etf_analyses=etf_analyses,
        consolidation_date=datetime.now()
    )

    # 3. Save to JSON
    output_path = f"output/reports/{session_id}/consolidated_report.json"
    Path(output_path).write_text(consolidated.model_dump_json(indent=2))

    return consolidated

Pattern 4: Jinja2 Template with French Localization¶

{% raw %}

HTML

{% extends "crew_reports/base.html" %}

{% block content %}
<h1>📊 Analyse {{ data.ticker }}</h1>

<section>
    <h2>Recommandation</h2>
    <p class="grade-{{ data.grade.lower().replace('+', '-plus') }}">
        {% if data.recommendation == "BUY" %}
            ✅ <strong>ACHETER</strong>
        {% elif data.recommendation == "SELL" %}
            ❌ <strong>VENDRE</strong>
        {% else %}
            ⏸️ <strong>CONSERVER</strong>
        {% endif %}
        - Grade {{ data.grade }}
    </p>
</section>

<section>
    <h2>Évaluation des risques</h2>
    <p>Score de risque global: {{ data.risk_assessment.overall_risk_score }}/5</p>
</section>
{% endblock %}

{% endraw %}

Summary¶

The report aggregation architecture provides:

✅ Cost Savings: $7-13 per execution by using Python instead of AI
✅ Performance: 136-290 seconds faster per execution
✅ Quality: 100% consistent formatting with templates
✅ Testability: Full unit test coverage for all components
✅ Maintainability: Clear separation of concerns (AI for analysis, Python for presentation)

Follow these patterns to extend the architecture while maintaining these benefits.

Version: 1.0 Last Updated: 2025-01-25 Related Docs:

Architecture Design (internal spec)
Requirements (internal spec)
Implementation Tasks (internal spec)