Let op: dit experiment is nog niet Codex-gevalideerd. Gebruik de bevindingen als voorlopige aanwijzingen.

Hypotheses

FAMILY_INTERNATIONAL_LEAD_LAG_DYNAMICS: Experiment Log

FAMILY_INTERNATIONAL_LEAD_LAG_DYNAMICS

Testing properly aligned weekly international price lead-lag relationships between Netherlands and neighboring markets (Belgium, Germany, France). This hypothesis uses REAL DATA ONLY from BoerderijApi's international price records, with proper weekly alignment instead of flawed exact-date matching.

Laatste update
2025-12-01
Repo-pad
hypotheses/FAMILY_INTERNATIONAL_LEAD_LAG_DYNAMICS
Codex-bestand
Ontbreekt

Experimentnotities

FAMILY_INTERNATIONAL_LEAD_LAG_DYNAMICS: Experiment Log

Overview

Testing properly aligned weekly international price lead-lag relationships between Netherlands and neighboring markets (Belgium, Germany, France). This hypothesis uses REAL DATA ONLY from BoerderijApi's international price records, with proper weekly alignment instead of flawed exact-date matching.

Hypothesis Origins

  • FAMILY_CROSS_MARKET_COUPLING: CONDITIONALLY SUPPORTED (86-87%) but used contemporaneous prices
  • Data Discovery: Boerderij Excel contains 438 BE, 190 DE, 152 FR weekly records now API-accessible
  • Failed Exact-Date Studies: Previous cross-market analyses failed due to different reporting days
  • Industry Evidence: German potato traders report 1-2 week price leadership over Dutch market
  • Academic Basis: Granger (1969) causality; Engle & Granger (1987) cointegration theory

Experiment Design

  • Method: Rolling-origin cross-validation
  • Training Window: 365 days minimum
  • Step Size: 7 days (weekly)
  • Test Window: 60 days maximum
  • Baselines: ALL mandatory standard baselines (persistent, seasonal_naive, ar2, historical_mean)
  • REAL DATA ONLY: BoerderijApi international prices

Data Sources (REAL DATA ONLY)

  • Dutch Prices: BoerderijApi - NL.157.2086 consumption potatoes - git:current
  • Belgian Prices: BoerderijApi - BE.157.2086 (438 weekly records, legacy=true) - git:current
  • German Prices: BoerderijApi - DE.157.2086 (190 weekly records, legacy=true) - git:current
  • French Prices: BoerderijApi - FR.157.2086 (152 weekly records, legacy=true) - git:current
  • Critical: Weekly alignment by week-of-year, NOT exact date matching

Experiment Runs

Variant A: German Market Leads Dutch by 1-2 Weeks

Status: Ready for implementation - Model: RandomForest, GradientBoosting, Ridge regression - Features: de_price_lag_1w, de_price_lag_2w, de_price_momentum_2w, nl_de_price_spread_1w, de_volatility_4w, cross_correlation_de_nl, lead_lag_stability_de, transport_cost_ma4, nl_price_lag_1w - Horizons: 30-day, 60-day - Mechanism: German market size advantage creates systematic 1-2 week information lead over Dutch market - Expected: >15% improvement using real DE-NL price relationships - Hypothesis: German prices lagged 1-2 weeks predict Dutch prices due to market size asymmetries and early price discovery

Variant B: French Market Leads Belgian by 2-3 Weeks

Status: Ready for implementation - Model: GradientBoosting, RandomForest, XGBoost - Features: fr_price_lag_2w, fr_price_lag_3w, be_price_lag_1w, fr_be_price_spread_2w, fr_price_momentum_3w, triangular_signal_fr_be_nl, fr_volatility_4w, be_nl_correlation_4w, transport_cost_fr_index, nl_price_lag_1w - Horizons: 30-day, 60-day - Mechanism: French market leadership transmits through Belgian processing connections to Dutch prices with compound 2-3 week lag - Expected: >12% improvement using real FR-BE-NL price chain - Hypothesis: Triangular causality chain (FR→BE→NL) outperforms direct relationships

Variant C: Multi-Market Granger Causality Network

Status: Ready for implementation - Model: Ensemble (GradientBoosting 0.4, RandomForest 0.4, VectorAutoregression 0.2) - Features: de_granger_signal, be_granger_signal, fr_granger_signal, de_optimal_lag, be_optimal_lag, fr_optimal_lag, causality_strength_de/be/fr, reverse_causality_nl_de, network_centrality_nl, dominant_lead_market, multi_market_momentum, causality_stability_4w, cointegration_residual, nl_price_lag_1w - Horizons: 30-day, 60-day - Complexity: Multi-directional causality network with optimal lag selection and cointegration - Expected: >18% improvement through systematic directional causality analysis - Hypothesis: Different optimal lag structures across market pairs with network effects providing superior forecasting

Statistical Tests

  • Diebold-Mariano test with Harvey-Leybourne-Newbold correction vs strongest baseline
  • TOST equivalence test with SESOI = 15% improvement (information advantage from lead-lag)
  • Granger causality tests for all market pairs (DE→NL, FR→BE, BE→NL, reverse tests)
  • Johansen cointegration test for long-run equilibrium relationships
  • FDR correction for multiple comparisons across variants
  • ALL 4 mandatory standard baselines: persistent, seasonal_naive, ar2, historical_mean (strongest baseline comparison)

Weekly Alignment Methodology (CRITICAL INNOVATION)

  • ISO Week Matching: Align by week-of-year, NOT calendar dates
  • Reporting Schedule Correction: BE Tuesday, DE Wednesday, FR Thursday, NL Friday
  • Week-Ending Aggregation: Standardize all markets to week-ending prices
  • Maximum lag: 4 weeks for causality testing
  • Cross-correlation: 52-week rolling windows for relationship stability
  • Optimal lag selection: AIC criterion for each market pair
  • Causality significance: p < 0.05 for directional relationships

Verdicts

(Experiments not yet run)

HE Notes

  • Created 2025-08-18 leveraging newly discovered international price data
  • Corrects fundamental flaw in previous studies (exact-date matching)
  • Weekly alignment crucial - markets report on different weekdays
  • First proper lead-lag analysis with actual international data
  • SESOI set at 25% due to information advantage from leading markets
  • All variants use ONLY REAL DATA from BoerderijApi

Decision Log

(To be updated after experiment completion)

Geen Codex-samenvatting

Voeg codex_validated.md toe om de status te documenteren.