FAMILY_AUTOML_PATTERN_DISCOVERY - Experiment Log

Overview

Objective: Use AutoML frameworks to automatically discover hidden patterns and beat the current 53.7% improvement baseline at 12-week horizons through computational brute force and automated feature engineering.

Status: DEVELOPMENT
Priority: CRITICAL
Target Performance: 60%+ improvement (vs current 53.7% baseline)

Variants

Variant A: AutoGluon comprehensive ensemble with 1000+ automated features
Variant B: H2O AutoML with deep learning and gradient boosting focus
Variant C: TPOT genetic programming for evolutionary pattern discovery

Experimental Plan

Data Requirements

Primary: BoerderijApi weekly Dutch potato prices (1,203 observations)
Secondary: NDVI satellite data, international markets, weather accumulation, storage indicators
Validation: USE ONLY REAL DATA from repository interfaces - NO SYNTHETIC DATA
Horizon: 12-week (84 days) - proven optimal for maximum improvement

Feature Engineering Strategy

Automated Feature Explosion: Generate 1000+ features programmatically
Price transformations: lags 1-52, moving averages, differences, ratios
Polynomial interactions: 2nd and 3rd order combinations
Fourier transforms: seasonal cycles at multiple frequencies
Wavelet decomposition: multi-scale temporal analysis
Technical indicators: RSI, MACD, Bollinger Bands
Cross-market features: international spreads, correlations
AutoML Framework Testing:
AutoGluon: 2-hour comprehensive ensemble search
H2O AutoML: 2-hour deep learning + GBM focus
TPOT: 4-hour genetic programming evolution
Validation Protocol:
Rolling-origin cross-validation with corrected baselines
Test against ALL 4 standard baselines: persistent, seasonal_naive, ar2, historical_mean
Statistical testing: DM + HLN, TOST vs SESOI, FDR correction
Compare against STRONGEST baseline (lowest error)

Success Criteria

Primary: Beat 53.7% baseline by 5%+ relative improvement (58.7%+ total)
Statistical: p < 0.05 vs strongest baseline after FDR correction
Practical: Improvement exceeds 8% SESOI threshold
Discovery: Identify top 20 predictive features humans wouldn't consider

Experiment Results

Results will be appended here as variants are completed

Decision Log

Decision summary will be added after all variants complete

FAMILY_AUTOML_PATTERN_DISCOVERY - Experiment Log

Experimentnotities

FAMILY_AUTOML_PATTERN_DISCOVERY - Experiment Log

Overview

Variants

Experimental Plan

Data Requirements

Feature Engineering Strategy

Success Criteria

Experiment Results

Decision Log

Geen Codex-samenvatting