Domain isolation prevents cross-contamination. The same stacking architecture class behind Netflix Prize and Airbnb's market pricing engine — applied to neighborhood-level prediction.
Seven models train independently on separate proprietary signal domains. Each generates held-out out-of-fold predictions passed as meta-features to Layer 1. Cross-domain feature injection is forbidden — it violates stacking isolation.
Stacks all domain signals into gentrification, growth, and structural decline probabilities. Two variants: lagged (full signal history, data ceiling 2024) and no-lag (near real-time signals only, ceiling 2026).
Different signal types lead neighborhood change by different margins. Domain lookback windows are calibrated to this causal cascade — each signal layer anchored at its validated lead time, with zero overlap between feature and label windows.
Home value velocity, income velocity, and proxy features excluded from classification features. Three leakage types enforced: temporal (feature overlaps label window), definitional (feature encodes label formula), and proxy (r > 0.7 with excluded outcomes).
Signal types lead neighborhood change by different margins. The ensemble is calibrated to this ordering — not flat feature averaging.
Each domain model is anchored at its validated lead time — zero overlap between feature and label windows.
The design choices that make these AUC numbers usable for capital decisions — not just benchmark artifacts.
We'll walk through the domain-level accuracy breakdown for your target metro areas.
Request a Demo