Sam Fredrickson ea8c2092a6 Mega-squash

Implement some basic backtesting.

Switch to simpler log-based projection.

Add improved vol calculation.

Implement trend smoothing.

Add Claude's notes on current model.

Run projections and backtests in parallel.

Add market maturity projection adjustments.

Improve volume handling in market maturity calculations.

New backtest suite proposed by Claude.

Fix create_plots() output.

Merge branch 'new-backtests' into market-maturity-backtests

Add era-aware market maturity adjustments.

Add updated notes from Claude.

Add justfile to simplify organizing results for comparison.

Run more projections from various start dates.

Tuning session; removed market maturity.

The market maturity score only complicated the model with no clear
benefit. Still working on getting the various backtests tuned.

Add Claude's notes from recent session.

More helpful additions to workflow.

New systematic backtest framework.

Add notes on new backtesting framework.

Use ruff linter.

Tweak the backtests.

* Start from 2011 instead of 2013.
* Validate over two years instead of one.

Improve uncertainty estimation.

Add Claude's notes from the last revision.

Add adaptive volatility window.

New, streamlined NOTES.

Fix projection plot bugs.

Update prices.csv.

Actually use long-term vol in adaptive calculation.

Use more conservative 1e-6 to prevent division by zero.

it's an error not to provide halving dates

warn when val period shorter than projection period

Update prices.csv

Manage output files in less-janky fashion.

Use market fundamentals intsead of empirical era adjustments.

Improve CI coverage.

Use S2F metrics for trend analysis.

update prices

Merge branch 'next' into mkt-fndm

Update prices.

Merge branch 'next' into mkt-fndm

Add CDPR plot.

Merge branch 'next' into mkt-fndm

Update prices.

Slight optimization to cdpr plot gen.

Update prices.

Merge branch 'next' into mkt-fndm

Add price to CDPR plot.

Update prices.

Add .private to .gitignore.

Update prices.

2024-12-19 23:58:42 -08:00

9.3 KiB

Raw Permalink Blame History

Bitcoin Price Model Documentation

Model Overview

A probabilistic price projection model combining log returns analysis, cycle awareness, and Monte Carlo simulation. The model generates projected price ranges with confidence intervals, balancing short-term market dynamics with long-term cyclical patterns.

Core Design Principles

1. Return Analysis

Uses log returns for better handling of exponential growth
Combines multiple timeframes for volatility estimation
Implements adaptive window sizing based on market conditions
Handles volatility clustering through regime-aware adjustments

2. Cycle Integration

Recognizes Bitcoin's ~4 year (1460 day) halving cycle
Maps historical returns to cycle positions (0-1 scale)
Adjusts expectations based on position in cycle
Handles transitions between cycles with uncertainty scaling

3. Market Era Recognition

Three distinct eras with specific characteristics:

Early (2013-2017): Higher base volatility, conservative trends
Transition (2017-2020): Futures market introduction period
Mature (2020+): Institutional participation, reduced base volatility

4. Uncertainty Estimation

Generates both point estimates and confidence intervals
Adapts uncertainty based on market conditions
Uses asymmetric volatility response
Implements dynamic confidence interval calibration

Architecture

Key Components

Trend Analysis (analyze_trends)
- Calculates cycle-position-specific returns
- Applies position-aware smoothing
- Handles cycle boundaries
Volatility Estimation (calculate_volatility)
- Adaptive window sizing
- Multi-timeframe integration
- Era-specific scaling
- Regime detection and response
Price Projection (project_prices)
- Monte Carlo simulation engine
- Dynamic uncertainty scaling
- Confidence interval calculation
- Trend integration
Projection Adjustment (get_projection_adjustments)
- Time-varying uncertainty scaling
- Market condition response
- Cycle position awareness
- Minimum uncertainty bounds

Model Performance & Validation

Performance Characteristics

Normal Market Conditions

MAPE: 30-40% typical
95% CI Coverage: ~95%
68% CI Coverage: ~73%
Best performance in mature market periods (2020+)
Most reliable for 3-6 month horizons

Stress Periods

MAPE: 30-60%
95% CI Coverage: ~95%
68% CI Coverage: ~76%
Wider but well-calibrated confidence intervals
Maintains reliability through increased uncertainty

Key Strengths

Consistent confidence interval coverage
Rapid adaptation to volatility changes
Robust handling of cycle transitions
Well-calibrated uncertainty estimates

Known Limitations

Higher error during market structure changes
Increased uncertainty in early cycle periods
Limited incorporation of external factors
May underestimate extreme events

Validation Framework

Backtest Configuration

Minimum training period: 8 years
Validation period: 2 years
Rolling window approach
Separate evaluation of normal/stress periods

Key Test Periods

Cycle Transitions
- Pre/post halving periods
- Historical halvings (2016, 2020, 2024)
- Cycle peak/trough transitions
Market Structure Changes
- Futures introduction (2017)
- Institution adoption (2020-2021)
- Major market events (e.g., COVID crash)
Recent History
- 2021 bull market
- 2022 drawdown
- 2024 recovery

Validation Metrics

Accuracy Measures
- MAPE (Mean Absolute Percentage Error)
- RMSE (Root Mean Square Error)
- Maximum deviation
Calibration Measures
- Confidence interval coverage
- Uncertainty estimation accuracy
- Regime transition handling
Stability Measures
- Parameter sensitivity
- Training period dependence
- Regime change response

Technical Implementation

Core Functions

Volatility Calculation

def calculate_volatility(df, short_window=30, medium_window=90, long_window=180):
    """
    Adaptive volatility calculation combining multiple timeframes.

    Features:
    - Dynamic window sizing based on market conditions
    - Era-specific scaling factors
    - Regime-aware adjustments
    - Robust error handling and fallbacks
    """

Key parameters:

short_window: Fast response (default 30 days)
medium_window: Primary estimate (default 90 days)
long_window: Stability baseline (default 180 days)

Adaptive features:

Windows shrink in high volatility periods
Expand during low volatility
Minimum size constraints for stability
Weighted combination based on regime

Cycle Position

def get_cycle_position(date, halving_dates):
    """
    Calculate position in halving cycle (0 to 1).
    0 = halving event
    1 = just before next halving
    """

Position calculation:

Linear interpolation between halvings
Special handling for pre-first-halving
Extension mechanism for future cycles
Built-in boundary condition handling

Price Projection

def project_prices(df, days_forward=365, simulations=1000,
                  confidence_levels=[0.95, 0.68]):
    """
    Generate price projections with confidence intervals.

    Core simulation parameters:
    - Number of paths: 1000
    - Confidence levels: 95% and 68%
    - Dynamic uncertainty scaling
    """

Data Requirements

Input Data

Minimum fields:

Date
Close price
Trading volume (optional)
High/Low (optional)

Format requirements:

Daily data preferred
Sorted chronologically
No missing dates
Prices > 0

Training Data

Minimum requirements:

2 years for basic operation
8 years recommended
Must include at least one cycle transition
Should span multiple market regimes

Error Handling

Data Validation

Missing value detection and interpolation
Outlier identification
Zero/negative price handling
Volume anomaly detection

Runtime Guards

Minimum data length checks
Window size validation
Numerical stability checks
Regime transition handling

Fallback Mechanisms

Simple volatility calculation
Default uncertainty estimates
Conservative parameter sets
Standard cycle assumption

Memory and Performance

Optimization Features

Efficient numpy operations
Vectorized calculations where possible
Smart data windowing
Caching of intermediate results

Resource Usage

Typical requirements for 10-year dataset:

Memory: ~100MB
CPU: ~2-5 seconds per projection
Storage: Negligible

Parallelization

Multiprocessing support for backtests
Independent path simulation
Multiple period analysis
Backtest parallelization

Development History & Evolution

Major Versions

Version 1.0 (Initial Implementation)

Basic log return analysis
Fixed volatility windows
Simple cycle position calculation
Base Monte Carlo simulation

Version 2.0 (Market Structure)

Added era-based adjustments
Improved cycle handling
Multiple timeframe volatility
Enhanced Monte Carlo engine

Version 3.0 (Current)

Adaptive volatility windows
Dynamic uncertainty scaling
Improved regime detection
Enhanced confidence interval calibration

Key Improvements

Volatility Estimation

Fixed → Adaptive Windows
- Initial: Fixed 30/90/180 day windows
- Current: Dynamic sizing based on regime
- Result: Better regime transition handling
Uncertainty Calibration
- Initial: Fixed scaling factors
- Current: Market-aware dynamic scaling
- Result: More reliable confidence intervals
Era Recognition
- Initial: Single model for all periods
- Current: Era-specific adjustments
- Result: Better handling of market evolution

Simulation Engine

Path Generation
- Initial: Basic random walks
- Current: Regime-aware path simulation
- Result: More realistic price trajectories
Confidence Intervals
- Initial: Fixed width
- Current: Dynamic, asymmetric intervals
- Result: Better calibrated uncertainty

Failed Experiments

1. Complex Regime Detection

Attempted multiple indicator fusion
Added excessive complexity
Reduced model stability
Reverted to simpler approach

2. Machine Learning Integration

Tested neural network components
Reduced interpretability
Inconsistent improvements
Kept traditional statistical approach

3. External Factor Integration

Tried incorporating macro indicators
Added noise to projections
Complicated parameter estimation
Maintained focus on price dynamics

Recent Improvements (2024)

Adaptive Volatility Windows

Implementation: Dynamic window sizing
Purpose: Better regime handling
Results:
- Improved 95% CI coverage to ~95%
- Better stress period handling
- More reliable uncertainty estimates

Performance Metrics

Normal Periods:

MAPE: 39.9%
RMSE: $12,007
95% CI Coverage: 95.9%
68% CI Coverage: 72.5%

Stress Periods:

MAPE: 32.8%
RMSE: $12,794
95% CI Coverage: 95.2%
68% CI Coverage: 76.1%

Future Directions

Short Term

Fine-tune adaptive parameters
Improve transition period handling
Enhanced backtest framework
Additional regime indicators

Medium Term

Cycle strength indicators
Volume analysis integration
Improved documentation
Performance optimization

Long Term

Real-time adaptation framework
Advanced regime detection
Market microstructure integration
External API integration

9.3 KiB Raw Permalink Blame History