Project: PathFindN

A mathematical expression game that doubles as a dataset generator. Born from AI training and reviewing work, PathFindN asks players to find valid mathematical "paths" to a target integer N, with mandatory reasoning for each submission. Every accepted path and its reasoning are stored for potential use in anonymized research datasets.

Play PathFindN →

PathValidator: The Core System

The entire game hinges on one question: is this expression safe to evaluate, and does it actually equal N? The PathValidator handles both through a multi-stage pipeline that combines Python AST analysis with SymPy computer algebra verification.

PathValidator pipeline: expression input through normalize, AST parse, whitelist check, constraints, safe_eval, SymPy CAS verify, canonical hash, and storage.

Validation Pipeline

Normalize — Unicode cleanup (π to pi, ^ to **, |x| to abs(x))
AST Parse — Python ast.parse in eval mode
Whitelist Check — Only whitelisted operators (+, -, *, /, //, **, %), functions (sqrt, log, sin, cos, floor, ceil, gcd, etc.), and constants (pi, e, tau) are allowed. Attribute access, method calls, dunders, and imports are all blocked.
Constraint Check — Optional daily mutators: forbidden digits, allowed operator subsets, minimum expression length
safe_eval — Numeric evaluation via whitelisted AST walker using cmath/math
SymPy CAS Verify — simplify(expr - N) == 0 catches symbolic identities that numeric checks miss
Canonical Hash — SHA256(sp.srepr(parse_expr(normalized))) for expression uniqueness. 1+2 and 2+1 canonicalize to the same hash.

CAS verification is the primary acceptance path. Numeric fallback (within ±0.0001) catches expressions SymPy cannot simplify. Both paths feed into the same canonical hash for deduplication.

Integer Regime System

Target numbers are drawn from difficulty tiers based on the formula IR_n = 5 × 11^n-1 (Principle of Self-Interaction):

Regime	Range	Scale
d₀	0 – 4	Axiomatic
d₁	5 – 55	Tens
d₂	56 – 605	Hundreds
d₃	606 – 6,655	Thousands
d₄	6,656 – 73,205	Ten-thousands
d₅	73,206 – 805,255	Hundred-thousands
d₆	805,256 – 8,857,805	Millions

Challenge modes: Standard (stay in your regime) and Ladder (start at d₀ and advance on each accepted path).

Stack

Python (Flask), SymPy, PostgreSQL (Supabase), Supabase Auth (JWT), Stripe for billing. FREE tier gets 3 attempts/day; SUPPORTER gets 5.

Dataset Generation Angle

Every submission is stored with its expression, canonical hash, reasoning text, regime, and challenge mode. The game is the data collection mechanism. Players generate novel mathematical paths that no scraper or synthetic generator would produce, because the reasoning requirement forces genuine mathematical thinking rather than brute-force enumeration.