Hash Tables & EAI

How the biggest breakthrough in computer science can help us build a better AI

Feb 16, 2026

[Note: All content *not* in block quotes is generated by the GSNV-GPT engine. Comments and questions are for paid subscribers only and should be directed to the engine.]

If you would like to track our progress, interact with the GSNV Engine, or support this important work, please subscribe.

This morning, while watching the first segment of the following video, I realized that this breakthrough work on hash tables offered a deeper design principle for AI — the move from optimization of the present to evaluation of future states.

Here is the paper cited, and the video below:
Followed by the article generated by GSNV-GPT

[TL;DR: A 2025 breakthrough in hashing just handed us a perfect toy-model for Evaluative AI—and for markets.

Old open-addressed hashing was greedy: it optimizes the present by taking the closest available slot. That works until density rises, slack evaporates, and then search costs spike—because the algorithm has been quietly deforming the future cost surface the entire time.

The new result (Farach-Colton, Krapivin, Kuszmaul) shows you can do something radically different without reordering history: spend extra compute now to evaluate where “good” placements still exist, refuse bad local minima, and “snap” into a region that preserves shallow future access. Net: amortized costs stabilize (flat), tails are controlled, and the system stops manufacturing volatility in its own internal geometry.

Design law: Optimization consumes slack; evaluation protects reachability. That is the core move an Evaluative AI must learn—and it’s also the difference between liquidity consumption that creates market spikes vs. execution that preserves future optionality.]

Elastic Hashing, or: Why Myopic Optimization Manufactures Spikes

A small 2025 breakthrough in data structures makes a big point for Evaluative AI—and for any domain (financial markets included) where history is sticky and tail risk is real.

In “Optimal Bounds for Open Addressing Without Reordering” (on arXiv), Martín Farach-Colton, Andrew Krapivin, and William Kuszmaul revisit open-addressed hashing under a constraint that is philosophically—and institutionally—nontrivial: no reordering after insertion. You don’t get to rewrite the past.

What they show is a clean separation you can carry far beyond hashing:

Algorithms that optimize the present (greedy “closest”) can force future costs into spikes.
Algorithms that evaluate future position (non-greedy “best”) can stabilize cost.

The “best” here has a geometric valence: it is the placement that preserves a shallow access geometry, even if it requires expensive present-time evaluation.

1) The old regime: optimize-now under a fixed local metric

Classic open addressing with uniform probing has a simple ethic: take the first free slot in the probe sequence. Under load 1−δ1-\delta1−δ, this yields amortized expected probe complexity on the order of Θ(log⁡δ−1)\Theta(\log \delta^{-1})Θ(logδ−1) in the traditional analysis and folklore. That folklore hardened into “optimality” claims for greedy schemes, canonized in Andrew C. Yao’s 1985 “Uniform hashing is optimal” in the Journal of the ACM.

But notice what “optimal” meant in that era’s conceptual posture:

You optimize the present insertion step according to a local notion of “closest” (first available in the sequence).
You accept that near saturation the world turns adversarial: collision structure accumulates, and the tail thickens.

This is the general myopia pattern: present optimality is purchased by silently mortgaging the future metric.

2) The new regime: evaluation as a different causal stance

The paper’s “elastic hashing” is non-greedy. Its central move is not a clever constant-factor trick—it is a change in stance:

Decouple “deliberation cost” from “stored future cost”

Elastic hashing allows insertion to do many probes (evaluation), but it fights to keep future search probe complexity low. It explicitly leverages the fact that insertion probe effort and lookup probe depth need not be the same variable.

What “best” means (as opposed to “closest”)

Elastic hashing partitions the array into geometrically shrinking regions and engineers a multi-scale geometry in which “good placements” are those that land shallow in this induced metric. The insertion procedure probes far enough to estimate whether a good placement is reachable, and if not, it “snaps” into a region designed to remain slack—protecting future access geometry rather than consuming it.

The headline result (the stabilizing claim)

They achieve, without reordering:

Amortized expected probe complexity O(1)O(1)O(1) (flat average cost)
Worst-case expected probe complexity O(log⁡δ−1)O(\log \delta^{-1})O(logδ−1) (controlled tail)
with matching lower bounds indicating the worst-case rate is essentially tight.

Read that slowly: in a history-bound system, the “evaluate” stance makes the distribution of costs more stable. This is not merely “faster on average.” It is a direct attack on spikiness.

(They also give “funnel hashing,” disproving a long-standing conjecture about greedy worst-case expected probe complexity—interesting historically, but the deeper meta-lesson is the evaluation pivot.)

3) The general EAI design principle

Here’s the principle, stated as densely as possible, in “EAI-native” terms:

EAI Principle: Spend present attention to preserve future reachability.
When the system is dense and path-dependent, the agent’s job is not to minimize CnowC_{\text{now}}Cnow under a fixed local metric, but to maintain a stable cost geometry over the future action manifold—especially the tails.

Formalization (cost geometry, not scalar reward)

Let sts_tst be state, ata_tat action, and let the environment+agent co-produce a future cost geometry Gt→TG_{t\to T}Gt→T (a distribution, not a number), e.g.:

expected effort for retrieval / intervention,
variance and tail thickness of that effort,
probability of cliff events (phase-boundary crossings),
loss of reachability (unreachable basins in possibility space).

Then evaluative control is:

at=arg⁡min⁡a(Cnow(st,a)+λ Φ(Gt→T)),a_t = \arg\min_a \left( C_{\text{now}}(s_t,a) + \lambda\,\Phi(G_{t\to T}) \right),at=argamin(Cnow(st,a)+λΦ(Gt→T)),

where Φ\PhiΦ penalizes spikiness: heavy tails, discontinuities, basin loss, fragility.

Elastic hashing is a minimal constructive witness that Φ\PhiΦ-first design can dominate greedy present-optimization when you cannot reorder history.

Operational corollary: the Slack Reservoir invariant

Dense systems fail by buffer erosion:

hash tables: slack δn\delta nδn shrinks → collisions amplify → depths jump
agentic systems: uncertainty/capacity slack shrinks → brittleness jumps
institutions/markets: liquidity/capital slack shrinks → impact/volatility jumps

Elastic hashing stabilizes by maintaining structured slack (reservoir regions kept intentionally less full so the system can “snap” into low-cost basins). That is evaluation as architecture.

So the invariant is:

Treat slack as first-class. Not as leftover.

4) EAI failure modes (named, not implied)

If you want this to be usable as doctrine, name the predictable pathologies:

Myopia / local greed: “closest” masquerades as “best.”
Buffer erosion: slack consumed until the system crosses a phase boundary.
Tail manufacturing: rare events become catastrophically expensive because the cost geometry steepened silently.
History lock-in: early cheap commitments constrain future reachability (no reordering).
Procyclicality: success reinforces the very behaviors that later amplify spikes.

Elastic hashing is valuable because it isolates these failure modes in a clean computational setting: no psychology, no narrative—just mechanics.

5) Why this matters for financial markets (not as metaphor, as structure)

Markets are “no reordering” systems: trades execute; inventories, funding constraints, and order-book states persist; path dependence is real. The relevant risk is not average cost—it’s cost spikes under stress.

Two canonical finance references already formalize the core pattern:

Liquidity spirals: market liquidity and funding liquidity can be mutually reinforcing; under some conditions liquidity can “suddenly dry up,” producing discontinuities and feedback-driven spikes—exactly the tail story in different clothing. Markus K. Brunnermeier and Lasse Heje Pedersen model these dynamics explicitly.
Optimal execution: you don’t optimize the “present fill”; you trade off expected cost against risk across time, constructing an efficient frontier over liquidation schedules—explicitly acknowledging that myopic aggression can increase future impact and volatility exposure. Robert Almgren and Neil Chriss are the canonical statement of that posture.

So the transfer is structural:

Greedy “closest” ↔ myopic liquidity consumption / immediate execution
Saturation ↔ thin books / funding constraints / margin pressure
Probe-depth spikes ↔ impact/volatility spikes
Elastic evaluation ↔ bounded evaluation to preserve future reachability (routing, slicing, buffering, switching regimes)

The point is not that markets are hash tables. The point is that tail stability requires evaluative stance whenever the cost surface is endogenously shaped by your own past actions.

6) A contact test (to keep this from becoming rhetoric)

If you want this to land as more than a pretty analogy, here are concrete “EAI contact tests” implied by the principle:

Tail metric instrumentation: do you track not just mean cost, but variance, kurtosis, and regime-switch indicators of your action-cost distribution?
Slack accounting: do you represent “capacity / liquidity / uncertainty margin” explicitly as a state variable with protected thresholds?
Regime switching policy: do you have a rule that refuses locally available actions when they steepen future gradients (i.e., threshold-based “snap” behavior)?
Path-dependence audit: can you quantify irreversibility (no-reordering constraints) and show how policy protects reachability under those constraints?

Elastic hashing passes these tests in miniature: it instruments and enforces a cost-geometry norm by design, and its theorems show the stabilization payoff.

GSNV

Discussion about this post

Ready for more?