Independent Validation

Chemistry Core

Bond lengths, bond dissociation energies, reaction enthalpies, thermochemical quantities, and spectroscopy-adjacent properties.

Open track →

Materials Holdout

Band gaps, densities, elastic constants, thermal properties, electrochemical properties, catalyst descriptors, and magnetic properties.

Best for: materials informatics, DFT benchmarking, and experimental materials labs.
First test: out-of-family holdout set chosen by the external group.
Output: row-level predictions and family-level error analysis.

Open track →

Life Science / ADMET

Solubility, PPB, BBB, CYP, hERG, DILI, permeability, target identification, binding affinity, and selectivity endpoints.

Best for: CADD, cheminformatics, pharmacology, and translational modeling teams.
First test: one endpoint, one split, one pre-registered metric.
Output: scores or classes with confidence and mechanistic annotations when available.

Open track →

Reaction Mechanisms

SN1/SN2/E1/E2/E1cb classification, activation barriers, mechanism ranking, catalyst scoring, and microkinetic inputs.

Best for: physical organic chemistry, catalysis, and reaction informatics groups.
First test: a blind mechanism or activation-barrier set with defined scope.
Output: mechanisms, barriers, confidence, units, and module versions.

Open track →

Experimental Validation

External measurement of a molecular, materials, catalytic, spectroscopic, or biological property predicted before the result is known.

Open track →

Evidence Packets

Packet templates define scope, input rows, target files, output schemas, scoring metrics, module labels, and version manifests.

Open packets →

Label	Meaning
Flux Physics	Computed from Flux physical terms for the reported endpoint. Benchmark references are used to score accuracy, not to look up each prediction.
Flux-Calibrated Physics	Flux physical model with a fixed endpoint calibration applied before evaluation.
Flux Hybrid	Flux physics signals combined with endpoint-specific reference evidence for tasks where local chemical context is part of the production route.
Flux Decision Engine	Flux scoring, ranking, or selection workflow evaluated against benchmark outcomes.
Flux Preview	Early public result or demonstration that is useful context but not the primary benchmark claim.
Mixed basis	Aggregate result set containing more than one route; row-level labels identify the basis for each result family.

Independent validation starts here.