Numbers you can check. Methods you can challenge.
FluxMateria benchmarks include accuracy metrics, comparison methodology, and validation datasets.
FluxMateria benchmarks include accuracy metrics, comparison methodology, and validation datasets.
FluxMateria holds #1 accuracy on 3 ADMET endpoints, achieves 100% reaction mechanism classification, and delivers sub-1% error across 16 materials properties — with zero fitted parameters, no training data, and no GPU. Every number below is published with full methodology. Reproduce any result yourself.
How we test, what datasets we use, how we measure. No hidden assumptions.
Against established tools where possible. Fair comparisons, same test sets.
Where predictions are most and least reliable. We tell you the boundaries.
Enough detail that you could replicate. Trust but verify.
Summary of performance across core capabilities.
Single + multiple bonds across p, d, and s-block. Zero fitted parameters.
Singles, doubles, and triples. 870/906 within 1.0%. Zero fitted parameters.
Single-threaded, no GPU. Scales linearly with cores for batch jobs.
PPB, BBB, solubility, metabolism, permeability, hERG, DILI, CYP. 3 are #1 SOTA.
Band gap 0.7 eV MAE (1,048 materials). Core holdout 1.2%. Gemstone color 19/19.
Detailed performance data for each capability.
178K compounds validated via LOO across 8 hybrid endpoints. 3 are #1 SOTA.
This benchmark validates the battery-native decision layer as a screening and prototype-handoff engine, not as a replacement for electrochemical lab validation.
Where FluxMateria predictions are most and least reliable.
We document the boundaries of reliable prediction space:
Confidence indicators in predictions reflect these boundaries. Low confidence = verify experimentally.
We want you to verify our claims.
Benchmark datasets and evaluation scripts are available to pilot participants.
Request Pilot Access