The results
| Property |
FluxMateria vs experiment |
DFT (PBE) vs experiment |
Verdict |
| Lattice constant a |
MAPE 0.2% · median 0.1% |
Reference (lattice fixed at experiment) |
Strong composition-only lattice prediction |
| Band gap |
MAPE 7.6% · median 1.2% |
MAPE 45.1% · median 50.7% |
Engine beats PBE |
| Magnetic moment |
MAPE 3.6% (Fe, Ni) |
MAPE 9.0% (Fe, Ni) |
Engine beats PBE on Fe/Ni |
| Speed per material |
~3 ms (per call) |
seconds–minutes (PBE) · hours (hybrid / GW) |
~25,000× measured |
What the numbers say
Lattice constant. 14 of 15 materials match experiment to within 1%; only TiO2-rutile sits just outside that band at 1.1%. The structural channel is tight: median error 0.1%, MAPE 0.2%. Earlier passes had remaining error concentrated in wurtzite in-plane lattice (GaN, ZnO ~6%) and layered systems (graphite, h-BN ~8%, MoS2-2H +32%); the latest structural-geometry refinements bring all of those under 1%.
Band gap. Engine median error 1.2%; DFT-PBE median 50.7%. The engine matches Si, Ge, GaAs, GaN, ZnO, MoS2, NaCl to within 0–7% of experiment. PBE's well-known wide-gap underestimate shows up at MgO (3.13 eV vs 7.83 experimental, −60%), h-BN (3.84 vs 5.96, −36%), and ZnO (0.93 vs 3.37, −72%) — the engine doesn't inherit that systematic. The remaining engine outliers are MgO (−26.6%, wide-gap ionic class under audit) and h-BN (−33.3%).
Magnetic moment. Fe: experiment 2.22 μB, engine 2.26 (+1.9%), DFT 2.20 (−1.0%). Ni: experiment 0.62 μB, engine 0.65 (+5.4%), DFT 0.72 (+16.8%). Both engine moments come from a single composition-only call. This is a direct DFT-vs-engine comparison on a known-hard property — spin-polarised PBE on FM transition metals is not a cheap calculation, and the engine matches Fe's moment more tightly than DFT does, while DFT in turn matches Ni a bit looser than the engine.
Speed. The DFT side of the Tier 2 EOS sweep took about 22 minutes on a modern laptop CPU. The engine completed the same 15-material panel in 1.4 seconds total wall time, with typical per-material calls around 3 ms. The benchmark page reports the per-material speedup at ~25,000× (or ~950× if you charge the engine's one-time 600 ms first-call import to the 15-material panel).