← Benchmarks | Electronic Structure

d-Band Center Descriptor BENCHMARK

The d-band center is the single most important descriptor in transition-metal catalysis. FluxMateria predicts it from atomic quantum numbers and surface geometry — across pure metals, facets, and binary alloys — with no DFT inputs.

100 cases benchmarked
0 fitted parameters
0 DFT inputs
Multi-source literature validated
0.197 eV
Combined MAE
across 100 multi-source cases
0.154 eV
Facet-surface MAE
41 surfaces across 7 facet geometries
90%
Within 0.5 eV
inside DFT method-to-method spread
0
Fitted parameters
no ML training route

Headline Results

Combined and per-category accuracy across the 100-case benchmark.

Set n MAE (eV) RMSE (eV) ≤0.2 eV ≤0.3 eV ≤0.5 eV
Combined benchmark 100 0.197 0.278 71% 81% 90%
Pure transition metals 27 0.223 0.299 63% 74% 89%
Facet-specific surfaces 41 0.154 0.206 83% 90% 93%
Binary alloys 32 0.230 0.334 62% 75% 88%

Error Breakdown

Per-period and per-facet residuals let users see where the predictor is strongest and where the known physics limits kick in.

By period

3d (period 4)n = 320.141 eV
4d (period 5)n = 340.181 eV
5d (period 6)n = 340.266 eV

3d is cleanest. 5d carries larger residuals because spin–orbit and relativistic physics become material.

By facet

(0001)n = 20.076 eV
(211)n = 60.115 eV
stepn = 10.113 eV
(110)n = 120.135 eV
(100)n = 120.186 eV
(111)n = 80.187 eV

Low-coordination and stepped surfaces are predicted tightly — exactly where real catalysis lives.

Comparison to ML Baselines

Three standard regressors were fitted on the same atomic descriptors with 5-fold cross-validation. The FluxMateria route is evaluated directly on the same held-out descriptor rows.

Model CV MAE (eV) CV RMSE (eV) Notes
Linear regression 0.432 0.556 OLS with bias term on the standardized feature set.
k-Nearest Neighbors (k = 3) 0.388 0.657 Inverse-distance weighted, standardized features.
Random Forest (20 × depth 4) 0.295 0.470 Bootstrap aggregation with square-root feature bagging.
FluxMateria physics engine 0.223 0.299 No ML training route; work-function metadata caveat disclosed.

Feature set for all four models: atomic number, period, group, d-electron count, filling fraction, metallic radius, work function. Cross-validation on the 27-element pure-TM set with identical splits. The physics-derived predictor wins against every ML baseline trained on the same descriptors.

State-of-the-Art Context

Where FluxMateria lands compared to established methods for the same quantity.

Method Type MAE (eV) Source
FluxMateria — pure TMs First-principles, no fitting 0.223 this work
FluxMateria — facets First-principles, no fitting 0.154 this work
FluxMateria — combined First-principles, no fitting 0.197 this work
PBE-DFT (typical single method) DFT 0.15 – 0.30 method-to-method spread
SISSO surrogate ML on DFT features 0.20 – 0.30 Ghiringhelli et al., 2017
Neural-network surrogate ML on DFT features 0.15 – 0.25 Batchelor et al., 2019
CGCNN (graph neural network) GNN on DFT + structure 0.15 – 0.20 Xie & Grossman, 2018

FluxMateria matches DFT-trained ML surrogates on pure-TM and facet accuracy — without requiring a DFT training set and without any fitted coefficients.

Methodology

The benchmark measures the operational FluxMateria predictor — the same path exposed to the universal material engine — against independent literature data.

1. First-principles prediction

The d-band center is computed directly from atomic quantum numbers and surface coordination, with closed-shell and relativistic corrections for heavier series.

  • Input: element symbol + optional facet
  • No DFT calculation required
  • No training set or fitted parameters
  • Same function used in the production engine

2. Three-layer benchmark

Every reported number is measured on published literature values — nothing is fit or held out of the predictor.

  • 27 pure transition metals (3d / 4d / 5d plus group 3)
  • 41 facet-specific surfaces: (111), (100), (110), (211), (0001), steps
  • 32 binary alloys across Pt/Pd/Au/Ag/Ni/Co/Rh families

Important scope note: this benchmark establishes FluxMateria as a physics-based predictor for the central descriptor in transition-metal catalysis. It does not claim to replace dedicated surface-science DFT for mechanism-level questions, reactor kinetics, or high-throughput screening at full scale — its role is to give a fast, physically grounded starting point that does not depend on trained ML surrogates.

Literature Sources

Every predicted value was compared against one or more of these independent studies.

HN14

Hammer & Nørskov, Chem. Rev. 114, 4259 (2014)

K04

Kitchin, Nørskov, Barteau, Chen, J. Chem. Phys. 120, 10240 (2004)

GN09

Greeley, Stephens, Bondarenko et al., Nat. Chem. 1, 552 (2009)

N95

Hammer & Nørskov, Surf. Sci. 343, 211 (1995)

CM20

Chen & Mavrikakis, Chem. Rev. 121, 1007 (2021)

Method-to-method spread

The five sources disagree by 0.05–0.35 eV per element across DFT methods, slab thicknesses, and facet choices. FluxMateria lands inside that window on the majority of cases.

Known Limitations

Honest bounds on where the predictor is weaker — published up front.

Group 3 f-block boundary

Y, La, Ce show ~0.5 eV residuals because 4f/5d orbital mixing is not yet modelled. Not recommended for rare-earth catalysis.

Pt-3d skin alloys

Pt₃Ni, Pt₃Co, Pt₃Fe under-predict the ligand-induced deepening by 0.6–0.9 eV — a multi-body electronic coupling effect on the roadmap.

Period 6 (5d)

Larger residuals than 3d/4d. Spin–orbit splitting for open-shell 5d (W, Re) is the next physics upgrade.

Single-source elements

Sc, Cr, Mn, Hf, Ta, Nb, Zr, Tc, La, Ce have only one literature entry for cross-check. More sources will sharpen the validation.

Artifacts

d-Band benchmark summary (JSON)
Machine-readable snapshot with all 100 cases, per-category metrics, ML baselines, SOTA comparison, and literature sources.
Download JSON
d-Band benchmark report (Markdown)
Human-readable report with the same numbers, formatted for easy inclusion in publications or decks.
Download MD
Catalyst scoring benchmark
The d-band descriptor feeds into the full catalyst ranking and inverse-discovery stack, benchmarked separately.
Open benchmark

From descriptor to catalyst discovery

This benchmark establishes the d-band center as a production-grade physics descriptor. It feeds the catalyst scoring layer and downstream discovery workflows inside the FluxMateria platform.

Catalyst scoring benchmark Catalyst case study Back to Materials module

Benchmark basis

d-band centers are computed from Flux surface and electronic descriptors. Benchmark references are used to measure descriptor accuracy.

Flux Physics