← Benchmarks | Electronic Structure

d-Band Center Descriptor BENCHMARK

The d-band center is the single most important descriptor in transition-metal catalysis. FluxMateria predicts it from atomic quantum numbers and surface geometry — across pure metals, facets, and binary alloys — with no DFT inputs.

100 cases benchmarked

0 fitted parameters

0 DFT inputs

Multi-source literature validated

Download JSON Download summary (MD)

0.197 eV

Combined MAE

across 100 multi-source cases

0.154 eV

Facet-surface MAE

41 surfaces across 7 facet geometries

90%

Within 0.5 eV

inside DFT method-to-method spread

Fitted parameters

no ML training route

Headline Results

Combined and per-category accuracy across the 100-case benchmark.

Set	n	MAE (eV)	RMSE (eV)	≤0.2 eV	≤0.3 eV	≤0.5 eV
Combined benchmark	100	0.197	0.278	71%	81%	90%
Pure transition metals	27	0.223	0.299	63%	74%	89%
Facet-specific surfaces	41	0.154	0.206	83%	90%	93%
Binary alloys	32	0.230	0.334	62%	75%	88%

Error Breakdown

Per-period and per-facet residuals let users see where the predictor is strongest and where the known physics limits kick in.

By period

3d (period 4)	n = 32	0.141 eV
4d (period 5)	n = 34	0.181 eV
5d (period 6)	n = 34	0.266 eV

3d is cleanest. 5d carries larger residuals because spin–orbit and relativistic physics become material.

By facet

(0001)	n = 2	0.076 eV
(211)	n = 6	0.115 eV
step	n = 1	0.113 eV
(110)	n = 12	0.135 eV
(100)	n = 12	0.186 eV
(111)	n = 8	0.187 eV

Low-coordination and stepped surfaces are predicted tightly — exactly where real catalysis lives.

Comparison to ML Baselines

Three standard regressors were fitted on the same atomic descriptors with 5-fold cross-validation. The FluxMateria route is evaluated directly on the same held-out descriptor rows.

Model	CV MAE (eV)	CV RMSE (eV)	Notes
Linear regression	0.432	0.556	OLS with bias term on the standardized feature set.
k-Nearest Neighbors (k = 3)	0.388	0.657	Inverse-distance weighted, standardized features.
Random Forest (20 × depth 4)	0.295	0.470	Bootstrap aggregation with square-root feature bagging.
FluxMateria physics engine	0.223	0.299	No ML training route; work-function metadata caveat disclosed.

Feature set for all four models: atomic number, period, group, d-electron count, filling fraction, metallic radius, work function. Cross-validation on the 27-element pure-TM set with identical splits. The physics-derived predictor wins against every ML baseline trained on the same descriptors.

State-of-the-Art Context

Where FluxMateria lands compared to established methods for the same quantity.

Method	Type	MAE (eV)	Source
FluxMateria — pure TMs	First-principles, no fitting	0.223	this work
FluxMateria — facets	First-principles, no fitting	0.154	this work
FluxMateria — combined	First-principles, no fitting	0.197	this work
PBE-DFT (typical single method)	DFT	0.15 – 0.30	method-to-method spread
SISSO surrogate	ML on DFT features	0.20 – 0.30	Ghiringhelli et al., 2017
Neural-network surrogate	ML on DFT features	0.15 – 0.25	Batchelor et al., 2019
CGCNN (graph neural network)	GNN on DFT + structure	0.15 – 0.20	Xie & Grossman, 2018

FluxMateria matches DFT-trained ML surrogates on pure-TM and facet accuracy — without requiring a DFT training set and without any fitted coefficients.

Methodology

The benchmark measures the operational FluxMateria predictor — the same path exposed to the universal material engine — against independent literature data.

1. First-principles prediction

The d-band center is computed directly from atomic quantum numbers and surface coordination, with closed-shell and relativistic corrections for heavier series.

Input: element symbol + optional facet
No DFT calculation required
No training set or fitted parameters
Same function used in the production engine

2. Three-layer benchmark

Every reported number is measured on published literature values — nothing is fit or held out of the predictor.

27 pure transition metals (3d / 4d / 5d plus group 3)
41 facet-specific surfaces: (111), (100), (110), (211), (0001), steps
32 binary alloys across Pt/Pd/Au/Ag/Ni/Co/Rh families

Important scope note: this benchmark establishes FluxMateria as a physics-based predictor for the central descriptor in transition-metal catalysis. It does not claim to replace dedicated surface-science DFT for mechanism-level questions, reactor kinetics, or high-throughput screening at full scale — its role is to give a fast, physically grounded starting point that does not depend on trained ML surrogates.

Literature Sources

Every predicted value was compared against one or more of these independent studies.

HN14

Hammer & Nørskov, Chem. Rev. 114, 4259 (2014)

K04

Kitchin, Nørskov, Barteau, Chen, J. Chem. Phys. 120, 10240 (2004)

GN09

Greeley, Stephens, Bondarenko et al., Nat. Chem. 1, 552 (2009)

N95

Hammer & Nørskov, Surf. Sci. 343, 211 (1995)

CM20

Chen & Mavrikakis, Chem. Rev. 121, 1007 (2021)

Method-to-method spread

The five sources disagree by 0.05–0.35 eV per element across DFT methods, slab thicknesses, and facet choices. FluxMateria lands inside that window on the majority of cases.

Known Limitations

Honest bounds on where the predictor is weaker — published up front.

Group 3 f-block boundary

Y, La, Ce show ~0.5 eV residuals because 4f/5d orbital mixing is not yet modelled. Not recommended for rare-earth catalysis.

Pt-3d skin alloys

Pt₃Ni, Pt₃Co, Pt₃Fe under-predict the ligand-induced deepening by 0.6–0.9 eV — a multi-body electronic coupling effect on the roadmap.

Period 6 (5d)

Larger residuals than 3d/4d. Spin–orbit splitting for open-shell 5d (W, Re) is the next physics upgrade.

Single-source elements

Sc, Cr, Mn, Hf, Ta, Nb, Zr, Tc, La, Ce have only one literature entry for cross-check. More sources will sharpen the validation.

Artifacts

d-Band benchmark summary (JSON)

Machine-readable snapshot with all 100 cases, per-category metrics, ML baselines, SOTA comparison, and literature sources.

Download JSON

d-Band benchmark report (Markdown)

Human-readable report with the same numbers, formatted for easy inclusion in publications or decks.

Download MD

Catalyst scoring benchmark

The d-band descriptor feeds into the full catalyst ranking and inverse-discovery stack, benchmarked separately.

Open benchmark

From descriptor to catalyst discovery

This benchmark establishes the d-band center as a production-grade physics descriptor. It feeds the catalyst scoring layer and downstream discovery workflows inside the FluxMateria platform.

Catalyst scoring benchmark Catalyst case study Back to Materials module

Benchmark basis

d-band centers are computed from Flux surface and electronic descriptors. Benchmark references are used to measure descriptor accuracy.

Flux Physics