Beyond the benchmark: continuous alloys and doping
The 0.237 eV MAE number is what the headline benchmark measures, but the
real value of the predictor for industrial users is the
continuous piece. Every working semiconductor device
ships as an alloy or a doped composition, not a stoichiometric endpoint.
The same single predictor handles both directly, with the same
zero-training, zero-fitted-parameter contract.
Alloy band-gap engineering
Type a fractional composition and the predictor decomposes it into
integer-stoichiometry endpoints, evaluates each via the same
composition-only path that gives 0.237 eV on the cohort above, and
interpolates with Vegard's law. No system-specific bowing
constants; no fit. Sample verified outputs:
| Composition | Application | FluxMateria | Experiment |
| In0.53Ga0.47As | 1.55 µm telecom photonics, lattice-matched to InP | 0.73 eV | 0.74 eV |
| Hg0.4Cd0.6Te | SWIR infrared detector | 0.78 eV | 0.60 eV |
| In0.8Ga0.2As | High-electron-mobility transistor channel | 0.54 eV | 0.50 eV |
| Zr1.33Ta0.67N1.63O1.89 | Bi-axis quaternary oxynitride | 2.52 eV | 2.48 eV |
| Bi0.04Te0.06Pb0.98Se0.98 | Bi/Te-codoped PbSe thermoelectric | 0.19 eV | 0.28 eV |
Doping → carrier concentration and Fermi level
A dilute dopant in a fractional formula is auto-classified as donor /
acceptor / isoelectronic by comparing the dopant's bonding signature
to the host site it replaces. The module returns activation energy,
carrier concentration n or p at the user's temperature, Fermi-level
offset from the band edge, and Burstein–Moss channel-filling shift
— all in the same millisecond call:
| Doped composition | Application | Type | Notes |
| In1.95Sn0.05O3 | ITO transparent conductor | n-type | Sn donor on In site |
| Zn0.98Al0.02O | AZO transparent conductor | n-type | Al donor on Zn site |
| Ga0.999Mg0.001N | p-GaN LED layer | p-type | Mg acceptor on Ga site |
| Ga0.999Si0.001N | n-GaN LED contact | n-type | Si donor on Ga site |
| Cd0.99Cu0.01Te | p-CdTe solar absorber | p-type | Cu acceptor on Cd site |
| Si0.999P0.001 | n+ Si | n-type | P donor on Si site |
| Zn0.99Mn0.01O | Dilute magnetic semiconductor | isoelectronic | Mn matches Zn host valency — no carriers |
Practical applications enabled
Photovoltaic absorber design (CIGS, perovskite tandems),
visible-spectrum LED engineering (InGaN, AlGaInP), telecom photonics
(InGaAsP at 1.3 / 1.55 µm), IR detector cutoff tuning (HgCdTe),
power-electronics doped channels (n-GaN, p-GaN, doped β-Ga2O3),
transparent conductors (ITO, AZO, ATO), thermoelectric alloy
optimisation, and dilute magnetic semiconductors — all from a
composition string + temperature, in roughly a millisecond per query.
Carrier concentration: 99.0% within factor 3 on 197 Hall measurements
Extending past type classification, the predictor scores against a
curated 197-entry Hall-measurement cohort spanning Si, Ge, GaAs, InP,
InAs, InSb, GaSb, GaN, AlAs, AlSb, GaP, ZnO, ZnS, ZnSe, ZnTe, CdS,
CdSe, CdTe, PbS, PbSe, PbTe, SiC, diamond, ITO, AZO, FTO and their
compensated combinations. Curated from Sze, Madelung, Adachi,
Pearton, Look, NSM Ioffe.
100%
Type classification (n / p / intrinsic)
99.0%
Within factor 3 of measured n (novel-material mode)
0.000 dex
Median log10 error
197
Hall-measurement entries
Mobility & conductivity
The same single call returns electron and hole mobilities, and the
full conductivity σ = q n μ, at any temperature.
100% of predictions land within a factor of 3 of
literature mobilities across Si, Ge, GaAs, InP, InAs, GaN, CdTe, ZnO,
and other canonical semiconductors. Pair with carrier-concentration
output to size a transistor channel, rank transparent conductors, or
triage thermoelectric leads.
Band-edge alignment for heterojunctions
Pass any two compositions and the predictor returns the
conduction-band offset ΔEc, valence-band offset
ΔEv, and the Type I / II / III
alignment classification — the key inputs for HEMT, LED
quantum well, multi-junction solar, and IR-detector stack design.
~88% type classification accuracy on canonical
heterostructure benchmarks (AlGaAs / GaAs, AlGaN / GaN, CdSe / ZnS,
InAs / GaSb broken-gap, SiGe, CdTe / ZnTe, etc.) with eV magnitudes
inside typical DFT band-offset uncertainty (~0.3 eV).
Ab initio on materials that don't exist yet
A separate "novel-material" benchmark mode disables every
measured-property lookup the module would normally use, and forces
the predictor to derive the entire bundle — band gap, carrier
concentration, mobility, conductivity, band-edge alignment —
from composition alone. On the same 197-entry Hall cohort and the
same heterostructure benchmark set, the novel-material mode
reproduces the known-material accuracy to within a few percentage
points on every metric:
| Metric | Known-material accuracy | Novel-material accuracy |
| Hall carrier within factor 3 | 98.5% | 99.0% |
| Hall carrier type classification | 100% | 100% |
| Mobility within factor 3 | 100% | 100% |
| Band-offset Type I / II / III | 87.5% | 87.5% |
This is the contract that separates this pipeline from
interpolation-based surrogates: the prediction for a never-measured
composition is generated by the same physics that scores against
measured materials, and lands at the same accuracy. The known /
novel split is exposed as a benchmark mode so users can verify it
on any composition.