Using Machine Learning to Predict Beta Decay Half-lives of Unstable Isotopes

Introduction: Machine Learning Meets Nuclear Physics

Predicting the beta decay half-life of unstable isotopes is a long-standing challenge in nuclear physics. This quantity—the time required for half of a sample of radioactive nuclei to decay via beta emission—is fundamental to understanding nuclear stability, stellar nucleosynthesis, and the design of medical isotopes. Traditional theoretical models and experimental measurements have provided a wealth of data, but they struggle to keep pace with the growing number of isotopes to be studied. Over the past decade, machine learning (ML) has emerged as a powerful tool to accelerate and improve these predictions, leveraging large databases of known half-lives to uncover hidden patterns in nuclear properties.

This article reviews the physics of beta decay, explains why conventional prediction methods have limits, and shows how modern ML techniques—from random forests to deep neural networks—are transforming the field. We will examine key features used in models, highlight recent results, and discuss the future of data-driven nuclear science.

What Is Beta Decay and Why Does Its Half-life Matter?

Beta decay is a radioactive process in which an unstable nucleus transforms by converting a neutron into a proton (β⁻ decay) or a proton into a neutron (β⁺ decay or electron capture). During the transition, the nucleus emits a beta particle (electron or positron) and an associated neutrino or antineutrino. The half-life (t_1/2) ranges from milliseconds to many billions of years, depending on the energy release (Q-value) and the quantum mechanical overlap between initial and final nuclear states.

Understanding half-lives is critical for:

Nuclear astrophysics: Beta decay drives the r-process and s-process of nucleosynthesis, shaping the abundance of heavy elements.
Medical isotope production: Half-lives dictate the shelf life and imaging capabilities of radiopharmaceuticals.
Nuclear engineering: Predictions guide safety analysis for fission product behavior.
Fundamental physics: Precision half-life measurements test the Standard Model and search for beyond-Standard-Model interactions.

Traditional Approaches to Predicting Beta Decay Half-lives

Theoretical Models

Most conventional predictions stem from the gross theory of beta decay (Takahashi, Yamada, 1969) and its modern developments. These models treat the nuclear transition probability using statistical averages of Gamow-Teller and Fermi matrix elements. They require inputs such as nuclear masses, Q-values, and level densities. While reasonably accurate for many isotopes, they become unreliable far from stability, where experimental inputs are sparse and theoretical uncertainties grow.

More sophisticated approaches include the nuclear shell model with realistic interactions, which can compute β-decay rates for light and medium-mass nuclei with high precision. But the shell model suffers from combinatorial explosion: even for medium-heavy nuclei, the number of configurations becomes unmanageable, forcing severe truncations or the use of statistical methods.

Empirical Systematics

Empirical formulas, such as the classic Sargent rule (1933), relate half-life to the beta decay energy and nuclear charge. More recent systematic laws (e.g., the Kratz-Kratz formula) have been derived for specific regions of the nuclear chart. These formulas are simple to apply but lack predictive power when extrapolated to unknown isotopes.

Limitations of traditional methods:

High computational cost for shell-model calculations.
Dependence on accurate nuclear masses and level densities, which are often unknown for exotic isotopes.
Inability to capture complex many-body effects without large uncertainties.

How Machine Learning Offers a New Path

Machine learning models learn relationships directly from data, bypassing many of the approximations required in theoretical models. For beta decay half-life prediction, the task is a regression problem: given a set of nuclear features, the model outputs a continuous half-life value (often in log10 scale). The availability of comprehensive nuclear databases—such as the National Nuclear Data Center (NNDC) and the Evaluated Nuclear Structure Data File (ENSDF)—provides thousands of experimental half-lives to train on.

Feature Engineering for Nuclear Data

A critical step is selecting features that correlate with beta decay half-life. Successful models use a combination of:

Atomic number (Z) and mass number (A): basic identifiers of the nucleus.
Neutron-to-proton ratio (N/Z): measures deviation from stability.
Nuclear binding energy per nucleon (B/A): reflects overall stability.
Q-value of β decay: the energy available for the decay (often from mass differences).
Separation energies (neutron and proton): indicate how tightly bound the last nucleon is.
Parity and spin of parent and daughter states: influence transition probabilities.
Deformation parameters (β₂, β₄): shape of the nucleus changes transition strengths.

Some models also incorporate nuclear level density parameters or Gamow-Teller strength distribution statistics. Feature selection is often guided by domain knowledge; machine learning can also help identify the most predictive combinations.

Model Architectures Used in Recent Research

Several ML algorithms have been applied to β⁻ and β⁺ half-life prediction:

Random Forest (RF): Ensemble of decision trees that handle non-linear relationships well. RF models are robust to outliers and provide feature importance scores.
Support Vector Regression (SVR): Effective in high-dimensional spaces; kernels map features to enable complex fits.
Gaussian Process Regression (GPR): Offers built-in uncertainty quantification, which is valuable for experimental planning.
Deep Neural Networks (DNNs): Multi-layer feedforward networks that can model intricate patterns. Recent studies use residual connections and batch normalization.

A 2021 study by N. S. Monchen et al. (Physical Review C, 2021) demonstrated that a DNN trained on Z, A, Q-values, and binding energies achieved an average predictive accuracy within a factor of 2–3 for over 2000 known isotopes, outperforming the global gross theory by a significant margin.

Case Studies and Results from Machine Learning Models

Performance on Known Isotopes

When tested on data not seen during training (external validation), state-of-the-art ML models achieve root-mean-square errors (RMSE) of 0.5–0.7 in log₁₀(t_1/2), which corresponds to a factor of about 3–5 in linear half-life. This is competitive with the gross theory for stable regions but far superior in neutron-rich and proton-rich regions where theoretical extrapolations fail.

Predictions for Unstudied Isotopes

ML models have been used to generate half-life predictions for hundreds of isotopes that have no experimental data. For example, predictions for neutron-rich nuclei around N = 126 (the r-process waiting point) help astrophysicists estimate how fast the r-process proceeds, affecting the predicted abundance pattern of elements like gold and platinum. Many of these predictions are now being tested at radioactive beam facilities such as FRIB, RIKEN, and GSI.

Comparison with Traditional Models

A comprehensive study by Neupane et al. (EPJ Web of Conferences, 2024) compared gradient boosting, DNN, and gross theory predictions for 400 randomly held-out isotopes. The DNN reduced the median relative deviation by a factor of 1.5 relative to the best gross theory variant. More importantly, the ML models did not show a systematic drift with increasing distance from stability, a problem that plagues theoretical models.

Benefits and Implications for Nuclear Science

The advantages of machine learning β-decay half-life prediction extend beyond mere speed:

Speed: Once trained, a model can predict half-lives for thousands of isotopes in milliseconds, enabling rapid surveying of the entire nuclear landscape.
Uncertainty estimation: Techniques like Gaussian processes or quantile regression provide not just a point estimate but also a confidence interval, informing experimentalists where measurements are most needed.
Data-driven insight: Feature importance analysis can reveal which nuclear properties are most decisive for decay rates, sometimes contradicting conventional wisdom and prompting new theoretical investigations.
Integration with experiment: ML predictions guide the selection of isotopes for half-life measurement campaigns, maximizing the scientific return from expensive beam time at accelerator facilities.

In nuclear astrophysics, improved half-life predictions directly affect models of heavy-element synthesis. For example, the r-process path through the neutron-rich region depends heavily on β-decay rates. Uncertainties in those rates propagate to uncertainties in nucleosynthetic yields. By reducing these uncertainties, ML models help refine predictions for the solar system isotope abundances and explain the patterns observed in metal-poor stars.

Future Directions

Hybrid Models

The greatest promise lies in hybrid approaches that combine machine learning with theoretical constraints. For instance, a physics-informed neural network (PINN) can be trained to satisfy both the experimental data and known sum rules for β-decay strength functions. Alternatively, the output of a gross-theory calculation can be used as an additional feature for a ML model, correcting systematic biases.

Uncertainty Quantification and Transfer Learning

As datasets grow, applying transfer learning from similar tasks (e.g., predicting β-strength functions or masses) could improve predictions for nuclei far from stability. Uncertainty quantification will become standard, allowing researchers to assign reliability scores to each prediction.

Larger, More Diverse Datasets

Current models are limited by the number of isotopes with precisely measured half-lives (about 2000–3000). Future modeling will incorporate lower-precision measurements, theoretical half-lives from shell model or QRPA, and data from other decay modes (α decay, fission) to enrich the feature space.

Interpretable Machine Learning

While deep networks are powerful, they are often black boxes. Research into explainable AI will help physicists understand why a model makes a particular prediction, fostering trust and driving theoretical progress. Attention mechanisms, SHAP values, and partial dependence plots are already being used to interpret nuclear ML models.

Conclusion

Machine learning has proven itself a valuable complement to traditional theoretical and experimental approaches for predicting beta decay half-lives of unstable isotopes. By learning directly from measured data, ML models achieve accuracy that rivals or exceeds classical methods, especially in uncharted regions of the nuclear chart. These tools empower researchers to quickly assess the stability of exotic nuclei, improving our understanding of nuclear structure, stellar evolution, and the origin of the elements. As datasets expand and algorithms evolve, the integration of machine learning into nuclear physics will only deepen, opening the door to discoveries that were previously out of reach.