Machine Learning: Science and Technology

ISSN: 2632-2153

OPEN ACCESS

Machine Learning: Science and Technology is a multidisciplinary open access journal that bridges the application of machine learning across the sciences with advances in machine learning methods and theory as motivated by physical insights.

Submit an article opens in new tab Track my article opens in new tab

RSS

Median submission to first decision before peer review 3 days

Median submission to first decision after peer review 49 days

Impact factor 6.8

Citescore 7.1

Full list of journal metrics

Open all abstracts, in this tab

The following article is Open access

A quantum inspired approach to learning dynamical laws from data—block-sparsity and gauge-mediated weight sharing

J Fuksa et al 2024 Mach. Learn.: Sci. Technol. 5 025064

View article, A quantum inspired approach to learning dynamical laws from data—block-sparsity and gauge-mediated weight sharing PDF, A quantum inspired approach to learning dynamical laws from data—block-sparsity and gauge-mediated weight sharing

Recent years have witnessed an increased interest in recovering dynamical laws of complex systems in a largely data-driven fashion under meaningful hypotheses. In this work, we propose a scalable and numerically robust method for this task, utilizing efficient block-sparse tensor train representations of dynamical laws, inspired by similar approaches in quantum many-body systems. Low-rank tensor train representations have been previously derived for dynamical laws of one-dimensional systems. We extend this result to efficient representations of systems with K-mode interactions and controlled approximations of systems with decaying interactions. We further argue that natural structure assumptions on dynamical laws, such as bounded polynomial degrees, can be exploited in the form of block-sparse support patterns of tensor-train cores. Additional structural similarities between interactions of certain modes can be accounted for by weight sharing within the ansatz. To make use of these structure assumptions, we propose a novel optimization algorithm, block-sparsity restricted alternating least squares with gauge-mediated weight sharing. The algorithm is inspired by similar notions in machine learning and achieves a significant improvement in performance over previous approaches. We demonstrate the performance of the method numerically on three one-dimensional systems—the Fermi–Pasta–Ulam–Tsingou system, rotating magnetic dipoles and point particles interacting via modified Lennard–Jones potentials, observing a highly accurate and noise-robust recovery.

https://doi.org/10.1088/2632-2153/ad4f4e

The following article is Open access

Investigating the ability of PINNs to solve Burgers' PDE near finite-time blowup

Dibyakanti Kumar and Anirbit Mukherjee 2024 Mach. Learn.: Sci. Technol. 5 025063

View article, Investigating the ability of PINNs to solve Burgers' PDE near finite-time blowup PDF, Investigating the ability of PINNs to solve Burgers' PDE near finite-time blowup

Physics Informed Neural Networks (PINNs) have been achieving ever newer feats of solving complicated Partial Differential Equations (PDEs) numerically while offering an attractive trade-off between accuracy and speed of inference. A particularly challenging aspect of PDEs is that there exist simple PDEs which can evolve into singular solutions in finite time starting from smooth initial conditions. In recent times some striking experiments have suggested that PINNs might be good at even detecting such finite-time blow-ups. In this work, we embark on a program to investigate this stability of PINNs from a rigorous theoretical viewpoint. Firstly, we derive error bounds for PINNs for Burgers' PDE, in arbitrary dimensions, under conditions that allow for a finite-time blow-up. Our bounds give a theoretical justification for the functional regularization terms that have been reported to be useful for training PINNs near finite-time blow-up. Then we demonstrate via experiments that our bounds are significantly correlated to the $\ell_2$ -distance of the neurally found surrogate from the true blow-up solution, when computed on sequences of PDEs that are getting increasingly close to a blow-up.

https://doi.org/10.1088/2632-2153/ad51cd

The following article is Open access

Deep artificial neural network-powered phase field model for predicting damage characteristic in brittle composite under varying configurations

Hoang-Quan Nguyen et al 2024 Mach. Learn.: Sci. Technol. 5 025062

View article, Deep artificial neural network-powered phase field model for predicting damage characteristic in brittle composite under varying configurations PDF, Deep artificial neural network-powered phase field model for predicting damage characteristic in brittle composite under varying configurations

This work introduces a novel artificial neural network (ANN)-powered phase field model, offering rapid and precise predictions of fracture propagation in brittle materials. To improve the capabilities of the ANN model, we incorporate a loop of conditions into its core to regulate the absolute percentage error for each observation point, that filters and consistently selects the most accurate outcome. This algorithm enables our model to better adapt to the highly sensitive validation data arising from varying configurations. The effectiveness of the approach is illustrated through three examples involving changes in the microgeometry and material properties of steel fiber-reinforced high-strength concrete structures. Indeed, the predicted outcomes from the improved ANN phase field model in terms of stress–strain relationship, and crack propagation path demonstrates an outperformance compared with that based on the extreme gradient boosting method, a leading regression machine learning technique for tabular data. Additionally, the introduced model exhibits a remarkable speed advantage, being 180 times faster than traditional phase field simulations, and provides results at nearly any fiber location, demonstrating superiority over the phase field model. This study marks a significant advancement in the application of artificial intelligence for accurately predicting crack propagation paths in composite materials, particularly in cases involving the relative positioning of the fiber and initial crack location.

https://doi.org/10.1088/2632-2153/ad52e8

The following article is Open access

The twin peaks of learning neural networks

Elizaveta Demyanenko et al 2024 Mach. Learn.: Sci. Technol. 5 025061

View article, The twin peaks of learning neural networks PDF, The twin peaks of learning neural networks

Recent works demonstrated the existence of a double-descent phenomenon for the generalization error of neural networks, where highly overparameterized models escape overfitting and achieve good test performance, at odds with the standard bias-variance trade-off described by statistical learning theory. In the present work, we explore a link between this phenomenon and the increase of complexity and sensitivity of the function represented by neural networks. In particular, we study the Boolean mean dimension (BMD), a metric developed in the context of Boolean function analysis. Focusing on a simple teacher-student setting for the random feature model, we derive a theoretical analysis based on the replica method that yields an interpretable expression for the BMD, in the high dimensional regime where the number of data points, the number of features, and the input size grow to infinity. We find that, as the degree of overparameterization of the network is increased, the BMD reaches an evident peak at the interpolation threshold, in correspondence with the generalization error peak, and then slowly approaches a low asymptotic value. The same phenomenology is then traced in numerical experiments with different model classes and training setups. Moreover, we find empirically that adversarially initialized models tend to show higher BMD values, and that models that are more robust to adversarial attacks exhibit a lower BMD.

https://doi.org/10.1088/2632-2153/ad524d

The following article is Open access

Machine-learning strategies for the accurate and efficient analysis of x-ray spectroscopy

Thomas Penfold et al 2024 Mach. Learn.: Sci. Technol. 5 021001

View article, Machine-learning strategies for the accurate and efficient analysis of x-ray spectroscopy PDF, Machine-learning strategies for the accurate and efficient analysis of x-ray spectroscopy

Computational spectroscopy has emerged as a critical tool for researchers looking to achieve both qualitative and quantitative interpretations of experimental spectra. Over the past decade, increased interactions between experiment and theory have created a positive feedback loop that has stimulated developments in both domains. In particular, the increased accuracy of calculations has led to them becoming an indispensable tool for the analysis of spectroscopies across the electromagnetic spectrum. This progress is especially well demonstrated for short-wavelength techniques, e.g. core-hole (x-ray) spectroscopies, whose prevalence has increased following the advent of modern x-ray facilities including third-generation synchrotrons and x-ray free-electron lasers. While calculations based on well-established wavefunction or density-functional methods continue to dominate the greater part of spectral analyses in the literature, emerging developments in machine-learning algorithms are beginning to open up new opportunities to complement these traditional techniques with fast, accurate, and affordable 'black-box' approaches. This Topical Review recounts recent progress in data-driven/machine-learning approaches for computational x-ray spectroscopy. We discuss the achievements and limitations of the presently-available approaches and review the potential that these techniques have to expand the scope and reach of computational and experimental x-ray spectroscopic studies.

https://doi.org/10.1088/2632-2153/ad5074

Machine Learning: Science and Technology

Journal links

Journal information

Machine Learning: Science and Technology

Most read

Latest articles

Review articles

Accepted manuscripts

Trending

Trending on Altmetric

Journal links

Journal information