Scalable Machine Learning Analysis of Parker Solar Probe Solar Wind Data

Publications

Daniela Martin, Connor Oโ€™Brien, Valmir P. Moraes Filho, Jinsu Hong, Jasmine R. Kobayashi, Evangelia Samara & Joseph Gallego
arXiv preprint arXiv:2510.21066 (submitted 24 Oct 2025)

Abstract:
We present a scalable machine learning framework for analyzing Parker Solar Probe (PSP) solar wind data using distributed processing and the quantum-inspired Kernel Density Matrices (KDM) method. The PSP dataset (2018--2024) exceeds 150 GB, challenging conventional analysis approaches. Our framework leverages Dask for large-scale statistical computations and KDM to estimate univariate and bivariate distributions of key solar wind parameters, including solar wind speed, proton density, and proton thermal speed, as well as anomaly thresholds for each parameter. We reveal characteristic trends in the inner heliosphere, including increasing solar wind speed with distance from the Sun, decreasing proton density, and the inverse relationship between speed and density. Solar wind structures play a critical role in enhancing and mediating extreme space weather phenomena and can trigger geomagnetic storms; our analyses provide quantitative insights into these processes. This approach offers a tractable, interpretable, and distributed methodology for exploring complex physical datasets and facilitates reproducible analysis of large-scale in situ measurements. Processed data products and analysis tools are made publicly available to advance future studies of solar wind dynamics and space weather forecasting. The code and configuration files used in this study are publicly available to support reproducibility.

Links:


๐Ÿ“Œ BibTeX citation

@article{martin2025scalablePSP,
  title   = {Scalable Machine Learning Analysis of Parker Solar Probe Solar Wind Data},
  author  = {Martin, Daniela and O'Brien, Connor and Moraes Filho, Valmir P. and Hong, Jinsu and Kobayashi, Jasmine R. and Samara, Evangelia and Gallego, Joseph},
  journal = {arXiv preprint},
  year    = {2025},
  eprint  = {2510.21066},
  archivePrefix = {arXiv},
  primaryClass = {cs.LG},
  url     = {https://arxiv.org/abs/2510.21066}
}

ยฉ 2026 This website is copyright. Created by All Shaman team.