Efficient Sampling Algorithms for Approximate Computation of High-Dimensional Data in Scientific Computing

Authors

  • Bronisław Lucjan Latocha Faculty of Information Technology, Casimir Pulaski University of Radom, Radom, 26-600, Poland
  • Celina Hajdukowa Faculty of Information Technology, Casimir Pulaski University of Radom, Radom, 26-600, Poland
  • Czesława Latocha Faculty of Informatics, Jan Kochanowski University, Kielce, 25-020, Poland

DOI:

https://doi.org/10.64972/dea.2025.v4i2.2005d:57-71

Keywords:

High-Dimensional Data, Efficient Sampling, Approximate Computation, Scientific Computing, Statistical Analysis

Abstract

As the dimensionality of high-dimensional data increases, scientific computing faces many challenges. There is an urgent need for high-quality approximation methods. To address the high-dimensional data approximation problems frequently encountered in fields such as environmental monitoring, genomics, and remote sensing, this paper introduces a new class of efficient sampling algorithms. In order to improve the speed and reliability of data-based statistics, the aforementioned two methods were chosen. A large number of experiments have shown that the above methods perform well in synthesizing Gaussian fields, hyperspectral images, and single-cell gene expression matrices. According to the results, the proposed algorithm achieved an average approximation error of less than 0.018, even with a sampling ratio of only 5%. It outperforms traditional importance sampling (0.043) and uniform random sampling (0.071), and exhibits linear scalability in terms of computation time and memory. To demonstrate that reliable and accurate estimators can still be used under adaptive weighting and feedback mechanisms, ablation experiments and sensitivity analyzes were also conducted. These findings provide new standards for the sample economy and accuracy of data-intensive scientific processes. The study provides the scientific community with a reproducible and open workflow to help accelerate research and gain deeper insights into high-dimensional data.

Downloads

Published

2025-04-25

How to Cite

Latocha, B. L., Hajdukowa, C., & Latocha, C. (2025). Efficient Sampling Algorithms for Approximate Computation of High-Dimensional Data in Scientific Computing. Data Engineering and Applications, 4(2), 5d:57–71. https://doi.org/10.64972/dea.2025.v4i2.2005d:57-71

Issue

Section

Articles

Similar Articles

1 2 3 4 5 > >> 

You may also start an advanced similarity search for this article.