衛星ビッグデータからの知識発見を可能とするインタラクティブ統計分析手法の開発
项目来源
项目主持人
项目受资助机构
立项年度
立项时间
项目编号
项目级别
研究期限
受资助金额
学科
学科代码
基金类别
关键词
参与者
参与机构
1.Adaptive Segmentation and Statistical Analysis for Multivariate Big Data Forecasting
- 关键词:
- Big data;Budget control;Complex networks;Computational complexity;Finance;Higher order statistics;Image segmentation;Multivariant analysis;Predictive analytics;Time series ;Time series analysis;Big data analytic;Data analytics;Data optimization;High fluctuation;High volumes;Multivariate time series;Predictive models;Recursive window segmentation;Univariate;Univariate time series
- Fomo, Desmond;Sato, Aki-Hiro
- 《Big Data and Cognitive Computing》
- 2025年
- 9卷
- 11期
- 期刊
Forecasting high-volume, univariate, and multivariate longitudinal data streams is a critical challenge in Big Data systems, especially with constrained computational resources and pronounced data variability. However, existing approaches often neglect multivariate statistical complexity (e.g., covariance, skewness, kurtosis) of multivariate time series or rely on recency-only windowing that discards informative historical fluctuation patterns, limiting robustness under strict resource budgets. This work makes two core contributions to big data forecasting. First, we establish a formal, multi-dimensional framework for quantifying "data bigness" across statistical, computational, and algorithmic complexities, providing a rigorous foundation for analyzing resource-constrained problems. Second, guided by this framework, we extend and validate the Adaptive High-Fluctuation Recursive Segmentation (AHFRS) algorithm for multivariate time series. By incorporating higher-order statistics such as covariance, skewness, and kurtosis, AHFRS improves predictive accuracy under strict computational budgets. We validate the approach in two stages. First, a real-world case study on a univariate Bitcoin time series provides a practical stress test using a Long Short-Term Memory (LSTM) network as a robust baseline. This validation reveals a significant increase in forecasting robustness, with our method reducing the Root Mean Squared Error (RMSE) by more than 76% in a challenging scenario. Second, its generalizability is established on synthetic multivariate data sets in Finance, Retail, and Healthcare using standard statistical models. Across domains, AHFRS consistently outperforms baselines; in our multivariate Finance simulation, RMSE decreases by up to 62.5% in Finance and Mean Absolute Percentage Error (MAPE) drops by more than 10 percentage points in Healthcare. These results demonstrate that the proposed framework and AHFRS advances the theoretical modeling of data complexity and the design of adaptive, resource-efficient forecasting pipelines for real-world, high-volume data ecosystems. © 2025 by the authors.
...2.Spatial Big Data Infrastructure for the Japanese Hyper-spectral Imager SUIte (HISUI) based on World Grid Square Statistics
- 关键词:
- Mesh generation;Metadata;Spectrum analyzers;Steganography;Tropics;Data infrastructure;Descriptive statistics;Hyper spectral sattelite data;Hyper-spectral imager SUIte;HyperSpectral;MESHSTATS;Sattelite;Spectral imagers;World grid square statistic
- Sato, Aki-Hiro
- 《2024 IEEE International Conference on Big Data, BigData 2024》
- 2024年
- December 15, 2024 - December 18, 2024
- Washington, DC, United states
- 会议
This study proposes a framework to manage HISUI data (spatiotemporal and spectral big data) as spatial statistics regarding time and spectrum. This study focuses on spatial Big Data infrastructure for developing and deploying interactive Web data applications of HISUI hyperspectral data from a constructive point of view.The study aims to construct a system for spatiotemporal spectrum analysis and visualization based on the proposed technique. Such a spatial Big Data infrastructure assists in developing algorithms that enable users to interactively analyze, classify, and detect temporal changes on the Earth's surface. As a result, users can conduct statistical analysis of spatio-temporal and spectral characteristics on hyperspectral images.The World Grid Square statistics on descriptive statistics of reflectances for each band are produced by parallel computation with high-speed calculation methods and provided via Web API on MESHSTATS. The response is formatted as CSV with World Grid Square code, sample size, and descriptive statistics selected by type for 185 bands.The proposed method enables users to analyze hyperspectral satellite data captured as multi-channel raster data with different observation directions, times, and weather conditions by converted into Grid Square statistics. It is concluded that our data analytics platform enables users to conduct analysis on Grid Square statistics about HISUI hyperspectral satellite data with statistical significance. © 2024 IEEE.
...
