CINDI:基于条件归一化流的电网数据条件插补与噪声数据完整性修复框架 / CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data
1️⃣ 一句话总结
这篇论文提出了一个名为CINDI的无监督概率框架,它首次将异常检测和数据修复统一到一个端到端的系统中,通过建模数据的精确条件概率,能够自动识别并修复电网等复杂时间序列数据中的噪声和异常,同时保持数据的物理和统计特性。
Real-world multivariate time series, particularly in critical infrastructure such as electrical power grids, are often corrupted by noise and anomalies that degrade the performance of downstream tasks. Standard data cleaning approaches often rely on disjoint strategies, which involve detecting errors with one model and imputing them with another. Such approaches can fail to capture the full joint distribution of the data and ignore prediction uncertainty. This work introduces Conditional Imputation and Noisy Data Integrity (CINDI), an unsupervised probabilistic framework designed to restore data integrity in complex time series. Unlike fragmented approaches, CINDI unifies anomaly detection and imputation into a single end-to-end system built on conditional normalizing flows. By modeling the exact conditional likelihood of the data, the framework identifies low-probability segments and iteratively samples statistically consistent replacements. This allows CINDI to efficiently reuse learned information while preserving the underlying physical and statistical properties of the system. We evaluate the framework using real-world grid loss data from a Norwegian power distribution operator, though the methodology is designed to generalize to any multivariate time series domain. The results demonstrate that CINDI yields robust performance compared to competitive baselines, offering a scalable solution for maintaining reliability in noisy environments.
CINDI:基于条件归一化流的电网数据条件插补与噪声数据完整性修复框架 / CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data
这篇论文提出了一个名为CINDI的无监督概率框架,它首次将异常检测和数据修复统一到一个端到端的系统中,通过建模数据的精确条件概率,能够自动识别并修复电网等复杂时间序列数据中的噪声和异常,同时保持数据的物理和统计特性。
源自 arXiv: 2603.11745