计算机科学
元数据
笔记本电脑
数据质量
Python(编程语言)
管道(软件)
数据库
软件
实时计算
数据挖掘
光伏系统
缺少数据
工程类
操作系统
公制(单位)
运营管理
电气工程
机器学习
作者
Bennet Meyers,Elpiniki Apostolaki-Iosifidou,Laura T. Schelhas
标识
DOI:10.1109/pvsc45281.2020.9300847
摘要
The increasing volume of photovoltaic system performance data is creating opportunities for remotely monitoring system health and optimizing operations and maintenance activities. However, this data often arrives in a variety of formats, with varying levels of quality, and missing metadata. The inability to automatically ingest, quality check, filter, and flag data across a large number of unique PV systems is a major roadblock to unlocking the potential of PV system performance data at scale. We have developed an automatic data processing pipeline application, written in open source Python. This software takes as an input generic, tabular PV performance time series data, typically a power signal at some sampling frequency, possibly with some missing or corrupted data. Using advanced data algorithms, large, high-frequency data sets can be standardized, cleaned, filtered, and flagged for basic operational issues in seconds on a standard laptop, with no need for user input or system metadata.
科研通智能强力驱动
Strongly Powered by AbleSci AI