EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training

自回归模型基础（证据）通才与专种培训（气象学）计算机科学人工智能脑电图心理学机器学习计量经济学数学政治学地理神经科学生物生态学气象学栖息地法学

作者

Tongtian Yue,Shuning Xue,Xuange Gao,Yepeng Tang,Longteng Guo,Jie Jiang,Jing Liu

出处

期刊：Cornell University - arXiv 日期：2024-10-14

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.48550/arxiv.2410.19779

摘要

Electroencephalogram (EEG) signals are pivotal in providing insights into spontaneous brain activity, highlighting their significant importance in neuroscience research. However, the exploration of versatile EEG models is constrained by diverse data formats, outdated pre-training paradigms, and limited transfer learning methods, only leading to specialist models on single dataset. In this paper, we introduce EEGPT, the first generalist EEG foundation model designed to address these challenges. First, we propose an electrode-wise modeling strategy that treats each electrode as a fundamental unit, enabling the integration of diverse EEG datasets collected from up to 138 electrodes, amassing 37.5M pre-training samples. Second, we develop the first autoregressive EEG pre-trained model, moving away from traditional masked autoencoder approaches to a next signal prediction task that better captures the sequential and temporal dependencies of EEG data. We also explore scaling laws with model up to 1.1B parameters: the largest in EEG research to date. Third, we introduce a multi-task transfer learning paradigm using a learnable electrode graph network shared across tasks, which for the first time confirms multi-task compatibility and synergy. As the first generalist EEG foundation model, EEGPT shows broad compatibility with various signal acquisition devices, subjects, and tasks. It supports up to 138 electrodes and any combination thereof as input. Furthermore, we simultaneously evaluate it on 5 distinct tasks across 12 benchmarks. EEGPT consistently outperforms existing specialist models across all downstream tasks, with its effectiveness further validated through extensive ablation studies. This work sets a new direction for generalist EEG modeling, offering improved scalability, transferability, and adaptability for a wide range of EEG applications. The code and models will be released.

求助该文献

最长约 10秒，即可获得该文献文件

EEGPT: Unleashing the Potential of EEG Generalist Foundation Model by Autoregressive Pre-training

今日热心研友