To address the challenges in identifying effective fault features and achieving sufficient diagnostic accuracy and robustness in variable-speed printing press bearings, where complex mixed-condition vibration signals exhibit non-stationarity, strong nonlinearity, ambiguous time-frequency characteristics, and overlapping fault features across multiple operating conditions, this paper proposes an adaptive optimization signal decomposition method combined with dual-modal time-series and image deep feature fusion for variable-speed multi-condition bearing fault diagnosis. First, to overcome the strong parameter dependency and significant noise interference of traditional adaptive decomposition algorithms, the Crested Porcupine Optimization Algorithm is introduced to adaptively search for the optimal noise amplitude and integration count of ICEEMDAN for effective signal decomposition. IMF components are then screened and reorganized based on correlation coefficients and variance contribution rates to enhance fault-sensitive information. Second, multidimensional time-domain features are extracted in parallel to construct time-frequency images, forming time-sequence-image bimodal inputs that enhance fault representation across different dimensions. Finally, a dual-branch deep learning model is developed: the time-sequence branch employs gated recurrent units to capture feature evolution trends, while the image branch utilizes SE-ResNet18 with embedded channel attention mechanisms to extract deep spatial features. Multimodal feature fusion enables classification recognition. Validation using a bearing self-diagnosis dataset from variable-speed hybrid operation and the publicly available Ottawa variable-speed bearing dataset demonstrates that this method achieves high-accuracy fault identification and strong generalization capabilities across diverse variable-speed hybrid operating conditions.