The susceptibility of deep neural networks (DNNs) to adversarial manipulation has been extensively verified in numerous studies. Within black-box attack settings, since the internal parameters of the target model are inaccessible, adversaries typicall...
The susceptibility of deep neural networks (DNNs) to adversarial manipulation has been extensively verified in numerous studies. Within black-box attack settings, since the internal parameters of the target model are inaccessible, adversaries typically rely on surrogate models to approximate the decision boundary and subsequently craft adversarial inputs. Nevertheless, the reliance on a single surrogate model often leads to local optima, thereby weakening the cross-model transferability of the crafted adversarial examples. To address this issue, we introduce an adversarial example generation framework termed Attention-based Multi-model feature integration (AMA). Our approach leverages attention weights obtained from intermediate feature representations of surrogate models, integrates attention maps from multiple models to identify common discriminative features, and then applies an optimization strategy to perturb these features, thus overcoming the limitations inherent in single-model surrogates. Experimental evaluations demonstrate that AMA achieves up to 17.4% improvement over the baseline, with an average attack success rate of 58.0% against diverse defense models—surpassing the strongest baseline by 5.1%. These results highlight the effectiveness of our method in enhancing both adversarial strength and transferability.