Adaptive adversarial augmentation for molecular property prediction

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

This paper introduces a novel adversarial data augmentation technique that enhances the performance of molecular property prediction tasks. The proposed method, Adversarial Augmentation to Influential Sample (AAIS), addresses classification problems involving imbalanced data in multitask scenarios where traditional augmentation techniques often prove ineffective. Data augmentation is conducted by a distributionally robust optimization, so its performance is less dependent on the available data and the number of tasks. Particularly, we devise an adaptive augmentation that identifies data points that significantly influence model training using the influence function. We found that these data points are located near the decision boundary, and the adaptive augmentation locally flattens the decision boundary, leading to the robustness of prediction. The proposed method was evaluated using benchmark datasets commonly employed for property prediction based on molecular structures. We confirmed that the developed adaptive adversarial augmentation approach effectively improves model prediction performance by 1%–15% in AUC and 1%–35% in F1-score. Our method effectively addresses the class imbalance problem and enhances performance in graph-level tasks by augmenting influential graph features. These findings suggest that AAIS offers a flexible tool for improving the generalization ability and accuracy of the model, particularly in complex or imbalanced molecular prediction tasks.

Original languageEnglish
Article number126512
JournalExpert Systems with Applications
Volume270
DOIs
StatePublished - 25 Apr 2025

Keywords

  • Adversarial augmentation
  • Graph neural network
  • Influence function
  • Molecular properties prediction

Fingerprint

Dive into the research topics of 'Adaptive adversarial augmentation for molecular property prediction'. Together they form a unique fingerprint.

Cite this