BounDr.E: Predicting Drug-likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization

Citations

SCOPUS

0

초록

The advent of generative AI now enables largescale de novo design of molecules, but identifying viable drug candidates among them remains an open problem. Existing drug-likeness prediction methods often rely on ambiguous negative sets or purely structural features, limiting their ability to accurately classify drugs from nondrugs. In this work, we introduce BOUNDR.E: a novel modeling of drug-likeness as a compact space surrounding approved drugs through a dynamic one-class boundary approach. Specifically, we enrich the chemical space through biomedical knowledge alignment, and then iteratively tighten the drug-like boundary by pushing nondrug-like compounds outside via an ExpectationMaximization (EM)-like process. Empirically, BOUNDR.E achieves 10% F1-score improvement over the previous state-of-the-art and demonstrates robust cross-dataset performance, including zero-shot toxic compound filtering. Additionally, we showcase its effectiveness through comprehensive case studies in large-scale in silico screening. Our codes and constructed benchmark data under various schemes are provided at: github.com/eugenebang/boundr e. © 2025, ML Research Press. All rights reserved.

제목
BounDr.E: Predicting Drug-likeness via Biomedical Knowledge Alignment and EM-like One-Class Boundary Optimization
저자
Bang, DongminSung, InyoungPiao, YinhuaLee, SangseonKim, Sun
발행일
2025
유형
Conference paper
저널명
Proceedings of Machine Learning Research
267
페이지
2858 ~ 2893