Machine Learning for Chemical Reactivity: The Importance of Failed Experiments

Strieth-Kalthoff, Felix; Sandfort, Frederik; Kühnemund, Marius; Schäfer, Felix R.; Kuchen, Herbert; Glorius, Frank


Abstract

Assessing the outcomes of chemical reactions in a quantitative fashion has been a cornerstone across all synthetic disciplines. Classically approached through empirical optimization, data-driven modelling bears an enormous potential to streamline this process. However, such predictive models require significant quantities of high-quality data, the availability of which is limited: Main reasons for this include experimental errors and, importantly, human biases regarding experiment selection and result reporting. In a series of case studies, we investigate the impact of these biases for drawing general conclusions from chemical reaction data, revealing the utmost importance of “negative” examples. Eventually, case studies into data expansion approaches showcase directions to circumvent these limitations—and demonstrate perspectives towards a long-term data quality enhancement in chemistry.

Keywords
Cross-Coupling; Data Bias; Machine Learning; Reaction Data; Yield Prediction



Publication type
Research article (journal)

Peer reviewed
Yes

Publication status
Published

Year
2022

Journal
Angewandte Chemie - International Edition

Volume
61

Issue
29

Language
English

ISSN
1433-7851

DOI