Home Artificial Intelligence-Assisted Chemistry
Article Publicly Available

Artificial Intelligence-Assisted Chemistry

  • Ahmed Saeed , Peter Hotchkiss

    Peter Hotchkiss <peter.hotchkiss@opcw.org> is the science policy adviser at the Organisation for the Prohibition of Chemical Weapons and secretary to the OPCW’s Scientific Advisor Board. ORCID 0000-0002-1229-1090.

    EMAIL logo
    , Jeremy Frey , Hemda Garelick

    Hemda Garelick, Prof of Environmental Science and Public Health Education at Middlesex University, London, UK. Past President of IUPAC Div VI ‘Chemistry and the Environment’. ORCID 0000-0003-4568-2300

    , Vladimir Gubala , Bipul Saha , Russell Boyd

    Russell Boyd is the Alexander McLeod Professor Emeritus in computational chemistry at Dalhousie University. ORCID 0000-0001-6802-2976.

    and Jonathan E Forman

    Jonathan Forman is a Science and Technology Advisor at Pacific Northwest National Laboratory in the USA. He formerly served as the OPCW Science Policy Adviser from 2013-2019. ORCID 0000-0002-5379-812X 

Published/Copyright: August 14, 2023
Become an author with De Gruyter Brill

An International Workshop on “Artificial Intelligence Assisted Chemistry” was held at the headquarters of the Organisation for the Prohibition of Chemical Weapons (OPCW) in The Hague, The Netherlands on 16 and 17 June 2022. The program was jointly organized by the OPCW’s Scientific Advisory Board (SAB) and IUPAC, with representatives from the Committee on Chemistry and Industry (Bipul Saha), the Interdivisional Committee on Green Chemistry for Sustainable Development (Jonathan Forman), the Physical and Biophysical Chemistry Division (Jeremy Frey), Chemistry and the Environment Division (Hemda Garelick), and the Chemistry and Human Health Division (Vladimir Gubala). The objectives of the workshop were to (a) provide insight into current and future applications of Artificial Intelligence in Chemistry and (b) generate provocative discussions on the future of this emerging field and its potential impacts, both positive and negative, on the chemical sciences and society.

The SAB is tasked with monitoring the developments in science and technology of relevance to the Chemical Weapons Convention (CWC). The SAB’s guidance enables the OPCW Director-General to render science and technology advice relevant to the CWC, to the Conference of the States Parties, the Executive Council and States Parties. Every five years the OPCW holds a Special Session of the Conference of the States Parties to Review the Operation of the Chemical Weapons Convention, otherwise known as a Review Conference. These Review Conferences serve as fora for the assessment and evaluation of the CWC’s implementation and setting long-term views by providing strategic guidance to the OPCW. Particular focus is given to the verification regime, especially given the CWC’s guidance that the OPCW consider measures to make use of advances in science and technology in undertaking its verification activities (Article VIII, paragraph 6), the changing context within which the regime is implemented and scientific and technological advances in chemistry, engineering, and biotechnology. The Fifth Review Conference was scheduled for 15-19 May 2023.

To ensure that the OPCW Technical Secretariat and States Parties are apprised, the SAB prepares a report before each Review Conference which analyses relevant developments in science and technology over the past five years and presents recommendations and observations that the SAB considers to be important for the review of the operation of the Convention and its future implementation.

 
        “Chemistry of the Future” by DALL.E2 for IUPAC and OPCW (3 March 2023)

“Chemistry of the Future” by DALL.E2 for IUPAC and OPCW (3 March 2023)

IUPAC and OPCW have collaborated on various activities and workshops over the years [1, 2, 3] . The relationship was further strengthened in 2016 by the joint signing of a Memorandum of Understanding (MOU) officially establishing the relationship [4]. The MOU outlines the framework for cooperation between the two organisations with a view towards achieving common objectives and to providing benefits to their respective programmes and areas of work. These areas include, inter alia, science, education and outreach, ethics, and peaceful uses of chemistry. This workshop on AI-assisted chemistry is the latest example of this long-standing collaboration.

Summary of the Workshop

The workshop provided insight into current and future applications of AI in chemistry, with a focus on areas of chemistry of relevance to the OPCW. Topics such as synthetic design and retrosynthetic analysis, reaction conditions and optimisation, drug discovery, adoption and impact of AI in and on the chemical industry, and AI in agricultural chemistry were all considered via both presentations by world experts as well as during moderated discussions throughout the 1.5-day workshop. A brief overview of the presentations and discussions follows.

The workshop kicked off with an overview of the agenda and the impetus for convening on the AI theme. This was provided on behalf of the SAB by Board member Ahmed Saeed and Peter Hotchkiss of the OPCW. This opening presentation briefly touched on the various areas in which AI is impacting society before focusing on the many ways it is enabling the chemical sciences.

Artificial Intelligence in Drug Discovery

The first invited speaker was Sean Ekins of Collaborations Pharmaceuticals Inc, USA. Collaborations Pharmaceuticals uses AI to develop clinical candidates for rare, neglected, and unmet therapeutic needs. However, in a meeting with science, industry, and arms control communities, they started to consider how the technology they use may be misused in the development of chemical and biological weapons. His team and some colleagues from the Swiss Federal Institute for NBC-Protection (Spiez Laboratory) and Kings College London recently published the results of a thought experiment in which his team used a pre-existing machine learning model for bioactivity and rat acute LD50 (in order to identify toxic molecules) along with a generative algorithm trained on the ChEMBL database to develop molecules with the aim of assessing if the algorithm could generate molecules similar to nerver agent VX. In just 6 hours the generative algorithm produced 40,000 molecules including VX, related known analogues as well as thousands of other molecules that looked similar.

The work, published in March 2022 in Nature Machine Intelligence, garnered a lot of global and media attention allowing Ekins to both present the team’s work to various audiences, but also the story of the work and how they learned a lot about the potential for misuse that this technology allows [5]. Ekins mentioned that the number of companies using AI for drug discovery is growing and AI is used across many sectors such as consumer products (e.g., to identify for non-animal testing options for cosmetics) and animal health (e.g., to discover cost effective new treatments). He emphasized the importance of security and ethics to control misuse of AI.

Artificial, Augmented and Automated chemistry

Jeremy Frey of the University of Southampton, UK spoke about machine learning in chemistry and how it is useful for a host of applications to include drug design, materials design, complex mixture and formulation design, and device design. However, large amounts of quality data are required for machine learning to be successful. Obtaining these data is a major driving force for the development of new synthetic and characterisation techniques. In posing the question of whether scientific discovery can be automated, Frey indicated that he thought so, but that automation is just one piece to the puzzle. Frey also highlighted that despite the opportunities and efficiencies that artificial, augmented, and automated chemistry may offer, that students should not forsake traditional chemistry. These computer-aided techniques and approaches are only tools, and the human mind will still be needed in these processes (i.e., keeping the human in the loop).

Use of neural networks and symbolic AI in chemical synthesis planning

Marwin Segler of Microsoft Corp explained how the AI/machine learning landscape has evolved in last few years through different stages, from automated synthesis planning and molecular design, to property prediction of new molecules, to readily available open-source code and cloud computing. He suggested that AI should be used for designing property profiles, not compounds per se. In reviewing the history of computer-aided synthesis planning, Segler noted that AI can serve as an idea generator for competent chemists and software is openly available [6]. Some limitations of current approaches were also highlighted, such as: the difficulties in predicting reaction conditions, experimental success and scale-up, and the subsequent limitations of reaction models. However, these will continue to improve with more data and combining this with automated approaches, such as robot-assisted synthesis/experimentation.

Segler believes that, in the next 5-10 years, we should expect to see:

  1. Synthesis planning, reaction optimization and new molecule generation using AI [7]

  2. An increased ability to identify new targets and binding sites

  3. More accurate atomistic simulations

  4. Algorithms better at extrapolation

He concluded by saying that chemistry expertise will still be needed in the future, but tools like AI will make some of these complex undertakings more accessible.

Application of Artificial Intelligence in Agricultural Chemistry and Agriculture

Bipul Saha of the Sagar Group of Institutions in India spoke about the application of AI in the discovery and synthesis of safe and highly effective crop protection chemicals, listing a number of recent use-cases such as the discovery of a new selective herbicide for sorghum (Benquitrione). AI can also be used in the prediction of risk of resistance and toxicity of new molecules, saving time and money when developing a new agricultural chemical. When it comes to formulation development, machine learning can be used for prediction of physical stability, dispersion rate and stability of emulsion, etc.

 
          Participants in the International Workshop on “Artificial Intelligence Assisted Chemistry” held at OPCW in The Hague, The Netherlands on 16-17 June 2022. Facilitations for remote presentations and attendance were also provided.

Participants in the International Workshop on “Artificial Intelligence Assisted Chemistry” held at OPCW in The Hague, The Netherlands on 16-17 June 2022. Facilitations for remote presentations and attendance were also provided.

Saha noted that the population of the world is expected to grow to 9.7 billion by the year 2050. To feed this population, agricultural output must increase by at least 50%. At the same time, availability of agricultural land is decreasing due to urbanization and other factors. There is also the increased potential for substantial loss of crops due to pests, crop diseases and weeds.

AI can therefore play an important role in areas such as:

  1. Early diagnosis of disease and fungal growth in crops to reduce loss of yield

  2. Monitoring crop heath to diagnose nutrient deficiency, moisture content, and pest damage

  3. Advising farmers on the right time to sow fields for maximum yield

  4. Helping farmers in selective removal of weeds to reduce cost and environmental impact

  5. Reducing labor needs by incorporating autonomous robots in crop harvesting

  6. Predicting crop yields and forecasting prices to help empower farmers

One recent example of the impact of AI is in the collaboration in India between the International Crops Research Institute for the Semi-Arid Tropics (ICRISAT) and Microsoft Corp to generate sowing advice for farmers resulting in a 30% increase in yield. There are other companies and organisations in India looking to harness the power of AI to strengthen the agricultural industry.

Self-Optimising Reactors for Industry 4.0 Process Development

Richard Bourne of the University of Leeds spoke on self-optimising reactors. He indicated that, compared to conventional methods, self-optimizing reactors:

  1. Provide a cheaper, faster and greener route to reaction optimization

  2. Enhance researcher capabilities by removing the need for labour intensive experimentation

  3. Align with growing use of flow chemistry in the chemicals and pharmaceuticals industry

Bourne highlighted an example from his laboratory in which 46 experiments were conducted in 37 hours using only 200 mg of sample per experiment, allowing for the most efficient reaction conditions to be determined [8]. This experiment highlights the efficiency a self-optimising reactor can realise.

Bourne also identified some potential risks and opportunities for OPCW in automated DMTA (design, make, test and analysis) cycles. It is clear that automated DMTA can positively impact the pharmaceutical industry, but could it also be used maliciously in CW research? It may also prove beneficial in developing automated processes to chemical destruction and remediation and could allow for more remote processes which tend to be inherently safer to human life.

Bourne in summary explained that automation of flow chemistry for process development will enable enhanced productivity, reproducibility, data analysis, and exploration of chemical space.

Sustainability Implications of AI in the Chemical Industry

Prof Yuan Yao of Yale University began by explaining how AI is helping society achieve the United Nations Sustainable Development Goals. There are also sustainability implications of AI in the chemical industry. Yao and her team have reviewed relevant literature and categorized their findings by applications, including AI applications to R&D, unit operations, chemical plants and supply chains. Most studies assessed the energy, economic, and safety implications of AI applications, while relatively few have looked at the environmental impacts of AI [9]. Previous studies also did not consider the impacts of AI systems themselves (e.g., AI infrastructure). As AI has received increasing interest in the chemical industry, it is important to evaluate the potential environmental, economic, and societal impacts of different AI applications.

Yao presented a conceptual framework to assess the sustainability impacts of AI, which includes approaches from industrial ecology, economics, and engineering. As an emerging tool, AI can also be used to support a better understanding of the sustainability of chemicals and materials. This was illustrated with a case study integrating machine learning and chemical process simulation for bio-based materials [10]. Similar applications were identified for bioenergy and other bio-based chemicals [11].

Use of AI in molecular design, reaction planning and reaction execution

Connor Coley of the Massachusetts Institute of Technology gave the final invited talk of the workshop. The Coley Research group focuses on developing new computational strategies to accelerate molecular discovery, with particular emphasis on small molecule therapeutic discovery. Among the major themes of the group are computer-aided molecular design and data-driven predictive chemistry. He noted that most molecular discovery workflows proceed iteratively through design-make-test-analyze cycles, exemplified by lead optimization in the pharmaceutical industry.

Table 1

Workshop participants (* indicates speaker, 1 indicates IUPAC representative, 2 indicates SAB member)

Name Affiliation
Crister Åstot2 FOI, Sweden
Khaldoun Bachari2 Algerian Public Scientific and Technical Research Centre in The Physico-Chemical-CRAPC, Algeria
Renate Becker-Arnold2 BASF, Germany
Elma Biscotti2 Scientific and Technical Research Institute for Defense, Argentina
Anne Bossée2 DGA CBRN Defense, France
Richard Bourne* Leeds University, United Kingdom
Thomas Chamberlain Leeds University, UK
Connor Coley* Massachusetts Institute of Technology, USA
Stefano Costanzi American University, USA
Vladimir Dimitrov2 Bulgarian Academy of Sciences, Bulgaria
Sean Ekins* Collaborations Pharmaceuticals, Inc, USA
Raza Ellahi2 Defence Science & Technology Organization, Pakistan
Jonathan Forman1 Pacific Northwest National Laboratory, USA
Jeremy Frey*1 University of Southampton, UK
Hemda Garelick1 Middlesex University, UK
Vladimir Gubala1 University of Kent, UK
Matteo Guidotti1,2 National Research Council, Italy
Peter Hotchkiss Organisation for the Prohibition of Chemical Weapons
Wilford Jwalshik2 Institute of Chartered Chemists of Nigeria, Nigeria
Robert Kristovich2 United States Army DEVCOM Chemical Biological Center, USA
Andrea Leisewitz2 Universidad San Sebastián, Chile
Imee Martinez2 University of the Philippines-Diliman, Philippines
Pia Mueller Leeds University, UK
Elisa Souza Orth2 University of Paraná, Brazil
Günter Povoden2 European Union CBRN Risk Mitigation Centres of Excellence, Austria
Ines Primožič2 University of Zagreb, Croatia
Ahmed Saeed2 Sudan University of Science and Technology, Sudan
Bipul Behari Saha*1 Sagar Group of Institutions, India
Marwin Segler* Microsoft Corporation
Yuan Yao* Yale University, USA

Coley noted that the canonical approach to computer-aided molecular design is virtual screening. In virtual screening, large pre-enumerated libraries of molecules are evaluated and the ones with the most favorable values are advanced to experimental testing. A second approach to computer-aided molecular design is generation. Generative models perform a computational optimization by proposing new molecules, estimating their performance (typically with a QSPR model) and revising the rules by which they generate molecules to bias generation towards higher-performing candidates.

For either approach, synthesis is an essential consideration. Virtual libraries are often “make-on-demand” libraries enumerated using chemical transformation rules we believe to be robust and generative models produce new compounds for which we must plan synthetic routes. He said that rational molecular design is an iterative, multi-objective optimization, and computation can accelerate discovery by making it easier to generate and test hypotheses. He further mentioned that although AI can help us to design molecules and develop retro-synthetic pathways, it can’t do everything and anything [12]. He raised the following questions:

  1. What catalysts, solvents, reagents, temperature do we use?

  2. What concentrations and reaction time do we use?

  3. What kind of flask/vessel do we use?

  4. What order do we add species in, and how quickly or slowly do we add?

  5. For multi-step sequences, what isolation steps are required?

In conclusion, Coley remarked that AI-assisted chemistry is changing how researchers approach the design and discovery of new functional molecules. The dual use concern is obvious but should not be overblown. Computational predictions of molecular properties are just that—predictions—and one should always consider what a model’s domain of applicability and uncertainty is, or whether biases during training are influencing its predictions. Computer-aided retrosynthesis to medchem-like compounds works well for data-driven programs; the distribution shift when working with small, highly-functionalized compounds presents a challenge (e.g., fentanyl is easy, nerve agents are hard).

Moderated Discussions

In addition to the invited speakers, there were several moderated discussion sessions to probe particpants’ thoughts on where the use of AI in the sciences is headed and what opportunities and concerns these may present. Discussions were moderated by Hemda Garelick, Vladimir Gubala and Jonathan Forman. These discussions probed several areas of relevance to the work of the OPCW, including synthesis automation, AI in Chemical Safety and Security Management, AI and sustainable technologies and green chemistry, Life Cycle analysis, and AI ethics.

Conclusion and Next Steps

All participants appreciated the workshop program. AI already plays an important role in the field of chemistry, and its adoption and role in the chemical sciences will only expand. Noting this, we should focus on developing positive applications and aspects of AI.

While limitations remain, AI-based synthesis planning makes continued progress. The conversation should hasten with AI ethics institutes and other experts regarding guidance for ethical guidelines to prevent the misuse of AI in the sciences—the Hague Ethical Guidelines provides a framework from which to have these deliberations [13]. Increased training and awareness of scientists to recognize potential for dual-use concerns of AI-assisted chemistry will be critical. Increased security around the software methods and tools that enable AI-assisted chemistry may be appropriate.

www.opcw.org/resources/science-and-technology

Über die Autoren

Peter Hotchkiss

Peter Hotchkiss <> is the science policy adviser at the Organisation for the Prohibition of Chemical Weapons and secretary to the OPCW’s Scientific Advisor Board. ORCID 0000-0002-1229-1090.

Hemda Garelick

Hemda Garelick, Prof of Environmental Science and Public Health Education at Middlesex University, London, UK. Past President of IUPAC Div VI ‘Chemistry and the Environment’. ORCID 0000-0003-4568-2300

Russell Boyd

Russell Boyd is the Alexander McLeod Professor Emeritus in computational chemistry at Dalhousie University. ORCID 0000-0001-6802-2976.

Jonathan E Forman

Jonathan Forman is a Science and Technology Advisor at Pacific Northwest National Laboratory in the USA. He formerly served as the OPCW Science Policy Adviser from 2013-2019. ORCID 0000-0002-5379-812X 

Referencers

1. https://iupac.org/100/stories/a-partnership-of-science-and-diplomacy-to-eliminate-chemical-weapons/ Search in Google Scholar

2. Mark Cesa, et al. “OPCW-IUPAC Workshop on Innovative Technologies for Chemical Security” Pure Appl. Chem., vol. 90, no. 10, 2018, pp. 1501-1506. https://doi.org/10.1515/pac-2018-070110.1515/pac-2018-0701Search in Google Scholar

3. Building Broader and Deeper Links Between OPCW and IUPAC, poster (2019) https://www.opcw.org/sites/default/files/documents/2019/07/OPCW-IUPAC%20Cooperation.pdfSearch in Google Scholar

4. >https://www.opcw.org/sites/default/files/documents/Industry/OPCW_IUPAC_Signed_MoU_01Dec2016.pdf or https://iupac.org/iupac-opcw-take-partnership-new-level/ (News 1 Dec 2016)Search in Google Scholar

5. Urbina, F., Lentzos, F., Invernizzi, C. et al. A teachable moment for dual-use. Nat Mach Intell 4, 607 (2022). https://doi.org/10.1038/s42256-022-00511-6.10.1038/s42256-022-00511-6Search in Google Scholar PubMed PubMed Central

6. Segler, Preuss, Waller, Nature, 2018, (555), 604–610. 10.1038/nature25978Search in Google Scholar PubMed

7. Strieth-Kalthoff, Sandfort, Segler, Glorius, Chem. Soc. Rev., 2020, 49, 6154-6168.10.1039/C9CS00786ESearch in Google Scholar

8. Andrew J. Parrott, Richard A. Bourne, Geoffrey R. Akien, Derek J. Irvine, Martyn Poliakoff, 2011, Angew. Chem. Int. Ed. 50(16), 3788-3792. https://doi.org/10.1002/anie.201100412.10.1002/anie.201100412Search in Google Scholar PubMed

9. Liao, M., Lan, K and Y. Yao (2021). Sustainability implications of artificial intelligence in the chemical industry: A conceptual framework. Journal of Industry Ecology. 26: 164–182. https://doi.org/10.1111/jiec.13214.10.1111/jiec.13214Search in Google Scholar

10. Liao, M.; Kelley, S.; Yao, Y. ACS Sustainable Chem. Eng. 2020, 8, 2, 1252-1261. https://pubs.acs.org/doi/full/10.1021/acssuschemeng.9b06522.10.1021/acssuschemeng.9b06522Search in Google Scholar

11. Liao, M.; Yao, Y. GCB-Bioenergy. 2021, 13, 5, 774-802. https://onlinelibrary.wiley.com/doi/full/10.1111/gcbb.12816.10.1111/gcbb.12816Search in Google Scholar

12. Coley, C. W. Trends in Chemistry. DOI: 10.1016/j.trechm.2020.11.004 (2020).10.1016/j.trechm.2020.11.004Search in Google Scholar

13. >https://www.opcw.org/hague-ethical-guideli Search in Google Scholar

Online erschienen: 2023-08-14
Erschienen im Druck: 2023-07-01

© 2023 IUPAC & De Gruyter. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License. For more information, please visit: http://creativecommons.org/licenses/by-nc-nd/4.0/

Downloaded on 8.10.2025 from https://www.degruyterbrill.com/document/doi/10.1515/ci-2023-0314/html
Scroll to top button