Large Language Models in Life Sciences

Project Charter

Explore the use of Large Language Models for biological research, and define the best practices for doing so, using target discovery and validation as the initial use case

The Challenge

The use of Large Language Models (LLMs), such as GPT-4, presents a transformative opportunity for pharmaceutical R&D, particularly in target discovery and validation. Target discovery—a foundational process in drug development—requires the synthesis of large, complex datasets and the integration of proprietary research within the broader context of public information.

Proposed Approach:
We aim to harness prompt-tuned LLMs combined with Retrieval-Augmented Generation (RAG) methodologies to generate plain-English answers to typical target discovery questions. By focusing on highly structured public datasets, this project will establish a scalable, open-source pipeline tailored for the demands of target discovery.

Key Objectives:

  1. Guidelines for LLM Integration: Develop a robust framework outlining the most effective strategies for deploying LLMs in target discovery, ensuring reproducibility and transparency.
  2. Open-Source Innovation: Deliver a practical, community-driven pipeline leveraging LLMs for target discovery, reducing redundancy and promoting collaborative advancements in biological research.

Why Target Discovery?
This process is universally relevant across pharmaceutical R&D and exemplifies the challenges LLMs can address—namely, mining vast and intricate datasets to produce actionable insights. By solving these challenges, we pave the way for broader applications of LLMs in the scientific and industrial research landscape.

Through this initiative, we aim to define the role of LLMs in pre-competitive research, demonstrating their potential to accelerate drug discovery and enhance collaboration across the life sciences sector.

Get involved

Talk to our project managers to learn more and get involved

Contact Us

Project Supporter

  • Abbvie logo
New Idea

Agent Communication Protocol and AI Agent Standard Specs

Strategic Priority - Harnessing AI to Accelerate R&D

We believe that the next phase in the evolution of enterprise LLM applications is to create a framework that links diverse and heterogeneous AI agents into a network. As such we are looking to develop a new project...

Learn More

Our Events

Purple event banner with white logo of a spire
20 Feb 2025

Protected: IDMP Ontology Training Recordings 2025

Book Now
26 Feb 2025

IDMP Community of Interest Meeting

Book Now
04 Mar 2025

AI in Support of Regulatory Decision Making

Book Now
Lab of the Future March 2025 Banner with event details
10 Mar 2025

Partner Event – Lab of the Future USA Congress 2025

Book Now
25 Mar 2025

Pistoia Alliance 2025 London Conference

Book Now
02 Apr 2025

Partner Event – Bio IT World Conference and Expo

Book Now
Event banner showing a panoramic view of Manchester
02 Jun 2025

Partner Event – Virtual Imaging Trials in Medicine 2025

Book Now
Blue event banner for Lab of the Future conference with event time and location
30 Sep 2025

Partner Event – Lab of the Future Congress Europe 2025

Book Now
Boston Bay image Conference 2025
11 Nov 2025

Pistoia Alliance USA Conference 2025

Book Now