A white circular icon with lines and nodes representing a network

Datafairy Bioassay Annotation

Project Charter

This project aims to convert unstructured assay protocol descriptions into a high-quality FAIR data set, and create standards for this information.

Process: From Ideation to Delivery

Project Workflow
  • Icon of a lightbulb

    Ideation

  • Icon of 3 people with speech bubbles

    Discussion & Validation

  • An icon of a clipboard with a graph and checklist

    Problem Statement & Business Case Written

  • An icon of a hand with a pound sign

    Member Funded Project

  • An icon of 3 people with stars above their heads

    Project Live

The Challenge

This project will:

  • Revolutionize biotech R&D by standardizing research methods and improving reproducibility.
  • Save time for scientists by reducing assay planning efforts and avoiding failed experiments.
  • Help data scientists harmonize datasets, cleanse data, and enable advanced analytics, ML, and AI.
  • Foster precompetitive collaborations and interoperable scientific data initiatives.
  • Lower internal bioassay curation costs and simplify regulatory submissions.
  • Aid assay kit vendors and CROs by increasing visibility in public databanks like ChEMBL and PubChem.
  • Improve scientific publications’ quality through common assay annotation standards and public data banks.
  • Maximize research funders’ ROI by increasing the value of funded science.

The project do this by addressing key issues:

  • Over 1.4 million assay protocols in publications lack suitable formats for automated mining.
  • Research organizations spend 4–12 weeks per assay selecting, setting up, and validating protocols, often leading to wasted efforts.
  • Current manual curation methods are labor-intensive, while automated systems remain inaccurate.
  • Many organizations already convert unstructured assay protocols into machine-readable forms, duplicating costly efforts.
  • Obsolete assay protocols and evolving technologies hinder reproducibility and historical data interpretation.

What will the project deliver?

  • Share costs for converting published bioassay data into accurate, machine-readable FAIR data using a community-defined model.
  • Develop a FAIR data model based on public ontologies like BioAssay Ontology and promote it as an industry standard.
  • Ensure FAIR data is publicly accessible after a brief exclusivity period for partners.

Get involved

Talk to our project manager to learn more and get involved

Contact Us

Project Supporters

  • GSK logo
  • Abbvie logo
  • Astrazeneca Logo
  • Novartis logo
  • Roche blue logo with hexagonal border on white background
Survey

Labs of the Future Survey 2025

What will the labs of the future look like?

A global survey highlighting the changes in lab technology and development

Read More

Our Events

28 Jan 2025

Get to Know the Pistoia Alliance

Book Now
30 Jan 2025

Partner Webinar – Beyond RAG: The Future of Gen AI and Knowledge Graphs

Book Now
Lab of the Future March 2025 Banner with event details
10 Mar 2025

Partner Event – Lab of the Future USA Congress 2025

Book Now
25 Mar 2025

Pistoia Alliance 2025 London Conference

Book Now
Blue event banner for Lab of the Future conference with event time and location
30 Sep 2025

Partner Event – Lab of the Future Congress Europe 2025

Book Now
Boston Bay image Conference 2025
11 Nov 2025

Pistoia Alliance USA Conference 2025

Book Now