Pharma companies are collaborating to boost the power of artificial intelligence (AI) in drug discovery by allowing access to proprietary structural data to train a large language model. Each of the partners is contributing data from several thousand experimentally determined protein:ligand interactions, creating one of the most diverse datasets and the richest chemistry assembled to date for model training.