Deep Learning with Proteins
A Hugging Face blog post covering the application of deep learning techniques to protein science, likely covering protein language models, structure prediction, and related tooling. Published in late 2022, this sits in the context of AlphaFold2's impact and the emerging ecosystem of protein ML models. The post likely surveys models, datasets, and frameworks available for computational biology on the Hugging Face platform.
Related guides (3)
Related events (8)
AlphaFold: Five Years of Impact
DeepMind published a retrospective on AlphaFold's five-year impact on biological research and scientific discovery. The post surveys how the protein structure prediction system has accelerated science globally since its initial release. As a tier-1 source anniversary piece, it likely highlights cumulative usage statistics, downstream research enabled, and future directions.
ESMFold2: The Bitter Lesson is Coming for Proteins — Alex Rives, BioHub
A Latent Space interview/commentary piece featuring Alex Rives of BioHub discussing ESMFold2 and the application of the 'bitter lesson' (scale and general methods beating hand-crafted inductive bias) to protein structure prediction and biology. The piece covers the tension between dataset scale versus domain-specific inductive bias in biological ML, and touches on world models and programmable biology. This represents a significant perspective from a leading researcher in protein language models on the next generation of biological foundation models.
Hugging Face Teams Up with Protect AI: Enhancing Model Security for the ML Community
Hugging Face has announced a partnership with Protect AI to improve security for machine learning models hosted on the platform. The collaboration aims to address vulnerabilities in model files and supply chain risks that affect the broader ML community. Specific details about the technical implementation and scope of the security enhancements are not provided in the available content.
AlphaFold Reveals Structure of Key Heart Disease Protein
DeepMind has used AlphaFold to determine the structure of a key protein implicated in heart disease. The announcement highlights a new scientific application of AlphaFold's protein structure prediction capabilities to cardiovascular research. This represents a continued expansion of AlphaFold's impact on biomedical discovery beyond its initial structural biology applications.
Hugging Face Machine Learning Demos on arXiv
Hugging Face announced an integration allowing ML demos to be linked or embedded directly on arXiv paper pages. This lowers the barrier between research publication and interactive model demonstration. The feature connects academic papers to live Spaces or model demos hosted on Hugging Face.
Accelerating Protein Language Model ProtST on Intel Gaudi 2
A Hugging Face blog post details the acceleration of ProtST, a protein language model, on Intel's Gaudi 2 AI accelerator hardware. The post covers the technical integration and performance results of running this specialized biological ML model on Gaudi 2. This represents an intersection of domain-specific AI (protein modeling) and alternative AI hardware ecosystems.
PLAID: Repurposing Protein Folding Models for Multimodal Protein Generation with Latent Diffusion
PLAID is a generative model that simultaneously produces protein 1D sequences and 3D all-atom structures by learning a diffusion model over the latent space of ESMFold, a protein folding model. It requires only sequence data for training—leveraging databases 2-4 orders of magnitude larger than structure databases—and decodes structure at inference via frozen folding model weights. The approach supports compositional prompting by function and organism, addressing practical drug-design constraints like humanization and solubility. A companion compression model, CHEAP, addresses the high-dimensionality of transformer latent spaces to make the diffusion training tractable.
DeepSeek releases DeepSeek-OCR vision-language model on Hugging Face
DeepSeek has released DeepSeek-OCR, a multilingual image-text-to-text model on Hugging Face, built on the DeepSeek-VL-v2 architecture. The model targets OCR and image feature extraction tasks and has accumulated over 2.4 million downloads and 3,275 likes, indicating significant community uptake. This represents an open-weights multimodal release from a major Chinese AI lab.


