Making a Web App Generator with Open ML Models
A Hugging Face blog post demonstrates how to build a web application generator using open-source ML models. The tutorial covers using language models to generate functional web app code from natural language descriptions. This represents an early practical example of code generation pipelines built on open-weights models for end-to-end application development.
Related guides (3)
Related events (8)
Making ML-powered web games with Transformers.js
This Hugging Face blog post demonstrates how to build machine learning-powered web games using Transformers.js, enabling in-browser inference without a server backend. The post covers practical implementation patterns for running transformer models directly in the browser via WebAssembly and WebGL. It serves as both a tutorial and a showcase of client-side ML deployment capabilities.
Open-Source Text Generation & LLM Ecosystem at Hugging Face
Hugging Face published a blog post surveying the open-source LLM ecosystem as of mid-2023, covering text generation models, tooling, and deployment patterns available on the platform. The post highlights the breadth of open-weight models and associated infrastructure for inference and fine-tuning. It serves as a reference overview of the state of open-source LLMs at that point in time.
From OpenAI to Open LLMs with Messages API on Hugging Face
Hugging Face's Text Generation Inference (TGI) now supports an OpenAI-compatible Messages API, enabling developers to switch from OpenAI models to open-weight LLMs with minimal code changes. The integration allows existing OpenAI SDK users to point their client at Hugging Face endpoints by changing only the base URL and model name. This lowers the migration barrier for teams wanting to self-host or use open models while retaining familiar tooling.
Introducing the Synthetic Data Generator - Build Datasets with Natural Language
Hugging Face has launched a Synthetic Data Generator tool that allows users to create datasets using natural language descriptions. The tool is designed to lower the barrier for dataset creation, enabling practitioners to generate training data without writing code. This is relevant to the broader trend of synthetic data as a scalable alternative to manual data collection and annotation.
Practical 3D Asset Generation: A Step-by-Step Guide
A Hugging Face blog post providing a practical walkthrough of AI-based 3D asset generation workflows. The guide covers step-by-step techniques for generating 3D content using machine learning models. This represents applied multimodal/generative AI work targeting creative and game development use cases.
Assisted Generation: a new direction toward low-latency text generation
Hugging Face introduces assisted generation (speculative decoding) as a practical technique for reducing LLM inference latency. The approach uses a smaller draft model to propose token candidates that a larger model then verifies in parallel, enabling multiple tokens to be accepted per forward pass. The blog post explains the mechanism and demonstrates integration into the Hugging Face Transformers library.
3D Asset Generation: AI for Game Development #3
This Hugging Face blog post covers AI-driven 3D asset generation techniques relevant to game development workflows. It is part of a series exploring practical ML applications in game creation pipelines. The post likely surveys current tools and models for generating 3D content from text or image inputs.
GLM-5.1 Open-Weights Model Targets Long-Running Agentic Tasks; Andrew Ng on Coding Agent Acceleration by Software Domain
Z.ai released GLM-5.1, an open-weights mixture-of-experts LLM (754B total / 40B active parameters) designed for sustained agentic coding tasks lasting up to eight hours, featuring iterative planning-execution-evaluation loops with thousands of tool calls. The model claims top open-weights performance on Artificial Analysis Intelligence Index and SWE-Bench Pro, available under MIT license via HuggingFace. The accompanying editorial by Andrew Ng offers a tiered framework for how much coding agents accelerate different software work categories—frontend most, then backend, infrastructure, and research least—with practical implications for team organization. A secondary item references data-center opposition and LLM helpfulness failure modes.


