Interconnects interviews Finbarr Timbers on frontier post-training recipes
Interconnects (Nathan Lambert) publishes interview #18 with Finbarr Timbers reviewing frontier post-training recipes. The conversation likely covers RLHF, preference optimization, and related techniques used by leading labs. Timbers is a practitioner with direct experience in post-training at frontier scale.
Related guides (2)
Related events (8)
AINews: How to Land a Job at a Frontier Lab (on Pretraining)
A Latent Space AINews digest published on a quiet day before Google I/O highlights a notable blog post about landing jobs at frontier AI labs, with a focus on pretraining. The piece appears to surface career and technical insights relevant to the pretraining domain at major AI organizations. The timing suggests it is a low-activity news day filler ahead of a major industry event.
Interconnects commentary on Claude Fable 5 and AI safety power politics
Nathan Lambert's Interconnects newsletter analyzes Claude Fable 5 and what he frames as new 'AI safety fables,' examining the power politics surrounding frontier AI systems. The piece appears to engage with Anthropic's model releases and safety narratives in a critical or interpretive frame. As a tier-2 commentary source, this reflects ongoing discourse about how frontier labs construct and communicate safety claims.
20x Faster TRL Fine-tuning with RapidFire AI
RapidFire AI claims to achieve 20x faster fine-tuning throughput using TRL (Transformer Reinforcement Learning library) compared to standard configurations. The announcement appears on the Hugging Face blog, suggesting integration or compatibility with the HF ecosystem. No additional technical details are available from the body of the post, but the claim targets a significant pain point in LLM post-training workflows.
torchtune: PyTorch Native Post-Training Library for LLMs
Meta's PyTorch team introduces torchtune, a PyTorch-native library for post-training LLMs that emphasizes modularity, hackability, and direct access to underlying PyTorch components. The library supports fine-tuning, experimentation, and deployment-oriented workflows across distributed training settings. Benchmarked against popular frameworks Axolotl and Unsloth, torchtune demonstrates competitive performance and memory efficiency while maintaining flexibility for research iteration. The paper presents design principles, model builders, training recipes, and distributed training stack details.
Andon Labs on building frontier evals: VendingBench and evaluating Claude models
Latent Space interviews Lukas Petersson and Axel Backlund of Andon Labs, the creators of VendingBench, about their approach to building real-world AI evaluations. The conversation covers their experience evaluating Claude models across the capability spectrum from Haiku to Mythos, and their methodology for constructing durable frontier evals. The episode is notable for touching on a speculative or unreleased Claude model tier called 'Mythos.'
Opus 4.6, Codex 5.3, and the post-benchmark era
A Interconnects commentary piece examining how to compare frontier AI models in 2026, using Anthropic's Opus 4.6 and OpenAI's Codex 5.3 as case studies. The piece appears to argue that traditional benchmarks are no longer sufficient for distinguishing model capabilities at the frontier. This reflects a broader industry shift toward more nuanced, task-specific evaluation methods.
Import AI 461: Alignment concerns, FrontierCode benchmark, and synthetic research interns
Import AI issue 461 covers three topics: a claim that AI alignment is not on track, a new benchmark or dataset called FrontierCode, and work on synthetic research interns (likely LLM-based agents simulating research assistants). The newsletter is a weekly digest by Jack Clark that synthesizes developments across the AI/ML landscape. The alignment framing and synthetic agent research angle are both substantive signals worth tracking.
LoRA Training Scripts of the World, Unite!
Hugging Face published a blog post consolidating and comparing advanced LoRA fine-tuning scripts for Stable Diffusion XL, covering techniques such as pivotal tuning, custom captions, and various regularization strategies. The post aims to unify fragmented community training approaches into a more coherent set of best practices. It serves as a practical guide for practitioners fine-tuning SDXL models with LoRA adapters.

