5GitHub Trending (AI/LLM filtered)·29d ago

TimesFM: Google Research Pretrained Time-Series Foundation Model

TimesFM is a pretrained foundation model developed by Google Research specifically for time-series forecasting tasks. The repository has accumulated over 20,000 GitHub stars with 99 new stars today, indicating sustained community interest. It represents Google Research's effort to apply the foundation model paradigm to time-series data rather than language or vision.

Frontier Model Releases Enterprise Deployment Patterns Google Research TimesFM

Related guides (2)

Frontier Model ReleasesTopic guide

Frontier Model Releases: The Race From Language to Action

Read asBeginner In-depth

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Related events (8)

7arXiv · cs.AI·1mo ago·source ↗

Toto 2.0: Open-Weights Time Series Foundation Models Demonstrate Scaling Laws from 4M to 2.5B Parameters

Datadog releases Toto 2.0, a family of five open-weights time series forecasting models ranging from 4M to 2.5B parameters, demonstrating consistent forecast quality improvements with scale. The models achieve state-of-the-art results on three benchmarks: BOOM (observability), GIFT-Eval (general-purpose), and TIME (contamination-resistant). The release includes architectural details, a u-muP hyperparameter transfer pipeline, and all base checkpoints under Apache 2.0 license.

Training Infrastructure Frontier Model Releases Toto 2.0 GIFT-Eval TIME +5 more

7arXiv · cs.AI·29d ago·source ↗

Foundation Model for Wearable Health Data Pretrained on 1 Trillion Minutes from 5 Million Participants

Researchers propose a large-scale foundation model for wearable health data, pretrained on over one trillion minutes of unlabeled sensor signals from five million participants. The model demonstrates systematic performance improvements across 35 health prediction tasks spanning cardiovascular, metabolic, sleep, and mental health domains, with joint scaling of model capacity and data volume. A 'classroom' of LLM agents autonomously searches downstream predictive head configurations, and the resulting embeddings are integrated into a Personal Health Agent validated by 1,860 clinician ratings. The work establishes label-efficient few-shot learning and generative capabilities for daily health metric estimation.

Frontier Model Releases Evaluation and Benchmarking LLM Agent Classroom Personal Health Agent few-shot learning +4 more

4Hugging Face Blog·1mo ago·source ↗

Probabilistic Time Series Forecasting with Transformers

This Hugging Face blog post introduces probabilistic time series forecasting using Transformer-based models available in the Hugging Face ecosystem. It covers the application of attention-based architectures to sequential prediction tasks with uncertainty quantification. The post serves as a tutorial and capability demonstration for time series modeling within the Transformers library.

Agent and Tool Ecosystem Probabilistic Time Series Forecasting Hugging Face Transformers Hugging Face

5arXiv · cs.LG·19d ago·source ↗

TxFM: Masked Autoencoding Foundation Model for RNA-seq Gene Expression Representation

The paper introduces TxFM, a self-supervised masked autoencoder model for transcriptomic (RNA-seq) data representation learning, trained on a curated 1.4M-sample corpus called DiverseRNA-1.4M. TxFM outperforms existing transcriptomic foundation models trained on datasets over 100x larger, addressing the known problem of deep models underperforming linear baselines on gene expression data. The work provides ablation studies identifying critical architecture choices and argues that careful data curation combined with inductive self-supervised learning is sufficient for strong transfer performance in transcriptomics.

Evaluation and Benchmarking DiverseRNA-1.4M masked autoencoding RNA sequencing +2 more

4arXiv · cs.AI·5d ago·source ↗

Benchmark of deep learning architectures for multi-horizon behavioural forecasting in mobile health

A new arXiv preprint benchmarks six deep learning architectures, two zero-shot foundation models, and statistical baselines on multi-horizon behavioural forecasting from wearable and smartphone data across 800+ participants. Key findings include: no single architecture dominates (PatchTST leads among trained models), TimesFM matches or exceeds trained models zero-shot especially in low-data regimes, and participant-level fine-tuning reduces per-feature RMSE by 16–60%. The study is the first to jointly evaluate modern deep learning, foundation models, and personalisation for this domain.

Evaluation and Benchmarking A Comparative Study of Deep Learning Architectures for Multi-Horizon Behavioural Forecasting for Mobile Health TimesFM TCN +1 more

4Hugging Face Blog·1mo ago·source ↗

PatchTSMixer in HuggingFace

Hugging Face introduces PatchTSMixer, a lightweight MLP-Mixer-based model for multivariate time-series forecasting, now available in the Transformers library. The model is designed for efficient patch-based mixing of temporal and channel information. This integration expands Hugging Face's time-series modeling capabilities alongside the previously added PatchTST model.

Agent and Tool Ecosystem Hugging Face Transformers Hugging Face MLP-Mixer +2 more

6arXiv · cs.AI·29d ago·source ↗

Temporally Ordered Pre-training Improves LLM Factual Freshness (Kairos)

Researchers from Kyutai pre-train 6B-parameter models on temporally ordered Common Crawl snapshots and compare them against standard shuffled pre-training baselines. They introduce a benchmark of over 7,000 temporally grounded questions to evaluate whether models correctly associate facts with their corresponding time periods. Results show sequentially trained models match shuffled baselines on general language understanding while exhibiting more up-to-date and temporally precise factual knowledge. Code, checkpoints, and datasets are released under the Kairos project.

Training Infrastructure Frontier Model Releases Kyutai Common Crawl temporally ordered pre-training +3 more

5arXiv · cs.AI·1mo ago·source ↗

Distilling Tabular Foundation Models for Structured Health Data

This paper investigates knowledge distillation from tabular foundation models (TFMs) to lightweight student models for healthcare applications. The authors address context leakage in in-context TFMs via stratified out-of-fold teacher labeling, evaluating across 19 healthcare datasets, 6 TFM teachers, and 4 student families. Distilled students retain at least 90% of teacher AUC while running 26× faster on CPU, with preserved calibration and fairness properties. Multi-teacher ensembles do not consistently outperform the best single teacher.

Evaluation and Benchmarking Inference Economics knowledge distillation Stratified Out-of-Fold Teacher Labeling AUC +2 more