4arXiv cs.AI (Artificial Intelligence)·1mo ago

INSHAPE: Instance-Level Shapelets for Interpretable Time-Series Classification

INSHAPE is a new interpretable time-series classification framework that discovers variable-length discriminative temporal patterns specific to individual instances rather than across the full dataset population. It models temporal dependencies among non-overlapping segments and bridges local and global interpretability via a bottom-up aggregation into prototypical shapelets. Evaluated on 128 UCR and 30 UEA benchmark datasets, INSHAPE outperforms state-of-the-art shapelet-based methods while offering more intuitive instance-level explanations.

Evaluation and Benchmarking UEA Time Series Classification Repository shapelet-based time-series classification UCR Time Series Archive INSHAPE

Related guides (1)

Evaluation and BenchmarkingTopic guide

Evaluation and Benchmarking: How We Measure AI — and Why It Keeps Getting Harder

Read asBeginner In-depth

Related events (8)

3Hugging Face Blog·1mo ago·source ↗

Multivariate Probabilistic Time Series Forecasting with Informer

A Hugging Face blog post introduces the Informer model for multivariate probabilistic time series forecasting. The post covers the architecture and usage of Informer, which uses a sparse attention mechanism (ProbSparse) to handle long sequences more efficiently than standard Transformers. It demonstrates how to use the model via the Hugging Face Transformers library for forecasting tasks.

Agent and Tool Ecosystem Informer Hugging Face Transformers ProbSparse Attention +1 more

5arXiv · cs.LG·23d ago·source ↗

AREA: Attribute Extraction and Aggregation for CLIP-Based Class-Incremental Learning

AREA is a new method for CLIP-based Class-Incremental Learning (CIL) that decomposes the classification process into attribute extraction and aggregation stages to combat catastrophic forgetting. Extraction is stabilized by anchoring visual and textual attributes on a hyperspherical embedding space via principal geodesic analysis, while aggregation uses lightweight task-specific experts regularized by a variational information bottleneck. Inference employs optimal transport routing over task attribute manifolds. The method is reported to consistently outperform state-of-the-art CIL approaches and is accepted at ICML 2026.

Evaluation and Benchmarking Multimodal Progress Optimal Transport AREA Principal Geodesic Analysis +5 more

6Berkeley Ai Research (Bair) Blog·1mo ago·source ↗

SPEX and ProxySPEX: Scalable Interaction Discovery for LLM Interpretability

Researchers from BAIR introduce SPEX (Spectral Explainer) and ProxySPEX, algorithms for identifying influential feature, data, and model-component interactions in LLMs at scale. The approach exploits sparsity, low-degreeness, and hierarchy properties to reframe interaction discovery as a sparse recovery problem using tools from signal processing and coding theory. ProxySPEX achieves comparable performance to SPEX with roughly 10x fewer ablations by leveraging hierarchical structure. The methods are evaluated on feature attribution (sentiment analysis), data attribution, and mechanistic interpretability tasks, outperforming marginal methods like LIME at long context lengths.

Long Context Evolution Evaluation and Benchmarking GPT-4o mini Faith-Shap LIME +5 more

7arXiv · cs.LG·9d ago·source ↗

Interpretability-based pipeline for auditing and shaping post-training learning signals

Researchers introduce a data-centric post-training pipeline that applies interpretability methods to preference datasets before optimization, surfacing latent concepts that separate preferred from dispreferred generations. The approach unifies several interpretability-based training protocols as feature or data interventions that shape reward signals. Empirically, the pipeline diagnoses undesirable signals such as sycophancy and over-stylization, mitigates off-target learning, and can amplify desired properties like safety behaviors and model personality. The work reframes post-training from opaque scalar reward optimization into an auditable, concept-level sculpting process.

Evaluation and Benchmarking AI Safety Research Anatomy of Post-Training: Using Interpretability to Characterize Data and Shape the Learning Signal +1 more

4arXiv · cs.AI·11d ago·source ↗

Pose-ICL: 3D-aware in-context learning for pose-controllable image generation of custom subjects

Researchers introduce Pose-ICL, a tuning-free framework for generating images of user-specified subjects with accurate pose control. The method uses Surface-Anchored Position Embedding (SAPE) to give 2D diffusion models explicit 3D awareness by anchoring image tokens to volumetric bounding box surface coordinates. Evaluations on 3D assets and real-world subjects show improvements over existing methods in both pose accuracy and identity consistency. The framework is designed for compatibility with existing Diffusion Transformer (DiT) models.

Multimodal Progress Surface-Anchored Position Embedding Pose-ICL Pose-ICL: 3D-Aware In-Context Learning for Pose-Controllable Subject Customization

5arXiv · cs.AI·4d ago·source ↗

Internal Oppenheim-Lim test reveals phase/sign identity codes shared across image classifier architectures

A new arXiv preprint applies a causal intervention inspired by Oppenheim and Lim (1981) to probe whether trained image classifiers encode identity in Fourier phase rather than magnitude within their hidden layers. By transplanting phase or sign components between images at chosen layers in PRISM2D, GFNet, ViT-B/16, and ResNet-50, the authors find that predictions follow the phase/sign donor across all tested architectures, with image-specific magnitude largely dispensable. ResNet-50 requires a pre-ReLU intervention to reveal a latent sign code, exposing how rectification and readout geometry shape the basis in which the code is expressed. The findings offer a mechanistic account of the texture–shape gap between CNNs and attention-based models.

Evaluation and Benchmarking ViT-B/16 GFNet PRISM2D +2 more

4Hugging Face Blog·1mo ago·source ↗

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Hugging Face introduces SetFitABSA, an extension of the SetFit few-shot learning framework for Aspect-Based Sentiment Analysis (ABSA). The approach enables fine-grained sentiment classification at the aspect level with minimal labeled data. This builds on SetFit's contrastive sentence-transformer training paradigm, adapting it to the structured ABSA task of identifying sentiment toward specific aspects within text.

Agent and Tool Ecosystem Aspect-Based Sentiment Analysis SetFitABSA SetFit +1 more

5arXiv · cs.AI·1mo ago·source ↗

IVGT: Implicit Visual Geometry Transformer for Neural Scene Representation

IVGT is a new neural architecture that implicitly models continuous 3D geometry from unposed multi-view images without requiring explicit pointmap regression. It learns a continuous neural scene representation in a canonical coordinate system, supporting SDF-based surface queries and color prediction via lightweight decoders. The model is trained with multi-dataset joint optimization using 2D supervision and 3D geometric regularization, achieving strong generalization across mesh reconstruction, novel view synthesis, depth/normal estimation, and camera pose estimation tasks.

Frontier Model Releases Multimodal Progress IVGT Signed Distance Function (SDF)Neural Radiance Field (NeRF)+1 more