SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit
Hugging Face introduces SetFitABSA, an extension of the SetFit few-shot learning framework for Aspect-Based Sentiment Analysis (ABSA). The approach enables fine-grained sentiment classification at the aspect level with minimal labeled data. This builds on SetFit's contrastive sentence-transformer training paradigm, adapting it to the structured ABSA task of identifying sentiment toward specific aspects within text.
Related guides (2)
Related events (8)
SetFit: Efficient Few-Shot Learning Without Prompts
SetFit is a framework for few-shot text classification that fine-tunes Sentence Transformers on small labeled datasets without requiring prompts or large language models. The approach generates contrastive sentence pairs from few examples, fine-tunes a dense embedding model, and then trains a lightweight classifier head. It achieves competitive accuracy with GPT-3-scale models using far fewer parameters and labeled examples.
Sentence Transformers Joins Hugging Face
Sentence Transformers, a widely-used library for generating sentence embeddings and semantic similarity, is officially joining Hugging Face. This integration brings the popular embedding framework under the Hugging Face ecosystem, likely enabling tighter integration with the Hub, datasets, and other HF tooling. The move consolidates a key component of the NLP/embedding pipeline within the dominant open-source AI platform.
Training and Finetuning Sparse Embedding Models with Sentence Transformers
Hugging Face published a tutorial on training and fine-tuning sparse embedding models using the Sentence Transformers library. Sparse embeddings offer an alternative to dense vector representations for retrieval tasks, potentially improving interpretability and efficiency. The post covers the tooling and workflows available in Sentence Transformers for producing sparse encoders suitable for search and RAG pipelines.
SubFit: Submodule-Level Fitted Residual Replacement for LLM Compression
SubFit introduces a post-training LLM compression method that operates at the submodule level (Attention and FeedForward separately) rather than full layers, and selects components non-contiguously. The approach replaces removed submodules with lightweight fitted residual bypasses calibrated on small data. Evaluated across ten LLMs at sparsity levels from 12.5% to 37.5%, SubFit retains 84.6% of dense downstream accuracy at 25% sparsity versus 81.6% for the strongest baseline, while reducing perplexity degradation from 4.34x to 2.42x and delivering measurable inference speedup and KV-cache savings.
Stance Detection in Prediction Market Commentary via Counterfactual Augmentation and Market Context
This paper introduces the first stance detection system applied to prediction market commentary (Polymarket), addressing extreme class imbalance (8.7% anti-market comments) through LLM-driven counterfactual augmentation using the Anthropic API. RoBERTa-base is fine-tuned across a 4×3 ablation covering input configurations and augmentation doses. Key findings: market context is the dominant factor (raising 3-class Anti recall from 0.10 to 0.45), 50% synthetic augmentation is optimal, and full augmentation (100%) consistently degrades performance. Attention-based interpretability supports all three findings mechanistically.
Train and Fine-Tune Sentence Transformers Models
This Hugging Face blog post provides a technical guide on training and fine-tuning Sentence Transformers models for producing dense sentence embeddings. It covers dataset preparation, loss function selection, and training configuration using the sentence-transformers library. The post targets practitioners building semantic search, clustering, or similarity systems.
Temporal Simultaneity Predicts Annotation Quality in Setswana Sentiment Corpora
Researchers present a Setswana sentiment dataset of 3,565 tweets annotated by three native speakers across eight batches, finding that inter-annotator agreement (IAA) declines sharply over time despite an aggregate Kappa of 0.76. The dominant predictor of agreement quality is temporal simultaneity: tweets labeled within one minute achieve κ=0.98 versus κ=0.65 for those labeled more than a day apart. The study also benchmarks multilingual encoders and proprietary models including GPT-5 and Gemini on three-class sentiment classification, with GPT-5 few-shot achieving the best result at 62.2 macro-F1. The dataset, timestamps, and analysis code are released to support reproducible quality auditing for African language NLP.
ActiveSAM: Training-free open-vocabulary segmentation via image-conditional class pruning on SAM 3
ActiveSAM is a training-free, zero-shot inference framework that wraps Segment Anything Model 3 (SAM 3) to perform open-vocabulary semantic segmentation more efficiently. It estimates an image-conditioned active class subset at low resolution before running full-resolution decoding only on retained classes, using bucketed prompt multiplexing and margin-aware background calibration. Across eight benchmarks, it outperforms the prior state-of-the-art SegEarth-OV3 by ~1.4 mIoU on average while running up to 5.5x faster on large-vocabulary datasets, with strong robustness to image corruption relevant to autonomous driving and embodied AI.

