4Hugging Face Blog·1mo ago

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Hugging Face introduces SetFitABSA, an extension of the SetFit few-shot learning framework for Aspect-Based Sentiment Analysis (ABSA). The approach enables fine-grained sentiment classification at the aspect level with minimal labeled data. This builds on SetFit's contrastive sentence-transformer training paradigm, adapting it to the structured ABSA task of identifying sentiment toward specific aspects within text.

Agent and Tool Ecosystem Aspect-Based Sentiment Analysis SetFitABSA SetFit Hugging Face

Related guides (2)

Hugging Face

Hugging Face: The Home of Open-Source AI

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

SetFit: Efficient Few-Shot Learning Without Prompts

SetFit is a framework for few-shot text classification that fine-tunes Sentence Transformers on small labeled datasets without requiring prompts or large language models. The approach generates contrastive sentence pairs from few examples, fine-tunes a dense embedding model, and then trains a lightweight classifier head. It achieves competitive accuracy with GPT-3-scale models using far fewer parameters and labeled examples.

Evaluation and Benchmarking Agent and Tool Ecosystem GPT-3 SetFit Hugging Face +1 more

5Hugging Face Blog·1mo ago·source ↗

Sentence Transformers Joins Hugging Face

Sentence Transformers, a widely-used library for generating sentence embeddings and semantic similarity, is officially joining Hugging Face. This integration brings the popular embedding framework under the Hugging Face ecosystem, likely enabling tighter integration with the Hub, datasets, and other HF tooling. The move consolidates a key component of the NLP/embedding pipeline within the dominant open-source AI platform.

Enterprise Deployment Patterns Agent and Tool Ecosystem Hugging Face Sentence Transformers

5Hugging Face Blog·1mo ago·source ↗

Training and Finetuning Sparse Embedding Models with Sentence Transformers

Hugging Face published a tutorial on training and fine-tuning sparse embedding models using the Sentence Transformers library. Sparse embeddings offer an alternative to dense vector representations for retrieval tasks, potentially improving interpretability and efficiency. The post covers the tooling and workflows available in Sentence Transformers for producing sparse encoders suitable for search and RAG pipelines.

Inference Economics Agent and Tool Ecosystem Sparse Embedding Models Hugging Face Sentence Transformers

6arXiv · cs.CL·18d ago·source ↗

SubFit: Submodule-Level Fitted Residual Replacement for LLM Compression

SubFit introduces a post-training LLM compression method that operates at the submodule level (Attention and FeedForward separately) rather than full layers, and selects components non-contiguously. The approach replaces removed submodules with lightweight fitted residual bypasses calibrated on small data. Evaluated across ten LLMs at sparsity levels from 12.5% to 37.5%, SubFit retains 84.6% of dense downstream accuracy at 25% sparsity versus 81.6% for the strongest baseline, while reducing perplexity degradation from 4.34x to 2.42x and delivering measurable inference speedup and KV-cache savings.

Training Infrastructure Evaluation and Benchmarking FeedForward submodule KV Cache SubFit +7 more

4arXiv · cs.CL·23d ago·source ↗

Stance Detection in Prediction Market Commentary via Counterfactual Augmentation and Market Context

This paper introduces the first stance detection system applied to prediction market commentary (Polymarket), addressing extreme class imbalance (8.7% anti-market comments) through LLM-driven counterfactual augmentation using the Anthropic API. RoBERTa-base is fine-tuned across a 4×3 ablation covering input configurations and augmentation doses. Key findings: market context is the dominant factor (raising 3-class Anti recall from 0.10 to 0.45), 50% synthetic augmentation is optimal, and full augmentation (100%) consistently degrades performance. Attention-based interpretability supports all three findings mechanistically.

Agent and Tool Ecosystem RoBERTa counterfactual data augmentation Polymarket +2 more

3Hugging Face Blog·1mo ago·source ↗

Train and Fine-Tune Sentence Transformers Models

This Hugging Face blog post provides a technical guide on training and fine-tuning Sentence Transformers models for producing dense sentence embeddings. It covers dataset preparation, loss function selection, and training configuration using the sentence-transformers library. The post targets practitioners building semantic search, clustering, or similarity systems.

Agent and Tool Ecosystem Hugging Face Sentence Transformers

4arXiv · cs.CL·24d ago·source ↗

Temporal Simultaneity Predicts Annotation Quality in Setswana Sentiment Corpora

Researchers present a Setswana sentiment dataset of 3,565 tweets annotated by three native speakers across eight batches, finding that inter-annotator agreement (IAA) declines sharply over time despite an aggregate Kappa of 0.76. The dominant predictor of agreement quality is temporal simultaneity: tweets labeled within one minute achieve κ=0.98 versus κ=0.65 for those labeled more than a day apart. The study also benchmarks multilingual encoders and proprietary models including GPT-5 and Gemini on three-class sentiment classification, with GPT-5 few-shot achieving the best result at 62.2 macro-F1. The dataset, timestamps, and analysis code are released to support reproducible quality auditing for African language NLP.

Evaluation and Benchmarking Agent and Tool Ecosystem Inter-Annotator Agreement Randolph's Free-Marginal Kappa Setswana Sentiment Dataset +3 more

5arXiv · cs.AI·4d ago·source ↗

ActiveSAM: Training-free open-vocabulary segmentation via image-conditional class pruning on SAM 3

ActiveSAM is a training-free, zero-shot inference framework that wraps Segment Anything Model 3 (SAM 3) to perform open-vocabulary semantic segmentation more efficiently. It estimates an image-conditioned active class subset at low resolution before running full-resolution decoding only on retained classes, using bucketed prompt multiplexing and margin-aware background calibration. Across eight benchmarks, it outperforms the prior state-of-the-art SegEarth-OV3 by ~1.4 mIoU on average while running up to 5.5x faster on large-vocabulary datasets, with strong robustness to image corruption relevant to autonomous driving and embodied AI.

Evaluation and Benchmarking Inference Economics VILA-Lab Segment Anything Model 2 ActiveSAM +1 more