Entity · paper

Multi-Source Cybersecurity Logs: An ATT&CK-Labeled Dataset and SLM Evaluation

paperactivemulti-source-cybersecurity-logs-an-att-ck-labeled-dataset-and-slm-evaluation-be4591f7·1 events·first seen Jun 17, 2026

Aliases: Multi-Source Cybersecurity Logs: An ATT&CK-Labeled Dataset and SLM Evaluation

Co-occurring entities

CICIDS Llama 3.2 LoRA Phi-4-mini Qwen2.5-1.5B MITRE ATT&CK UNSW-NB15 Atlas

More like this (12)

MITRE ATT&CK Multi-Agentic System Leveraging Open-Source LLMs to Mitigate Disinformation Threats Online Safety Monitoring for LLMs Cybersecurity Task Evaluation Beyond Success Rate: Cost-Aware Evaluation of Offensive and Defensive Security Agents Detecting and Countering Malicious Uses of Claude: March 2025 Words Speak Louder Than Code: Investigating Cognitive Heuristics in LLM-Based Code Vulnerability Detection Automated Compliance Mapping in Cloud Security with Domain-Adapted Sentence Transformers 2026 State of AI Traffic and Cyberthreat Benchmark Report What Do Safety-Aligned LLMs Learn From Mixed Compliance Demonstrations?Rethinking Penetration Testing for AI-Enabled Systems: From Resource Compromise to Behavioral Objective Violation CM-LRS

Recent events (1)

5arXiv · cs.LG·Jun 17, 2026·source ↗

Multi-source cybersecurity log dataset with ATT&CK labels and SLM fine-tuning evaluation

Researchers introduce a new multi-source cybersecurity log dataset of 870 sessions (~2.3M events) capturing system, network, and browser activity on Windows endpoints, with per-entry MITRE ATT&CK technique labels across 12 tactics and 53 techniques. The dataset addresses gaps in existing public datasets (CICIDS, UNSW-NB15, ATLAS) that lack combined multi-source coverage with fine-grained ATT&CK labeling. Three small language models (Qwen2.5-1.5B, Llama-3.2-3B, Phi-4-Mini) were fine-tuned with LoRA on the dataset, achieving chunk classification accuracy of 90–97% versus ~8% for base variants, though ATT&CK technique identification remained harder at 42% exact-match accuracy.

Evaluation and Benchmarking AI Safety Research Multi-Source Cybersecurity Logs: An ATT&CK-Labeled Dataset and SLM Evaluation CICIDS Llama 3.2 +6 more