Multi-Source Cybersecurity Logs: An ATT&CK-Labeled Dataset and SLM Evaluation
multi-source-cybersecurity-logs-an-att-ck-labeled-dataset-and-slm-evaluation-be4591f7·1 events·first seen 6h agoAliases: Multi-Source Cybersecurity Logs: An ATT&CK-Labeled Dataset and SLM Evaluation
Co-occurring entities
More like this (12)
Recent events (1)
Multi-source cybersecurity log dataset with ATT&CK labels and SLM fine-tuning evaluation
Researchers introduce a new multi-source cybersecurity log dataset of 870 sessions (~2.3M events) capturing system, network, and browser activity on Windows endpoints, with per-entry MITRE ATT&CK technique labels across 12 tactics and 53 techniques. The dataset addresses gaps in existing public datasets (CICIDS, UNSW-NB15, ATLAS) that lack combined multi-source coverage with fine-grained ATT&CK labeling. Three small language models (Qwen2.5-1.5B, Llama-3.2-3B, Phi-4-Mini) were fine-tuned with LoRA on the dataset, achieving chunk classification accuracy of 90–97% versus ~8% for base variants, though ATT&CK technique identification remained harder at 42% exact-match accuracy.