Entity · dataset

ASTRA

datasetactiveastra-4ffc83bb·1 events·first seen May 28, 2026

Aliases: ASTRA

Co-occurring entities

Code as a Weapon Prompt Bank CySecBench RedCode MalwareBench AdvBench Scam2Prompt JailbreakBench RMCBench Fleiss' Kappa

More like this (12)

Astera Institute AstrBot Astral FunASR AASIST AuRA AISPA SAST ALER-TI ACROS ARIS ASRD

Recent events (1)

6arXiv · cs.CL·May 28, 2026·source ↗

Consensus-Labeled Prompt Bank for Measuring Coding-Model Compliance with Malicious-Code Requests

This paper introduces a large, consensus-labeled benchmark of 6,675 prompts drawn from eight existing corpora (ASTRA, CySecBench, AdvBench, JailbreakBench, MalwareBench, RedCode, RMCBench, Scam2Prompt) to evaluate whether coding-specialized LLMs refuse malicious requests. A key contribution is the distinction between requests for executable malicious code (4,748 prompts) versus harmful security knowledge (1,923 prompts), arguing that coding models should face a stricter refusal standard given their outputs can be directly weaponized. A five-judge consensus protocol achieves Fleiss' kappa of 0.767, providing a reliability-quantified substrate for cross-corpus compliance measurement that the field has previously lacked.

Evaluation and Benchmarking AI Safety Research Code as a Weapon Prompt Bank CySecBench RedCode +8 more