Entity · benchmark

MNIST

benchmarkactivemnist-b117d755·6 events·first seen May 18, 2026

Aliases: MNIST

Co-occurring entities

NeuronSoup ResNet-18 Q-DIBA Input-Aware Dynamic Backdoor Attack Against Quantum Neural Networks Fashion-MNIST National Institute of Standards and Technology AI Risk Management Framework Anthropic Hugging Face adversarial training human uncertainty alignment model calibration dataset cartography soft-label learning CIFAR-10 Random Coding Flow Matching Watermarking

More like this (12)

Fashion-MNIST Colored MNIST PneumoniaMNIST ImageNet CIFAR-10 ImageNet-100 ResNet AlexNet TensorFlow CIFAR-100 neural network image classifiers MATH

Recent events (6)

4arXiv · cs.LG·Jul 17, 2026·source ↗

NeuronSoup: Asynchronous shared-neuron temporal graph architecture evolved without backpropagation

NeuronSoup is a neural computation architecture that replaces synchronous layer-by-layer processing with asynchronous, delay-mediated signal propagation through a pool of physically shared neurons, optimized entirely by a genetic algorithm rather than gradient descent. On MNIST digit classification using frozen ResNet18 features, the evolved network achieves 85.9% test accuracy with 204 active paths through 266 hidden neurons, fitting in 115 KB. The architecture requires no differentiable computation graph, adapts computation depth per sample, and discovers lateral pathway interactions without explicit engineering. The authors argue genetic algorithms are the appropriate optimizer for this problem class and discuss why CMA-ES fails at this scale.

Evaluation and Benchmarking MNIST NeuronSoup ResNet-18

4arXiv · cs.LG·Jul 14, 2026·source ↗

Q-DIBA: First input-aware dynamic backdoor attack against Quantum Neural Networks

Researchers introduce Q-DIBA, the first input-aware dynamic backdoor attack targeting Quantum Neural Networks (QNNs), addressing limitations of prior fixed-trigger quantum backdoor methods. The approach jointly trains a classical trigger generator and a victim QNN using a three-mode mini-batch strategy and an ensemble density contrastive loss operating on post-ansatz quantum states before measurement. Experiments on MNIST and Fashion-MNIST demonstrate high attack success rates, stealthiness, and resilience against defenses including spectral-signature detection and fine-tuning. The work highlights a novel security threat relevant to near-term quantum machine learning deployments.

AI Safety Research Q-DIBA MNIST Input-Aware Dynamic Backdoor Attack Against Quantum Neural Networks +1 more

5Anthropic News·Jun 3, 2026·source ↗

Anthropic proposes ambitious federal funding increase for NIST AI measurement and standards

Anthropic published a policy proposal in April 2023 calling for a significant increase in federal funding for the National Institute of Standards and Technology (NIST) to support AI measurement, evaluation, and standards work. The post argues that rigorous AI capability and risk measurement is a prerequisite for effective regulation, and outlines a concrete funding program building on NIST's existing AI Risk Management Framework and related work. Anthropic frames this as a 'shovel-ready' complement to broader AI governance proposals, recommending at minimum a $15 million increase over FY2023 levels.

AI Safety Research Regulatory Developments MNIST National Institute of Standards and Technology AI Risk Management Framework +1 more

3Hugging Face Blog·May 19, 2026·source ↗

How to Train Your Model Dynamically Using Adversarial Data

This Hugging Face blog post describes a methodology for dynamically training models using adversarial data, likely in the context of improving robustness against adversarial examples. The post covers techniques for generating and incorporating adversarial inputs during the training loop to improve model resilience. Published in mid-2022, it targets practitioners looking to harden ML models against distribution shift and adversarial attacks.

AI Safety Research MNIST Hugging Face adversarial training

5arXiv · cs.CL·May 19, 2026·source ↗

Controlled Audit of Human vs. Synthetic Soft-Labels for Calibration and Uncertainty Alignment

This paper presents a controlled study disentangling the effects of human soft-labels from label mode-shift corrections in soft-label learning, using MNIST and a synthetic variant. The authors find that human soft-labels primarily act as a regularizer improving calibration on difficult samples and promoting stable training convergence, rather than simply correcting mislabeled data. Dataset cartography analysis shows models trained on human soft-labels mirror human uncertainty patterns, while those trained on synthetic labels fail to align. The work provides a diagnostic testbed for evaluating human-AI uncertainty alignment.

Evaluation and Benchmarking AI Safety Research MNIST human uncertainty alignment model calibration +3 more

5arXiv · cs.LG·May 18, 2026·source ↗

Dynamics-Level Watermarking of Flow Matching Models with Random Codes

This paper proposes embedding watermarks directly into the velocity field (continuous dynamics) of flow matching generative models, rather than into weights or outputs. The method uses key-dependent perturbations added during training, formulated as random coding over a continuous channel, allowing black-box message recovery at detection time. The perturbation is designed to leave the generated distribution unchanged. Experiments on MNIST and CIFAR-10 demonstrate reliable message recovery, preserved generation quality, and chance-level decoding without the secret key.

Evaluation and Benchmarking AI Safety Research MNIST CIFAR-10 Random Coding +2 more