paper
A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise
paperactiveprovisional
a-diffusion-approximation-for-temporal-difference-learning-with-linear-features-under-markovian-noise-01e6b946·1 events·first seen 7h agoAliases: A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise
Co-occurring entities
More like this (12)
Temporal Difference LearningDenoising Diffusion Probabilistic ModelsBeyond Fully Random Masking: Attention-Guided Denoising and Optimization for Diffusion Language ModelsSelf-Augmenting Retrieval for Diffusion Language ModelsKnowledge Editing in Masked Diffusion Language ModelsLESS: Mutual-Stability Sampling for Diffusion Language ModelsMasked Diffusion Modelslikelihood approximationFlashbackCL: Mitigating Temporal Forgetting in Federated Learningmultivariate time series representation learningdiffusion posterior samplingdiscrete diffusion models
Recent events (1)
SDE approximation for TD learning with linear features under Markovian noise
A new arXiv preprint replaces the classical ODE description of linear TD(0) learning with a stochastic differential equation (SDE) approximation that accounts for Markovian sampling noise. The model separates contraction dynamics governed by the projected Bellman operator from the influence of Markovian long-run covariance, providing a theoretical explanation for the constant-stepsize error floor. The work is a theoretical contribution to the foundations of reinforcement learning policy evaluation.