Approximate dynamic programming using fluid and diffusion approximations with applications to power management

Creators: Chen, Wei; Huang, Dayu; Kulkarni, Ankur A.; Unnikrishnan, Jayakrishnan; Zhu, Quanyan; Mehta, Prashant; Meyn, Sean; Wierman, Adam

Style

An error occurred while generating the citation.

Abstract

TD learning and its refinements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only within a prescribed finite-dimensional function class. Thus, the question that always arises is how should the function class be chosen? The goal of this paper is to propose an approach for TD learning based on choosing the function class using the solutions to associated fluid and diffusion approximations. In order to illustrate this new approach, the paper focuses on an application to dynamic speed scaling for power management.

Additional Information

Attached Files

Published - 05399685.pdf

Submitted - 1307.1759.pdf

Files

1307.1759.pdf

Files (4.2 MB)

Name	Size	Download all
1307.1759.pdf md5:f9835f86ad208fdf0563b19284d46a5d	3.0 MB	Preview Download
05399685.pdf md5:2d120ae78662e0d0050fcb5f0b3bcda9	1.2 MB	Preview Download

Additional details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes