On solutions of Kolmogorov’s equations for non-homogeneous jump Markov processes & Sufficiency of Markov policies in continuous-timeMarkov decision processes

dc.identifier.uri	http://hdl.handle.net/11401/77454
dc.description.sponsorship	This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.	en_US
dc.format	Monograph
dc.format.medium	Electronic Resource	en_US
dc.language.iso	en_US
dc.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dc.type	Dissertation
dcterms.abstract	A basic fact in the theory of Discrete-time Markov Decision Processes is that for any policy there exists a Markov policy with the same marginal state-action distributions. This fact implies that the study of control problems with multiple criteria and constraints that are determined by marginal distribution (for e.g. expected total discounted and non-discounted costs, average cost per unit time) can be restricted to the set of Markov policies. This dissertation presents a similar result for Continuous-Time Markov Decision Processes (CTMDPs). In CTMDPs with Borel state and action spaces, unbounded transition and cost rates, for an arbitrary policy, we construct a Markov policy such that the marginal distribution on the state-action pairs is the same for both the policies. This fact implies that the expected cost rates at each time instant are equal for these two policies. Thus, the constructed Markov policy performs equally to the original policy for problems with multiple criteria and constraints that are determined by marginal distribution. The proof consists of two major steps: The first step describes the properties of solutions to Kolmogorov's equations for jump Markov processes. In particular, for given transition intensities, the three approaches to construct a jump Markov process: (i) via the compensator of the random measure of a multivariate point process, (ii) as a minimal solution of Kolmogorov's backward equation, and (iii) as a minimal solution of Kolmogorov's forward equation define the same transition function. If the jump Markov process associated with the transition function has no accumulation points, then it is the unique solution of both Kolmogorov's equations. The second step applies these results to CTMDPs and establishes that the marginal distribution on the state for both the policies satisfy Kolmogorov's forward equation defined by the Markov policy. This fact immediately implies that the marginal distributions on the state for both the policies coincide if the transition intensities corresponding to the Markov policy are bounded. In the general case, it is possible to consider a sequence of policies with bounded transition intensities and that converge to the original policy. The proof for the general case follows from these approximations.
dcterms.available	2017-09-20T16:52:43Z
dcterms.contributor	Rachev, Svetlozar	en_US
dcterms.contributor	Feinberg, Eugene	en_US
dcterms.contributor	Takhtajan, Leon	en_US
dcterms.contributor	Hu, Jiaqiao.	en_US
dcterms.creator	Mandava, Manasa
dcterms.dateAccepted	2017-09-20T16:52:43Z
dcterms.dateSubmitted	2017-09-20T16:52:43Z
dcterms.description	Department of Applied Mathematics and Statistics.	en_US
dcterms.extent	70 pg.	en_US
dcterms.format	Application/PDF	en_US
dcterms.format	Monograph
dcterms.identifier	http://hdl.handle.net/11401/77454
dcterms.issued	2015-05-01
dcterms.language	en_US
dcterms.provenance	Made available in DSpace on 2017-09-20T16:52:43Z (GMT). No. of bitstreams: 1 Mandava_grad.sunysb_0771E_12389.pdf: 485096 bytes, checksum: 8bba2fc73bee1133d6c3595ef519c26e (MD5) Previous issue date: 2015	en
dcterms.publisher	The Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subject	Applied mathematics
dcterms.subject	Compensator, continuous-time Markov decision process, Jump Markov process, Kolmogorov's equation, Markov policies
dcterms.title	On solutions of Kolmogorovâ€™s equations for non-homogeneous jump Markov processes & Sufficiency of Markov policies in continuous-timeMarkov decision processes
dcterms.title	On solutions of Kolmogorov’s equations for non-homogeneous jump Markov processes & Sufficiency of Markov policies in continuous-timeMarkov decision processes
dcterms.type	Dissertation

Files in this item

Name:: Mandava_grad.sunysb_0771E_12389.pdf
Size:: 473.7Kb
Format:: application/pdf

View/Open

This item appears in the following Collection(s)

Stony Brook Theses and Dissertations Collection [4009]

Show simple item record

On solutions of Kolmogorovâ€™s equations for non-homogeneous jump Markov processes & Sufficiency of Markov policies in continuous-timeMarkov decision processes

Files in this item

This item appears in the following Collection(s)