Show simple item record

On solutions of Kolmogorov’s equations for non-homogeneous jump Markov processes & Sufficiency of Markov policies in continuous-timeMarkov decision processes

dc.identifier.urihttp://hdl.handle.net/11401/77454
dc.description.sponsorshipThis work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree.en_US
dc.formatMonograph
dc.format.mediumElectronic Resourceen_US
dc.language.isoen_US
dc.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dc.typeDissertation
dcterms.abstractA basic fact in the theory of Discrete-time Markov Decision Processes is that for any policy there exists a Markov policy with the same marginal state-action distributions. This fact implies that the study of control problems with multiple criteria and constraints that are determined by marginal distribution (for e.g. expected total discounted and non-discounted costs, average cost per unit time) can be restricted to the set of Markov policies. This dissertation presents a similar result for Continuous-Time Markov Decision Processes (CTMDPs). In CTMDPs with Borel state and action spaces, unbounded transition and cost rates, for an arbitrary policy, we construct a Markov policy such that the marginal distribution on the state-action pairs is the same for both the policies. This fact implies that the expected cost rates at each time instant are equal for these two policies. Thus, the constructed Markov policy performs equally to the original policy for problems with multiple criteria and constraints that are determined by marginal distribution. The proof consists of two major steps: The first step describes the properties of solutions to Kolmogorov's equations for jump Markov processes. In particular, for given transition intensities, the three approaches to construct a jump Markov process: (i) via the compensator of the random measure of a multivariate point process, (ii) as a minimal solution of Kolmogorov's backward equation, and (iii) as a minimal solution of Kolmogorov's forward equation define the same transition function. If the jump Markov process associated with the transition function has no accumulation points, then it is the unique solution of both Kolmogorov's equations. The second step applies these results to CTMDPs and establishes that the marginal distribution on the state for both the policies satisfy Kolmogorov's forward equation defined by the Markov policy. This fact immediately implies that the marginal distributions on the state for both the policies coincide if the transition intensities corresponding to the Markov policy are bounded. In the general case, it is possible to consider a sequence of policies with bounded transition intensities and that converge to the original policy. The proof for the general case follows from these approximations.
dcterms.available2017-09-20T16:52:43Z
dcterms.contributorRachev, Svetlozaren_US
dcterms.contributorFeinberg, Eugeneen_US
dcterms.contributorTakhtajan, Leonen_US
dcterms.contributorHu, Jiaqiao.en_US
dcterms.creatorMandava, Manasa
dcterms.dateAccepted2017-09-20T16:52:43Z
dcterms.dateSubmitted2017-09-20T16:52:43Z
dcterms.descriptionDepartment of Applied Mathematics and Statistics.en_US
dcterms.extent70 pg.en_US
dcterms.formatApplication/PDFen_US
dcterms.formatMonograph
dcterms.identifierhttp://hdl.handle.net/11401/77454
dcterms.issued2015-05-01
dcterms.languageen_US
dcterms.provenanceMade available in DSpace on 2017-09-20T16:52:43Z (GMT). No. of bitstreams: 1 Mandava_grad.sunysb_0771E_12389.pdf: 485096 bytes, checksum: 8bba2fc73bee1133d6c3595ef519c26e (MD5) Previous issue date: 2015en
dcterms.publisherThe Graduate School, Stony Brook University: Stony Brook, NY.
dcterms.subjectApplied mathematics
dcterms.subjectCompensator, continuous-time Markov decision process, Jump Markov process, Kolmogorov's equation, Markov policies
dcterms.titleOn solutions of Kolmogorov’s equations for non-homogeneous jump Markov processes & Sufficiency of Markov policies in continuous-timeMarkov decision processes
dcterms.titleOn solutions of Kolmogorov’s equations for non-homogeneous jump Markov processes & Sufficiency of Markov policies in continuous-timeMarkov decision processes
dcterms.typeDissertation


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record