Approximation of lorenz-optimal solutions in multiobjective markov decision processes

Patrice Perny, Paul Weng, Judy Goldsmith, Josiah P. Hanna

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

8 Scopus citations

Abstract

We consider the problem of finding small representative sets of Lorenz-optimal policies for MOMDPs (Multiobjective Markov Decision Processes). MDPs model planning under uncertainty; MOMDPs are MDPs with multiple reward functions. Most work on MOMDPs finds sets of Paretooptimal policies, i.e., policies whose expected discounted total reward vectors cannot be improved on one objective without being downgraded on another objective. Unfortunately, if we allow randomized policies, the Pareto set of policies may be infinite; even if we restrict to deterministic policies, there are families of MOMDPs where the sizes of the Pareto sets grow exponentially in the number of MDP states (Ogryczak, Pemy, and Weng 2011). Earlier work looks at polynomial-sized representative samples of the Pareto set (Papadimitriou and Yannakakis 2000; Chatterjee, Majumdar, and Henzinger 2006). Here, we seek a stronger notion than Pareto optimality. It is based on Lorenz dominance, a partial preference order refining Pareto dominance while including an idea of fairness in preferences. It is used for the measurement of inequalities in mathematical economics (Shorrocks 1983), for example to compare income distributions over a population. In our context, it can be used to compare reward vectors by inspecting how they distribute rewards over objectives. We describe algorithms for finding small, representative subsets of the Lorenz-optimal policies for MOMDPs.

Original languageEnglish
Title of host publicationLate-Breaking Developments in the Field of Artificial Intelligence - Papers Presented at the 27th AAAI Conference on Artificial Intelligence, Technical Report
Pages92-94
Number of pages3
StatePublished - 2013
Event27th AAAI Conference on Artificial Intelligence, AAAI 2013 - Bellevue, WA, United States
Duration: Jul 14 2013Jul 18 2013

Publication series

NameAAAI Workshop - Technical Report
VolumeWS-13-17

Conference

Conference27th AAAI Conference on Artificial Intelligence, AAAI 2013
Country/TerritoryUnited States
CityBellevue, WA
Period7/14/137/18/13

Funding

FundersFunder number
National Science Foundation (NSF)CCF-1049360, EF-0850237

    ASJC Scopus subject areas

    • General Engineering

    Fingerprint

    Dive into the research topics of 'Approximation of lorenz-optimal solutions in multiobjective markov decision processes'. Together they form a unique fingerprint.

    Cite this