Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

Approximation of lorenz-optimal solutions in multiobjective markov decision processes

Producción científica: Conference contributionrevisión exhaustiva

9 Citas (Scopus)

Resumen

We consider the problem of finding small representative sets of Lorenz-optimal policies for MOMDPs (Multiobjective Markov Decision Processes). MDPs model planning under uncertainty; MOMDPs are MDPs with multiple reward functions. Most work on MOMDPs finds sets of Paretooptimal policies, i.e., policies whose expected discounted total reward vectors cannot be improved on one objective without being downgraded on another objective. Unfortunately, if we allow randomized policies, the Pareto set of policies may be infinite; even if we restrict to deterministic policies, there are families of MOMDPs where the sizes of the Pareto sets grow exponentially in the number of MDP states (Ogryczak, Pemy, and Weng 2011). Earlier work looks at polynomial-sized representative samples of the Pareto set (Papadimitriou and Yannakakis 2000; Chatterjee, Majumdar, and Henzinger 2006). Here, we seek a stronger notion than Pareto optimality. It is based on Lorenz dominance, a partial preference order refining Pareto dominance while including an idea of fairness in preferences. It is used for the measurement of inequalities in mathematical economics (Shorrocks 1983), for example to compare income distributions over a population. In our context, it can be used to compare reward vectors by inspecting how they distribute rewards over objectives. We describe algorithms for finding small, representative subsets of the Lorenz-optimal policies for MOMDPs.

Idioma originalEnglish
Título de la publicación alojadaLate-Breaking Developments in the Field of Artificial Intelligence - Papers Presented at the 27th AAAI Conference on Artificial Intelligence, Technical Report
Páginas92-94
Número de páginas3
EstadoPublished - 2013
Evento27th AAAI Conference on Artificial Intelligence, AAAI 2013 - Bellevue, WA, United States
Duración: jul 14 2013jul 18 2013

Serie de la publicación

NombreAAAI Workshop - Technical Report
VolumenWS-13-17

Conference

Conference27th AAAI Conference on Artificial Intelligence, AAAI 2013
País/TerritorioUnited States
CiudadBellevue, WA
Período7/14/137/18/13

Financiación

FinanciadoresNúmero del financiador
National Science Foundation (NSF)CCF-1049360, EF-0850237

    ASJC Scopus subject areas

    • General Engineering

    Huella

    Profundice en los temas de investigación de 'Approximation of lorenz-optimal solutions in multiobjective markov decision processes'. En conjunto forman una huella única.

    Citar esto