English
Français

Se connecter
Login UniNEMot de passe

English
Français

Se connecter
Login UniNEMot de passe

Accueil
Université de Neuchâtel
Personnes
Dimitrakakis, Christos

Informations complémentaires

Options

Vignette d'image

Dimitrakakis, Christos

Nom

Dimitrakakis, Christos

Affiliation principale

Institut d'informatique

Fonction

Professor

Email

christos.dimitrakakis@unine.ch

Identifiants

https://libra.unine.ch/handle/123456789/30936

_

0000-0002-5367-5189

Résultat de la recherche

Visualisation Par type
Visualisation par date

1 Résultats

Filtres

Aristide Tossou

Debabrota Basu 1

Dimitrakakis, Christos 1

Recherche du nom d'auteur

Soumettre

Institut d'informatique 1

Rechercher des institutions

Soumettre

Artificial Intelligence (cs.AI)

Computer Science and Game Theory (cs.GT) 1

Machine Learning 1

Machine Learning (cs.LG) 1

Recherche d'un sujet

Soumettre

journal article 1

Recherche d'un type

Soumettre

Réinitialiser les filtres

Paramètres

Trier par

Résultats par page

Voici les éléments 1 - 1 sur 1

Accès libre
Near-optimal Optimistic Reinforcement Learning using Empirical Bernstein Inequalities
(2019)
Aristide Tossou
;
Debabrota Basu
;
Dimitrakakis, Christos
We study model-based reinforcement learning in an unknown finite communicating Markov decision process. We propose a simple algorithm that leverages a variance based confidence interval. We show that the proposed algorithm, UCRL-V, achieves the optimal regret O~(DSAT−−−−−−√) up to logarithmic factors, and so our work closes a gap with the lower bound without additional assumptions on the MDP. We perform experiments in a variety of environments that validates the theoretical bounds as well as prove UCRL-V to be better than the state-of-the-art algorithms.

Présentation du portail

Guide d'utilisation

Stratégie Open Access

Directive Open Access

La recherche à l'UniNE

Service information scientifique & bibliothèques
Rue Emile-Argand 11
2000 Neuchâtel
contact.libra@unine.ch

Propulsé par DSpace, DSpace-CRIS & 4Science | v2022.02.00