Advanced search options

Advanced Search Options 🞨

Browse by author name (“Author name starts with…”).

Find ETDs with:

in
/  
in
/  
in
/  
in

Written in Published in Earliest date Latest date

Sorted by

Results per page:

You searched for id:"handle:10012/14963". One record found.

Search Limiters

Last 2 Years | English Only

No search limiters apply to these results.

▼ Search Limiters


University of Waterloo

1. Jhunjhunwala, Aman. Policy Extraction via Online Q-Value Distillation.

Degree: 2019, University of Waterloo

Recently, deep neural networks have been capable of solving complex control tasks in certain challenging environments. However, these deep learning policies continue to be hard to interpret, explain and verify which limits their practical applicability. Decision Trees lend themselves well to explanation and verification tools but are not easy to train especially in an online fashion. The aim of this thesis is to explore online tree construction algorithms and demonstrate the technique and effectiveness of distilling reinforcement learning policies into a Bayesian tree structure. We introduce Q-BSP Trees and an Ordered Sequential Monte Carlo training algorithm that helps condense the Q-function from fully trained Deep Q-Networks into the tree structure. QBSP Forests generate partitioning rules that transparently reconstruct the value function for all possible states. It convincingly beats performance benchmarks provided by earlier policy distillation methods resulting in performance closest to the original Deep Learning policy.

Record DetailsSimilar RecordsGoogle PlusoneFacebookTwitterCiteULikeMendeleyreddit

APA · Chicago · MLA · Vancouver · CSE | Export to Zotero / EndNote / Reference Manager

APA (6th Edition):

Jhunjhunwala, A. (2019). Policy Extraction via Online Q-Value Distillation. (Thesis). University of Waterloo. Retrieved from http://hdl.handle.net/10012/14963

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

Chicago Manual of Style (16th Edition):

Jhunjhunwala, Aman. “Policy Extraction via Online Q-Value Distillation.” 2019. Thesis, University of Waterloo. Accessed September 19, 2019. http://hdl.handle.net/10012/14963.

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

MLA Handbook (7th Edition):

Jhunjhunwala, Aman. “Policy Extraction via Online Q-Value Distillation.” 2019. Web. 19 Sep 2019.

Vancouver:

Jhunjhunwala A. Policy Extraction via Online Q-Value Distillation. [Internet] [Thesis]. University of Waterloo; 2019. [cited 2019 Sep 19]. Available from: http://hdl.handle.net/10012/14963.

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

Council of Science Editors:

Jhunjhunwala A. Policy Extraction via Online Q-Value Distillation. [Thesis]. University of Waterloo; 2019. Available from: http://hdl.handle.net/10012/14963

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

.