Title : An equivalence between adaptive dynamic programming with a critic and backpropagation through time.

Pub. Date : 2013 Dec

PMID : 24805225






1 Functional Relationships(s)
Download
Sentence
Compound Name
Protein Name
Organism
1 We extend DHP into a new algorithm that we call Value-Gradient Learning, VGL(lambda), and prove equivalence of an instance of the new algorithm to Backpropagation Through Time for Control with a greedy policy. dhp high density lipoprotein binding protein Homo sapiens