mirror of
https://github.com/google-deepmind/deepmind-research.git
synced 2026-05-31 13:05:40 +08:00
Fix file name in README for gamma=1 images.
PiperOrigin-RevId: 294215051
This commit is contained in:
committed by
Diego de Las Casas
parent
48ba42f792
commit
d0efbec03a
+2
-2
@@ -145,7 +145,7 @@ low performance.<br>
|
|||||||
For 10 replicas without TVT and with gamma equal to 1, performance of the RMA
|
For 10 replicas without TVT and with gamma equal to 1, performance of the RMA
|
||||||
agent without TVT is improved, but is unstable and never consistently goes above
|
agent without TVT is improved, but is unstable and never consistently goes above
|
||||||
6.<br>
|
6.<br>
|
||||||
# 
|
# 
|
||||||
|
|
||||||
### Active-visual-match
|
### Active-visual-match
|
||||||
Across 10 replicas, we found that the TVT agents get to a score of 10,
|
Across 10 replicas, we found that the TVT agents get to a score of 10,
|
||||||
@@ -162,7 +162,7 @@ For 10 replicas wihtout TVT and with gamma equal to 1, performance of the RMA
|
|||||||
agent without TVT
|
agent without TVT
|
||||||
is considerably worse, suggesting the behavior learnt from later phases does not
|
is considerably worse, suggesting the behavior learnt from later phases does not
|
||||||
result in undirected exploration in the first phase.
|
result in undirected exploration in the first phase.
|
||||||
# 
|
# 
|
||||||
|
|
||||||
## Citing this work
|
## Citing this work
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user