mirror of
https://github.com/google-deepmind/deepmind-research.git
synced 2026-05-10 13:27:17 +08:00
Fix file name in README for gamma=1 images.
PiperOrigin-RevId: 294215051
This commit is contained in:
committed by
Diego de Las Casas
parent
48ba42f792
commit
d0efbec03a
+2
-2
@@ -145,7 +145,7 @@ low performance.<br>
|
||||
For 10 replicas without TVT and with gamma equal to 1, performance of the RMA
|
||||
agent without TVT is improved, but is unstable and never consistently goes above
|
||||
6.<br>
|
||||
# 
|
||||
# 
|
||||
|
||||
### Active-visual-match
|
||||
Across 10 replicas, we found that the TVT agents get to a score of 10,
|
||||
@@ -162,7 +162,7 @@ For 10 replicas wihtout TVT and with gamma equal to 1, performance of the RMA
|
||||
agent without TVT
|
||||
is considerably worse, suggesting the behavior learnt from later phases does not
|
||||
result in undirected exploration in the first phase.
|
||||
# 
|
||||
# 
|
||||
|
||||
## Citing this work
|
||||
|
||||
|
||||
Reference in New Issue
Block a user