Louise Deason
49def83d1d
Adds the code for "gated_linear_networks" to the files release.bara.sky and README.md for public release on github.
...
PiperOrigin-RevId: 338219746
2020-10-21 09:16:25 +00:00
Arthur Guez
c45af649a7
Fix Arxiv link.
...
PiperOrigin-RevId: 338119053
2020-10-20 22:59:38 +01:00
David Pfau
b2b1386a4d
Add citation info for GEOMANCER after NeurIPS acceptance
...
PiperOrigin-RevId: 338045024
2020-10-20 22:58:51 +01:00
Arthur Guez
245211f318
Initial release of himo.
...
PiperOrigin-RevId: 337916498
2020-10-19 23:53:53 +01:00
Victoria Krakovna
fefa95eb1f
Added flags and parameters for the UVFA approximation to relative reachability (for the future tasks paper)
...
PiperOrigin-RevId: 336881122
2020-10-13 16:57:49 +01:00
Victoria Krakovna
2e48a73ee4
add explanation and requirements for running the UVFA approximation for the future tasks paper
...
PiperOrigin-RevId: 336880872
2020-10-13 16:56:57 +01:00
Victoria Krakovna
0b9372d5e6
add UVFA approximation for relative reachability penalty for the future tasks paper
...
PiperOrigin-RevId: 336851625
2020-10-13 13:01:02 +01:00
Victoria Krakovna
bc398d8004
Added functionality for running on new environment variants in the future task paper
...
PiperOrigin-RevId: 336680745
2020-10-13 13:00:11 +01:00
Peter Wirnsberger
1d86410c90
Release of "learned_free_energy_estimation".
...
PiperOrigin-RevId: 336058713
2020-10-12 16:05:37 +01:00
Charlie Nash
c82e368e0e
Makes run.sh executable
...
PiperOrigin-RevId: 334373303
2020-10-12 16:03:28 +01:00
Raphael Koster
58a3ac707d
Fix repeated words in readme.
...
PiperOrigin-RevId: 334196593
2020-10-12 16:03:27 +01:00
Alexander Novikov
99976cfaa9
An example of training CRR agent with TPU support
...
PiperOrigin-RevId: 334117027
2020-10-12 16:03:27 +01:00
Jake VanderPlas
0e5237df2a
Use jax.api.device_put_sharded() in place of private JAX APIs.
...
PiperOrigin-RevId: 332514384
2020-09-23 16:43:36 +01:00
Mehdi Mirza
1d763e0beb
Update the link to the arxiv tech report of physics planning games.
...
PiperOrigin-RevId: 332458733
2020-09-23 16:43:12 +01:00
Mihaela Rosca
fd75024d61
Fix scratchgan requirements and update README.
...
PiperOrigin-RevId: 332257241
2020-09-18 15:33:39 +01:00
Alvaro Sanchez-Gonzalez
30434080cc
Minor README changes.
...
PiperOrigin-RevId: 332027406
2020-09-16 18:02:10 +01:00
Alvaro Sanchez-Gonzalez
603a238733
Fixes bug in integration test setup which was making travis fail.
...
PiperOrigin-RevId: 332020695
2020-09-16 18:01:41 +01:00
Alvaro Sanchez-Gonzalez
0d8c06196b
Adding "Learning to Simulate Complex Physics with Graph Networks".
...
PiperOrigin-RevId: 332003336
2020-09-16 15:59:22 +01:00
Alvaro Sanchez-Gonzalez
15d063b1ae
Minor README changes.
...
PiperOrigin-RevId: 331572243
2020-09-16 15:58:56 +01:00
Alvaro Sanchez-Gonzalez
d1f410c717
Adding README for the open sourced code.
...
PiperOrigin-RevId: 331568645
2020-09-16 15:58:29 +01:00
Alvaro Sanchez-Gonzalez
94f98470b9
Making integration test pull an actual dataset and do a few steps of training and evaluation.
...
PiperOrigin-RevId: 331524333
2020-09-16 15:58:01 +01:00
Mehdi Mirza
ef672a0db9
Adds physics_planning_games.
...
PiperOrigin-RevId: 331167767
2020-09-11 18:21:07 +01:00
Florent Altché
7e7255eed1
Export typing annotations when available.
...
PiperOrigin-RevId: 328527159
2020-09-11 18:18:51 +01:00
Thomas Keck
85187de3dc
Fix python dependencies for glassy dynamics.
...
PiperOrigin-RevId: 328493638
2020-08-26 16:59:08 +01:00
Florent Altché
9bd09fcace
Minor fix to type annotations.
...
PiperOrigin-RevId: 328491975
2020-08-26 16:58:44 +01:00
Florent Altché
486d8caf21
Update glass_dynamics to Python 3 imports.
...
PiperOrigin-RevId: 328363319
2020-08-26 16:55:54 +01:00
Florent Altché
8457046b2c
Add checkpoints from the ablation study.
...
PiperOrigin-RevId: 328023346
2020-08-26 16:54:56 +01:00
Charlie Nash
22c3daff19
Adds PolyGen to public Deepmind Research Github repository
...
PiperOrigin-RevId: 327802622
2020-08-21 15:42:20 +01:00
Florent Altché
56031adaa4
release of Bootstrap Your Own Latent.
...
PiperOrigin-RevId: 327631980
2020-08-21 08:22:52 +00:00
Florent Altché
2314aa74d5
Update BYOL readme with checkpoints URL and note on batchnorm init.
...
PiperOrigin-RevId: 327614354
2020-08-21 08:22:27 +00:00
Zafarali Ahmed
58ee8555ed
Fix typo in the word "reinforcement learning"
...
PiperOrigin-RevId: 327006404
2020-08-21 08:21:58 +00:00
Saran Tunyasuvunakool
b89cc0f495
[catch_carry] Add DOI and article number to BibTeX citation.
...
PiperOrigin-RevId: 327006370
2020-08-21 08:21:32 +00:00
Florent Altché
63fa5e72d5
Fix initial convolution channels not multiplied by width_multiplier.
...
PiperOrigin-RevId: 327005191
2020-08-21 08:21:04 +00:00
Clara Huiyi Hu
923ad3cff0
This notebook illustrates the CIFAR-10 experiments in the paper:
...
PiperOrigin-RevId: 326141025
2020-08-21 08:16:49 +00:00
Daniel J. Mankowitz
84321ad894
Add correct link to RWRL paper.
...
PiperOrigin-RevId: 326012834
2020-08-11 15:00:26 +01:00
Saran Tunyasuvunakool
7a07dc8e47
Release of code and dataset accompanying the SIGGRAPH 2020 publication "Catch & Carry: Reusable Neural Controllers for Vision-Guided Whole-Body Tasks".
...
PiperOrigin-RevId: 325790467
2020-08-10 15:49:14 +01:00
Shaobo Hou
d687ecc9fb
Update data URL.
...
PiperOrigin-RevId: 325116144
2020-08-06 03:05:07 +01:00
Shaobo Hou
60550a5bc6
Add a colab for generating figures.
...
Export training curves to file and fix some inconsistencies.
PiperOrigin-RevId: 324825810
2020-08-06 03:04:41 +01:00
Sergio Gomez
99aaa6930a
Move to dopamine-rl version 3.1.2 in RL Unplugged
...
PiperOrigin-RevId: 324071731
2020-08-03 09:16:16 +00:00
Louise Deason
e67cf45868
Release of MEMO dataset
...
PiperOrigin-RevId: 324000883
2020-08-03 09:15:49 +00:00
Mehdi Mirza
c4af01fa75
Add board games to physics_planning_games.
...
PiperOrigin-RevId: 323799306
2020-08-03 09:15:12 +00:00
Shaobo Hou
a24bda5ed0
Add GPE/GPI experiments.
...
PiperOrigin-RevId: 323750949
2020-07-29 14:36:59 +01:00
Alexander Novikov
59c0cf5044
Fix colab links
...
PiperOrigin-RevId: 323321348
2020-07-29 14:36:20 +01:00
Saran Tunyasuvunakool
69d8db961e
Replace "pip install dm_control[locomotion_mazes]" commands with "pip install dm_control".
...
The [locomotion_mazes] extra has been deprecated in dm_control, and the labmaze dependency is now on the main dm_control package.
PiperOrigin-RevId: 322820779
2020-07-23 19:02:45 +01:00
Sergio Gomez
ed1b814008
Fix problem with gsutil command to copy Atari data to local folder
...
PiperOrigin-RevId: 322759837
2020-07-23 19:02:05 +01:00
Sergio Gomez
1ea4cc033c
Add RL Unplugged data loading code and examples
...
PiperOrigin-RevId: 321746296
2020-07-22 16:49:30 +01:00
Sergio Gomez
bd29e1b710
Add image and citation links to RL Unplugged README
...
PiperOrigin-RevId: 321197757
2020-07-15 12:50:10 +01:00
Louise Deason
8188882a82
release of RL Unplugged
...
PiperOrigin-RevId: 320985969
2020-07-13 17:48:24 +00:00
Charlie Nash
fd4cb5a29b
Adding code for PolyGen.
...
PiperOrigin-RevId: 320927671
2020-07-13 17:47:53 +00:00
David Pfau
a499eb9be7
Changed "published" to "described" in README to reflect preprint status
...
PiperOrigin-RevId: 318282884
2020-07-13 17:47:02 +00:00