File size: 849 Bytes
fe22f1a 5974307 eb5f6f9 fe22f1a e308b00 5974307 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 |
---
datasets:
- commaai/commabody
pipeline_tag: robotics
---
This model has been trained on a larger version (194 minutes total) of the commabody dataset.
It includes a [vqgan](https://github.com/CompVis/taming-transformers) encoder/decoder fine tuned from imagenet. It compresses images of size 160x256 to 10x16 tokens.
It also includes a GPT2 model trained to predict the next frame, wheel speeds and actions. It can be used either as a simulator or as a policy. More details in [our blog post](https://blog.comma.ai/a-drive-in-the-office/).
<video title="imagined rollouts from 3s of context" controls>
<source src="https://blog.comma.ai/img/body_patrol/rollouts.webm" type="video/webm">
</video>
You can run it on a comma body using [our example script in body-jim](https://github.com/commaai/body-jim/blob/master/examples/roam.py).
|