ReBaBot

ReBaBot is an extension of the original BaBot project by Johan Link, introducing Reinforcement Learning (RL) as an alternative to the traditional PID controller for ball balancing control.

Instead of manually tuned PID actions, ReBaBot uses RL algorithms to decide the control actions required to stabilize and balance the ball.

A special thanks goes to Johan Link for the original BaBot idea and implementation, which served as the foundation for this project.

Project Overview

The goal of ReBaBot is to replace the classical PID-based control system with a Reinforcement Learning approach. The system learns how to balance a ball dynamically on a platform through interaction and reward feedback.

We experimented with two RL algorithms:

Soft Actor-Critic (SAC)
Proximal Policy Optimization (PPO)

The RL model is responsible for generating the control actions that would normally be computed by the PID controller.

System Architecture

A Raspberry Pi 5 is connected directly to the robot’s control board.
The Raspberry Pi runs the Reinforcement Learning inference/training logic.
Communication between the Raspberry Pi and the robot is handled via serial communication.
Sensor feedback and state information are sent to the RL agent, which responds with control actions.

Repository Structure

report.pdf → Detailed explanation of the project, implementation, and experiments (recommended reading for full reproduction)
3D_models.pdf→ STL files of the robot (both first and second versions)
datasets → CSV files containing the individual runs of PPO and SAC, and the PID datasets
tensorboard_training_logs → Tensorboard log files
plot_result.py → Script to plot the training results
models → saved RL models

🙏 Acknowledgements

This project is built upon the original BaBot system by Johan Link.
We sincerely thank him for the initial idea that made this work possible.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
3D_Models		3D_Models
datasets		datasets
main		main
models		models
tensorboard_training_logs		tensorboard_training_logs
.DS_Store		.DS_Store
README.md		README.md
compare_pid_vs_rl_actions.py		compare_pid_vs_rl_actions.py
dataset_collection.py		dataset_collection.py
pickle_to_csv.py		pickle_to_csv.py
plot_result.py		plot_result.py
report.pdf		report.pdf
rotationalEquivariance.py		rotationalEquivariance.py
testModel.py		testModel.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReBaBot

Project Overview

System Architecture

Repository Structure

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ReBaBot

Project Overview

System Architecture

Repository Structure

🙏 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages