Need someone with experience in reinforcement learning to help me with my project.
- Building extensions of an existing algorithm based on another implementation
- testing the performance on different Pybullet environments
- Requires extensive knowledge of PyTorch and actor-critic models
Need to build an extension of an existing algorithm based on the attached paper