Since we might want to include some kind of real-time feedback into the performance, we need to benchmark all 3 parts of the pipeline: 1. 2D pose estimation 2. 3D pose estimation 3. Pose generation with pre-trained model