Hello IK Team,
I've recently got myself a new NVIDIA RTX 4070 card and was a little bummed by the capturing performance. Some people on forums claim that a extende capture takes at around 30 minutes on their GTX 1650 which i find hard to believe.
Currently the advanced training takes 45min and 30 seconds on my system. What's odd is that the GPU usage stays at around 30% only so i believe that the performance could be improved greatly by optimizing the usage.
Some assumtions.
- The batch size for the training data is set too low on purpose so that weaker systems don't run out of memory or can even handle the training
- the used pytorch or maybe xformers (if used) are not optimized for NVIDIA cards?
Is there anything else that could limit the GPU usage, because it feels like driving in first gear at a max of 10 mph.
My CPU usage is at 50% while training