Reduce the number of times load_weights is called in GhostTrainer's advance function #4931

nolan-dev · 2021-02-09T21:48:31Z

Is your feature request related to a problem? Please describe.
When I train policies with a large number of weights and self-play, a lot of time is spent in the load_weights function called from here: https://github.com/Unity-Technologies/ml-agents/blob/master/ml-agents/mlagents/trainers/ghost/trainer.py#L277
I'm still familiarizing myself with the code, but intuitively it doesn't seem like it would be necessary to load weights every time advance is called, which appears to be what's happening now.

Describe the solution you'd like
Here's the code where load weights is called frequently:

            try:
                policy = internal_policy_queue.get_nowait()
                self.current_policy_snapshot[brain_name] = policy.get_weights()
            except AgentManagerQueue.Empty:
                pass
            if next_learning_team in self._team_to_name_to_policy_queue:
                name_to_policy_queue = self._team_to_name_to_policy_queue[
                    next_learning_team
                ]
                if brain_name in name_to_policy_queue:
                    behavior_id = create_name_behavior_id(
                        brain_name, next_learning_team
                    )
                    policy = self.get_policy(behavior_id)
                    policy.load_weights(self.current_policy_snapshot[brain_name])
                    name_to_policy_queue[brain_name].put(policy)

My current impression is that name_to_policy_queue[brain_name].put(policy) only needs to be called when there's a policy update (and that only occurs when the internal policy queue has a policy in it), in which case the solution may be to replace

except AgentManagerQueue.Empty:
    pass

with

except AgentManagerQueue.Empty:
    continue

when I do that, I get around a 30% speed increase. However, I haven't spent enough time with the mlagents code to be sure that doesn't change the functionality at all.

Describe alternatives you've considered
Unfortunately, I don't think there's a way around load_weights being an expensive function for models with a large number of weights.

Thanks

The text was updated successfully, but these errors were encountered:

andrewcoh · 2021-02-09T23:38:09Z

Hi @nolan-dev

Thank you very much for raising this. Your solution of pass -> continue works if the learning team doesn't change but I do not believe it will in the case of swapping the learning team.

I think with a few more flags though we can address this properly and get the speed up you are reporting. I'll follow-up on this thread with a fix.

andrewcoh · 2021-02-16T20:53:01Z

Closing this as it has been addressed in #4934

github-actions · 2021-03-19T00:19:50Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

nolan-dev added the request label Feb 9, 2021

andrewcoh self-assigned this Feb 9, 2021

andrewcoh mentioned this issue Feb 10, 2021

Add additional logic to avoid load being called on every advance #4934

Merged

10 tasks

andrewcoh closed this as completed Feb 16, 2021

github-actions bot locked as resolved and limited conversation to collaborators Mar 19, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce the number of times load_weights is called in GhostTrainer's advance function #4931

Reduce the number of times load_weights is called in GhostTrainer's advance function #4931

nolan-dev commented Feb 9, 2021

andrewcoh commented Feb 9, 2021

andrewcoh commented Feb 16, 2021

github-actions bot commented Mar 19, 2021

Reduce the number of times load_weights is called in GhostTrainer's advance function #4931

Reduce the number of times load_weights is called in GhostTrainer's advance function #4931

Comments

nolan-dev commented Feb 9, 2021

andrewcoh commented Feb 9, 2021

andrewcoh commented Feb 16, 2021

github-actions bot commented Mar 19, 2021